- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Prachumwat, A.
- Articles by Palopoli, M. F.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Prachumwat, A.
- Articles by Palopoli, M. F.
Intron Size Correlates Positively With Recombination Rate in Caenorhabditis elegans
Anuphap Prachumwata, Laura DeVincentisa, and Michael F. Palopoliaa Department of Biology, Bowdoin College, Brunswick, Maine 04011
Corresponding author: Michael F. Palopoli, Bowdoin College, 6500 College Station, Brunswick, ME 04011., mpalopol{at}bowdoin.edu (E-mail)
Communicating editor: S. HENIKOFF
| ABSTRACT |
|---|
A negative correlation between intron size and recombination rate has been reported for the Drosophila melanogaster and human genomes. Population-genetic models suggest that this pattern could be caused by an interaction between recombination rate and the efficacy of natural selection. To test this idea, we examined variation in intron size and recombination rate across the genome of the nematode Caenorhabditis elegans. Interestingly, we found that intron size correlated positively with recombination rate in this species.
SPLICEOSOMAL introns are widespread and abundant in eukaryotic genomes (![]()
![]()
26, 11, and 24% of the Caenorhabditis elegans, Drosophila melanogaster, and human genome sequences, respectively (C. ELEGANS SEQUENCING CONSORTIUM 1998; ![]()
![]()
Several beneficial functions that are associated with introns have been identified. For example, introns are required for alternative splicing, a post-transcriptional mechanism that allows a single stretch of DNA to code for more than one functional protein (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
A negative correlation between intron size and local recombination rate has been reported for both the D. melanogaster and human genomes (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
To test these models further, we analyzed intron size and recombination rate variation within the genome of the nematode C. elegans. Recombination rates and intron sizes vary substantially in this species (![]()
![]()
| MATERIALS AND METHODS |
|---|
Data collection and analysis:
The first and last nucleotide positions of both exons and introns for every predicted and confirmed gene were obtained from the flat text file format of the C. elegans genome database (Wormbase, http://www.wormbase.org, release WS46, April 2001; ![]()
Intron size and location:
The initial sample included 21,049 predicted genes and 109,128 predicted introns. On the basis of comparisons of genomic positions, we determined that 7706 introns (7.1% of the initial sample) were duplicated in the database. Manual inspection of several hundred of these duplicates indicated that they were in gene sequences that had been assigned more than one name in the database. Only one copy of each duplicated intron was retained for further analysis. A total of 64 predicted introns were <20 bp in length, whereas the shortest known intron size that allows for a successful splicing reaction is
20 bp (![]()
![]()
Recombination rate:
Recombination rate was estimated as a function of nucleotide position along a chromosome by taking the first derivative of the polynomial function that described the best-fit curve for recombination-map position vs. nucleotide coordinate in the genomic sequence, as described in ![]()
| RESULTS |
|---|
Intron size correlated positively with recombination rate for the entire C. elegans genome (Fig 1A; Spearman's rank correlation, R = 0.174, P < 0.0001). A similar pattern was observed when each autosome was considered separately (Fig 1B), but not when the X chromosome was considered separately (Fig 1C). Spearman's rank correlation coefficients were R = 0.206, 0.179, 0.247, 0.212, 0.145, and 0.018 for chromosomes I, II, III, IV, V, and X, respectively. All five autosomes exhibited a rank correlation significantly different from zero, whereas the X chromosome did not (after Bonferroni correction, P < 0.001 for each autosome, P > 0.05 for the X chromosome). Consistent with the correlation analysis, the slope of the least-squares line for the X chromosome was much closer to zero than that observed for any of the autosomes separately or for the genome as a whole (Fig 1, a and b vs. c).
|
When each chromosome was divided into 10 regions of equal length, average intron size and average recombination rate exhibited parallel distributions throughout the genome (Fig 2), and this positive correlation between regional averages was statistically significant (Spearman's rank correlation, R = 0.750, P < 0.0001). Both intron sizes and recombination rates tended to be much greater on the autosomal arms than in the autosomal centers. On the X chromosome, however, average intron size did not exhibit much regional variation, and regional averages in intron size on the X chromosome were generally intermediate between those observed for autosomal centers and arms.
|
Results were similar when the sample was limited to introns from confirmed genes only or to introns from within the known recombination map. For example, on the basis of introns from confirmed genes only, intron size correlated positively with recombination rate across the entire genome (Spearman's rank correlation, R = 0.204, P < 0.0001). Similar trends were observed when each autosome was examined separately: Spearman's rank correlation coefficients for introns from confirmed genes only were R = 0.268, 0.243, 0.258, 0.132, and 0.281 for chromosomes I, II, III, IV, and V, respectively. In contrast, no significant correlation was observed for the X chromosome (Spearman's rank correlation, R = 0.0001, P = 0.993).
| DISCUSSION |
|---|
In the C. elegans genome, intron size correlated positively with recombination rate (Fig 1A). This result contrasts with the negative correlation between these variables observed for the D. melanogaster and human genomes and is not predicted by current models for the evolution of intron size (![]()
![]()
There were consistent, regional trends in average intron size and average recombination rate across the C. elegans genome (Fig 2): (1) autosomal arms tended to have large introns and high recombination rates; (2) autosomal centers tended to have small introns and low recombination rates; and (3) the X chromosome exhibited much less regional variation in average intron size than did any of the autosomes, with average intron sizes intermediate between those observed for autosomal arms and centers. These consistent patterns ruled out the possibility that the genome-wide positive correlation was due to a few regions with widely divergent intron sizes and/or recombination rates.
Population-genetic models have assumed that regional variation in intron size across the genome is determined largely by an interaction between recombination rate and the efficacy of natural selection, termed the Hill-Robertson effect (![]()
![]()
First, one of the current models invoking the Hill-Robertson effect might be valid for flies and humans, but not for nematodes. This explanation for our results implies that the balance of factors determining intron size varies from one evolutionary lineage to another and is supported by the apparent lack of a Hill-Robertson effect on regional variation in both transposon density and codon bias in the C. elegans genome (![]()
![]()
![]()
Second, it is possible that the Hill-Robertson effect is not a major determinant of intron size variation in any of these organisms. This explanation for our results fails to explain the observed correlations (either positive or negative) between intron size and recombination rate. Nevertheless, the fact that recombination rate can be positively correlated with intron size in some species, but negatively correlated in others, raises the question of what other factors might be driving the evolution of intron size.
One interesting possibility is that intron size varies systematically across the genome because the insertion of nonfunctional ("junk") DNA imposes a greater fitness cost in some chromosomal regions than in others. In eukaryotic cells, chromosomal regions harboring dense clusters of active genes are often located toward the center of the nucleus, adjacent to an interchromatin compartment (![]()
![]()
![]()
![]()
This chromosome territory model could explain the observed regional variation in intron size in the C. elegans genome. In general, this model predicts that regions of the genome that are often exposed to interchromatin compartments should tend to have less noncoding DNA than regions that are usually distant from interchromatin compartments. If the autosomal centers are the regions of the C. elegans genome that are most often exposed to interchromatin compartments, then this model could explain the consistent tendency for introns to be smaller in the autosomal centers. According to this interpretation, introns would have evolved to be smaller in the autosomal centers so that more coding DNA could fit into a limited chromosomal region.
Recently, it was reported that genes expressed at higher levels tend to have shorter introns in both humans and C. elegans (![]()
| ACKNOWLEDGMENTS |
|---|
We are grateful to two anonymous reviewers, Patsy Dickinson, Bruce Kohorn, and Anne McBride for making suggestions to improve the manuscript. We thank the members of the Biology Department for many stimulating discussions on these and related topics. This work was supported by National Science Foundation grant 0110994 to M.F.P.
Manuscript received October 2, 2003; Accepted for publication December 10, 2003.
| LITERATURE CITED |
|---|
ADAMS, M. D., S. E. CELNIKER, R. A. HOLT, C. A. EVANS, and J. D. GOCAYNE et al., 2000 The genome sequence of Drosophila melanogaster.. Science 287:2185-2195.
BARNES, T. M., Y. KOHARA, A. COULSON, and S. HEKIMI, 1995 Meiotic recombination, noncoding DNA and genomic organization in Caenorhabditis elegans.. Genetics 141:159-179.[Abstract]
Genome sequence of the nematode C. elegans: a platform for investigating biology. (1998) Science 282:2012-2018.
CACERES, J. F. and A. R. KORNBLIHTT, 2002 Alternative splicing: multiple control mechanisms and involvement in human disease. Trends Genet. 18:186-193.[CrossRef][Medline]
CARVALHO, A. B. and A. G. CLARK, 1999 Intron size and natural selection. Nature 401:344.[CrossRef][Medline]
CASTILLO-DAVIS, C. I., S. L. MEKHEDOV, D. L. HARTL, E. V. KOONIN, and F. A. KONDRASHOV, 2002 Selection for short introns in highly expressed genes. Nat. Genet. 31:415-418.[Medline]
COMERON, J. M. and M. KREITMAN, 2000 The correlation between intron length and recombination in Drosophila: dynamic equilibrium between mutational and selective forces. Genetics 156:1175-1190.
COMERON, J. M. and M. KREITMAN, 2002 Population, evolutionary and genomic consequences of interference selection. Genetics 161:389-410.
CREMER, T. and C. CREMER, 2001 Chromosome territories, nuclear architecture and gene regulation in mammalian cells. Nat. Rev. Genet. 2:292-301.[CrossRef][Medline]
DEUTSCH, M. and M. LONG, 1999 Intron-exon structures of eukaryotic model organisms. Nucleic Acids Res. 27:3219-3228.
DIBB, N. J., 1993 Why do genes have introns? FEBS Lett. 325:135-139.[CrossRef][Medline]
DURET, L. and P. BUCHER, 1997 Searching for regulatory elements in human noncoding sequences. Curr. Opin. Struct. Biol. 7:399-406.[CrossRef][Medline]
DURET, L., G. MARAIS, and C. BIEMONT, 2000 Transposons but not retrotransposons are located preferentially in regions of high recombination rate in Caenorhabditis elegans.. Genetics 156:1661-1669.
FELSENSTEIN, J., 1974 The evolutionary advantage of recombination. Genetics 78:737-756.
HANKE, J., D. BRETT, I. ZASTROW, A. AYDIN, and S. DELBRUCK et al., 1999 Alternative splicing of human genes: More the rule than the exception? Trends Genet. 15:389-390.[CrossRef][Medline]
HAWKINS, J. D., 1988 A survey on intron and exon lengths. Nucleic Acids Res. 16:9893-9908.
HILL, W. G. and A. ROBERTSON, 1966 The effect of linkage on the limits to artificial selection. Genet. Res. 8:269-294.[Medline]
KENT, W. J. and A. M. ZAHLER, 2000 Conservation, regulation, synteny, and introns in a large-scale C. briggsaeC. elegans genomic alignment. Genome Res. 10:1115-1125.
KLIMAN, R. M. and J. HEY, 1993 Reduced natural selection associated with low recombination in Drosophila melanogaster.. Mol. Biol. Evol. 10:1239-1258.[Abstract]
LAMOND, A. I. and W. C. EARNSHAW, 1998 Structure and function in the nucleus. Science 280:547-553.
MAHY, N. L., P. E. PERRY, S. GILCHRIST, R. A. BALDOCK, and W. A. BICKMORE, 2002 Spatial organization of active and inactive genes and noncoding DNA within chromosome territories. J. Cell Biol. 157:579-589.
MARAIS, G. and G. PIGANEAU, 2002 Hill-Robertson interference is a minor determinant of variations in codon bias across Drosophila melanogaster and Caenorhabditis elegans genomes. Mol. Biol. Evol. 19:1399-1406.
MARAIS, G., D. MOUCHINOUD, and L. DURET, 2001 Does recombination improve selection on codon usage? Lessons from nematode and fly complete genomes. Proc. Natl. Acad. Sci. USA 98:5688-5692.
MOURIER, T. and D. C. JEFFARES, 2003 Eukaryotic intron loss. Science 300:1393.
OGATA, H., W. FUJIBUCHI, and M. KANEHISA, 1996 The size differences among mammalian introns are due to the accumulation of small deletions. FEBS Lett. 390:99-103.[CrossRef][Medline]
OPHIR, R. and D. GRAUR, 1997 Patterns and rates of indel evolution in processed pseudogenes from human and murids. Gene 205:191-202.[CrossRef][Medline]
PETROV, D. A. and D. L. HARTL, 1998 High rate of DNA loss in the Drosophila melanogaster and Drosophila virilis species groups. Mol. Biol. Evol. 15:293-302.[Abstract]
PETROV, D. A. and D. L. HARTL, 2000 Pseudogene evolution and natural selection for a compact genome. J. Hered. 91:221-227.
PETROV, D. A., E. R. LOZOVSKAYA, and D. L. HARTL, 1996 High intrinsic rate of DNA loss in Drosophila. Nature 384:346-349.[CrossRef][Medline]
ROBERTSON, H. M., 2000 The large srh family of chemoreceptor genes in Caenorhabditis nematodes reveals processes of genome evolution involving large duplications and deletions and intron gains and losses. Genome Res. 10:192-203.
RUSSELL, C. B., D. FRAGA, and R. D. HINRICHSEN, 1994 Extremely short 2033 nucleotide introns are the standard length in Paramecium tetraurelia.. Nucleic Acids Res. 22:1221-1225.
SHABALINA, S. A. and A. KONDRASHOV, 1999 Pattern of selective constraint in C. elegans and C. briggsae genomes. Genet. Res. 74:23-30.[CrossRef][Medline]
SHABALINA, S. A., A. Y. OGURTSOV, V. A. KONDRASHOV, and A. S. KONDRASHOV, 2001 Selective constraint in intergenic regions of human and mouse genomes. Trends Genet. 17:373-376.[CrossRef][Medline]
STEIN, L. D., P. STERNBERG, R. DURBIN, J. THIERRY-MIEG, and J. SPIETH, 2001 WormBase: network access to the genome and biology of Caenorhabditis elegans.. Nucleic Acids Res. 29:82-86.
TANABE, H., S. MÜLLER, M. NEUSSER, J. VON HASE, and E. CALCAGNO et al., 2002 Evolutionary conservation of chromosome territory arrangements in cell nuclei from higher primates. Proc. Natl. Acad. Sci. USA 99:4424-4429.
VENTER, J. C., M. D. ADAMS, E. W. MYERS, P. W. LI, and R. J. MURAL et al., 2001 The sequence of the human genome. Science 291:1304-1351.
This article has been cited by other articles:
![]() |
S. S. Hughes, C. O. Buckley, and D. E. Neafsey Complex Selection on Intron Size in Cryptococcus neoformans Mol. Biol. Evol., February 1, 2008; 25(2): 247 - 253. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Beye, I. Gattermeier, M. Hasselmann, T. Gempe, M. Schioett, J. F. Baines, D. Schlipalius, F. Mougel, C. Emore, O. Rueppell, et al. Exceptionally high levels of recombination across the honey bee genome Genome Res., November 1, 2006; 16(11): 1339 - 1344. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. C. Presgraves Intron Length Evolution in Drosophila Mol. Biol. Evol., November 1, 2006; 23(11): 2203 - 2213. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Marais, P. Nouvellet, P. D. Keightley, and B. Charlesworth Intron Size and Exon Evolution in Drosophila Genetics, May 1, 2005; 170(1): 481 - 485. [Abstract] [Full Text] [PDF] |
||||
- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Prachumwat, A.
- Articles by Palopoli, M. F.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Prachumwat, A.
- Articles by Palopoli, M. F.




