- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Hey, J.
- Articles by Kliman, R. M.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Hey, J.
- Articles by Kliman, R. M.
Interactions Between Natural Selection, Recombination and Gene Density in the Genes of Drosophila
Jody Heya and Richard M. Klimanba Department of Genetics, Rutgers University, Piscataway, New Jersey 08854-8082
b Department of Biological Sciences, Kean University, Union, New Jersey 07083
Corresponding author: Jody Hey, Rutgers University, Nelson Biological Labs, 604 Allison Rd., Piscataway, NJ 08854-8082., jhey{at}mbcl.rutgers.edu (E-mail)
Communicating editor: S. W. SCHAEFFER
| ABSTRACT |
|---|
In Drosophila, as in many organisms, natural selection leads to high levels of codon bias in genes that are highly expressed. Thus codon bias is an indicator of the intensity of one kind of selection that is experienced by genes and can be used to assess the impact of other genomic factors on natural selection. Among 13,000 genes in the Drosophila genome, codon bias has a slight positive, and strongly significant, association with recombinationas expected if recombination allows natural selection to act more efficiently when multiple linked sites segregate functional variation. The same reasoning leads to the expectation that the efficiency of selection, and thus average codon bias, should decline with gene density. However, this prediction is not confirmed. Levels of codon bias and gene expression are highest for those genes in an intermediate range of gene density, a pattern that may be the result of a tradeoff between the advantages for gene expression of close gene spacing and disadvantages arising from regulatory conflicts among tightly packed genes. These factors appear to overlay the more subtle effect of linkage among selected sites that gives rise to the association between recombination rate and codon bias.
THE redundancy of the genetic code has often been used to tease out manifestations of natural selection that would be beyond the resolution of most experimental approaches. The classic example is the contrast between substitution rates (and polymorphism levels) for mutations that do, and do not, alter the amino acid sequences of proteins. The latter class, which falls within the redundancy of the code, is commonly assumed to be selectively neutral and thus provides a baseline for interpretation of the tempo and mode of natural selection on those mutations that do alter proteins (![]()
![]()
![]()
![]()
Another application of genetic code redundancy to the study of natural selection relies on evidence that natural selection does indeed act on synonymous mutations (evidence that partly undermines methods that assume neutrality of synonymous mutations). The evidence is that genes that are expressed at high levels often show strongly biased codon usage in favor of those codons that correspond to the most common tRNAs (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Here we employ this basic ideathat codon bias is an indicator of the overall impact of one kind of natural selection experienced by a gene (i.e., selection for high gene expression)to address how other aspects of the Drosophila genome interact with this type of natural selection. Of course not all genes need be highly expressed, and one reason that a gene may have low codon bias is simply that mutations that raise the expression level of that gene do not increase fitness. However, we can still inquire whether the codon bias distribution among genes varies as a function of genomic factors that may have an effect on gene expression or that may affect how well natural selection acts on gene expression.
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Note that this hypothesis does not specify what types of selection cause the Hill-Robertson effect, but rather that if it occurs, then the overall effectiveness of selection will be reduced. Any and all mutations with impacts on fitness and in negative linkage disequilibrium with one another will contribute to a Hill-Robertson effect. The method of ![]()
![]()
However, a significant confounding factor is that the preferred codons of Drosophila (i.e., those that increase in frequency in highly biased and highly expressed genes) all end in either G or C, which in turn leads to a correlation between codon bias and GC content. Recently, ![]()
In recent years the concept that selection conflicts arise under tight linkage has played a central role in research on polymorphism levels in natural populations (![]()
![]()
![]()
![]()
![]()
![]()
| MATERIALS AND METHODS |
|---|
Gene sequences:
The nearly complete D. melanogaster genome sequence (![]()
115 Mb and the large majority of Release 2 of the Drosophila genome sequence. The small files of Release 2 were not included because they provide little information on gene density. The header information in each GenBank file contains the locations of all amino acid coding sequences (CDS). A gene was excluded from the analysis if its CDS was incomplete or if, in rare cases, the length of its CDS was not a multiple of three. For 436 genes (3.3% of those used in our analyses), more than one CDS was given. In these cases we simply used the first CDS listed. To check whether this could affect the analyses we compared the codon bias, using the effective number of codons (ENC) measure (![]()
The data for all 12,999 genes were compiled into a common spreadsheet for statistical analysis. That spreadsheet is available from http://lifesci.rutgers.edu/~heylab.
Measuring codon bias:
Codon bias in Drosophila and other organisms has been measured in two primary ways: departure from equal codon usage (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
As a measure of codon bias we used the primary factor found in a factor analysis of relative codon usage. Factor analysis is a multivariate statistical technique, similar to principal components analysis, which is designed for cases when multiple variables (codon usage frequencies in the present case) are thought to be shaped by a common factor. To avoid a basic GC content factor and a gene-length factor, synonymous codon frequencies (calculated for each of the 59 codons that fall within a set of synonymous codons) were first regressed on GCnc and on the length of the amino-acid-coding portions of genes, and the residuals from these regressions were used for the factor analysis. Factor analysis and the majority of other statistical analyses were done using SAS version 8.0.
Gene expression estimation:
We used counts of reported expressed sequence tags (ESTs) as a rough indicator of the level of gene expression (![]()
Recombination estimation:
Genome-wide recombination rate estimators (which generate estimates of the recombination rate per generation for every gene) can be generated from the relationship between genetic and physical maps of the Drosophila genome. To make sure that our findings were not an artifact of any particular approach, we considered five different measures of recombination.
KH93:
This measure was developed by ![]()
![]()
![]()
ACE:
The adjusted coefficient of exchange is the measure used by C. Aquadro and colleagues (![]()
![]()
![]()
![]()
![]()
RTE:
This measure is based on genetic and physical map data collected by ![]()
![]()
R: A current table of genes listing all those with independently determined genetic map locations was obtained from Flybase (A. DE GRAY, personal communication). Genes were excluded if they had multiple conflicting map estimates and if their genetic map position placed them out of sequence with nearby genes on the basis of the physical locations in the genome sequence. The resulting set of 493 X-linked and autosomal genes was ordered by physical position on each chromosome, and recombination was estimated by taking the slope, over eight flanking genes, of the genetic map position as a function of the DNA position on the chromosome. Recombination rates were then assigned to all the genes on the basis of those in the set of 493 to which they are closest.
RP: This measure, like KH93, is based on four-term polynomial regressions of genetic map position on physical map location, with a separate regression done for each chromosome arm. However, RP is based on the 493 loci used for R.
Gene density estimation:
We focused primarily on two different estimates of gene density, each calculated from the base positions of genes indicated in the GenBank files of the genome sequence. "Space between genes" (SBG) is the mean of the distances on either side of a gene, that is, between a gene's terminal codon and the closest terminal codon of the nearest gene. "Genes per kilobase" (GPK) was calculated from the number of genes, including fractions of a gene, observed over a 20,000-base region centered on the midpoint of a gene. In addition, analyses were also conducted using other measures of gene density, including measures like GPK, but measured over longer and shorter intervals, as well as measures of codon density. All measures, including those based on codon density, behaved very similarly to those reported here (results available upon request).
| RESULTS |
|---|
Table 1 shows the correlation coefficients among all five measures of recombination. All are highly correlated with each other; all are similarly, weakly, but highly significantly, correlated with codon bias; and none are significantly correlated with GCnc. Hereafter, analyses that included recombination are based on R, although all analyses using either RP or RTE are nearly identical to those obtained with R and those obtained using KH93 and ACE are qualitatively very similar to those obtained with R (results available upon request).
|
The factor analysis among the regression residuals of the frequencies of 59 codons revealed a strong primary factor with an eigenvalue over three times that associated with the second factor. For 12,999 genes, the distribution of factor scores for this primary factor has a mean of zero and is closely approximated by a normal distribution (Fig 1). The factor scores were highly correlated with other measures of codon bias (Fig 1), and hereafter this primary factor, denoted as F, is used as our measure of codon bias. Because F is based on residuals from linear regression of codon frequencies against GCnc and gene length, the product-moment correlation between F and GCnc, as well as between F and gene length, is zero.
|
Fig 2 shows F plotted against the recombination rate measure R and gene expression E with means and 95% confidence intervals shown for each bin of
1000 genes. Except for the lowest values of E (4383 genes had an EST count of either 0 or 1), gene expression is positively associated with F. The product-moment correlation between F and E is 0.146 (P < 0.0001).
|
From Fig 2 it appears that the association between F and R is primarily limited to the genes with the lowest values of recombination. The 4000 genes with the lowest estimated recombination rates have a product-moment correlation between R and F of 0.167 (P < 0.0001), whereas the correlation among the 9000 genes with the highest levels of R is -0.021 (not significant). The association between F and the lowest levels of recombination is especially apparent for the genes on the small fourth chromosome of Drosophila, which does not recombine. The fourth chromosome consists of interspersed regions of apparent euchromatin and heterochromatin (as evidenced by position effect variegation experienced by inserted genes; ![]()
The fourth chromosome genes are unusual in other respects, including a low mean value of GCnc (0.312 vs. 0.377 for other genes; P < 0.001; Kruskal-Wallis test) and low values of gene density as measured by SBG (8998 vs. 5984 bp; P < 0.001; Kruskal-Wallis test) and GPK (0.092 genes/kb vs. 0.212 genes/kb; P < 0.001; Kruskal-Wallis test). However, the low level of codon bias on the fourth chromosome is not solely due to the reduced GC content. Considering only those 1000 nonfourth chromosome genes that have the lowest values of GCnc, and that have a mean GCnc less than that for the fourth chromosome loci, the mean value of F is 0.037, far higher than that for the fourth chromosome genes. In the majority of analyses to follow, the fourth chromosome genes are not included.
If conflicting selection pressures that arise from linkage disequilibrium among sites that are under selection actually lead to reduced codon bias because of a reduced efficiency of selection, then we would expect that genes that are physically closer to other genes would experience more selection conflicts and, thus, would also have lower codon bias. The reasoning is simply that since genes are the likely location of most mutations that have effects on fitness, then genes that are closest to each other should be those most likely to experience selection conflicts due to linkage. Fig 3A shows that codon bias varies as a complex curvilinear function of SBG, with a less curvilinear relationship with GPK. Surprisingly, the major effect in both curves is an increase of codon bias with gene density, which is in the opposite direction of that expected by models of selection conflicts under linkage. Only for those genes for which there is little space separating them from flanking genes (low values of SBG) do we find the expected decline of codon bias with increasing gene density. As suggested by the 95% confidence intervals in Fig 3, the nonlinear relationships are highly significant. Quadratic polynomial regression of codon bias on the gene density measures reveals a highly significant curvilinear relationship. This is apparent for GPK (
, where F' is the predicted value of codon bias) and for those genes with SBG values <5000
. Overall regression of F on SBG is complicated by the highly nonuniform distribution of SBG values. However, a quadratic polynomial regression of F on ranked values of SBG (SBGr) for all genes also reveals a downward curve
. In each of these cases, the fit of the quadratic model, as well as the value of each estimated regression parameter, is highly significant (P < 0.001).
|
Measures of gene spacing and density also covary with intron length and intron number in such a way that all these variables seem to reflect a common factor of gene packing. For genes that have introns, both total intron length and the number of introns are lower for genes that are more densely packed (Fig 4). For GPK some of this covariation is a necessary consequence of the variable itself (more and longer introns must reduce local gene density that is measured over a fixed distance), but the same pattern holds for the length of SBG.
|
The observation that more closely packed genes have higher codon bias is not due to covariation with noncoding GC content. GCnc is positively correlated with SBG (r = 0.0283, P = 0.0013) and the correlation coefficient with GPK is negative, albeit not significantly different from zero (r = -0.010, P = 0.253). Both of these correlations are in the opposite direction of that required to explain the observed covariation between gene density and codon bias. The predominant association between gene density and codon bias is probably not caused by selection conflicts associated with linkage, as the covariation between these variables is primarily in the wrong directionhigher gene density should lead to more selection conflicts and lower codon bias, the reverse of what is seen. To be sure, we checked to see if polymorphism levels were correlated with gene density, as they are with recombination rates (![]()
![]()
![]()
![]()
The relationship between codon bias and gene density is partly mirrored by the relationship between gene expression and measures of gene density. Fig 5 reveals a curvilinear relationship between E and ranked measures of gene density. The genes with the highest gene expression are, on average, those that occur at intermediate levels of gene density. As in the case of codon bias, the curvilinear relationships between gene expression and gene density are highly significant. Quadratic polynomial regression of E on the gene density measures reveals downward curvilinear relationships:
(for genes with SBG <5000 bp) and
, where E' is the predicted value of gene expression. In each case, the fit of the quadratic model, as well as the value of each estimated regression parameter, is highly significant (P < 0.001). Interestingly, the genes with the very lowest values of SBG show a marked increase in intron length (Fig 4A). Part of the reason for this is that some genes contain other genes within their introns and thus have large introns and SBG values of zero.
|
| DISCUSSION |
|---|
In recent years the patterns of codon usage in the genes of Drosophila have often been used to study the interplay of natural selection and other factors, such as mutation and genetic drift, that shape the DNA sequences of genes (![]()
![]()
![]()
![]()
Codon usage is but one factor that has an effect on gene function, and generally we expect it to be a very minor one relative to other gene components (e.g., relative to the amino acid sequence and to the locations and sequences of regulatory sites). However, codon usage serves a unique role for investigators. Unlike the case with other aspects of a gene, investigators have a clear hypothesis about the pattern of codon usage and the magnitude of one kind of natural selection on a gene. Codon usage can be determined precisely, given a DNA sequence, and specific predictions regarding the action of natural selection can be generated simply on the basis of the pattern of codon usage. In particular, the idea that some codons are "preferred" and that others are "unpreferred" has emerged as highly explanatory for the understanding of codon bias in general (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Codon bias and recombination:
Codon bias (F) is highly significantly correlated with several measures of recombination (Table 1); however, the correlation coefficients are quite low, and the corresponding r2 values are miniscule, indicating that recombination explains just a tiny fraction of the variation found among genes in codon bias. The degree of association is also not a constant one over the range of recombination, and genes that have the lowest levels of recombination show a much stronger relationship between codon bias and recombination. This pattern was also reported in previous studies with fewer genes (![]()
![]()
The low degree of association between recombination rate and codon bias means that the relationship can be easily obscured if those variables are measured with high variance or if there are confounding covariates. Consider the associations between other measures of codon bias and R, GCnc, and gene length (Table 2). The ENC (![]()
![]()
![]()
|
The pattern of association between codon bias and recombination has important implications for models of the evolutionary origin and maintenance of recombination. If it is indeed the case that mutations that affect fitness are often in linkage disequilibrium simply due to the physical distance between them, then there arises a benefit to mutations that elevates the recombination rate (![]()
![]()
9000 genes with recombination rates >1.5 cM/Mb) show no association between recombination and codon bias. These genes appear to have recombination rates higher than necessary to dispel Hill-Robertson conflicts. In other words, selection conflicts that arise under disequilibrium due to linkage appear not to be a sufficient explanation of the high recombination rates found in most genes of Drosophila.
Recently ![]()
![]()
![]()
![]()
The recombination estimates of ![]()
![]()
![]()
![]()
|
![]()
![]()
![]()
Codon bias, gene expression, and gene density:
The prediction of an association between recombination and codon bias is based on the idea that genomic regions of low recombination will experience more linkage disequilibrium among mutations that have effects on fitness. If we assume that the regions within and near genes are where most such mutations occur, as opposed to regions between genes, then the same line of reasoning leads to the prediction that genes in regions of high gene density should also have more linkage disequilibrium between mutations with effects on fitness and should also reveal an association between codon bias and, in this case, gene density. However, this pattern was not observed, and we found instead a curvilinear relationship between measures of density (Fig 3) and recombination and between those measures and gene expression (Fig 5). As far as we have been able to determine from a review of the literature, these are novel observations. The presence of a curvilinear pattern suggests that multiple factors are acting to shape the relationship between F (and E) and measures of gene density. In what follows we develop a hypothesis of conflicting factors that predominate over different ranges of gene density.
In eukaryotes, including Drosophila, the local chromatin structure around genes is a major determinant of gene expression levels (![]()
![]()
![]()
![]()
![]()
![]()
This simple argument, if carried further, also entails a cost for high levels of gene packing. As genes move closer, they come closer to the regulatory domains of other genes, and their own regulatory elements (linked enhancers and silencers) become closer to other genes, which creates selection pressure for insulating elements that prevent regulatory conflicts (![]()
This specific tradeoff model is a hypothesis motivated by the curvilinear patterns of Fig 3 and by literature reports on the large role played by chromatin in gene expression. However, given the simple circumstantial nature of the evidence, other hypotheses for all or part of the curvilinear pattern could certainly be developed. For example, transposable elements may tend to accumulate preferentially in regions where genes are not expressed at high levels and where recombination is low, and given their length, such insertions would tend to reduce gene density. It could also be the case that baseline rates of other types of insertions and deletions vary over the genome in such a way as to contribute to the observed patterns and that this is for reasons not associated with natural selection to optimize gene expression.
Hill-Robertson effects and gene density:
In a complex multivariate context it can be difficult to tease out the pattern of interdependence among variables, particularly when patterns of covariation are not linear. In the present context we would especially like to understand whether Hill-Robertson effects and resulting selection pressures favoring recombination have played a role in shaping gene density. However, the complex patterns of covariation prevent clear conclusions. Nevertheless, three points can be made. First, the negative relationship between codon bias and gene density, found for the most tightly packed genes, is in the direction predicted by the Hill-Robertson effect. It is possible that the Hill-Robertson effect becomes strong only at the very highest levels of gene density, just as it appears to for the genes with lowest levels of recombination (Fig 2), and that this contributes to the positive association between gene density and both F and E for the highest levels of density. Second, if Hill-Robertson effects really do occur among genes with low recombination rates per base pair, then this will create a selection pressure to increase the space between genes. This point is essentially the same as that made to explain the observed negative correlation between intron length and recombination in Drosophila (![]()
![]()
![]()
![]()
The third point is that both gene density and recombination explain only small portions of the variance in codon bias and in gene expression and only small portions of the variance of each other. Thus it may be best to consider the effects of recombination on F and E as largely independent of the effects associated with gene density. To check this we assessed the correlation between F and R for low values of SBG (for SBG
1700 bp, n = 5441, r = 0.0518, P < 0.0001) and for high values of SBG (for SBG >1700 bp, n = 7558, r = 0.1001, P < 0.0001). These correlations changed negligibly when SBG was held constant via partial correlation (for SBG
1700 bp, n = 5441, r = 0.0515, P < 0.0001; for SBG >1700 bp, n = 7558, r = 0.0963, P < 0.0001). Finally, the independence of gene density and recombination is fairly striking in a simple plot of F against SBG for both high and low levels of recombination (Fig 7). The two curves closely parallel each other, with that for high recombination consistently higher than that for low recombination, and both curves have a peak in the middle of the span that is essentially identical in location to that for the combined data (Fig 3). These results are consistent with gene density and Hill-Robertson effects acting essentially independently on gene expression and codon bias. They also suggest that Hill-Robertson interference occurs across the gene density spectrum, as F is higher for high recombination genes across the spectrum of SBG values.
|
| ACKNOWLEDGMENTS |
|---|
We are grateful to Hiroshi Akashi, James Birchler, Sarah Elgin, and Adam Eyre-Walker for helpful comments and to two reviewers for helpful critique and suggestions. This research was supported by National Institutes of Health grants R01GM54684 to J.H. and R15HG02456 to R.M.K.
Manuscript received August 14, 2001; Accepted for publication November 19, 2001.
| LITERATURE CITED |
|---|
ADAMS, M. D., S. E. CELNIKER, R. A. HOLT, C. A. EVANS, and J. D. GOCAYNE et al., 2000 The genome sequence of Drosophila melanogaster.. Science 287:2185-2195
AKASHI, H., 1994 Synonymous codon usage in Drosophila melanogaster: natural selection and translational accuracy. Genetics 136:927-935[Abstract].
AKASHI, H., 1995 Inferring weak selection from patterns of polymorphism and divergence at "silent" sites in Drosophila DNA. Genetics 139:1067-1076[Abstract].
AKASHI, H., 1996 Molecular evolution between Drosophila melanogaster and D. simulans: reduced codon bias, faster rates of amino acid substitution, and larger proteins in D. melanogaster.. Genetics 144:1297-1307[Abstract].
AKASHI, H., 1997 Codon bias evolution in Drosophila. Population genetics of mutation-selection drift. Gene 205:269-278[Medline].
AQUADRO, C. F., D. J. BEGUN and E. C. KINDAHL, 1994 Selection, recombination, and DNA polymorphism in Drosophila, pp. 4656 in Non-neutral Evolution, edited by B. GOLDING. Chapman & Hall, London.
ASHBURNER, M., 1992 Flybase, a Drosophila genetic database, version 9209.
BEGUN, D. J., 2001 The frequency distribution of nucleotide variation in Drosophila simulans. Mol. Biol. Evol. 18:1343-1352
BEGUN, D. J. and C. F. AQUADRO, 1992 Levels of naturally occurring DNA polymorphism correlate with recombination rates in D. melanogaster.. Nature 356:519-520[Medline].
BELL, A. C., A. G. WEST, and G. FELSENFELD, 2001 Insulators and boundaries: versatile regulatory elements in the eukaryotic genome. Science 291:447-450
BENNETZEN, J. L. and B. D. HALL, 1982 Codon selection in yeast. J. Biol. Chem. 257:3026-3031
CARVALHO, A. B. and A. G. CLARK, 1999 Intron size and natural selection. Nature 401:344[Medline].
CHARLESWORTH, B., M. T. MORGAN, and D. CHARLESWORTH, 1993 The effect of deleterious mutations on neutral molecular evolution. Genetics 134:1289-1303[Abstract].
CHAVANCY, G., A. CHEVALLIER, A. FOURNIER, and J. P. GAREL, 1979 Adaptation of iso-tRNA concentration to mRNA codon frequency in the eukaryotic cell. Biochimie 61:71-78[Medline].
COHEN, B. A., R. D. MITRA, J. D. HUGHES, and G. M. CHURCH, 2000 A computational analysis of whole-genome expression data reveals chromosomal domains of gene expression. Nat. Genet. 26:183-186[Medline].
COMERON, J. M. and M. KREITMAN, 2000 The correlation between intron length and recombination in Drosophila: dynamic equilibrium between mutational and selective forces. Genetics 156:1175-1190
COMERON, J. M., M. KREITMAN, and M. AGUADÉ, 1999 Natural selection on synonymous sites is correlated with gene length and recombination in Drosophila. Genetics 151:239-249
DURET, L., 2001 Why do genes have introns? Recombination might add a new piece to the puzzle. Trends Genet. 17:172-175[Medline].
DURET, L. and D. MOUCHIROUD, 1999 Expression pattern and, surprisingly, gene length shape codon usage in Caenorhabditis, Drosophila, and Arabidopsis. Proc. Natl. Acad. Sci. USA 96:4482-4487
FARKAS, G., B. A. LEIBOVITCH, and S. C. ELGIN, 2000 Chromatin organization and transcriptional control of gene expression in Drosophila. Gene 253:117-136[Medline].
FELSENSTEIN, J., 1974 The evolutionary advantage of recombination. Genetics 78:737-756
GASSER, S. M., R. PARO, F. STEWART, and R. AASLAND, 1998 The genetics of epigenetics. Cell. Mol. Life Sci. 54:1-5[Medline].
GOUY, M. and C. GAUTIER, 1982 Codon usage in bacteria: correlation with gene expressivity. Nucleic Acids Res. 10:7055-7074
HENIKOFF, S., 1990 Position-effect variegation after 60 years. Trends Genet. 6:422-426[Medline].
HEY, J., 1998 Selfish genes, pleiotropy and the origin of recombination. Genetics 149:2089-2097
HILL, W. G. and A. ROBERTSON, 1966 The effect of linkage on limits to artificial selection. Genet. Res. 8:269-294[Medline].
IKEMURA, T., 1985 Codon usage and tRNA content in unicellular and multicellular organisms. Mol. Biol. Evol. 2:13-34[Abstract].
IKEMURA, T., 1991 Correlation between codon usage and tRNA content in microorganisms, pp. 87112 in Transfer RNA in Protein Synthesis, edited by D. L. HATFIELD, B. J. LEE and R. M. PIRTLE. CRC Press, Boca Raton, FL.
ISING, G. and K. BLOCK, 1984 A transposon as a cytogenetic marker in Drosophila melanogaster.. Mol. Gen. Genet. 196:6-16[Medline].
ISING, G., and C. RAMEL, 1976 The behavior of a transposing element in Drosophila melanogaster, pp. 947954 in Genetics and Biology of the Drosophila, edited by M. ASHBURNER and E. NOVITSKI. Academic Press, London.
KIMURA, M., 1977 Preponderance of synonymous changes as evidence for the neutral theory of molecular evolution. Nature 267:275-276[Medline].
KINDAHL, E. C., 1994 Recombination and DNA polymorphism on the third chromosome of Drosophila melanogaster. Ph.D. Thesis, Cornell University, Ithaca, NY.
KLIMAN, R. M., 1999 Recent selection on synonymous codon usage in Drosophila. J. Mol. Evol. 49:343-351[Medline].
KLIMAN, R. M. and J. HEY, 1993 Reduced natural selection associated with low recombination in Drosophila melanogaster.. Mol. Biol. Evol. 10:1239-1258[Abstract].
LI, W. H., C. I. WU, and C. C. LUO, 1985 A new method for estimating synonymous and nonsynonymous rates of nucleotide substitution considering the relative likelihood of nucleotide and codon changes. Mol. Biol. Evol. 2:150-174[Abstract].
MARAIS, G., D. MOUCHIROUD, and L. DURET, 2001 Does recombination improve selection on codon usage? Lessons from nematode and fly complete genomes. Proc. Natl. Acad. Sci. USA 98:5688-5692
MAYNARD SMITH, J. and J. HAIGH, 1974 The hitch-hiking effect of a favourable gene. Genet. Res. 23:23-35[Medline].
MCDONALD, J. H. and M. KREITMAN, 1991 Adaptive protein evolution at the Adh locus in Drosophila. Nature 351:652-654[Medline].
MCVEAN, G. A. and B. CHARLESWORTH, 2000 The effects of Hill-Robertson interference between weakly selected mutations on patterns of molecular evolution and variation. Genetics 155:929-944
MORIYAMA, E. N. and J. R. POWELL, 1997 Codon usage bias and tRNA abundance in Drosophila. J. Mol. Evol. 45:514-523[Medline].
MUNTE, A., M. AGUADÉ, and C. SEGARRA, 2001 Changes in the recombinational environment affect divergence in the yellow gene of Drosophila. Mol. Biol. Evol. 18:1045-1056
OTTO, S. P. and N. H. BARTON, 1997 The evolution of recombination: removing the limits to natural selection. Genetics 147:879-906[Abstract].
POWELL, J. R. and E. N. MORIYAMA, 1997 Evolution of codon usage bias in Drosophila. Proc. Natl. Acad. Sci. USA 94:7784-7790
RICE, W. R. and A. K. CHIPPINDALE, 2001 Sexual recombination and the power of natural selection. Science 294:555-559
SHARP, P. M. and K. M. DEVINE, 1989 Codon usage and gene expression level in Dictyostelium discoideum: highly expressed genes do prefer optimal codons. Nucleic Acids Res. 17:5029-5039
SHARP, P. M. and W. H. LI, 1986 An evolutionary perspective on synonymous codon usage in unicellular organisms. J. Mol. Evol. 24:28-38[Medline].
SHARP, P. M. and W. H. LI, 1987a The codon adaptation indexa measure of directional synonymous codon usage bias, and its potential applications. Nucleic Acids Res. 15:1281-1294
SHARP, P. M. and W. H. LI, 1987b The rate of synonymous substitution in enterobacterial genes is inversely related to codon usage bias. Mol. Biol. Evol. 4:222-230[Abstract].
SHARP, P. M., M. AVEROF, A. T. LLOYD, G. MATASSI, and J. F. PEDEN, 1995 DNA sequence evolution: the sounds of silence. Philos. Trans. R. Soc. Lond. B. Biol. Sci. 349:241-247[Medline].
SHIELDS, D., P. M. SHARP, D. G. HIGGINS, and F. WRIGHT, 1988 Silent sites in Drosophila genes are not neutral: evidence of selection among synonymous codons. Mol. Biol. Evol. 5:704-716[Abstract].
SORSA, V., 1988 Chromosome Maps of Drosophila. CRC Press, Boca Raton, FL.
STENICO, M., A. T. LLOYD, and P. M. SHARP, 1994 Codon usage in Caenorhabditis elegans: delineation of translational selection and mutational biases. Nucleic Acids Res. 22:2437-2446
SUN, F. L., M. H. CUAYCONG, C. A. CRAIG, L. L. WALLRATH, and J. LOCKE et al., 2000 The fourth chromosome of Drosophila melanogaster: interspersed euchromatic and heterochromatic domains. Proc. Natl. Acad. Sci. USA 97:5340-5345
WHITFIELD, L. S., R. LOVELL-BADGE, and P. N. GOODFELLOW, 1993 Rapid sequence evolution of the mammalian sex-determining gene sry. Nature 364:713-715[Medline].
WRIGHT, F., 1990 The effective number of codons used in a gene. Gene 87:23-29[Medline].
This article has been cited by other articles:
![]() |
J. F. Baines, S. A. Sawyer, D. L. Hartl, and J. Parsch Effects of X-Linkage and Sex-Biased Gene Expression on the Rate of Adaptive Protein Evolution in Drosophila |













