Genetics, Vol. 150, 767-775, October 1998, Copyright © 1998

The Correlation Between Synonymous and Nonsynonymous Substitutions in Drosophila: Mutation, Selection or Relaxed Constraints?

Josep M. Comerona and Martin Kreitmana
a Department of Ecology and Evolution, University of Chicago, Chicago, Illinois 60637

Corresponding author: Josep M. Comeron, Department of Ecology and Evolution, University of Chicago, 1101 E. 57th St., Chicago, IL 60637., jcomeron{at}midway.uchicago.edu (E-mail).

Communicating editor: J. HEY


*  ABSTRACT
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Codon usage bias, the preferential use of particular codons within each codon family, is characteristic of synonymous base composition in many species, including Drosophila, yeast, and many bacteria. Preferential usage of particular codons in these species is maintained by natural selection acting largely at the level of translation. In Drosophila, as in bacteria, the rate of synonymous substitution per site is negatively correlated with the degree of codon usage bias, indicating stronger selection on codon usage in genes with high codon bias than in genes with low codon bias. Surprisingly, in these organisms, as well as in mammals, the rate of synonymous substitution is also positively correlated with the rate of nonsynonymous substitution. To investigate this correlation, we carried out a phylogenetic analysis of substitutions in 22 genes between two species of Drosophila, Drosophila pseudoobscura and D. subobscura, in codons that differ by one replacement and one synonymous change. We provide evidence for a relative excess of double substitutions in the same species lineage that cannot be explained by the simultaneous mutation of two adjacent bases. The synonymous changes in these codons also cannot be explained by a shift to a more preferred codon following a replacement substitution. We, therefore, interpret the excess of double codon substitutions within a lineage as being the result of relaxed constraints on both kinds of substitutions in particular codons.


THE rate of synonymous evolution (Ks) in Drosophila genes is correlated with the base composition at synonymous sites and is negatively correlated with the degree of synonymous codon usage bias (SHIELDS et al. 1988 Down; SHARP and LI 1989 Down; MORIYAMA and GOJOBORI 1992 Down). In the bacterium Escherichia coli and the yeast Saccharomyces cerevisiae, both of which have a strongly biased codon usage, the degree of this bias is strongly positively correlated with the level of expression of the genes. In addition, the prevalent, or major, codon in each codon family is concordant with the most abundant tRNA (GROSJEAN and FIERS 1982 Down; SHARP et al. 1986 Down; SHARP and LI 1987 Down; BULMER 1991 Down). Both observations suggest that selection for codon bias acts at the level of translation. In Drosophila, selection pressures have also been mainly restricted to post-transcriptional events, and the nucleotide composition of synonymous sites within a gene cannot be explained by regional mutational biases (MORIYAMA and HARTL 1993 Down; KLIMAN and HEY 1994 Down). General translational efficiency has been proposed as the main factor determining both the rate of evolution and base composition at synonymous sites. In multicellular organisms, however, both the level of gene expression and tRNA abundances are difficult to quantify and can vary among tissues (CHEVALLIER and GAREL 1979 Down; GAREL 1982 Down) and developmental stages (WHITE et al. 1973 Down).

In Drosophila, bacteria, and also mammals, the rate of nonsynonymous substitution (Ka) is also positively correlated with the rate of synonymous substitution (Ks) (GRAUR 1985 Down; SHARP and LI 1987 Down; WOLFE and SHARP 1993 Down; AKASHI 1994 Down; COMERON and AGUADE 1996 Down). In mammals, the genome is structured into isochores of different nucleotide composition. The synonymous sites composition, and hence the codon usage, and the introns, reflect the different overall base composition of each isochore (AOTA and IKEMURA 1986 Down; BERNARDI and BERNARDI 1986 Down). Nucleotide compositional differences across the genome have been interpreted as being the result of differences in mutational patterns among isochores (IKEMURA 1985 Down; BULMER 1987 Down; FILIPSKY 1987) and do not exhibit a clear relationship with the rate of synonymous substitution (WOLFE et al. 1989 Down; WOLFE and SHARP 1993 Down; MOUCHIROUD et al. 1995 Down). Therefore, explanation of the correlation between Ks and Ka in mammals has focused on the possibility of double mutations at adjacent sites, hereafter called doublets. There is disagreement, however, as to whether doublet mutations are sufficiently common to explain the correlation (TICHER and GRAUR 1989 Down; WOLFE and SHARP 1993 Down; MOUCHIROUD et al. 1995 Down; OHTA and INA 1995 Down).

The basis of the correlation between Ks and Ka in Drosophila and prokaryotes is largely unexplored. In Drosophila, AKASHI 1994 Down found evidence for selection on translational accuracy by showing that the more highly functionally and evolutionarily conserved amino acids within a gene also had greater codon bias than less conserved amino acids. He proposed that selection for translational accuracy would lead to a positive correlation between Ks and Ka. The positive correlation between Ks and Ka in species with codon bias selection can also be explained by a hypothesis proposed by LIPMAN and WILBUR 1985 Down (and rejected) for mammals (WOLFE and SHARP 1993 Down; MOUCHIROUD et al. 1995 Down). Under their hypothesis, an amino acid replacement change, possibly driven by positive selection, will then favor a synonymous mutation in that amino acid for a more preferred codon.

No attempt has been made to test hypotheses to explain the correlation between Ks and Ka in Drosophila. We test these hypotheses by investigating the evolution of codons with single synonymous and single nonsynonymous substitutions between D. pseudoobscura and D. subobscura.


*  MATERIALS AND METHODS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

The genes and accession numbers used in this study are as follows: Adh (X78384, X62181, M15545), Adhr (X78384, Y00602, M55545), A/A-T/sesB (S43651, AF025798, AF025799), Aprt (M18432, L06281, AF025800), Arr2/ArrB (M32141, X54084), Bcd (X07870, X55735, X78058), Cp15 (X02497, X53423; BENSON 1995 Down), Cp16 (X16715, X53423), Cp18 (X02497, X53423), Cp19 (X02497, X53423; BENSON 1995 Down), Cpy1 (M62398, AF025803, AF025804), Ddc (X04426; WANG et al. 1996 Down), Eno (X17034, AF025805, AF025806), Esterase6/5b (J04167, M55907), Gad1 (X76198, AF025807, AF025808), Gapdh2 (M11255/256/259, AF025809, AF025810), Gart/ade3 (J02527, X06285), Gld (M29298/X07358/X13581/582, M29299, AF025811), Gpdh (X67650, U59682; WELLS 1996 Down), Mlc1 (M10125, L08052, AF025812), Pcp (J02527, X06285), Rh1/ninaE (K02135, X65877, AF025813), Rh2 (M12896, X65878), Rh3 (M17718, X65879), Rp49/RpL32 (X00848, S59382, M21333), sc (M17119, X96479), Sod (M24421, U47871, U47888), Sry-{alpha} (X03121, L19536, L19535), Sxl (M23636, X98370), Tpi (X57576/S70377, AF025814, AF025815), Ubx (X05723/24/25/27, X05179), Uro (X51940, X57113/S94076, AF025816), Xdh/ry (Y00307/308, M33977, Y08237), y (X04427, Y13909), and zen (X68347, X78058). Antp and hsp82 genes were not used in this study because for these genes the interspecific comparison is only possible for less than half of the entire coding sequence. Also, ATPsyn-ß and Vha14 genes were not used in the analyses because they exhibit significant variation in the synonymous substitution rate (P < 0.01) between lineages (ZENG et al. 1998 Down). All the other genes (longer than 100 codons) for which a comparison between D. melanogaster and both species of the obscura group, D. subobscura and D. pseudoobscura, is possible have been used in the analyses. The sequences were aligned after translation using CLUSTAL W (THOMPSON et al. 1994 Down) with minor manual adjustments to eliminate unnecessary gaps. The numbers of synonymous (Ks) and nonsynonymous (Ka) substitutions per site were estimated as described in COMERON 1995 Down. Ks1 and Ka1 also refer to the number of synonymous and nonsynonymous substitutions per site, respectively, but entail only those codons that differ among the homologous sequences by no or one position. The estimated numbers of substitutions per site were obtained by using the program K-Estimator v3.2 available upon request from J.M.C. or from ftp.bio.indiana.edu/molbio/mswin. Correlation probabilities were calculated by applying the z-transformation (z*) suggested by Hotelling (SOKAL and ROHLF 1995 Down; Chap. 15) for dealing with small sample size (n < 50).

Computer simulations:
We generated pseudorandom coding regions of 250 codons with a G + C content of 0.70 at third positions of codons. We assumed both selection coefficients on synonymous (ss) and nonsynonymous (sa) mutations to be constant within a gene and to follow a normal distribution of mean and variance of 1.1 and 0.35, and 4.4 and 1.4, respectively, among genes. We used the means Ss = -1.1 and Sa = -4.4 because they predict averages for Ks of 0.824 and for Ka of 0.082, very close to the observed averages between D. melanogaster and the obscura species, D. subobscura and D. pseudoobscura, assuming Kn = 1.5. The number of synonymous and nonsynonymous substitutions was obtained from a Poisson distribution with mean the product of the value K, predicted by s following KIMURA 1983 Down, and the number of sites under analysis. A transition:transversion ratio of 2:1 was applied. Partial correlations between ss and nonsynonymous sa were obtained by ss = fc x w x N(a) + (1 - fc) x N(s), where fc indicates the fraction of variance of ss explained by sa, w the average ratio ss/sa, and N(a) and N(s) the independent values obtained from the normal distribution of selection coefficients on nonsynonymous and synonymous mutations, respectively.


*  RESULTS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

As expected, we observed a significant positive correlation between Ks and Ka between D. melanogaster and the obscura group species (D. pseudoobscura and D. subobscura) for the 35 genes under analysis (r = 0.452, P = 0.006; Table 1 and Figure 1). This correlation is also detected when those codons with three substitutions are not taken into account (r = 0.437, P = 0.008). In Drosophila, one or two codons are preferred in each codon family (SHIELDS et al. 1988 Down; MORIYAMA and GOJOBORI 1992 Down; AKASHI 1995 Down), and they are always C- or G-ending. In genes with biased codon usage, most replacement changes will also be a mutation to another preferred codon for the new amino acid based on the definition of preferred codons proposed by AKASHI 1995 Down. Nevertheless, about 18% of replacement changes in a preferred codon generate an unpreferred codon; the exact percentage depends on the amino acid composition. Thus, one might expect natural selection to favor preferred synonymous mutations (from unpreferred to preferred codons or to a more frequent synonymous codon) in those codons where the first nonsynonymous substitution has been to an unpreferred codon (LIPMAN and WILBUR 1985 Down). On an evolutionary time scale, this codon selection is expected to lead to a relative excess of codons with both synonymous and nonsynonymous substitutions. We will refer to this adaptive explanation as hypothesis i.



View larger version (7K):
In this window
In a new window
Download PPT slide
 
Figure 1. Relationship between Ks and Ka for the compared 35 genes between D. melanogaster and D. subobscura or D. pseudoobscura.


 
View this table:
In this window
In a new window

 
Table 1. Divergence estimates for the 35 genes under study

Two other hypotheses have been proposed to explain this correlation: (ii) an excess of double mutation at adjacent sites, i.e., doublet mutations (WOLFE and SHARP 1993 Down), and (iii) correlated selective constraints on synonymous and nonsynonymous sites over the entire gene, or (iiia) over particular codon positions, where less conserved proteins or amino acids also have relaxed constraints on synonymous sites (LIPMAN and WILBUR 1985 Down; AKASHI 1994 Down). Selection for translational accuracy has been suggested as a driving force for correlated selective constraints predicted under hypothesis iii (AKASHI 1994 Down).

Doublet mutations do not appear to be the main cause of the correlation between Ks and Ka in these genes. According to the doublet mutation hypothesis ii, almost half (47.8%) of adjacent mutations, involving one synonymous and one nonsynonymous mutation, are expected to be in the third position of a codon and in the first position of the next codon because most, but not all (95.4%), substitutions at the first position of a codon are nonsynonymous, and 59.8% of substitutions at the third position are synonymous (substitutions at the second position are always nonsynonymous). Thus, under this hypothesis, Ks1 and Ka1 (see MATERIALS AND METHODS) should be positively correlated (WOLFE and SHARP 1993 Down) because adjacent mutations in the third position of a codon and the first position of the next codon, will usually be a synonymous and nonsynonymous change, respectively. Contradicting this prediction, there is no evidence of a positive correlation in the 35 genes between Ks1 and Ka1 (Table 1) for those codons with no or only one substitution (r = 0.159, P = 0.34). This lack of evidence for a positive correlation between Ks1 and Ka1 also rules out correlated selection pressures for the whole gene on both kinds of substitutions (hypothesis iii). The observed correlation between Ks and Ka must be due to a relative excess of two substitutions (one synonymous and one nonsynonymous) in the same codon. To further test this excess, we have studied the presence of synonymous and nonsynonymous substitutions in codons in the 22 genes where the comparison of the two obscura species (D. subobscura and D. pseudoobscura) is possible. A G-test of independence for the presence of one synonymous and one nonsynonymous substitution in the same codon reveals a strong excess of doubly substituted codons (G = 22.95, P < 0.0001; Table 2). A further analysis at the codon position of the synonymous and nonsynonymous substitutions in doubly substituted codons allowed us to compare the observed frequencies of substituted positions (first-second, first-third, and second-third) with the expected frequencies on the basis of the frequency of synonymous substitutions at the different codon positions assuming a random coding sequence. A test of goodness of fit (Table 3) fails in detecting an excess of codons substituted at adjacent positions (G = 0.793, P = 0.673, with the Williams correction for continuity). This result confirms that doublet and adjacent mutations are not the cause of the excess of codons with both one synonymous and one nonsynonymous substitution.


 
View this table:
In this window
In a new window

 
Table 2. G-test of independence for the presence of synonymous and nonsynonymous substitutions in the same codon between D. subobscura and D. pseudoobscura


 
View this table:
In this window
In a new window

 
Table 3. Substituted positions in codons with one synonymous and one nonsynonymous substitution between D. subobscura and D. pseudoobscura

Hypotheses i and iiia, adaptive synonymous substitutions and covarying constraints on particular codons, make distinguishable predictions about the independent occurrence of double mutations in the same codon. Under the covarying constraints hypothesis, independent occurrences of synonymous and replacement substitutions are expected to accumulate in more weakly constrained codons. In contrast, the adaptive substitution hypothesis predicts nonindependence of substitutions. To distinguish between these two predictions, we have carried out a phylogenetic analysis of substitutions in those doubly substituted codons. If synonymous and nonsynonymous substitutions in the same codon are independent events, then they will be expected to have occurred at equal frequency as single mutations in each lineage leading to D. pseudoobscura and D. subobscura and as double mutations in the same species lineage. The adaptive hypothesis, otherwise, predicts an excess occurrence of the two substitutions in the same species lineage.

We first identified all codons with one synonymous and one nonsynonymous substitution when comparing D. subobscura and D. pseudoobscura sequences. We then used D. melanogaster sequences for the 22 genes whose sequences are available in the three species (ZENG et al. 1998 Down; see Table 4) to identify the ancestral sequence for each of these codons. Of the 35 codons with one synonymous and one nonsynonymous substitution between D. subobscura and D. pseudoobscura, and for which we could unambiguously assign each substitution to one branch, a statistically significant number of them, 27, had both substitutions in one obscura lineage ({chi}2 = 10.31, P = 0.0013). Also, 48.1% of these codons show nonadjacent substitutions (first and third position), which is very close to the expected 47.8% (see above). Synonymous and nonsynonymous substitutions in the same codon are not independent occurrences, and they cannot be explained by doublet, and adjacent, mutations or by correlated selective constraints on particular codons. Only the adaptive substitution hypothesis is compatible with the data.


 
View this table:
In this window
In a new window

 
Table 4. Number of codons with one synonymous and one nonsynonymous substitution between D. subobscura and D. pseudoobscura

To investigate the possible selective mechanism driving the nonindependent occurrence of synonymous and nonsynonymous substitutions, we considered a further prediction of the adaptive substitution hypothesis. Under any equilibrium model of codon bias maintained by selection, the expected rates of fixation of preferred and unpreferred mutations must be equal, while mutation pressure favors the appearance of unpreferred codons. In contrast, under the adaptive substitution hypothesis for doubly substituted codons, preferred substitutions are expected to exceed unpreferred substitutions. This relative excess of preferred substitutions is expected because the fixation of a nonsynonymous substitution (either by selection or drift) that generates an unpreferred codon will lead to positive selection for a subsequent synonymous substitution to the new preferred codon for that amino acid. The selection mechanism underlying adaptive synonymous substitution is not critical to this argument: selection may be to enhance either translational efficiency or accuracy. The important point is that the consequence of this selection will be to increase the substitution rate of preferred substitutions compared to unpreferred substitutions in the codons that have both synonymous and nonsynonymous substitutions occurring in the same lineage.

The data do not support this prediction. Under the assumption that a nonsynonymous substitution preceded a synonymous substitution, 65% of the inferred ancestral codons are preferred (for the 20 out of the 27 codons that conform to this assumption), while this percentage becomes 40 and 30% for the intermediate and doubly substituted codons, respectively. Thus, there is no evidence of selection for either a preferred synonymous substitution following a nonsynonymous substitution, or equilibrium between preferred and unpreferred codons. A G-test of independence among the number of preferred and unpreferred codons of the ancestral and doubly substituted codons reveals a smaller ratio of preferred to unpreferred codons in this latter group (G = 4.837, P = 0.0279 with the Williams correction). The expected frequency of preferred and unpreferred codons on the basis of random mutation can be estimated from the inferred ancestral codons and a mutational bias toward an A + T to G + C ratio of 60/40. The frequency of preferred codons consequence of both the first nonsynonymous substitution and the subsequent synonymous substitution in the same codon can be explained by random fixation of both the nonsynonymous ({chi}2 = 0.695, P = 0.404) and the synonymous ({chi}2 = 1.377, P = 0.241) mutations. By the same argument it is also possible to reject the converse of this adaptive hypothesis for doubly mutated codons, in which the initial occurrence of an unpreferred synonymous substitution established positive selection for an amino acid replacement that would be a more preferred codon (see below). From this analysis, and the preceding ones, we conclude that the data do not support any of the three hypotheses as explanations for the observed correlation between synonymous and replacement rates among genes. An additional alternative hypothesis to explain the data will be considered below.


*  DISCUSSION
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Synonymous and replacement substitutions in the same codon:
Our interest in studying codons differing by both a synonymous and an amino acid replacement substitution was motivated by the knowledge that synonymous and amino acid replacement substitution rates are positively correlated in Drosophila genes. The detection of an excess of codons with both a synonymous and a nonsynonymous substitution, not explained by adjacent mutations, confirms that natural selection is acting at the codon, post-transcriptional, level in Drosophila. Thus, codon selection can give rise to coupled synonymous and nonsynonymous substitutions in the same codon. This result does not, however, rule out selection also acting on synonymous sites at the level of mRNA to maintain functional secondary structures (PARSH et al. 1997 Down; M. ANTEZANA and M. KREITMAN, unpublished data).

The analysis of doubly substituted codons between D. psedoobscura and D. subobscura, using D. melanogaster as an outgroup species, in a relatively large number of genes (22), allowed us to test several hypotheses. The discovery of a greater than expected number of these codons with both substitutions occurring in the same species lineage provides evidence for the nonindependence of replacement and synonymous substitutions in these codons. This nonindependence allowed us to reject the hypothesis of differential constraints among codons, predicted by the translational accuracy model.

The nonindependence of replacement and synonymous substitutions in doubly substituted codons is unlikely to be an artifact of the procedure used to assign changes to each lineage. The possible misassignment of a substitution to the wrong lineage, which might have occurred if the same mutational change had also occurred in the D. melanogaster lineage, or if a codon had multiple substitutions at the same nucleotide position, would be expected to obscure rather than enhance the nonrandom pattern of substitution we observed. This finding is also not likely to be due to unequal substitution rates in the D. pseudoobscura and the D. subobscura lineages, because these rates are very similar (ZENG et al. 1998 Down) and because the doubly substituted codons were found to be present in both lineages at nearly equal frequencies (13 and 14). Nor can the nonindependence of doubly substituted codons be due to any systematic shift in codon usage in any of the three lineages. In a separate study, we found the codon usage in these genes to be remarkably similar for almost all of the codon families (KREITMAN and ANTEZANA 1998 Down). Finally, the possibility of a single mutational event causing base changes at two adjacent positions can also be eliminated as the cause of this nonindependence of substitutions.

We therefore considered models of selection acting within the context of individual codons, i.e., codon selection. One attractive possibility is the idea that selection drives synonymous substitutions to a new preferred codon following the substitution of an amino acid replacement substitution to an unpreferred codon. Alternatively, one might also consider the reverse process, an adaptive substitution of an amino acid replacement substitution to a new preferred codon following a synonymous substitution to an unpreferred codon. Unfortunately, the evidence does not support either of these adaptive explanations: The majority of synonymous (or replacement) substitutions in these doubly substituted codons were to unpreferred rather than preferred codons. Thus, the second substitution, whichever it is, is not likely to be the result of positive codon selection.

An ad hoc explanation for the observed patterns of change in doubly substituted codons can be invoked by assuming a lineage-specific relaxation of selective constraints on particular amino acids within a protein. This relaxed selection at the amino acid level will allow for an increase in the amino acid replacement substitution rate, as well as for an increase in the synonymous substitution rate. The latter is expected to occur because translational accuracy will also be relaxed in these codons. Under this scenario, the first substitution is likely to be a synonymous change to a less preferred codon. The inferred ancestral codons are 68% preferred (for the 25 codons that conform to this assumption) whereas the intermediate codons, after the synonymous substitution, are 24% preferred. The clear increment of unpreferred codons can be explained by mutation on the basis of the expected frequency of preferred and unpreferred codons (see previous section; {chi}2 = 0.802, P = 0.37). The increased occurrence of the second, nonsynonymous, substitution would not require the action of positive selection but only the relaxation of selective constraints on amino acid substitutions in this codon ({chi}2 = 0.153, P = 0.696, based on the observed number of preferred codons and the expected frequency due to mutation). Such an explanation differs from the already rejected differential constraints hypothesis only in the additional requirement that selective constraints on individual codons change over time within a species lineage. This scenario is reminiscent of one aspect of Fitch's covarion model of protein evolution (FITCH 1971 Down; MIYAMOTO and FITCH 1995 Down) in its proposal that the substitution of one amino acid in a protein, either by drift or selection, influences the functional constraints on other amino acid positions. (Our ad hoc explanation for doubly substituted codons, however, does not claim for a constant fraction of invariable positions.) Under our model, shifting relaxed constraints at the amino acid level is extended to include codon selection by including translational accuracy.

Our analysis indicates, therefore, that only a covarion-like model can explain the observed excess of doubly substituted codons on single-species lineages. Can the covarion-like model also account for the correlation between Ks and Ka observed in Drosophila data (Figure 1)? And if so, does it provide a more plausible explanation for this correlation than does a noncovarion model? To investigate these questions, we first explored the expected relationship between Ks and Ka when selection coefficients for synonymous (ss) and replacement (sa) mutations are completely correlated, and selection is constant for all sites (i.e., a noncovarion substitution model). Following KIMURA 1983 Down,

where Kn is the fixation probability of neutral mutations and S = 4 Ne s. Figure 2 shows that for linearly related selection coefficients for the two types of mutations, Ks and Ka are positively related, and that the relationship is a strongly nonlinear one. In fact, a similar trend can be observed in the data comparing synonymous and replacement changes between D. melanogaster and D. subobscura or D. pseudoobscura (Figure 1). We have also investigated by simulation the correlation between Ks and Ka for this same model, but when the selection coefficients on synonymous and replacement mutations are not completely correlated. As indicated in Table 5, a correlation between Ks and Ka in the range observed for Drosophila requires a very strong correlation between ss and sa, approaching a value of 1.



View larger version (12K):
In this window
In a new window
Download PPT slide
 
Figure 2. Expected nonlinear relationship between Ks and Ka based on a linear relationship between selection coefficients on synonymous (ss) and nonsynonymous (sa) mutations assuming a number of neutral substitutions per site of 1.5.


 
View this table:
In this window
In a new window

 
Table 5. Correlation coefficients between Ks and Ka obtained by computer simulation

Table 5 also gives the results of simulations for a covarion-like model, and Figure 3 shows the data generated for one of the conditions examined. Under this model a fraction, fc, of amino acid substitutions generates another amino acid substitution as well as a synonymous substitution. As shown in the table, the covarion-like model predicts moderately strong correlations between Ks and Ka for more permissive conditions of the relationship between ss and sa than the alternative model. This covarion-like model is therefore at least a viable explanation for the observed correlation between Ks and Ka seen in Drosophila data.



View larger version (20K):
In this window
In a new window
Download PPT slide
 
Figure 3. Relationship between Ks and Ka obtained by computer simulation (see MATERIALS AND METHODS) under the proposed covarion-like model assuming that selection coefficients on nonsynonymous sites (sa) can explain 50% of the variance of the selection coefficients on synonymous sites (ss); 1000 replicates, fc = 0.5 (see text for details).

This analysis of the covarion-like model, together with the detected excess of codons with doubly substituted codons in the same species lineage (and with a tendency toward unpreferred codons), support a covarion hypothesis of shifting selective constraints over time on individual amino acids. We propose that selection at the level of translational accuracy couples synonymous and nonsynonymous substitutions in codons with relaxed constraints.


*  ACKNOWLEDGMENTS

We are grateful to P. Andolfatto, M. Antezana, C. Bergman, A. Llopart, E. Stahl, C. Toomajian, and M. Przeworski for helpful discussions and/or comments on the manuscript. We also thank J. Hey and one anonymous reviewer for valuable comments that substantially improved the manuscript. J.M.C. is funded by a Postdoctoral Fellowship from Ministerio de Educación y Ciencia, Spain. This work was supported by a National Institutes of Health grant GM-39355 to M.K.

Manuscript received January 22, 1998; Accepted for publication July 9, 1998.


*  LITERATURE CITED
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

AKASHI, H., 1994  Synonymous codon usage in Drosophila melanogaster: Natural selection and translational accuracy. Genetics 136:927-935[Abstract].

AKASHI, H., 1995  Inferring weak selection from patterns of polymorphism and divergence at "silent" sites in Drosophila DNA. Genetics 139:1067-1076[Abstract].

AOTA, S. and T. IKEMURA, 1986  Diversity in G+C content at the third positions of codons in vertebrate genes and its cause. Nucleic Acids Res. 14:6345-6355[Abstract/Free Full Text].

BENSON, A. R., 1995 The molecular evolution of the obscura group Chorion s15: a prominent role for codon bias. Ph.D. Thesis, Harvard University.

BERNARDI, G. and G. BERNARDI, 1986  Compositional constraints and genome evolution. J. Mol. Evol. 24:1-11[Medline].

BULMER, M., 1987  Coevolution of codon usage and tRNA abundance. Nature 325:728-730[Medline].

BULMER, M., 1991  The selection-mutation-drift theory of synonymous codon usage. Genetics 129:897-907[Abstract].

CHEVALLIER, A. and J. P. GAREL, 1979  Studies on tRNA adaptation, tRNA turnover, precursor tRNA and tRNA gene distribution in Bombix mori by using two-dimensional polyacrylmide gel electrophoresis. Biochemie 61:245-262[Medline].

COMERON, J. M., 1995  A method for estimating the numbers of synonymous and nonsynonymous substitutions per site. J. Mol. Evol. 41:1152-1159[Medline].

COMERON, J. M. and M. AGUADÉ, 1996  Synonymous substitutions in the Xdh gene of Drosophila: Heterogeneous distribution along the coding region. Genetics 144:1053-1062[Abstract].

FILIPSKI, J., 1987  Correlation between molecular clock ticking, codon usage, fidelity of DNA repair, chromosome banding and chromatin compactness in germline cells. FEBS Lett. 217:184-186[Medline].

FITCH, W. M., 1971  Rate of change of concomitantly variable codons. J. Mol. Evol. 1:84-96[Medline].

GAREL, J. P., 1982  The silkworm, a model for molecular and cellular biologist. Trends Biochem. Sci. 7:105-108.

GRAUR, D., 1985  Amino acid composition and the evolutionary rates of protein-coding genes. J. Mol. Evol. 22:53-63[Medline].

GROSJEAN, H. and W. FIERS, 1982  Preferential codon usage in prokaryotic genes: the optimal codon-anticodon interaction energy and the selective codon usage in efficiently expressed genes. Gene 18:199-209[Medline].

IKEMURA, T., 1985  Codon usage and tRNA content in unicellular and multicellular organisms. Mol. Biol. Evol. 2:13-34[Abstract].

JUKES, T. H., and C. R. CANTOR, 1969 Evolution of protein molecules, pp. 21–132 in Mammalian Protein Metabolism III, edited by H. N. MUNRO. Academic Press, New York.

KIMURA, M., 1980  A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J. Mol. Evol. 16:111-120[Medline].

KIMURA, M., 1983 The Neutral Theory of Molecular Evolution. Cambridge University Press, Cambridge, UK.

KLIMAN, R. M. and J. HEY, 1994  The effects of mutation and natural selection on codon bias in the genes of Drosophila. Genetics 137:1049-1056[Abstract].

KREITMAN, M., and M. ANTEZANA, 1998 The population and evolutionary genetics of codon bias, in Evolutionary Genetics from Molecules to Morphology, edited by R. SING and C. KRIMBAS. Columbia University Press, New York. (in press).

LIPMAN, D. J. and W. J. WILBUR, 1985  Interaction of silent and replacement changes in eukaryotic coding sequences. J. Mol. Evol. 21:161-167.

MIYAMOTO, M. M. and W. M. FITCH, 1995  Testing the covarion hypothesis of molecular evolution. Mol. Biol. Evol. 12:503-513[Abstract].

MORIYAMA, E. N. and T. GOJOBORI, 1992  Rates of synonymous substitution and base composition of nuclear genes in Drosophila. Genetics 130:855-864[Abstract].

MORIYAMA, E. N. and D. L. HARTL, 1993  Codon usage bias and base composition of nuclear genes in Drosophila. Genetics 134:847-858[Abstract].

MOUCHIROUD, D., C. GAUTIER, and G. BERNARDI, 1995  Frequencies of synonymous substitutions in mammals are gene-specific and correlated with frequencies of nonsynonymous substitutions. J. Mol. Evol. 40:107-113[Medline].

OHTA, T. and Y. INA, 1995  Variation in synonymous substitution rates among mammalian genes and the correlation between synonymous and nonsynonymous divergences. J. Mol. Evol. 41:717-720[Medline].

PARSH, J., S. TANDA, and W. STEPHAN, 1997  Site-directed mutations reveal long-range compensatory interactions in the Adh gene of Drosophila melanogaster. Proc. Natl. Acad. Sci. USA 94:928-933[Abstract/Free Full Text].

SHARP, P. M. and W.-H. LI, 1987  The rate of synonymous substitution in enterobacterial genes is inversely related to codon usage bias. Mol. Biol. Evol. 4:222-230[Abstract].

SHARP, P. M. and W.-H. LI, 1989  On the rate of DNA sequence evolution in Drosophila. J. Mol. Evol. 28:398-402[Medline].

SHARP, P. M., T. M. F. TUOHY, and K. R. MOSURSKI, 1986  Codon usage in yeast: cluster analysis clearly differentiates highly and lowly expressed genes. Nucleic Acids Res. 14:5125-5139[Abstract/Free Full Text].

SHIELDS, D. C., P. M. SHARP, D. G. HIGGINS, and F. WRIGHT, 1988  "Silent" sites in Drosophila genes are not neutral: evidence of selection among synonymous codons. Mol. Biol. Evol. 5:704-716[Abstract].

SOKAL, R. R., and F. J. ROHLF, 1995 Biometry, Ed. 3. W. H. Freeman and Co., New York.

THOMPSON, J. D., D. G. HIGGINS, and T. J. GIBSON, 1994  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, positions-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22:4673-4680[Abstract/Free Full Text].

TICHER, A. and D. GRAUR, 1989  Nucleic acid composition, codon usage, and rate of synonymous substitution in protein-coding genes. J. Mol. Evol. 28:286-298[Medline].

WANG, D., J. L. MARSH, and F. J. AYALA, 1996  Evolutionary changes in the expression pattern of a developmentally essential gene in three Drosophila species. Proc. Natl. Acad. Sci. USA 93:7103-7107[Abstract/Free Full Text].

WELLS, R. S., 1996  Nucleotide variation at the Gpdh locus in the genus Drosophila. Genetics 143:375-384[Abstract].

WHITE, B. N., G. M. TENER, J. HOLDEN, and D. T. SUZUKI, 1973  Analysis of tRNAs during the development of Drosophila. Dev. Biol. 33:185-195[Medline].

WOLFE, K. H. and P. M. SHARP, 1993  Mammalian gene evolution: Nucleotide sequence divergence between mouse and rat. J. Mol. Evol. 37:441-456[Medline].

WOLFE, K. H., P. M. SHARP, and W.-H. LI, 1989  Mutation rates differ among regions of the mammalian genome. Nature 337:283-285[Medline].

ZENG, L.-W., J. M. COMERON, B. CHEN, and M. KREITMAN, 1998  The molecular clock revisited: the rate of synonymous vs. replacement change in Drosophila. Genetica 102(103):369-382.




This article has been cited by other articles:


Home page
Proc. Natl. Acad. Sci. USAHome page
J. M. Comeron
Weak selection and recent mutational changes influence polymorphic synonymous mutations in humans
PNAS, May 2, 2006; 103(18): 6940 - 6945.
[Abstract] [Full Text] [PDF]


Home page
Hum Mol GenetHome page
M. Fornage, C. R. Lee, P. A. Doris, M. S. Bray, G. Heiss, D. C. Zeldin, and E. Boerwinkle
The soluble epoxide hydrolase gene harbors sequence variation associated with susceptibility to and protection from incident ischemic stroke
Hum. Mol. Genet., October 1, 2005; 14(19): 2829 - 2837.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
T. M. Hambuch and J. Parsch
Patterns of Synonymous Codon Usage in Drosophila melanogaster Genes With Sex-Biased Expression
Genetics, August 1, 2005; 170(4): 1691 - 1700.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
N. Cobbe and M. M. S. Heck
The Evolution of SMC Proteins: Phylogenetic Analysis and Structural Implications
Mol. Biol. Evol., February 1, 2004; 21(2): 332 - 347.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
Y. Lu and M. D. Rausher
Evolutionary Rate Variation in Anthocyanin Pathway Genes
Mol. Biol. Evol., November 1, 2003; 20(11): 1844 - 1853.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
H.-Y. Wang, H. Tang, C.-K. J. Shen, and C.-I Wu
Rapidly Evolving Genes in Human. I. The Glycophorins and Their Possible Role in Evading Malaria Parasites
Mol. Biol. Evol., November 1, 2003; 20(11): 1795 - 1804.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
T. Domazet-Loso and D. Tautz
An Evolutionary Analysis of Orphan Genes in Drosophila
Genome Res., October 1, 2003; 13(10): 2213 - 2219.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Bot.Home page
Y.-C. Chiang, B. A. Schaal, C.-H. Chou, S. Huang, and T.-Y. Chiang
Contrasting selection modes at the Adh1 locus in outcrossing Miscanthus sinensis vs. inbreeding Miscanthus condensatus (Poaceae)
Am. J. Botany, April 1, 2003; 90(4): 561 - 570.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
K. S. Zigler and H. A. Lessios
Evolution of Bindin in the Pantropical Sea Urchin Tripneustes: Comparisons to Bindin of Other Genera
Mol. Biol. Evol., February 1, 2003; 20(2): 220 - 231.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
A. J. Betancourt and D. C. Presgraves
Linkage limits the power of natural selection in Drosophila
PNAS, October 15, 2002; 99(21): 13616 - 13620.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
J. M. Comeron and M. Kreitman
Population, Evolutionary and Genomic Consequences of Interference Selection
Genetics, May 1, 2002; 161(1): 389 - 410.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
K. J. Schmid and C. F. Aquadro
The Evolutionary Analysis of ""Orphans"" From the Drosophila Genome Identifies Rapidly Diverging and Incorrectly Annotated Genes
Genetics, October 1, 2001; 159(2): 589 - 598.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
S. Campesan, D. Chalmers, F. Sandrelli, A. Megighian, A. A. Peixoto, R. Costa, and C. P. Kyriacou
Comparative Analysis of the nonA Region in Drosophila Identifies a Highly Diverged 5' Gene That May Constrain nonA Promoter Evolution
Genetics, February 1, 2001; 157(2): 751 - 764.
[Abstract] [Full Text]


Home page
GeneticsHome page
K. A. Dunn, J. P. Bielawski, and Z. Yang
Substitution Rates in Drosophila Nuclear Genes: Implications for Translational Selection
Genetics, January 1, 2001; 157(1): 295 - 305.
[Abstract] [Full Text]


Home page
GeneticsHome page
F. Alvarez-Valin, J. F. Tort, and G. Bernardi
Nonrandom Spatial Distribution of Synonymous Substitutions in the GP63 Gene From Leishmania
Genetics, August 1, 2000; 155(4): 1683 - 1692.
[Abstract] [Full Text]


Home page
ScienceHome page
M. Averof, A. Rokas, K. H. Wolfe, and P. M. Sharp
Evidence for a High Frequency of Simultaneous Double-Nucleotide Substitutions
Science, February 18, 2000; 287(5456): 1283 - 1286.
[Abstract] [Full Text]


Home page
GeneticsHome page
A. Llopart and M. Aguadé
Synonymous Rates at the RpII215 Gene of Drosophila: Variation Among Species and Across the Coding Region
Genetics, May 1, 1999; 152(1): 269 - 280.
[Abstract] [Full Text]


Home page
GeneticsHome page
J. M. Comeron, M. Kreitman, and M. Aguadé
Natural Selection on Synonymous Sites Is Correlated With Gene Length and Recombination in Drosophila
Genetics, January 1, 1999; 151(1): 239 - 249.
[Abstract] [Full Text]


Home page
Proc. Natl. Acad. Sci. USAHome page
A. Wagner
Decoupled evolution of coding region and mRNA expression patterns after gene duplication: Implications for the neutralist-selectionist debate
PNAS, June 6, 2000; 97(12): 6579 - 6584.
[Abstract] [Full Text] [PDF]