Genetics, Vol. 165, 1221-1232, November 2003, Copyright © 2003

Protein Evolution and Codon Usage Bias on the Neo-Sex Chromosomes of Drosophila miranda

Doris Bachtroga
a Institute of Cell, Animal and Population Biology, University of Edinburgh, Edinburgh EH9 3JT, United Kingdom

Corresponding author: Doris Bachtrog, 227 Biotechnology Bldg., Cornell University, Ithaca, NY 14853., doris.bachtrog{at}cornell.edu (E-mail)

Communicating editor: D. BEGUN


*  ABSTRACT
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

The neo-sex chromosomes of Drosophila miranda constitute an ideal system to study the effects of recombination on patterns of genome evolution. Due to a fusion of an autosome with the Y chromosome, one homolog is transmitted clonally. Here, I compare patterns of molecular evolution of 18 protein-coding genes located on the recombining neo-X and their homologs on the nonrecombining neo-Y chromosome. The rate of protein evolution has significantly increased on the neo-Y lineage since its formation. Amino acid substitutions are accumulating uniformly among neo-Y-linked genes, as expected if all loci on the neo-Y chromosome suffer from a reduced effectiveness of natural selection. In contrast, there is significant heterogeneity in the rate of protein evolution among neo-X-linked genes, with most loci being under strong purifying selection and two genes showing evidence for adaptive evolution. This observation agrees with theory predicting that linkage limits adaptive protein evolution. Both the neo-X and the neo-Y chromosome show an excess of unpreferred codon substitutions over preferred ones and no difference in this pattern was observed between the chromosomes. This suggests that there has been little or no selection maintaining codon bias in the D. miranda lineage. A change in mutational bias toward AT substitutions also contributes to the decline in codon bias. The contrast in patterns of molecular evolution between amino acid mutations and synonymous mutations on the neo-sex-linked genes can be understood in terms of chromosome-specific differences in effective population size and the distribution of selective effects of mutations.


IN natural populations, the effective population size, Ne, is a key variable that determines the magnitude of random sampling effects on gene frequencies (i.e., genetic drift; KIMURA 1983 Down). Ne varies among species and across different regions of the genome within a species. In outcrossing taxa, sex chromosomes or mitochondria, for example, have a lower Ne than that of autosomes due to sex-specific inheritance and differences in ploidy levels (CABALLERO 1995 Down). Moreover, the recombinational landscape of a chromosome can also have an influence on Ne. Under the standard neutral model of sequence evolution, the amount of neutral variability—polymorphisms with no significant fitness effects—is directly proportional to the effective population size and the mutation rate (KIMURA 1983 Down). However, a locus under directional selection can increase the magnitude of random sampling effects at closely linked loci, thereby enhancing the influence of genetic drift (HILL and ROBERTSON 1966 Down). This "Hill-Robertson effect" predicts lower levels of neutral variability in low-recombining regions, due to the reduction in Ne imposed by linked selected sites. In accordance with this prediction, regions of reduced or no recombination in Drosophila harbor lower levels of variation at silent sites (BEGUN and AQUADRO 1992 Down; ZUROVCOVA and EANES 1999 Down; BACHTROG and CHARLESWORTH 2002 Down).

Population genetics theory predicts that Ne is a key determinant of the efficacy of natural selection. A mutation is effectively neutral (i.e., its fate is mostly determined by random genetic drift) if the product of the effective population size Ne and the selection coefficient s is below unity (|Nes| < 1; KIMURA 1983 Down). Thus, since selection at linked loci decreases Ne (the Hill-Robertson effect), the efficacy of selection is also expected to be reduced in regions of restricted recombination. Some empirical evidence suggests that the overall level of adaptation is indeed reduced in regions of the genome where genetic recombination is reduced or absent (WEINREICH and RAND 2000 Down; BACHTROG and CHARLESWORTH 2002 Down; BETANCOURT and PRESGRAVES 2002 Down). Clonally transmitted genomes like organelles show reduced levels of adaptation with respect to protein sequences (LYNCH and BLANCHARD 1998 Down; WEINREICH and RAND 2000 Down). Codon bias—the nonrandom usage of codons encoding the same amino acid—is reduced in regions of low recombination in Drosophila melanogaster (KLIMAN and HEY 1993 Down), consistent with Hill-Robertson effects (but see MARAIS and PIGANEAU 2002 Down). Also, the rate of replacement substitution is reduced in low-recombining regions in the D. melanogaster genome, compatible with a lower rate of protein adaptation in regions of restricted recombination (BETANCOURT and PRESGRAVES 2002 Down).

The Y chromosome provides an extreme example of the effects of genetic linkage on patterns of genome evolution. In species with male heterogamety, the Y chromosome lacks genetic recombination over most or all of its length. The X and Y chromosomes are thought to have descended from an ordinary pair of autosomes, with the almost complete erosion of gene function on the Y being a direct consequence of its lack of sexual recombination (BULL 1983 Down; CHARLESWORTH 1990 Down). Well-studied Y chromosome systems, like those of humans or D. melanogaster, are ancient and have lost most of their active genes (BACHTROG and CHARLESWORTH 2001 Down; CARVALHO et al. 2001 Down; LAHN et al. 2001 Down). Since the magnitude of interference between loci depends on the number of loci under selection (LI 1987 Down; MCVEAN and CHARLESWORTH 2000 Down), the opportunity for detecting the signatures of Hill-Robertson effects is considerably reduced compared to the early stages of Y chromosome evolution (CHARLESWORTH and CHARLESWORTH 2000 Down). Species where a neo-Y/neo-X chromosome pair has recently been formed by a fusion between an autosome and a sex chromosome offer excellent opportunities to study the processes involved in Y chromosome degeneration (LUCCHESI 1978 Down; CHARLESWORTH 1996 Down; STEINEMANN and STEINEMANN 1998 Down), since those systems still contain many functional loci under selection. As a result of such a fusion, one member of the pair of autosomes (the neo-Y) is transmitted from father to son like the true Y chromosome, while its homolog (the neo-X) cosegregates with the X (LUCCHESI 1978 Down). In Drosophila, the absence of crossing over in males ensures that such a neo-Y chromosome is genetically isolated from its partner (GETHMANN 1988 Down).

In D. miranda, a close relative of D. pseudoobscura, the Y chromosome has become fused to an autosome (element C; MULLER 1940 Down; see Fig 1). There has been partial loss of gene activity on the neo-Y, with some genes still retaining their activity (BACHTROG and CHARLESWORTH 2002 Down), others showing little or no expression (STEINEMANN et al. 1993 Down), and others having been apparently completely lost (STEINEMANN and STEINEMANN 1999 Down). This degeneration is associated with the acquisition of dosage compensation by some genes on the neo-X, by the standard Drosophila molecular mechanism (MARIN et al. 2000 Down). DNA sequence divergence between neo-X and neo-Y homologs indicates that they began to diverge ~1–1.5 MYA (i.e., ~10 Ne generations; BACHTROG and CHARLESWORTH 2002 Down).



View larger version (19K):
In this window
In a new window
Download PPT slide
 
Figure 1. Schematic of phylogenetic relationships of the sequences surveyed. The neo-sex chromosomes of D. miranda formed ~1 MYA, while D. miranda and D. pseudoobscura diverged ~2 MYA. Parsimony assumptions and outgroup sequence information from D. pseudoobscura were used to infer whether fixations occurred in either the neo-X or the neo-Y lineage.

Evolutionary theory predicts that deleterious mutations should accumulate on a nonrecombining chromosome (FELSENSTEIN 1974 Down), while beneficial mutations are less likely to be fixed (PECK 1994 Down; ORR and KIM 1998 Down). In a previous study based on three protein-coding genes, BACHTROG and CHARLESWORTH 2002 Down demonstrated a significantly higher rate of protein evolution for one gene on the neo-X-branch of the phylogeny (CycB), and a significantly higher rate on the neo-Y branch for another gene (robo). While this pattern was suggestive, here I compare patterns of molecular evolution at 18 genes located on the neo-sex chromosomes of D. miranda. I investigate two aspects of molecular evolution. First, the rate of replacement substitutions is analyzed, to more generally characterize rates of protein evolution on the neo-X and neo-Y. Second, I investigate several aspects of the evolution of codon bias. We have previously shown a 30-fold reduction in Ne of the neo-Y chromosome in D. miranda compared to that of the neo-X (BACHTROG and CHARLESWORTH 2000 Down; BACHTROG and CHARLESWORTH 2002 Down). Since the efficacy of selection is dependent on Ne, both lower levels of codon bias and reduced protein adaptation are expected on the neo-Y chromosome.


*  MATERIALS AND METHODS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Sequence information on the genes investigated:
Table 1 lists all the genes surveyed and the source of the sequences. The genes investigated were isolated either from a genomic library constructed from D. miranda (CG11136, CG11159, CG13437, CG16799, CG30152, CG9025, Cyclin B, exuperantia 1, roundabout, deadpan, zipper, and maleless) or using degenerate primers for PCR amplification (even-skipped and engrailed) or were taken from GenBank (Lcp1, Lcp2, Lcp3, and Lcp4). All genes investigated are located on the neo-sex chromosomes of D. miranda, as confirmed by in situ hybridization (primary sources; results not shown). For CG11136, CG11159, CG13437, CG16799, CG30152, CG9025, Cyclin B, exuperantia 1, roundabout, deadpan, even-skipped, Lcp1, Lcp2, Lcp3, and Lcp4 the entire coding sequence was investigated; for the remaining genes, partial coding sequences were analyzed. All genes were also sequenced in a strain of D. pseudoobscura, which allows the inference of lineage-specific effects on the neo-X and the neo-Y chromosome (Fig 1). Information on library screening and PCR amplification can be found in the primary sources (STEINEMANN et al. 1996 Down; BACHTROG and CHARLESWORTH 2002 Down; BACHTROG 2003 Down) or is available from the author on request. Sequences of D. miranda and D. pseudoobscura were aligned manually. Sites with gaps were excluded from the analyses.


 
View this table:
In this window
In a new window

 
Table 1. Genes investigated located on the neo-sex chromosome of D. miranda

Rates of protein evolution:
Several methods were used to estimate rates of protein evolution on the neo-sex chromosomes of D. miranda. First, rates of replacement and synonymous substitution per site, Ka and Ks, respectively, between the neo-sex chromosomes of D. miranda and D. pseudoobscura were estimated using the method of COMERON 1995 Down. Second, amino acid mutations and synonymous substitutions were assigned to either the neo-X or the neo-Y branch of the evolutionary tree assuming parsimony and using the D. pseudoobscura sequence as an outgroup (AKASHI 1995 Down). Sites with multiple substitutions were excluded from the parsimony analyses. Also, codons containing multiple substitutions were excluded from the analysis; thus, some values differ slightly from previous analyses (BACHTROG and CHARLESWORTH 2002 Down; BACHTROG 2003 Down). Finally, a maximum-likelihood method was used to estimate Ka and Ks that accounts for unequal transition and transversion rates and unequal base and codon frequencies, as implemented in the PAML software package (http://abacus.gene.ucl.ac.uk/software/paml.html). Two approaches were used to assess differences in the rates of substitutions on the neo-X and neo-Y chromosome. First, the numbers of amino acid substitutions on the neo-sex chromosomes were compared using Tajima's relative rate test (TAJIMA 1993 Down). Second, maximum-likelihood analysis of data on the neo-sex-linked genes was used to compare a model in which Ka/Ks was assumed to be the same on the neo-X and neo-Y branch with a model in which it was allowed to differ, as implemented in PAML. Significance was assessed using the chi-square test; twice the difference in log-likelihood between models is assumed to be approximately chi-square distributed with degrees of freedom equal to the difference in the numbers of parameters.

Codon bias analysis:
As a measure of codon bias, the effective number of codons (ENC; see WRIGHT 1990 Down) as well as the frequency of optimal codons (FOP; see STENICO et al. 1994 Down) was used. Optimal (preferred) codons can be identified as those whose frequencies (within a synonymous family) show a positive correlation with the degree of bias at other codons in the same genes (AKASHI 1995 Down). Optimal codons have been found to be conserved between the distantly related species D. melanogaster and D. pseudoobscura (AKASHI and SCHAEFFER 1997 Down); thus, I used the preferred codons identified in D. pseudoobscura (AKASHI and SCHAEFFER 1997 Down) to classify synonymous mutations in D. miranda. A parsimony approach was used to map preferred and unpreferred synonymous substitutions to either the neo-X or the neo-Y branch of the phylogenetic tree, and the numbers were compared using Tajima's relative rate test (TAJIMA 1993 Down). Sites with multiple hits were excluded from the parsimony analyses, as were codons with multiple substitutions.


*  RESULTS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Rates of protein evolution:
Table 2 and Table 3 summarize the rates of synonymous and nonsynonymous substitutions on the neo-sex chromosomes of D. miranda. Two genes, exu1 and CycB, have an exceptionally high rate of replacement substitutions on the neo-X lineage (see Table 2 and Fig 2). For both genes, I have demonstrated that the elevated Ka is due to the action of recent positive selection (BACHTROG and CHARLESWORTH 2002 Down; BACHTROG 2003 Down), as indicated by significant McDonald-Kreitman (MCDONALD and KREITMAN 1991 Down) and Hudson-Kreitman-Aguadé tests (HUDSON et al. 1987 Down). Most other genes, however, show a higher rate of amino acid evolution on the branch of the phylogeny leading to the neo-Y compared to the neo-X branch (Table 2; Fig 2). Table 2 gives the pairwise estimates of synonymous and nonsynonymous divergence between the D. miranda neo-X and neo-Y chromosome and D. pseudoobscura for all loci analyzed.



View larger version (23K):
In this window
In a new window
Download PPT slide
 
Figure 2. Maximum-likelihood estimates of the rate of amino acid substitutions (a) and synonymous substitutions (b) on the neo-X and the neo-Y chromosome, as well as the frequency of optimal codons (c). The arrows indicate the positions of the CycB and exu1 genes.


 
View this table:
In this window
In a new window

 
Table 2. Synonymous and nonsynonymous substitution rates between D. pseudoobscura and the D. miranda neo-X and neo-Y chromosome, using the method of COMERON 1995 Down


 
View this table:
In this window
In a new window

 
Table 3. Parsimony-based patterns of molecular evolution of genes located on the neo-sex chromosomes of D. miranda

Excluding the two genes that have undergone positive selection on the neo-X chromosome (i.e., exu1 and CycB), the total number of replacement substitutions is about sixfold higher on the branch of the phylogeny leading to the neo-Y compared to the neo-X (83 vs. 14 replacement mutations; see Table 3). This excess of replacement substitutions on the neo-Y is highly significant by Tajima's relative rate test (P < 0.001). Including exu1 and CycB, however, still indicates a significant excess of replacement substitutions on the neo-Y chromosome (98 vs. 55 replacement mutations; P = 0.015). In contrast, the numbers of synonymous substitutions are similar on the two branches (100 vs. 80 mutations; P = 0.35; Table 3). Assuming that synonymous substitutions accumulate neutrally, this indicates that the mutation rates do not greatly differ between the neo-Y and the neo-X chromosome. The elevation in the rate of amino acid substitutions on the neo-Y is consistent with a reduction in the efficacy of selection against deleterious mutations for Y-linked genes. Six of the loci investigated appear to be nonfunctional on the neo-Y chromosome, as indicated by frameshift mutations, large deletions, or stop codons (see Table 1). Excluding these loci from the analysis, the rate of replacement accumulation is still about sixfold higher on the neo-Y than on the neo-X (64 vs. 11 substitutions; P < 0.001).

Using a likelihood-based approach reveals a similar pattern. A model with a fixed rate of Ka/Ks for all branches was compared to a model with three different rates (neo-X branch, neo-Y branch, and one for the branch including pseudoobscura and the time in the miranda lineage before the split of the neo-sex chromosomes; see Fig 1). Again, most of the genes showed a higher rate of Ka/Ks on the branch leading to the neo-Y chromosome (results not shown). For individual loci, however, the likelihood-ratio test is significant for only a few cases, probably due to a lack of statistical power. Maximum-likelihood analysis on the combined data set (excluding CycB and exu1) indicates that a model that allows for a different Ka/Ks in each branch fits the data significantly better than a model that assumes the same rate for each branch of the phylogeny (2{Delta}L = 34.9; P < 0.001). Thus, the neo-Y chromosome has, on average, a significantly higher Ka/Ks ratio than the neo-X chromosome (0.26 vs. 0.07). Restricting the analysis to those genes that show intact open reading frames on the neo-Y chromosome results in the same conclusion (Ka/Ks of 0.06 and 0.24 on neo-X and neo-Y, respectively; 2{Delta}L = 24.4; P < 0.001).

Fig 2 shows the maximum-likelihood estimates of Ka and Ks for the neo-X and the neo-Y branches. The variance in the number of amino acid substitutions among genes on the neo-Y is much smaller than that on the neo-X (Fig 2). Amino acid substitutions occur uniformly among neo-Y-linked genes (see Fig 2; {chi}2 = 16.9; P > 0.5), as expected if all loci on the neo-Y are suffering from a reduced effectiveness of natural selection. In addition, synonymous substitutions occur uniformly among neo-Y genes ({chi}2 = 6.6; P > 0.9). In contrast, there is significant heterogeneity in the rate of amino acid evolution among genes on the neo-X chromosome (see Fig 2; {chi}2 = 154.5; P < 0.001), yet no heterogeneity is observed for synonymous substitutions ({chi}2 = 9.3; P > 0.9). This strongly suggests that frequent positive selection is limited to only a fraction of genes on the neo-X.

Codon bias analysis:
Reduced efficacy of natural selection for Y-linked genes predicts a reduction in level of codon bias for neo-Y- compared with neo-X-linked genes. In contrast to this expectation, the average codon bias for neo-X- and neo-Y-linked sequences is identical (mean ENC is 41 and mean FOP is 0.70 for both chromosomes; Fig 2) and only slightly higher than that in D. pseudoobscura (mean ENC is 41 and mean FOP is 0.71). Thus, there is no overall indication that natural selection is less effective at preventing unpreferred codons from accumulating on the neo-Y chromosome compared to the neo-X. However, for such closely related sequences as the neo-X, neo-Y, and D. pseudoobscura, a large number of sites are occupied by identical codons due to shared ancestry. Thus, the absolute values of codon bias will give little information about ongoing selection pressures on codon usage bias in the different lineages.

I also compared the numbers of substitutions from preferred to unpreferred synonymous codons ("unpreferred substitutions") to the number of unpreferred to preferred synonymous codons ("preferred substitutions"; see AKASHI 1995 Down). If selection for codon bias has recently been reduced on the neo-Y lineage, an excess of unpreferred substitutions should be detected among neo-Y-linked genes compared to neo-X-linked genes. Surprisingly, both the neo-X and the neo-Y chromosome show a strong excess of unpreferred over preferred codon substitutions (Table 4; Fig 3). On the neo-X, 58 unpreferred and 13 preferred substitutions were detected, and on the neo-Y chromosome, 62 unpreferred and 18 preferred substitutions occurred. This excess of unpreferred over preferred substitutions on the neo-sex-linked genes is highly significant, for both chromosomes independently, as well as for the combined data set (Tajima's relative rate test, P < 0.001 for all cases). The same is seen if only genes that show intact open reading frames on the neo-Y are analyzed (52 vs. 13 unpreferred over preferred substitutions on the neo-X and 55 vs. 15 on the neo-Y; P < 0.001). The excess of unpreferred fixations on both neo-sex chromosomes suggests a genome-wide reduction in selection intensity for codon usage bias in the D. miranda lineage.



View larger version (20K):
In this window
In a new window
Download PPT slide
 
Figure 3. Comparisons of patterns of codon usage bias and amino acid replacements at the neo-X and neo-Y chromosome. The number of unpreferred substitutions (P -> U), the number of preferred substitutions (U -> P), the number of other substitutions (i.e., P -> P or U -> U), and the number of replacement fixations (excluding exu1 and CycB) are shown.


 
View this table:
In this window
In a new window

 
Table 4. Patterns of preferred (U -> P) and unpreferred (P -> U) substitutions on the neo-sex chromosomes of D. miranda


*  DISCUSSION
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

As predicted by population genetics theory, the effective population size of the nonrecombining neo-Y chromosome of D. miranda is strongly reduced compared with its homolog, the neo-X chromosome. Both microsatellite and nucleotide polymorphism data suggest that Ne of the neo-Y chromosome is ~30-fold lower than that of the neo-X (BACHTROG and CHARLESWORTH 2000 Down, BACHTROG and CHARLESWORTH 2002 Down). This reduction in Ne is, in turn, predicted to result in a lower efficacy of both positive and negative selection on the Y chromosome. Here, I investigate two aspects of molecular evolution for 18 genes located on the neo-sex chromosomes of D. miranda: the rate of amino acid substitution and the level of codon bias. Apart from two loci that have undergone adaptive protein evolution on the neo-X chromosome, loci on the neo-Y chromosome generally show a higher rate of amino acid substitutions than do loci on the neo-X. Moreover, amino acid substitutions were found to be accumulating uniformly among neo-Y-linked genes. This is consistent with the observed lower effective population size of the neo-Y compared to the neo-X, reducing the efficacy of purifying selection against deleterious amino acid mutations on the neo-Y. However, neo-X and neo-Y chromosomes do not differ only in their recombination rate; other factors—such as differences in patterns of mutation or selection—could also contribute to differences in patterns of protein evolution observed between neo-X and neo-Y chromosomes.

Reduced selective constraint on the neo-Y?
Not all of the amino acid mutations that have accumulated on the neo-Y chromosome are expected to be deleterious. There are at least two reasons why genes on the neo-Y chromosome might be under reduced selective constraint compared to their neo-X-linked homologs. First, some of the genes surveyed on the neo-Y chromosome are clearly nonfunctional, implying that amino acid mutations can accumulate in a neutral manner among those genes. Also, some genes might be dosage compensated on the neo-X, and the homologs on the neo-Y could evolve under reduced selective constraint. Measuring levels of constraint (C) allows us to crudely quantify the difference between the neo-X and the neo-Y in patterns of amino acid evolution. Level of constraint—following the method of EYRE-WALKER and KEIGHTLEY 1999 Down—was estimated to be 0.92 on the neo-X chromosome, suggesting that 92% of the amino acid mutations are deleterious enough to be eliminated by purifying selection on the neo-X. As expected, the level of constraint is much lower on the neo-Y chromosome; only 68% of all amino acid changing mutations are purged from the neo-Y. Thus, a large fraction of replacement mutations (i.e., {cong}24%) behave effectively neutrally on the neo-Y. However, if only genes that show intact open reading frames on the neo-Y chromosome are analyzed, the level of constraint estimated for the neo-Y-linked genes is similarly low (i.e., 71%). Further, considering only those genes for which expression of the neo-Y-linked copy has been demonstrated (i.e., eve, robo, and Lcp3), the level of constraint estimated is 76%. Thus, there is no indication that the high rate of amino acid replacements of the neo-Y-linked genes is solely the consequence of some genes containing frameshift mutations. The issue of reduced functional constraint due to dosage compensation on the neo-X is more difficult to resolve. Cytogenetic analysis has shown that parts of the neo-X chromosome in D. miranda may be dosage compensated (MARIN et al. 2000 Down). The loci analyzed here are spread over various locations on the neo-X chromosome; however, it is unclear which neo-Y genes might be dosage compensated on the neo-X. Thus, some fraction of the neo-Y-linked genes might evolve under reduced functional constraint due to dosage compensation on the neo-X chromosome.

Differences in mutation bias and mutation rate?
A departure from stationarity in nucleotide composition between the neo-X and the neo-Y chromosome could also contribute to the differences observed in rates of nucleotide substitutions. To explore this possibility, intron-derived DNA was investigated. A total of ~6.4 kb of intron DNA could be aligned unambiguously between D. pseudoobscura and the D. miranda neo-X and neo-Y. AT content is identical on the neo-sex chromosomes (54.8%) and only slightly lower in D. pseudoobscura (54.4%). Moreover, the proportion of GC -> AT vs. AT -> GC substitutions is almost identical on the neo-X and neo-Y chromosome (2.1 vs. 1.9), and no significant heterogeneity in the two classes of substitutions between the neo-sex chromosomes is observed (P from Fisher's exact test >0.8 for 29:14 vs. 49:26; Table 5). Also, synonymous substitutions show more GC -> AT vs. AT -> GC substitutions on the neo-X and neo-Y chromosome, but no significant heterogeneity between chromosomes (59:13 vs. 66:18; P > 0.4; Table 6). Thus, while there is an excess of GC -> AT mutations on both neo-sex chromosomes (see below), differences in nucleotide composition between the chromosomes cannot account for the differences in rates of amino acid evolution observed. Selection on amino acid substitutions could have opposed the GC -> AT mutation bias; indeed, the pattern of GC -> AT vs. AT -> GC substitutions at replacement sites differs between the neo-X and neo-Y chromosome (25:15 vs. 59:14; P = 0.03).


 
View this table:
In this window
In a new window

 
Table 5. Numbers of substitutions in intron-derived DNA on the neo-X and neo-Y chromosomes


 
View this table:
In this window
In a new window

 
Table 6. Total numbers of substitutions in all genes studied on the neo-X and neo-Y branch

The number of silent mutations (i.e., synonymous and noncoding) is significantly higher on the neo-Y chromosome than on the neo-X. A total of 196 silent substitutions were observed on the neo-Y, but only 149 on the neo-X (Tajima's test, P < 0.05). If synonymous sites and introns experience no selective constraint, this might indicate a higher mutation rate on the neo-Y chromosome than on the neo-X. Alternatively, introns and synonymous sites might be under weak selective constraints; if mutations are slightly deleterious, this also can result in a faster rate of molecular evolution on the neo-Y. If there is indeed a higher mutation rate on the neo-Y, this could partly contribute to the faster rate of protein evolution observed, if amino acid changes are very weakly selected. However, the excess accumulation of slightly deleterious mutations on the neo-Y due to increased mutation pressure would be only 1.3-fold. This clearly cannot account for the observed 6-fold increase in amino acid substitutions on the neo-Y.

Sampling a single allele from the neo-X and the neo-Y does not allow distinguishing between fixed and segregating mutations (AKASHI 1996 Down). The 30-fold reduction in Ne of the neo-Y chromosome compared to the neo-X implies that neo-Y-derived alleles have a much shorter time back to their most recent common ancestor. While some mutations sampled from the neo-X allele might actually be segregating in the population, nearly all mutations on the neo-Y will be fixed. Thus, if slightly deleterious amino acid mutations that eventually will be eliminated by natural selection segregate on the neo-X, we would slightly underestimate the excess of protein evolution on the neo-Y chromosome.

Reduced codon bias in the D. miranda lineage:
In contrast to the striking difference between the neo-X and neo-Y in patterns of protein evolution, unpreferred synonymous substitutions do not accumulate differentially on the neo-sex chromosomes. Codon usage bias is thought to be maintained by a balance of mutation, selection, and drift (SHARP and LI 1986 Down; BULMER 1988 Down). The strength of selection acting on synonymous codons is probably much weaker on average than that for amino acid replacements; for D. simulans, the selection coefficient against unpreferred codons was estimated to be on the order of 10-6 (AKASHI 1995 Down). Assuming a statistical equilibrium (i.e., if selection intensity, mutation rates, and population sizes remain constant), the number of preferred substitutions should be the same as that of unpreferred ones in a given lineage (AKASHI 1995 Down). However, both the neo-X and the neo-Y in D. miranda show a significant excess of unpreferred over preferred substitutions, suggesting an overall reduction in the levels of codon bias in the species. Since the intensity of selection maintaining codon bias has been estimated to be low in D. pseudoobscura (Nes {cong} 1; AKASHI and SCHAEFFER 1997 Down) it is likely to be even lower in D. miranda given the approximately sixfold lower effective population size (based on estimates of silent-site diversity; YI et al. 2003 Down). Thus, a reduction in the effective population size of D. miranda compared to an ancestor could have caused synonymous codons to behave as effectively neutral on both the neo-X and the neo-Y chromosome. Since the rate of substitution for neutral mutations does not depend on the population size but only on their mutation rate (KIMURA 1983 Down), unpreferred codons should accumulate similarly on both neo-sex chromosomes.

Mutation bias can also contribute to the decrease in codon bias. All preferred codons in D. pseudoobscura end with a G or C nucleotide (AKASHI and SCHAEFFER 1997 Down). A change in mutational bias toward AT mutations can be analyzed by studying substitutions in intron DNA (assuming introns are not under selective constraint). This analysis indicates that base composition of introns between D. pseudoobscura and D. miranda is similar. However, the number of GC -> AT substitutions on both the neo-X and neo-Y is almost twice as large as the number of AT -> GC substitutions (29 vs. 14 on the neo-X and 49 vs. 26 on the neo-Y; P < 0.05 for both cases; Table 5). This large excess of GC -> AT substitutions indicates that nucleotide composition is not at equilibrium in D. miranda, but is evolving toward a higher AT content. Thus, a change in mutational bias is likely to have contributed to the decrease of codon bias in D. miranda.

A similar pattern for synonymous substitutions was also observed in the D. melanogaster and D. simulans lineages (AKASHI 1996 Down; MCVEAN and VIEIRA 2001 Down; BEGUN 2002 Down). Both D. simulans and D. melanogaster show a genome-wide reduction in levels of codon bias compared to their ancestor (AKASHI 1996 Down; MCVEAN and VIEIRA 2001 Down). D. melanogaster shows no evidence of current selection on codon usage (AKASHI 1996 Down; MCVEAN and VIEIRA 2001 Down), while this relaxation in selection appears to be less extreme for D. simulans (MCVEAN and VIEIRA 2001 Down). This pattern is consistent with a somewhat lower Ne for D. melanogaster than for D. simulans, as suggested by differences in overall levels of nucleotide variability (ANDOLFATTO 2001 Down). However, D. melanogaster also has an apparent mutation bias toward AT substitutions (TAKANO-SHIMIZU 2001 Down), confounding the effects of selection and mutation on the evolution of codon bias.

Reduced adaptation in regions of reduced recombination in Drosophila:
The higher rate of amino acid substitutions on the nonrecombining neo-Y chromosome and the larger variance in Ka on the neo-X relate to the recent finding that the rate of amino acid substitutions and the variance in Ka is elevated in regions of high recombination in D. melanogaster (BETANCOURT and PRESGRAVES 2002 Down). The pattern observed in D. melanogaster suggests that adaptive protein evolution may be constrained in low-recombining regions in the D. melanogaster genome, but that purifying selection against deleterious amino acid substitutions is operating genome-wide (BETANCOURT and PRESGRAVES 2002 Down). In D. miranda, while the average Ka is higher on the neo-Y chromosome, the variance in Ka among loci is much larger on the neo-X (see Fig 2). Most loci on the neo-X show very few amino acid substitutions, indicating purifying selection against deleterious amino acid changes, and two genes show rapid amino acid evolution due to positive selection (BACHTROG and CHARLESWORTH 2002 Down; BACHTROG 2003 Down). The data from the neo-sex chromosomes of D. miranda support the hypothesis that linkage limits the rate of adaptation, with adaptive protein evolution being mainly restricted to the recombining neo-X chromosome. However, the reduced efficacy of purifying selection is more obvious for the neo-Y in D. miranda (i.e., the general increase in Ka) than in regions of low recombination in D. melanogaster (although data for the latter are sparse; see BETANCOURT and PRESGRAVES 2002 Down). Part of this difference may be explained by the size of the regions of reduced recombination. The nonrecombining neo-Y chromosome constitutes an entire chromosome arm, with an estimate of ~2500 genes (ADAMS et al. 2000 Down). In contrast, the various regions of reduced recombination in D. melanogaster each contain at most ~100 genes (ADAMS et al. 2000 Down). Hill-Robertson effects—which are proportional to the number of linked loci under selection (LI 1987 Down; MCVEAN and CHARLESWORTH 2000 Down)—are therefore expected to be much more pronounced for the neo-Y in D. miranda.

I have emphasized the importance of the effective population size Ne in explaining patterns of evolution on the neo-X and neo-Y chromosome, as it is helpful to explain certain features of the data, such as a reduction in polymorphism on the neo-Y chromosome or the evolutionary dynamics of very weakly selected mutations (CHARLESWORTH and CHARLESWORTH 2000 Down). However, this is clearly not the only factor involved. Noteworthy is that the effective population size inferred from nucleotide diversity on the neo-Y chromosome (i.e., ~30,000) is on the same order as that estimated for humans (i.e., ~10,000). Indeed, in accordance with their small population size, human populations appear to be accumulating slightly deleterious mutations (EYRE-WALKER et al. 2002 Down). However, a key difference between the neo-Y of D. miranda and human populations is the absence of sexual recombination in the former. Thus, while both might suffer from the accumulation of weakly deleterious mutations due to a small Ne, recombination in human populations may be sufficient to prevent the fixation of strongly deleterious mutations. In contrast, the lack of recombination on the neo-Y chromosome leaves it susceptible to processes like Muller's ratchet or hitchhiking of linked detrimental alleles with strongly advantageous mutations; both processes can fix strongly deleterious mutations (CHARLESWORTH and CHARLESWORTH 2000 Down). This emphasizes the role of recombination (and not just simply differences in Ne) in explaining patterns of molecular evolution on the neo-X and neo-Y chromosomes.


*  ACKNOWLEDGMENTS

I am grateful to P. Andolfatto, B. Charlesworth, and S. Wright for helpful discussions and comments on the manuscript. D.B. is supported by a European Molecular Biology Organization Fellowship. This research was funded by a Biotechnology and Biological Sciences Research Council grant to B. Charlesworth and by a Royal Society grant to D.B.

Manuscript received April 21, 2003; Accepted for publication July 11, 2003.


*  LITERATURE CITED
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

ADAMS, M. D., S. E. CELNIKER, R. A. HOLT, C. A. EVANS, and J. D. GOCAYNE et al., 2000  The genome sequence of Drosophila melanogaster.. Science 287:2185-2195.[Abstract/Free Full Text]

AKASHI, H., 1995  Inferring weak selection from patterns of polymorphism and divergence at "silent" sites in Drosophila DNA. Genetics 139:1067-1076.[Abstract]

AKASHI, H., 1996  Molecular evolution between Drosophila melanogaster and D. simulans: reduced codon bias, faster rates of amino acid substitution, and larger proteins in D. melanogaster.. Genetics 144:1297-1307.[Abstract]

AKASHI, H. and S. W. SCHAEFFER, 1997  Natural selection and the frequency distributions of "silent" DNA polymorphism in Drosophila. Genetics 146:295-307.[Abstract]

ANDOLFATTO, P., 2001  Contrasting patterns of X-linked and autosomal nucleotide variation in Drosophila melanogaster and Drosophila simulans.. Mol. Biol. Evol. 18:279-290.[Abstract/Free Full Text]

BACHTROG, D., 2003  Adaptation shapes patterns of genome evolution in sexual and asexual genomes in Drosophila.. Nat. Genet. 34:215-219.[Medline]

BACHTROG, D. and B. CHARLESWORTH, 2000  Reduced levels of microsatellite variability on the neo-Y chromosome of Drosophila miranda.. Curr. Biol. 10:1025-1031.[Medline]

BACHTROG, D. and B. CHARLESWORTH, 2001  Towards a complete sequence of the human Y chromosome. Genome Biol. 2:1016.

BACHTROG, D. and B. CHARLESWORTH, 2002  Reduced adaptation of a non-recombining neo-Y chromosome. Nature 416:323-326.[Medline]

BEGUN, D. J., 2002  Protein variation in Drosophila simulans, and comparison of genes from centromeric versus noncentromeric regions of chromosome 3. Mol. Biol. Evol. 19:201-203.[Free Full Text]

BEGUN, D. J. and C. F. AQUADRO, 1992  Levels of naturally occurring DNA polymorphism correlate with recombination rates in Drosophila melanogaster.. Nature 356:519-520.[Medline]

BETANCOURT, A. J. and D. C. PRESGRAVES, 2002  Linkage limits the power of natural selection in Drosophila.. Proc. Natl. Acad. Sci. USA 99:13616-13620.[Abstract/Free Full Text]

BULL, J. J., 1983 Evolution of Sex Determining Mechanisms. Benjamin Cummings, Menlo Park, CA.

BULMER, M., 1988  Codon usage and intragenic position. J. Theor. Biol. 133:67-71.[Medline]

CABALLERO, A., 1995  On the effective size of populations with separate sexes, with particular reference to sex-linked genes. Genetics 139:1007-1011.[Abstract]

CARVALHO, A. B., B. A. DOBO, M. D. VIBRANOVSKI, and A. G. CLARK, 2001  Identification of five new genes on the Y chromosome of Drosophila melanogaster.. Proc. Natl. Acad. Sci. USA 98:13225-13230.[Abstract/Free Full Text]

CHARLESWORTH, B., 1990  The evolution of sex chromosomes. Science 251:1030-1033.

CHARLESWORTH, B., 1996  The evolution of chromosomal sex determination and dosage compensation. Curr. Biol. 6:149-162.[Medline]

CHARLESWORTH, B. and D. CHARLESWORTH, 2000  The degeneration of Y chromosomes. Philos. Trans. R. Soc. Lond. B Biol. Sci. 355:1563-1572.[Medline]

COMERON, J. M., 1995  A method for estimating the numbers of synonymous and nonsynonymous substitutions per site. J. Mol. Evol. 41:1152-1159.[Medline]

EYRE-WALKER, A. and P. D. KEIGHTLEY, 1999  High genomic deleterious mutation rates in hominids. Nature 397:344-347.[Medline]

EYRE-WALKER, A., P. D. KEIGHTLEY, N. G. SMITH, and D. GAFFNEY, 2002  Quantifying the slightly deleterious mutation model of molecular evolution. Mol. Biol. Evol. 19:2142-2149.[Abstract/Free Full Text]

FELSENSTEIN, J., 1974  The evolutionary advantage of recombination. Genetics 78:737-756.[Abstract/Free Full Text]

GETHMANN, R. C., 1988  Crossing over in males of higher Diptera (Brachycera). J. Hered. 79:344-350.[Abstract/Free Full Text]

HILL, W. G. and A. ROBERTSON, 1966  The effect of linkage on limits to artificial selection. Genet. Res. 8:269-294.[Medline]

HUDSON, R. R., M. KREITMAN, and M. AGUADE, 1987  A test of neutral molecular evolution based on nucleotide data. Genetics 116:153-159.[Abstract/Free Full Text]

KIMURA, M., 1983 The Neutral Theory of Molecular Evolution. Cambridge University Press, Cambridge, UK.

KLIMAN, R. M. and J. HEY, 1993  Reduced natural selection associated with low recombination in Drosophila melanogaster.. Mol. Biol. Evol. 10:1239-1258.[Abstract]

LAHN, B. T., N. M. PEARSON, and K. JEGALIAN, 2001  The human Y chromosome, in the light of evolution. Nat. Rev. Genet. 2:207-216.[Medline]

LI, W. H., 1987  Models of nearly neutral mutations with particular implications for nonrandom usage of synonymous codons. J. Mol. Evol. 24:337-345.[Medline]

LUCCHESI, J. C., 1978  Gene dosage compensation and the evolution of sex chromosomes. Science 202:711-716.[Abstract/Free Full Text]

LYNCH, M. and J. L. BLANCHARD, 1998  Deleterious mutation accumulation in organelle genomes. Genetica 102(103):29-39.

MARAIS, G. and G. PIGANEAU, 2002  Hill-Robertson interference is a minor determinant of variations in codon bias across Drosophila melanogaster and Caenorhabditis elegans genomes. Mol. Biol. Evol. 19:1399-1406.[Abstract/Free Full Text]

MARIN, I., M. L. SIEGAL, and B. S. BAKER, 2000  The evolution of dosage-compensation mechanisms. Bioessays 22:1106-1114.[Medline]

MCDONALD, J. H. and M. KREITMAN, 1991  Adaptive protein evolution at the Adh locus in Drosophila.. Nature 351:652-654.[Medline]

MCVEAN, G. A. and J. VIEIRA, 2001  Inferring parameters of mutation, selection and demography from patterns of synonymous site evolution in Drosophila. Genetics 157:245-257.[Abstract/Free Full Text]

MCVEAN, G. A. T. and B. CHARLESWORTH, 2000  The effects of Hill-Robertson interference between weakly selected mutations on patterns of molecular evolution and variation. Genetics 155:929-944.[Abstract/Free Full Text]

MULLER, H. J., 1940 Bearings of the Drosophila work on systematics, pp. 185–268 in The New Systematics, edited by J. S. HUXLEY. Oxford University Press, London.

NEI, M. and T. GOJOBORI, 1986  Simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions. Mol. Biol. Evol. 3:418-426.[Abstract]

ORR, H. A. and Y. KIM, 1998  An adaptive hypothesis for the evolution of the Y chromosome. Genetics 150:1693-1698.[Abstract/Free Full Text]

PECK, J. R., 1994  A ruby in the rubbish: beneficial mutations, deleterious mutations and the evolution of sex. Genetics 137:597-606.[Abstract]

SHARP, P. M. and W. H. LI, 1986  An evolutionary perspective on synonymous codon usage in unicellular organisms. J. Mol. Evol. 24:28-38.[Medline]

STEINEMANN, M. and S. STEINEMANN, 1998  Enigma of Y chromosome degeneration: neo-Y and neo-X chromosomes of Drosophila miranda a model for sex chromosome evolution. Genetica 102(103):409-420.

STEINEMANN, M., S. STEINEMANN, and F. LOTTSPEICH, 1993  How Y chromosomes become genetically inert. Proc. Natl. Acad. Sci. USA 90:5737-5741.[Abstract/Free Full Text]

STEINEMANN, M., S. STEINEMANN, and W. PINSKER, 1996  Evolution of the larval cuticle proteins coded by the secondary sex chromosome pair: X2 and neo-Y of Drosophila miranda. I. Comparison at the DNA sequence level. J. Mol. Evol. 43:405-412.[Medline]

STEINEMANN, S. and M. STEINEMANN, 1999  The Amylase gene cluster on the evolving sex chromosomes of Drosophila miranda.. Genetics 151:151-161.[Abstract/Free Full Text]

STENICO, M., A. T. LLOYD, and P. M. SHARP, 1994  Codon usage in Caenorhabditis elegans: delineation of translational selection and mutational biases. Nucleic Acids Res. 22:2437-2446.[Abstract/Free Full Text]

TAJIMA, F., 1993  Simple methods for testing the molecular evolutionary clock hypothesis. Genetics 135:599-607.[Abstract]

TAKANO-SHIMIZU, T., 2001  Local changes in GC/AT substitution biases and in crossover frequencies on Drosophila chromosomes. Mol. Biol. Evol. 18:606-619.[Abstract/Free Full Text]

WEINREICH, D. M. and D. M. RAND, 2000  Contrasting patterns of nonneutral evolution in proteins encoded in nuclear and mitochondrial genomes. Genetics 156:385-399.[Abstract/Free Full Text]

WRIGHT, F., 1990  The "effective number of codons" used in a gene. Gene 87:23-29.[Medline]

YI, S., D. BACHTROG, and B. CHARLESWORTH, 2003  A survey of chomosomal and nucleotide sequence variation in Drosophila miranda.. Genetics 164:1369-1381.[Abstract/Free Full Text]

ZUROVCOVA, M. and W. F. EANES, 1999  Lack of nucleotide polymorphism in the Y-linked sperm flagellar dynein gene Dhc-Yh3 of Drosophila melanogaster and D. simulans.. Genetics 153:1709-1715.[Abstract/Free Full Text]




This article has been cited by other articles:


Home page
GeneticsHome page
C. Diaz-Castillo and K. G. Golic
Evolution of Gene Sequence in Response to Chromosomal Location
Genetics, September 1, 2007; 177(1): 359 - 374.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
R. Bergero, A. Forrest, E. Kamau, and D. Charlesworth
Evolutionary Strata on the X Chromosomes of the Dioecious Plant Silene latifolia: Evidence From New Sex-Linked Genes
Genetics, April 1, 2007; 175(4): 1945 - 1954.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
D. Bachtrog and P. Andolfatto
Selection, Recombination and Demographic History in Drosophila miranda
Genetics, December 1, 2006; 174(4): 2045 - 2059.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
C. Bartolome and B. Charlesworth
Evolution of Amino-Acid Sequences and Codon Usage on the Drosophila miranda Neo-Sex Chromosomes
Genetics, December 1, 2006; 174(4): 2033 - 2044.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
W.-Y. Ko, S. Piao, and H. Akashi
Strong Regional Heterogeneity in Base Composition Evolution on the Drosophila X Chromosome
Genetics, September 1, 2006; 174(1): 349 - 362.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
H. Akashi, W.-Y. Ko, S. Piao, A. John, P. Goel, C.-F. Lin, and A. P. Vitins
Molecular Evolution in the Drosophila melanogaster Species Subgroup: Frequent Parameter Fluctuations on the Timescale of Molecular Divergence
Genetics, March 1, 2006; 172(3): 1711 - 1726.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
D. Bachtrog
Sex chromosome evolution: Molecular aspects of Y-chromosome degeneration in Drosophila
Genome Res., October 1, 2005; 15(10): 1393 - 1401.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
N. D. Singh, J. C. Davis, and D. A. Petrov
X-Linked Genes Evolve Higher Codon Bias in Drosophila and Caenorhabditis
Genetics, September 1, 2005; 171(1): 145 - 155.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
C. Bartolome, X. Maside, S. Yi, A. L. Grant, and B. Charlesworth
Patterns of Selection on Synonymous and Nonsynonymous Variants in Drosophila miranda
Genetics, March 1, 2005; 169(3): 1495 - 1507.
[Abstract] [Full Text] [PDF]