Genetics, Vol. 153, 497-506, September 1999, Copyright © 1999

The Age of Nonsynonymous and Synonymous Mutations in Animal mtDNA and Implications for the Mildly Deleterious Theory

Rasmus Nielsena and Daniel M. Weinreich1,b
a Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts 02138
b Museum of Comparative Zoology, Harvard University, Cambridge, Massachusetts 02138

Corresponding author: Rasmus Nielsen, Department of Organismic and Evolutionary Biology, Harvard University, 288 Biol. Labs., Cambridge, MA 02138., rnielsen{at}oeb.harvard.edu (E-mail)

Communicating editor: A. G. CLARK


*  ABSTRACT
*TOP
*ABSTRACT
*METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

McDonald/Kreitman tests performed on animal mtDNA consistently reveal significant deviations from strict neutrality in the direction of an excess number of polymorphic nonsynonymous sites, which is consistent with purifying selection acting on nonsynonymous sites. We show that under models of recurrent neutral and deleterious mutations, the mean age of segregating neutral mutations is greater than the mean age of segregating selected mutations, even in the absence of recombination. We develop a test of the hypothesis that the mean age of segregating synonymous mutations equals the mean age of segregating nonsynonymous mutations in a sample of DNA sequences. The power of this age-of-mutation test and the power of the McDonald/Kreitman test are explored by computer simulations. We apply the new test to 25 previously published mitochondrial data sets and find weak evidence for selection against nonsynonymous mutations.


ONE of the greatest standing controversies in evolutionary biology is the debate over the causes of molecular variation in natural populations (e.g., KIMURA 1983 Down; GILLESPIE 1991 Down). The neutral theory asserts that the vast majority of variation is selectively neutral and when two or more alleles coexist in a population, natural selection cannot distinguish between the alleles. Under the neutral theory, allelic frequencies are determined by genetic drift. Similarly the theory holds that most allelic fixation events are the consequence of genetic drift and not of natural selection. In contrast, selective theories, including the mildly deleterious model (OHTA 1973 Down), claim that a significant amount of the allelic variation and fixations in natural populations is determined by natural selection.

The McDonald/Kreitman test (MCDONALD and KREITMAN 1991 Down) analyzes the neutral assumption that the same processes determine allelic polymorphism and allelic fixation by comparing the number of synonymous and nonsynonymous sites segregating within and fixed between species in a 2 x 2 contingency table. Recently, neutrality has been rejected by the McDonald/Kreitman test in a number of mitochondrial DNA (mtDNA) data sets (BALLARD and KREITMAN 1994 Down; NACHMAN et al. 1994 Down, NACHMAN et al. 1996 Down; Rand et al. 1994 Down; Rand AND KANN 1996 Down, Rand AND KANN 1998 Down; NACHMAN 1998 Down; HASEGAWA et al. 1998 Down; WISE et al. 1998 Down). The direction of deviation from neutrality in each of these data sets is the same: an excess number of polymorphic nonsynonymous sites segregating within species relative to the number of nonsynonymous sites fixed between species. Mildly deleterious alleles may segregate in a population in low or intermediate frequencies but are not as likely to become fixed (KIMURA 1983 Down), and thus if some nonsynonymous mutations are mildly deleterious, we expect that they will segregate within populations but rarely reach fixation. This is the interpretation commonly given to the significant McDonald/Kreitman tests observed for animal mtDNA (BALLARD and KREITMAN 1994 Down; NACHMAN et al. 1994 Down, NACHMAN et al. 1996 Down; NACHMAN 1998 Down; Rand AND KANN 1996 Down, Rand AND KANN 1998 Down; Rand et al. 1994 Down; HASEGAWA et al. 1998 Down; WISE et al. 1998 Down).

We present a new method for testing the hypothesis that segregating nonsynonymous sites in animal mtDNA are mildly deleterious, based on a comparison of the age of segregating nonsynonymous and synonymous mutations. Throughout we assume that there is no recombination between adjacent nucleotide sites. This implies that nonsynonymous and synonymous sites share the same underlying gene genealogy. We first show that the mean age of a segregating deleterious mutation is less than the mean age of a segregating neutral mutation, even in the absence of recombination. Thus under the hypothesis that nonsynonymous mutations are mildly deleterious, the mean age of nonsynonymous segregating sites should be less than the mean age of synonymous segregating sites despite the fact that the underlying gene genealogy is identical for the two types of sites. We present a test for comparing the mean ages of two arbitrary classes of segregating sites (e.g., synonymous and nonsynonymous) in a sample and determine its power as a function of the strength of purifying selection. We then determine the power of the McDonald/Kreitman test to detect purifying selection. Finally we apply our test to a set of 25 previously published mtDNA data sets.


*  METHODS
*TOP
*ABSTRACT
*METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Computer simulations:
Throughout, computer simulations were used to test theoretical predictions and to examine the properties of the suggested statistical tests. In these simulations a haploid population of size N chromosomes was simulated under the Wright-Fisher model. To model the mutational process an infinite sites model was used (WATTERSON 1975 Down). The mutational makeup of each individual chromosome and the generation in which each segregating mutation arose were stored in computer memory. In each generation the population of chromosomes underwent mutation, selection, and genetic drift. The probability of a new mutation occurring in a given generation on a chromosome is µ = µs + µn, where µs is the mutation rate of selected mutations and µn is the mutation rate of neutral mutations. Simple multiplicative fitness was employed; thus the relative fitness of the ith chromosome is given by

(1)

where s is the selection coefficient of selected mutations and ni is the number of selected mutations carried by the ith chromosome. For the sake of simplicity we assume that s does not vary in time or among selected mutations and that there are no epistatic interactions. Selection together with genetic drift is simulated by sampling multinomially between generations among the k segregating chromosomes according to the probabilities specified by the wi, i = 1, 2, . . , k given by (1). In general, samples were taken from the population every 10N generations after the first 100N generations. For each sample, some function of the sample was evaluated, such as the mean age of selected or neutral segregating sites or the P value under one of the suggested tests. Denote the obtained value of the evaluated function in the ith sample by fi. Assuming that stationary distributions exist for these functions, the theorem of ergodicity ensures that

where E(f) is the expectation of the function f and n is the number of samples. To examine the power of the McDonald/Kreitman test, we also scored the number of fixed differences between the sequences in each sample and a single sequence sampled t generations in the past. Population sizes of N = 1000 chromosomes were used in the simulations throughout. Except where otherwise noted, the parameter values used in the simulations were chosen to fit the estimated values based on the analyzed data sets (see Data analysis); i.e., the number of sequences in a sample was set to n = 25, {theta} = 10 ({theta} = 4Nµ), and the level of intraspecific divergence was set to µt = 180. The ratio of nonsynonymous to synonymous mutations was set to µns = 1, corresponding to assuming that ~3/4 of all new nonsynonymous mutations are immediately lethal to the organism.

Test statistics:
We explored several tests of the hypothesis that the mean ages of segregating sites in two distinct classes of sites (e.g., neutral and selected) in a sample are equal. The basics of these tests are the same: a score is calculated for each segregating site. Denote the value of this score for the ith site by {omega}i. Then, we define the test statistic

(2)

where Ii is an indicator function that returns 0 if site i is a neutral site and 1 if site i is selected. Thus {Omega} is the ratio of the average of {omega} for selected sites to the average of {omega} for neutral sites. The null distribution of {Omega} under the hypothesis that the value of {omega}i is independent of the class of site i can be approximated by repeatedly recalculating {Omega} after random relabeling of sites such that the total number of selected sites remains constant and the total number of neutral sites remains constant. A total of 100,000 replicates of this resampling scheme were used to establish critical values of {Omega}. If we denote the ith value of {Omega} by {Omega}i and the observed value of {Omega} by {Omega}obs, then a test based on k resampled samples rejects at the 5% level if (){Sigma}ki=1I({Omega}obs>={Omega}i) < 0.05, where I() is an indicator function that returns 1 if the Boolean condition is true and 0 otherwise. We refer to the statistic {Omega} as the age-of-mutation (AOM) statistic.

The power of such a test to detect differences in mean age between classes is obviously dependent on how strongly {omega}i is correlated with the age of the mutation responsible for polymorphic site i. The power of the test using the different estimators for {omega} was evaluated as described in Computer simulations. One obvious candidate for {omega} is an estimator of the age of a mutation under a neutral equilibrium infinite sites model suggested by GRIFFITHS and TAVARE 1999 Down. This estimator uses E(T | X) (where T is the age of a mutation and X is the observed data) as an estimator of T. E(T | X) is calculated by integrating over all possible gene genealogies (branch lengths and topologies) while weighting each genealogy by its likelihood. The integral is evaluated by the Monte Carlo method described in GRIFFITHS and TAVARE 1995 Down, GRIFFITHS and TAVARE 1999 Down. This estimator may have desirable statistical properties because all the information in the data regarding T is applied in the estimation of T. However, application of this estimator is computationally very time consuming. Furthermore, in cases where the data do not conform to the infinite sites model, it is necessary to remove conflicting sites or conflicting sequences. For this reason, considerable effort was devoted to developing additional estimators that could be calculated fast and did not require elimination of any data.

Given only b, the observed number of chromosomes bearing a mutation in a sample of n chromosomes, the expected age (a) of the mutation under strict neutrality is given by

(3)

(GRIFFITHS and TAVARE 1998 Down). Results such as this, based on the marginal frequency of a mutation in a single site, can obviously be applied to estimate ages of individual mutations. However, they were in general found to perform much worse than estimators employing more of the data. Also, estimators based on neutrality such as E(T | X) may not perform well when the assumption of selective neutrality is violated. Instead, a heuristic estimator was devised on the basis of estimation of the structure of the underlying genealogy. In this method, an estimate of the underlying genealogy was obtained by the unweighted pair group method using arithmetic averages (UPGMA; SOKAL and MICHENER 1958 Down). Mutations were subsequently assigned to branches (edges) using Fitch and Wagner parsimony (e.g., FITCH 1971 Down), and the expected age of a mutation was estimated as the average age of the branch on which the mutation occurred, i.e., the average of the time of the start of the branch and the time of the end of the branch. In cases where two or more mutations were needed to account for the observed site pattern, the average of the estimates obtained for the different assignments of the mutation was used as an estimator. We denote this estimator by â. Note that under the infinite sites model, â is the expected age of a mutation conditional on the estimated genealogy.

This estimator (â) is applied in addition to E(T | X) because it can be calculated quickly, making it suitable for simulation purposes, and because it does not require the elimination of data when the assumptions of the infinite sites model are violated. However, we do not suggest that this heuristically derived estimator should be used for more general purposes. It was chosen here solely because it provided more power in the test than estimators based on the marginal site pattern such as estimators derived from Equation 3.

Power analysis:
The power of the test statistic given in Equation 2 and of the McDonald/Kreitman test to detect the presence of deleterious mutations was evaluated by computer simulations as described in Computer simulations for different values of the selection coefficient. The proportion of simulated data sets that gave significant test statistic values was tabulated for a range of values of Ns for {theta} = 10 and for a range of values of {theta} for Ns = -3.

McDonald/Kreitman tests (MCDONALD and KREITMAN 1991 Down) of simulated data were performed as a G-test in a 2 x 2 contingency table using the Williams correction for small sample size (see, for example, SOKAL and ROHLF 1981 Down). The numbers entering the contingency table were the number of fixed selected and neutral mutations and the number of segregating selected and neutral mutations in the sample. Empty cell entries were ignored when calculating the test statistic. This mimics a common method for performing the McDonald/Kreitman test and should not inflate the chosen significance level markedly under biologically reasonable parameter values.

Data analysis:
The null hypothesis that we wish to test is that the average age of nonsynonymous mutations is the same as the average age of synonymous mutations. To analyze the data, all polymorphisms were partitioned into nonsynonymous and synonymous polymorphisms. To do this, the following heuristic approach was applied: the most common sequence was chosen as a reference sequence representing a hypothesized ancestral state. This was done only to establish an unambiguous criterion to determine if a mutation was synonymous or nonsynonymous. The identity of a variable site in a codon was determined by comparison to the homologous codon in the reference sequence. In this manner, all polymorphisms could be scored unambiguously as nonsynonymous or synonymous even when there are multiple segregating sites in a single codon. If three nucleotides were segregating in the same site, two mutations were inferred and represented as two variable sites. In most cases only one mutation was observed in each codon, suggesting that the effect of this heuristic approach does not seriously bias the analysis. Also, because the approach was applied blindly with regard to the class of the mutation, it should not inflate the chosen significance level in the resampling test.

For each data set, two resampling tests were performed, one using E(T | X) and one using the heuristically derived â as estimators of the age of a mutation. In each case, the data were resampled 100,000 times to provide a P value. When E(T | X) was applied, the WATTERSON 1975 Down estimator of {theta} was used to drive the simulations, and 100,000 genealogies (runs of the Markov chain) were sampled to provide estimates of the age of each mutation (see GRIFFITHS and TAVARE 1995 Down). Calculation of E(T | X) requires that the data strictly conform to the infinite sites model. For example, HARDING et al. 1997 Down chose to eliminate apparent recombinant sequences. In the present analysis we instead eliminated sites to obtain a subset of sites that are consistent with the infinite sites model. The following heuristic algorithm was implemented for this purpose. Two diallelic sites are in conflict if all of the patterns in Table 1 are observed, where the labeling of state (0, 1) is arbitrary. Let Ci be the number of sites that conflict with site i. Then site i is eliminated from the data set when Ci = max{C1, C2, ... , Ch}, where h is the number of sites. This elimination scheme is repeated until C1 = C2 = ... = Ch = 0. The test based on E(T | X) was performed on these reduced data sets, whereas the test based on â was performed on the full data sets. Data sets so small that a significant result was not theoretically possible were not included in the analysis. Four data sets were eliminated for this reason.


 
View this table:
In this window
In a new window

 
Table 1. Conflicting site patterns

Amino acid saturation dynamics:
To examine the relative degree of mutational saturation in nonsynonymous and synonymous substitutions in mtDNA, the number of nonsynonymous and synonymous nucleotide differences between pairs of species was plotted for 13 mtDNA genes (NADH1, NADH2, NADH3, NADH4, NADH4L, NADH5, NADH6, ATPase 6, ATPase 8, COI, COII, COIII, Cyt. b). The species included in this analysis were Homo sapiens (ANDERSON et al. 1981 Down); Pan paniscus, Pongo pygmaeus (HORAI et al. 1995 Down); Mus musculus (BIBB et al. 1981 Down); Gallus gallus (DESJARDINS and MORAIS 1990 Down); and Drosophila melanogaster (GARESSE 1988 Down). Sequences were aligned by eye and nonsynonymous and synonymous nucleotide differences were counted as described in Data analysis.


*  RESULTS
*TOP
*ABSTRACT
*METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Expected ages of selected and linked neutral segregating mutations:
Several results regarding the ages of neutral mutations are known. For example, the age of a neutral mutation with frequency b in a sample of size n is given by Equation 3. However, selection will affect the underlying gene genealogy and thus the distribution of coalescence times in those genealogies (e.g., KAPLAN et al. 1988 Down; HUDSON and KAPLAN 1994 Down) and will therefore affect the expected age of a sampled selected mutation and of linked neutral mutations. In addition, the age of a selected mutation may differ from the neutral expectation because selected mutations may not be distributed on the genealogy according to a Poisson process.

We are interested in the ratio of the age of selected to the age of neutral mutations in models without recombination. We show that in the models of interest, selection tends to reduce this ratio below unity and that this property of models with selection can be used to test neutrality. First, consider a haploid model in which there is only one selected site in the population and assume that there are only two possible states for this site (A and a). A haplotype carrying state A has fitness 1 and a haplotype carrying state a has fitness 1 + s. Furthermore, assume that mutations occur from state A to state a at rate {upsilon} and from state a to state A at rate {upsilon}. Assume that neutral mutations occur according to an infinite sites model at rate µ and that there is no recombination. Following HUDSON and KAPLAN 1994 Down, we analyze this model in the limit of large population sizes so that the frequency of A is maintained at the equilibrium value. Under this assumption the mutation/selection model can be analyzed as a model of migration between two populations. Mutation between allele A and a is equivalent to migration between two isolated subpopulations with population sizes Nf(A) and Nf(a), respectively, where N is the population size and f(A) and f(a) are the frequencies of A and a, respectively, in the population. The equilibrium frequency of a for biologically plausible values of s and {upsilon} is given by

(4)

(see, for example, NAGYLAKI 1992 Down, pp. 9–13). The probability that an A allele arose by mutation in the previous generation is given by

(5)

and the probability that an a allele arose by mutation in the previous generation is given by

(6)

Equation 5 and Equation 6 correspond to Equation 4 of HUDSON and KAPLAN 1994 Down. Let T(A, a) be the expected coalescence time of a sample of size 2 given that one A allele and one a allele were obtained in the sample, T(2a) be the expected coalescence time of a sample of size 2 given that two a alleles were obtained in the sample, and T(2A) be the expected coalescence time of a sample of size 2 given that two A alleles were obtained in the sample. Assuming that only one mutation or coalescence event can happen in each generation, we can establish the following system of recursive equations for T(A, a), T(2A), and T(2a),

(7)

which can easily be solved for T(A, a), T(2A), and T(2a). Then the mean coalescence time can be found as

(8)

(HUDSON and KAPLAN 1994 Down). Using this result, the expected age of a segregating neutral mutation in a sample of size 2 is then simply given by T2/2. The expected age of a detectable selected mutation in a sample of size 2 is given by

(9)

We can therefore evaluate the ratio of the expected age of a detectable selected mutation to the expected age of a neutral mutation in a sample of size 2 (Figure 1). The absolute ratio of the ages in the figure does not approach 1.0 as Ns approaches 0 because it is a function of {upsilon}. However, note that the ratio is a monotonically decreasing function of s. The reason is that selection has only a marginal effect on the distribution of coalescence times but has a strong effect on the age of the selected mutation. That selection has only little effect on the expected coalescence time in a diallelic model at equilibrium has previously been noted by NEUHAUSER and KRONE 1997 Down. The model described above does not easily pertain to real data because of the assumption that the frequency of A is maintained at the equilibrium value and because only one selected site is considered. A more realistic model would assume an infinite sites model for both selected and neutral mutations and would allow the frequency of an allele to be a random variable through time. However, such a model does not lend itself to analytical treatment. Instead, we examined this model using computer simulations as described in METHODS (Figure 2). Note that the ratio of the mean age of a selected segregating mutation to the age of a segregating neutral mutation is a monotonically increasing function of the selection coefficient in the examined range of Ns and that this relationship extends to positive values of Ns. These results confirm the notion that the differences in the average age of nonsynonymous and synonymous mutations may be used to detect selection against slightly deleterious mutations. If a large fraction of nonsynonymous mutations is slightly deleterious, their average age should be less than the average age of synonymous mutations. However, if the excess of nonsynonymous mutations segregating in mtDNA within populations is caused by other forces such as balancing selection, one would not necessarily expect the age of nonsynonymous mutations to be less than the age of synonymous mutations. Indeed, balancing selection is expected to increase the age of the selected mutation relative to the neutral expectation.



View larger version (9K):
In this window
In a new window
Download PPT slide
 
Figure 1. The ratio of the expected age of a detectable selected mutation to the age of a neutral mutation (age ratio) in the model described in the text with N{upsilon} = 1.0.



View larger version (13K):
In this window
In a new window
Download PPT slide
 
Figure 2. The ratio of the average age of nonsynonymous mutations to synonymous mutations in a sample when nonsynonymous mutations are under constant positive or negative selection. Parameter values were n = 25 (sample size), {theta} = 10, N = 1000, µsn = 1. A total of 1000 data sets were obtained to evaluate each point. Simulations were performed as described in METHODS.

Power analyses:
The power of the McDonald/Kreitman test (MCDONALD and KREITMAN 1991 Down) has not received much attention in the literature, although AKASHI 1999 Down recently explored its power under the assumption of free recombination between adjacent sites. We were interested in examining its power against the hypothesis of purifying selection in the absence of recombination. This is the case applicable to mitochondrial DNA. Such analysis may provide a crude estimate of the size of the selection coefficients required to explain the repeated observations of departures from neutrality in animal mtDNA (BALLARD and KREITMAN 1994 Down; NACHMAN et al. 1994 Down, NACHMAN et al. 1996 Down; Rand et al. 1994 Down; Rand AND KANN 1996 Down, Rand AND KANN 1998 Down; NACHMAN 1998 Down; WISE et al. 1998 Down). We examined the power of the test as described in METHODS (Figure 3) and found that, although it has a maximum at ~Ns = -10, the test has power >50% over the range -2 > Ns > -40. Thus selection coefficients acting on nonsynonymous mutations in mtDNA should fall in this range if they are to account for the many observed deviations from neutrality.



View larger version (16K):
In this window
In a new window
Download PPT slide
 
Figure 3. The power (the proportion of tests significant at the 5% level) of the McDonald/Kreitman test (a) and the test of the ratio of mean ages of selected to neutral mutations (b) under negative selection. The parameter values are as in Figure 2; µt = 180.

Having demonstrated that under purifying selection the mean age of segregating selected sites is less than the mean age of segregating neutral sites in a population (Figure 1 and Figure 2), we next turned to the problem of comparing mean ages of selected and neutral sites segregating in a sample of chromosomes drawn from a population. We examined the power of the AOM test (Equation 2) when using â as an estimator of the age of mutations by computer simulations as described in METHODS and found it to increase as Ns is reduced from 0 to -10 with a maximum in the range -10 > Ns > -50 (Figure 3A). Power analysis of the AOM test using E(T | X) as an estimator of age of mutations by simulation is computationally too expensive to perform.

Data analysis:
The hypothesis that the ages of nonsynonymous and synonymous polymorphisms are equal was tested for 25 previously published animal mtDNA data sets (BALLARD and KREITMAN 1994 Down; NACHMAN et al. 1994 Down, NACHMAN et al. 1996 Down; Rand AND KANN 1996 Down; WISE et al. 1998 Down; and references in NACHMAN 1998 Down and Rand AND KANN 1998 Down). The results of the tests are described in Table 2. Note that in only 6 cases using â as the estimator of relative age and in only 4 cases using E(T | X) as the estimator of age was the hypothesis of an equal average age of nonsynonymous and synonymous mutations rejected by the AOM test, whereas the McDonald/Kreitman test rejected neutrality in 14 of 25 cases.


 
View this table:
In this window
In a new window

 
Table 2. The results of the test of equal age of nonsynonymous and synonymous segregating mutations


*  DISCUSSION
*TOP
*ABSTRACT
*METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Application of the McDonald/Kreitman test (MCDONALD and KREITMAN 1991 Down) to animal mtDNA has repeatedly demonstrated that the evolution of this genetic element is not consistent with strict neutrality (BALLARD and KREITMAN 1994 Down; NACHMAN et al. 1994 Down, NACHMAN et al. 1996 Down; Rand et al. 1994 Down; Rand AND KANN 1996 Down, Rand AND KANN 1998 Down; NACHMAN 1998 Down; HASEGAWA et al. 1998 Down; WISE et al. 1998 Down). Relative to the number of nonsynonymous sites fixed between species, these data sets all show an excess number of segregating nonsynonymous sites within species. KIMURA 1983 Down has shown that deleterious alleles contribute more to (within-species) heterozygosity than to (between-species) divergence and thus, these observations may be consistent with the notion that most segregating synonymous mutations are roughly neutral but that most segregating nonsynonymous mutations are mildly deleterious. We set out to explicitly test this hypothesis. On the basis of theoretical predictions (Figure 1 and Figure 2) we chose the ratio of the estimated mean age of nonsynonymous mutations to the estimated mean age of synonymous mutations as a test statistic. Two estimators of the age of a mutation were used to analyze published mtDNA data sets: E(T | X) and â (an estimator based on estimation of the underlying gene genealogy using classical phylogenetic methods). E(T | X) has the advantage of using all the information in a sample, but assumes that the data fit the infinite sites model. As shown in Table 2, most of the published data sets (18/25) do not fit this model. Furthermore, the calculation of E(T | X) is very time consuming, making it unsuitable for simulation purposes. This is the motivation for using â as an age estimator of the age of a mutation in addition to E(T | X).

However, note that the selective removal of sites when E(T | X) is used will not lead to falsely significant results. To realize this, note that under the null hypothesis of strict neutrality, both nonsynonymous and synonymous mutations should be distributed according to a Poisson process on the underlying shared gene genealogy. Because the distribution of the times of events in a Poisson process is given by the order statistics of a uniform random variable, the expected age of a mutation in a site is independent of the number of mutations occurring at the site. Thus the expected age of mutations in a data set will not change by the removal of sites with many mutations nor will the ratio of the expected ages of nonsynonymous to synonymous mutations change.

On the basis of an analysis of the power of the McDonald/Kreitman test (Figure 3A), we conclude that if purifying selection alone is responsible for the observed deviations from neutrality in animal mtDNA, selection coefficients must be in the range -2 > Ns > -40. It was surprising to us that the McDonald/Kreitman test retains power over such a wide range and that such large (negative) values of Ns may be responsible for the observations. AKASHI 1999 Down found that little power remained for values of Ns <-10 and that essentially no power remained for values of Ns <-20. The discrepancy between the results can presumably be explained by the effect of linkage, which is known to reduce the efficacy of selection (e.g., HILL and ROBERTSON 1966 Down).

The AOM test based on â has less power to reject the hypothesis of neutrality across most of the ranges of Ns examined (Figure 3B) than the McDonald/Kreitman test. Power rises as Ns is reduced from -1 to -10 after which it appears to reach a plateau of ~65%. It is not surprising that the McDonald/Kreitman test has more power in most of the parameter space because this test employs more data (interspecific comparisons in addition to intraspecific comparisons) than the test based on â. However, the new test based on â uses information in the data not employed by the McDonald/Kreitman test, and it is able to distinguish between hypotheses of balancing selection and selection on deleterious mutations. It should therefore provide additional information regarding selection coefficients when applied in addition to the McDonald/Kreitman test. Another advantage of the new test is that it is quite robust to the model assumptions. Under strict neutrality, the distribution of site patterns in nonsynonymous sites and in synonymous sites should be identical. If the weights assigned to different site patterns are poorly chosen, this will lead to a reduction in power but not to an inflation of the chosen significance level. This is also true even if the population is not at stationarity because of the decoupling of the mutational and the genealogical processes under neutrality.

Six out of 25 tests were significant using â as an estimator of the age of an allele (Table 2). The probability of randomly obtaining 6 or more significant results in 25 tests at the 5% significance level is P = 0.0012. One of the data sets (Mesomys) has a P value so low (P < 0.001) that the null hypothesis of no difference in age of synonymous and nonsynonymous mutations for all data sets can be rejected by a Bonferroni test. It is curious to note that all six significant data sets contain a relatively large number of segregating sites; indeed three of the six are the three largest data sets shown in Table 2 with respect to number of segregating sites. This may suggest that the small number of significant tests is caused by a lack of statistical power when only few polymorphic sites are available. In contrast there appears to be no bias among significant data sets with respect to number of chromosomes examined. Also, in 17/25 cases, the estimated mean age of nonsynonymous mutations is less than the estimated mean age of synonymous substitution (not shown). Together, these results provide weak support for the hypothesis that the average age of nonsynonymous mutations is less than the average age of synonymous mutations.

To test the hypothesis that the small number of significant AOM results in Table 2 could be the consequence of small sample size (polymorphic sites), additional power analyses of both the McDonald/Kreitman and the AOM tests were performed in which Ns was held constant and {theta} varied (Figure 4). Indeed the reduction in power when {theta} is small is somewhat more severe for the AOM test than for the McDonald/Kreitman test.



View larger version (13K):
In this window
In a new window
Download PPT slide
 
Figure 4. The power (the proportion of tests significant at the 5% level) of the two tests ({square}, McDonald/Kreitman test; {triangleup}, AOM test) when Ns = 3 and {theta} is varied. The remaining parameters are as in Figure 2.

Supposing that purifying selection is acting on amino acid replacement mutations, the next question is which range of values of Ns are consistent with the observations from both the McDonald/Kreitman test and the AOM test presented here. Figure 5 (solid bars) shows the expected frequency distributions of P values for the test based on â derived from simulation with Ns = 3. Note that Ns = 3 is near the bottom of the power curve of the AOM test (Figure 3B); however, if Ns is larger, the expected distribution would be even more skewed toward low P values. In contrast the frequency distribution of P values observed in the 25 data sets is nearly uniform (Figure 5, hatched bars). It appears that Ns must be <3 to account for the distribution of P values obtained in the test based on â. However, if selection against deleterious mutations is to explain the many significant McDonald/Kreitman tests, Ns should be >2. There is therefore only a small window of values of Ns that are consistent with the findings in this article. Given the variability of population sizes in naturally occurring populations, it is difficult to explain why all values of Ns should fall in this range.



View larger version (23K):
In this window
In a new window
Download PPT slide
 
Figure 5. The observed distribution of P values (solid bars) for the 25 data sets and the expected distribution (hatched bars) for Ns = 3 and the same parameter values as in Figure 2.

The model applied to describe selection against deleterious mutations is very simple in that it assumes that all nonlethal nonsynonymous mutations have the same selection coefficient and that this selection coefficient is constant in time. Likewise, it is assumed in the simulations that the populations are in stationarity, i.e., a constant population size and no population subdivision. Although there is no reason to assume that a more complex demographic model or a more complex distribution of negative selection coefficients would change the relative power of the McDonald/Kreitman test vs. the AOM test, we cannot rule out this possibility.

If purifying selection alone is not responsible for the repeated observations of deviation from neutrality, what other processes are consistent with the data? One obvious explanation is that the significant results partly are artifacts caused by the simple methods used to estimate the number of nonsynonymous and synonymous mutations (e.g., HASEGAWA et al. 1998 Down). Such an explanation would require that saturation occur faster in nonsynonymous than in synonymous sites, i.e., the number of nonsynonymous nucleotide differences should be a convex function of the number of synonymous nucleotide differences (NIELSEN 1997 Down). To examine this hypothesis, we plotted the number of nonsynonymous nucleotide differences between pairs of species against the number of synonymous nucleotide differences for several mitochondrial genes (Figure 6). As expected, saturation occurs faster in synonymous sites than in nonsynonymous sites. Thus the use of nucleotide differences instead of the estimated number of substitutions in the McDonald/Kreitman test does not appear to be responsible for the apparent pattern of deviation from neutrality in mtDNA evolution. In fact, saturation by multiple hits should make the McDonald/Kreitman test more conservative against the alternative of segregating deleterious mutations.



View larger version (12K):
In this window
In a new window
Download PPT slide
 
Figure 6. Observed nonsynonymous differences per site (dn) and synonymous differences per site (ds) in pairwise comparisons for multiple mtDNA-encoded proteins.

Another problem is sequencing errors. Such errors could inflate the significance level of the McDonald/Kreitman test because they would tend preferentially to elevate the number of intraspecific nonsynonymous mutations. Likewise, sequencing errors could be of importance in explaining the significant results from the AOM test presented in this article. However, sequencing errors should also elevate the ratio of nonsynonymous mutations to synonymous mutations for small levels of divergence in interspecific data. Because no such pattern is observed in Figure 6, we do not believe that this explanation is likely. Moreover, elevated intraspecific nonsynonymous mutations have not been observed in nuclear data sets (MCDONALD and KREITMAN 1991 Down; D. M. WEINREICH and D. RAND, unpublished results). However, we cannot conclusively rule out that sequencing errors may be of some importance in explaining the large number of significant McDonald/Kreitman tests.

Theoretically, balancing selection acting on mtDNA could maintain levels of nonsynonymous polymorphism within species above that seen between species (NACHMAN et al. 1994 Down; Rand AND KANN 1996 Down). Balancing selection could act through cytoplasmic-nuclear interaction (MCRAE and ANDERSON 1988 Down; BABCOCK and ASMUSSEN 1996 Down; GOODISMAN and ASMUSSEN 1997 Down). However, it is difficult to imagine that this type of selection can act on sufficiently many nucleotide sites simultaneously to account for the observed results. Indeed, selection against slightly deleterious mutations appears at the moment to be the most plausible explanation for the significant results of the McDonald/Kreitman test in animal mtDNA. However, a combination of forces cannot be excluded.


*  FOOTNOTES

1 Present address: Department of Ecology and Evolutionary Biology, Brown University, Providence, RI 02912. Back


*  ACKNOWLEDGMENTS

This study was supported by a fellowship to R.N. from the Danish Research Council. David Rand and Michael Nachman provided several animal mtDNA data sets prior to publication.

Manuscript received December 20, 1998; Accepted for publication June 2, 1999.


*  LITERATURE CITED
*TOP
*ABSTRACT
*METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

AKASHI, H., 1999  Inferring the fitness effects of DNA mutations from polymorphism and divergence data: statistical power to detect directional selection under stationarity and free recombination. Genetics 151:221-238[Abstract/Free Full Text].

ANDERSON, S., A. T. BANKIER, B. G. BARRELL, M. H. L. DEBRUIJN, and A. R. COULSON et al., 1981  Sequence and organization of the human mitochondrial genome. Nature 290:457-465[Medline].

BABCOCK, D. S. and M. A. ASMUSSEN, 1996  Effects of differential selection in the sexes on cytonuclear polymorphism and disequilibria. Genetics 144:839-853[Abstract].

BAKER, R. J., R. A. VANDENBUSSCHE, A. J. WRIGHT, L. E. WIGGINS, and M. J. HAMILTON et al., 1996  High levels of genetic change on rodents of Chernobyl. Nature 380:707-708[Medline].

BALLARD, J. W. O. and M. KREITMAN, 1994  Unraveling selection in the mitochondrial genome of Drosophila.. Genetics 138:875-888.

BIBB, M. J., R. A. VAN ETTEN, C. T. WRIGHT, M. W. WALBERG, and D. A. CLAYTON, 1981  Sequence and gene organization of mouse mitochondrial DNA. Cell 26:167-180[Medline].

BRUNA, E. M., R. N. FISHER, and T. J. CASE, 1996  Morphological and genetic evolution is decoupled in Pacific skinks (Squamata: Scinidae: Emoia). Proc. R. Soc. Lond. Ser. B. 263:681-688[Abstract/Free Full Text].

DESJARDINS, P. and R. MORAIS, 1990  Sequence and gene organization of the chicken mitochondrial genome. J. Mol. Biol. 212:599-634[Medline].

FITCH, W. M., 1971  Toward defining the course of evolution: minimum change for a specific tree topology. Syst. Zool. 20:406-416.

GARESSE, R., 1988  Drosophila melanogaster mitochondrial DNA: gene organization and evolutionary considerations. Genetics 118:649-663[Abstract/Free Full Text].

GILLESPIE, J. H., 1991 The Causes of Molecular Evolution. Oxford University Press, Oxford.

GOODISMAN, M. A. D. and M. A. ASMUSSEN, 1997  Cytonuclear theory for haplodiploid species and X-linked genes. I. Hardy-Weinburg dynamics and continent-island, hybrid zone models. Genetics 147:321-338[Abstract].

GRIFFITHS, R. C. and S. TAVARÉ, 1995  Unrooted genealogical tree probabilities in the infinitely-many-sites model. Math. Biosci. 127:77-98[Medline].

GRIFFITHS, R. C. and S. TAVARÉ, 1998  The age of a mutation in a general coalescent tree. Stoch. Model. 14:273-295.

GRIFFITHS, R. C. and S. TAVARÉ, 1999  The ages of mutations in gene trees. Ann. Appl. Prob. in press.

HARDING, R. M., S. M. FULLERTON, R. C. GRIFFITHS, J. BOND, and M. J. COX et al., 1997  Archaic African and Asian lineages in the genetic ancestry of modern humans. Am. J. Hum. Genet. 60:772-789[Medline].

HASEGAWA, M., Y. CAO, and Z. YANG, 1998  Preponderance of slightly deleterious polymorphism in mitochondrial DNA: nonsynonymous/synonymous rate ratio is much higher within species than between species. Mol. Biol. Evol. 15:1499-1505[Free Full Text].

HILL, W. G. and A. ROBERTSON, 1966  The effect of linkage on the limits to artificial selection. Genet. Res 8:269-294[Medline].

HORAI, S., K. HAYASAKA, R. KONDO, K. TSUGANE, and N. TAKAHATA, 1995  Recent African origin of modern humans revealed by complete sequences of hominoid mitochondrial DNAs. Proc. Natl. Acad. Sci. USA 92:532-536[Abstract/Free Full Text].

HUDSON, R. R., and N. L. KAPLAN, 1994 Gene trees with background selection, pp. 140–153 in Non-Neutral Evolution, edited by B. GOLDING. Chapman and Hall, New York.

KAPLAN, N. L., T. DARDEN, and R. R. HUDSON, 1988  The coalescent in models with selection. Genetics 120:819-829[Abstract/Free Full Text].

KIMURA, M., 1983 The Neutral Theory of Molecular Revolution. Cambridge University Press, Cambridge, United Kingdom.

MCDONALD, J. H. and M. KREITMAN, 1991  Adaptive evolution in the ADH locus in Drosophila. Nature 351:652-654[Medline].

MCRAE, A. F. and W. W. ANDERSON, 1988  Evidence for non-neutrality of mitochondrial DNA haplotypes in Drosophila pseudoobscura.. Genetics 120:485-494[Abstract/Free Full Text].

MORITZ, C., C. J. SCHNEIDER, and D. B. WAKE, 1992  Evolutionary relationships within the Ensatina eschscholtzii complex confirm the ring species interpretation. Syst. Biol. 41:273-291.

NACHMAN, M. W., 1998  Deleterious mutations in animal mitochondrial DNA. Genetica 102(103):61-69.

NACHMAN, M. W., S. N. BOYER, and C. F. AQUADRO, 1994  Non-neutral evolution at the mitochondrial NADH dehydrogenase subunit 3 gene in mice. Proc. Natl. Acad. Sci. USA 91:6364-6368[Abstract/Free Full Text].

NACHMAN, M. W., W. M. BROWN, M. STONEKING, and C. F. AQUADRO, 1996  Nonneutral mitochondrial DNA variation in humans and chimpanzees. Genetics 142:953-963[Abstract].

NAGYLAKI, T., 1992 Introduction to Theoretical Population Genetics. Springer-Verlag, Berlin.

NEUHAUSER, C. and S. M. KRONE, 1997  The genealogy of samples in models with selection. Genetics 145:519-534[Abstract].

NIELSEN, R., 1997  The ratio of replacement to silent divergence and tests of neutrality. J. Evol. Biol. 10:217-231.

OHTA, T., 1973  Slightly deleterious mutant substitutions in evolution. Nature 246:96-98[Medline].

RAND, D. M. and L. KANN, 1996  Excess amino acid polymorphism in mitochondrial DNA: contrasts among genes from Drosophila, mice and humans. Mol. Biol. Evol. 13:735-748[Abstract].

RAND, D. M. and L. KANN, 1998  Mutation and selection at silent and replacement sites in the evolution of animal mitochondrial DNA. Genetica 102(103):393-407.

RAND, D. M., M. DORFSMAN, and L. M. KANN, 1994  Neutral and nonneutral evolution of Drosophila mitochondrial DNA. Genetics 138:741-756[Abstract].

RUVOLO, M., D. PAN, S. ZEHR, T. GOLDBERG, and T. R. DISOTELL et al., 1994  Gene trees and hominoid phylogeny. Proc. Natl. Acad. Sci. USA 91:8900-8904[Abstract/Free Full Text].

SOKAL, R. R. and C. D. MICHENER, 1958  A statistical method for evaluating systematic relationship. Univ. Kansas Sci. Bull. 28:1409-1438.

SOKAL, R. R., and F. J. ROHLF, 1981 Biometry. W. H. Freeman and Company, New York.

SUMMERS, K., E. BERMINGHAM, L. WEIGT, S. MCCAFFERTY, and L. DAHLSTROM, 1997  Phenotypic and genetic divergence in three species of dart-poison frogs with contrasting parental behavior. J. Hered. 88:8-13[Abstract/Free Full Text].

WATTERSON, G. A., 1975  On the number of segregating sites in genetical models without recombination. Theor. Popul. Biol. 7:256-276[Medline].

WISE, C. A., S. MICHEALA, and S. EASTEAL, 1998  Departure from neutrality at the mitochondrial NADH dehydrogenase subunit 2 gene in humans, but not in chimpanzees. Genetics 148:409-421[Abstract/Free Full Text].

WOOD, T. C. and C. KRAJEWSKI, 1996  Mitochondrial DNA sequence variation among the subspecies of Sarus Crane (Grus antigone).. Auk 113:655-663.

ZINK, R. M. and R. C. BLACKWELL, 1996  Patterns of allozyme, mitochondrial DNA, and morphometric variation in four sparrow genera. Auk 113:59-67.




This article has been cited by other articles:


Home page
J. Virol.Home page
D. C. Tully and M. A. Fares
Shifts in the Selection-Drift Balance Drive the Evolution and Epidemiology of Foot-and-Mouth Disease Virus
J. Virol., January 15, 2009; 83(2): 781 - 790.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
P. R. Haddrill, D. Bachtrog, and P. Andolfatto
Positive and Negative Selection on Noncoding DNA in Drosophila simulans
Mol. Biol. Evol., September 1, 2008; 25(9): 1825 - 1834.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
S. Kryazhimskiy, G. A. Bazykin, and J. Dushoff
Natural Selection for Nucleotide Usage at Synonymous and Nonsynonymous Sites in Influenza A Virus Genes
J. Virol., May 15, 2008; 82(10): 4938 - 4945.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
K. Popadin, L. V. Polishchuk, L. Mamirova, D. Knorre, and K. Gunbin
Accumulation of slightly deleterious mutations in mitochondrial protein-coding genes of large versus small mammals
PNAS, August 14, 2007; 104(33): 13390 - 13395.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
O. G. Pybus, A. Rambaut, R. Belshaw, R. P. Freckleton, A. J. Drummond, and E. C. Holmes
Phylogenetic Evidence for Deleterious Mutation Load in RNA Viruses and Its Contribution to Viral Evolution
Mol. Biol. Evol., March 1, 2007; 24(3): 845 - 852.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
C. T. T. Edwards, E. C. Holmes, O. G. Pybus, D. J. Wilson, R. P. Viscidi, E. J. Abrams, R. E. Phillips, and A. J. Drummond
Evolution of the Human Immunodeficiency Virus Envelope Gene Is Dominated by Purifying Selection
Genetics, November 1, 2006; 174(3): 1441 - 1453.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
S. Y. W. Ho, M. J. Phillips, A. Cooper, and A. J. Drummond
Time Dependency of Molecular Rate Estimates and Systematic Overestimation of Recent Divergence Times
Mol. Biol. Evol., July 1, 2005; 22(7): 1561 - 1568.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
A. Eyre-Walker, P. D. Keightley, N. G. C. Smith, and D. Gaffney
Quantifying the Slightly Deleterious Mutation Model of Molecular Evolution
Mol. Biol. Evol., December 1, 2002; 19(12): 2142 - 2149.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
R. Nielsen
Mutations as Missing Data: Inferences on the Ages and Distributions of Nonsynonymous and Synonymous Mutations
Genetics, September 1, 2001; 159(1): 401 - 411.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
B. C. Verrelli and W. F. Eanes
Clinal Variation for Amino Acid Polymorphisms at the Pgm Locus in Drosophila melanogaster
Genetics, April 1, 2001; 157(4): 1649 - 1663.
[Abstract] [Full Text]


Home page
GeneticsHome page
B. C. Verrelli and W. F. Eanes
Extensive Amino Acid Polymorphism at the Pgm Locus Is Consistent With Adaptive Protein Evolution in Drosophila melanogaster
Genetics, December 1, 2000; 156(4): 1737 - 1752.
[Abstract] [Full Text]


Home page
GeneticsHome page
D. M. Weinreich and D. M. Rand
Contrasting Patterns of Nonneutral Evolution in Proteins Encoded in Nuclear and Mitochondrial Genomes
Genetics, September 1, 2000; 156(1): 385 - 399.
[Abstract] [Full Text]