Genetics, Vol. 161, 835-848, June 2002, Copyright © 2002

Muller's Ratchet and the Pattern of Variation at a Neutral Locus

Isabel Gordoa, Arcadio Navarrob, and Brian Charlesworthb
a Instituto Gulbenkian da Ciência, P-2781-901 Oeiras, Portugal
b Institute of Cell, Animal and Population Biology, University of Edinburgh, Edinburgh EH9 3JT, United Kingdom

Corresponding author: Isabel Gordo, Instituto Gulbenkian da Ciência, Rua da Quinta Grande 6, Apartado 14, P-2781-901 Oeiras, Portugal.

Communicating editor: N. TAKAHATA


*  ABSTRACT
*TOP
*ABSTRACT
*SIMULATION METHODS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*LITERATURE CITED

The levels and patterns of variation at a neutral locus are analyzed in a haploid asexual population undergoing accumulation of deleterious mutations due to Muller's ratchet. We find that the movement of Muller's ratchet can be associated with a considerable reduction in genetic diversity below classical neutral expectation. The extent to which variability is reduced is a function of the deleterious mutation rate, the fitness effects of the mutations, and the population size. Approximate analytical expressions for the expected genetic diversity are compared with simulation results under two different models of deleterious mutations: a model where all deleterious mutations have equal effects and a model where there are two classes of deleterious mutations. We also find that Muller's ratchet can produce a considerable distortion in the neutral frequency spectrum toward an excess of rare variants.


EVERY population is continuously exposed to newly occurring mutations, the majority of which are probably deleterious. MULLER 1964 Down pointed out that, in the absence of recombination and back mutation, a finite population would suffer an accumulation of deleterious mutations due to the irreversible loss of the classes of individuals with the least numbers of mutations—the least-loaded classes. Consider the simplest case of a nonrecombining haploid population of N breeding individuals, subject to mutations with a deleterious effect s, occurring at a rate U per individual per generation. With multiplicative fitness interactions between loci and a Poisson distribution of the number of mutations occurring per individual per generation, the population is divided into classes of individuals according to the number of mutations they carry. In an effectively infinite population at deterministic equilibrium, the frequencies of the classes, after mutation but prior to selection, are given by a Poisson distribution with mean U/s (HAIGH 1978 Down). In particular, the size of the class free of deleterious mutations is . If the size of this class is not very large, genetic drift will cause it to be lost. After such an event, it will be replaced by the next least-loaded class. Then a new deterministic equilibrium will be approached and, after some time, the least-loaded class will be lost again.

This is the repetitive irreversible process known as Muller's ratchet. Muller's ratchet has been thought to play a role in providing an evolutionary advantage to sex and recombination and to cause the degeneration of large nonrecombining portions of the genome of sexual organisms, such as Y chromosomes, and the extinction of small asexual populations (MULLER 1964 Down; FELSENSTEIN 1974 Down; CHARLESWORTH 1978 Down; MAYNARD SMITH 1978 Down; LYNCH et al. 1993 Down; GESSLER and XU 1999 Down). A great deal of theoretical work has been done to understand the circumstances under which Muller's ratchet can produce evolutionary effects. In particular, one can ask the question: How many generations on average will it take for an asexual population to lose its present best class of individuals (HAIGH 1978 Down; PAMILO et al. 1987 Down; STEPHAN et al. 1993 Down; GESSLER 1995 Down; HIGGS and WOODCOCK 1995 Down; CHARLESWORTH and CHARLESWORTH 1997 Down; GORDO and CHARLESWORTH 2000A Down, GORDO and CHARLESWORTH 2000B Down)?

Although the biological importance of the ratchet can be assessed by the quantification of this time and the associated decline in mean fitness, its extremely high sensitivity to small changes in the parameters (GORDO and CHARLESWORTH 2000A Down, GORDO and CHARLESWORTH 2000B Down), together with our lack of knowledge of the exact values of the relevant parameters of mutation and selection (KEIGHTLEY and EYRE-WALKER 1999 Down), make it hard to draw definitive conclusions about the ratchet's role. For example, if the majority of deleterious mutations are mildly deleterious (with s << 1%), then, with our present knowledge of the deleterious mutation rate, Muller's ratchet could potentially be a major process driving the degeneration of Y chromosomes, even in very large populations such as those of Drosophila (GORDO and CHARLESWORTH 2000B Down). But if this is not the case, its operation may be biologically negligible in that context.

One signature of the operation of Muller's ratchet is the fixation of deleterious alleles as a consequence of the recurrent loss of the best class (HIGGS and WOODCOCK 1995 Down; CHARLESWORTH and CHARLESWORTH 1997 Down; BERGSTROM and PRITCHARD 1998 Down). In this article, we try to evaluate another signature of its operation. It is well known that the elimination of strongly deleterious mutations can substantially reduce variation levels at linked neutral sites—an effect known as "background selection" (CHARLESWORTH et al. 1993 Down). Background selection, as classically stated, assumes that no irreversible accumulation of deleterious mutations occurs, which is not the case when the ratchet is turning. We therefore asked the following question: What is the level and pattern of neutral variation in a population where Muller's ratchet is operating? Observations on neutral variability in asexual populations or on Y chromosomes may detect the signature of processes such as the ratchet (CHARLESWORTH and CHARLESWORTH 2000 Down), so that it is important to have theoretical predictions of what to expect.

This question is examined using Monte Carlo stochastic simulations of a neutral locus embedded in a set of selected loci that accumulate deleterious mutations by the ratchet mechanism. Variability at the neutral locus is measured and compared with both analytical and simulated results based on the structured coalescent. Tajima's D statistic (TAJIMA 1989 Down) and Fu and Li's D* statistic (FU and LI 1993 Down), commonly used to test deviations from the standard neutral model, are calculated, and their power to reject neutrality in a population undergoing a ratchet mechanism is assessed. Recent studies on the rate of occurrence and selection coefficients of deleterious mutations have suggested that a simple discrete distribution with two classes of mutation effects seems to fit the data better than a continuous distribution (KEIGHTLEY 1996 Down; DAVIES et al. 1999 Down). We therefore also studied a model in which the population is subject to deleterious mutations with two major types of effects and compared the level of neutral variability under such a model to an analytical approximation.


*  SIMULATION METHODS
*TOP
*ABSTRACT
*SIMULATION METHODS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*LITERATURE CITED

Multilocus Monte Carlo simulations:
Following the previous work of GORDO and CHARLESWORTH 2000A Down, GORDO and CHARLESWORTH 2000B Down, a haploid, nonrecombining population of N chromosomes was simulated with the following life cycle: mutation, reproduction, and selection. In each generation, mutations to deleterious alleles occur according to a Poisson distribution with mean U. Multiplicative fitness effects of the deleterious mutations are assumed. Two kinds of model of the effects of deleterious mutations are considered: a model in which all mutations have the same selection coefficient (s) and a model where mutations can have two types of deleterious effect (ss and sb, with ss < sb; GORDO and CHARLESWORTH 2001 Down). At a neutral locus, mutations are generated according to the infinite-sites model; i.e., each mutation occurs at a new site, at a total rate of µ at the locus, which has 250 neutral sites. Every N generations, a sample of size n = 25 chromosomes is taken from the population, and variability measures at the neutral locus are computed. For each set of parameter values, 10 simulation runs were done, with 5 samples taken per run (no significant correlation between samples was observed). A total of 50 samples was used for the computations.

Coalescent simulations:
A model in which all mutations have identical selection coefficients (s) was simulated using the coalescent process. Our method is based on the structured coalescent described in CHARLESWORTH et al. 1995 Down, with a slight modification. The program follows the genealogies of a sample of neutral alleles backward in time. These can move between the different classes of individuals defined by the number of deleterious mutations harbored by each individual. The frequencies of these classes, after selection, are given by their deterministic expectation, i.e., according to a Poisson distribution with parameter when n0 > 1. When , the distribution is replaced by a shifted Poisson distribution with parameter , where (GESSLER 1995 Down; see below for details). The first step in the program is to generate a transition matrix, Qij, of the probabilities that an individual with i mutations in a given generation has an ancestor with j mutations in the previous generation, according to Equation 3 of CHARLESWORTH et al. 1995 Down. Then, the haplotypes in which the neutral alleles in the sample are embedded are generated randomly from the equilibrium distribution after selection (this is different from CHARLESWORTH et al. 1995 Down, who considered the equilibrium distribution after mutation and prior to selection—although in practice, this makes no difference for the values of s considered here).

After these two preliminary steps, the backward process starts. Every generation, the number of mutations in the ancestor of each individual is obtained randomly by using the probabilities in the matrix Qij as expected values. When all the ancestors have been assigned to a class, coalescence is allowed to occur between individuals belonging to the same class, with probability ki(ki - 1)/2/Nf*i, where ki is the number of lineages with i deleterious mutations present in the sample at a given generation and Nf*i is the deterministic equilibrium size of class i, after selection. The possibility of more than one coalescent event within a class is neglected. As commonly implemented in previous algorithms for the structured coalescent (HUDSON 1990 Down; CHARLESWORTH et al. 1995 Down), simultaneous coalescent events are possible in the same generation only if they occur in different classes. Once the most recent common ancestor of the whole sample is reached, neutral mutations are distributed over the gene tree generated by the simulation according to the infinite-sites model. For each set of parameter values, 104 trees were generated.

Measures of genetic diversity at a neutral locus:
Two measures of genetic variation in a sample of alleles at the neutral locus are considered: the mean number of pairwise differences between randomly sampled sequences, k, and the number of segregating sites, S. Under the infinite-sites model in the absence of deleterious mutations, the expectations of these quantities for a haploid population are

(EWENS 1979 Down), where , where the subscript 0 refers to the strictly neutral model. In the absence of recombination, these expectations are reduced by a factor of in a large population that is at equilibrium between recurrent mutation to strongly deleterious alleles and their elimination by purifying selection (background selection). This approximation was shown previously to be accurate in a population where Muller's ratchet does not operate (CHARLESWORTH et al. 1993 Down).

The mutational frequency spectrum:
Selection against deleterious mutations is expected to affect k more than S, since k is weighted toward variants at intermediate frequencies (TAJIMA 1989 Down; CHARLESWORTH et al. 1993 Down). This is observed when computing statistics, such as Tajima's D, designed to test deviations from the frequency spectrum predicted under strict neutrality. Tajima's D is defined as

where and Var(k - {theta}w) is calculated assuming no recombination (TAJIMA 1989 Down). Negative values of D are associated with a skew in the distribution of frequencies of neutral mutations toward an excess of rare variants. Because we expect a negative D in the presence of purifying selection (CHARLESWORTH et al. 1993 Down), we asked how often, in the presence of a ratchet, we can reject the neutral model because of very negative D's.

For a given {theta} value, we ran standard coalescent simulations (HUDSON 1990 Down) of the neutral infinite-sites mutational model and calculated the critical values [at the 95% confidence interval (C.I.)] of the statistic D. We then used forward simulations to compute the rejection power, given by the proportion of forward simulations whose observed D was lower than the critical value expected under neutrality: Pow 1. Although in some species, such as Drosophila melanogaster, one has information about genome-wide levels of variability, from which one can estimate {theta}, in others such information is not available. Because {theta} is generally not known, we also performed a power analysis assuming a fixed number of segregating sites, S (HUDSON 1993 Down). We generated standard neutral genealogies, distributed S mutations onto them, and obtained the 95% critical values of D. Afterward, we ran structured coalescent simulations, also distributing S mutations onto the genealogical trees, and calculated Pow 2, the proportion of times the value of D obtained in the structured coalescent simulations was lower than the critical D in the neutral simulations. A total of 104 genealogical trees were run for every set of parameters.


*  RESULTS
*TOP
*ABSTRACT
*SIMULATION METHODS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*LITERATURE CITED

Muller's ratchet and genetic diversity:
Suppose that the accumulation of deleterious alleles is occurring due to the repetitive "clicks" of Muller's ratchet. What is the expected level of variability at a locus evolving neutrally? In Fig 1 we show the reduction in the mean number of pairwise differences caused by deleterious mutations, i.e., the ratio of the observed k to that expected in the absence of purifying selection, k0, as a function of s. We also plot the deterministic equilibrium frequency of the least-loaded class, f0. For any value of N, with a sufficiently large value of s the reduction in k is independent of N and is very well approximated by f0. With recurrent mutations with very large deleterious effects, the rate at which the ratchet operates is extremely low (if it operates at all), and the level of variation at a neutral locus reflects the size of the class of individuals with the highest fitness. This is because any neutral variant arising in less fit classes is quickly driven to extinction (FISHER 1930 Down, p. 122), or, putting it in another way, any neutral variant sampled from mutated classes is very recently derived from gametes belonging to the fittest class. Thus, for these cases we recover the classical background selection approximation E(k) {cong} 2Nf0µ (CHARLESWORTH et al. 1993 Down).



View larger version (19K):
In this window
In a new window
Download PPT slide
 
Figure 1. Simulations of the effect of Muller's ratchet on mean number of pairwise differences at a neutral locus k (relative to that under strict neutrality, k0) as a function of the selection coefficient against deleterious mutations (s). The mutation rate is 0.05 and N is 1000 (triangles), 3000 (circles), and 8000 (squares). We start observing clicks of the ratchet when s < 0.03 for N = 1000, s < 0.02 for N = 3000, and s < 0.015 for N = 8000. The continuous line is the frequency of the least-loaded class at the mutation-selection deterministic equilibrium, f0. The error bars correspond to two standard errors.

For intermediate selection coefficients, Muller's ratchet starts to operate at a reasonable rate. Two phenomena start to occur: The size of the best class fluctuates around its deterministic equilibrium value and is driven to 0 with a time lag that varies stochastically (HAIGH 1978 Down), and fixations of deleterious alleles in the population start to occur (HIGGS and WOODCOCK 1995 Down; CHARLESWORTH and CHARLESWORTH 1997 Down). Therefore, neither is the size of the least-loaded class constant nor is the frequency of every deleterious allele predicted by the mutation-selection deterministic equilibrium. For example, for s = 0.01 and U = 0.05, as in Fig 1, the ratchet clicks on average every 110 generations for N = 1000, every 170 generations for N = 3000, and every 331 generations for N = 8000. We find that, under conditions favorable to the operation of Muller's ratchet, variability is always higher than the value predicted by f0 (although always lower than the expectation under the strictly neutral model) and that the reduction in variability is dependent on N.

The conditions for the operation of the ratchet require that s is not very large and/or Nf0 is relatively small. This implies that the mean time that a gamete with a deleterious mutation persists in the population can be larger than the mean coalescent time within the least-loaded class (1/s >> Nf0), which means that more loaded classes can contribute significantly to variability. Hence, in these circumstances the relative genetic diversity (k/k0) is higher than the value predicted simply by f0, as seen in Fig 1. For a given N and U, there is a value of s, smin, that produces a minimum in diversity. If we plot the results of Fig 1 as a function of Nf0s we observe that the minimum occurs for Nf0s ~ 1. When Nf0s >> 1, increasing s increases diversity through the increase in f0; when Nf0s << 1, decreasing s increases diversity due to the contribution of classes other than the least-loaded one. With very weak selection, the reduction in variability becomes very small and is negligible in the limiting case s << 1/N, as deleterious alleles then become effectively neutral and do not interfere with the dynamics of the linked neutral locus at which variation is being measured (CROW and KIMURA 1970 Down, p. 322).

We try to approximate the reduction in genetic diversity as follows. Because , where T2 is the expected time to the most recent common ancestor of two randomly sampled gametes, we approximate T2 using the coalescent approach of HUDSON and KAPLAN 1994 Down by assuming that a population subject to recurrent deleterious mutations can be thought of as a subdivided population in which mutation plays the role of migration. Under conditions where an approximate mutation-selection balance can be attained, i.e., when n0 >> 1, (STEPHAN et al. 1993 Down; GESSLER 1995 Down), the sizes of the mutational classes are close to their deterministic expectation most of the time, and the assumption of mutation-selection balance to calculate the coalescent time should produce reasonable results. The expressions for the coalescent time are given in Appendix A, which are equivalent to Equation 12 of HUDSON and KAPLAN 1994 Down.

When Nf0 < 1, the distribution of mutations can deviate considerably from a Poisson with mean U/s. For these cases, GESSLER 1995 Down has suggested that a shifted Poisson distribution of mean {lambda}, where with , is a better approximation. For these cases we replace Equation A2 of Appendix A by

(1)

where Qi,i-1 is the probability that a gamete with i mutations derives from a gamete with i - 1 mutations. T2 can be calculated in the same way as before, but using Equation 1.

In Fig 2, we compare the results of these analytical approximations (leading to Equation A5 in Appendix A) with those from the exact Monte Carlo forward simulations. The deleterious mutation rate is 0.05, and two values of s are considered: s = 0.005 and s = 0.015. Fig 2 shows that variability is more reduced for larger values of N and slowly approaches the value f0 as N -> {infty} (absence of the ratchet). The analytical expressions provide reasonably good approximations to the simulation results. Note that, for the case s = 0.005 in the range of values of N considered, the deterministic value of n0 is <1, so that Equation 1 was used to calculate the mean coalescent time. Simulations of the structured coalescent were run and compared with both the results of the exact Monte Carlo forward simulations and Equation A5. As expected, no difference is observed between the mean number of pairwise differences predicted by Equation A5 and the one obtained in the coalescent simulations, since they are based on the same assumptions (results not shown).



View larger version (16K):
In this window
In a new window
Download PPT slide
 
Figure 2. Relation between the mean number of pairwise differences, relative to that under strict neutrality, and population size (N). The deleterious mutation rate is 0.05. The simulation results for s = 0.005 (squares) and 0.015 (circles) are shown. The dashed and solid lines are the corresponding theoretical values calculated using Equation A5. For sufficiently large values of N, k/k0 would be ~5 x 10-5 for s = 0.005 and ~0.04 for s = 0.015. The error bars correspond to two standard errors.

In Fig 3, we consider the effects of different values of U with a constant population size of 2000 individuals. As expected, the larger the value of U, the bigger the reduction in variability, for any given value of s. The reduction in expected variability predicted by the coalescent approach is a reasonably good approximation to the means obtained in the forward simulations, even for cases where n0 < 1. However, for the cases where n0 << 1, with the smaller values of s and large values of U in Fig 3, coalescent predictions (and, therefore, coalescent simulations) underestimate the mean pairwise differences in the forward simulations. For example, in Fig 3 with U = 0.1 and s = 0.003, the reduction in the mean pairwise differences observed in forward simulations is 0.217 (with 95% C.I. 0.027) while the prediction from Equation A5 is 0.164. A similar behavior is detected upon close examination of Fig 2, although the difference there is much smaller.



View larger version (18K):
In this window
In a new window
Download PPT slide
 
Figure 3. The dependence of the relative reduction in mean number of pairwise differences, k/k0, on the deleterious mutation rate and selection coefficient, with N = 2000. Squares are the simulation results for U = 0.01, and the solid line is the analytical prediction; circles and the dashed-dot line are the simulation and analytical results for U = 0.05; diamonds and the dashed line are the results for U = 0.1. Error bars represent two standard errors. Clicks of the ratchet were observed in the simulations when s < 0.01 for U = 0.01, s < 0.025 for U = 0.05, and s < 0.04 for U = 0.1.

There are at least two reasons to expect a discrepancy between the coalescence approximations and the forward simulations in these cases. The first is that, when n0 << 1, the time between clicks of the ratchet is so small that it is very difficult to maintain the stability assumed in the approximations over reasonable periods of time. The second is that, due to this fact, the frequency of the least-loaded class experiences large fluctuations and spends a considerable amount of time above the expected value assumed in the coalescent approximations. This implies that the level of genetic diversity is likely to be underestimated by the coalescent approach. We observe such underestimation whenever selection is very weak and the mutation rate is very high, so that the ratchet clicks >100 times over N generations.

From the results presented here, we conclude that Muller's ratchet can considerably reduce genetic diversity at a neutral locus. The extent to which this variation is reduced depends strongly on s (Fig 1), N (Fig 2), and U (Fig 3). For large values of U, the reduction is essentially unaffected by changes in s over a wide range of intermediate selection coefficients (Fig 3), which is important since the exact value of s is poorly known.

Muller's ratchet and the frequency spectrum:
We now consider the effect of Muller's ratchet on the frequency spectrum of mutations at the neutral locus. As explained above, we examined Tajima's D, which is widely used for this purpose (FU 1997 Down). In Table 1, we show, for different values of s, the time between clicks of the ratchet and average values of D obtained from forward and coalescent simulations, assuming a fixed value of {theta}. Coalescent simulations were run to compare with the results obtained from forward simulations. As can be seen in Table 1, they agree quite well with each other. The power to detect deviations from neutrality in samples of size 25 is also shown in Table 1. Table 2 shows the results of coalescent simulations where a fixed number of segregating mutations, S, was distributed over the trees (see SIMULATION METHODS).


 
View this table:
In this window
In a new window

 
Table 1. Mean Tajima's D and power with fixed {theta}


 
View this table:
In this window
In a new window

 
Table 2. Mean Tajima's D and power with fixed S

We find that the operation of Muller's ratchet produces negative values of Tajima's D in samples of realistic size. The mean value of D for different values of N and intermediate values of s is ~ -1. For the {theta} and S values considered, with intermediate values of Ns there is considerable power to detect deviations from neutrality in samples of size 25. For sample sizes of 10, however, we generally found no power to reject neutrality (results not shown). For a given N, the maximum negative average value of Tajima's D is observed for intermediate values of s. We observe that, as the time between turns of the ratchet becomes very large, by increasing s and N (or decreasing U), the average value of Tajima's D becomes less negative and the frequency spectrum becomes closer to that expected under neutrality (D -> 0), as expected from previous results on background selection (CHARLESWORTH et al. 1995 Down). We also computed values of Fu and Li's D* (not shown) and found that this statistic is less powerful than Tajima's D to detect deviations from neutrality, for the sample size considered and this range of parameter values.

Muller's ratchet with two types of deleterious mutations:
Assume now that there are two major types of deleterious mutations: one class of mutations causing very strongly deleterious effects (sb) and another class with weak deleterious effects (ss), occurring at rates Ub and Us, respectively. Although this mutational model is probably too simplistic biologically, it has been suggested that it provides a reasonably good fit to data from experiments on the fitness effects of induced mutations, at least in Caenorhabditis elegans (DAVIES et al. 1999 Down). In addition, it allows us to explore the combined operation of two processes: Muller's ratchet and background selection (see below; CHARLESWORTH 1996B Down; GORDO and CHARLESWORTH 2001 Down). The deterministic equilibrium frequency of the class with i mutations of effect ss and j mutations of effect sb, after selection, is the product of the relevant Poisson distributions (JOHNSON 1999 Down). In particular the size of the least-loaded class, after selection, is

We can easily extend the coalescent approach used above to this two-type mutation model. The expression for the mean number of pairwise differences relative to the neutral case is given in Appendix B.

If n00 > 1, the population will be close to the deterministic equilibrium most of the time and the sizes of the classes can be well approximated by Equation B3. When ss is small and/or Us is large, such that n00 < 1, we approximate the distribution of the classes with respect to these mutations by a shifted Poisson with parameter {lambda}s (see Appendix B; GESSLER 1995 Down).

In Table 3, we show the mean number of pairwise differences relative to the neutral case, in populations of size 3000 and 6000 subject to both types of deleterious mutations. We also show the case when the deleterious mutations with selection coefficient sb are absent, for comparison, and the results from Equation B3, which are referred to as "theoretical." The distortion of the neutral frequency spectrum, as measured by the mean Tajima's D, is given for every set of parameters.


 
View this table:
In this window
In a new window

 
Table 3. The reduction in the mean number of pairwise differences (k/k0) due to Muller's ratchet with two classes of mutations

There are several distinct cases that can occur in a two-type mutational model. The first is the accumulation of mutations of effect ss in the presence of much more strongly deleterious mutations, for which there is no ratchet—i.e., the combined operation of Muller's ratchet and background selection (GORDO and CHARLESWORTH 2001 Down). The large effect mutations are expected to reduce variability by a fraction f0b, and the additional presence of the other mutations, which are accumulating due to Muller's ratchet, is expected to reduce variability even more. For the cases where this occurs (a, b, g, h, i, and k in Table 3), we see that Equation B3 gives good predictions of the relative diversity observed in the simulations. Strongly deleterious mutations reduce diversity at neutral sites by a fraction f0b, but they also reduce the effective population size experienced by the small effect mutations by approximately the same amount (GORDO and CHARLESWORTH 2001 Down). The small effect mutations will then cause a reduction in genetic diversity according to this new effective size (Nf0b). It follows that, in this case, the resulting reduction in the mean number of pairwise differences caused by both types of mutations is given by

with k/k0 calculated with Equation A5.

The average values of Tajima's D are ~ -0.9 and there is some power to reject neutrality in samples of reasonable size (25 chromosomes and {theta} = 6, in the cases in Table 3).

The second case occurs when both types of mutations are accumulating due to the ratchet. In Table 3, we show some examples of this (c, d, e, f, l, m, o, and p). We see that Equation B3 predicts the expected mean number of pairwise differences relative to that under strict neutrality reasonably well. Average values of Tajima's D are between -0.8 and -1, for the {theta} value considered, and there is some power to detect a distortion in the frequency spectrum, for a sample size of 25.

The third case occurs if the effects of both types of mutations are very large and/or the mutation rates are very small, such that none will accumulate. This corresponds to the classical background selection model, with no recombination and two mutational classes. In Table 3, we see that, when we did not observe any clicks of the ratchet (cases j and n) and when n00 >> 1/ss and n00 >> 1/sb, the reduction in genetic diversity is well approximated by f00 (as expected from the expressions in Appendix B). Note that this is the result expected from a one-class deleterious mutational model in which the relevant selection coefficient is the harmonic mean of the selection coefficients in the two-class mutational model (CHARLESWORTH 1996A Down).

The fourth case occurs when the presence of strongly deleterious mutations reduces the effective population size by such a large amount that the smaller mutations become effectively neutral, i.e., Nf0b ss < 1 (CHARLESWORTH 1996B Down). Under these conditions, genetic drift is the major force determining the dynamics of the small effect mutations and driving them to fixation. Some examples of this case are considered in Table 4, with two different mutational models for the small effect mutations: one considering irreversible mutation and another, more realistic model, allowing for back mutation (MCVEAN and CHARLESWORTH 2000 Down). In these cases, the reduction in the mean number of pairwise differences is very close to the one caused by the strong mutations, since the weak ones are effectively neutral (KIMURA 1983 Down) and do not have any significant effect on the neutral locus at which variation is being measured. Therefore, k/k0 can essentially be approximated by f0b. In this case, the average Tajima's D is much less negative than in some of the previous cases, and it is very difficult to detect distortions in the frequency spectrum (cf. CHARLESWORTH et al. 1995 Down), especially when allowing for back mutation.


 
View this table:
In this window
In a new window

 
Table 4. The effects of background selection on weakly selected mutations with and without back mutation

As in the previous model, in this two-type mutational model we also find that, when Muller's ratchet starts to operate, the level of k/k0 is roughly the same over intermediate values of the selection coefficient, for a fixed population size and mutation rate.


*  DISCUSSION
*TOP
*ABSTRACT
*SIMULATION METHODS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*LITERATURE CITED

Muller's ratchet and neutral variation:
Genetic diversity at a neutral locus results from the balance between the rate at which variation is generated (mutation pressure) and the rate with which it is lost (genetic drift). In a population that is permanently at equilibrium under recurrent mutation to deleterious alleles, in the absence of recombination neutral genetic diversity is expected to be smaller than the strict neutral expectation (CHARLESWORTH et al. 1993 Down). This results from the fact that a large fraction of individuals in such a population are destined to be eliminated relatively quickly, so that its effective size is reduced to the class of individuals that do not carry deleterious mutations, Nf0 (CHARLESWORTH et al. 1993 Down).

In this article, we have quantified the expected genetic diversity when a population is not permanently at equilibrium, but is losing its least-loaded class at a given rate. We have shown that the operation of Muller's ratchet is consistent with a considerable reduction in genetic diversity. The extent to which such variation is reduced is a function, not only of the relevant mutation and selection parameters, but also of population size. In particular, in a population where Muller's ratchet does not operate, or does so at an exceedingly slow rate, which is expected when Nf0s >> 10 (GORDO and CHARLESWORTH 2000A Down, GORDO and CHARLESWORTH 2000B Down), the effective size is well approximated by Nf0. But when the ratchet starts to operate, the effective size is higher than Nf0. Although it has been suggested that the operation of Muller's ratchet causes a different value of genetic diversity from that given by the simple 2Nf0µ approximation (CHARLESWORTH et al. 1993 Down), this study is the first attempt to formally demonstrate that it does so and to estimate by how much.

We have shown that the mean coalescent time of two randomly sampled alleles derived from a structured coalescent model with fixed class size (HUDSON and KAPLAN 1994 Down) is a good predictor of expected genetic diversity when the ratchet is operating. Just as is observed in the full Monte Carlo simulations, the analytical approximation predicts a minimum genetic diversity for an intermediate value of the selection coefficient. Our results are closely related to those of HIGGS and WOODCOCK 1995 Down, who studied the effect of deleterious mutations on genealogies in very small populations and showed that the probability of common parentage was maximal for some intermediate value of the selection coefficient against deleterious alleles. The results from Figure 8 of HIGGS and WOODCOCK 1995 Down can be obtained by our approximation, provided that we correct them for sampling after selection.

Our results are also related to those of TACHIDA 2000 Down, who found that diversity at neutral sites was minimal for an intermediate strength of selection, although his model is different from the one we consider here. In Tachida's independent multicodon (IMC) model, a gene is composed of a set of completely linked sites. One-third of the sites are neutral and two-thirds are selected, with selection coefficients drawn from a normal distribution. In our model, the selection coefficient is constant, but the qualitative effect on neutral diversity is the same. The simulation results regarding genetic diversity at neutral sites in Table 1 of TACHIDA 2000 Down can be obtained by our approximation, if we substitute s in our approximation by the value corresponding to the mean strength of selection ({alpha}) considered in his Table 1 ({alpha} {cong} 2Ns). With our formula, we obtain good estimates of the average genetic diversity at neutral sites observed in his simulations, except when {alpha} < 5. As an example, with per site (implying for the whole nonrecombining region) and {alpha} = 5 (implying s = 0.005), the value of neutral variability observed in Tachida's simulations is 0.00857 and the value predicted by our approximation is 0.00836.

In contrast to the classical background selection model with strong selection (HUDSON and KAPLAN 1994 Down; CHARLESWORTH et al. 1995 Down), if Muller's ratchet is operating under weak selection, a considerable distortion of the frequency spectrum at the neutral locus, toward an excess of rare variants, is expected in samples of realistic size (as seen in Table 1). But such an effect may be difficult to detect when the ratchet causes a very large reduction in variation, as may be the case in large populations (see below). This signature of the ratchet is quite close to that of selective sweeps, but not as extreme (see below). For the parameter values tested we found more power for Tajima's D test than for Fu and Li's D* test to reject neutrality under the operation of the ratchet than in its absence.

Because a model that considers that all deleterious mutations have the same effect on fitness is a simplification, we also studied the pattern of neutral variation under a two-type deleterious mutational model (GORDO and CHARLESWORTH 2000B Down). We considered several distinct cases. In the case where none of the deleterious mutations accumulate, the classical background selection scenario, we recover the expected prediction: The reduction in genetic diversity is well approximated by considering the harmonic mean of the selection coefficients of the two-type model. Tajima's D is negative on average, but distortions of the frequency spectrum are hard to detect (see Table 3). In the case where one of the mutational types is sufficiently strongly selected against that it does not accumulate, but the other type of deleterious mutations does accumulate, we essentially observe the effects of the ratchet in a population of reduced size Nf0. In the case where both mutational types accumulate, we again observe negative values of Tajima's D and obtain reasonably good predictions of the genetic diversity by the extended coalescent approach.

It would be of interest to study a potentially more realistic case that considers a continuous distribution of selection coefficients, but we have not pursued this here. In qualitative terms, one would expect that a continuous distribution with a high frequency of weakly deleterious mutations, such as the case of an exponential distribution, would produce similar results to those in the two-type model considered here: i.e., negative Tajima's D values and a reduction in genetic diversity. But the quantitative effects will depend on the shape of the distribution of selection coefficients and the deleterious mutation rate, which are presently a matter for debate (DAVIES et al. 1999 Down; KEIGHTLEY and EYRE-WALKER 1999 Down).

Muller's ratchet and the Y chromosome:
It has been suggested that Muller's ratchet has been a potentially major process in shaping the evolution of Y chromosomes (CHARLESWORTH 1978 Down). It is therefore interesting to try to quantify the levels of neutral variation expected under its operation. It is of special interest to ask about the diversity levels expected under the ratchet in systems with relatively young Y chromosomes (RICE 1996 Down; CHARLESWORTH and CHARLESWORTH 2000 Down). Some examples of these systems are the Y chromosomes of the plant species Silene latifolia and S. dioica (FILATOV et al. 2000 Down) and the neo-Y chromosomes (resulting from fusions between an autosome and the old Y chromosome) of some Drosophila species such as D. miranda (CHARLESWORTH and CHARLESWORTH 2000 Down).

In Fig 4 and Fig 5 we show some expectations for the signatures of Muller's ratchet. Fig 4A shows the expected number of fixations of deleterious alleles over a period of 500,000 generations (in the case of D. miranda, this corresponds to ~0.1 million years). Fig 4B shows the reduction in the mean number of pairwise differences. Fig 5A and Fig B, shows the average values of Tajima's D caused by the ratchet. The number of fixations is estimated from the number of clicks of Muller's ratchet over the time period, since there is a one-to-one correspondence between clicks and fixation events (CHARLESWORTH and CHARLESWORTH 1997 Down). The parameter values N and U were assigned in the light of the data presently available (DRAKE et al. 1998 Down; KEIGHTLEY and EYRE-WALKER 1999 Down; FILATOV et al. 2000 Down; YI 2000 Down). As previously discussed (GORDO and CHARLESWORTH 2000B Down) substantial declines in fitness of the nonrecombining chromosome can be produced, especially for intermediate s values.



View larger version (19K):
In this window
In a new window
Download PPT slide
 
Figure 4. The signatures of Muller's ratchet in large hypothetical populations of nonrecombining Y or neo-Y chromosomes. The parameters are N = 125,000 for dashed lines and N = 500,000 for solid lines, with U = 0.01 for circles and U = 0.03 for squares. (A) The expected number of fixations over a period of 500,000 generations. These are based on the expressions for the time between clicks of Muller's ratchet in GORDO and CHARLESWORTH 2000A Down, GORDO and CHARLESWORTH 2000B Down for the cases when Nf0 > 1 and the results of GESSLER 1995 Down for the cases when Nf0 < 1. (B) Expected mean number of pairwise differences relative to that in the absence of deleterious mutations, calculated using the analytical prediction.



View larger version (19K):
In this window
In a new window
Download PPT slide
 
Figure 5. Average value of Tajima's D for a sample size of 12 for {theta} = 5 (A) and sample size 12 for {theta} = 100 (B). Lines and symbols are as in Fig 4.

Associated with the fixation of deleterious mutations by the ratchet, a reduction in genetic diversity of ~10- to 100-fold is expected (as calculated by Equation B3 and the simulations of the structured coalescent). This is expected across a wide range of values of selection coefficients for which the ratchet can operate, since the mutation rate is the major determinant of the level of variation expected (Fig 4B). In samples of moderate size (n = 12 in Fig 5), average Tajima's D values of ~ -1 are expected when the selection coefficient is intermediate. For larger samples, the average values of Tajima's D become more negative. As an example, with a sample size of 40, with N = 125,000, U = 0.01, and s = 0.1–0.2%, we obtained an average value of Tajima's D of -1.7 for {theta} = 50 and -1.9 for {theta} = 100. The power to reject neutrality for these two examples was >80% (assuming a fixed {theta}). A large amount of sequence information and large samples are, however, needed to detect this effect. For example, in Drosophila, where normal levels of variability are ~1–3% per nucleotide site (MORIYAMA and POWELL 1996 Down), the above example implies sequencing ~5000–10,000 neutral sites. One can ask if increasing sample size (n) will produce higher power than increasing {theta} by increasing the number of sites sequenced. From simulations of the structured coalescent with n{theta} held constant, we found that increasing n seems to give more power than increasing {theta}.

The additional presence of much more strongly deleterious mutations, causing background selection, will result both in an increase in the number of fixations (GORDO and CHARLESWORTH 2001 Down) and in a bigger reduction in genetic diversity, as expected from the results presented before. There is evidence for considerably reduced levels of variability in some Y chromosome systems (CHARLESWORTH and CHARLESWORTH 2000 Down). For the Y chromosome of the dioecious plant S. latifolia and the neo-Y chromosome of D. miranda, nucleotide variability is 20- to 30-fold lower than for the X chromosome (FILATOV et al. 2000 Down; BACHTROG and CHARLESWORTH 2002 Down). If the simple process we have studied was the sole cause of the observed reduction, the results in Fig 4B imply that the deleterious mutation rate for such nonrecombining chromosomes is unlikely to be >0.01.

Selective sweeps vs. the ratchet:
A large reduction in variability could, of course, be caused by another process, such as a recent selective sweep. When an advantageous mutation arises and goes to fixation in a nonrecombining population, it wipes out linked neutral variation—the hitchhiking effect (MAYNARD SMITH and HAIGH 1974 Down). After such a sweep, variation is slowly restored by mutation, with most of the new neutral variants being at low frequency. Selective sweeps therefore cause distortions of the neutral frequency spectrum (BRAVERMAN et al. 1995 Down; SIMONSEN et al. 1995 Down), just as with repetitive clicks of the ratchet. In Table 5, we compare the pattern of variability under the ratchet and under a recent sweep. We assume knowledge of the neutral equilibrium value of {theta}, in the absence of any of these processes, and study conditions under which genetic diversity is reduced by ~20- to 30-fold. As is clear from Table 5, a recent sweep generally produces more negative average Tajima's D than the ratchet, for a given reduction in diversity. In small samples, a sweep is more likely to be detected than the operation of the ratchet, but it is clear that there is a wide range of parameter space in which no unambiguous conclusion can be drawn. Other statistics, such as patterns of linkage disequilibrium, could also be helpful in trying to distinguish between these and other models (CHARLESWORTH and CHARLESWORTH 2000 Down). Further work is needed to assess the power of such statistics to distinguish between background selection, Muller's ratchet, and selective sweeps.


 
View this table:
In this window
In a new window

 
Table 5. Comparison of the ratchet with the hitchhiking model


*  ACKNOWLEDGMENTS

We thank Peter Andolfatto, Carolina Bartolome, Xulio Maside, and Gil McVean for helpful discussions throughout the course of this study. I. Gordo specially thanks Frantz Depaulis for numerous comments that greatly improved the manuscript. I. Gordo is supported by the Gulbenkian Foundation and program PRAXIS XXI of Portugal, A. Navarro by the BBSRC/EPSRC, and B. Charlesworth by the Royal Society.

Manuscript received August 27, 2001; Accepted for publication February 22, 2002.


*  APPENDIX A
*TOP
*ABSTRACT
*SIMULATION METHODS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*LITERATURE CITED

Assume that there are m mutational classes in the population, so that a sample may contain gametes with 0, 1, 2, up to m mutations. The probability that a gamete with i mutations derives from a gamete with i - 1 mutations is

which is a special case of Equation 3 in CHARLESWORTH et al. 1995, and f*i is the frequency of the i class, after selection. (We assume here that the mutation rate is sufficiently low that we can neglect mutations from classes other than the adjacent one.)

If the distribution of the frequencies of the classes is close to the deterministic expectation most of the time, then

(A1)

so that

(A2)

Suppose that we sample randomly two individuals and that these belong to classes i and j. If i and j != 0, there are two possible mutational events in the previous generation: Either gamete i came from the i - 1 class (with probability Qi,i-1), or gamete j came from the j - 1 class (with probability Qj,j-1); if i = j they can also coalesce, with probability 1/Nf*i, since the size of class i, after selection (which is when we are sampling) is Nf*i. Let Ti,j be the mean time (in generations) back to the common ancestor of a sample of two gametes with i and j (i, j >= 0) mutations. We then have

where otherwise. Rearranging, we have

(A3)

which is equivalent to Equation 12 of HUDSON and KAPLAN 1994 Down for a sample size of two, with the difference that we are counting individuals as postselection adults. The mean time for the most recent common ancestor of two randomly sampled sequences is then

(A4)

and the resulting mean number of pairwise differences relative to the neutral expectation will be

(A5)


*  APPENDIX B
*TOP
*ABSTRACT
*SIMULATION METHODS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*LITERATURE CITED

Suppose that we take a random sample of two individuals from a population subject to recurrent mutations with two types of effect, ss and sb, occurring at two different rates, Us and Ub, respectively. Suppose one individual carries i mutations of type ss and k mutations of type sb and the other carries j mutations of type ss and l mutations of type sb. Let Ti,k,j,l be the time to the most recent common ancestor of these individuals. If the population is close to the deterministic equilibrium this time will be given by

where

(B1)

which is the extension of the previous approximation for mutations of equal effects. When n00 < 1, because Nf0s < 1, we use, as previously, the shifted Poisson distribution with parameter , where , so that

(B2)

Using these approximations, the mean time to the most recent common ancestor of two random gametes is

(B3a)

and

(B3b)

with


*  LITERATURE CITED
*TOP
*ABSTRACT
*SIMULATION METHODS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*LITERATURE CITED

BACHTROG, D. and B. CHARLESWORTH, 2002  Reduced adaptation of a non-recombining neo-Y chromosome. Nature 416:323-326[Medline].

BERGSTROM, C. T. and J. PRITCHARD, 1998  Germline bottlenecks and the evolutionary maintenance of mitochondrial genomes. Genetics 149:2135-2146[Abstract/Free Full Text].

BRAVERMAN, J. M., R. R. HUDSON, N. L. KAPLAN, C. H. LANGLEY, and W. STEPHAN, 1995  The hitchhiking effect on the site frequency spectrum of DNA polymorphisms. Genetics 140:783-796[Abstract].

CHARLESWORTH, B., 1978  Model for evolution of Y chromosomes and dosage compensation. Proc. Natl. Acad. Sci. USA 75:5618-5622[Abstract/Free Full Text].

CHARLESWORTH, B., 1996a  Background selection and patterns of genetic diversity in Drosophila melanogaster.. Genet. Res. 68:131-149[Medline].

CHARLESWORTH, B., 1996b  The evolution of chromosomal sex determination and dosage compensation. Curr. Biol. 6:149-162[Medline].

CHARLESWORTH, B. and D. CHARLESWORTH, 1997  Rapid fixation of deleterious alleles can be caused by Muller's ratchet. Genet. Res. 70:63-73[Medline].

CHARLESWORTH, B. and D. CHARLESWORTH, 2000  The degeneration of Y chromosomes. Philos. Trans. R. Soc. Lond. B Biol. Sci. 355:1563-1572[Medline].

CHARLESWORTH, B., M. T. MORGAN, and D. CHARLESWORTH, 1993  The effect of deleterious mutations on neutral molecular variation. Genetics 134:1289-1303[Abstract].

CHARLESWORTH, D., B. CHARLESWORTH, and M. T. MORGAN, 1995  The pattern of neutral molecular variation under the background selection model. Genetics 141:1619-1632[Abstract].

CROW, J. F., and M. KIMURA, 1970 An Introduction to Population Genetics Theory. Burgess Publishing/Alpha Editions, Minneapolis.

DAVIES, E. K., A. D. PETERS, and P. D. KEIGHTLEY, 1999  High frequency of cryptic deleterious mutations in Caenorhabditis elegans.. Science 285:1748-1751[Abstract/Free Full Text].

DRAKE, J. W., B. CHARLESWORTH, D. CHARLESWORTH, and J. F. CROW, 1998  Rates of spontaneous mutation. Genetics 148:1667-1686[Abstract/Free Full Text].

EWENS, W. J., 1979 Mathematical Population Genetics. Springer-Verlag, Berlin.

FELSENSTEIN, J., 1974  The evolutionary advantage of recombination. Genetics 78:737-756[Abstract/Free Full Text].

FILATOV, D. A., F. MONEGER, I. NEGRUTIU, and D. CHARLESWORTH, 2000  Low variability in a Y-linked plant gene and its implications for Y-chromosome evolution. Nature 404:388-390[Medline].

FISHER, R. A., 1930 The Genetical Theory of Natural Selection. Oxford University Press, Oxford.

FU, Y., 1997  Statistical tests of neutrality of mutations against population growth, hitchhiking and background selection. Genetics 147:915-925[Abstract].

FU, Y. and W. LI, 1993  Statistical tests of neutrality of mutations. Genetics 133:693-709[Abstract].

GESSLER, D. D., 1995  The constraints of finite size in an asexual population and the rate of the ratchet. Genet. Res. 66:241-253.

GESSLER, D. D. and S. XU, 1999  On the evolution of recombination and meiosis. Genet. Res. 73:119-131[Medline].

GORDO, I. and B. CHARLESWORTH, 2000a  The degeneration of asexual haploid populations and the speed of Muller's ratchet. Genetics 154:1379-1387[Abstract/Free Full Text].

GORDO, I. and B. CHARLESWORTH, 2000b  On the speed of Muller's ratchet. Genetics 156:2137-2140[Free Full Text].

GORDO, I. and B. CHARLESWORTH, 2001  The speed of Muller's ratchet, with background selection and the degeneration of Y chromosomes. Genet. Res. 78:149-162[Medline].

HAIGH, J., 1978  The accumulation of deleterious genes in a population—Muller's Ratchet. Theor. Popul. Biol. 14:251-267[Medline].