Abstract
We calculate the fixation probability of a beneficial allele that arises as the result of a unique mutation in an asexual population that is subject to recurrent deleterious mutation at rate U. Our analysis is an extension of previous works, which make a biologically restrictive assumption that selection against deleterious alleles is stronger than that on the beneficial allele of interest. We show that when selection against deleterious alleles is weak, beneficial alleles that confer a selective advantage that is small relative to U have greatly reduced probabilities of fixation. We discuss the consequences of this effect for the distribution of effects of alleles fixed during adaptation. We show that a selective sweep will increase the fixation probabilities of other beneficial mutations arising during some short interval afterward. We use the calculated fixation probabilities to estimate the expected rate of fitness improvement in an asexual population when beneficial alleles arise continually at some low rate proportional to U. We estimate the rate of mutation that is optimal in the sense that it maximizes this rate of fitness improvement. Again, this analysis relaxes the assumption made previously that selection against deleterious alleles is stronger than on beneficial alleles.
IT is often useful to view adaptive evolution in an asexual population (for example, on a nonrecombining chromosome) as two separate processes. The first process is the origin of new beneficial alleles by mutation, and the second process is the fixation of some of those alleles by natural selection. This article is concerned with the second process and specifically with calculating the probability of fixation of a beneficial allele, assuming that it starts at a low frequency. This problem was first studied by modeling the copy number of the beneficial allele as a branching process (Fisher 1922, 1930; Haldane 1927), assuming that the number of descendants from each copy of the beneficial allele is drawn from a Poisson distribution with mean W, where W ≡ 1 + sb is the absolute fitness of an individual carrying the beneficial allele. In a large population of fixed size N, the probability of ultimate fixation of a single copy of a beneficial allele is Pfix = p[sb] (for 1/N ⪡ sb), where p[·] is the unique function satisfying
The geographical invariance principle states that the fixation probability of a beneficial allele is unaffected by spatial structuring of the population when there is no variation in W between demes (Maruyama 1970, 1971; Nagylaki 1982). However, almost any other departure from an idealized Wright-Fisher population at linkage equilibrium violates the assumptions made by Fisher (1922) and Haldane (1927). Specifically, their result does not hold when there is variation in W, and this is likely under a range of biologically realistic conditions. Examples include variation in space caused by local extinction and recolonization (Barton 1993; Barton and Whitlock 1997), variation in time caused by fluctuating population size (Ewens 1967; Otto and Whitlock 1997), variation across genetic backgrounds caused by selection at linked loci (Fisher 1930; Hill and Robertson 1966; Robertson 1970; Manning and Thompson 1984; Barton 1994, 1995; Charlesworth 1994; Peck 1994; Caballero and Santiago 1995; Gerrish and Lenski 1998; Stephanet al. 1999; Orr 2000b; Gerrish 2001; Rice and Chippindale 2001), and variation in the intensity of selection on the beneficial allele itself (for example, Pollak 1966b; Kimura and Ohta 1970; Barton 1987).
An accurate formula for fixation probabilities, based on a biologically appropriate model, is desirable for at least two reasons. First, it is one of the building blocks of more complex evolutionary models, in which the behavior of rare beneficial alleles is not explicitly modeled. Instead, the convenient assumption is made that a fraction Pfix of beneficial alleles reach frequencies large enough to actually be considered in the model, and the remaining fraction (1 - Pfix) are lost while still rare and can be ignored altogether (for example, Kimura 1983; Berg 1995; Hartl and Taubes 1996, 1998; Gerrish and Lenski 1998; Orr and Kim 1998; Orr 1998, 2000a,b; Poon and Otto 2000; Gerrish 2001). In some models this assumption is in fact a limiting property of the model, obtained when selection is strong and mutation is weak relative to drift (Gillespie 1983a,b). Second, laboratory evolution experiments with Escherichia coli (Gerrish and Lenski 1998; Imhof and Schlötterer 2001) and vesicular stomatitis virus (Miralleset al. 1999) have allowed inferences to be made about the number of beneficial mutations that have arisen over the course of the experiment, on the basis of an observation of the number of selective sweeps that have occurred. Clearly, making accurate estimates requires an accurate formula for fixation probability (as pointed out by Orr 2000b).
This article is primarily concerned with how the fixation probability of a beneficial allele at one locus is influenced by segregating deleterious alleles at other loci, in a completely asexual population or along a completely nonrecombining chromosome. In the absence of recombination, beneficial mutations that arise in a given genetic background are effectively “trapped” in it, unable to recombine into other backgrounds (Fisher 1930, p. 122). Variation in fitness between the different backgrounds, caused by segregating deleterious alleles, can reduce or outweigh the advantage conferred by a beneficial allele, resulting in a substantially reduced fixation probability. The magnitude of this reduction has been the subject of several theoretical investigations (Manning and Thompson 1984; Charlesworth 1994; Peck 1994; Orr 2000b) and also of a recent empirical study (Rice and Chippindale 2001). This effect has also been studied theoretically for sexual populations, but with the emphasis on either a small number of tightly linked loci (Barton 1995; Stephanet al. 1999) or a large number of loosely linked loci (Peck 1994; Barton 1995).
Previous analytical work on asexual models has either assumed a fixed selection coefficient against deleterious alleles (sd) with sb < sd (Manning and Thompson 1984; Charlesworth 1994; Peck 1994) or has allowed sd to vary across loci assuming that sb < min{sd} (Barton 1995; Orr 2000b). Under either of these assumptions, the frequency of the beneficial allele is expected to increase only if it is present in the most fit genetic background, as first pointed out by Fisher (1930, p. 122). If, prior to the origination of the beneficial allele, the frequency of the most fit background or genotype is f0, and if a beneficial allele that originates on a lower fitness background has negligible probability of subsequently moving onto the most fit background, then
In this article we assume deleterious alleles of fixed effect but relax the requirement that sb < sd. This case has been studied previously by Manning and Thompson (1984). However (as discussed by Peck 1994), their analysis made an erroneous assumption that Pfix = 0 if the beneficial allele was lost from the background in which it first arose, and also only a table of numerical results was presented. Peck (1994) conducted some simulation work for sb > sd. We present an algorithm for exact solution of the model studied by Manning and Thompson (1984) and Peck (1994). This solution is valid when sb ⪢ 1/N and exp[-U/sd] ⪢ 1/N. We also study the case where the beneficial mutation arises in a population that is not at equilibrium with respect to deleterious allele frequencies. A population will be perturbed from equilibrium by a selective sweep or a population bottleneck, and if selection against deleterious alleles is weak then the timescale of the approach back to mutation-selection equilibrium will be very slow (Johnson 1999). Indeed, in a rapidly evolving asexual population it is likely that frequencies of deleterious alleles will fluctuate continually and it is therefore of some interest to consider how fixation probabilities are altered in out-of-equilibrium populations. The more specific question addressed here is: What effect does the sweep to fixation of one beneficial allele have on the probability of subsequently arising beneficial alleles also becoming fixed?
We apply our results for fixation probabilities by estimating the expected rate of fitness improvement in an asexual population when beneficial alleles arise continually by mutation at rate kU per individual per generation, with k ⪡ 1. Our analysis is essentially an extension of the work of Orr (2000b), who used Equation 2 to estimate fixation probabilities and so derived results that are valid only when sb < sd. Following Orr (2000b), we estimate the rate of mutation that is optimal in the sense that it maximizes this rate of fitness improvement.
GENERAL METHODS FOR CALCULATING FIXATION PROBABILITIES
The branching process model as originally developed (Fisher 1922, 1930; Haldane 1927) assumes that W is the same for all copies of the beneficial allele and is constant over time. Building on work by Pollak (1966a, 1972), Barton (1995 and references therein) has applied the theory of multitype branching processes (see, e.g., Harris 1963) to study how fixation probabilities are influenced when W varies according to the “site” in which a given copy of the beneficial allele finds itself. This is a flexible approach in which sites can represent demes or microhabitats (Pollak 1966a, 1972; Barton 1987, 1993), genetic backgrounds (Barton 1994, 1995), age classes, or any other aspect of population structure. Relatively complex models can be analyzed in a straightforward way by considering the probability Qi,t that a single copy of the beneficial allele present in site i in generation t is ultimately lost from the population. An expression for Qi,t can be found by summing over all possible numbers of offspring and all possible movements of offspring between sites. If each copy of the allele present in site i at time t independently gives rise to n offspring, where n is drawn from a Poisson distribution with mean Wi,t, and the independent probability that each of these offspring will be in site j at time t + 1 is mi,j, and each offspring in site j has independent probability Qj,t+1 of being lost, then
The assumption that separate copies of the allele are independent means that the branching process is an appropriate model only when the number of copies of the beneficial allele is small relative to the total population size. When sb ⪢ 1/N it is reasonable to assume that deterministic forces will prevail when the branching process model breaks down in this way, and in this case an allele that arises in background i at time t and that is never lost is said to become established. This occurs with probability Pi,t = 1 - Qi,t. The probability of establishment for a beneficial mutation that arises in a random genetic background is denoted Pfix,t and is calculated as an average of the Pi,t, weighted by the probability of the beneficial mutation initially occurring in each background i. If an allele is established, it is not actually guaranteed fixation, but its frequency might approach a polymorphic equilibrium or it might become fixed only in some sites.
Because the branching process model assumes that the fate of an allele of interest is determined while it is rare, it cannot be used to calculate fixation probabilities for slightly deleterious mutations or for beneficial mutations that confer an advantage that is weak relative to the effects of genetic drift. For the same reason, branching process models cannot be used to determine the distribution of times taken until ultimate fixation of an allele. To address these types of questions Kimura (1957) used a diffusion approximation, which is valid when change is approximately continuous in time (large population sizes and weak selection). This approach has been used extensively to study the expected time to fixation (Kimura and Ohta 1969) and how fixation probabilities are influenced by population subdivision (Maruyama 1970, 1971; Nagylaki 1982; Barton 1987, 1993; Barton and Rouhani 1987), by “background selection” due to segregating deleterious alleles (Charlesworth 1994), or by changing population size (Otto and Whitlock 1997). Despite the advantages of the diffusion approximation, the branching process model remains a useful tool because it can often be analyzed much more easily.
MODEL
The model used is identical to the one studied by Manning and Thompson (1984) and Peck (1994). The notation used is summarized in Table 1. We consider the fate of a beneficial allele within a haploid population of fixed size N. Individuals without the beneficial allele are called “wild type.” We assume that N is sufficiently large for deterministic results to apply to the wild-type subpopulation and for the effect of Muller’s ratchet (loss of the most fit genotype by genetic drift; see Muller 1964; Haigh 1978; Gordo and Charlesworth 2000) within the wild-type subpopulation to be negligible over the timescale of interest. (A sufficient condition for this is exp[-U/sd] ⪢ 1/N.) We assume that deleterious mutations arise at a constant rate U per genome per generation, that there are an effectively infinite number of equivalent biallelic loci, and that at each locus the deleterious allele reduces fitness by a factor (1 - sd) with multiplicative effects across loci. Genetic backgrounds (sites) can therefore be enumerated by i, where i is the number of deleterious alleles and takes discrete values i = 0, 1, 2,.... Back mutations are ignored. We first consider a wild-type population at mutation-selection equilibrium. We then consider the more complex scenario of a wild-type population that is initially free of deleterious alleles and is approaching equilibrium. The correspondingly more complex calculations have been placed in appendices a and b.
We assume that a single beneficial allele arises in a randomly chosen wild-type individual, and its presence increases relative fitness by a factor (1 + sb) regardless of the genetic background on which it is expressed; that is, we assume no epistasis for fitness. Except for its small size, the subpopulation carrying the beneficial allele is identical to the large wild-type (sub)population. That is, deleterious mutations arise at the same rate U and have the same effect on fitness (1 - sd). Because the number of copies of the beneficial allele is initially small, we consider the progress of Muller’s ratchet within this subpopulation. We calculate the fixation probability for the beneficial allele, Pfix, by considering its copy number in different genetic backgrounds as a multitype branching process. We assume a Poisson distribution of offspring number.
Frequently used notations
To estimate the long-term average rate of adaptation, measured as the rate of fitness improvement, we embed our model for fixation probabilities within a more complex model. This is a generalization of the model of Orr (2000b). We make the standard assumption that the beneficial mutation origination process is Poisson and occurs at rate kU per individual per generation, where k ⪡ 1 is the ratio of beneficial to deleterious mutations. Because the population size is N the total number of new beneficial alleles arising per generation is NkU. We then make the questionable assumption that each such mutation is fixed with equal and independent probability Pfix, so that the mutation fixation process is also Poisson and occurs at rate NkUPfix. This assumption requires that the perturbations due to beneficial substitutions are small and transient enough that the fixation probabilities of most beneficial mutations are well approximated by the equilibrium population results. In making this assumption we also ignore interference between multiple beneficial alleles (the Hill-Robertson effect; Fisher 1930; Muller 1932; Hill and Robertson 1966; Felsenstein 1974; Barton 1995; Gerrish and Lenski 1998; Gerrish 2001). We assume that successively fixed beneficial alleles have multiplicative effects on fitness.
ANALYSIS
Fixation probabilities: Fitness relative to the fittest wild-type individual is denoted w. Hence, when the beneficial allele is present in the fittest possible individual it has relative fitness w = w0 = (1 + sb). A beneficial allele present in genetic background i has relative fitness
In Equations 3 and 4, Qi,t is the probability of loss of a single copy of the beneficial allele present after selection followed by movement between sites. Since here “movement between sites” represents deleterious mutation, the probability of the beneficial allele arising in site i is calculated using the frequencies of the different sites after deleterious mutation. Because the number of deleterious alleles, i, carried by a randomly chosen wild-type individual is Poisson distributed with mean λ= U/sd (Haigh 1978), we have that
It is more convenient to rewrite Equation 4 in terms of the probability P* i that an allele in background i is never lost (where P* i = 1 - Q* i). The probability that a given copy of the beneficial allele in site i is moved to site i + j by deleterious mutation is simply
As mentioned briefly earlier in this article and discussed at more length by Peck (1994), there may be backgrounds in which the beneficial allele confers a net advantage in the short term but that will ultimately give rise to a lineage with a lower mean fitness than the wild-type population; such beneficial alleles are assumed not to ultimately fix. Consider a beneficial allele subpopulation that has become large and that includes individuals with i, i + 1, i + 2,... deleterious alleles. The most fit individual in this subpopulation has fitness max{wB} = (1 + sb)(1 - sd)i and so the subpopulation will ultimately approach mutation-selection balance with w¯B = (1 + sb)(1 - sd)i e-U. The wild-type subpopulation ultimately approaches w¯W = e-U and if w¯B > w¯W, equivalently (1 + sb)(1 - sd)i > 1 or
As t → ∞ with N constant the state of the wild-type population becomes constant over time, and at equilibrium both Pi,t+1 and Pi,t converge to the single value Pi (an abbreviation for Pi,∞).
By making the simplifications described in the previous two paragraphs, we obtain a set of simultaneous equations in Pi for i = 0, 1, 2,..., imax. In general, we can start by solving
The rate of adaptation: Our model for long-term adaptation assumes that beneficial mutations are rare enough that they can be considered, to a good approximation, to arise in close-to-equilibrium populations, so that Pfix is relevant. We wish to find an approximation for C, the expected increase in mean log-fitness per generation. To do this we first find an approximation for ΔC, the expected increase in log-fitness per beneficial substitution, which in turn requires an approximation for ΔCi, the expected increase in log-fitness per beneficial substitution conditional on the beneficial allele arising in background i.
If the beneficial allele arises on background i and is ultimately fixed, then some number hi of deleterious alleles will be fixed by hitchhiking (allowing the possibility of h0 = 0). Clearly hi ≥ i, but the problem is that hi conditional on fixation is a random variable and may exceed i in the event that the beneficial mutation is lost from the background on which it originally arose but ultimately does becomes fixed. We make use of the decomposition Pi = pi + xi, which is detailed in appendix c. Here pi = p[wi - 1] is the probability of fixation given that the beneficial allele fixes in the background in which it arose (i), and xi is the probability of fixation given that the beneficial allele is lost from the background in which it arose. xi can be calculated directly by numerical solution of Equation C2 or simply by taking the difference between Pi and pi. This partitioning of fixation events into two mutually exclusive possibilities suggests that for U ⪡ 1 it might be reasonable to suppose that the fate of a beneficial allele lost by mutation from background i, conditional on eventual fixation, is identical to the fate of a beneficial allele that arises in background i + 1, again conditional on ultimate fixation. This leads to the approximation
As explained above, the assumption of our model for continued adaptation is that the mutation fixation process is Poisson and occurs at rate NkUPfix. Because we assume multiplicative effects of multiple beneficial alleles, the long-term average rate of mean log-fitness increase is given approximately by
When U is varied and the other model parameters are held constant, C is maximized at a particular value of U, which we call the optimum mutation rate Uopt. Orr (2000b) proved that when sb < sd (so that imax = 0) the optimum mutation rate is given simply by Uopt = sd. This result can be seen by substituting (2) into (18) and differentiating to obtain
NUMERICAL RESULTS
Fixation probabilities when many genetic backgrounds are relevant: As we described in the Introduction (see also Fisher 1930; Manning and Thompson 1984; Charlesworth 1994; Peck 1994; Barton 1995; Orr 2000b), when sb ≤ sd a beneficial allele must arise in the single most fit genetic background to have any chance of fixation, and the net fixation probability is given by Equation 2. However, when sb ⪢ sd there are many genetic backgrounds in which a beneficial allele can arise and have some probability of fixation and so the situation is more complex. This is illustrated in Figure 1, which shows Pi (top, line), the fixation probability of a beneficial allele arising in a background with i deleterious alleles, assuming a wild-type population at equilibrium and parameter values sb = 5 × 10-3, sd = 5 × 10-4, and U = 3 × 10-3. The probability of arising in each background, fi (top, dots), and the way that Pi and fi combine (bottom) to determine the net fixation probability Pfix are also shown. For beneficial alleles arising in the least fit background of relevance, with i = imax = 9 deleterious alleles, the fixation probability is very small (wimax < ∼1 + sd and hence Pimax < ∼2sd for sd ⪡ 1) because the advantage of the beneficial allele is almost totally eliminated by the deleterious alleles it is linked to and because any new deleterious mutation will eliminate that advantage altogether. As i decreases Pi increases and approaches an asymptote, which is P0 ≃ 2(U + sb) = 1.6 × 10-2 for U ⪡ 1, sb ⪡ 1. This asymptote occurs because the beneficial allele is in a genotype with absolute fitness approaching (1 + sb)exp[U] and because many new deleterious mutations must occur to reduce that advantage. A mathematical description of this behavior, which is a good approximation when U ⪢ sb, is detailed in appendix c. In this example the main contribution to the net fixation probability Pfix is from moderate fitness backgrounds with i intermediate between zero and imax. For the parameter values used in Figure 1 a beneficial allele that fixes is most likely to be one that arose on a background of i = 5 deleterious alleles, and therefore at least one-half its selective advantage will be negated by the deleterious alleles that hitchhike to fixation with it.
—A beneficial allele has probability fi (top, dots) of arising in a background of i deleterious alleles. Given that it does, it has fixation probability Pi (top, line). The sum of fiPi (bottom) over all backgrounds gives the net fixation probability Pfix ≃ 7.8 × 10-3 for the parameter values used here (sb = 5 × 10-3, sd = 5 × 10-4, U = 3 × 10-3).
Fixation probabilities in an equilibrium population: Figure 2 shows results for the situation where the beneficial allele arises in a population at equilibrium under selection and deleterious mutations of fixed effect. It explores the region of the parameter space where U/sd ≤ 10, so that f0 ≥ e-10 ≃ 4.5 × 10-5 will represent a large number of individuals for population sizes that are realistic, at least for bacteria. Shown is the relative fixation probability when there is interference, R = Pfix/ p[sb], where the fixation probability of the beneficial allele is Pfix and the value it would take in the absence of any interference is p[sb]. A built-in Mathematica algorithm (Wolfram 1996) was used to fit contours to a lattice of values calculated by numerical solution of Equations 11 and 12. The irregularities in the contour lines are not caused by numerical inaccuracies but occur near where the number of relevant genetic backgrounds imax changes from one integer value to the next, which causes the gradient of Pfix to change abruptly.
For the region of the parameter space considered in Figure 2 the relative fixation probability spans almost five orders of magnitude. A logarithmic scale of fixation probabilities is appropriate if one wishes to understand in which regions of the parameter space evolution essentially cannot proceed. However, for other questions a linear scale is more appropriate. In the case of competing subpopulations, a small difference in rates of adaptation is critical, and the rate of adaptation is approximately linear in the fixation probability. In a log-log-linear space the (sb, sd, R) surfaces show regions with roughly constant either high (R ≃ 1) or low (R ≃ 0) fixation probability. To help visualize this we show additional contour lines at R = 0.9 (dotted lines) and R = 0.5 (dashed lines), which together with the first solid contour at R = 0.1 roughly define the transition from no effect to a severe effect of interference from deleterious alleles.
In all of Figure 2, the area above and to the left of the diagonal sb = sd represents the region of the parameter space where Equation 2 applies, and the area below and to the right of this diagonal represents the region of the parameter space where the methods developed above are necessary to calculate the fixation probability.
Several trends worthy of comment are evident in Figure 2. First, by comparing across all three panels we can see that fixation probabilities are reduced more severely by interference as the rate of deleterious mutation U increases. This result is expected. Second, the severity of the reduction in fixation probability often but not always increases monotonically as the strength of selection against deleterious alleles sd decreases. An example of nonmonotonicity in sd can be seen when U = 10-3 and sb = 10-2.5 (Figure 2, top). This perhaps counterintuitive result occurs because there are two opposing forces at work here. As sd decreases the frequency of deleterious alleles increases, and hence they are more likely to be present in the background on which the beneficial allele arises, but the effect of any one deleterious allele in reducing the advantage of the beneficial allele is less. The graphs show that, for the parameter space explored, the former force tends to dominate the latter. For sb < sd examination of Equation 2 shows that R increases as U increases and sd decreases, but it was not at all clear that this dependency would be true for most but not all sb.
—The fixation probability Pfix of a beneficial allele conferring a selective benefit sb is reduced by interference from segregating deleterious alleles causing disadvantage sd that arise at completely linked loci at rate U. The relative fixation probability in an equilibrium population (R = Pfix/ p[sb], measured relative to the fixation probability without interference) is shown as a function of sb and sd. From top to bottom U = 10-3, U = 10-2, and U = 10-1.
A third visible trend is that beneficial alleles of larger effect have fixation probabilities that are less influenced by segregating deleterious alleles. This is an intuitively reasonable result, but one that is not true when sb < sd. Equation 2 shows that R is independent of sb, which can be seen to be true in the parts of Figure 2 where this equation applies, i.e., everything above and left of the line sb = sd. To the right of and below the line we see that the situation is more complex. The final trend worth noting is that in some parts of the parameter space there is a catastrophic reduction in fixation probability caused by interference from segregating deleterious alleles. To a coarse approximation, it can be said that R ≃ 0.5 when sb = max{sd,U} and R will be very small when both sb ⪡ sd and sb ⪡ U.
The three parts of Figure 2 are almost perfect replicas of each other, offset by the value of U, suggesting that R depends only on two compound parameters sb/U and sd/U. Equation 2 shows that this is true when sb < sd. In appendix c we show that this is also true when sd ⪡ sb ⪡ 1 and U ⪡ 1, that is, when selection and mutation are both weak and many genetic backgrounds are relevant.
Nonequilibrium populations: In appendices a and b we derive results for fixation probabilities when the population of interest was free of segregating deleterious alleles at some time t = 0 in the past. This initial condition approximates the effect of a rapid selective sweep, a severe population bottleneck with rapid recovery, or the founding of a laboratory evolution experiment from a single clone. The fixation probability of a beneficial mutation that arises at some subsequent time t =τ is denoted Pfix,τ. The results above for an equilibrium population are a special limiting case of this scenario, where τ → ∞.
Consider first the simplest case τ= 0. At that time a beneficial mutation is guaranteed to arise in a background free of deleterious alleles (f0 = 1), which would increase its net fixation probability. However, at the same time the population mean fitness w¯ would be unity, causing the absolute fitness of genotypes containing the beneficial allele to be lower than in an equilibrium population (where w¯ = e-U; see Equation 6), which would cause a decrease in its net fixation probability. There are therefore two opposing forces at work, and their combined effect on the fixation probability of a beneficial allele is not obvious. It is not necessarily sufficient to argue that the net fixation probability is reduced by variance in fitness across backgrounds and is therefore higher when there is zero variance at τ= 0, because this argument does not take into account the changing mean fitness of the wild-type population. (Indeed a variance-based argument fails to predict how Pfix depends on sd in an equilibrium population.)
In appendix b we prove that, for the special case where sb < sd, the fixation probability for a beneficial mutation occurring at time τ< ∞ is always greater than in an equilibrium population (τ= ∞). To see the importance of this result, consider further the case of a weakly selected beneficial allele with sb ⪡ U, sb ⪡ sd arising at time τ= 0. Genotypes containing such a beneficial allele all give rise to, on average, less than one offspring of the same genotype at t = 0 and for some period of time thereafter (because e-UWi,0 < 1, see appendix a). To have any probability of fixation at least one copy of the allele must persist until the wild-type mean fitness has decayed significantly to have absolute fitness greater than one. It is therefore quite surprising to find that such beneficial alleles always have greater fixation probability than they would if they arose in an equilibrium population. We offer the following explanation for our perhaps counterintuitive result. As we go backward in time there is a decrease in fixation probability due to increasing w¯t and an increase in fixation probability due to increasing f0,t. Because f0,t+1/f0,t = e-U/w¯t these two forces are coupled, and the nature of this coupling ensures that, working backward in time from an equilibrium fixation probability, the fixation probability will always increase. On the basis of our numerical results for sb > sd we speculate that this is true for all values of sb and sd.
—The fixation probability Pfix of a beneficial allele conferring a selective benefit sb is inflated if it arises some short time τ after segregating deleterious alleles have been purged from an asexual population. The degree of inflation (I = Pfix,τ/Pfix,∞, measured relative to the fixation probability in a population at mutation-selection equilibrium) is shown as a function of τ and sb. Left, U = 10-2, sd = 10-2, and the maximum inflation is I ≃ 1.58. Right, U = 10-1, and sd = 10-2, and the maximum inflation is I ≃ 37.7. In both sides the same shading scheme is used, solid contours are at geometric intervals of ∼1.44, and dashed contours are at geometric intervals of ∼1.048.
We measure the effect of a nonequilibrium population by the inflation in fixation probability, I = Pfix,τ/ Pfix,∞, measured relative to an equilibrium population. Figure 3 shows I as a function of sb and of τ. In these calculations we assumed that after 1000 generations the population would be close to equilibrium. The choice of sd = 10-2 for these plots was influenced by the computer time required for the calculations (see appendix a), and the choices of U = 10-2 (left plot) and U = 10-1 (right plot) represent situations where there is a moderate and a large reduction in fixation probability at equilibrium. For these parameter values, any inflation in fixation probability is a relatively short-lived effect and is negligible [in the sense that (I - 1)/(max{I} - 1) < 0.05] after 300 generations. This is to be expected because a population perturbed from mutation-selection balance decays toward its equilibrium state on a timescale proportional to 1/sd = 100 generations (Johnson 1999). What is more remarkable is that the inflation in fixation probability occurs only for beneficial alleles with certain selection coefficients. For U = 10-2 only beneficial alleles with sb ≃ 10-2 have inflated fixation probabilities and the effect is moderate (I ≤ 1.6). For U = 10-1 beneficial alleles with selection coefficients sb ≃ 10-2 or greater have inflated fixation probabilities and the effect is substantial (I ≃ 40). To understand why this is so, it is necessary to consider these fixation probabilities relative to the case with no interference, measured by R = Pfix,τ/p[sb], which are shown in Figure 4. The abrupt changes in gradient visible in the graph are caused when the number of relevant genetic backgrounds imax changes from one integer value to the next.
—Relative fixation probabilities, R, in a nonequilibrium population. Top, sd = 10-2, U = 10-2; bottom, sd = 10-2, U = 10-1. Note the very different scales of the y-axes. Curves show τ= 0 (dotted), τ= 100 (dashed), and τ= 1000 ≃ ∞ (solid).
Assuming that the fixation probability is always less than it would be in the complete absence of deleterious mutation (we prove this for sb < sd in appendix b and speculate that it is true always), then if fixation probabilities are only moderately reduced, as in the case sd = 10-2, U = 10-2 (Figure 4, top), then the inflation in fixation probability I can be at most moderate. On the other hand when fixation probabilities are substantially reduced, as in the case sd = 10-2, U = 10-1 (Figure 4, bottom), then the inflation in fixation probability I can also be substantial. The slightly mysterious peak of inflation I at sb ≃ 10-2 for U = 10-2 can be partly explained because, for sb ⪢ 10-2, there is no reduction in fixation probability at equilibrium and hence there can be no transient inflation.
—Under the model assumptions (see text and Figure 6), the optimum deleterious mutation rate Uopt maximizes the long-term average rate of fitness improvement in an asexual population. Uopt depends only on sb and sd, against which it is plotted here.
Figures 3 and 4 show that, for the specific departure from equilibrium that we have studied, there is little effect on fixation probability for weakly selected beneficial alleles. It is possible that this is because the fate of such alleles takes a long time to be determined, and the transient nonequilibrium state of the wild-type population is therefore of little relevance.
The rate of adaptation and the optimum mutation rate: Figure 5 shows numerical calculations of the optimum mutation rate Uopt as a function of the two parameters on which it depends, sb and sd. The optimum mutation rate is the rate that maximizes the long-term average rate of fitness increase, as estimated using Equation 18. These results are in agreement with Orr’s (2000b) finding that Uopt = sd when sb ≤ sd (Figure 5, left). However, when sb > sd the dependence on sd is much weaker, and to a very coarse approximation Uopt ≃ max{sb, sd} for the whole parameter space examined here. This result can be explained in terms of our results for fixation probabilities. For any chosen combination of sb and sd imagine the appropriate points in Figure 2. Consider first a very small value of U. The fixation probability is high. Now imagine increasing U so that a “hole” starts to appear in the corner of the plane. At first, there is a less-than-linear decline in fixation probability with U and the rate of beneficial mutations increases linearly with U, so the rate of fitness improvement C increases. Suddenly, when U exceeds max{sb, sd} there is a catastrophic decline in fixation probability and an associated decline in C. Hence the rate of adaptation is maximized just before this catastrophe, at Uopt ≃ max{sb, sd}. This is illustrated in Figure 6.
DISCUSSION
We have described a method for calculating fixation probabilities when segregating deleterious alleles of fixed effect jointly influence the fate of a beneficial allele in an asexual population. Our analysis includes as a special case the situation studied previously where any single deleterious allele overwhelms the advantage of the beneficial allele (Manning and Thompson 1984; Charlesworth 1994; Peck 1994; Orr 2000b). However, our analysis applies for any strength of selection on both the deleterious alleles and the beneficial allele, provided that selection is strong relative to drift and that Muller’s ratchet is not operating rapidly. In particular, it allows study of the fate of a beneficial allele of large effect, where “large” means larger than the effect of a single deleterious allele. Our method allows fixation probabilities to be calculated to arbitrary precision by solution of a set of equations, but we cannot in general write down an equation for the fixation probability that improves our understanding of the effect we are studying. The value of our numerical results is that a reasonable intuition about the effect of interfering deleterious alleles on fixation probabilities can be developed by studying Figure 2. This numerical work together with the approximate analytical results derived in appendix c justifies the following “rules of thumb” concerning R = Pfix/p[sb], a statistic that describes the fixation probability in the presence of segregating deleterious alleles (Pfix) in an equilibrium population, relative to what it would be in the absence of deleterious alleles (p[sb]).
For weak selection and mutation R depends only on the compound parameters sb/U and sd/U, which describe the strength of selection relative to the rate of deleterious mutation.
The relative fixation probability R cannot be predicted from only the variance in fitness in the wild-type population, which is Usd.
For sb < sd the relative fixation probability is R = exp[-U/sd].
For sd ⪡ U and sb > U there is negligible reduction in fixation probability and R ≃ 1.
For sd ⪡ U and sb < U there is substantial reduction in fixation probability and R ≃ 0.
Conclusions 1, 2, 4, and 5 are novel.
—Assuming the ratio of beneficial to deleterious mutations, k, is fixed and small, the long-term average rate of fitness improvement C in an asexual population is maximized when the deleterious mutation rate U takes a particular value, Uopt, which we term the optimum. Uopt is independent of k and of the population size N. The optimum is clear on plots of C against U, shown here for three choices of parameter values: sb = sd = 10-2 (solid); sb = 10-2, sd = 10-3 (dotted); and sb = 10-3, sd = 10-2 (dashed). For all curves Nk = 1.
Previous studies of the effect of linked deleterious alleles on fixation probabilities have all assumed sb < min{sd}, with three relatively minor exceptions. Peck (1994) presented simulation results where sb > sd, but only for two choices of parameter values and thus gave little indication of how Pfix depends on model parameters. The earlier numerical work of Manning and Thompson (1984) was similarly cursory, and their analysis contained an error (as discussed by Peck 1994 and described above also). Orr (2000b, Figure 2) showed simulation results where both sb and sd were drawn from continuous distributions for each mutational event and in a fraction (∼2.5%) of pairwise cases sb > sd.
The emphasis on sb < sd in previous studies has been influenced by a combination of mathematical convenience and the belief that a major component of adaptation is due to beneficial alleles of small effect. The view that sb is typically small was argued by Fisher (1930) on the basis of a “geometrical” model, which involves evolution toward an optimum in high-dimensional phenotype space. In this model, mutations are assumed to alter the phenotype in a random “direction” and the probability that a given mutation produces a phenotype closer to the optimum increases as the effect of that mutation becomes smaller. Thus, the density function for the distribution of sb (conditional on sb > 0) is expected to be monotonically decreasing. This tendency becomes more marked as the dimensionality of phenotype space increases. Later studies (Kimura 1983, pp. 154-155; Orr 1998, 1999, 2000a) describe the distribution of sb conditional on fixation and so do not alter Fisher’s (1930) conclusion concerning the distribution of effects of newly arising beneficial alleles. We note that these later studies have all assumed Pfix ≃ 2sb, and so a possible area for further work is to examine expressions for fixation probability that are nonlinear in sb, such as those derived here, in the context of Fisher’s geometrical model.
There is an accumulating body of empirical evidence (reviewed by Orr 1999) suggesting that adaptation involves factors of moderate or large effect and so a complete theory of the population genetics of adaptation must include the parameter space where sb > sd. Whether more general evolutionary theories that deal with the evolution of sex (e.g., Charlesworth 1994; Peck 1994) or the degeneration of Y chromosomes (e.g., Charlesworth 1996; Orr and Kim 1998) must also deal with this part of the parameter space is difficult to answer. Clearly the absolute magnitude of sb is of little relevance, and what matters (for models with fixed sd) is whether sb < sd is typical or at least provides a good approximation. It can be argued that sb < sd for most mutations, because there is a class of deleterious mutations with sd ≃ 0.02 and a class of lethal mutations (see, e.g., Mukai 1964; Keightley 1996; Keightley and Eyre-Walker 1999; Lynchet al. 1999). However, deleterious alleles of large or lethal effect segregate at low frequency or not at all and hence have little influence on the fixation probability of beneficial alleles. It may be more important that there is a substantial “nearly neutral” class of deleterious mutations with very small effects on fitness (Ohta 1973, 1992; Ohta and Gillespie 1996; Davieset al. 1999; Keightley and Eyre-Walker 1999). It seems certain that there will be beneficial alleles that have greater selective advantage than any individual deleterious mutation in this latter class, but the combined effects of many such very slightly deleterious mutations could still have a substantial effect on fixation probabilities. The analysis for large sb presented here may therefore be of importance for beneficial alleles that would not be considered (from a Fisherian viewpoint) to have a particularly large effect. However, whether sb < sd or sb > sd for “most” mutations (or pairs of mutations) may not be the relevant question. In reality sd varies across loci and the effects of newly arising deleterious alleles are drawn from an approximately continuous distribution. Fixation probabilities for beneficial alleles depend on the approximately continuous distribution of fitnesses of genetic backgrounds on which beneficial alleles arise and on the way that subsequent deleterious mutation moves beneficial alleles from one background to another after they arise. An analysis of such a model will be described in a subsequent article.
In the results we remarked that there is a region of the parameter space where fixation probabilities are catastrophically reduced. For weak selection against deleterious alleles, sd ⪡ U, there is a transition between high and low fixation probabilities that occurs in the region around sb ≃ U. This transition becomes more marked as sd becomes smaller and in fact approaches a step function in the limit sd 0. This conclusion and in fact any application of our→model when sd ⪡ U must be carefully qualified. As sd becomes smaller the distribution of fitness in the wild-type population piles up around w = e-U with very small variance. In our analysis, we measure the fitness of the beneficial allele relative to the fitness of an individual with w = 1, and such individuals become vanishingly rare as sd 0 with constant U. Technically speaking this would not → be a problem if we held by the stated assumption that the distribution of fitness in the wild-type population is constant over time, but clearly the validity of this assumption becomes questionable when f0 = exp[-U/sd] ⪢ /1/N. A second reason for our argument breaking down as sd → 0 is that we assume the beneficial allele subpopulation reaches mutation-selection equilibrium before displacing the wild-type subpopulation (see Equation 10). This is equivalent to assuming that the time taken for a selective sweep (∼ln[N]/sb generations) is at least as long as the time taken to approach mutation-selection balance (∼1/sd generations). The minimum population size N required for this assumption to hold is of magnitude exp[sb/sd] and as sd becomes small compared to sb this rapidly becomes unrealistic. As sd becomes smaller still, the frequencies of deleterious alleles begin to fluctuate under the influence of genetic drift and our model breaks down in yet another way. Finally when sd is small compared to the per site mutation rate deleterious alleles reach high frequencies and back mutation can no longer be ignored. There is therefore a need for a more thorough analysis of how fixation probabilities are influenced by very weak selection at very many linked loci in large but not infinite populations.
We have argued that a rapid selective sweep or bottleneck of one individual will purge segregating deleterious alleles from an asexual population and that this will inflate the fixation probabilities of subsequently arising beneficial mutations. In other words, adaptation can trigger further adaptation. This effect would cause selectively driven substitutions to tend to occur in bursts, and the substitution process would tend to be overdispersed relative to a Poisson process. However, the effect seems to be small in many regions of the parameter space. Even when the effect is substantial (e.g., when sb = 3 × 10-2, sd = 10-2, and U = 10-1) our results should not be taken to imply that the net effect would be to speed up adaptation in the long term. This is because the population size of individuals carrying the sweeping beneficial mutation, or the population size during recovery after the bottleneck, will be reduced and less beneficial mutations will arise. Our result shows only that the temporary purging of segregating deleterious alleles will cause a substitution process that is not Poisson in time. [The fixation probabilities of these beneficial alleles will also be increased because they arise in a growing subpopulation (Ewens 1967).] Interestingly, Gerrish (2001) has recently shown that interference between multiple simultaneously segregating beneficial alleles (the Hill-Robertson effect; Fisher 1930; Muller 1932; Hill and Robertson 1966; Felsenstein 1974; Barton 1995; Gerrish and Lenski 1998; Gerrish 2001) will cause the selected substitutions to be “rhythmic,” that is, underdispersed or more uniformly spaced in time than random. Determining the interaction between these effects, and whether the net effect is an under- or overdispersed substitution process, requires further work. We note that although Gerrish (2001) performed simulations that explicitly included deleterious alleles, he assumed that f0 = 0.9 at all times, so that only 10% of beneficial alleles arose on a background containing a deleterious allele and nonequilibrium effects were not allowed for at all. We speculate that the index of dispersion for selectively driven substitutions will depend on population size N, since increasing N increases the intensity of the Hill-Robertson effect but has little effect on interference from segregating deleterious alleles. Furthermore, both our argument and the model of Gerrish (2001) assume an unlimited supply of beneficial alleles. More realistic assumptions, such as the mutational landscape model studied by Gillespie (1984), will also cause the substitution process to deviate from a Poisson process. An area for further work is to examine both the effects described here and by Gerrish (2001) in the context of molecular evolution, where there are a considerable amount of data suggesting that the substitution process is overdispersed (see Cutler 2000 for a review).
It is important to note that our results for nonequilibrium populations consider one of the most extreme nonequilibrium situations possible. Our assumption of a large and completely homogenous population at time t = 0 is motivated mostly by mathematical tractability, but it should be a good approximation for some biologically realistic situations, including a population recovering by unchecked binary division after a bottleneck of one individual or a population immediately after a strongly selected (sb > 1) beneficial allele has swept to fixation. In these cases the population size could reach 109 in <30 generations, which is rapid compared to the timescale of approach to mutation-selection equilibrium when sd ⪡ 0.03 (Johnson 1999). Burch and Chao (2000) bottlenecked two populations of the RNA virus ϕ6 to size 1 and observed a subsequent approach to mutation-selection equilibrium over a period of ∼100 generations. This provides some empirical evidence that nonequilibrium situations such as the one modeled here are relevant. More generally, we hope that our results will motivate further research on evolutionary processes in nonequilibrium populations, under more widely applicable assumptions.
We have shown that the optimal mutation rate, Uopt, which is the mutation rate that maximizes the long-term rate of fitness improvement, can depend on the strengths of selection on both deleterious and beneficial alleles. Orr (2000b) previously found that, when selection for beneficial alleles is weaker than against deleterious alleles (sb < min{sd}), the optimum mutation rate depends only on the strength of selection on deleterious alleles and specifically that Uopt = Eh(sd). Orr (2000b) tentatively suggested that this result might explain Drake’s rule, that a range of DNA-based microorganisms with great variation in life history and genome size all have roughly the same per genome per replication mutation rate μg ≃ 0.003 (Drake 1991; Drakeet al. 1998; Sniegowskiet al. 2000), arguing that it is plausible that Eh(sd) might be roughly conserved across the group of organisms for which Drake’s rule applies. Our results show that this explanation is sensitive to Orr’s (2000b) assumption that sb < min{sd}. Our results suggest that if on the other hand sb > sd is typical for microorganisms, then a possible alternative explanation for Drake’s rule is that the strength of selection on beneficial alleles could be roughly conserved across the relevant populations. In this respect we note that for the bacterium E. coli growing on minimal media Kibota and Lynch (1996) estimated that s¯d ≤ ∼0.012 and that Imhof and Schlötterer (2001) have estimated selection coefficients for beneficial alleles that were often >0.01, conditional on them both becoming fixed and being observed. However, as pointed out by Orr (2000b) and Sniegowski et al. (2000), there is only a weak theoretical basis for supposing that asexual populations will evolve a mutation rate that is optimal. Areas for further theoretical research include finding the optimal mutation rate when beneficial and deleterious alleles each have selection coefficients drawn from some distribution of effects and determining under what conditions an asexual population will evolve its optimal mutation rate.
APPENDIX A: NONEQUILIBRIUM WILD-TYPE POPULATION
There are many ways in which the wild-type population could be out of equilibrium. For simplicity, we study only one possibility. We assume that the population of interest is descended from one that was free of segregating deleterious alleles at time t = 0, where t is measured in generations, and that any deleterious alleles that were fixed at t = 0 do not revert to wild-type alleles by back mutation. This initial condition approximates the effect of a rapid selective sweep, a severe population bottle-neck with rapid recovery, or the founding of a laboratory evolution experiment from a single clone. As time increases, the frequencies of deleterious alleles will increase and the population will approach an equilibrium at mutation-selection balance. We assume that a single beneficial allele arises at some time t =τ> 0. The analysis in the main part of the article is therefore a special case with τ= ∞. Since the fate of the beneficial allele depends on the time of its origin, we write Pfix,τ for its fixation probability.
Because we assume that there are no segregating deleterious alleles in the wild-type population at time zero, the mean fitness and distribution of number of deleterious alleles per wild-type individual will change over time. Johnson (1999) gives equations describing these dynamics, in terms of the Laplace transforms (see, e.g., Feller 1971; Pinkus and Zafrany 1997) of the distributions involved. When deleterious alleles have a fixed effect (1 - sd), the distribution of effects on log fitness has Laplace transform d~(z) = (1 - sd)z, where z is a dummy variable. Equation 12 of Johnson (1999) shows that the Laplace transform of negative log fitness at time t is
Equations A3 and A4 can also be derived using factorial cumulant-generating functions, as described by Dawson (1999).
The fixation probability of a beneficial mutation arising at time t =τ can be obtained by choosing some large time t = T, setting Pi,T = Pi,∞, and then finding the Pi,t recursively in decreasing order of t using Equation 9 with the upper limit in the sum set to imax. Results obtained in this way, using T = 1000, are shown in Figures 3 and 4. The choice of T = 1000 is justified by noting that w¯t and λt approach their equilibria over a timescale proportional to 1/sd = 100 generations. Similar calculations for smaller sd would be time-consuming at present because both the appropriate value for t and the number of genetic backgrounds to be considered for a given sb (imax) scale with 1/sd and hence the number of calculations scales with
Given the computationally intensive nature of these calculations, an approximation is desirable. It is tempting to assume that w¯t changes slowly relative to the time taken for the fate of the branching process to be determined (which is typically “rapid”; see, e.g., Feller 1968, p. 297), which would give Pi,t+1 ≃ Pi,t ≃ Pi,τ. Then the method used for τ= ∞ could be followed with appropriately modified values for the Wi and fi. Numerical results (not shown) show that this approximation is highly unsatisfactory for many combinations of parameter values. Specifically, this approximation often gives Pfix,τ < Pfix,∞ when more accurate calculations show that Pfix,τ > Pfix,∞. Because this approximation assumes that the fate of the beneficial allele is determined rapidly relative to the mean fitness dynamics in the wild-type subpopulation, it is most accurate for large sb and/or large τ. However, sufficiently large sb are so large that there is negligible effect from interfering deleterious alleles and sufficiently large τ are so large that solutions are indistinguishable from solutions for equilibrium wild-type populations.
APPENDIX B: NONEQUILIBRIUM POPULATION AND BENEFICIAL ALLELE OF SMALL EFFECT
For the special case sb ≤ sd we can prove an important result. This is that the net fixation probability of a beneficial allele arising in a nonequilibrium population (for the specific departure from equilibrium considered in appendix a) is greater than in an equilibrium population. More specifically, going backward in time from a close-to-equilibrium population, the net fixation probability increases. We speculate that this result is probably true in general but do not see an easy way to prove it. We can also show that for sb ≤ sd the net fixation probability increases by at most a factor (1 + sb) per generation backward in time. This may explain why beneficial alleles of small effect do not have inflated fixation probabilities in nonequilibrium populations.
When sb ≤ sd it is necessary only to consider the fate of beneficial alleles in a background of zero deleterious alleles; that is, imax = 0. The recursion for the fixation probability of a beneficial allele on such a background is
Our proof that net fixation probabilities increase as we go backward in time seems a little contorted, but we have been unable to find a more transparent proof. We first assume that a beneficial allele that arises in a nonequilibrium population at some large time τ= T has net fixation probability equal to that for an equilibrium population
To prove Equation B8, we note first that, from (B4), the equation
APPENDIX C: APPROXIMATIONS
In the analysis section of the article we described a method that allows us to calculate Pfix exactly (given the model assumptions) for any parameter values. However, this analysis makes no real improvement to our understanding of how fixation probabilities are influenced by segregating deleterious alleles. There does not appear to be a more concise exact expression for Pfix, except for the special case where sb < sd and hence imax = 0, which was solved by Peck (1994).
In this section we assume an equilibrium wild-type population, but the methods used could in principle be applied to nonequilibrium wild-type populations.
It is useful to write Pi = pi + xi. Here the first term gives the probability that the beneficial allele fixes within the genetic background in which it arose, and the second term gives the probability, conditional on it being lost from that background and that it was lost by mutation, that it fixes in some lower fitness background where it still confers a net advantage. ximax = 0 but Manning and Thompson’s (1984) analysis erroneously assumed xi = 0 for all i. By making this decomposition of Pi, separating out the j = 0 term from the sum in Equation 12 and then rearranging we can obtain
When selection and mutation are weak, we can assume 1 - pi ≃ 1, e-UWi ≃ 1, and ignore terms in U 2 and higher powers in the summation to obtain
We can use Equation C6 to show that when sb ⪢ sd the relative fixation probability R = Pfix/p[sb] depends only on the compound parameters σb = sb/U and σd = sd/U. First note that under these conditions imax ≃ sb/sd =σb/σd and that the fi depend only on λ= U/sd = 1/σd. Let the contribution to R from background i be Ri = Pi/p[sb] ≃ Pi/2sb. Then from Equation C6, which assumes weak selection and weak mutation, we have
Acknowledgments
We thank Brian Charlesworth, Arcadi Navarro, Allen Orr, Sally Otto, Mario Pineda-Krch, Rosie Redfield, Olivier Tenaillon, and two anonymous reviewers for discussions and/or helpful comments on the manuscript. T.J. is supported by Wellcome Trust International Prize Travelling Research Fellowship no. 061530. N.B. is supported by the Biotechnology and Biological Sciences Research Council and by the Natural Environment Research Council.
Footnotes
-
Communicating editor: W. Stephan
- Received November 26, 2001.
- Accepted May 17, 2002.
- Copyright © 2002 by the Genetics Society of America