| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Genetics, Vol. 172, 611-626, January 2006, Copyright © 2006
doi:10.1534/genetics.105.046680
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Laboratoire Génome, Populations, Interactions, Adaptation, USTL-IFREMER-CNRS UMR 5171, Université des Sciences et Techniques du Languedoc, 34095 Montpellier, France
1 Corresponding author: Department of Biology, Queen's University, Kingston, ON K7L3N6, Canada.
E-mail: jeanbaptisteandre{at}gmail.com
| ABSTRACT |
|---|
|
|
|---|
The general framework undertaken in numerous preceding works, as well as in the present one, is game theory. Let us consider a population fixed for a given mutation rate (the resident) and introduce a rare variant bearing a modifier gene affecting replication accuracy. The aim of analytical models is to predict the fate of the variant and ultimately to find a resident mutation rate in which no variant can invade (i.e., an evolutionarily stable mutation rate; MAYNARD-SMITH 1982). In this purpose, the fitness of modifier-bearing individuals must be calculated. This fitness may be affected directly by the mutation rate modifier. For instance, owing to the thermodynamic cost of replication fidelity, a modifier increasing mutation rate may have a positive effect on fitness (DAWSON 1998, 1999). In addition, fitness is also affected indirectly by the carriage of the modifier, through linkage disequilibrium with other loci. This indirect effect, the most peculiar effect of mutation rate, is double sided. Consider the case of a modifier increasing mutation rate. On one side, when they replicate their DNA, modifier-bearing individuals generate more unfavorable errors than wild-type individuals. Therefore, the modifier allele is positively linked with deleterious mutations, which generates an indirect fitness cost. However, on the other side, modifier-bearing individuals are also more likely to generate favorable errors at replication, which generates an indirect fitness benefit for the modifier. The aim of theoretical models is to measure the respective strength of these three effects.
At first, one could get the impression that mutation rate evolution depends simply on the average effect of DNA alterations. If, in expectation, random mutations increase fitness, then mutation is good; if they decrease fitness then it is bad. Mutation rate evolution would then be a trivial issue. Indeed, even when a population is in the course of adaptation, the vast majority of mutations are still deleterious. Therefore, in expectation, altering an individual's DNA by mutation always leads to a reduction of its fitness. An individual who must choose between producing mutated or nonmutated offspring should always prefer nonmutated ones, and mutation rate should be fixed at the minimum one possible.
Theoretical models have shown that this simple answer is not accurate because the indirect cost of deleterious mutations and the indirect benefit of advantageous ones are not symmetric effects. Consider again a modifier increasing mutation rate. In a large asexual population, any individual carrying more than the minimum number of deleterious mutations ultimately leaves no descendants; i.e., it is evolutionarily dead (FISHER 1930, p. 136). Therefore, the indirect cost of the modifier is characterized by a slight increase in the number of "dead" offspring generated by modifier-bearing individuals (LEIGH 1970; DAWSON 1998, 1999; JOHNSON 1999a). The indirect effect of beneficial mutations is different. When a new favorable mutation appears, it rises in frequency until complete fixation and, in the absence of recombination, yields the simultaneous fixation, by hitchhiking, of the genetic background in which it appeared (MAYNARD-SMITH and HAIGH 1974). In the following we call this phenomenon a selective sweep. This is important in the evolution of mutation rate, because a modifier increasing mutation rate is more likely than at random to generate beneficial mutations and reach fixation with them. Therefore, selective sweeps generate a very strong indirect benefit for the modifier, and this benefit is much stronger than the cost of deleterious mutations (see, for instance, LEIGH 1970).
To describe the long-term fate of a modifier, LEIGH (1970) and JOHNSON (1999a) consider a population for a large number of generations and assume that a selective sweep occurs with a constant probability K per generation (e.g., after the environment has changed). The expected frequency of a modifier gene after T generations is then the product of selection due to deleterious mutations along the T generations times the selection owing to the KT selective sweeps that have occurred. From this analysis, LEIGH (1970) and JOHNSON (1999a) find a resident's mutation rate in which no modifier's expected frequency can increase. They show that, in the absence of recombination, the evolutionarily stable strategy is to generate errors at the exact same rate that the environment is changing.
However, this approach involves several assumptions that should be relaxed for a more profound analysis. First, to be able to measure the selection upon the modifier in each generation, LEIGH (1970) and JOHNSON (1999a) had to assume that the modifier stays rare in all the considered generations. This is reasonable for a weak-effect modifier in an infinite population (LEIGH 1970) or in a finite sexual population because the strong recombination rate between modifier and advantageous mutations prevents important modifiers' increases during selective sweeps (JOHNSON 1999a). In a finite asexual population, however, each selective sweep is a bottleneck, reducing polymorphism everywhere in the genome, including at the modifier locus. Particularly, if the sweep is the fixation of a single advantageous mutation, then the final modifier's frequency is either one or zero, which represents a strong variation relative to an initial low frequency (see discussion on this point by JOHNSON 1999a). Therefore, in a finite asexual population, the modifier cannot be assumed rare along all the generations considered, which implies important modifications of the model.
Second, LEIGH (1970) assumes that the rate of selective sweeps is controlled solely by the environment. However, in a finite population the generation of genetic polymorphism also affects the frequency of adaptation events. In other words, the frequency of selective sweeps relies both on external factors, i.e., the pace of environmental changes, and on internal factors, i.e., the population size and mutation rate. As a first consequence, the evolution of mutation rate might depend on population size. This has been observed in simulations of the dynamics of strong-effect modifiers (TADDEI et al. 1997; TENAILLON et al. 1999); a convincing model of mutation rate evolution should therefore account for this effect. As a second consequence, the fate of modifiers may rely on the resident mutation rate itself, which should change importantly the long-term evolutionary trajectories of mutation rate.
In this article, we describe the evolutionary trajectory of mutation rate owing to the recurrent appearance and fixation of weak-effect modifiers. In this aim, we first develop analytical tools to measure the effect of each selection component on the modifier's frequency (cost of accuracy, deleterious mutations, and adaptive mutations). For clarity, the cost of accuracy is described only in APPENDIX B. Second, we develop three complete models of mutation rate evolution corresponding to different ecological scenarios; each of these models is described in the main text of this article (models A, B, and C). All models are analyzed under two distinct hypotheses regarding the interference between adaptive and deleterious mutations (hypotheses 1 and 2). Hypothesis 1 is described in the main text, and hypothesis 2 is developed in APPENDIX A.
| MEASURES OF SELECTION COMPONENTS |
|---|
|
|
|---|
Deleterious mutations:
Let us first describe the indirect cost of a high mutation rate, owing to the generation of deleterious mutations. Modifiers are assumed to stay rare in all the generations where deleterious mutations affect their frequency. Note that this is not equivalent to assuming that they stay rare in all generations (LEIGH 1970; JOHNSON 1999a), as is made clear later on.
An asexual and haploid population of constant size N and discrete generations is considered. The population is first fixed for wild-type individuals, generating errors in DNA replication at a rate µ per base pair. Mutations are deleterious to fitness when they affect one among L base pairs, hence
is the genomewide deleterious mutation rate. A rare modifier gene, appearing by random mutation from wild-type individuals, is in frequency
at time
and yields a base pair mutation rate
and a deleterious mutation rate
.
Before the appearance of the modifier, the wild-type population is at a stationary distribution regarding the number of deleterious mutations per individual. Assuming that no deleterious mutation ever reaches fixation, the average relative fitness at this equilibrium is
(KIMURA and MARUYAMA 1966). This equilibrium is not affected by the appearance of a rare modifier. Let us consider the modifier lineage. At mutationselection balance, it will ultimately have an average fitness
. The modifier's frequency will hence be multiplied in expectation by
in each generation (where
is the average relative fitness in the population). If the modifier is rare and of relatively weak effect, then
is controlled only by wild types and is given by
; hence the modifier's frequency will follow
![]() | (1) |
stands for the expectation of the random variable X and
is the realized frequency of the modifier in generation t.
However, Equation 1 is valid only after the modifier has reached its own stationary distribution, which is not the case immediately after appearance (see JOHNSON 1999b). Indeed, the modifier initially appears within the wild-type background with an expected fitness
. To circumvent this problem we adopt a slightly different approach.
Let us first follow the number of modifier individuals that carry none but the minimum number of deleterious mutations (called nondeleterious modifiers in the following). This number in generation t is written
, while the number of modifiers carrying k deleterious mutations is
, and the total number of modifiers is
. At appearance, modifiers are drawn randomly from the resident population. Assuming that the resident population is at stationary distribution and that every deleterious mutation has the same multiplicative effect on fitness (
), the proportion of individuals from the resident population who carry none but the minimum number of deleterious mutations is
(HAIGH 1978). Therefore, at appearance, the number of nondeleterious modifiers has the expectation
![]() | (2) |
Let us now derive the expectation of the number of nondeleterious modifiers in any generation
, conditional on the realized number of nondeleterious modifiers being
in the previous generation. This gives
. Assuming that modifiers stay rare in all generations, the average relative fitness in the population is determined solely by the resident and is a constant
. In consequence, the effect of selection is linear, and the conditional expectation of
can be averaged over all realized values of
, giving the unconditional expression
. Solving this recurrence equation, and using Equation 2 for
, we derive the expected number of nondeleterious modifiers as a function of time,
![]() | (3) |
Dividing by the constant total population size, N, this finally gives the expected frequency of nondeleterious modifiers,
![]() | (4) |
However, we might need also to follow the total frequency of modifier individuals (
). From Equation 3, this quantity can be deduced when the modifier subpopulation reaches its own mutationselection stationary distribution. By definition, at stationary distribution, the ratio of the expected number of modifiers of each deleterious class relative to the expected number of modifiers of the zero class is constant. This gives the condition
![]() | (5) |
is the number of modifiers carrying
deleterious mutations (modifiers of the k-class). From Equation 3, we have
, and Equation 5 then becomes
![]() | (6) |
In other words, at stationary distribution, the number of modifier individuals of each deleterious class varies according to the same rate [i.e., it is multiplied by
in each generation]. Following HAIGH (1978), let us then assume that the number of new deleterious mutations per individual per generation follows a Poisson distribution of expectation
and express the expected number of modifiers of the k-class in generation
, conditionally on the realized number of modifiers of each class in the previous generation [given by the vector
]. This gives
![]() | (7) |
Assuming again that the modifier stays rare in all generations, the average fitness in the population is controlled solely by the resident and is the constant
. Therefore, here again, the effect of selection is linear, and Equation 7 can be expressed unconditionally as
![]() | (8) |
Introducing Equation 8 into condition (6) yields the following condition for the stationary distribution to be reached:
![]() | (9) |
To obtain the expected total number of modifiers at stationary distribution,
, Equation 9 is summed over all k, giving
, which simplifies to
![]() | (10) |
Therefore, the expected total number of modifier individuals in any generation at stationary distribution can be found as a function of the expected number of nondeleterious modifiers at the same generation. Finally, from Equations 3 and 10 the expected total number of modifiers in the population can be written as a function of time,
![]() | (11) |
Dividing by the total population size, N, this finally gives the expected total frequency of modifiers as a function of time, once the modifier subpopulation has reached stationary distribution:
![]() | (12) |
, which implies (i) that the total population size is large and (ii) that the modifier stays rare in all generations.
For comparison, let us now take the logarithm of Equation 12,
![]() | (13) |
![]() | (14) |
The log-slope of the modifier's frequency with time is the same in both cases [(13) and (14)] except that Equation 13 is shifted
generations back. Owing to the delay before reaching mutationselection balance, it is merely as if the cost of deleterious mutations was paid only
generations after appearance (see also JOHNSON 1999b). For instance, if deleterious mutations are lethal (
), then the shift is only of one generation because the mutationselection balance is attained in a single generation. Both expressions are compared with stochastic individual-based simulations (Figure 1). Equation 14 assumes that modifiers have an average fitness
immediately after introduction whereas they have the average fitness of wild types. As a consequence it yields an underestimation of the expected frequency of modifiers when they increase mutation rate (
) and an overestimation when they reduce it (
; not shown). Equation 13 provides a correct approximation, but only when the modifier has reached stationary distribution, which occurs a certain number of generations after introduction, this number being inversely proportional to the effect of deleterious mutations.
|
is the advantageous mutation rate). In this article, a selective sweep is assumed to be the total fixation of a single mutant (see also JOHNSON 1999a); i.e., the population size is small with regard to the average advantageous mutation rate (
, where
is the average fixation probability of an advantageous mutant,
is the average advantageous mutation rate, and
is the number of generations needed for the fixation of the advantageous mutation to occur). Hence the simultaneous or successive generation of several advantageous mutants during one selective sweep is not considered.
At this point, two alternative hypotheses need to be envisaged. The first hypothesis (hyp. 1) assumes that the beneficial mutations are much stronger than the deleterious ones and hence that selective sweeps involve random individuals, irrespective of the number of deleterious mutations they carry. According to this hypothesis, the expected total frequency of modifier individuals {
} needs to be considered as they may all participate in selective sweeps; this frequency is evaluated from Equation 12. The second hypothesis (hyp. 2), known as the "ruby in the rubbish" hypothesis (PECK 1994; ORR 2000; see also BACHTROG and GORDO 2004), assumes that beneficial mutations can fix only in nondeleterious backgrounds. This is true provided that deleterious mutations are stronger than advantageous ones. According to this hypothesis, the expected frequency of nondeleterious modifiers
} is the only relevant quantity; it is evaluated from Equation 4. These two hypotheses should be seen as two extremes of a continuum; in the general case indeed, the average fixation probability of an advantageous mutation is quantitatively reduced by the presence of deleterious mutation(s) in the same genome (JOHNSON and BARTON 2002), which yields an interference between the background (modifier or not) and the probability to generate a selective sweep. However, this general case being extremely difficult to model, the two extremes are considered instead (hyps. 1 and 2). In the following the methods employed are detailed only for the first hypothesis; the analysis of the ruby in the rubbish case is detailed in APPENDIX A.
Under hypothesis 1, conditional on the presence of a selective pressure, the expected probabilities that in generation t the modifier {in expected frequency
} and wild-type subpopulations generate an adaptive mutation destined to fix are, respectively,
![]() | (15) |
![]() | (16) |
is the average fixation probability of an advantageous mutation occurring in the modifier or wild-type background. | MODELS AND ANALYSES |
|---|
|
|
|---|
Here we present models valid for weak selection (
). Recall that the effect of deleterious mutations on the frequency of modifiers (Equation 12) is valid only at mutationselection balance. Selective sweeps must therefore not occur before this equilibrium is reached by the modifier subpopulation. Our model is then valid if selective sweeps are rare and if the cost of deleterious mutations is relatively high (rapid convergence to mutationselection equilibrium). Remember finally that, in this article, a selective sweep is assumed to be the total fixation of a single mutant (see also JOHNSON 1999a), which is valid only if population size is small with regard to the average advantageous mutation rate (
).
Leigh's model:
LEIGH (1970) assumes that, each generation, a selective sweep can occur with a fixed probability K. The overall probability that a selective sweep begins in generation t is
(Equations 15 and 16). Let us take a modifier gene in total frequency p in any focal generation, and let us assume that a selective sweep begins in this focal generation; the expected frequency of the modifier at the end of the selective sweep (conditional on the occurrence of the sweep) is
. From Equations 15 and 16, this yields
![]() | (17) |
is the average mutation rate per base pair in the population. This is the classic result of LEIGH (1970, 1973), derived also in JOHNSON (1999a), giving the expected variation of the frequency of a modifier owing to one selective sweep. If the modifier is rare (
), then the average mutation rate is controlled only by wild types (
) and Equation 17 is simplified to
.
LEIGH (1970) aims at predicting the expected variation in a modifier's frequency during a very large number of generations T, where exactly KT sweeps occur (see also JOHNSON 1999a). The expected frequency of the modifier should be given simply by the product of selection due to deleterious mutations along the T generations times the selection owing to the KT selective sweeps. The difficulty of this approach comes from the fact that selection on the modifier depends upon its own frequency, which varies in a stochastic manner along the T generations. To circumvent this problem, LEIGH (1970) considers only infinite populations. Therefore, weak-effect modifiers can be assumed to stay rare in the entire period of study. Note that in JOHNSON (1999a), population size is finite, but recombination is assumed to be frequent between modifiers and beneficial mutations; therefore, the modifier remains rare during the
generations as well. In LEIGH (1970), the expected frequency of the modifier in generation T is given by
![]() | (18) |
In this model, the delay in the cost of deleterious mutations does not need to be considered but only the long-term, per-generation, variation in the modifier's frequency owing to deleterious mutations does. Indeed, replacing the effect of deleterious mutations
(Equation 18) by
(from Equation 12) would be equivalent merely to a variation of the initial frequency of modifiers. As stated in the Introduction, from Equation 18, LEIGH (1970) finds the evolutionarily stable mutation rate, as given by
(see also JOHNSON 1999a).
However, in an asexual finite population, if each sweep is the fixation of a single advantageous mutation, then the modifier's frequency after each selective sweep is either one or zero. As a result, after the first selective sweep, mutation-rate polymorphism is lost and selection is absent. Therefore, the effect of indirect selection on the modifier's frequency cannot be measured as
and
in all generations, even for a weak (or even neutral) modifier. In other words, selection must not be measured through its long-term effect on the modifier's frequency (Equation 18; LEIGH 1970; JOHNSON 1999a) but through its effect on the probability that the modifier is fixed or lost during the first selective sweep. Our approach is therefore very different from that of LEIGH (1970) and JOHNSON (1999a), as described in the following section.
Model Aconstant rate of selective sweeps:
In a first model, let us follow LEIGH (1970) and JOHNSON (1999a) and assume that, each generation, a selective sweep can occur with a fixed probability K.
After its introduction in generation 0, the modifier is rare and its expected frequency varies according to Equation 12. After an infinite number of generations, a selective sweep has necessarily occurred; hence the modifier is either fixed or lost. The modifier's expected frequency,
, is then given by the probability that it is fixed,
. Let us first derive Ft (and Gt), the probability that the modifier is fixed (and lost), in generation t and then find
as
.
The modifier is assumed rare in generation 0 (and of relatively weak effect); therefore, at any generation t, preceding the first selective sweep, we have
and
. The average mutation rate in the population is controlled only by wild types, and the probability that the modifier fixes during the first selective sweep is
, as described above (Equation 17). The probabilities of fixation (Ft) and loss (Gt) of the modifier are given by the following recurrence equations:
![]() | (19) |
Finally, from the hypothesis of the modifier's rarity and relatively weak effect, the probability that the modifier is fixed in any generation is negligible relative to the probability that it is lost (
). In other words, the model describes a situation where the modifier is lost most of the time but occasionally goes to fixation from its original low frequency. The recurrence equation for Gt therefore simplifies to
and Gt is given by
, the probability that at least one selective sweep has occurred in t generations. As a result, the recurrence equation for Ft becomes
. The limit
can therefore be expressed as a sum from 0 to
. If the modifier is of weak effect, then
, and this sum converges, giving
![]() | (20) |
is the expected number of descendants, after an infinite number of generations, of an individual modifier with per base pair mutation rate
in a resident population with per base pair mutation rate
. One can verify that
; e.g., the expected number of descendants of a neutral variant is one. A modifier of mutation rate is considered favored by natural selection if its ultimate fixation probability is higher than the fixation probability of a neutral allele, e.g.,
, and disfavored if
.
Developing
into a Taylor series around
gives
![]() |
![]() |
![]() | (21) |
If
any rare mutant with small
increases in frequency in expectation (e.g., selection favors a higher mutation rate), while if
mutants increase in frequency if
(e.g., selection favors a lower mutation rate).
If
then the resident mutation rate
is a singular strategy (GERITZ et al. 1998); the second-order derivative of
must be taken into account in the Taylor development, giving
![]() |
![]() |
) is favored by natural selection; i.e., the singular strategy is not evolutionarily stable. In contrast, if
, then any nonneutral modifier is counterselected; e.g., the singular strategy is an evolutionarily stable strategy (ESS) (MAYNARD-SMITH 1982).
From Equation 21,
; if the resident's mutation rate is lower than a threshold, then the direction of selection on mutation rate is positive. In contrast,
; when the resident's mutation rate is higher than the threshold, selection favors a reduction of mutation rate. Evolution will therefore tend to bring the resident's mutation rate toward a convergence stable strategy (GERITZ et al. 1998)
, which is found as the unique solution of
from (21), giving
![]() | (22) |
First, note that, owing to the delay, the convergence stable mutation rate decreases with the cost of deleterious mutations,
(Figure 2). Second, let us consider the effect of
, the rate of selective sweeps. For low K relative to the cost of deleterious mutations (
), the genomic mutation rate toward which evolution is converging is approximately equal to the rate of sweep (
), which resembles the classic result of LEIGH (1970). However, if selective sweeps are not extremely rare relative to the effect of deleterious mutations, then (i) the expected amount of time open for selection in favor of the reduction of the mutation rate is short, and (ii) a newly introduced modifier with large mutation rate is likely to experience a sweep before having paid entirely the cost of generating deleterious mutations. For these two reasons, as selective sweeps become more and more frequent, the weight of deleterious mutations in the fate of modifiers becomes increasingly negligible, especially if deleterious mutations are of weak effect. As a consequence, for relatively large K, the mutation rate toward which evolution is converging is higher than that predicted by LEIGH (1970).
|
|
is not an evolutionarily stable mutation rate (MAYNARD-SMITH 1982; DIECKMANN 1997; GERITZ et al. 1998). The second-order derivative of
in
is
![]() |
, selection favors any nonneutral modifier. Hence evolution converges to the singular strategy
, but then any modifier can invade. We will discuss more generally the instable nature of mutation rate in a subsequent part of this analysis.
Under hypothesis 2 (ruby in the rubbish) we obtain very similar results (see APPENDIX A): the singular strategy is given by
and the second-order derivative of
in
is positive (not shown). The singular strategy is convergence stable but not ESS stable. Surprisingly, here the cost of deleterious mutations has no effect on the fate of modifiers. This is because (i) the first cost of deleterious mutationsa reduction of the efficient initial frequency of modifiers by a factor
is the same as that paid by wild types, and (ii) the second cost of deleterious mutationsa factor
each generationis then paid by the modifier immediately after introduction, whatever is the effect of deleterious mutations.
Models B and Cinternal control of the rate of selective sweeps:
The preceding model assumes, as in LEIGH (1970), that the average probability of selective sweep per generation is a constant, which is valid if the rate of environmental changes is the only limiting factor. However, in a finite population, the frequency of selective sweeps should also be limited by the generation of adaptive mutations. Therefore, the rate of selective sweep K should depend itself on the mutation rate established in the population.
To take this mechanism into account, a new model is built. A population is fixed for a resident mutation rate
and a modifier with mutation rate
is introduced. Let us assume that the population is undergoing a selective pressure (i.e., some mutations are adaptive) with a probability S in each generation (the value of S will be calculated later on, in two submodels). Conditional on the existence of this pressure, and making the same assumptions as in model A (modifier initially rare and of relatively weak effect), the population has a probability
to generate a selective sweep in each generation, where we recall that
is the population size,
is the fixation probability of favorable mutations, and l is the number of base pairs where mutations are favorable. Therefore, the overall per generation probability of occurrence of a selective sweep is the product
.
Following the approach of model A, we then derive the probability that the modifier is ultimately fixed in the population,
, by writing recurrence equations. These equations are shown to be identical to Equation 19 with the constant rate of selective sweep, K, replaced simply by the product
(not shown). Therefore, if only weak-effect modifiers are present, the direction of selection on mutation rate is given by
![]() | (23) |
is not a constant but actually depends on the mutation rate itself.
Model Becological scenario:
This model describes a population undergoing occasional environmental changes, with a probability
per generation (
). The population may then adapt by the fixation of one adaptive mutation. Successive environmental changes do not add up: if adaptation to a first change has not occurred before the next change takes place, the second change effaces the first and only one selective sweep is required for adaptation. Before the modifier is introduced in the population, the adaptive mutation rate is
, and the probability that the population adapts in any generation is
(conditional on the presence of a maladaptation). At balance between environmental changes and adaptation, the probability per generation that the population is maladapted is found as
![]() | (24) |
Equation 24 remains true if a rare modifier of relatively weak effect is present in the population; it can therefore be introduced as an expression for S in Equation 23, and the direction of selection on mutation rate can be derived after simplification as
![]() | (25) |
Slow environmental changes:
First, we consider the case where the environment is changing rarely
:
), then
for
and
for
,
being the only solution of
, giving
![]() | (26) |
in
,
, is strictly positive as long as
(not shown); hence
is convergence stable but not ESS stable. The model is valid only when each sweep is the fixation of one advantageous mutant and never more; this should be true around
also for the model to be correct, This implies that
, which gives
. This condition will be fulfilled, even for large populations, provided that the rate of environmental changes is low and the effect of deleterious mutations is not too weak. The results here are qualitatively similar to the results found with a constant rate of selective sweeps (model A). Evolution converges to a mutation rate
, but then any weak modifier is favored.
), then
for all
; selection always favors weak-effect modifiers with lower mutation rate. Small-step evolution converges to
, which is a locally stable strategy as shown by the analysis of
(not shown).
In conclusion, let us sum up the effect of population size in the case of slow environmental changes. Supposing that only weak-effect modifiers are present, evolution should tend to move the mutation rate close to a value
, which is nil if
and strictly positive if
. The threshold effect of population size is illustrated in Figure 4a. As population size increases, the convergence stable mutation rate fits more and more the rate of environmental changes (if
, then
, as in LEIGH 1970).
|
), the less influential are adaptive events and hence population size in the evolution of the mutation rate. This is illustrated in Figure 4. When the cost of accuracy is present (Figure 4, b and c), the effect of population size on
becomes continuous and moderate.
Rapid environmental changes:
Second, we consider the case where the environment is changing frequently [
].
) then
for all
; in this case selection always favors a higher mutation rate, until infinity. This result is obtained also when the cost of accuracy is taken into account. The reason is that the higher is the resident mutation rate, the higher is the probability of selective sweep, and thus the lower is the importance of selection against deleterious mutations. This positive feedback yields the infinite escalation of mutation rate. However, the model will then depart from its own limits of validity, as the number of mutants implied in a selective sweep will become higher than one. In other words, the generation of polymorphism will become less limiting in the rate of selective sweeps. For that reason, the frequency of sweeps will stop increasing linearly with mutation rate and will begin to saturate. This effect has been called "clonal interference" in asexuals by GERRISH and LENSKI (1998), even though it is strictly equivalent to what was previously known as the HillRobertson effect (HILL and ROBERTSON 1966). Such saturation should reduce the positive feedback of the rate of sweeps on the evolution of mutation rate and probably yield a stabilization of mutation rate. The evolutionary outcome of the system in this case is impossible to predict from our model. Note that the attained mutation rate should not maximize any global parameter of the population such as the rate of adaptation or the average relative fitness.
), then
![]() |
), while
for all
. Furthermore we have
and also
![]() |
is an unstable singular strategy (neither convergence stable nor ESS stable). If the initial mutation rate is lower than
, then small-step evolution converges to
, which is the local evolutionary equilibrium of the system. In contrast, if the initial mutation rate is higher than
, then selection favors an ever-higher mutation rate, because the rate of sweep is accelerating. Here also, the same results are obtained when the cost of accuracy is taken into account. The mere difference, again, is that the mutation rate never goes down to zero but reaches instead the lowest attainable mutation rate (such as in Figure 4, b and c). All results are qualitatively equivalent under hypothesis 2 as well (see APPENDIX A for more details).
Model Cpermanent selective pressure:
This last model describes a population that is never adapted to its environment, because environmental changes are very frequent and/or imply too many selective sweeps. The probability that a selective pressure is present in any generation is therefore
, and the direction of selection on mutation rate can be derived from Equation 23 (not shown).
The outcome of this model is the same as that of Model B in the case of rapid environmental changes [
]. If
, then selection favors an ever-higher mutation rate. Such as in model B, the model will then depart from its own limit of validity as several independent mutants will sweep together. If
, then the system is bistable with a threshold resident mutation rate
(not shown). Here also, the results are qualitatively identical when the cost of accuracy is taken into account, except that, again, the mutation rate never reaches zero but only the lowest attainable value. All results are qualitatively equivalent under hypothesis 2 as well (see APPENDIX A)
Cost of accuracy and stability:
We introduced the cost of DNA replication, as a function of mutation rate, in all the above models (models A, B, and C, each with hypotheses 1 and 2). Apart from the results already mentioned, the cost of accuracy has another important effect. When the cost of accurate replication is taken into account and is large enough (large f), we showed numerically that the second-order derivative of the fitness function is then always negative around
. In other words, the cost of accuracy stabilizes the evolutionary trajectory of mutation rate at an ESS. The antagonistic interplay of adaptation vs. deleterious mutations cannot lead to a stabilization of mutation rate, while the interplay of deleterious mutations vs. accuracy costs can. However, the significance of this stability must be tempered, as we see in the following section.
The instability of mutation rate and the emergence of mutators:
Our model applies primarily for weak-effect modifiers, but interestingly it can also apply under certain conditions to the case of strong-effect modifiers. Indeed, the basic assumption required for the model to work is that the average mutation rate in a population is controlled merely by resident individuals and is not influenced by modifiers. This hypothesis is always valid for weak-effect modifiers. Further, it may remain valid also in the case of strong-effect modifiers under two conditions. First, the modifiers must be rare when they first appear. Second, they must increase mutation rate, so that they always diminish in frequency after appearance, until the next selective sweep.
Let us then consider a modifier increasing the mutation rate by
. In model A, under hypothesis 1, the expected number of descendants of such a modifier is given by
![]() |
, where
is the initial frequency of modifiers].
Interestingly, when the modifier has a strong effect (large
), then its fitness can be approximated by
, which increases monotonously with
. This result also holds in all models (A, B, and C, with both hypotheses, whatever may be the cost of accuracy). Therefore, in all cases with no exception, as modifiers become stronger and stronger, their probability of fixation increases monotonously. In consequence, a sufficiently strong modifier always exists whose probability of fixation is significant. More precisely, in evolutionary terms, modifiers whose fitness is greater than one exist. In consequence, even if a locally stable singular strategy exists, there never is a globally stable mutation rate.
Strong-effect modifiers increasing mutation rate are called mutators (MILLER 1996). In model A, we can show that the minimal relative strength [
] required for a mutator to be favored is of the order of magnitude of 1/K, the number of generations needed for a sweep to arise. Therefore, in practice, mutators may arise only when the rate of selective sweeps is large enough. More interestingly, in model B, when population size is very small, mutators may theoretically emerge but they must have an unreasonably strong effect to do so. Therefore, in practice, they may emerge only in large populations. In other words, assuming that the effects of modifiers have any given maximum, then the larger the populations are, the more likely they are to be invaded by mutators, while a small population will remain stabilized at
.
| DISCUSSION |
|---|
|
|
|---|
The evolution of mutation rate is affected by the combination of three factors: the cost of exact replication, the cost of deleterious mutations, and the advantage of beneficial mutations. Since beneficial mutations ultimately reach fixation, they cause in the same time the fixation of all other polymorphisms (MAYNARD-SMITH and HAIGH 1974); this is a selective sweep. In particular, selective sweeps can cause the fixation of the modifier gene itself. This is especially likely if the modifier increases mutation rate and modifier individuals thus generate beneficial mutations at a larger rate than wild types. In this article, we analyzed the outcome of each of these forces (cost of accuracy, deleterious mutations, and adaptive mutations) on the fate of modifiers and on the long-term evolution of mutation rate.
Instability:
A previous classical analytic study of mutation rate evolution in asexuals (LEIGH 1970) showed that evolution leads to an intermediate evolutionarily stable mutation rate (ESS), such that the overall genomic mutation rate (
) is exactly equal to the rate of adaptation events (i.e., selective sweeps). In other words, selection leads to a mutation rate perfectly fitting the pace of environmental changes. The results we obtained in this article share similarity with those of LEIGH (1970). We showed that the effect of selection on mutation rate leads to a convergence stable mutation rate. In other words, when the resident mutation rate is below (above) a certain level, any modifier with a larger (smaller) mutation rate is favored. However, this convergence stable mutation rate differs from the ESS found by LEIGH (1970) in two respects.
First, the overall genomic mutation rate at convergence stability indeed increases with the rate of selective sweep but, in contrast to that in LEIGH (1970), it is not equal to but always larger than this rate. In other words, selection does not lead to a perfect fit between genomic mutation rate and the pace of environmental changes. Second, and more importantly, the convergence stable mutation rate is not globally stable. When it is fixed in a population, modifiers with a larger-than-resident mutation rate that are able to invade the population always exist. Further, in the case where the cost of replication accuracy is low, the convergence stable mutation rate is not even locally stable. When it is fixed in a population, any modifier of mutation rate is favored by selection.
How can these two discrepancies between the present model and LEIGH's (1970) model be explained? An assumption, differing between both models, is actually the key to these differences: the hypotheses on population size. In LEIGH (1970), population size is infinite. In consequence, each selective sweep corresponds to the fixation of an infinite number of independent mutations. At the end of the sweep, the modifier is neither fixed nor lost and its frequency is not even strongly affected by the sweep. In contrast, we consider populations of finite and "reasonable" sizes: each selective sweep is the fixation of a single independent mutation (see also JOHNSON 1999a). In this case, at the end of a sweep, the modifier and the resident are not segregating anymore: one of them is fixed. This dramatic event constitutes a novel mechanism acting on mutation rate evolution. Consider, for instance, a modifier with a lower-than-resident mutation rate. This modifier benefits from replication accuracy only during the period preceding the selective sweep. Afterward, it is either fixed or lost and thus not affected anymore by selection. In other words, the occurrence of selective sweeps, and hence of strong population bottlenecks, not only favors a larger mutation rate, but also cancels out the advantage of replication accuracy. This relaxation of selection, due to bottlenecks, explains the finding of a larger convergence stable mutation rate than that in LEIGH (1970). In addition, modifiers with a larger-than-resident mutation rate pay only the cost of deleterious mutations in the phase preceding the next selective sweep. In consequence, modifiers with a very strong mutation rate have an opportunity to arise and play a destabilizing role, which is not the case in LEIGH (1970) as they are selected against in all generations.
The instability of mutation rate has actually an important consequence. If potential modifiers all have weak effects, then the evolutionary trajectory of mutation rate should not be very different from what is expected in the case of evolutionary stability. Mutation rate should converge toward the convergence-stable value and stay in this region with only minor nondetectable variations. However, if strong-effect modifiers can also be generated, they may strongly affect the evolutionary trajectory of mutation rate, as they can be favored at any time, even if the population reaches local stability. In consequence, the evolutionary trajectory of the mutation rate is strongly dependent on its genetic control. In fact, this is perfectly concordant with empirical data on mutation rate evolution in bacteria. During phases of adaptation, bacterial populations are invaded by strong-effect modifiers called mutators, with a mutation rate 1010,000 times larger than that of wild types, owing to a deficiency in a DNA reparation enzyme (MILLER 1996). Such mutators have been observed repetitively both in vitro and in vivo (LECLERC et al. 1996; SNIEGOWSKI et al. 1997, 2000; MANSKY and CUNNINGHAM 2000; OLIVER et al. 2000; DENAMUR et al. 2002; RICHARDSON et al. 2002; SHAVER et al. 2002). Our theoretical results now suggest that the emergence of these mutators is not an anecdotic phenomenon, but is instead an inevitable and universal outcome of adaptation in asexuals (see also SNIEGOWSKI et al. 1997; TADDEI et al. 1997; KESSLER and LEVINE 1998; TENAILLON et al. 1999; TRAVIS and TRAVIS 2002; TANAKA et al. 2003; TANNENBAUM et al. 2003).
Further, from results such as LEIGH's (1970) it has often been believed that, in asexuals, the conjunction of the need for adaptation and the necessity to preserve the genome could lead to an intermediate "optimal" mutation rate, perfectly fit to the rate of environmental changes. This idea was even sometimes used to explain the finding of a constant genomic mutation rate among DNA microbes (DRAKE 1991; see also ORR 2000). Our model shows that this is probably not an accurate explanation. In asexuals, the need for both adaptation and genome preservation is not an evolutionary force that can stabilize mutation rate at an intermediate optimum. The cost of accurate DNA replication is a more likely force yielding a stabilization of mutation rate. Further, in the situations where this stabilizing cost is important, then the actual influence of adaptation on mutation rate evolution becomes negligible (see Figures 3 and 4). Therefore, when adaptation has a significant role, it primarily destabilizes mutation rate and yields the emergence of mutators during adaptation phases. In addition, in sexual species, JOHNSON (1999a) and SNIEGOWSKI et al. (2000) suggested that the effect of adaptation on mutation rate evolution was unlikely to be important because of recombination. Therefore, it seems that, in both sexuals and asexuals, intermediate stable mutation rates cannot be explained by the effect of adaptation but more probably by the joint effects of replication cost and the need for genome preservation (see, for instance, DAWSON 1998, 1999).
LEIGH (1970) considers populations of infinite sizes. We consider instead populations with finite and "reasonable" sizes such that each selective sweep is the fixation of a single mutation. What might happen in intermediate cases where a finite number of independent adaptive mutations generate selective sweeps? Note, first, that adaptive mutation rates are usually relatively small. For instance, measuring the rate of evolution in Escherichia coli populations, ROZEN et al. (2002) and GERRISH and LENSKI (1998) estimated that the numbers of advantageous mutations per genome per replication were, respectively, 
and
(see also WILKE 2004). Therefore, the independent generation of several adaptive mutations occurs only in very large populations. However, this may occur for sure in certain cases and is indeed very difficult to model analytically. After each selective sweep, the modifier reaches an intermediate frequency, very different from the initial one, and highly stochastic. Subsequently, the effect of selection depends on this frequency, until the next sweep, and so on. We cannot make a general prediction on the evolution of mutation rate in this case, apart from the obvious statement that the larger population size gets, the closer to LEIGH (1970) the outcome will be. TADDEI et al. (1997), TENAILLON et al. (1999), and TANAKA et al. (2003) actually performed simulations in this general case. But simulations allow considering only the dynamics of a given modifier in a given resident population and not the long-term evolution of mutation rate.
Delayed cost:
In classical views (e.g., LEIGH 1970; JOHNSON 1999a), the quantitative cost of deleterious mutations does not affect the evolution of mutation rate. Let us briefly explain why. Consider a lineage with a given mutation rate, and thus a given mutational load, and calculate the average fitness of individuals in that lineage. Deleterious mutations with strong cost remain in small frequency in the lineage, while those with moderate cost become frequent. Owing to this compensation, the mutational load in a lineage depends only on mutation rate and is not affected by the quantitative cost of each deleterious mutation (KIMURA and MARUYAMA 1966)