Abstract

Understanding adaptation in changing environments is an important topic in evolutionary genetics, especially in the light of climatic and environmental change. In this work, we study one of the most fundamental aspects of the genetics of adaptation in changing environments: the establishment of new beneficial mutations. We use the framework of time-dependent branching processes to derive simple approximations for the establishment probability of new mutations assuming that temporal changes in the offspring distribution are small. This approach allows us to generalize Haldane’s classic result for the fixation probability in a constant environment to arbitrary patterns of temporal change in selection coefficients. Under weak selection, the only aspect of temporal variation that enters the probability of establishment is a weighted average of selection coefficients. These weights quantify how much earlier generations contribute to determining the establishment probability compared to later generations. We apply our results to several biologically interesting cases such as selection coefficients that change in consistent, periodic, and random ways and to changing population sizes. Comparison with exact results shows that the approximation is very accurate.

THE process of adaptation depends on the establishment of new beneficial mutations. To be successful, it is not sufficient that mutations simply enter a population; they have to survive an initial phase of strong random genetic drift during which they can be lost. Once a beneficial mutation rises to a sufficiently large number of copies, the strength of drift becomes negligible and selection drives them to fixation or maintains them at intermediate frequency. The probability that mutations survive loss has been called the probability of fixation, the probability of establishment, or the probability of invasion, depending on the context. This probability plays an important role in determining the rate of adaptation of populations (Gillespie 2000; Orr 2000).

Fixation probabilities of beneficial, neutral, or deleterious mutations have been at the core of population genetics theory since the early days of the field. Traditionally, two alternative frameworks have been used: branching processes (Fisher 1922) and diffusion approximations (Kimura 1962). Branching processes allow for the derivation of simple analytical results. Simulations show that these results are accurate, provided the population size is sufficiently large such that N s ≫ 1. The disadvantage of this approach is that it is limited to beneficial and initially rare mutations. Diffusion approximations on the other hand are more powerful and flexible. One can study deleterious or neutral mutations of arbitrary initial frequency. The downside is that it is often impossible to obtain simple analytical results and the underlying assumptions are often less intuitive than in branching processes.

Using branching processes, Haldane (1927) derived his famous result that the probability of fixation of a beneficial mutation is approximately twice its selective advantage. Haldane assumed Poisson-distributed offspring and a small, constant selection coefficient. Later, Haldane’s result was generalized to arbitrary offspring distributions (see Bartlett 1955; Haccou et al. 2005; Lambert 2006).

The impacts of several genetic and ecological factors on the probability of fixation have been investigated (reviewed in Patwa and Wahl 2008). These include the effects of population structure (Whitlock 2003) and spatial heterogeneity (Whitlock and Gomulkiewicz 2005), interference due to selection on linked loci (Barton 1995; Hartfield and Otto 2011) or due to epistatic interaction (Takahasi and Tajima 2005), and changing population sizes (Ewens 1967; Otto and Whitlock 1997; Wahl and Gerrish 2001; Orr and Unckless 2008). In all of these studies, the selection coefficients are assumed constant in time but other aspects of the environment changed over time or space.

Most studies of time-dependent selection have focused on a stationary distribution of selection coefficients, which, at least in a probabilistic sense, can be interpreted as a time-homogeneous case (Jensen 1973; Karlin and Levikson 1974; Takahata et al. 1975; Huillet 2011). Ohta and Kojima (1968) and Kimura and Ohta (1970) studied models with gradual change in selection coefficients. Their derivations are quite general but require solutions to differential equations that can be obtained only in special cases. Using branching processes, Pollak (1966) studied some aspects of the effect of periodically or consistently changing selection coefficients on the probability of fixation.

Two recent articles have investigated the fixation of beneficial mutations when there is an explicit trend in the change of selection coefficients and population sizes. Uecker and Hermisson (2011) study a continuous-time birth–death process with time-dependent parameters and find results for the probability of fixation and the expected time to fixation. Their approach, based on results from Kendall (1948), is general and flexible but typically requires numerical evaluations. Waxman (2011) uses a diffusion approach that assumes arbitrary changes in population size and selection coefficients for a fixed number of generations, after which the environment stays constant. Again, numerical methods are necessary to calculate fixation probabilities. The strengths and weaknesses of these two approaches are discussed in Wahl (2011).

Despite these recent advances, many questions remain unresolved. No analytical results are available for stochastically changing environments if there is trend and autocorrelation. More generally, no simple analytic results are available for general patterns of environmental change. Such results are needed to improve our understanding of the genetics of adaptation and to guide future studies, both theoretical and empirical.

This article presents the first analytical approximation for the probability of establishment in arbitrarily changing environments. The foundation is a novel approach for studying time-dependent branching processes that assumes environmental fluctuations are small but is general otherwise. Temporal fluctuations can be deterministic or stochastic, and the source of variation in the strength of selection and drift is arbitrary. We also allow for different types of offspring distributions in different generations and variation among offspring distributions within generations. Our main result is the generalization of Haldane’s result for the probability of fixation to arbitrary stochastic or deterministic change in environments. We show that probability of establishment is ∼2se, where se is a weighted average of the selection coefficients across generations. We use this result to calculate the probability of establishment in several biologically interesting cases and compare these approximations to simulations.

Model and Background

Branching process methods have been studied extensively in population genetics (reviewed in Haccou et al. 2005; Patwa and Wahl 2008). Before we present our results we give a brief outline of the basic results needed in our analysis.

We begin by considering an infinitely large population at demographic equilibrium. Generations are discrete and nonoverlapping. A resident allele is fixed and a single mutant allele is introduced at generation 0. Each copy of the mutant allele in the population produces a random number of descendant copies in the following generation (”offspring”) that is independent of the number produced by other mutant copies. We denote the mean number of offspring that a mutant allele leaves in generation n by μn and the selection coefficient by sn = μn − 1. (In a haploid population, sn is the relative fitness advantage of a mutant, while in a randomly mating diploid population, it is the relative fitness advantage of a heterozygote. Because the ultimate fate of the mutation is decided while mutant homozygotes are still rare, we can ignore their fitness.) The variance in offspring produced by a mutant allele is denoted Vn. There are two possible outcomes: the mutant allele dies out or it becomes permanently established. The probabilities of these two events are denoted 1 − p and p, respectively. We call the first outcome extinction and the second outcome establishment.

Let Xn(= 0, 1, 2, …) be the number of copies of the mutant allele in generation n and denote the number of offspring produced by copy i as ξn(i)(=0,1,2,), i = 1, … , Xn. Changes in the environment are reflected by the temporal variation in their distributions. The number of mutant copies in generation n is given by
Xn=i=1Xn1ξn1(i).
(1)
The strength of selection in generation n is given by the mean number of offspring μn and the strength of drift is determined largely by the variance Vn.
We assume that the ξn(i) are a family of independent identically distributed random variables and set ξn=ξn(i). We define the probability generating function (PGF) of ξn as
φn(x)=k=0fn,kxk,
(2)
where fn,k = P(ξn = k) is the probability that a mutant copy in generation n has k offspring. A branching process is characterized completely by the initial condition, i.e., the number of copies at generation 0, and a sequence of PGFs {φ0, φ1, φ2, …}.
A simple extension of classical results for branching processes in constant environments can be used to find the probability of extinction when the offspring distribution changes in time. Appendix A shows that this probability is
1p=limnφ0(φ1(φn(0))).
(3)
Since the φn are monotonically increasing functions, p exists and p ε [0, 1]. The nested structure of (3) makes calculation of p impossible in most cases. In the following, we assume that a positive solution of (3) exists. For instance, this is the case if sn > 0 for all n.
We next introduce the concept of a reference environment in which the offspring distribution remains constant in time. The PGF for that distribution is denoted φ* and its mean is denoted μ* = 1 + s*. We assume that μ* > 1. The probability of fixation in the reference environment, denoted p*, is then ∼2s*/V*, where V* is the variance of the offspring distribution (Bartlett 1955). The PGF in generation n can be written in terms of its deviation from the PGF of the reference environment:
δn(x):=φn(x)φ*(x).
(4)
We call δn the perturbation function of generation n. Each δn is a smooth (i.e., arbitrarily often differentiable) and bounded function mapping [0, 1] into [−1, 1]. We note that δn(1) = 0. To proceed further, we assume that temporal variation in the distribution of offspring is small, i.e., that maxx,k[δk(x)] and maxx,k[δk(x)]1.
Replacing φn by φ* + δn in (3) and expanding in a Taylor series (see Appendix B for details) yields
p=p*k=0δk(1p*)ωk+O(δ2),
(5)
where ω = φ*′(1 − p*). By O(δ2) we mean terms of order maxx,k[δk(x)2] and maxx,l,k[δl(x)δk(x)]. Because the mean number of offspring in the reference process is >1, it follows that 0 < ω < 1. Consequently, the sum converges for every series of environments because every δn is bounded and ω < 1.

The sum in Equation 5 is a weighted sum over the temporal change in the environment. Typically, δn(x) will be negative if μn > μ* and positive if μn > μ*. The contribution of δn is weighted by ωn. Because ω < 1, earlier generations have more weight than later generations. In addition, it is easy to see that ω increases with decreasing p* (Karlin and Taylor 1975). Consequently, if the probability for fixation is small, a large number of generations will have an impact on the probability of establishment. On the other hand, the fate of new mutations will be decided early for mutations with a high chance of fixation. Let pn* be the probability of survival until generation n in the reference environment. Then, ω is the rate of convergence of pn*p* (Karlin and Taylor 1975). In that sense, the reference environment is used to quantify the rate at which the contribution of individual generations declines with increasing n.

Note that Equation 5 approximates the probability of establishment for one series of environments, for instance if the change in environmental conditions is deterministic. Deriving the probability establishment in random environments is straightforward. The only random variables in Equation 5 are the δn(1 − p*) terms. (Note that δn is a random function.) Since the approximation is linear in δ, we can simply replace every δn(1 − p*) in Equation 5 by its expectation. This holds for every stochastic process {δn(1p*)},nε0+.

We next present our main result, which is a corollary of (5) (see Appendix C). Assume that selection is weak, i.e., s*, sn ≪ 1, and let
πn:=s*(1s*)n.
(6)
Because k=0πk=1, the set {πk} can be interpreted as a probability distribution. Further, the value of πn decreases in time. Now define the effective selection coefficient
se:=k=0πksk.
(7)
The probability of establishment can then be written simply as
p2seV*,
(8)
where V* is again the variance in offspring number in the reference environment. An alternative representation of this result is
pp*k=0sk(1s*)k.
(9)

When selection is weak and fluctuations are small, the only aspect of temporal variation that enters the probability of establishment is the effective selection coefficient. Haldane’s classic result is recovered immediately by setting sn = s. Because the π that determine se can be interpreted as probabilities, we can view the effective selection coefficient as a weighted average. Strongest weight is given to the mutant’s fitness in the first generation that it appears, and weights in later generations are given approximately by es*n, where s* is again the selection coefficient in the reference environment. A similar rate appears in Equation 10 of Uecker and Hermisson (2011) (see also Kendall 1948) and in Equations 7 and 12 of Otto and Whitlock (1997).

The weight πn is closely related to the expected number of copies of mutant alleles in generation n, which is given by
k=0n(1+sk)(1+s*)nes*n.
(10)
Thus, the weight that measures the contribution of generation n is inversely proportional to the expected number of mutants in generation n. This reflects the fact that the strength of drift decreases with increasing number of mutants.
An important aspect of our approximation is choosing a suitable reference environment. A natural way to define the reference environment is using the arithmetic mean of selection coefficients,
s*=limn1nk=0nsk,
(11)
and the average variance in offspring,
V*=limn1nk=0nVk.
(12)

In the next section we demonstrate that this choice generally yields very accurate results. In few cases, usually when selection is very weak over long periods, better results may be obtained by using a different reference environment. It is a difficult mathematical problem, however, to give a general guideline for the choice of an optimal reference environment.

Examples

We next illustrate how our results can be applied to particular cases of changing environments. In all examples, we compare our analytical approximation for the probability of establishment (Equation 8) to exact results obtained by numerical iteration of (3) and stochastic simulations (see Appendix E for details on the simulation methods). For large populations (>50,000 individuals), results from individual-based simulations coincide with the numerical solution of (3). For the sake of clarity, when the probability of establishment can be obtained numerically, we do not show results from simulations. The R scripts used for the simulations can be found in supporting information, File S1. All numerical results are obtained using Poisson-distributed offspring. Other distributions yield similarly accurate results (data not shown). Unless otherwise stated, we use the time average of selection coefficients (11) and variance in offspring (12) to define the reference environment.

Monotonic change

Most published studies on the genetics of adaptation rely on the idea that ecology and evolution can be decoupled (see Orr 2010 for a review). This separation of timescales may not always be appropriate. Here, we investigate the effect of a gradually changing environment.

We model the change in mean number of offspring of mutants from μ0 = 1 + s0 to μ = 1 + s by
μn=1+s(1eκn)+s0eκn,
(13)
where κ measures the speed of environmental change and defines an ecological timescale. Note that Equation 13 can describe monotonically increasing or decreasing fitness, depending on the values of s0 and s.
Equation 8 yields
p2s[1+s0s1eκ(1s)].
(14)
One immediate consequence of (14) is that extinction is certain if the final selection coefficient is negative.

Figure 1 shows comparison of (14) with results obtained by numerical evaluation of (3). The behavior of p as a function of κ can be described by distinguishing three different scenarios. If the timescale of selection is much faster than the timescale of temporal change (κs; κ < 0.001 in Figure 1), the probability of establishment is determined largely by the initial selection coefficient and is ∼2s0. If temporal change occurs on a much faster timescale than selection (κs; κ > 0.1 in Figure 1), temporal change has very little effect and p ≈ 2s. If environmental change and selection operate on similar timescales (κs; 0.001 < κ < 0.1 in Figure 1), p is intermediate between 2s0 and 2s.

Figure 1

Monotonic change in the environment. Solid curves show the analytic approximation (14), and the dashed curves show the numerical solution of (3). Parameter values are s = 0.01 and (from top to bottom): s0 = 0.02, 0.015, 0.005, and 0.

In general, Figure 1 shows that (8) is a very good approximation for the probability of establishment. If s0 and κs, we find, however, that the approximation underestimates the true probability of establishment. The reason for this seems to be that we approximate the decay of the impact of generations by (1 − s*)n. The exact weight of generation n depends, however, on the history of environments until generation n. Hence, if s0 and κs, we use a timescale of selection that is too fast, which leads to the observed underestimation of p. In this case, a better approximation may be obtained by using a smaller value for s*.

Cyclic variation

Populations often experience periodic fluctuations, for example, seasonal changes in humidity or temperature. Consider a sinusoidal change in the mean number of offspring,
μn=1+s+Δscos(nα+φ),
(15)
where s is the mean selection coefficient and Δs is the amplitude of the fluctuations. The parameter α controls the length of one cycle of fluctuations and the parameter φ determines initial selection coefficient s0. In this case, our approximation yields
p2s[1+Δs(1s)cos(αϕ)cos(ϕ)2(1s)cos(α)(1s)21].
(16)

Figure 2 shows comparison of this approximation with numerical calculations. As expected, the initial condition (determined by ϕ) is most important if fluctuations occur on a slower timescale as selection does (αs). In contrast, the effect of initial conditions vanishes and the mean selective advantage determines the probability of establishment if αs. For αs, the dependence on parameters of se is more complicated and both α and φ have a strong influence on p (see 0.001 < α < 0.1 in Figure 2).

Figure 2

Periodically changing selection coefficients. Solid curves show the analytic approximation (16) and the dashed curves the numerical solution of (3). Parameter values are s=0.01,Δs=0.005, and (from top to bottom) ϕ = 0, 3π/2, π/2, and π.

Random fluctuations in selection coefficients

Next, we study a stochastic first-order autoregressive model for environmental change: a so-called AR(1) process (Mills and Markellos 2008). Autoregressive processes are commonly used to model time series, for instance, patterns of climate change (Bloomfield and Nychka 1992; Hay et al. 2002).

We write the selection coefficients as
sn=(1ρ)s+ρsn1+εn,
(17)
where s is the mean of the process, ρ ε [0, 1] is the correlation coefficient of sn and sn−1, and εn is a white noise term with mean 0 and variance σ2. This process is stationary in the sense that the distribution of the process is time independent. The variance is given by σ2/(1ρ2). Note that strong autocorrelation increases the total variance of the process considerably.
If we condition the process on an initial value s0, the process becomes time dependent. In the conditioned process, the expected selection coefficient in generation n is given by
E[sn|s0]=s(1ρn)+s0ρn.
(18)
Note that (18) is equivalent to (13) with κ = −log(ρ). Thus, we can view this process as a stochastic version of (13). Figure 3A shows realizations of a conditioned process with strong autocorrelation ρ = 0.99. This process includes random noise (εn), autocorrelation (ρ), and dynamics (s0 and s).
Figure 3

The autoregressive model. (A) Examples of realizations with s0 = 0.02, s=0.01, σ = 0.001, and ρ = 0.99. The dashed curve shows the mean selection coefficient, and the solid curve shows a single realization. Shaded lines show 50 additional realizations. (B) Probability of establishment as a function of the initial selection coefficient s0. The solid curve shows the analytical approximation (19), solid dots show results from simulations, and the whiskers show 99% confidence intervals. The shaded dashed line shows the quadratic regression line of the simulated points. Values for the other parameters are as in A. (C) Establishment probability as a function of ρ for s0 = 0.02 (top) and s0 = 0 (bottom). Other parameters are as in A.

Calculation of the expected establishment probability yields
E[p|s0]=2s(1+s0s1(1s)ρ).
(19)
Here, the expectation is taken over all realizations of the process conditioned on s0. The effect of parameters on the probability of establishment is essentially the same as described in Monotonic change. This illustrates that autocorrelation can have a significant effect on the probability of establishment by amplifying the effect of initial conditions. Figure 3, B and C, shows comparison of the analytical approximation (19) with simulations. Again, our approximation provides an accurate prediction for the probability of establishment. The fit is slightly less accurate if s0 is small and ρ is large. The reason is that sn is close to zero (or negative) for long periods early on (cf. Monotonic change).

Variation within generations

So far we have assumed that the numbers of offspring of mutant copies that live in the same generation are independent and identically distributed random variables. It is straightforward to relax the second assumption and extended our results to include variation in the offspring distributions within generations. Such a model accounts for heterogeneity in environmental conditions within a generation (e.g., Levene 1953). An example of the time-independent case can be found in Karlin and Taylor (1975). Here, we briefly discuss the extension to the time-dependent case.

We assume that each copy of the mutant allele has Poisson-distributed offspring. The means of the offspring distributions are drawn independently from an arbitrary distribution with mean μn=1+sn. Assuming weak selection, Appendix D shows that the offspring of a randomly drawn copy in generation n is again Poisson distributed and the expected number of offspring is μn. Consequently, the probability of establishment is given by (8) with sn=sn. As an example, consider normally distributed selection coefficients with mean sn=s0eκn+s(1eκn) and variance σ2. This model is another stochastic version of (13). Figure 4 shows comparison of analytical predictions with simulation results. The example in Figure 4 illustrates that a consistent trend in the average strength of selection has a significant impact on the probability of fixation, even when variance among reproductive successes is large relative to the change in mean selection intensity from one generation to the next.

Figure 4

Variation within generations. (A) Illustration of a branching process with variation in offspring distributions within generations. The solid curve shows the mean selection coefficient sn with κ = 0.01, σ = 0.1, and s0 = 0.02, and shaded dots show selection coefficients of mutant copies for a single realization. (B) The expected selection coefficient sn on a different scale. (C) Probability of establishment as a function of the initial selection coefficient s0. The solid curve shows the analytical approximation, solid dots shows results from simulations, and the whiskers show 99% confidence intervals. Other parameters are as in A.

Changing population size

We next assume that the mutants have a constant advantage s over the resident. Environmental variation arises from changes in the population size of residents. Then, the mean number of offspring of a resident in generation n is given by Nn+1/Nn and that of mutants by
μn=Nn+1Nn(1+s),
(20)
where Nn denotes the population size of the residents in generation n. As an example, consider the case of a logistically growing or shrinking population. The demographic dynamics of the resident are given by the solution of the Beverton–Holt equation
Nn=KN0N0+(KN0)ern
(21)
(Beverton and Holt 1957), where r is the growth rate of the population, K the carrying capacity, and N0 the initial population size.

A comparison of the analytical approximation for p and numerical results is shown in Figure 5. In growing populations (N0/K < 1), the approximation is very accurate if s and r are of the same order of magnitude. If sr, however, sn is large relative to s* in early generations, which causes the approximation to underestimate the true probability of fixation considerably (data not shown). A better approximation can be obtained by using a larger value for s* (see bottom curve in Figure 5). In shrinking populations (N0/K > 1), sn will be very small (or negative) during early generations. This causes the approximation to underestimate the probability of establishment if the initial population size is much larger than the carrying capacity. For a more thorough treatment of fixation probabilities in populations of changing size we refer to Otto and Whitlock (1997).

Figure 5

Growing and shrinking populations. Parameter values are s = 0.02, 0.01, 0.01, and 0.001 (from top to bottom) and r = 0.01. In the bottom curve (s = 0.001) we used s* = 0.01 to define the reference environment.

Discussion

The adaptation of a population to environmental challenges is a critical evolutionary process. Sources of environmental variation are manifold and changes can occur on all timescales, ranging from instantaneous changes to transitions on geological timescales. Nevertheless, most studies of the genetics of adaptation rely on a separation of timescales of evolution and ecology (see Orr 2010, for a review). In many cases, this might be a severe oversimplification. Indeed, recent developments on the establishment of beneficial mutations in changing environments reveal that this separation is not always appropriate and can lead to qualitatively different results (see Uecker and Hermisson 2011). Furthermore, the effects on the probability of establishment of trend and autocorrelation have been ignored largely in studies of stochastically changing environments.

This article studies the probability of establishment of beneficial mutations in changing environments. We used an approach that assumes that temporal fluctuations are small, but is general otherwise. The new idea here is to describe the difference of two distributions by the difference of their probability generating functions and perform a perturbation analysis in this function. This approach leads to surprisingly simple yet general results.

Our main result is the generalization of Haldane’s classical result on the fixation of beneficial mutations to arbitrarily changing selection coefficients. Under weak selection, all aspects of environmental variation collapse into a single quantity: the effective selection coefficient se. This coefficient is defined as the selection coefficient in a constant environment that leads to the same probability of establishment as the original time-dependent process. The effective selection coefficient se is a weighted average over the selection coefficients in each generation, where the weights decrease monotonically over time. Thus the establishment probability is determined by the order in which environments are experienced as well as by their average effect (see Tănase-Nicola and Nemenman 2011 for a discussion of similar findings in deterministic models). The weights are approximately inversely proportional to the expected number of mutants in generation n. This makes sense because the strength of drift will be negligible if the number of mutants reaches a certain threshold (∼1/s*). Consequently, even small changes in selection coefficients from one generation to the next (on the order of s*2) can have a considerable effect on the probability of establishment (cf. Uecker and Hermisson, 2011).

These results apply to beneficial mutations in populations that are large such that Ns*1. To derive analytical results, we assumed that temporal fluctuations in the offspring distributions are small. A straightforward interpretation of this assumption is difficult because the calculation quantifies the strength of fluctuations using the difference of two probability generating functions. Our results suggest that our approximations hold if fluctuations of selection coefficients are at most of the same order as the strength of selection in the reference environment. This is in agreement with Ohta (1972), who showed that the ratio of mean and variance of selection coefficients can have an important influence on fixation probabilities, especially if this ratio is small.

An important part of our approach is the choice of a suitable reference environment to quantify the fluctuations in reproductive success of mutants. The reference environment should be chosen such that the contributions of the perturbation functions are minimized. It is, however, difficult to give a general guideline for the optimal choice of the reference environment. Indeed, finding a rigorous way to derive the optimal reference environment would be an interesting but challenging subject for future studies. The time average of selection coefficients and variance in offspring seems generally to yield simple and accurate analytical results (see Figures 15). When selection is very weak and the environment changes very slowly (relative to the timescale of selection), other choices may yield better results (see s0 = 0 in Figure 1 and s = 0.001 in Figure 5). As a guideline in experimental settings, our results suggest that one should consider at least 1/s0 generations to estimate the probability of establishment in changing environments.

To illustrate how the analytical approximations can be used in particular scenarios, we applied them to several biologically interesting cases. In particular, we studied examples of selection changing in consistent, periodical, or random ways. In the latter case, we also studied variation in expected reproductive success among mutant copies within a generation. We also briefly studied mutants with constant selective advantage in populations of changing size. The main focus was on examples with sn > 0, which ensures that a positive solution of (3) exists. In some examples, however, negative selection coefficients were allowed (see Figures 35). These examples illustrate that our approximations can yield accurate results if selection coefficients are negative occasionally (see Figures 3 and 4). In general, however, our approximation underestimates the probability of establishment if sns* for long periods in early generations (see Figures 1 and 5).

Our results shed light on the interaction of environmental and evolutionary dynamics. If the environment changes slowly relative to the timescale of selection, we can separate the timescales of environmental change and evolution (e.g., Orr and Unckless 2008). Conversely, if environmental change occurs at a faster timescale than selection, the mean of the selection coefficients determines the probability of establishment (see Figures 13). If selection and temporal change operate on similar timescales, one might naively expect that the probability of establishment is intermediate between the two extreme cases of slow or fast temporal change. Our results illustrate, however, that the dependence on parameters of the effective selection coefficient can be more complicated in such cases (see 0.001 < α < 0.1 in Figure 2). Thus, our results show that separating the timescales of environmental change and evolutionary dynamics is not appropriate if selection and temporal change operate on similar timescales. This conclusion also holds in stochastic environments. Even if expected fitness does not vary over time, autocorrelation can amplify the effect of the initial conditions on the probability of establishment (see Figure 3, B and C). If there is variation in fitness among mutant copies within generations, consistent change in expected selection intensity has a significant effect on the probability of establishment (see Figure 4). This holds even when the variance in fitness within generations is large relative to the change in selection between generations.

The results also reveal implications of environmental change on the distribution of selection coefficients of successful mutations. The timescale of selection is faster for mutations with large effects than it is for mutations with small effects. As a consequence, a consistently degrading environment has a less severe effect on the probability of establishment of mutations with larger effects. This effect is reminiscent of “Haldane’s sieve” (Haldane 1924; Turner 1981), which describes the advantage of dominant beneficial mutations over recessive ones. When environmental stress increases over time, we expect that this “sieve” will be severe and yield an overrepresentation of large mutational effects among successful mutations. Indeed, bias toward large-effect mutations is observed in many cases of adaptation to new environments (Orr and Coyne 1992).

Recent evolution experiments in deteriorating environments (Bell and Gonzalez 2009) led to results consistent with analytical predictions on evolutionary rescue (Gomulkiewicz and Holt 1995; Bell 2008; Orr and Unckless 2008). In these experiments an automated liquid handling system manipulated the selective environment of many hundreds of microbial populations. Such a system could be used to validate a general theory of adaptation and extinction in changing environments. Hence, a general theoretical framework is necessary to formulate broad predictions that can then be tested in experiments. Together with other recent advances (e.g., Campos and Wahl 2010; Uecker and Hermisson 2011), our results illustrate that branching processes are a powerful framework to describe the process of adaptation in changing environments. We hope that our results encourage future work, both theoretically and experimentally, on this important subject.

In summary, we studied the probability of establishment of initially rare beneficial mutations and derived analytical results using a new approach based on the assumption of small temporal fluctuations. Under weak selection, our results are surprisingly simple and provide intuitive insights on the interaction of evolutionary and environmental processes. It would be highly interesting to develop our approach further, e.g., to study the effects of large temporal fluctuations, to calculate approximations for the time until fixation and to allow for correlation in reproductive success among mutant copies.

We close by noting that branching processes have been successfully employed to model many other natural phenomena than the one studied in this work. These include the spread of infectious diseases (e.g., Lloyd-Smith et al. 2005), the growth of tumor cells (e.g., Bozic et al. 2010), and polymerase chain reaction (e.g., Sun 1995). Thus, the ideas we put forward in this study could be useful in a variety of other scientific areas.

Acknowledgements

We thank S. Otto, M. Whitlock, J. Hermisson, and H. Uecker for useful discussions and two anonymous reviewers for helpful comments on the manuscript. This work was funded by National Science Foundation grant DEB-0819901 (to M.K.).

Literature Cited

Bartlett
M
,
1955
Stochastic Processes, with Special Reference to Methods and Applications
.
Cambridge University Press
,
Cambridge, UK/London/New York
.

Barton
N
,
1995
Linkage and the limits to natural selection
.
Genetics
 
140
:
821
841
.

Bell
G
,
2008
Selection: The Mechanism of Evolution
.
Oxford University Press
,
London/New York/Oxford.

Bell
G
,
Gonzalez
A
,
2009
Evolutionary rescue can prevent extinction following environmental change
.
Ecol. Lett.
 
12
:
942
948
.

Beverton
R
,
Holt
S
,
1957
On the Dynamics of Exploited Fish Populations (Fishery Investigations Series II)
,
Vol. XIX
.
Ministry of Agriculture, Fisheries and Food
,
London, UK
.

Bloomfield
P
,
Nychka
D
,
1992
Climate spectra and detecting climate change
.
Clim. Change
 
21
:
275
287
.

Bozic
I
,
Antal
T
,
Ohtsuki
H
,
Carter
H
,
Kim
D
 et al. ,
2010
Accumulation of driver and passenger mutations during tumor progression
.
Proc. Natl. Acad. Sci. USA
 
107
:
18545
18550
.

Campos
P
,
Wahl
L
,
2010
The adaptation rate of asexuals: deleterious mutations, clonal interference and population bottlenecks
.
Evolution
 
64
:
1973
1983
.

Ewens
W
,
1967
The probability of survival of a new mutant in a fluctuating environment
.
Heredity
 
22
:
438
443
.

Fisher
R
,
1922
On the mathematical foundations of theoretical statistics
.
Philos. Trans. R. Soc. Lond. A Contain. Pap. Math. Phys. Character
 
222
:
309
368
.

Gillespie
J
,
2000
Genetic drift in an infinite population: the pseudohitchhiking model
.
Genetics
 
155
:
909
919
.

Gomulkiewicz
R
,
Holt
R
,
1995
When does evolution by natural selection prevent extinction?
 
Evolution
 
49
:
201
207
.

Haccou
P
,
Jagers
P
,
Vatutin
V
,
2005
Branching Processes: Variation, Growth, and Extinction of Populations
.
Cambridge University Press
, Cambridge, UK/London/New York
.

Haldane
J
,
1924
A mathematical theory of natural and artificial selection. Part II: the influence of partial self-fertilisation, inbreeding, assortative mating, and selective fertilisation on the composition of Mendelian populations, and on natural selection
.
Biol. Rev. Camb. Philos. Soc.
 
1
:
158
163
.

Haldane
J
,
1927
A mathematical theory of natural and artificial selection. Part V: selection and mutation
.
Math. Proc. Camb. Philos. Soc.
 
3
:
607
615
.

Hartfield
M
,
Otto
S
,
2011
Recombination and hitchhiking of deleterious alleles
.
Evolution
 
65
:
2421
2434
.

Hay
S
,
Cox
J
,
Rogers
D
,
Randolph
S
,
Stern
D
 et al. ,
2002
Climate change and the resurgence of malaria in the east African highlands
.
Nature
 
415
:
905
909
.

Huillet
T
,
2011
On the Karlin-Kimura approaches to the Wright-Fisher diffusion with fluctuating selection
.
J. Stat. Mech.
 
2011
:
P02016
.

Jensen
L
,
1973
Random selective advantages of genes and their probabilities of fixation
.
Genet. Res.
 
21
:
215
219
.

Karlin
S
,
Levikson
B
,
1974
Temporal fluctuations in selection intensities: case of small population size
.
Theor. Popul. Biol.
 
6
:
383
412
.

Karlin
S
,
Taylor
H
,
1975
A First Course in Stochastic Processes
.
Academic Press
,
New York/London/San Diego
.

Kendall
D
,
1948
On the generalized “birth-and-death” process
.
Ann. Math. Stat.
 
19
:
1
15
.

Kimura
M
,
1962
On the probability of fixation of mutant genes in a population
.
Genetics
 
47
:
713
719
.

Kimura
M
,
Ohta
T
,
1970
Probability of fixation of a mutant gene in a finite population when selective advantage decreases with time
.
Genetics
 
65
:
525
534
.

Lambert
A
,
2006
Probability of fixation under weak selection: a branching process unifying approach
.
Theor. Popul. Biol.
 
69
:
419
441
.

Levene
H
,
1953
Genetic equilibrium when more than one ecological niche is available
.
Am. Nat.
 
87
:
331
333
.

Lloyd-Smith
J
,
Schreiber
S
,
Kopp
P
,
Getz
W
,
2005
Superspreading and the effect of individual variation on disease emergence
.
Nature
 
438
:
355
359
.

Mills
T
,
Markellos
R
,
2008
The Econometric Modelling of Financial Time Series
.
Cambridge University Press
,
Cambridge, UK/London/New York
.

Ohta
T
,
1972
Fixation probability of a mutant influenced by random fluctuation of selection intensity
.
Genet. Res.
 
19
:
33
38
.

Ohta
T
,
Kojima
K
,
1968
Survival probabilities of new inversions in large populations
.
Biometrics
 
24
:
501
516
.

Orr
H
,
2000
The rate of adaptation in asexuals
.
Genetics
 
155
:
961
968
.

Orr
H
,
2010
The population genetics of beneficial mutations
.
Philos. Trans. R. Soc. B Biol. Sci.
 
365
:
1195
1201
.

Orr
H
,
Coyne
J
,
1992
The genetics of adaptation: a reassessment
.
Am. Nat.
 
140
:
725
742
.

Orr
H
,
Unckless
R
,
2008
Population extinction and the genetics of adaptation
.
Am. Nat.
 
172
:
160
169
.

Otto
S
,
Whitlock
M
,
1997
The probability of fixation in populations of changing size
.
Genetics
 
146
:
723
733
.

Patwa
Z
,
Wahl
L
,
2008
The fixation probability of beneficial mutations
.
J. R. Soc. Interface
 
5
:
1279
1289
.

Pollak
E
,
1966
Some effects of fluctuating offspring distributions on the survival of a gene
.
Biometrika
 
53
:
391
396
.

Sun
F
,
1995
The polymerase chain reaction and branching processes
.
J. Comput. Biol.
 
2
:
63
86
.

Takahasi
K
,
Tajima
F
,
2005
Evolution of coadaptation in a two-locus epistatic system
.
Evolution
 
59
:
2324
2332
.

Takahata
N
,
Ishii
K
,
Matsuda
H
,
1975
Effect of temporal fluctuation of selection coefficient on gene frequency in a population
.
Proc. Natl. Acad. Sci. USA
 
72
:
4541
4545
.

Tănase-Nicola
S
,
Nemenman
I
,
2012
Fitness in time-dependent environments includes a geometric phase contribution
.
J. R. Soc. Interface
 
7
:
1354
1362
.

Turner
J
,
1981
Adaptation and evolution in heliconius: a defense of NeoDarwinism
.
Annu. Rev. Ecol. Syst.
 
12
:
99
121
.

Uecker
H
,
Hermisson
J
,
2011
On the fixation process of a beneficial mutation in a variable environment
.
Genetics
 
188
:
915
930
.

Wahl
L
,
2011
Fixation when N and s vary: classic approaches give elegant new results
.
Genetics
 
188
:
783
785
.

Wahl
L
,
Gerrish
P
,
2001
The probability that beneficial mutations are lost in populations with periodic bottlenecks
.
Evolution
 
55
:
2606
2610
.

Waxman
D
,
2011
A unified treatment of the probability of fixation when population size and the strength of selection change over time
.
Genetics
 
188
:
907
913
.

Whitlock
M
,
2003
Fixation probability and time in subdivided populations
.
Genetics
 
164
:
767
779
.

Whitlock
M
,
Gomulkiewicz
R
,
2005
Probability of fixation in a heterogeneous environment
.
Genetics
 
171
:
1407
1417
.

Appendices

Appendix A: Probability of Survival in a Time-Dependent Branching Process (Equation 3 in the Main Text)

We here derive an expression for the probability of survival in a time-dependent branching process. Let Xn, n ε ℕ+ be the number of copies of mutant alleles in generation n and ξn(i) be the number of offspring of the ith copy in generation n, i = 1, …, Xn. The sequence of random variables {Xn} is then given by
Xn=i=1Xn1ξn(i).
(A1)
We denote the PGF of ξn(i) by
φi,n(x)=k=0fn,k(i)xk,
(A2)
where fn,k(i)=P(ξn(i)=k). The PGF of Xn is denoted Φn. We assume that the ξn(i) are independent but not necessarily identically distributed random variables. It follows that
Φn+1(x)=k=0P(Xn+1=k)xk
(A3)
=k=0l=0P(Xn=l)P(i=1lξn(i)=k)xk
(A4)
=l=0P(Xn=l)k=0P(i=1lξn(i)=k)xk.
(A5)
Note that k=0P(i=1Iξn(i)=k)xk is the PGF of i=1lξn(i). Because the PGF of a sum of independent random variables is the product of the PGFs of the random variables (e.g., Karlin and Taylor 1975), it follows that
Φn+1(x)=l=0P(Xn=l)i=1lφi,n(x)
(A6)
=l=0P(Xn=l)(φn(x))l,
(A7)
where
φn(x):=(i=1Xnφi,n)1/Xn.
(A8)
Consequently,
Φn+1(x)=φn(φn)
(A9)
and, by setting X0 = 1,
Φn+1(x)=φ0(φ1(φn(x)))).
(A10)
This is a key relationship for calculating the probability of ultimate survival, which is given by 1 − limn→∞Φn(0). We note that φn is a random function and depends on the particular realization of the process. Hence, to calculate the probability of establishment, one needs to calculate the expectation of φn. In Appendix D, we calculate this expectation for Poisson-distributed offspring and weak selection.
A simpler expression can be obtained if there is no variation in offspring distributions within generations. Then, for each n, the ξn(i) are a family of independent identically distributed random variables. We denote the offspring of a mutant copy in generation n by ξn. Further, we denote the PGF of ξn by
φn(x)=k=0fn,kxk,
(A11)
where fn,k = P(ξn = k). By observing that φn=φn, we immediately get
Φn+1(x)=φ0(φ1(φn(x)))
(A12)
from Equation A10. The probability of ultimate survival, p, is then given by
1p=limnφ0(φ1(φn(0))),
(A13)
which is Equation 3 in the main text.

Appendix B: Derivation of Equation 5

Here, we derive an approximation for the probability of establishment if there is no variation in offspring distributions within generations.

Let qn = Φn(0) denote the probability of extinction by generation n. It is straightforward to see that {qn} is an increasing sequence that is bounded by 1. Intuitively, this can be seen by observing that the probability of extinction in generation n + 1 has to be larger than the probability of extinction in generation n. More formally, it follows because the φn are monotonically increasing functions of x. Consequently, q = limn→∞  qn exists and q ε [0, 1]. The probability of establishment is then given by p = 1 − q.

We introduce a time-independent branching process, denoted reference process, and define
δn(x):=φn(x)φ*(x),
(B1)
where φ* is the PGF of the offspring distribution in the reference process. Since both φn and φ* are PGFs, it follows that δn is a bounded and continuously differentiable function of x ε [0, 1] and that δn(1) = 0. Furthermore, δn can be written as δn(x)=k=0ak,nxk with k=0ak,n=0. In most cases, δn(x) will be negative if μn > μ* and positive if μn > μ*. We note, however, that one can construct examples where μn > μ does not imply δn(x) < 0 for all x ε [0, 1].
Ignoring second- and higher-order terms in δn and δn we can write (A12) as
Φn(x)=φ*(n)(x)+δ0(φ*(n1)(x))ω0+δ1(φ*(n2)(x))ω1+δn1(x)ωn1,
(B2)
where φ*(n) is the n-fold composition of φ* with itself,
ωk={i=1kφ*(φ*(ni)(x))ifk>01ifk=0,
(B3)
and φ*′ denotes the derivative of φ*. A rigorous proof of (B2) can be obtained by induction.
We define
q*:=1p*=limnφ*(n)(0)
(B4)
and note that
0<φ*(0)<<φ*(n)(0)<<limnφ*(n)(0)=q*<1.
(B5)
Because φ*′(x) < 1 for all x ε [0, q*], it follows that
1>ω1>ω2>>0.
(B6)
Let qj,n, j < n, denote the first j + 1 terms of Φn(0),
qj,n=φ*(n)(0)+δ0(φ*(n1)(0))ω0+δ1(φ*(n2)(0))ω1+δj1(φ*(nj)(0))ωj1.
(B7)
Then, up to quadratic and higher-order terms in δ and δ′,
q=limnφn(0)=limjlimnqj,n.
(B8)
Since the δn and φn are smooth functions, it follows that
limnqj,n=q*+δ0(q*)ω0+δ1(q*)ω1+δ2(q*)ω2++δj1(q*)ωj1,
(B9)
where
ωk:=limnωk
(B10)
=i=1kφ*(q*).
(B11)
Consequently,
q=q*+δ0(q*)ω0+δ1(q*)ω1+=q*+k=0δk(q*)ωk,
(B12)
which is equivalent to Equation 5 in the main text.

Appendix C: Derivation of Equations 8 and 9

We assume weak selection, i.e., that s*, sn ≪ 1. It follows that q* ≈ 1 − 2s*/V* and
ωφ*(12s*V*)
(C1)
φ*(1)φ*(1)2s*V*
(C2)
=1+s*[V*+(1+s*)2(1+s*)]2s*V*
(C3)
1s*.
(C4)
We here used that φ*′(1) = 1 + s* and φ*″(1) = V* + (1 + s*)2 − (1 + s*).
By ignoring second- and higher-order terms in (x − 1), we can approximate the perturbation functions δn by
δn(x)δn(1)+δn(1)(x1)
(C5)
=(sns*)(x1).
(C6)
Inserting this into Equation B12 yields
pp*+p*k=0(sks*)ωk
(C7)
=p*(1+k=0skωkk=0s*ωk)
(C8)
=p*(1+k=0skωk1)
(C9)
=p*k=0sk(1s*)k,
(C10)
which yields Equation 9 in the main text.
We define πk = (1 − s*)ks* and recall that P* ≈ 2s*/V*. Then the above calculation yields
p2k=0πkskV*,
(C11)
which is equivalent to Equations 7 and 8 in the main text.

Appendix D: Variation Within Generations

We consider variation in the distribution of offspring among copies of mutant alleles that live in the same generation. Equation A10 shows that the PGF of a randomly drawn mutant copy in generation n is given by φn (Equation A8). Note that φn is a random function and depends on the particular realization of the process. Hence, we need to calculate the expectation of φn. In the following, we assume that each copy has Poisson-distributed offspring. The means of these distributions are drawn independently from another distribution that changes over time. Assuming weak selection, we calculate the distribution of the number of offspring of a randomly drawn copy in a given generation.

The PGF of the ith copy in generation n is given by
φi,n(x)=eμi,n(x1),
(D1)
where μi,n = 1 + si,n is the mean of this copy’s offspring distribution. Then,
φn(x)=(i=1Xneμi,n(x1))1/Xn
(D2)
=e(1/Xn)i=1Xnμi,n(x1).
(D3)
If selection is weak, i.e., si,n ≪ 1,
φn(x)=e(x1)(1+(x1)1Xni=1Xnsi,n)+O(si,n2).
(D4)
Let E[μi,n]=μn=1+sn. To calculate the expectation of (D4) we can simply replace si,n by its expectation:
E[φn(x)]e(x1)(1+(x1)sn).
(D5)
Finally, we observe that (D5) is, up to quadratic and higher-order terms in s, equivalent to the PGF of a Poisson distribution with mean 1+sn.

Appendix E: Simulation Methods

We simulated the spread of a new mutation in a finite population of haploid individuals. Generations are discrete and nonoverlapping. Each individual contributes a Poisson-distributed number of offspring to the next generation. The mean number of offspring of the ith individual carrying the mutation is denoted μn,i and that of a resident is νn. Population regulation follows the Beverton–Holt equation. The expected number of surviving offspring of a resident in generation n + 1 is given by
νn+1=er1+Nn(er1)/K,
(E1)
where r is the growth rate, K the carrying capacity, and Nn the total number of individuals in generation n. At demographic equilibrium νn = 1. The expected offspring of mutant copy i in generation n is
μn,i=νn(1+sn,i),
(E2)
where sn,i is the selection coefficient of mutant copy i in generation n. We used a carrying capacity of K = 50,000 and a growth rate of r = ln(2) (unless stated otherwise, see Changing population size). Other parameters were chosen as described in the main text. At the beginning of each simulation, a single mutant is introduced at generation 0. We stopped when the number of mutants reached zero (extinction) or 500 (establishment). Using larger threshold values did not change the outcome of the simulations for the parameter values used here. Simulations were performed in R. The R scripts are available in File S1.

Footnotes

Communicating editor: L. M. Wahl

Author notes

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model)

Supplementary data