Genetics, Vol. 163, 467-474, February 2003, Copyright © 2003

Probability of Fixation of an Advantageous Mutant in a Viral Quasispecies

Claus O. Wilkea
a Digital Life Laboratory, Caltech, Pasadena, California 91125

Corresponding author: Claus O. Wilke, Caltech, Pasadena, CA 91125., wilke{at}caltech.edu (E-mail)

Communicating editor: M. W. FELDMAN


*  ABSTRACT
*TOP
*ABSTRACT
*THEORETICAL FRAMEWORK
*FIXATION ON A NEUTRAL...
*DISCUSSION
*APPENDIX
*LITERATURE CITED

The probability that an advantageous mutant rises to fixation in a viral quasispecies is investigated in the framework of multitype branching processes. Whether fixation is possible depends on the overall growth rate of the quasispecies that will form if invasion is successful rather than on the individual fitness of the invading mutant. The exact fixation probability can be calculated only if the fitnesses of all potential members of the invading quasispecies are known. Quasispecies fixation has two important characteristics: First, a sequence with negative selection coefficient has a positive fixation probability as long as it has the potential to grow into a quasispecies with an overall growth rate that exceeds that of the established quasispecies. Second, the fixation probabilities of sequences with identical fitnesses can nevertheless vary over many orders of magnitudes. Two approximations for the probability of fixation are introduced. Both approximations require only partial knowledge about the potential members of the invading quasispecies. The performance of these two approximations is compared to the exact fixation probability on a network of RNA sequences with identical secondary structure.


ONE of the most remarkable aspects of the dynamics of RNA viruses is the high rate at which mutant variants are produced. At mutation rates close to one substitution per genome per generation (DRAKE 1993 Down; DRAKE and HOLLAND 1999 Down), a virus population forms a highly diverse cloud of mutants (DOMINGO et al. 1976 Down, DOMINGO et al. 1978 Down; HOLLAND et al. 1982 Down; STEINHAUER et al. 1989 Down; BIEBRICHER and LUCE 1993 Down; BURCH and CHAO 2000 Down), a so-called quasispecies (EIGEN and SCHUSTER 1979 Down; NOWAK 1992 Down; DOMINGO and HOLLAND 1997 Down; DOMINGO et al. 2001 Down). At the same time, the sequence space is so large that even for population sizes up to 1012, there is a constant stream of new mutants that have never existed before. Most of these mutants have impaired fitness, but occasionally, a new mutant will fare better than all currently existing virions, for example, by presenting an epitope that the immune system fails to recognize. With a certain probability, this mutant will rise to fixation, where fixation is understood in the sense that the mutant becomes the ancestor of a new quasispecies, which completely replaces the currently existing one.

The problem of the fixation of an advantageous mutant is an old one, with a long history of investigations in classical population genetics, reaching back to Haldane and Fisher (FISHER 1922 Down, FISHER 1930 Down; HALDANE 1927 Down; KIMURA 1957 Down, KIMURA 1964 Down, KIMURA 1970 Down; EWENS 1967 Down; KIMURA and KING 1979 Down; BARTON 1995 Down; BURGER and EWENS 1995 Down; OTTO and BARTON 1997 Down; POLLAK 2000 Down). However, these investigations differ from the quasispecies case in one important aspect: the mutation rates considered. In classical population genetics, the usual assumption is that mutations are rare events, such that an invading mutant will not mutate again while it is moving toward either fixation or extinction. In the quasispecies setting, on the other hand, most of the immediate offspring of a mutant will have further mutations, and their offspring will as well, and so on. As a consequence, the fitness of a prospective invading quasispecies is not given by the fitness of the initial mutant, but rather by the average fitness of the offspring mutant cloud that will form eventually. One of the more surprising results of these dynamics is that a mutant with the ability to replace the currently existing quasispecies may actually have a reduced replication rate, if at the same time its robustness against further mutations is increased (SCHUSTER and SWETINA 1988 Down; WILKE 2001B Down; WILKE et al. 2001 Down; KRAKAUER and PLOTKIN 2002 Down).

Quasispecies theory in its original formulation by EIGEN and SCHUSTER 1979 Down is based on deterministic differential equations and as such cannot deal with the fluctuations that are responsible for fixation or extinction of individual mutants. Within the more general mathematical framework of multitype branching processes, it is possible to describe both the deterministic aspects of large populations and the fluctuations inherent in the dynamics of small and very small populations (DEMETRIUS et al. 1985 Down; HOFBAUER and SIGMUND 1988 Down; HERMISSON et al. 2002 Down). An expression for the probability of fixation follows naturally from branching process theory. We discuss how this expression relates to the predictions of the deterministic quasispecies equations, as well as to the results of classical population genetics.

The remainder of the article is organized as follows. First, we derive a general expression for the probability of fixation in an arbitrary fitness landscape. Then, we discuss the special case of fixation on a neutral network, that is, the case in which all sequences of the invading quasispecies have the same fitness, and derive two approximations for the fixation probability that can be evaluated without the knowledge of the full-fitness landscape. To give a concrete example, we apply both the exact expression and the approximations to a known network of >50,000 RNA sequences. For this neutral network, we also discuss how the fixation probability changes if multiple sequences invade at the same time.


*  THEORETICAL FRAMEWORK
*TOP
*ABSTRACT
*THEORETICAL FRAMEWORK
*FIXATION ON A NEUTRAL...
*DISCUSSION
*APPENDIX
*LITERATURE CITED

For a population evolving under high mutational pressure, we have to understand fixation in the sense that a mutant is fixed once it has become a common ancestor of the whole population. The more traditional definition of fixation, which is to regard a mutation as fixed if all sequences in the population carry it, is not applicable: The mutational pressure constantly creates new deleterious mutants, which may not carry a particular mutation although their ancestors did so. If we understand fixation as the process by which a mutant becomes a common ancestor of the whole population, then the probability that a mutant is fixed is given by the probability that the cascade of further mutated offspring of the invading mutant does not come to a halt. We can calculate this probability from the theory of multitype branching processes.

The general setting to which our theory applies is as follows. Consider a viral quasispecies in mutation-selection balance, with an average fitness <w>. If generations are discrete and nonoverlapping, and the population size N is constant, then the probability that a virion i produces k offspring in one generation is given by Wright-Fisher sampling,

(1)

with , where wi is the fitness of virion i.

Assume that a rare mutation leads to the emergence of a virion with the potential to form a new quasispecies and to replace the already established one in the process. This new quasispecies (in the following also called the invading quasispecies) may consist of sequences of type 1, 2, ... , n, with replication rates wi. Let the probability that a sequence j produces an erroneous copy i be given by Qij. As long as the total abundance of the invading quasispecies is small compared to the established quasispecies, we can assume that <w> is not affected by the presence of the invading quasispecies. Then, the probability that a single sequence of type i generates (k1, ... , kn) offspring of types 1, ... , n can be expressed as

(2)

(see Appendix), with . The matrix elements Mij give the expected number of offspring of type j from sequences of type i in one generation. In the following, we assume that the population size is so large that we can approximate P(k1, ... , kn|i) by its limit for an infinitely large population. This limit is a multivariate Poisson distribution:

(3)

By using the theory of branching processes and by assuming an infinite population size in Equation 3, we restrict the applicability of our theory to certain scenarios. We can apply our theory only to those types of fixation events that increase the average fitness of the population. The situation of genetic drift, whereby a neutral or deleterious mutant is fixed because of stochastic fluctuations in a small population (KIMURA 1970 Down; KIMURA and KING 1979 Down), is not covered by our theory. This latter type of fixation event reduces the average fitness or leaves it unaltered.

Let xi be the probability that the offspring cascade spawned by a sequence i goes extinct after a finite number of generations. From the theory of multitype branching processes (HARRIS 1963 Down), we know that the vector of extinction probabilities x = (x1, ... , xn) satisfies x = f(x), where f(z) = (f1(z), ... , fn(z)) is the probability-generating function of the distribution of offspring probabilities P(k1, ... , kn|i). The probability-generating function is defined as

(4)

After inserting Equation 3 into Equation 4, we obtain . With the convention we can rewrite this expression as

(5)

Since the probability of fixation {pi}i of a sequence i is given by the probability that the offspring cascade spawned by i does not go extinct, we have . The vector of fixation probabilities satisfies therefore . With Equation 5, we find

(6)

This equation has exactly one solution with 0 < {pi}i < 1 for all i if the spectral radius {rho}M of M > 1 (HARRIS 1963 Down; for the matrices M we are considering here, the spectral radius coincides with the largest positive eigenvalue of M, by virtue of the Frobenius-Perron theorem). Otherwise, {pi}i = 0 for all i.

To compare Equation 6 to the result of HALDANE 1927 Down, we take the logarithm on both sides of Equation 6 and expand to second order:

(7)

With , this simplifies to

(8)

If si > 0 and all off-diagonal elements of M are 0, then Equation 8 reduces to Haldane's result {pi}i = 2si; that is, the fixation probability of a sequence is twice its selective advantage. If the off-diagonal elements are nonzero, then the fixation probability is increased, because the invading sequence gets support from its mutational neighbors. In particular, even if some si < 0, the corresponding {pi}i are positive as long as {rho}M > 1. This means that in quasispecies fixation, sequences that by themselves reproduce too slowly to outcompete the currently established quasispecies can nevertheless found a new quasispecies that grows fast enough to overtake the population.

For simplicity, we have considered only discrete, nonoverlapping generations. Generalization to continuous time is straightforward (see, e.g., HARRIS 1963 Down; HERMISSON et al. 2002 Down). In the continuous-time case, the vector of fixation probabilities {pi} is again determined by an equation of the form . However, the generating function f(z) is in general not given by Equation 5. Its functional form depends on the details of the continuous-time process that is being modeled. For example, if reproduction occurs through binary fission, f(z) will be quadratic in the variables z1, ... , zn.


*  FIXATION ON A NEUTRAL NETWORK
*TOP
*ABSTRACT
*THEORETICAL FRAMEWORK
*FIXATION ON A NEUTRAL...
*DISCUSSION
*APPENDIX
*LITERATURE CITED

Exact expressions and estimates:
So far, we have made no assumptions about the structure and fitness distribution of the invading quasispecies. This has led to a general equation for the vector of fixation probabilities {pi}, but not much further analysis is possible without a concrete model for the fitness landscape of the invading quasispecies (we do not have to make any further assumptions about the established quasispecies, since it enters the equations only through its average fitness <w>). The concrete fitness landscape we study is that of a neutral network (HUYNEN et al. 1996 Down; BORNBERG-BAUER 1997 Down) of related sequences with identical replication rate {sigma}. All sequences that are not part of the neutral network are assumed to have a vanishing replication rate. Mutations occur as random substitutions of single bases, and we allow for at most one substitution per replication event, similar to the approach of VAN NIMWEGEN et al. 1999 Down. The probability of a substitution is given by µ. The restriction to at most a single substitution is a technicality that simplifies the analysis. Generalization to more elaborate mutation schemes is possible along the lines of WILKE 2001A Down.

We denote the sequence length by L and the number of different bases by {kappa} ({kappa} = 4 for RNA/DNA). For the matrix M, we have to take into account only the sequences belonging to the neutral network. It is useful to introduce the connection graph G = (Gij). The elements Gij are 1 if and only if two sequences i and j are exactly one mutation apart. In all other cases, Gij = 0. We can express M in terms of G as

(9)

where , and 1 is the identity matrix. We restrict our analysis to primitive connection graphs, in which case the spectral radius {rho}G of G is given by the unique positive eigenvalue of largest modulus of G (VARGA 2000 Down). (Irreducibility, which is often assumed in similar contexts, is not sufficient, since complex eigenvalues of modulus {rho}G may exist if G is not primitive. Irreducible undirected connection graphs of the kind we are considering here are primitive if they contain at least one cycle of odd length.)

The spectral radius of M is given in terms of the spectral radius of the connection graph {rho}G as

(10)

This implies that fixation can occur as long as s is not smaller than -ß{rho}G.

In an experimental setting, we cannot expect to have knowledge of the complete connection graph G. Therefore, it is important to have approximations for the fixation probability {pi}i. We consider two alternative methods. Both are based on replacing the matrix M in Equation 6 by a suitable diagonal matrix. This replacement leads to a decoupling of the equations for different {pi}i.

The quantity that is easiest to obtain experimentally is the growth rate of the invading quasispecies relative to the established quasispecies, when initially both are present in large and equal amounts. From the definition of M, we see that this relative growth rate corresponds to the spectral radius {rho}M of M. If we assume that every mutant present in the invading quasispecies has an expectation of {rho}M offspring per generation, then we can replace M in Equation 6 with a matrix that has entries {rho}M on the diagonal, while all off-diagonal elements are zero. Then, Equation 6 simplifies to for all i. Clearly, this approximation will overestimate the {pi}i for some mutants (mostly those that produce on average <{rho}M offspring) and underestimate it for others (mostly those that produce on average >{rho}M offspring). In the following, we refer to this estimate as the deterministic growth estimate, because it is based on the assumption that the invading quasispecies grows according to the deterministic equations from the outset.

The alternative method of estimating {pi}i is as follows. It is reasonable to assume that the first couple of replication cycles mostly determine fixation or extinction for an invading sequence. During these initial generations, the subpopulation descending from the invading sequence cannot explore the full neutral network if the network is large. Therefore, the major contribution to the fixation probability comes from the connection matrix of the local genetic neighborhood of the invading sequence, and sequences farther away on the neutral network are relatively unimportant. The idea behind the second approximation is therefore to calculate the fixation probability on the basis of a small area of genotype space surrounding the invading sequence. In the simplest case, we consider only the invading sequence and its immediate mutational neighbors. Assume sequence i has {nu}i neutral neighbors, i.e., . Then the total expected number of offspring of sequence i is . Under the assumption that all offspring of i have the same expected number of further offspring, the probability of fixation satisfies the equation We call the solution to this equation the neutrality estimate. As in the case of the deterministic growth estimate, it will overestimate the true fixation probability for some sequences and underestimate it for others.

Fixation on an RNA neutral network:
We compared the two estimates to the exact fixation probabilities on a neutral network of RNA sequences. The network of 51,028 sequences of length L = 18 was found through exhaustive enumeration by VAN NIMWEGEN et al. 1999 Down. The spectral radius of the network's connection graph is {rho}G = 15.7. To calculate fixation probabilities on this neutral network, we have to make an assumption about the average fitness <w> of the established quasispecies. We assume , in which case the relative growth rate of the invading quasispecies (at macroscopic concentration) with respect to the established quasispecies follows from Equation 10 as {rho}M = {sigma}, independent of the mutation rate.

Fig 1 displays the exact fixation probabilities (obtained numerically from Equation 6) and the two estimates as functions of the mutation rate. We have shown the average fixation probability the minimum probability , and the maximum probability . Since we chose <w> such that {rho}M is independent of µ, the deterministic growth estimate is independent of µ. We observe that the deterministic growth estimate lies consistently above the average , but below the maximum {pi}max. The neutrality estimate underestimates the smallest fixation probabilities and overestimates the largest ones. Its average lies slightly below for small mutation rates and above for large mutation rates. A more detailed plot of the fixation probabilities at a fixed mutation rate of µ = 0.5 is given in Fig 2. There, we display the fixation probability vs. the neutrality (number of neutral neighbors) of the invading sequence. The spread in the fixation probabilities is remarkable. For sequences with a given neutrality, the fixation probabilities vary over up to seven orders of magnitude. This demonstrates the important influence of not only the nearest neighbors but also the wider genetic neighborhood on the fate of a single sequence in quasispecies evolution. The neutrality estimate substantially underestimates the fixation probabilities of those sequences that have only few immediate neutral neighbors, but are otherwise located in a region of the genotype space where the density of neutral sequences is high. In principle, we could improve the neutrality estimate by taking into account all neutral sequences up to some distance d, but in practice this method becomes quickly as unwieldy as calculating the exact fixation probabilities.



View larger version (19K):
In this window
In a new window
Download PPT slide
 
Figure 1. Fixation probability vs. mutation rate in a neutral network of 51,028 RNA sequences taken from VAN NIMWEGEN et al. 1999 Down. Solid lines correspond to the solution of the full equations, dashed lines correspond to the neutrality estimate, and the dotted line indicates the deterministic growth estimate. {sigma} = 1.05, L = 18, {rho}G = 15.7, <w> = 1 - µ[1 - {rho}G/(3L)].



View larger version (32K):
In this window
In a new window
Download PPT slide
 
Figure 2. Fixation probability vs. neutrality {nu} of the invading sequence in a neutral network of 51,028 RNA sequences taken from VAN NIMWEGEN et al. 1999 Down. The dots stem from the exact numerical solution, the dashed line corresponds to the neutrality estimate, and the dotted line indicates the deterministic growth estimate. The inset shows the distribution of neutralities in the network. µ = 0.5, {sigma} = 1.05, L = 18, {rho}G = 15.7, <w> = 1 - µ[1 - {rho}G/(3L)].

Multiple invading sequences:
The above considerations address only the case of a single invading sequence. The generalization to more than one invading sequence is straightforward. Assume that a set S of N sequences, with S = {i1, ... , iN}, invades an established quasispecies. The probability that this invasion is successful is given by 1 - {Pi}i{isin}S(1 - {pi}i), where {pi}i are the fixation probabilities of the individual sequences. The probability of successful invasion of N sequences can be used as an indicator for the population size at which the deterministic quasispecies equations capture the relevant dynamics of a finite population. The fluctuations distinguishing the stochastic process of a finite population from the deterministic description can be neglected if the invasion probability is close to one. In Fig 3, the fixation probability on the same neutral network of RNA sequences that we have used before is displayed against the size of the invading population. The individual data points are averaged over 1000 independent trials, where for each trial the N starting sequences were chosen at random. As before, <w> is chosen such that {sigma} is the average number of offspring of the invading quasispecies in the deterministic limit.



View larger version (20K):
In this window
In a new window
Download PPT slide
 
Figure 3. Fixation probability {pi} vs. size of the invading population N in a neutral network of 51,028 RNA sequences. The fixation probability is averaged over 1000 independent sets of invading sequences, chosen at random. The error bars indicate the standard deviation. Lines are meant as a guide to the eye. µ = 0.2, L = 18, {rho}G = 15.7, <w> = 1 - µ[1 - {rho}G/(3L)].

Fig 3 shows that the population need not cover the relevant sequence space to behave as predicted by the deterministic equations. On a neutral network of >50,000 sequences, a population of ~1000 behaves deterministically at an advantage in growth rate of only 1%. It is important to note that this advantage has been calculated under the assumption of an infinite population and that sufficiently small populations will grow substantially slower (VAN NIMWEGEN et al. 1999 Down). Apparently, here a population that covers only 2% of the neutral network is not sufficiently small to experience this reduction in growth rate.


*  DISCUSSION
*TOP
*ABSTRACT
*THEORETICAL FRAMEWORK
*FIXATION ON A NEUTRAL...
*DISCUSSION
*APPENDIX
*LITERATURE CITED

The exact expression for the probability of fixation in the quasispecies context is easy to evaluate numerically if the fitnesses of all relevant sequences are known. However, these data are normally not available for experimental systems, and approximations must be used. What is most easily available experimentally is the relative rate of growth of the two quasispecies at macroscopic concentrations, which is the basis of the deterministic growth estimate. Since this estimate gives only a single number, independently of the sequence actually seeding the invading quasispecies, it does not reflect local variations in the density of viable sequences around the invading sequence. The neutrality estimate does not suffer from this shortcoming. However, it requires the knowledge of the fitnesses of the immediate neighbors of the invading sequence. Although experimentally tedious, these fitnesses can be measured in principle. For example, ELENA and LENSKI 1997 Down generated 225 mutant strains of the bacterium Escherichia coli (each mutant differed from the wild type by one, two, or three mutations) and measured the relative fitnesses of the mutant strains to the wild type. The mutant neighborhood of an RNA virus can conceivably be measured in a similar manner.

The predictive power of both the deterministic growth estimate and the neutrality estimate depends strongly on the distribution of neutral sequences in sequence space. For example, both estimates become exact for the case of a uniform neutral lattice, in which all sequences have exactly the same neutrality. Furthermore, we expect the neutrality estimate to perform particularly well in networks in which a sequence's neutrality is strongly correlated to the neutralities of its immediate and more distant neutral neighbors. The deterministic growth estimate, on the other hand, will yield the best results if the neutral network does not decompose into areas that are substantially more densely or less densely connected than other areas. However, to what extent these conditions are met in natural systems is questionable. As we have seen in this article, the connection graph of a comparatively simple neutral network—consisting of RNA sequences that are only 18 bp long—is already so heterogeneous that both estimates fail to give an accurate prediction of the fixation probability for a substantial fraction of sequences on that network. It is reasonable to assume that the distribution of high-fitness sequences in sequence space for an RNA virus that consists of several thousand bases is at least as heterogeneous as the one in our toy RNA network, probably more so.

In this work, we have considered only the fate of a single invading quasispecies. However, while an invading quasispecies is moving toward fixation or extinction, another mutant, one that belongs to a quasispecies of even higher mean fitness, may appear. The fixation probability of the first invader will then be modulated by the dynamics of the second one and vice versa, an effect commonly referred to as "clonal interference" (GERRISH and LENSKI 1998 Down). Clonal interference has been reported in experiments with vesicular stomatitis virus (MIRALLES et al. 1999 Down, MIRALLES et al. 2000 Down) and with the bacterium E. coli (DE VISSER et al. 1999 Down). Currently, an accurate mathematical description of clonal interference for the quasispecies case is not available.

The approach we have followed in this work cannot be directly generalized to include clonal interference, because the assumption of a constant background average fitness <w> is not justified in the context of two (or more) competing branching processes. A second problem that we have to solve in a theory of quasispecies clonal interference is the identification of advantageous mutants. Throughout this article, we have used the definition that an advantageous mutant is one that can grow into a quasispecies with higher average fitness than that of the currently established quasispecies. To use this definition in the context of clonal interference, we need to have a priori knowledge about how to best subdivide the sequence space into independent quasispecies. Only with this knowledge can we decide whether a particular new mutant is part of the parent quasispecies or rather the founding member of a new quasispecies. A possible way to study clonal interference in future work will be to consider a particular fitness landscape—for example, a set of intertwined neutral networks at different fitness levels—for which the a priori separation into distinct quasispecies is possible. For such a landscape, numerical studies of clonal interference will be straightforward, and an analytic description should be possible as well. For landscapes that are a priori unknown, even the numerical investigation of clonal interference will remain difficult until a workable method for the identification of advantageous mutants has been found.

Recently, JENKINS et al. 2001 Down and HOLMES and MOYA 2002 Down expressed doubts regarding the relevancy of the quasispecies model for virology (but see DOMINGO 2002 Down). They argued that there is no unequivocal experimental evidence for the quasispecies nature of RNA viruses and that the deterministic quasispecies equations are potentially not applicable to viral evolution on theoretical grounds, due to the immense size of the sequence space. The results of this article show that the second concern is not entirely justified. A single sequence has a positive probability to rise to fixation if and only if the average fitness of the quasispecies that will form eventually exceeds the average fitness of the currently established quasispecies. The individual fitness of the invading sequence has some influence on the exact value of that probability, but does not affect whether fixation is possible at all. Moreover, when the population size reaches several hundred, with probability of almost one the population will, for reasonable choices of the parameters, behave as predicted by the deterministic equations. A similar result has been obtained by VAN NIMWEGEN et al. 1999 Down for flow reactor simulations, where, on the same neutral network of RNA sequences that we have studied here, quasispecies effects started to become important when the product of population size and mutation rate Nµ exceeded the value 10 (see Fig 3 of VAN NIMWEGEN et al. 1999 Down).

WILKE 2001B Down studied the probability of fixation for RNA sequences in a simulated flow reactor. The measured fixation probability was compared to an expression equivalent to the deterministic growth estimate of this work (since continuous time simulations were used to generate the data, the exact expressions differ from those given here). The analytic expression correctly predicted the parameter regions for which fixation was possible. In particular, the mutation rate at which a slower replicator with better mutational support could successfully invade a quasispecies consisting of sequences with higher individual fitnesses was determined accurately. However, the exact fixation probabilities seemed to be slightly overestimated. (Within the statistical accuracy of the data, a definite decision on this issue could not be made. While the data were in agreement with the model according to a {chi}2 test, they were not in agreement according to a nonparametric test based on how often the data points fell above or below the predicted value.)

The probability of fixation of advantageous mutants is obviously of tremendous importance for disease dynamics and vaccines. For example, live vaccinces of attenuated poliovirus can contain small amounts of virulent poliovirus variants (CHUMAKOV et al. 1991 Down), the reason being that attenuated and virulent virus variants are often separated by only one or a few mutations. In experiments, small amounts of highly virulent virus remain typically suppressed by the less virulent virus, but once a threshold concentration of the highly virulent virus variant is reached, infection occurs (DE LA TORRE and HOLLAND 1990 Down; CHUMAKOV et al. 1991 Down; TENG et al. 1996 Down). The apparent existence of such a threshold may well be a result of insufficient resolution of the experiments. Whether the highly virulent strain will grow is determined by stochastic fluctuations, and, as we have seen in Fig 3, the probability of fixation decays quickly with shrinking initial concentration of the virulent strain. If such a strain in a vaccine has a 1% chance to cause infection, then >>100 replicates of the appropriate assay are necessary to observe at least one infection with certainty. Probabilities of this magnitude or lower can easily be missed at low numbers of replicates, so that the virulent strain appears to be safely suppressed.


*  ACKNOWLEDGMENTS

I thank C. Adami for extensive discussions and encouragement and E. C. Holmes for commenting on an earlier version of this manuscript. Moreover, I thank M. Huynen for providing the neutral network data from VAN NIMWEGEN et al. 1999 Down. Financial support by the National Science Foundation under contract DEB-9981397 is gratefully acknowledged.

Manuscript received June 3, 2002; Accepted for publication November 1, 2002.


*  APPENDIX
*TOP
*ABSTRACT
*THEORETICAL FRAMEWORK
*FIXATION ON A NEUTRAL...
*DISCUSSION
*APPENDIX
*LITERATURE CITED

We consider a model with discrete, nonoverlapping generations and a constant population size N. Under the assumption that the reproductive success of a sequence i is proportional to its fitness wi, the probability that a randomly chosen sequence in the next generation is offspring of sequence i is given by , where <w> is the average fitness in the population. Since there are N sequences in the population, the probability that k of them are offspring of sequence i is binomial, Now consider a sequence of type r in the offspring generation. For the probability that the parent of sequence r is a particular sequence i of the previous generation, we find , because only a fraction Qri of the total offspring of i will be of type r. Following the previous argument, we find for the probability that sequence i leaves kr offspring of type r:

We can extend the above argument to sequences of two types, r and s. The probability that sequence i leaves kr offspring sequences of type r and ks offspring sequences of type s is the probability that kr offspring are of type r, {xi}krr, times the probability that ks offspring are of type s, {xi}kss, times the probability that the remaining offspring either are of different types or have different parent sequences, (1 - {xi}r - {xi}s)N-kr-ks, times the number of possible ways in which kr and ks sequences can be chosen out of the total of N sequences in the population. This latter number is a multinomial coefficient, N!/[kr!ks!(N - kr - ks)!]. Putting everything together, we find

(A1)

By repeating this argument for n different sequence types, and with the definition Mij := N{xi}j = wiQji/<w>, we arrive at Equation 3.


*  LITERATURE CITED
*TOP
*ABSTRACT
*THEORETICAL FRAMEWORK
*FIXATION ON A NEUTRAL...
*DISCUSSION
*APPENDIX
*LITERATURE CITED

BARTON, N. H., 1995  Linkage and the limits to natural selection. Genetics 140:821-841.[Abstract]

BIEBRICHER, C. K. and R. LUCE, 1993  Sequence analysis of RNA species synthesized by Qß replicase without template. Biochemistry 32:4848-4854.[Medline]

BORNBERG-BAUER, E., 1997  How are model protein structures distributed in sequence space? Biophys. J. 73:2393-2403.[Abstract/Free Full Text]

BURCH, C. L. and L. CHAO, 2000  Evolvability of an RNA virus is determined by its mutational neighbourhood. Nature 406:625-628.[Medline]

RGER, R. and W. J. EWENS, 1995  Fixation probabilities of additive alleles in diploid populations. J. Math. Biol. 33:557-575.

CHUMAKOV, K. M., L. B. POWERS, K. E. NOONAN, I. B. RONINSON, and I. S. LEVENBOOK, 1991  Correlation between amount of virus with altered nucleotide sequence and the monkey test for acceptability of oral poliovirus vaccine. Proc. Natl. Acad. Sci. USA 88:199-203.[Abstract/Free Full Text]

DE LA TORRE, J. C. and J. J. HOLLAND, 1990  RNA virus quasispecies populations can suppress vastly superior mutant progeny. J. Virol. 64:6278-6281.[Abstract/Free Full Text]

DE VISSER, J. A. G. M., C. W. ZEYL, P. J. GERRISH, J. L. BLANCHARD, and R. E. LENSKI, 1999  Diminishing returns from mutation supply rate in asexual populations. Science 283:404-406.[Abstract/Free Full Text]

DEMETRIUS, L., P. SCHUSTER, and K. SIGMUND, 1985  Polynucleotide evolution and branching processes. Bull. Math. Biol. 47:239-262.[Medline]

DOMINGO, E., 2002  Quasispecies theory in virology. J. Virol. 76:463-465.[Free Full Text]

DOMINGO, E. and J. J. HOLLAND, 1997  RNA virus mutations and fitness for survival. Annu. Rev. Microbiol. 51:151-178.[Medline]

DOMINGO, E., R. A. FLAVELL, and C. WEISSMANN, 1976  In vitro site directed mutagenesis: generation and properties of an infectious extracistronic mutant of bacteriophage Qß.. Gene 1:3-25.[Medline]

DOMINGO, E., D. SABO, T. TANIGUCHI, and C. WEISSMANN, 1978  Nucleotide sequence heterogeneity of an RNA phage population. Cell 13:735-744.[Medline]

DOMINGO, E., C. K. BIEBRICHER, M. EIGEN and J. J. HOLLAND, 2001 Quasispecies and RNA Virus Evolution: Principles and Consequences. Landes Bioscience, Georgetown, TX.

DRAKE, J. W., 1993  Rates of spontaneous mutation among RNA viruses. Proc. Natl. Acad. Sci. USA 90:4171-4175.[Abstract/Free Full Text]

DRAKE, J. W. and J. J. HOLLAND, 1999  Mutation rates among RNA viruses. Proc. Natl. Acad. Sci. USA 96:13910-13913.[Abstract/Free Full Text]

EIGEN, M., and P. SCHUSTER, 1979 The Hypercycle: A Principle of Natural Self-Organization. Springer-Verlag, Berlin.

ELENA, S. F. and R. E. LENSKI, 1997  Test of synergistic interactions among deleterious mutations in bacteria. Nature 390:395-398.[Medline]

EWENS, W. J., 1967  The probability of fixation of a mutant: the two-locus case. Evolution 21:532-540.

FISHER, R. A., 1922  On the dominance ratio. Proc. R. Soc. Edinb. 42:321-341.

FISHER, R. A., 1930  The distribution of gene ratios for rare mutations. Proc. R. Soc. Edinb. 50:204-219.

GERRISH, P. J. and R. E. LENSKI, 1998  The fate of competing beneficial mutations in an asexual population. Genetica 102(103):127-144.

HALDANE, J. B. S., 1927  A mathematical theory of natural and artificial selection. Part V: selection and mutation. Proc. Camp. Philos. Soc. 23:838-844.

HARRIS, T. E., 1963 The Theory of Branching Processes. Springer, Berlin.

HERMISSON, J., O. REDNER, H. WAGNER, and E. BAAKE, 2002  Mutation-selection balance: ancestry, load, and maximum principle. Theor. Popul. Biol. 62:9-46.[Medline]

HOFBAUER, J., and K. SIGMUND, 1988 The Theory of Evolution and Dynamical Systems. Cambridge University Press, Cambridge, UK.

HOLLAND, J., K. SPINDLER, F. HORODYSKI, E. GRABAU, and S. NICHOL et al., 1982  Rapid evolution of RNA genomes. Science 215:1577-1585.[Abstract/Free Full Text]

HOLMES, E. C. and A. MOYA, 2002  Is the quasispecies concept relevant to RNA viruses? J. Virol. 76:460-462.[Free Full Text]

HUYNEN, M. A., P. F. STADLER, and W. FONTANA, 1996  Smoothness within ruggedness: the role of neutrality in adaptation. Proc. Natl. Acad. Sci. USA 93:397-401.[Abstract/Free Full Text]

JENKINS, G. M., M. WOROBEY, C. H. WOELK, and E. C. HOLMES, 2001  Evidence for the non-quasispecies evolution of RNA viruses. Mol. Biol. Evol. 18:987-994.[Abstract/Free Full Text]

KIMURA, M., 1957  Some problems of stochastic processes in genetics. Ann. Math. Stat. 28:882-901.

KIMURA, M., 1964  Diffusion models in population genetics. J. Appl. Prob. 1:177-232.

KIMURA, M., 1970  The length of time required for a selectively neutral mutant to reach fixation through random frequency drift in a finite population. Genet. Res. 15:131-133.[Medline]

KIMURA, M. and J. L. KING, 1979  Fixation of a deleterious allele at one of two "duplicate" loci by mutation pressure and random drift. Proc. Natl. Acad. Sci. USA 76:2858-2861.[Abstract/Free Full Text]

KRAKAUER, D. C. and J. B. PLOTKIN, 2002  Redundancy, antiredundancy, and the robustness of genomes. Proc. Natl. Acad. Sci. USA 99:1405-1409.[Abstract/Free Full Text]

MIRALLES, R., P. J. GERRISH, A. MOYA, and S. F. ELENA, 1999  Clonal interference and the evolution of RNA viruses. Science 285:1745-1747.[Abstract/Free Full Text]

MIRALLES, R., A. MOYA, and S. F. ELENA, 2000  Diminishing returns of population size in the rate of RNA virus adaptation. J. Virol. 74:3566-3571.[Abstract/Free Full Text]

NOWAK, M. A., 1992  What is a quasispecies? TREE 7:118-121.

OTTO, S. P. and N. H. BARTON, 1997  The evolution of recombination: removing the limits to natural selection. Genetics 147:879-906.[Abstract]

POLLAK, E., 2000  Fixation probabilities when the population size undergoes cyclic fluctuations. Theor. Popul. Biol. 57:51-58.[Medline]

SCHUSTER, P. and J. SWETINA, 1988  Stationary mutant distributions and evolutionary optimization. Bull. Math. Biol. 50:635-660.[Medline]

STEINHAUER, D. A., J. C. DE LA TORRE, E. MEIER, and J. J. HOLLAND, 1989  Extreme heterogeneity in populations of vesicular stomatitis virus. J. Virol. 63:2072-2080.[Abstract/Free Full Text]

TENG, M. N., M. B. A. OLDSTONE, and J. C. DE LA TORRE, 1996  Suppression of lymphocytic choriomeningitis virus—induced growth hormone deficiency syndrome by disease-negative virus variants. Virology 223:113-119.[Medline]

VAN NIMWEGEN, E., J. P. CRUTCHFIELD, and M. HUYNEN, 1999  Neutral evolution of mutational robustness. Proc. Natl. Acad. Sci. USA 96:9716-9720.[Abstract/Free Full Text]

VARGA, R. S., 2000 Matrix Iterative Analysis, Ed. 2. Springer-Verlag, New York.

WILKE, C. O., 2001a  Adaptive evolution on neutral networks. Bull. Math. Biol. 63:715-730.[Medline]

WILKE, C. O., 2001b  Selection for fitness versus selection for robustness in RNA secondary structure folding. Evolution 55:2412-2420.[Medline]

WILKE, C. O., J. L. WANG, C. OFRIA, R. E. LENSKI, and C. ADAMI, 2001  Evolution of digital organisms at high mutation rate leads to survival of the flattest. Nature 412:331-333.[Medline]


*  NOTE ADDED IN PROOF

After acceptance of this manuscript, I became aware of a recent paper by T. JOHNSON and N. H. BARTON (2002, The effect of deleterious alleles on adaptation in asexual populations. Genetics 162: 395–411). Using the theory of multitype branching processes, Johnson and Barton studied in this article the probability of fixation of a mutant that suffers additional deleterious mutations while going to fixation.




This article has been cited by other articles:


Home page
GeneticsHome page
Z. Patwa and L. M. Wahl
Fixation Probability for Lytic Viruses: The Attachment-Lysis Model
Genetics, September 1, 2008; 180(1): 459 - 470.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
J. E. Hubbarde, G. Wild, and L. M. Wahl
Fixation Probabilities When Generation Times Are Variable: The Burst Death Model
Genetics, July 1, 2007; 176(3): 1703 - 1712.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
J. R. Bailey, A. R. Sedaghat, T. Kieffer, T. Brennan, P. K. Lee, M. Wind-Rotolo, C. M. Haggerty, A. R. Kamireddi, Y. Liu, J. Lee, et al.
Residual Human Immunodeficiency Virus Type 1 Viremia in Some Patients on Antiretroviral Therapy Is Dominated by a Small Number of Invariant Clones Rarely Found in Circulating CD4+ T Cells.
J. Virol., July 1, 2006; 80(13): 6441 - 6457.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
C. O. Wilke
The Speed of Adaptation in Large Asexual Populations
Genetics, August 1, 2004; 167(4): 2045 - 2053.
[Abstract] [Full Text] [PDF]