Genetics, Vol. 154, 1403-1417, March 2000, Copyright © 2000

Understanding the Overdispersed Molecular Clock

David J. Cutlera
a Center for Population Biology, University of California, Davis, California 95616

Corresponding author: David J. Cutler, Rm. BRB 747B, Case Western Reserve University, 2109 Adelbert Rd., Cleveland, OH 44106-4955., djc14{at}cwru.edu (E-mail)

Communicating editor: G. B. GOLDING


*  ABSTRACT
*TOP
*ABSTRACT
*CALCULATION OF R(T)
*SLOWLY CHANGING ENVIRONMENT
*UNDERSTANDING MODELS WITH...
*INTERACTION BETWEEN SITES
*EXCHANGEABLE ALLELES
*HOUSE OF CARDS
*OPTIMUM MODEL
*SHIFT MODELS
*INFINITE ALLELE MODELS
*A DELETERIOUS MUTATIONS MODEL
*DISCUSSION
*LITERATURE CITED

Rates of molecular evolution at some protein-encoding loci are more irregular than expected under a simple neutral model of molecular evolution. This pattern of excessive irregularity in protein substitutions is often called the "overdispersed molecular clock" and is characterized by an index of dispersion, R(T) > 1. Assuming infinite sites, no recombination model of the gene R(T) is given for a general stationary model of molecular evolution. R(T) is shown to be affected by only three things: fluctuations that occur on a very slow time scale, advantageous or deleterious mutations, and interactions between mutations. In the absence of interactions, advantageous mutations are shown to lower R(T); deleterious mutations are shown to raise it. Previously described models for the overdispersed molecular clock are analyzed in terms of this work as are a few very simple new models. A model of deleterious mutations is shown to be sufficient to explain the observed values of R(T). Our current best estimates of R(T) suggest that either most mutations are deleterious or some key population parameter changes on a very slow time scale. No other interpretations seem plausible. Finally, a comment is made on how R(T) might be used to distinguish selective sweeps from background selection.


THE most simple version of the neutral theory of molecular evolution (OHTA and KIMURA 1971 Down; SAWYER 1977 Down; KELLY 1979 Down; KIMURA 1983 Down) predicts that the number of mutations that arise in a population in T generations, which ultimately become fixed in the population, will be Poisson distributed with mean uT, where u is the per sequence, per generation mutation rate. Therefore, the variance in the number of substitutions will equal the mean under this most simple neutral model. The ratio of the variance in the number of substitutions to the mean number is called the index of dispersion of molecular evolution. Under the most simple neutral theory, the index of dispersion, R(T), should equal 1.

The first article to demonstrate a deviation from a Poisson number of substitutions occurred early in the history of the neutral theory (OHTA and KIMURA 1971 Down). Ohta and Kimura examined three proteins in several pairwise comparisons in mammals. They showed that for two of the proteins in a few of the pairs, a Poisson substitution rate could be rejected. This result was hard to interpret, as no explicit phylogenetic hypothesis was made, and the effect of phylogeny went unconsidered.

The first attempts to use a phylogeny in an explicit manner came a few years later (LANGLEY and FITCH 1973 Down, LANGLEY and FITCH 1974 Down). Langley and Fitch examined four proteins in 18 species. The species were assumed to have a known phylogeny. The numbers of amino acid substitutions along all branches in the phylogeny were inferred. Next, the branch lengths and mutation rates were found by a maximum-likelihood method. Finally, a {chi}2 (LANGLEY and FITCH 1973 Down) and a likelihood-ratio test (LANGLEY and FITCH 1974 Down) were performed to ask whether the observed numbers of mutations on the branches were statistically different from the expected. The neutral Poisson model was rejected with high confidence. Two basic interpretations of Langley and Fitch have been offered. The first interpretation suggested that the rate of molecular evolution changed over time (LANGLEY and FITCH 1974 Down); i.e., the mean number of substitutions was not constant over time (the substitution process is not stationary). The second interpretation (GILLESPIE and LANGLEY 1979 Down) was that the mean number of substitutions remained constant over time (the substitution process was stationary), but that the variance in the number of substitutions was larger than would be produced by a Poisson process, i.e., the index of dispersion was larger than one.

In 1983, Kimura attempted to directly test whether or not the index of dispersion truly equaled one (KIMURA 1983 Down). Kimura considered four different proteins taken from six mammalian lineages. He assumed these six lineages came from a star phylogeny, and therefore the number of substitutions in each lineage was an independent sample, each with mean uT. He calculated R(T) for each of these four proteins and found that R(T) ranged between 1.7 and 3.3. Although R(T) was bigger than predicted, in only two of the proteins was it significantly larger than one.

In a series of articles, GILLESPIE 1984A Down, GILLESPIE 1986A Down, GILLESPIE 1986B Down extended Kimura's test to nine proteins (both nuclear and mitochondrial), but again assumed a mammalian star phylogeny. GILLESPIE 1986B Down found that R(T) ranged from 0.16 to 35.55, and he had more than enough observations to repeatedly reject the most simple neutral theory. Despite the rather clear rejection of the simple neutral model, this analysis was less than completely convincing for several reasons. First, a star phylogeny was assumed. If this assumption were false, individual lineages would have different T's, and the variance would be artificially inflated (GILLESPIE 1989 Down). Second, under the neutral theory, the substitution rate should only be constant per generation, so different length generations in different lineages should artificially inflate R(T) (GILLESPIE 1989 Down). Third, an overall increase or decrease in the mutation rate in only some of the lineages (perhaps due to a systemic change in metabolic rate or in DNA repair machinery, etc.) would lead to an artificial inflation of R(T) (GILLESPIE 1989 Down). Fourth, in certain circumstances use of a correction formula to estimate divergence distances could artificially inflate R(T) (BULMER 1989 Down; GILLESPIE 1989 Down). Gillespie solved the first three problems, collectively known as lineage effects, in 1989.

Gillespie's solution to lineage effects was to (1) restrict his analysis to three species at a time, thereby guaranteeing a single unrooted phylogeny, and (2) weight the number of substitutions in each lineage by one over the mean number for that lineage, where the mean is taken over all loci examined. This weighting process amounts to regressing out lineage effects from the data. Using these weightings, Gillespie showed that for replacement substitutions in 20 loci, R(T) ranged from 0.13 to 43.82 with a mean of 6.95 (GILLESPIE 1991 Down, p. 119). Silent sites at these same loci had an average R(T) of 4.64. Gillespie concluded that R(T) was clearly statistically significant for replacement sites, but was perhaps only marginally significant for silent sites, due to the bias introduced by use of correction formulas.

GOLDMAN 1994 Down quantified the extent of the error lineage effects might have introduced in the early estimates of R(T). He further noted that Gillespie's simulations of his weighting factor solution had not been as extensive as they could have been. Nielsen more than made up for this lack of simulations (NIELSEN 1997 Down) and further showed that the fourth problem due to correction formulas was not very large as long as the sequences were not too close to saturation.

By 1995 enough data had been gathered to examine 49 mammalian loci. Using Gillespie's weighting factor, OHTA 1995 Down showed that, averaged over all these loci, R(T) > 5 for both silent and replacement sites. These values were more than large enough to reject simple neutrality. So taken as a whole, the evidence from mammals is compelling. The observed index of dispersion is larger than one for both silent and replacement sites. This "overdispersion" is not due to lineage effects and is not an artifact of correction formulas. The conclusion is inescapable. For mammals, the most simple neutral theory of molecular evolution does not explain protein divergence data.

A recent study of Drosophila (ZENG et al. 1998 Down) has cast some doubt on whether conclusions drawn from mammalian data should necessarily be applied to all life. Zeng and colleagues examined 24 proteins from three species of Drosophila, Drosophila pseudoobscura, D. subobscura, and D. melanogaster. D. pseudoobscura and D. subobscura are relatively closely related, with D. melanogaster a more distantly related out-group. They found that, using Gillespie's weighting factor, averaged over these 24 loci, R(T) was 4.37 for silent sites but only 1.64 for replacement sites. The value obtained for silent sites was qualitatively in agreement with Ohta's value for mammals, but the replacement values were much smaller and not statistically different from 1. Unfortunately, interpretation of the replacement site results is confounded by extremely low replacement divergence between D. pseudoobscura and D. subobscura (see DISCUSSION below).

It now seems clear that mammalian loci are, on average, overdispersed at both silent and replacement sites. In Drosophila, it is likely that silent sites are overdispersed, but replacement sites might not be. In any case, because the most simple neutral theory can never produce an R(T) > 1, it is of interest to know which models of molecular evolution can produce a large index of dispersion. Several models have been suggested. They include episodic selection on a mutational landscape (GILLESPIE 1984A Down, GILLESPIE 1984B Down, GILLESPIE 1991 Down), the fluctuating neutral space model (TAKAHATA 1987 Down, TAKAHATA 1989 Down), and the house of cards (HOC) model of slightly deleterious mutations (OHTA and TACHIDA 1990 Down; TACHIDA 1991 Down; IWASA 1993 Down; GILLESPIE 1994B Down; TACHIDA 1996 Down; ARAKI and TACHIDA 1997 Down). In addition to these particular models, there is an extensive set of simulations by GILLESPIE 1993 Down, GILLESPIE 1994A Down, GILLESPIE 1994B Down, which tried to characterize R(T) for numerous models. With these simulations, Gillespie showed that fluctuating selection could account for a high index of dispersion, but only if the fluctuations occurred very slowly, roughly at the same rate fixations happened (GILLESPIE 1993 Down). In addition, he found that symmetric underdominance (GILLESPIE 1994A Down), optimizing selection, and the house of cards model (GILLESPIE 1994B Down) could all produce R(T) > 1, but only in a very narrow range of parameter values. He found that exponential and gamma shift models of deleterious sites produced R(T) {approx} 1 (GILLESPIE 1994B Down) (Table 1).


 
View this table:
In this window
In a new window

 
Table 1. Previously described models

In other simulations, GILLESPIE 1994A Down showed that rapidly fluctuating selection not only failed to explain a large index of dispersion, but actually produced an R(T) < 1. This result was particularly surprising. Put only a little bit facetiously, Gillespie took a neutral model, added some random fluctuations to it, and derived a process that was less random than the one he started with. Other models also produced R(T) < 1, including symmetric overdominance and normal shift models. Gillespie attempted to develop some insight concerning how a model might produce an R(T) < 1 (GILLESPIE 1993 Down), but his insight was built by considering an infinite allele, not an infinite site model. It is argued below that infinite site models behave substantially differently than infinite allele models, and his result is only applicable to the latter. The mechanism by which an infinite site model could ever produce an R(T) < 1 has not yet been suggested.

The goals of this article are threefold: first, to describe the mathematical machinery necessary to analyze the index of dispersion for an infinite site model of the gene and second, using this machinery, to describe which models will produce R(T) > 1, which will produce R(T) < 1, and which will produce R(T) {approx} 1. Finally, from our observation that R(T) appears to be >5 for mammalian data, this article attempts to discover what we can infer about the nature of mammalian evolution.


*  CALCULATION OF R(T)
*TOP
*ABSTRACT
*CALCULATION OF R(T)
*SLOWLY CHANGING ENVIRONMENT
*UNDERSTANDING MODELS WITH...
*INTERACTION BETWEEN SITES
*EXCHANGEABLE ALLELES
*HOUSE OF CARDS
*OPTIMUM MODEL
*SHIFT MODELS
*INFINITE ALLELE MODELS
*A DELETERIOUS MUTATIONS MODEL
*DISCUSSION
*LITERATURE CITED

A substitution is a mutation that ultimately fixes in the population. There are two different processes that might be called the substitution process. One process, the origination process (GILLESPIE 1994A Down), is the point process of the times of entry of those mutations that ultimately fix in the population. The other process, the fixation process (GILLESPIE 1994A Down), is the point process of the times when mutations, which ultimately fix, first reach frequency one. This article is concerned only with the origination process.

To derive the index of dispersion of the origination process begin by assuming that the gene contains an infinite number of sites and by assuming that there is no recombination between those sites (WATTERSON 1975 Down). Assume that time is discrete and that population size is constant and equal to N < {infty} haploid individuals. Let the population reproduce according to a discrete time Moran model (MORAN 1958 Down). The mutation process and site frequency dynamics are assumed to be stationary, so that translations of the time axis do not effect the origination process. Let Mt equal one if there is a mutation at time t, and equal zero otherwise. Let St equal one if there is an origination at t, and zero otherwise. It can be shown that ratio of the variance in the number of originations, divided by the mean number of originations in T time steps, R(T), is given by

(1)

(CUTLER 2000 Down), where {rho} is the origination rate, {rho} = E{St} = Pr{St = 1}, and h(t) is the conditional intensity function defined by

(2)

Equation 1 and Equation 2 are discrete time analogs of results given by COX and ISHAM 1980 Down(Equations 2.27 and 1.19). It can be shown that (2) can be rewritten (CUTLER 2000 Down) as

(3)

where E{Xt} is the expected frequency of a mutant t time steps after it enters the population, and (t) is the amount of interaction between sites separated by t time units, defined by

(4)

where p is the probability of fixation of a new mutant, p = E{St|Mt = 1}, and jt on i0 is the condition of a mutant arising at time t on a piece of DNA containing a mutant that arose at time 0.

If h(t) converges to {rho} sufficiently quickly, so that {Sigma}{infty}t=1t(h(t) - {rho}) < {infty}, then for large T, R(T) can be approximated by

(5)


(6)

where Ds = {Sigma}{infty}1(h(t) - {rho}). The approximation uses the fact that {rho} << 1. The sign of Ds determines whether the substitutional process is overdispersed (Ds > 0), underdispersed (Ds < 0), or indistinguishable from a neutral model (Ds = 0). Thus Ds can be thought of as the deviation in R(T) from a simple neutral model. It turns out that Ds can be calculated directly for a few simple models. Even when direct calculation of Ds is difficult, its sign and relative magnitude can often be estimated.

A few comments concerning the conditional intensity function should be made. It is defined to be the product of three terms, the probability there is a mutation at time 0, given a mutation at time t, Pr{M0 = 1|Mt = 1}, the expected frequency of a mutant t time units after it entered the population, E{Xt}, and the amount of interaction between mutants separated by t time steps, (t). The amount of interaction between mutants, (t), is defined to be the probability of fixation of a mutant, given that it occurred on a piece of DNA containing a mutation that entered the population t time steps earlier, Pr{St = 1|jt on i0}, divided by the unconditional probability of fixation of a mutant p. A more complete description of (t) is given below.


*  SLOWLY CHANGING ENVIRONMENT
*TOP
*ABSTRACT
*CALCULATION OF R(T)
*SLOWLY CHANGING ENVIRONMENT
*UNDERSTANDING MODELS WITH...
*INTERACTION BETWEEN SITES
*EXCHANGEABLE ALLELES
*HOUSE OF CARDS
*OPTIMUM MODEL
*SHIFT MODELS
*INFINITE ALLELE MODELS
*A DELETERIOUS MUTATIONS MODEL
*DISCUSSION
*LITERATURE CITED

Virtually any model containing a key parameter that changes on a sufficiently slow time scale can explain the observed index of dispersion. Other work has shown (CUTLER 2000 Down) that if either the mutation rate or the probability of fixation changes as slow as, or slower than, the average time between fixation of sites, then the index of dispersion can be elevated significantly above one. GILLESPIE's (1993) simulations confirm that models of a slowly fluctuating environment can produce large R(T)'s regardless of the details of the model.

Despite the fact that slowly changing parameters can cause R(T) to be large, simply invoking slow change appears to be an incomplete explanation of R(T). If one assumes that the time between substitutions is measured in millions of years, one must also assume that the environment changes on the time scale of millions of years. At first glance, it is not obvious that any environmental process has this property. Environmental processes that are often considered slow (for instance glaciation) are usually orders of magnitude faster than would be required here. To fully explain R(T), a mechanism would need to be suggested that could cause a key parameter to change so slowly. Without such a mechanism, a slowly changing environment appears a somewhat hollow explanation.

Takahata's fluctuating neutral space (FNS) model (TAKAHATA 1987 Down, TAKAHATA 1989 Down) could provide a possible mechanism. At the heart of the FNS model is the notion that each mutation changes the subsequent mutation rate for a given piece of DNA. This process is difficult to model exactly [see CUTLER 2000 Down for an attempt], so it is often approximated by a model where mutation rate changes with each substitution (TAKAHATA 1987 Down, TAKAHATA 1989 Down; CUTLER 2000 Down). Whether this change occurs at the moment a mutant first reaches frequency one (i.e., corresponding to events in the fixation process), or at the moment a mutant destined to fix first enters the population (i.e., corresponding to events in the origination process) is often left obscured by the coarseness of the approximations used in the analysis (TAKAHATA 1987 Down; CUTLER 2000 Down). Regardless of the modeling details, the FNS model has the property the mutation rate must change on the same time scale as molecular evolution. Therefore, the FNS model is capable of generating large R(T) values. Several results on the FNS model have been obtained.

First, for the FNS model to generate large values of R(T), there must be more than two possible mutation rates (CUTLER 2000 Down). Second, when new mutation rates are picked independently of previous rates, R(T) = 5 implies that sequences that differ by only a single site will have mutation rates that differ by an order of magnitude 2–5% of the time (depending on the details of the distribution of mutation rates; CUTLER 2000 Down). Processes where new mutation rates are not independent of previous rates are difficult to analyze (TAKAHATA 1989 Down), but can produce large values of R(T), if the process has a sufficient amount of time to evolve (TAKAHATA 1989 Down).


*  UNDERSTANDING MODELS WITH SELECTION
*TOP
*ABSTRACT
*CALCULATION OF R(T)
*SLOWLY CHANGING ENVIRONMENT
*UNDERSTANDING MODELS WITH...
*INTERACTION BETWEEN SITES
*EXCHANGEABLE ALLELES
*HOUSE OF CARDS
*OPTIMUM MODEL
*SHIFT MODELS
*INFINITE ALLELE MODELS
*A DELETERIOUS MUTATIONS MODEL
*DISCUSSION
*LITERATURE CITED

Many models make the assumption that the mutation process has a constant rate. If {nu}(t) = {nu}, then

(7)

If there is little interaction between sites ((t) {approx} 1), then (7) further reduces to

(8)

where Ds1 = 2{nu}{Sigma}{infty}t=1(E{Xt} - p) can be thought of as the deviation of R(T) due to selection in the absence of mutation interactions. In many cases, understanding Ds1 is the key to understanding selection's effect on the index of dispersion.

The expected frequency of a neutral mutation does not change over time; E{Xt1} = E{Xt2} = p for all t1 and t2. Nonneutral mutations do not necessarily have this property. Ds1 measures the effect a changing expected frequency has on the index of dispersion. A simple rule of thumb results. In the absence of site interactions, deleterious mutations cause R(T) > 1, and advantageous mutations cause R(T) < 1. The magnitude of the effect can be made quite large.

The overall sign of Ds1 is obviously determined by the sign of the E{Xt} - p. If the expected frequency of mutations does not change over time, then Ds1 = 0. If the expected frequency of sites monotonically declines over time, then Ds1 > 0 [because E{Xt} >= E{X{infty}} = p, E{Xt} - p >= 0]. Conversely, if the expected frequency monotonically increases, then Ds1 < 0. An interesting unsolved problem is to describe which models of molecular evolution have the property that the expected frequency of mutations is monotonic over time. A natural conjecture (and one that is consistent with the simulations performed here) is that any stationary model has this property.

Apart from a simple one-locus, two-allele Fisher-Wright world, there is some difficulty defining what is meant by a deleterious/advantageous mutation. For the purposes of this article, a particular mutation will be said to be deleterious/advantageous if its expected frequency decreases/increases over time. A model will be said to be a deleterious/advantageous mutation model if, averaged over all possible mutants, the expected frequency of mutants decreases/increases. Note that the definition of deleterious/advantageous mutation model is a property of the mutations, not the originations. Thus, a model will be called a deleterious site model if the majority of mutations decline in frequency, but this statement implies nothing at all about the fitness of the sites that actually fix. It is a statement about the average properties of mutants, not a statement on the properties of those rare mutants who eventually fix.

Thus, we arrive at the conclusion that, in the absence of site interactions, deleterious mutation models have an R(T) > 1, and advantageous mutation models have an R(T) < 1. Although this result is clear as stated, the intuition concerning why it's true may be less obvious.

Mutations do not necessarily fix one at a time. Ds1 can be thought of as measuring the effect the size and frequency of multiple fixations has on the index of dispersion. To see this, write Ds1 as:

(9)

Consider the piece of DNA that reproduces at time step 0. Call this piece of DNA i0. i0 may contain a mutation from time step -1. The probability that there was a mutation at time -1 is {nu}. The probability that i0 contains this mutation is E{X1} (BIRKY and WALSH 1988 Down; CUTLER 2000 Down). Because no more than one mutation can occur in a single time step, {nu}E{X1} is the expected number of mutations on i0 from time step -1. Similarly {nu}E{X2} is the expected number of mutations from time step -2. In general, the first sum in (9) is the expected number of mutations on i0.

In a neutral model, the expected frequency of a site does not change over time. So, for a neutral model E{Xt} = p for all t. Thus, the second sum in Equation 9 is what the expected number of mutations on i0 would be, if this were a neutral model with probability of fixation p. Therefore, Ds1/2 is equal to the expected number of mutations on i0 minus the expected number of mutations under a neutral model. For a deleterious site model E{Xt} > p, so that the first sum is bigger than the second, and, on average, there are"too many" mutations on i0, relative to a neutral model with the same probability of fixation. Conversely, in an advantageous model E{Xt} < p, so that there are "too few" mutations on i0 relative a neutral model with the same probability of fixation.

Finding Ds1 directly for any particular model is not trivial. Other than for the neutral case, it is not obvious that E{Xt} is ever easy to calculate. For models that may be approximated with a diffusion, finding E{Xt} amounts to solving a Kolmogorov backward equation. If a two-allele model is an adequate approximation, the problem can also be formulated as an ordinary differential equation (OHTA and KIMURA 1969 Down), but truncation of higher-order moments is often necessary. Despite these difficulties, Ds1 can be directly measured in a simulation.

To estimate Ds1 within a simulation, a single extra vector, call it DS[0 ... R], needs to be stored, where R is a number sufficiently large so that all sites are extremely likely to be fixed or lost within R generations (R = 1000N was used in the simulations for this article). Initialize the DS vector to 0. During the simulation, track the frequency of each site in all generations. For each mutation add its frequency t generations after it entered the population to the value stored in DS[t]. When the simulation is done, divide each element of DS by the total number of mutations. Estimate Ds1 by Ds1 = 2{nu}({Sigma}Rt=0DS[t] - DS[R]).

Finally, one might ask if there is any general intuition on the effects of the overall mutation rate and overall strength of selection on Ds1. As is obvious from Equation 8, Ds1 is independent of time. Also, Ds1 appears to be a linear function of the mutation rate. For small mutations rates, this may be roughly true, but for large 2{nu} the linear dependence must disappear. The reason for this is that E{Xt} and p must also be a function of 2{nu}, because 2{nu} effects, among other things, the overall heterozygosity of the population and its mean fitness. As a result, Ds1 is unlikely to depend on 2{nu} in a simple linear manner.

By analogy to a classical Fisher-Wright model, one can imagine changing the strength of selection. This can have two effects on Ds1. First, it can change the probability of fixation, p, thereby making p closer to/further from the initial frequency of a new mutant (1/N), thereby decreasing/increasing |Ds1|. Second, when the strength of selection changes, the time it takes for the expected frequency to reach p will also change. Increasing selection decreases the time, so that |Ds1| decreases. Decreasing selection increases the time, so that |Ds1| increases. Predicting the net effect is difficult. In all the simulations, increasing selection usually increased |Ds1|, and never significantly decreased it, but for very strong selection, Ds1 generally appeared to approach some asymptote.


*  INTERACTION BETWEEN SITES
*TOP
*ABSTRACT
*CALCULATION OF R(T)
*SLOWLY CHANGING ENVIRONMENT
*UNDERSTANDING MODELS WITH...
*INTERACTION BETWEEN SITES
*EXCHANGEABLE ALLELES
*HOUSE OF CARDS
*OPTIMUM MODEL
*SHIFT MODELS
*INFINITE ALLELE MODELS
*A DELETERIOUS MUTATIONS MODEL
*DISCUSSION
*LITERATURE CITED

Consider a mutant that enters the population at the current time step, t. Call this mutation jt. The probability that jt ultimately fixes is p. When jt entered the population, it arose on some piece of DNA. The piece of DNA might contain other mutations. Pr{St = 1|jt on i0} is the probability that jt fixes, given that the piece of DNA on which it arose contains another mutant, i0, which entered the population at time zero. If knowing that the piece of DNA contains an earlier mutation does not effect jt's chance of fixation, then Pr{St = 1|jt on i0} will equal p, and (t) = 1. When (t) = 1 we say there is no interaction between mutants. On the other hand, when the knowledge that jt arose on a piece of DNA containing i0 alters the probability that jt fixes, we say that there is interaction between mutants, and (t) != 1.

There are at least two fundamental ways in which mutants can interact. We call these two ways direct and indirect interactions. For many models of natural selection, the fitness of a piece of DNA is proportional to the number of mutations that it contains. For instance, in the negative gamma shift model (described below), when a new mutation enters the population, the fitness of the piece of DNA on which it arose is equal to its fitness prior to the mutation, minus a gamma-distributed random variable. When the fitness of a piece of DNA is a function of the number of mutations contained on the piece of DNA, we say that mutations directly interact with one another.

For other models of evolution, the fitness of a piece of DNA containing a new mutation is independent of the number of previous mutations. In the house of cards model, the fitness of a piece of DNA containing a new mutation is drawn independently from some fixed [often Gaussian (GILLESPIE 1994B Down; TACHIDA 1996 Down)] distribution. Thus, the fitness of a piece of DNA containing a new mutation is independent of the number of earlier mutations it contains. In this case, we say there is no direct interaction between mutations. Nevertheless, knowledge that a piece of DNA contains an earlier mutation can still indirectly effect the mutation's chance of fixation.

If one knows that jt arose on a piece of DNA containing i0, one has some information about the state of the population. In particular, one suspects that i0's expected frequency, conditional on jt arising on i0, is higher than its unconditional expected frequency. The knowledge that i0 is expected to be at higher frequency may, in turn, suggest something about the expected mean fitness of the population. The expected mean fitness of the population may, in turn, suggest something about the probability that jt will fix. In general, when i0 effects jt's probability of fixation through one or more intermediaries (like population mean fitness), we say that i0 and jt indirectly interact. Virtually all nonneutral models should have some form of indirect interactions, although we suspect that models that produce relatively constant population mean fitnesses might have relatively negligible indirect interactions.

A simple rule of thumb can be applied to site interactions. In general, direct interactions tend to move R(T) toward one; indirect interactions tend to move R(T) away from one. This can be seen by considering a few simplified cases.

Consider an advantageous mutation model where the fitness of a piece of DNA with k mutations is equal to 1 + ks, s > 0. This is, by definition, a model of direct interactions. If interactions were absent, R(T) would be <1. Direct calculation of (t) is hard, but it has to be >1. Because all mutations are advantageous, the probability of a site fixing, given that it arises on a piece of DNA containing another mutant, must be larger than its unconditional probability of fixation, because its fitness is higher. So, (t) > 1, which implies that when E{Xt} < p, (t)E{Xt} > E{Xt}. Thus, when mutations are beneficial, direct interactions move R(T) toward 1.

The converse is true for the deleterious model with direct interactions. If the fitness of a sequence with k mutations is 1 - ks, s > 0, then E{Xt} > p, and in the absence of interactions, R(T) would be >1. But, because each additional mutation lowers the fitness of a piece of DNA, Pr{St = 1|jt on i0} must be less than p, and (t) must be <1. Thus, this form of direct interaction must move R(T) toward 1.

Indirect interactions often have the opposite effect. Consider a mutation jt, which enters the population at time t, on a piece of DNA containing an earlier mutation i0. Conditional on jt landing on a piece of DNA containing i0, i0's expected frequency is likely to be higher than its unconditional expected frequency. If this is a deleterious mutation model, i0's higher conditional expected frequency suggests that the conditional expected population mean fitness is likely to be lower than the unconditional expectation. Given that jt arose at a time when the conditional population mean fitness is expected to be lower than the unconditional expected fitness, jt's probability of fixation is likely to be higher, thereby making (t) > 1. Conversely, if this is an advantageous mutation model, jt arising on i0 suggests that the conditional expected population mean fitness may be higher than the unconditional average, making it likely that jt's probability of fixation is lower than the unconditional average, so that (t) < 1. Therefore, indirect interactions are likely to increase R(T) for deleterious site models and decrease R(T) for advantageous ones.


*  EXCHANGEABLE ALLELES
*TOP
*ABSTRACT
*CALCULATION OF R(T)
*SLOWLY CHANGING ENVIRONMENT
*UNDERSTANDING MODELS WITH...
*INTERACTION BETWEEN SITES
*EXCHANGEABLE ALLELES
*HOUSE OF CARDS
*OPTIMUM MODEL
*SHIFT MODELS
*INFINITE ALLELE MODELS
*A DELETERIOUS MUTATIONS MODEL
*DISCUSSION
*LITERATURE CITED

GILLESPIE 1994A Down performed an extensive set of simulations of exchangeable allele models. His results can be summarized as follows: symmetrical overdominance, TIM (Takahata, Ishii, and Matsuda model; TAKAHATA et al. 1975 Down), and SAS-CFF (GILLESPIE 1978 Down) all produced R(T) < 1, and symmetrical underdominance produced R(T) > 1. These results should be expected.

Symmetrical over-/underdominance is characterized by individuals who are homozygous for all sites of the locus having fitness 1. Individuals who are heterozygous for even a single site have fitness 1 + s, where s is fixed and greater than zero for overdominance and less than zero for underdominance. The mutational model is assumed to be Poisson, with constant rate {nu}.

If mutation interactions were absent, the overdominance model would produce an R(T) < 1, and underdominance would lead to R(T) > 1. When a mutation enters the population, the piece of DNA on which it arose will be in heterozygotes for at least its first few generations. Therefore, this piece of DNA will have higher than average fitness during this time, and E{Xt} will be an increasing function for this time. Similarly, a new underdominant mutant will have a lower than average fitness, and E{Xt} will be a decreasing function at first. Whether this pattern continues (overdominance mutants increase in expected frequency; underdominance mutants decline) for the entire time a mutant segregates in the population remains an open analytical question. It is clear from GILLESPIE's (1994a) simulations, and the ones done here, that E{X{infty}} > 1/N for overdominance models, and E{X{infty}} < 1/N for underdominance models. Thus, one suspects that this pattern of expected frequency change may hold the entire time a mutant segregates. It is certain, for the parameter values examined in this study, in every simulation E{Xt} <= E{Xt+1} for all overdominance models, and E{Xt} >= E{Xt+1} for all underdominance ones.

There is no direct interaction between sites in the over-/underdominance model, because each new mutation makes the piece of DNA distinguishable from all other alleles, regardless of the number of previous mutations. Because our intuition suggests that indirect interactions will be often accomplished through changes in population mean fitness, there are likely to be only small amounts of indirect interactions in the overdominance model, because Gillespie has shown that the homozygosity, and as a result mean fitness, changes very little over time. There is likely to be a great deal more indirect interaction in the underdominance model, because this model does not maintain polymorphism, and there are significant changes in mean fitness as sites go to fixation. Indirect interactions should reinforce the effects of advantageous mutants, making R(T) < 1 + Ds1 for the overdominance model (but only slightly because strong interactions are unlikely), and making R(T) > 1 + Ds1 for the underdominance model.

Even though direct calculation of R(T) is difficult for the over-/underdominance model, Ds1 can be estimated from simulation. The basic simulation procedure is described in GILLESPIE 1994A Down, and Ds1 is estimated from the simulation as described above. Several conclusions result. First, 1 + Ds1 does an extraordinarily accurate job of predicting R(T) for the overdominance model, suggesting that indirect interactions are, in fact, very small. Second, it correctly predicts that the underdominance model has R(T) > 1, but underestimates the magnitude of R(T), but this underestimate is in the expected direction, given that indirect interactions should exist (Fig 1).



View larger version (13K):
In this window
In a new window
Download PPT slide
 
Figure 1. The symmetric overdominance and underdominance models. For all simulations, population size is 100 diploids. Mutation rate is 0.005. For each parameter value, the simulation was run for 2000 substitutions before any records were kept. After these initial 2000 substitutions were "burnt off," the simulation was tracked until 300,000 substitutions had occurred. All mutations were followed for 100,000 generations to estimate Ds1. R(T) was estimated from C0 in the simulation (see GILLESPIE 1993 Down). For a renewal process, R(T) = C0. For origination processes that do not form a renewal process (for instance the underdominance model), R(T) = C0 + 2 {Sigma}{infty}i=1 Ci. Nevertheless, C0 is still used as an estimator for R(T) for all figures (except Fig 14), because C0 {approx} C0 + {Sigma}100i=1 Ci. This suggests that for most processes examined, whether or not the process is strictly renewal, the higher-order covariances do not significantly contribute to R(T). Heterozygotes have fitness 1 + s. Homozygotes have fitness 1.

The underdominance model can probably account for R(T) > 5, but this is difficult to show in simulation, because the origination rate goes to zero very rapidly as Ns gets below -4. Interpolating from the graph, there appears to be a narrow range of Ns, perhaps -8 < Ns < -4, with a large, but not astronomical, R(T) that could account for the observed values, but with an overall rate of evolution that is much lower than the neutral rate.

TIM and SAS-CFF are both models of a rapidly fluctuating environment, and understanding their behavior requires slightly more conjecture. There are no direct interactions between sites in either of these models, but the magnitude of indirect interactions is difficult to predict. Gillespie has shown in simulation that E{X{infty}} > E{X0}, which is consistent with expected site frequencies increasing over time. Gillespie also found that R(T) < 1 for all these simulations, which is also consistent with expected site frequencies increasing over time. Nevertheless, actually demonstrating that expected site frequencies increase over time is a formidable problem. One is, however, fairly convinced of this by comparing simulated R(T) with 1 + Ds1, as estimated in these simulations. Simulation details can be found in GILLESPIE 1994A Down. Results are similar to the over-/underdominance case. For the TIM model, which will not maintain polymorphism in an infinite population and is therefore more likely to experience significant mean fitness fluctuation and as a result more indirect interactions, 1 + Ds1 does a qualitatively good job of predicting R(T), but there is some room for quantitative improvement. For the SAS-CFF models with a balancing component to selection (B > 1; note that this also implies a stationary frequency distribution for the finite allele diffusion and therefore is less likely to have a significant indirect interaction component), 1 + Ds1 is an extremely accurate predictor of R(T) (Fig 2 Fig 3 Fig 4 Fig 5 Fig 6).



View larger version (11K):
In this window
In a new window
Download PPT slide
 
Figure 2. TIM model (TAKAHATA et al. 1975 Down). A model of a fluctuating environment. Selection coefficients are drawn from a normal distribution with variance {sigma}2. No balancing component to selection.



View larger version (12K):
In this window
In a new window
Download PPT slide
 
Figure 3. SAS-CFF model, B = 2 (GILLESPIE 1978 Down). A model of a fluctuating environment. Selection coefficients are drawn from a normal distribution with variance {sigma}2. Higher values of B indicate a stronger balancing component to selection.



View larger version (11K):
In this window
In a new window
Download PPT slide
 
Figure 4. SAS-CFF model, B = 5 (GILLESPIE 1978 Down). A model of a fluctuating environment. Selection coefficients are drawn from a normal distribution with variance {sigma}2. Higher values of B indicate a stronger balancing component to selection.



View larger version (11K):
In this window
In a new window
Download PPT slide
 
Figure 5. SAS-CFF model, B = 10 (GILLESPIE 1978 Down). A model of a fluctuating environment. Selection coefficients are drawn from a normal distribution with variance {sigma}2. Higher values of B indicate a stronger balancing component to selection.



View larger version (12K):
In this window
In a new window
Download PPT slide
 
Figure 6. SAS-CFF model, B = 20 (GILLESPIE 1978 Down). A model of a fluctuating environment. Selection coefficients are drawn from a normal distribution with variance {sigma}2. Higher values of B indicate a stronger balancing component to selection.


*  HOUSE OF CARDS
*TOP
*ABSTRACT
*CALCULATION OF R(T)
*SLOWLY CHANGING ENVIRONMENT
*UNDERSTANDING MODELS WITH...
*INTERACTION BETWEEN SITES
*EXCHANGEABLE ALLELES
*HOUSE OF CARDS
*OPTIMUM MODEL
*SHIFT MODELS
*INFINITE ALLELE MODELS
*A DELETERIOUS MUTATIONS MODEL
*DISCUSSION
*LITERATURE CITED

The house of cards model of molecular evolution is the most thoroughly analyzed (OHTA and TACHIDA 1990 Down; TACHIDA 1991 Down, TACHIDA 1996 Down; IWASA 1993 Down; GILLESPIE 1994B Down; ARAKI and TACHIDA 1997 Down) and widely accepted (NACHMAN et al. 1994 Down; MORAN 1996 Down; OHTA and GILLESPIE 1996 Down) model of molecular evolution with the possibility of accounting for a large index of dispersion. The HOC model achieves a high index of dispersion through deleterious sites and an enormous indirect interaction component. There is no direct interaction in this model.

In its most common form, the HOC model assumes that the fitness of any mutation is picked from a normal distribution with mean 0 and variance {sigma}2. Under a small mutation rate assumption, it has been shown that the fitnesses of the most recently fixed sites can be thought of as a Markov process with a stationary distribution that is approximately Gaussian with mean 2N{sigma}2 and variance {sigma}2 (GILLESPIE 1994B Down; TACHIDA 1996 Down). Because the fitness of the most recently fixed site has mean 2N{sigma}2, and new mutations have mean fitness 0, we can think of the relative fitness of new mutations as a normally distributed random variable with mean -2N{sigma}2. Therefore, the vast majority of mutations have to be deleterious, so Ds1 > 0, and in the absence of interaction, R(T) > 1.

In fact, indirect interactions can lead R(T) to be vastly largely than one. To a first approximation, the mean fitness of the population is equal to 1 + s*, where s* is the selection coefficient of the most recently fixed site. Therefore, the population mean fitness must fluctuate, and these fluctuations must occur slowly (in fact, on the exact same time scale as molecular evolution). Because mean fitness fluctuates, indirect interactions are expected. Because mean fitness fluctuates on the same time scale as molecular evolution, the indirect interaction component must be large. Putting together deleterious sites with large indirect interactions leads to the prediction that R(T) > 1, and perhaps much greater (Fig 7). As expected, 1 + Ds1 > 1, but not much greater. 1 + Ds1 never rises above 1.5, despite R(T) growing to nearly 500. The indirect interaction component is enormous, though.



View larger version (10K):
In this window
In a new window
Download PPT slide
 
Figure 7. House of cards model (OHTA and TACHIDA 1990 Down). The fitness of a piece of DNA with a new mutation is drawn from a normal distribution with mean 0 and variance {sigma}2.

The house of cards model can account for an index of dispersion >5, but only when 0.5 < N{sigma} < 2. This is an incredible parameter sensitivity. For N{sigma} < 0.5, the house of cards is essentially a neutral model. For N{sigma} = 2, the index of dispersion is well into the hundreds. It is difficult to simulate N{sigma} > 3, because the origination rate is so slow.


*  OPTIMUM MODEL
*TOP
*ABSTRACT
*CALCULATION OF R(T)
*SLOWLY CHANGING ENVIRONMENT
*UNDERSTANDING MODELS WITH...
*INTERACTION BETWEEN SITES
*EXCHANGEABLE ALLELES
*HOUSE OF CARDS
*OPTIMUM MODEL
*SHIFT MODELS
*INFINITE ALLELE MODELS
*A DELETERIOUS MUTATIONS MODEL
*DISCUSSION
*LITERATURE CITED

The optimum model is a simple model of purifying selection. All mutations are assigned a phenotype drawn from a zero mean, unit variance normal distribution. The fitness function is quadratic with a maximum at zero. A single parameter {sigma} measures the width of the fitness function [see GILLESPIE 1994B Down for further details on this model]. It is obvious that most mutations are deleterious; therefore, in the absence of mutation interaction, one expects R(T) > 1 and Ds1 > 0. It is also clear that, like the HOC model, there are no direct interactions, but there are indirect interactions caused by the mean fitness of the population changing with each fixation (Fig 8). One sees roughly equal contributions from Ds1 and indirect interactions. Also, note that R(T) grows very slowly with increasing selection. As selection increased by a factor of 2 from {sigma} = 0.05 to {sigma} = 0.1, R(T) rose by only 1%. Over a wide range of parameter values (results not shown), the optimum model always has difficulty producing an R(T) as large as 5.



View larger version (12K):
In this window
In a new window
Download PPT slide
 
Figure 8. Optimum model (GILLESPIE 1994B Down). All mutations are assigned a phenotype drawn from a zero mean, unit variance normal distribution. The fitness function is quadratic with a maximum at zero. {sigma} measures the width of the fitness function.


*  SHIFT MODELS
*TOP
*ABSTRACT
*CALCULATION OF R(T)
*SLOWLY CHANGING ENVIRONMENT
*UNDERSTANDING MODELS WITH...
*INTERACTION BETWEEN SITES
*EXCHANGEABLE ALLELES
*HOUSE OF CARDS
*OPTIMUM MODEL
*SHIFT MODELS
*INFINITE ALLELE MODELS
*A DELETERIOUS MUTATIONS MODEL
*DISCUSSION
*LITERATURE CITED

Shift models are a perfect example of why performing simulations in the absence of theory can lead to entirely uninterpretable results. At first glance, the shift story looks simple. Gamma and exponential shifts yield R(T) {approx} 1; normal shifts lead to R(T) < 1 (GILLESPIE 1994B Down). Without theory, one might be tempted to say that the gamma and exponential shift models are "neutral" like, and therefore might be easy to analyze. Nothing could be further from the truth. Shift models are quite complicated and provide the clearest example of how direct interactions move R(T) toward 1.

The gamma and exponential shift models can be further subdivided into positive and negative shifts. In the negative-shift models (the only kind that has received significant theoretical attention; OHTA 1977 Down; KIMURA 1979 Down; GILLESPIE 1987 Down, GILLESPIE 1994B Down) the fitness of a new mutation is equal to the fitness of its parent's sequence, minus a gamma or exponentially distributed random variable (gamma distribution is the gamma shift, exponential is the exponential). In the positive shifts, the random variable is added, not subtracted from the fitness.

Qualitatively analyzing Ds1, in the absence of site interactions, for gamma or exponential shifts is easy. For negative shifts, each new mutation has, on average, a fitness lower than the mean fitness of the population; hence mutations are on average deleterious and Ds1 > 0. Similarly, positive shifts are advantageous and Ds1 < 0. Direct interactions qualitatively change this picture.

Shift models fundamentally differ in their mode of site interaction from all other models of evolution that we have so far considered. In all other models, the fitness of a sequence is essentially independent of the number of mutations it contains. In shift models, the fitness of a piece of DNA is directly proportional to the number of mutations it contains. Thus, sites directly interact with one another, so R(T) should be closer to one.

One can attempt to crudely estimate this effect. Consider, (t) = Pr{St = 1|jt on i0}/p. In words, (t) is the probability a mutant, jt, which entered the population at time t, fixes given it arose on a piece of DNA containing a mutant, i0, that entered at time 0, divided by the probability that jt fixes. The analysis is done by considering two cases.

In the first case jt arises on a piece of DNA containing i0, and i0 is still segregating in the population. In the second case jt arises only after i0 has been fixed (because jt is on i0, i0 cannot have been lost before time t). Therefore,

The first approximation is to assume that Pr{j fixes|i fixed before t} {approx} p. In other words, if i0 has been fixed before jt enters the population, then i0 has little effect on jt. Because this is an attempt to capture direct interactions, the approximation essentially amounts to assuming that if i0 is fixed, it contributes equally to the fitness of all alleles. Using this approximation,

Consider the ratio of probabilities in the first term. This term is the probability that jt fixes, given that it arose on a piece of DNA containing another segregating mutant, i0, divided by the probability that jt fixes. Thus, it is the ratio of the probability of fixation of a piece of DNA with at least two segregating mutants divided by the probability of fixation of a piece of DNA with at least one segregating mutant. Loosely, it is the probability of fixation of a piece of DNA with two mutants divided by the probability of fixation of a piece of DNA with one mutant. The question is, How does the extra mutant effect jt's probability of fixation? Direct calculation appears very difficult, but by considering a two-allele diffusion, an approximation may be found.

Consider a simple two-allele diffusion, where the fitnesses of the genotypes A1A1, A1A2, and A2A2 are 1, 1 + s/2, and 1 + s, respectively. The probability of fixation of a new mutant A2 is given by EWENS 1979 Down(p. 147):

(10)

Under a model of direct interactions, think of the fitness of a piece of DNA with only jt on it as 1 + s. Think of the fitness of a piece of DNA with both jt and i0 on it as 1 + 2s. Therefore, by analogy to the two-allele diffusion, an approximation for Pr{j fixes|jt on i0, i segs at t}/p might be {pi}(2s)/{pi}(s). For small values of Ns, {pi}(2s)/{pi}(s) may be further approximated by 2N{pi}(s) (see Fig 9). Noting that Equation 1 was derived for a haploid model with population size N, this suggests approximately Pr{j fixes|jt on i0, i segs at t}/p with the very simple Np. Plugging this approximation into (7),

(11)

where Ds2 = 2{nu}{Sigma}{infty}t=1(E{Xt} - Pr{Xt} = 1})(Np - 1). Quick examination shows that Ds2, at least qualitatively, captures the effect of direct interactions. The sign of Ds2 is determined by the sign of Np - 1, because E{Xt} >= Pr{Xt = 1}. If most mutations are advantageous, then Ds1 < 0, but Np > 1, so that Ds2 > 0, and R(T) is re-stored toward 1. If most mutations are deleterious, then Ds1 > 0, but Np < 1, so that Ds2 < 0, and R(T) is again restored toward 1. So, qualitatively Ds2 behaves as it should. Nevertheless, Ds2 contains two approximations that may effect its quantitative agreement with simulation.



View larger version (9K):
In this window
In a new window
Download PPT slide
 
Figure 9. 2N{pi}(s) is larger than {pi}(2s)/{pi}(s), but for small values of Ns, it is a reasonable approximation.

Ds2, much like Ds1, contains a term, Pr{Xt = 1}, that is difficult to find analytically. However, this term is easy to obtain from simulation. Hence, Fig 10 was produced using simulation to estimate both Ds1 and Ds2. These simulations behave qualitatively as expected. 1 + Ds1 + Ds2 does a much better job of predicting R(T) than does 1 + Ds1 alone, but there is still much room for improvement. Moreover, because R(T) is so nearly equal to 1.0 for the negative gamma shift, one wonders if a much better approximation than Ds2 is easily available.



View larger version (19K):
In this window
In a new window
Download PPT slide
 
Figure 10. Gamma shift model (GILLESPIE 1994B Down). Fitness of a piece of DNA with a new mutation is equal to its fitness before the mutation plus (or minus) a gamma-distributed random variable with mean {sigma}.

Normal shifts are similar in structure to gamma shifts. The fitness of a sequence with a newly arising mutation is its parents' fitness plus a normally distributed random variable with mean 0 and variance {sigma}2 (instead of a gamma-distributed random variable). Unlike the gamma shifts, where all sequences with new mutations were either uniformly worse than their parents (negative) or uniformly better (positive), mutants under a normal shift have a 50% chance of having higher fitness than their parents and a 50% chance of having a lower fitness. It is a little difficult to predict a priori that under this model, mutations on average increase in frequency, but this is not altogether surprising. New mutants have a higher than average fitness half the time. Thus, one expects new mutants to increase in frequency roughly half the time. Because there is a lot more "space" above 1/N than there is below it, it is not surprising that mutants on average increase in frequency. In any case, from simulation it is clear that mutants do, in fact, increase in frequency, on average, and as a result Ds1 < 0. Given that Ds1 < 0, one expects that Ds2 > 0 because of direct interactions. It is clear from simulation (Fig 11) that 1 + Ds1 + Ds2 does a reasonable job of predicting R(T), but once again there is still considerable room for improvement, particularly for weak selection.



View larger version (14K):
In this window
In a new window
Download PPT slide
 
Figure 11. Normal shift model (GILLESPIE 1994B Down). Fitness of a piece of DNA with a new mutation is equal to its fitness before the mutation plus a normally distributed random variable with mean zero and variance {sigma}2.


*  INFINITE ALLELE MODELS
*TOP
*ABSTRACT
*CALCULATION OF R(T)
*SLOWLY CHANGING ENVIRONMENT
*UNDERSTANDING MODELS WITH...
*INTERACTION BETWEEN SITES
*EXCHANGEABLE ALLELES
*HOUSE OF CARDS
*OPTIMUM MODEL
*SHIFT MODELS
*INFINITE ALLELE MODELS
*A DELETERIOUS MUTATIONS MODEL
*DISCUSSION
*LITERATURE CITED

GILLESPIE 1993 Down presents a proof that a two-allele diffusion with a reflecting barrier below and an absorbing barrier above has an index of dispersion less than one. The proof is formulated as a waiting time problem in a diffusion and is technical. The intuition is as follows. Consider an infinite allele model of the gene. Origination processes in infinite allele models differ from those in infinite site models in at least one fundamental way. Under infinite alleles, an origination occurs only when every individual in the population has exactly the same allele at the locus. Therefore, at the instant when an allele fixes, there must not be any mutations in the entire population's coalescent. With this in mind, one can think of the time between fixations as being composed of two pieces Tb + Tcf. Tb is the time the population waits until a mutation occurs that will eventually fix, and Tcf is time between when a mutant destined to fix arises in the population and when it actually fixes. Gillespie argues that as the mutation rate gets small, Tb looks increasingly like an exponential waiting time, under lots of models. He notes from his simulations that the variance to mean ratio of Tcf looks no more erratic than an exponential wait; therefore he concludes that Tb + Tcf is more regular than an exponential waiting time. Recall that the sum of two exponentials is more regular than a single exponential.

Gillespie's argument can be understood in the terms presented here as well. From (2) and (5), the index of dispersion can be written as

(12)

Under the assumption that the mutation process has a constant rate, (12) can be written as

(13)

Consider Pr{St = 1|S0 = 1, Mt = 1}. This is the probability that a mutation that enters the population at the t fixes, given that a mutation that entered the population at time 0 also fixes. Suppose the mutation from time 0 first reaches frequency 1 at time t*. Because of the structure of an infinite allele model, at time t* there cannot be any segregating sites at this locus. Thus, Pr{St = 1|S0 = 1, Mt = 1} = 0 for all values of t, such that t* - N < t < t*. In a Moran model it takes at least N time steps for a mutation to reach frequency 1; therefore any mutant that enters the population in the N time steps before t* is destined to be lost. For values of t slightly smaller than t* - N, Pr{St = 1|S0 = 1, Mt = 1} will be nearly 0. Thus, for all values of t slightly less than t*, Pr{St = 1|S0 = 1, Mt = 1} - p < 0. This suggests that infinite allele models will generally have an R(T) < 1. Note that this conclusion is a direct consequence of the infinite allele assumption, and it is hard to imagine that this result sheds any additional light on infinite site models.


*  A DELETERIOUS MUTATIONS MODEL
*TOP
*ABSTRACT
*CALCULATION OF R(T)
*SLOWLY CHANGING ENVIRONMENT
*UNDERSTANDING MODELS WITH...
*INTERACTION BETWEEN SITES
*EXCHANGEABLE ALLELES
*HOUSE OF CARDS
*OPTIMUM MODEL
*SHIFT MODELS
*INFINITE ALLELE MODELS
*A DELETERIOUS MUTATIONS MODEL
*DISCUSSION
*LITERATURE CITED

With an understanding of Ds1 and mutation interactions in hand, it is easy to construct a model that ought to produce large values of R(T). Deleterious sites cause Ds1 > 0, so the model must have mostly deleterious mutants. Direct interactions can negate this effect, so there must be no direct interactions between sites. Any model with these two properties ought to produce an R(T) > 1. The following extremely simple model (IWASA 1993 Down) should produce a large R(T).

Consider a two-allele model with alleles A1 and A2 with fitnesses 1 and 1 - {sigma}, {sigma} > 0, respectively. When an A1 allele mutates it becomes A2 with probability 1. When an A2 allele mutates it becomes A1 with probability q, q << 1, and stays A2 with probability 1 - q. A1 should be nearly fixed most of the time, and therefore almost all mutations will be deleterious. Even when A2 is nearly fixed, most mutations are neutral, so that Ds1 should be large.

To finish off the model, assume that q = 0.001 and this is an additive diploid population structure, so the ith sequence, with fitness wi {isin} {1,1 - {sigma}} and frequency Xi(t) in generation t, has deterministic frequency change

where

is the marginal fitness of the ith sequence, Ch(t) is the number of distinct sequences segregating in the population, and

is the population mean fitness.

This model should produce indirect interactions, because whenever polymorphism is unusually high, it is extremely likely that at least one A2 allele is at high frequency, which suggests that population mean fitness is unusually low, and subsequent fixations of other A2 alleles are unusually easy ((t) > 1). Simulations reflect this intuition (Fig 12). Under a weak mutation approximation, IWASA 1993 Down showed that a similar model with more alleles could also produce a large R(T).



View larger version (13K):
In this window
In a new window
Download PPT slide
 
Figure 12. Simple deleterious model, 2N{nu} = 2. A1 alleles have fitness 1. A2 alleles have fitness 1 - {sigma}.

The deleterious mutant model behaves exactly as expected. For 2N{nu} = 2, R(T) does not quite reach five before the origination rate falls significantly below the neutral level, but because the leading term in Ds1 is {nu}, elevating 2N{nu} ought to increase R(T). It does (Fig 13). This model can create an R(T) as large as one likes, while still maintaining an origination rate within an order of magnitude of the neutral rate, but only for a narrow range of N{sigma} values. R(T) can be further elevated (but only slightly) by making the A2 allele recessive (results not shown).



View larger version (13K):
In this window
In a new window
Download PPT slide
 
Figure 13. Simple deleterious model, 2N{nu} = 8. Increasing the mutation rate increases R(T).

These results are not crucially dependent on choice of q. As long as q is small the conclusions hold. Fig 14 shows this. As long as q stays below 0.1, R(T) remains quite high. Interestingly, and perhaps not surprisingly, large q versions of this model are the only example in this article of a model without a renewal-like appearance. For q > 0.01, one can show that R(T) is statistically different from what its value would be had the model been a renewal process. As a q = 0 limiting case, a slight variant of this model was considered. In this model, A1 mutates to A2 with probability one, and A2 mutates to A2 with probability one, but when a site fixes, the sequence on which that site arose instantaneously becomes an A1 allele. This model can be thought of as a deleterious shift model, analogous to the gamma shift, but with direct site interactions removed.



View larger version (13K):
In this window
In a new window
Download PPT slide
 
Figure 14. Simple deleterious model, 2N{nu} = 2. Relative insensitivity to rare advantageous mutations. q is the probability that a mutation changes a deleterious allele, A2, to an advantageous one, A1.


*  DISCUSSION
*TOP
*ABSTRACT
*CALCULATION OF R(T)
*SLOWLY CHANGING ENVIRONMENT
*UNDERSTANDING MODELS WITH...
*INTERACTION BETWEEN SITES
*EXCHANGEABLE ALLELES
*HOUSE OF CARDS
*OPTIMUM MODEL
*SHIFT MODELS
*INFINITE ALLELE MODELS
*A DELETERIOUS MUTATIONS MODEL
*DISCUSSION
*LITERATURE CITED

In the absence of site interactions, deleterious mutations cause R(T) to be >1, and advantageous mutations cause R(T) to be <1. Advantageous mutations are shown in simulation to nearly completely explain all previous models that produced an R(T) < 1 (Fig 1 Fig 2 Fig 3 Fig 4 Fig 5 Fig 6). Direct interactions (a sequence's fitness is directly proportional to the number of mutations contained in the sequence) tend to make R(T) closer to 1 than it would otherwise be. Indirect interactions (sites interact through an intermediary, usually population mean fitness) generally have the opposite effect.

For mammalian species, the observed index of dispersion of protein-encoding loci is >>1, and our current best estimate suggests that it is >5 (OHTA 1995 Down), for both silent and replacement sites. In Drosophila a somewhat different picture emerged (ZENG et al. 1998 Down). R(T) for silent sites is statistically >1 (an average of 4.37) and not too dissimilar from the mammalian value. Replacement sites, on the other hand, showed an R(T) (1.64) that is much lower than mammals and not distinguishable from one. This raises the immediate possibility that whatever model of evolution is correct in mammals, something quite different may be happening in flies (ZENG et al. 1998 Down). Unfortunately, interpretation of the Drosophila results is difficult because of the extremely low divergences between D. pseudoobscura and D. subobscura.

In all the simulations done here, the long-run behavior of R(T) is reported. The reason for this is that virtually any stationary, orderly (no more than one mutation per time step) model of molecular evolution has the property that R(T) will be an increasing function of time. For Equation 1, it is clear that so long as h(t) converges monotonically to {rho}, the longer one allows the process to evolve, the larger R(T) will be. For every model simulated here, R(T) was an increasing function of T, at least for small T. It is also clear from Equation 1 that if one observes a process for only a single time step, R(T) will be exactly equal to 1.0 - {rho}. Thus, as a population evolves, R(T) will start at ~1 and continue to rise, until some long-term asymptotic value is reached. The length of time it takes to approach this asymptote is crucially dependent on the details of the model, but some intuition is possible.

Consider an attempt to estimate R(T) in a highly simplified situation. Suppose there are X1 substitutions in lineage one and X2 substitutions in lineage two. A simple estimate of R(T) might be (X2 - X1)2/(X1 + X2). Because the number of substitutions can never be negative, R(T) will never be larger than the maximum of {X1, X2}. Thus, the estimate of R(T) can never be larger than the maximum number of substitutions observed in the two lineages. So, unless there are at least 5 substitutions in one of the lineages, the estimate of R(T) has to be <5, no matter what the long run value is. Thus, when the level of divergence is very low, there is no possibility of estimating a large value of R(T).

In 12 of the 24 proteins examined by ZENG et al. 1998 Down, both D. pseudoobscura and D. subobscura show less than five substitutions. For eight loci, neither species has more than a single substitution. Now, this situation is complicated by the existence of a third species, and the use of Gillespie's weighting factor, but the above intuition suggests that one should be very careful in drawing any inference from these replacement sites. One has to worry that, because of the incredibly low replacement site divergences, this study is simply incapable of estimating the asymptotic behavior of R(T). In contrast, for silent sites only a single locus shows fewer than five substitutions in both lineages, and silent sites exhibit a value of R(T) quite similar to mammals. Thus, the principal difference between Zeng et al.'s and Ohta's results may be that Ohta examined deeper divergences and came closer to observing the long-run behavior of R(T).

In any case, it is quite certain that our best estimate of R(T) for mammals is >5 for both silent and replacement sites. The question remains, What evolutionary processes can generate an index of dispersion so large? This work shows that there are fundamentally only two ways to create an R(T) as large as 5. The first way is to force most mutations to be deleterious (create a large Ds1 and indirect interactions). The second way is to create fluctuations (in mutation rate or probability of fixation or both) that occur on a very slow timescale. There appear to be no other ways to create a large R(T).

Virtually any model of molecular evolution with mostly deleterious mutations and no direct site interactions can explain an R(T) as large as 5, but only with sufficiently high mutation rates and selection coefficients. A very simple model of deleterious alleles is examined and shown to easily explain R(T). This model is shown to be reasonably insensitive to rare advantageous mutations.

Although the neutral theory does a poor job of explaining the index of dispersion, it nevertheless does a passable job of predicting the mean rate of originations in a lineage. We know that the per-site origination rate in many proteins in most organisms is not too different from the per-site, per-generation mutation rate for these proteins (KIMURA 1983 Down). These sorts of mutation rates are extremely hard to measure accurately, but it is certain that the origination rates are not, say, four orders of magnitude higher or lower than the mutation rates for most proteins. Therefore, any model that attempts to explain R(T) must also have the feature that the overall origination rates are at least somewhat similar to the mutation rates.

All the current models that can explain a large R(T) suffer from a parameter sensitivity problem. Models, which create a large R(T) by changing some extrinsic parameter on the same timescale as molecular evolution (GILLESPIE 1984A Down; ARAKI and TACHIDA 1997 Down), essentially are asking one to believe that there exists some key parameter that just happens to change around every 1/{nu} generations (or slower), where {nu} is the mutation rate. Now, this just may be the way the world is, but explaining it as such really just pushes the logical question back one level. The new question now becomes, How is it that some environmental parameter just happens to change at a rate keyed to the mutation rate?

Other models suffer from similar parameter problems. Takahata's fluctuating neutral space model (TAKAHATA 1987 Down) can explain a large value of R(T), but if one assumes that new mutation rates are independent of previous rates, sequences that differ by only a single nucleotide will sometimes (2–5% of the time) have mutation rates that differ by an order of magnitude (CUTLER 2000 Down). To some, this feature may not seem biologically realistic.

Deleterious site models can often explain large values of R(T), but only when Ns {approx} 2 - 10. If Ns > 10 evolution stops, and if Ns << 1, the world is neutral. Again these models just push the logical question back a level. If Ns = 5 explains the index of dispersion, how is it that organisms with N's that vary over many orders of magnitude just happen to have s's that match?

This last question is not entirely rhetorical. The notion that the average selective coefficient of a mutation might evolve has been suggested before (HARTL et al. 1985 Down) and has been given a recent serious treatment (CHERRY 1998 Down). Moreover, allowing the selective coefficient to evolve might also explain the similarity between R(T) for silent and replacement sites in mammals. If Ns evolves to approximately the same place for both silent and replacement sites, then their respective R(T)'s ought to be approximately the same. Thus, models that allow levels of selection to evolve might explain the parameter sensitivity problem.

Finally, much of population genetics theory over the last 15 years has been devoted to understanding within-population sequence variation. In particular, there has been a great deal of work trying to understand the observed levels of nucleotide polymorphism in genomic regions of low recombination (KAPLAN et al. 1989 Down; CHARLESWORTH et al. 1993 Down). Two major theories have been proposed: selective sweeps and background selection. Selective sweeps suggest that advantageous mutations go to fixation regularly and sweep out linked neutral variation in the process. Background selection suggests that recurrent deleterious mutations are being driven out by selection, thereby lowering the effective population size. Put simply, selective sweeps are inconsistent with an R(T) > 1, unless one further contends that something forces these sweeps to occur irregularly (and on a timescale tied to the mutation rate). Background selection, on the other hand, predicts that R(T) ought to be >1. An obvious question is whether there is a model that can simultaneously account for the index of dispersion and levels of polymorphism in natural populations.


*  ACKNOWLEDGMENTS

I thank John Gillespie, Hiroshi Akashi, Mark Grote, Michael Turelli, and two anonymous reviewers for much advance and many helpful suggestions.

Manuscript received April 23, 1999; Accepted for publication December 2, 1999.


*  LITERATURE CITED
*TOP
*ABSTRACT
*CALCULATION OF R(T)
*SLOWLY CHANGING ENVIRONMENT
*UNDERSTANDING MODELS WITH...
*INTERACTION BETWEEN SITES
*EXCHANGEABLE ALLELES
*HOUSE OF CARDS
*OPTIMUM MODEL
*SHIFT MODELS
*INFINITE ALLELE MODELS
*A DELETERIOUS MUTATIONS MODEL
*DISCUSSION
*LITERATURE CITED

ARAKI, H. and H. TACHIDA, 1997  Bottleneck effect on evolutionary rate in the nearly neutral mutation model. Genetics 147:907-914[Abstract].

BIRKY, C. W. and J. B. WALSH, 1988  Effects of linkage on rates of molecular evolution. Proc. Natl. Acad. Sci. USA 85:6414-6418[Abstract/Free Full Text].

BULMER, M., 1989  Estimating the variability of substitution rates. Genetics 123:615-619[Abstract/Free Full Text].

CHARLESWORTH, B., M. T. MORGAN, and D. CHARLESWORTH, 1993  The effect of deleterious mutations on neutral molecular variation. Genetics 134:1289-1303[Abstract].

CHERRY, J. L., 1998  Should we expect substitution rate to depend on population size. Genetics 150:911-919[Abstract/Free Full Text].

COX, D. R., and V. ISHAM, 1980 Point Process. Chapman and Hall, London.

CUTLER, D. J., 2000  The index of dispersion of molecular evolution: slow fluctuations. Theor. Popul. Biol. in press.

EWENS, W. J., 1979 Mathematical Population Genetics. Springer-Verlag, Berlin.

GILLESPIE, J. H., 1978  A general model to account for enzyme variation in natural populations. V. The SAS-CFF model. Theor. Popul. Biol. 14:1-45[Medline].

GILLESPIE, J. H., 1984a  The molecular clock may be an episodic clock. Proc. Natl. Acad. Sci. USA 81:8009-8013[Abstract/Free Full Text].

GILLESPIE, J. H., 1984b  Molecular evolution over the mutational landscape. Evolution 38:1116-1129.

GILLESPIE, J. H., 1986a  Natural selection and the molecular clock. Mol. Biol. Evol. 3:138-155[Abstract].

GILLESPIE, J. H., 1986b  Variability of evolutionary rates of DNA. Genetics 113:1077-1091[Abstract/Free Full Text].

GILLESPIE, J. H., 1987  Molecular evolution and the neutral allele theory. Oxf. Surv. Evol. Biol. 4:10-37.

GILLESPIE, J. H., 1989  Lineage effects and the index of dispersion of molecular evolution. Mol. Biol. Evol. 6:636-647[Abstract].

GILLESPIE, J. H., 1991 The Causes of Molecular Evolution. Oxford University Press, New York.

GILLESPIE, J. H., 1993  Substitution processes in molecular evolution. I. Uniform and clustered substitutions in a haploid model. Genetics 134:971-981[Abstract].

GILLESPIE, J. H., 1994a  Substitution processes in molecular evolution. II. Exchangeable models from population genetics. Evolution 48:1101-1113.

GILLESPIE, J. H., 1994b  Substitution processes in molecular evolution. III. Deleterious alleles. Genetics 138:943-952[Abstract].

GILLESPIE, J. H. and C. H. LANGLEY, 1979  Are evolutionary rates really variable? J. Mol. Evol. 13:27-34[Medline].

GOLDMAN, N., 1994  Variance to mean ratio, r(t), for poisson processes on phylogenetic trees. Mol. Phylogenet. Evol. 3:230-239[Medline].

HARTL, D. L., D. D. DYKHUIZEN, and A. M. DEAN, 1985  Limits of adaptation: the evolution of selective neutrality. Genetics 111:655-674[Abstract/Free Full Text].

IWASA, Y., 1993  Overdispersed molecular evolution in constant environments. J. Theor. Biol. 164:373-393[Medline].

KAPLAN, N. L., R. R. HUDSON, and C. H. LANGLEY, 1989  The hitchhiking effect revisited. Genetics 123:887-899[Abstract/Free Full Text].

KELLY, F. P., 1979 Reversibility and Stochastic Networks. John Wiley & Sons, Chichester, United Kingdom.

KIMURA, M., 1979  Model of effectively neutral mutations in which selective constraint is incorporated. Proc. Natl. Acad. Sci. USA 76:3440-3444[Abstract/Free Full Text].

KIMURA, M., 1983 The Neutral Allele Theory of Molecular Evolution. Cambridge University Press, Cambridge.

LANGLEY, C. H., and W. M. FITCH, 1973 The constancy of evolution: a statistical analysis of the {alpha} and ß haemoglobins, cytochrome c, and fibrinopeptide A, pp. 246–262 in Genetic Structure of Populations, edited by N. E. MORTON. Univ. of Hawaii Press, Honolulu.

LANGLEY, C. H. and W. M. FITCH, 1974  An estimation of the constancy of the rate of molecular evolution. J. Mol. Evol. 3:161-177[Medline].

MORAN, N. A., 1996  Accelerated evolution and Muller's rachet in endosymbiotic bacteria. Proc. Natl. Acad. Sci. USA 93:2873-2878[Abstract/Free Full Text].

MORAN, P. A. P., 1958  Random processes in genetics. Proc. Camb. Philos. Soc. 54:60-72.

NACHMAN, M. W., S. N. BOYER, and C. F. AQUADRO, 1994  Nonneutral evolution at the mitochondrial NADH dehydrogenase subunit 3 gene in mice. Proc. Natl. Acad. Sci. USA 91:6364-6368[Abstract/Free Full Text].

NIELSEN, R., 1997  Robustness of the estimator of the index of dispersion for DNA sequences. Mol. Phylogenet. Evol. 7:346-351[Medline].

OHTA, T., 1977 Extension of the neutral mutation drift hypothesis, pp. 148–167 in Molecular Evolution and Polymorphism, edited by M. KIMURA. National Institute of Genetics, Mishima, Japan.

OHTA, T., 1995  Synonymous and nonsynonymous substitutions in mammalian genes and the nearly neutral theory. J. Mol. Evol. 40:56-63[Medline].

OHTA, T. and J. H. GILLESPIE, 1996  Development of the neutral and nearly neutral theories. Theor. Popul. Biol. 49:128-142[Medline].

OHTA, T. and M. KIMURA, 1969  Linkage disequilibrium at steady state determined by random genetic drift and recurrent mutation. Genet. Res. 16:165-177.

OHTA, T. and M. KIMURA, 1971  On the constancy of the evolutionary rate of cistrons. J. Mol. Evol. 1:18-25[Medline].

OHTA, T. and H. TACHIDA, 1990  Theoretical study of near neutrality. I. Heterozygosity and rate of mutant substitution. Genetics 126:219-229[Abstract].

SAWYER, S., 1977  On the past history of an allele now known to have frequency p.. J. Appl. Prob. 14:439-450.

TACHIDA, H., 1991  A study on a nearly neutral mutation model in finite populations. Genetics 128:183-192[Abstract].

TACHIDA, H., 1996  Effects of the shape of distribution of mutant effect in nearly neutral mutation models. J. Genet. 75:33-48.

TAKAHATA, N., 1987  On the overdispersed molecular clock. Genetics 116:169-179[Abstract/Free Full Text].

TAKAHATA, N., 1989  Statistical models of the overdispersed molecular clock. Theor. Popul. Biol. 39:329-344.

TAKAHATA, N., K. ISHII, and H. MATSUDA, 1975  Effect of temporal fluctuation of selection coefficient on gene frequency in a population. Proc. Natl. Acad. Sci. USA 72:4541-4545[Abstract/Free Full Text].

WATTERSON, G. A., 1975  On the number of segregating sites in genetic models without recombination. Theor. Popul. Biol. 7:256-276[Medline].

ZENG, L.-W., J. M. COMERON, B. CHEN, and M. KREITMAN, 1998  The molecular clock revisited: the rate of synonymous vs. replacement change in drosophila.. Genetica 102(103):369-382.




This article has been cited by other articles:


Home page
Syst BiolHome page
S. Y. W. Ho and M. J. Phillips
Accounting for Calibration Uncertainty in Phylogenetic Estimation of Evolutionary Divergence Times
Syst Biol, July 3, 2009; (2009) syp035v1.
[Full Text] [PDF]


Home page
Nucleic Acids ResHome page
E. V. Koonin
Darwinian evolution in the light of genomics
Nucleic Acids Res., March 1, 2009; 37(4): 1011 - 1034.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
T. Bedford and D. L. Hartl
Overdispersion of the Molecular Clock: Temporal Variation of Gene-Specific Substitution Rates in Drosophila
Mol. Biol. Evol., August 1, 2008; 25(8): 1631 - 1638.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
T. Bedford, I. Wapinski, and D. L. Hartl
Overdispersion of the Molecular Clock Varies Between Yeast, Drosophila and Mammals
Genetics, June 1, 2008; 179(2): 977 - 984.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
S.-H. Kim and S. V. Yi
Mammalian Nonsynonymous Sites Are Not Overdispersed: Comparative Genomic Analysis of Index of Dispersion of Mammalian Proteins
Mol. Biol. Evol., April 1, 2008; 25(4): 634 - 642.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
C. S. McBride, J. R. Arguello, and B. C. O'Meara
Five Drosophila Genomes Reveal Nonneutral Evolution and the Signature of Host Specialization in the Chemoreceptor Superfamily
Genetics, November 1, 2007; 177(3): 1395 - 1416.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
R. D. Hernandez, S. H. Williamson, and C. D. Bustamante
Context Dependence, Ancestral Misidentification, and Spurious Signatures of Natural Selection
Mol. Biol. Evol., August 1, 2007; 24(8): 1792 - 1800.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
N. Takahata
Molecular Clock: An Anti-neo-Darwinian Legacy
Genetics, May 1, 2007; 176(1): 1 - 6.
[Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
V. Mustonen and M. Lassig
Adaptations to fluctuating selection in Drosophila
PNAS, February 13, 2007; 104(7): 2277 - 2282.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
J. D. Bloom, A. Raval, and C. O. Wilke
Thermodynamics of Neutral Protein Evolution
Genetics, January 1, 2007; 175(1): 255 - 266.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
H. Akashi, W.-Y. Ko, S. Piao, A. John, P. Goel, C.-F. Lin, and A. P. Vitins
Molecular Evolution in the Drosophila melanogaster Species Subgroup: Frequent Parameter Fluctuations on the Timescale of Molecular Divergence
Genetics, March 1, 2006; 172(3): 1711 - 1726.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
P. S. Novichkov, M. V. Omelchenko, M. S. Gelfand, A. A. Mironov, Y. I. Wolf, and E. V. Koonin
Genome-Wide Molecular Clock and Horizontal Gene Transfer in Bacterial Evolution
J. Bacteriol., October 1, 2004; 186(19): 6575 - 6585.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
A. D. Kern, C. D. Jones, and D. J. Begun
Molecular Population Genetics of Male Accessory Gland Proteins in the Drosophila simulans Complex
Genetics, June 1, 2004; 167(2): 725 - 735.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
N. G. C. Smith and A. Eyre-Walker
Partitioning the Variation in Mammalian Substitution Rates
Mol. Biol. Evol., January 1, 2003; 20(1): 10 - 17.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
T. Johnson and N. H. Barton
The Effect of Deleterious Alleles on Adaptation in Asexual Populations
Genetics, September 1, 2002; 162(1): 395 - 411.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
J. Kusumi, Y. Tsumura, H. Yoshimaru, and H. Tachida
Molecular Evolution of Nuclear Genes in Cupressacea, a Group of Conifer Trees
Mol. Biol. Evol., May 1, 2002; 19(5): 736 - 747.
[Abstract] [Full Text] [PDF]