- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Waxman, D.
- Articles by Peck, J. R.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Waxman, D.
- Articles by Peck, J. R.
The Anomalous Effects of Biased Mutation
D. Waxmana and J. R. Peckaa Centre for the Study of Evolution, School of Biological Sciences, University of Sussex, Brighton BN1 9QG, Sussex, United Kingdom
Corresponding author: D. Waxman, University of Sussex, Brighton BN1 9QG, Sussex, United Kingdom., d.waxman{at}sussex.ac.uk (E-mail)
Communicating editor: M. W. FELDMAN
| ABSTRACT |
|---|
A model is presented in which alleles at a number of loci combine to influence the value of a quantitative trait that is subject to stabilizing selection. Mutations can occur to alleles at the loci under consideration. Some of these mutations will tend to increase the value of the trait, while others will tend to decrease it. In contrast to most previous models, we allow the mean effect of mutations to be nonzero. This means that, on average, mutations can have a bias, such that they tend to either increase or decrease the value of the trait. We find, unsurprisingly, that biased mutation moves the equilibrium mean value of the quantitative trait in the direction of the bias. What is more surprising is the behavior of the deviation of the equilibrium mean value of the trait from its optimal value. This has a nonmonotonic dependence on the degree of bias, so that increasing the degree of bias can actually bring the mean phenotype closer to the optimal phenotype. Furthermore, there is a definite maximum to the extent to which biased mutation can cause a difference between the mean phenotype and the optimum. For plausible parameter values, this maximum-possible difference is small. Typically, quantitative-genetics models assume an unconstrained model of mutation, where the expected difference in effect between a parental allele and a mutant allele is independent of the current state of the parental allele. Our results show that models of this sort can easily lead to biologically implausible consequences when mutations are biased. In particular, unconstrained mutation typically leads to a continual increase or decrease in the mean allelic effects at all trait-controlling loci. Thus at each of these loci, the mean allelic effect eventually becomes extreme. This suggests that some of the models of mutation most commonly used in quantitative genetics should be modified so as to introduce genetic constraints.
MANY mutations affect continuously distributed traits such as height and weight (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
In this study, we have adopted a standard model of stabilizing selection, where an optimal value of the trait exists. We find that the dependence of the population's mean phenotypic value on the degree of mutational bias is nonmonotonic. As such, under some conditions, increasing the extent of mutational bias can actually lead to a reduction in the deviation of the population's mean phenotypic value from its optimal value.
We use a modified version of the model of mutation that was originally introduced by ![]()
![]()
![]()
![]()
![]()
![]()
![]()
Consideration of biased mutation is common when the trait under consideration is fitness, as most researchers believe that the vast majority of fitness-altering mutations cause a decline in fitness (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
| MODEL |
|---|
Consider a randomly mating population of dioecious sexual organisms, with no sexual dimorphism. The population size is assumed to be sufficiently large such that stochastic effects (genetic drift) can be ignored. Individuals are subject to selection on the value of a single phenotypic trait. The phenotype of a particular individual is assumed to depend on the individual's "genotypic value," G, plus a normally distributed environmental noise component,
. Using z to represent an individual's phenotypic value, we have z = G +
. The distribution of
is assumed to be independent of G and has a mean of zero and a standard deviation of Ve. Following convention and without loss of generality, we scale all variables so that Ve is set to unity.
Individuals are diploid with n freely recombining loci that additively affect the genotypic value. These loci are labeled 1, 2, ... , n. The DNA sequence of an allele determines its effect on genotypic value, and the effects of the maternally and paternally inherited alleles at locus i are denoted by xi and yi, respectively.
Following ![]()
> xi, yi > -
. Additivity in the determination of the genotypic value leads to
. Apart, possibly, from the initial generation, maternally and paternally inherited alleles have identical distributions. Because of this, we need to refer only to the distribution of alleles of maternal origin.
Generations are discrete, with all parents dying soon after the birth of offspring. Some offspring die before reaching reproductive maturity due to stabilizing viability selection. We confine ourselves to parameter ranges for which selection is weak at the level of the trait (see below). We can therefore employ a quadratic function to describe stabilizing selection, following the example of many authors including ![]()
![]()
0). The value of s* is a measure of the strength of stabilizing selection on phenotypes and individuals of optimal phenotype have z = zopt. Note that the probability of surviving should be set to zero for values of z yielding 1 - s*(z - zopt)2 < 0; however, for the parameter values considered in this work, the probability with which this occurs is of order 10-9 and thus negligible for practical purposes. Thus, the simple quadratic viability function is taken to hold without restriction on z.
We focus, in this study, on the impact of relatively weak selection: s* << 1. Under weak selection, a quadratic selection function gives results that are very close to those produced by a Gaussian selection function; however, a quadratic function is, mathematically, more amenable to analysis.
The preceding assumptions allow us to derive the effect of selection on the distribution of genotypic values. By averaging over environmental effects, it can be shown that the probability of survival for an individual with genotypic value G is proportional to
![]() |
(1) |
Here s = s*/(1 - s*) is a measure of the strength of selection on genotypic values.
Gamete formation involves standard Mendelian segregation and free recombination. The population members that have survived viability selectiontermed adultsundergo random mating and proceed to produce new offspring.
Each of an individual's 2n alleles is a copy of an allele present in one or the other of the individual's parents. The effect of an allele in an offspring is identical to that of the parental allele, of which it is a copy, unless a mutation occurred during its production. The per-allele rate of mutation at locus i is denoted by µi, where 1
µi
0. The expected number of new mutations that affect the trait, per individual, per generation, U, is given by
.
Let us now specify the mutation function. We have chosen a relatively general formulation that encompasses a number of previous approaches. In particular, we make the usual assumptions that mutations to different alleles occur independently and that mutant allelic effects are continuously distributed (![]()
![]()
![]()
![]()
![]()
Let x represent the effect of a particular allele at locus i in a particular offspring. Let x* represent the effect of the parental allele from which the offspring allele was copied. If no mutation of the allele occurred in the production of the offspring then x = x*. If a mutation did occur, then the value of x is chosen from a normal distribution with variance mi2 and mean
x* + bi, where 1
0. In other words, at locus i, the probability density function for the allelic effects of new mutations is given by
![]() |
(2) |
Let us consider the implications of this formula. If
= 1 and bi = 0 for all values of i, then we have the model of mutation most commonly used in quantitative genetics (e.g., ![]()
![]()
Next, consider the case where
= 1 and bi
0. In this case the distribution of mutants has a mean of x* + bi and we can interpret bi as the bias introduced by mutation: on average, mutations at locus i change allelic effects by an amount bi. We note that while ![]()
![]()
Consider now the case
< 1 and bi = 0. This model of mutation was initially formulated by ![]()
x*. The model incorporates the idea of genetic constraints, so that very extreme allelic effects are unlikely to arise as a cumulative consequence of mutation. Thus, even when selection is absent (s = 0), alleles with very extreme effects will not become common in the population as a result of mutation. Instead, allelic effects will remain clustered around zero. It does seem reasonable to incorporate some sort of genetic constraint. Otherwise one gets biologically implausible implications such as large populations yielding extremely large amounts of phenotypic variation on traits that are not under selection. Note that a special example of
< 1 is the case
= 0 and this corresponds to the house-of-cards model of mutation (![]()
Finally, let us consider the case where
< 1 and bi
0. This model is a combination of a Gaussian mutation model and ZENG and COCKERHAM's (1993) regression model. In this case the distribution of mutants has a mean of
x* + bi. Thus bi can be interpreted as the mean deviation of mutant alleles from
x*. The model allows us to consider mutational bias in situations where very extreme allelic effects are unlikely to arise. Thus, of the models discussed here, this combined mutation model is the most realistic.
In what follows, all summary statistics that describe the population (phenotypic values, genetic variance, etc.) are measured immediately after the birth of the offspring and before any selection has taken place.
| RESULTS |
|---|
It has been possible to produce analytical approximations of the model in a number of relevant cases and these have been supplemented with numerical studies. The appropriate analytical approximations depend on the combination of parameter values. The standard reference on this subject (![]()
- The expected number of new mutations per generation per individual that affect the trait,
, satisfies U << 0.05. - The variance of mutant effects at any locus, mi2, satisfies mi2 << 1 (
LANDE 1975 ;
TURELLI 1984 ).
- The strength of selection on genotypic values, s, satisfies s << 1 (
GARCIA-DORADO and MARIN 1998 ).
- The strength of selection acting on allelic effects at any locus, i, is much stronger than the effects of mutation at the locus, such that µi/(smi2) << 1 (
TURELLI 1984 ).
We assume that conditions iiv hold. In addition, we assume that bias is not large compared with the strength of selection, in the sense sb2 < 0.05. As we shall see, all behavior of interest occurs when b is substantially smaller than the requirements of this inequality. Therefore this assumption does not place any important limitation on the scope of this work.
The results presented below apply when the preceding assumptions are met. See Table 1 for notation.
|
Results for equivalent loci with
= 1:
One case where considerable analytic progress is possible is where
= 1 and all loci have identical parameter values governing mutation. This is the case where the parameters µi, mi, and bi do not have any variation in value across loci. We refer to these universal values as µ, m, and b, respectively. It would be surprising to find a case of exactly equivalent loci in nature; however, as we shall see, results for the case of equivalent loci are helpful in predicting the outcome of other, more realistic, cases. In addition, in one case (albeit a degenerate one) a lack of variation in parameter values among loci automatically arises. This is where only a single locus is under selection (n = 1). In the next section we consider equivalent loci with
< 1 and again the value of the analysis is for the insight gained for more realistic cases.
For equivalent loci, with
= 1, we can produce estimates of summary statistics using the analysis presented in Equivalent loci with
= 1 in the Appendix In particular, we estimate the equilibrium variance in genotypic values, G, among offspring, VG, and also estimate the equilibrium genetic load,
, where w(G) is given in Equation 1,
is its mean equilibrium value, and L is proportional to the fraction of the population that fails to survive viability selection. The estimates given in the Appendix for VG and L are, to first order in µ, unaffected by the degree of bias, b, and thus, to this order, identical to well-known approximations that have appeared in the literature (![]()
2nµ/s and
.
We turn now to the behavior of the mean phenotypic value. Our estimate of the equilibrium mean phenotypic value among offspring, denoted
, is given by
![]() |
(3) |
where
![]() |
(4) |
and corrections to
in Equation 3 are O(µ2). Note that D(ß) can be written in terms of special functions; however, we have found the form given in Equation 4 to be most useful since all of its most important properties are readily derivable from this expression.
To obtain an estimate of the error of the expressions given above for VG, L, and
, we have compared the analytical results, given above, with highly accurate numerical results. The latter followed from numerical solution of the equation of the Appendix governing the distribution of allelic effects, Equation A1, using the method of ![]()
is independent of the number of loci, n. As a consequence the estimates of errors are effectively on one-locus quantities and hence independent of n. With the plausible parameter values µ = 10-5, m = 0.2, and s = 0.025 (![]()
b/m
0, we have found that for VG, L, and
the difference between numerical and analytical results is <3%.
Note that an implicit assumption underlies the results for the summary statistics presented above, for equivalent loci. This is that the distribution of allelic effects in gametes (a function of the allelic effects at the n loci that affect the trait) eventually comes to equilibrium. We have used numerical methods to test this assumption and the results are in accord with those from previous studies (![]()
Let
i be the mean equilibrium effect of alleles of maternal origin, at locus i, in newborn offspring. It is identical to the corresponding quantity of paternal origin. With
the mean equilibrium phenotypic value, as given by Equation 3, it follows that any set of n mean allelic effects satisfying
is a possible end point of the dynamics (an equilibrium). In this sense, the equilibrium is "neutral." However, analysis in the Appendix shows that given that an equilibrium is achieved, and given mutationally equivalent loci, the shapes of the distributions of allelic effects at all loci (i.e., the marginal distributions of allelic effects) are identical. Thus, while the second and all higher central moments of the allelic-effect distributions are the same for all loci, the means of these distributions are generally different.
Let us now consider the effects of bias, ignoring the case b = 0, where mutation is unbiased and
. From Equation 3 it follows that a finite positive b yields a mean phenotypic value that is larger than the optimal phenotypic value, i.e.,
> zopt. When b is increased from zero, initially
- zopt increases approximately linearly with b. However, as b becomes larger the rate of increase in
- zopt declines until b reaches a critical value, and any further increase in b produces a decrease in
- zopt. For sufficiently large values of b, the value of
- zopt is, to a good approximation, proportional to b-1. An example of this sort of nonmonotonic behavior is given in Fig 1.
|
Results for b < 0 closely parallel those for b > 0 and can be determined from the latter from the relation
, which follows directly from the property D(-ß) = -D(ß). This is apparent in Fig 1.
Note that the maximum value of the function D(ß), of Equation 4, is given by Dmax
0.77, a value that is independent of all parameters. The maximum of D(ß) occurs when ßD(ß) = 1, i.e., when ß
1.31. The existence of a maximum of D(ß) implies a definite limit to the degree of deviation that biased mutation can cause in the equilibrium mean phenotype among offspring,
, from the optimal phenotype, zopt. For the parameter ranges specified previously, the absolute value of the maximum deviation, in terms of phenotypic standard deviations, is given by
![]() |
(5) |
The above equation indicates that the maximum possible absolute deviation of
from the optimum, not surprisingly, becomes larger when µ is increased or s is decreased, since both of these changes enhance the role of mutation relative to selection. The maximum deviation decreases as n increases because the equilibrium phenotypic variation increases with the number of loci controlling the trait, while |
- zopt|max is found to be independent of n. Thus increasing n decreases the effect of bias, when measured in phenotypic standard deviations. More intriguing is the effect of the standard deviation of mutant effects, m. The maximum possible deviation of
from zopt is highest when m is small. Furthermore, since D(b/m) reaches a maximum when b
1.3m, we find that the maximum deviation from the optimum occurs when both the degree of bias and the standard deviation of mutant effects are small. This intuitively makes sense since when m and b are small, many mutant offspring born to parents with nearly optimal genotypes will also have nearly optimal genotypes. This allows for the survival of a large proportion of the mutants and for the accumulation of biased mutations over the course of many generations.
Results for equivalent loci when
< 1:
When all the selected loci are equivalent, but
< 1, the preceding results are modified and analysis covering this case is contained in Equivalent loci with
< 1 in the Appendix In particular, numerical investigation indicates that the distribution of genotypes no longer simply comes to a neutral equilibrium. Instead, the distribution comes, after some time, to a unique and stable equilibrium where mean allelic effects,
i, at all loci are uniquely determined. The equilibrium distribution is thus independent of the initial distribution and equivalence of loci results in the
i having identical values for all loci:
. A unique equilibrium arises since a regression parameter,
< 1, corresponds to an additional evolutionary force in the system that directly couples to the allelic effects. It destroys the property of the case with
= 1 that, at equilibrium, any sets of mean allelic effects,
i, that lead to the equilibrium value of
are equally good candidates for an equilibrium.
As far as summary statistics are concerned, we find that under the same analytical approximations used for the case
= 1, and hence the same accuracy, the genetic variance, VG, and genetic load, L, are, to leading order in the allelic mutation rate µ, unaffected by the degree of bias, b, and again given by VG
2nµ/s and
. Furthermore, the mean phenotypic value,
, is now given by
![]() |
(6) |
where D(ß) is given in Equation 4. Thus, there is still a nonmonotonic dependence of
upon the value of bias, b. However, when
< 1, it is no longer the case that
. This is illustrated by the dashed curve in Fig 1.
Results for nonequivalent loci for
= 1:
We have, so far, confined ourselves to consideration of cases where all the mutational parameters (µi, mi, and bi) are the same at all loci. Let us now relax this assumption by allowing variation among loci in the mutational parameters for the case
= 1. Analysis covering this case is contained in Nonequivalent loci with
= 1 in the Appendix In particular, it is shown in the Appendix that in this more general case, we have an intriguing result: the assumption that allelic distributions at all loci come to an equilibrium generally leads to a mathematically inconsistent set of equations. Thus, in general, the population cannot equilibrate when loci are nonequivalent and
= 1.
Given this lack of attainment of equilibrium, what is the long-time behavior? Numerical investigation for n > 1 mutationally nonequivalent loci shows the reason for the inconsistency mentioned above. Nonequivalent loci typically lead to a situation where the allelic distributions at every locus continue to change indefinitely. More specifically, the typical long-term behavior is that at every locus there is a roughly linear change in the value of the mean allelic value,
i, with time. This appears to generally occur at a rate smaller than the mutation rate. This is caused by a continual turnover of alleles at every locus, such that common alleles become rare and new mutations multiply and become more common (see Fig 2).
|
Despite the turnover in alleles, when the inequalities given above in the first paragraph of RESULTS apply, the numerical studies indicate that the genetic variance and genetic load are reasonably close to the results that apply in the absence of bias:
and
. As an example, for the parameter values used in Fig 2 (see Fig 2 legend), the differences between the analytical predictions for VG and L and the numerical results are <4%.
The result given in Equation 3 for the equilibrium mean phenotypic effect,
, for equivalent loci with
= 1 may be used to estimate the corresponding quantity when loci are not equivalent. The most straightforward procedure is to evaluate Equation 3 at the mean values of the mutational parameters µi, mi, and bi. As an illustration of this, we have compared the long-time value of
of Fig 2 (which is the outcome of numerical solution) with Equation 3, evaluated at the mean values of the mutational parameters used in Fig 2 (see Fig 2 legend). The difference between the two values of
is found to be <1%.
Results for nonequivalent loci when
< 1:
What are the consequences of genetic constraint (
< 1) when loci are nonequivalent? This question is very difficult to fully address although some progress is made in Nonequivalent loci with
< 1 in the Appendix In particular, extensive numerical study strongly suggests that when
< 1, the population comes to a unique and stable equilibrium. Thus when
< 1 we do not see the perpetual turnover of alleles that occurs when
= 1 since mutational regression is evidently sufficient to stop turnover. We note that under the same approximations used in previous cases possessing an equilibrium, it is possible to analytically determine that the genetic variance and genetic load are, to O(µ), unaffected by mutational bias, so
and
. Furthermore, at each locus a distribution of allelic effects, characteristic of that locus, always becomes established, and this occurs regardless of the initial genotypic distribution. For the mean phenotypic value, we are able to determine the approximate bound
![]() |
(7) |
which is independent of the value of
and again indicates the highly limited extent to which mutational bias can affect the mean phenotypic value.
Let
denote the average of bi over all loci, weighted by the mutation rate
![]() |
(8) |
Then the dependence of
on
is qualitatively similar to the nonmonotonic dependence of
on b that was found for equivalent loci with
< 1; see Equation 6. As an example, we have considered n = 4 loci, with a substantial level of constraint, namely
= 0.2 and with mutation rates, µi, mutational standard deviations, mi, and an optimal phenotypic value, zopt, that are identical to those used in Fig 2 (see Fig 2 legend). We have produced a number of sets of randomly generated mutational biases, [b1, b2, b3, b4] and plotted the long-term values of
, in Fig 3, against the weighted average of bias,
. For comparison, we have also plotted an estimate of
motivated by Equation 6: namely
zopt + (µ/(sm))D(
/m - (1 -
)zopt/(2nm)), where the values of µ and m are taken as arithmetic mean values of the corresponding mutational parameters across loci. As is evident from Fig 3, the form for
given for equivalent loci with
< 1 (Equation 6) provides a useful estimate of the results for loci that are genetically constrained and mutationally nonequivalent.
|
| DISCUSSION |
|---|
In this study we have considered the evolutionary consequences of a biased-mutation process that affects a trait that is undergoing stabilizing selection. Stabilizing selection tends to bring the mean phenotypic value closer to the optimal phenotypic value (zopt). However, when mutation is biased, the new mutations arising during every generation tend to change the value of the trait. The overall degree of bias in mutations (taking all pertinent loci into account) can be characterized by
, which is given in Equation 8 and is the weighted average of the degree of bias at each locus, with the weighting determined by the mutation rate at each locus. If |
|, the absolute value of
, is large, then, on average, mutations tend to cause substantial directional changes in trait values. If |
| is small, then the average effect of mutations on trait values is also small.
We have assumed that the values of the parameters that describe the mutation process are within bounds that are currently believed to be biologically realistic. We have also assumed that, at every locus, the degree of bias, |bi|, is not very large compared with the standard deviation of mutational effects, mi. Under these assumptions, phenotypic variance and mean fitness are almost unaffected by the value of
. On the other hand, the deviation of the mean phenotype from the optimum, |
- zopt|, is sensitive to the value of
. This dependency turns out, however, to be nonmonotonic. While a small amount of bias (e.g.,
slightly in excess of zero) tends to move the value of
in the direction of the bias, a point is always reached where any further increase in the degree of bias will actually bring the value of
closer to zopt (see Fig 1 for the special case of equivalent loci). Furthermore, for plausible parameter-value choices, and for all models of mutation studied here, the maximum-possible deviation of
from zopt is quite small: <1% of a phenotypic standard deviation. The small effect of bias depends on the existence of just one optimal phenotype, as assumed throughout this article. If multiple optima exist, then a small amount of bias may have very large long-term evolutionary effects (![]()
The reason for the nonmonotonic response of
to mutational bias is easiest to understand if one relaxes one of the assumptions of the model and considers the fate of mutations when the degree of bias is large at every locus (|bi| very large at all loci). The extreme degree of bias leads to an equilibrium mean phenotype,
, that is very close to zopt, at least among adults. In this case, the reason is very clear. Nearly every mutation causes such a large deviation from the optimum that it induces death before maturation. Thus, only nonmutant offspring tend to survive, and so the adults have a mean phenotype that is very close to the optimum. When the values of the bias parameters are smaller, not every mutation induces fatality, and so mutant effects can accumulate each generation. Thus, the effect of mutational bias upon the equilibrium mean phenotype is largest when the degree of bias takes on an intermediate value.
Our claim that mutational bias cannot cause a large deviation of
from zopt depends on our assumption that the standard deviation of the mutant allelic effects at locus i, namely mi, is not very small. This is embodied in the assumption m2i >> µi/s. While the assumption of a substantial value of mi is consistent with much of the data on mutation, it should be recognized that very-small-effect mutations are hard to identify, and so current estimates of mi may be much too large (![]()
![]()
![]()
(see Equation 3, Equation 6, and Equation 7, but recall that these results have been derived assuming the mi are not small).
Another point to keep in mind is that our analysis applies to very large (effectively infinite) populations. It is possible that the long-term impact of mutational bias upon phenotype is greatly enhanced when population size is small. This is because of the action of genetic drift. In a finite population mutations have dynamics that are similar to those of strictly neutral alleles if their effect on fitness is small in comparison to the reciprocal of the effective population size (![]()
In the past, models of quantitative genetics have typically assumed that the mutation process is not affected by any kind of genetic constraint. This means, for example, that the probability that a mutation will increase the effect of an allele upon a trait is independent of the premutation effect of the allele (this is the
= 1 case of our model). Our results show that, when mutation is biased, a lack of genetic constraint typically (i.e., with nonequivalent loci) leads to the evolution of ever-more-extreme allelic values at every locus that affects the trait. This is so even if the degree of bias is very small and even if mutation is biased at only one locus, with mutations at all other loci being unbiased. The reasons for this continuous change in allelic effects lie within the nonlinear mathematics describing the problem. However, we note that the neutral equilibria, described in Equivalent loci with
= 1 in RESULTS, lie at the heart of the phenomenon. It is evidently the case that for nonequivalent loci with
= 1, the neutral equilibria acquire dynamical behavior at the level of the alleles, which themselves are not directly under selection but which underlie the trait. However, there is no manifestation of this dynamical behavior at the level of the traitwhich is directly under selection.
Further insight into the processes responsible for the various results can be obtained by considering the details of the analysis, as presented in the Appendix. We note, however, that biased mutation generally induces skew into the distributions of allelic effects, with each distribution generally asymmetric about its mean. One exercise that is particularly instructive is to try to simplify the analysis, for the case
= 1, by ignoring skew in the distributions of allelic effects that become established at long times. With the neglect of skew, the equilibrium value of
- zopt
- zopt is the outcome of a dynamical balance between a term proportional to the additional selection coefficient that is induced because
- zopt
0, namely 2s x VG x (
- zopt), and the mutational input into the mean phenotypic effect per generation,
. Thus, under the neglect of skew,
- zopt is given by
. Combining this with our (numerically verified) finding that moderate mutational bias causes negligible change in the genetic variance indicates that the mean phenotypic value at equilibrium,
, no longer behaves nonmonotonically as the degree of bias is increased. Thus, ignoring skew leads to incorrect results. The behavior of the skew of the distributions of allelic effects is also strongly implicated in the continuous change in allelic effects exhibited when there are nonequivalent loci with
= 1. Thus, some of the more intriguing behavior produced by the model depends on the skew in allelic distributions that is induced by mutational bias.
Of course, a model that leads to ever-increasing (or ever-decreasing) allelic effects is obviously not biologically reasonable. There must be some constraint on the effect that alleles, at any given locus, can have on a trait. Furthermore, biased mutation is known to occur in a variety of cases (![]()
![]()
![]()
![]()
In this study, we incorporated a simple model of genetic constraint that was suggested, in the absence of bias, by ![]()
. If
= 1, then there is no constraint, while
slightly <1 means that genetic effects can become quite extreme before genetic constraint has much effect on the distribution of mutants. If
= 0, then the degree of constraint is maximal, and parental allelic values have no effect on mutant allelic values (this is the house-of-cards model of ![]()
We found that, whenever
< 1, allelic effects tend to come to an equilibrium. The equilibrium distribution of allelic effects at each locus appears to be independent of initial conditions. However, if the degree of constraint is not very large (
slightly <1), then, at equilibrium, when there are more than one nonequivalent loci, the allelic effects at each locus tend, generally, to be rather extreme. This is because extreme effects at one locus tend to be compensated by extreme, but opposing, effects at other loci. Despite this, the equilibrium values of the mean phenotype and the genetic variance are, typically, not much affected by mutational bias.
The results raise some intriguing possibilities in the realm of molecular evolution. When mutations are constrained (
< 1) the equilibrium distributions of allelic effects, at the various loci, depend on the parameters of the model. These parameters can, quite plausibly, change over time. For example, a change in temperature might affect mutation rate (![]()
< 1, small changes in the parameters that govern selection or mutation may cause very large changes in the distribution of alleles at two or more of the n loci that control the trait. This effect is most dramatic when
is close to unity.
Because changes at the two loci compensate each other, even very large changes in mean allelic effects are virtually invisible at the phenotypic level. However, the large changes in mean allelic effects imply that a great deal of molecular evolution has occurred at all loci involved. This molecular evolution can take a long time, as the time-to-equilibration seems to roughly scale as µ-1, which is as one would expect. Thus, if the per-allele mutation rate is on the order of 10-5, then, after a small environmental change, it may take many thousands of generations for loci to undergo large mutually compensatory changes before approaching equilibrium. We intend to explore these phenomena further in future studies. We also intend to explore the possible implications of this sort of "quasi-neutral" genetic change for speciation in a model of the sort studied by ![]()
![]()
![]()
To sum up, it seems that biased mutation may hold the key to understanding some of the phenomena that fascinate evolutionary biologists today. In a single large population, biased mutation seems to have little effect on phenotypes, but it might have a substantial long-term impact on molecular evolution. In a subdivided population biased mutation has the potential to magnify small environmental differences between the habitats of different subpopulations, and so it might lead to speciation and to related behavioral mechanisms of reproductive isolation. In the light of this, it would not be surprising if further investigations implicate biased mutation as a prime mover of evolution in additional areas that have not been anticipated here.
| ACKNOWLEDGMENTS |
|---|
It is a pleasure to thank Nick Barton and John Welch for some very helpful discussions. We also thank two anonymous reviewers for suggestions that have significantly improved the manuscript.
Manuscript received March 6, 2003; Accepted for publication April 5, 2003.
| APPENDIX |
|---|
Here we provide the theoretical background to the results presented in the main body of the article. We adopt the convention that unless specified to the contrary, all integrals range from -
to
.
In the model (of which a full description is given in the main text), viability of individuals of genotypic value G, namely w(G), has been taken as a quadratic function of G rather than as alternatives such as a Gaussian function. Nevertheless, if the strength of selection is small enough to satisfy sm2i < 0.05, then there is not a substantial difference between results calculated from a Gaussian w(G) and those calculated from its quadratic approximation. In particular, we have considered the typical magnitude of the quartic term that is omitted in a quadratic approximation to a Gaussian w(G). We estimate that summary statistics, such as the mean phenotypic value, the genetic variance, and the genetic load, differ by <10% between the results calculated from a Gaussian w(G) and its quadratic approximation.
In all of the analysis of this work, we have followed the classic treatments by making the approximation of global linkage equilibrium (![]()
![]()
10-5, m1,2
0.2, s = 0.025, b1,2
m1,2,
0.9) and iterated the full dynamical equation for >106 generations. The levels of linkage disequilibria observed, as measured by the correlation
-
, were <10-3 of the allelic variances,
2, at either locus in either the presence or the absence of bias. This is in accordance with what we should theoretically expect: the level of linkage disequilibrium generated by selection between any two unlinked loci depends on the product of the variances of allelic effects of the two loci (see, e.g., Equation 32 of ![]()
Let us proceed with the analysis, under the assumption that the distribution of allelic effects in gametes approaches an equilibrium solution, which we write as
(x). Numerical results are in accordance with this, except in the case of nonequivalent loci with
= 1 and for this case we provide, below, a nonequilibrium analysis.
The approximation of global linkage equilibrium (![]()
![]()
(x), being given by
. Here
j(xj) is the equilibrium distribution of allelic effects of maternal origin at locus j in offspring (it is identical to the distribution of allelic effects of paternal origin at the same locus) and is a nonnegative and normalized function:
j(xj)
0, 
j(x)dx = 1. The distribution
j(xj) obeys an effective one-locus haploid equation that arises by integrating the equilibrium equation for
(x) over allelic effects of all loci, with the exception locus i. The effective haploid equation reads
![]() |
(A1) |
Here an overbar denotes an average of the respective quantity over the distribution in zygotes, for example,
. The quantity
-
i appearing in Equation A1 represents a genetic background contribution that arises from all alleles except one of the alleles at locus i. The quantity µi is the allelic mutation rate of locus i and fi(xi -
u - bi) is the distribution of mutant allelic effects given in Equation 2 of the main text.
Equation A1 coincides, in form, with the equilibrium equations of ![]()
![]()
![]()
We can write Equation A1 in the useful form
![]() |
(A2) |
where
. Noting that fi(x)
fi(0) it follows, from Equation A2 and normalization of
i(xi), that
i
x µi/(mis). It is the smallness of
i, compared with mi, that lies at the heart of the house-of-cards approximation (![]()
i << mi). When this condition applies, the variance of mutant allelic effects is large compared with the equilibrium variance of allelic effects. Thus the allelic effect of a mutant is virtually unrelated to the parental allelic effect, and this is very similar to the exact behavior of the house-of-cards mutational model of ![]()
f(xi -
u - bi)
i(u)du in Equation A2 by f(xi - 
i - bi), leading to
![]() |
(A3) |
and the requirement of normalization determines the value of
i.
We now investigate Equation A3 for some particular cases.
Equivalent loci with
= 1:
Consider the case where
= 1 and all loci are mutationally equivalent; i.e., at loci affecting the trait, allelic mutation rates, mutational variances, and mutational biases are all given by µ, m, and b, respectively. Then Equation A3 takes the form
![]() |
(A4) |
(we omit the subscript i, on
, in the case of equivalent loci). We proceed by multiplying Equation A4 by (xi +
- zopt -
i) and integrating over all xi. Using the substitution
. Combining the integral with the integral with y
-y yields
![]() |
(A5) |
In this form, we make two additional, but well-controlled approximations that lead to errors in
- zopt of O(µ2). The first approximation is to neglect
2 within the integral. The rationale is that when |y| >>
, the presence of
is irrelevant. Furthermore, when y
we can estimate, by Taylor expanding f(y -
+ zopt - b) - f(y +
- zopt + b) to linear order in y, that neglecting
2 results in an error of O(µ
) = O(µ2). Thus neglect of
2 is well justified in Equation A5. The second approximation is to note that
- zopt is, by Equation A5, O(µ) and hence neglecting
- zopt where it appears on the right-hand side of Equation A5 again leads to errors in
- zopt, on the left-hand side, of O(µ2). As a consequence, the mean phenotypic value,
, which coincides with the mean genotypic value,
, is given by
- zopt
µ(2s)-1
y-1 [f(y - b) - f(y + b)]dy. Manipulations of this integral show
![]() |
(A6) |
where D(ß) is given by Equation 4 of the main text. In this way we arrive at Equation 3 of the main text.
An additional result follows by multiplying Equation A4 by (xi +
- zopt -
i)2, integrating over all xi, and neglecting
2, as justified above. This shows that, to leading order in µ, the allelic variance at any locus,
, is given by
µ/s, i.e., independent of locus label, i, and unchanged from its unbiased (b = 0) value. As a consequence, the genetic variance and genetic load are, to leading order in µ, unchanged from their unbiased values: VG
2nµ/s and L = 1 -
2nµ.
Note that in the above calculations, the mean allelic effects,
i, do not appear in the final results and as a consequence are not determined by the equilibrium calculations. This is an exact property of Equation A1 for equivalent loci and
= 1. It may be seen to follow from the change of variable
, which eliminates
i from the equation. This property is a manifestation of the fact that in a dynamical calculation, the constant values the
i achieve at long times are dependent on initial data and numerical solution of the dynamical equations exhibits this feature.
Equivalent loci with
< 1:
When
= 1, and all loci are mutationally equivalent, the approximate distribution
i(xi), following from (A3), takes the form
i(xi)
(µ/s)f(xi - 
i - b)[(xi +
- zopt -
i)2 +
2]-1. Following closely the analysis for the case of equivalent loci with
= 1, we find
![]() |
(A7) |
and a variance in allelic effects of
. Thus genetic variance and genetic load are, to leading order in µ, unchanged from their unbiased values: VG
2nµ/s and
.
Note that for equivalent loci but with
< 1, the
i cannot be eliminated from the equations by a coordinate transformation (unlike the case where
= 1). In particular,
i appears explicitly in Equation A7 and numerical results verify that all
i are uniquely specified at equilibrium.
Equation A7 holds for i = 1, 2, ... , n and it is plausible that the equilibrium mean allelic effects,
i, are equal at all loci and given by
/(2n) and this is numerically confirmed in all cases considered.
Since Equation A7 yields
we can replace
on the right-hand side of Equation A7) by
. This introduces only errors of O(µ2) in
- zopt and using Equation 4 we obtain Equation 6 of the main textwhich is simply Equation 3 of the main text with the "regression correction" b
b - (1 -
)zopt/(2n).
Nonequivalent loci with
< 1:
In the case where
< 1, and loci are not mutationally equivalent, the approximate distribution
i(xi), following from (A3), takes the form
i(xi)
(µi/s)fi(xi - 
i - bi)[(xi +
- zopt -
i)2 +
2i]-1. Proceeding as previously, we find numerically that the
i are not indeterminate and making the same approximations as previously, we find
so
, and
![]() |
(A8) |
where D(ß) is given in Equation 4. While similar in form to the result for equivalent loci with
< 1 (Equation 6), we note that Equation A8 is substantially more complicated since there is no simple approximation for the
i. To determine the
i, it is necessary to simultaneously solve the set of equations, Equation A8 for i = 1, 2, ... , n, supplemented with
. This is generally nontrivial because D(ß) and hence the set of equations are nonlinear. Despite this, it is possible to draw a general conclusion from Equation A8. Noting








, where 


(






