Genetics, Vol. 154, 1193-1201, March 2000, Copyright © 2000

Multigeneration Maximum-Likelihood Analysis Applied to Mutation-Accumulation Experiments in Caenorhabditis elegans

Peter D. Keightleya and Thomas M. Bataillonb
a Institute of Cell, Animal and Population Biology, University of Edinburgh, Edinburgh EH9 3JT, Scotland
b Institut National de la Recherche Agronomique-Station de Génétique et Amélioration des Plantes, Montpellier, Domaine de Melgueil, F 34130 Maugio, France

Corresponding author: Peter D. Keightley, Institute of Cell, Animal and Population Biology, University of Edinburgh, W. Mains Rd., Edinburgh EH9 3JT, Scotland., p.keightley{at}ed.ac.uk (E-mail)

Communicating editor: A. G. CLARK


*  ABSTRACT
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

We develop a maximum-likelihood (ML) approach to estimate genomic mutation rates (U) and average homozygous mutation effects (s) from mutation-accumulation (MA) experiments in which phenotypic assays are carried out in several generations. We use simulations to compare the procedure's performance with the method of moments traditionally used to analyze MA data. Similar precision is obtained if mutation effects are small relative to the environmental standard deviation, but ML can give estimates of mutation parameters that have lower sampling variances than those obtained by the method of moments if mutations with large effects have accumulated. The inclusion of data from intermediate generations may improve the precision. We analyze life-history trait data from two Caenorhabditis elegans MA experiments. Under a model with equal mutation effects, the two experiments provide similar estimates for U of ~0.005 per haploid, averaged over traits. Estimates of s are more divergent and average at -0.51 and -0.13 in the two studies. Detailed analysis shows that changes of mean and variance of genetic values of MA lines in both C. elegans experiments are dominated by mutations with large effects, but the analysis does not rule out the presence of a large class of deleterious mutations with very small effects.


EXPERIMENTAL estimates of rates at which mutations occur in the genome and properties of distributions of mutation effects for fitness and other life-history traits are important for several questions in population and evolutionary biology, but have proved to be extremely difficult to obtain (GARCIA-DORADO et al. 1999 Down; KEIGHTLEY and EYRE-WALKER 1999 Down; LYNCH et al. 1999 Down for recent reviews). One experimental approach to obtain information on mutation parameters is mutation accumulation (MA) in the laboratory. This involves the random accumulation of spontaneous mutations in replicated chromosomes or inbred sublines, usually over several tens of generations, followed by large-scale life-history trait assays of MA and control lines. Due to the need for a high degree of replication, the experiments tend to be both tedious and time-consuming. An important issue, therefore, is the method of parameter estimation, as an experimentalist will wish to extract the maximum amount of information from the available experimental data.

The traditional way to analyze MA experimental data is the Bateman-Mukai (BM) method of moments (BATEMAN 1959 Down; MUKAI 1964 Down). Under the assumption that mutations have equal effects, an estimate of the genomic deleterious mutation rate (U) per haploid genome per generation is

(1)

where {Delta}M is the rate of change of the mean trait value per generation and Vm is the mutational variance. This is estimated as one-half of the rate of increase in MA among line variance per generation, VL (LYNCH and HILL 1986 Down), usually obtained from an analysis of variance (ANOVA). The average homozygous mutation effect (s) is estimated from

(2)

If mutations have variable effects, (1) underestimates U and (2) overestimates s. The BM method uses only information obtained from the means and variances of the MA and control lines, but the data may contain additional information that could be used for estimating the mutation parameters. A more recent maximum-likelihood procedure (KEIGHTLEY 1994 Down, KEIGHTLEY 1998 Down) can, in principle, extract a greater amount of information from MA data and can allow the comparison of the fit to the data of different distributions of mutation effects. Monte Carlo simulations have previously suggested that the ML approach can give higher precision than the BM approach, particularly if mutation effects are large relative to the error variance (KEIGHTLEY 1998 Down).

A drawback with the ML method is that it is quite complex, and has to date been implemented only for cases with a single MA generation plus a control line. In this article we extend the ML approach to analyze data from experiments with an arbitrary number of generations, and thereby make use of all the available information including covariances between phenotypic values for the same lines at different generations. At present, our multigeneration ML method is restricted to the case of equal mutation effects, i.e., the same model as assumed by the BM method. However, the method allows the comparison of results for different experiments under the same model. We investigate the properties of the method by Monte Carlo simulation and compare its precision to the BM approach. Finally, we use the multigeneration ML procedure to analyze data on life-history traits from two recently published MA experiments with the wild-type N2 strain of Caenorhabditis elegans (KEIGHTLEY and CABALLERO 1997 Down; VASSILIEVA and LYNCH 1999 Down).


*  MATERIALS AND METHODS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Likelihood framework for several generations:
In this section we derive the likelihood function, which is appropriate when data from several generations are jointly used to estimate genome-wide mutation parameters, by extending a previously developed method (KEIGHTLEY 1994 Down). We also allow for the joint estimation of fixed effects such as assay effects that can occur if experimental assays are carried out sequentially or in blocks.

Let Zk,tj denote the phenotypic value of line k assayed after tj generations of mutation accumulation. We assume that the number of mutations fixed in the homozygous state in the line is Poisson distributed with mean {lambda}j = Utj. The mutations are assumed to have a constant additive effect denoted by s. We assume that environmental effects are normally distributed with variance Ve. The phenotypic value is, therefore,

(3)

where M is the ancestral mean, x is a Poisson deviate with parameter {lambda}j, and e is a Gaussian deviate with mean zero and variance Ve. The likelihood associated with a single line k observed at generations, say, t1 and t1 + t2 is

(4)

where ptx(i) denotes the (Poisson) probability that the line has accumulated i new homozygous mutation(s) during the course of tx generations and f is the Gaussian probability density function with mean M and variance Ve. The overall likelihood is then obtained as = {prod}Nk=1k, where N is the number of lines. Control line data can be included in the analysis by including appropriate terms in (4) with U set to zero.

This likelihood equation can be generalized to incorporate an arbitrary number of points in time where the lines are assayed. Suppose that the set of MA lines are assayed at generation t1, t1 + t2, ... , t1 + t2 + ...tT; then the likelihood associated with a line k is

(5)

If fixed effects are to be fitted, such as assay or block effects for each point in time (t1, t1 + t2, ... ), likelihood Equation 4 is modified as:

(6)

where bt1, bt1+t2 are fixed effects to be estimated jointly with U and s. Note recorder effects could also be included by replacing the bt1, bt1+t2 terms in (6) with terms such as r1, r2 ... for each recorder. If several replicates are assayed to estimate the genetic value of a MA line the likelihood equations can be modified easily to incorporate this detail (FRY et al. 1999 Down), although we restrict our analysis of simulated and real data to line means (see below).

Simulation protocol:
To compare the precision of the BM estimator and the ML multigeneration estimators, we carried out Monte Carlo simulations of MA experiments. To a good approximation, the size of a MA experiment can be characterized by its "heritability" at the line level h2L = + VL (GARCIA-DORADO 1997 Down), where VL represents the between-line variance after t generations of mutation accumulation and VEL represents the error variance of line averages. For the majority of cases, we assumed that h2L was 5/6. This value is somewhat lower than the precision of spontaneous MA and EMS mutagenesis experiments carried out in Drosophila with balancer chromosomes (MUKAI et al. 1972 Down; OHNISHI 1977 Down) and somewhat higher than the precision of the two C. elegans experiments analyzed here. We assumed 80 generations of mutation accumulation, 100 MA and control lines, and chose several widely varying values for U and s, and adjusted VEL to obtain the desired heritability level. For each combination of parameters, 100–1000 simulations were performed. We carried out the above ML analysis and also obtained BM estimates from Equation 1 and Equation 2 by estimating {Delta}M and Vm by regression of phenotypic means and variances on generation number.

In addition, we investigated the performance of the BM and ML procedures with data in which mutation effects are exponentially distributed; thus data are analyzed under the "wrong" model of equal effects. For each U, s combination simulated under the constant mutation effects model, we performed simulations with the same U and mean mutation effect = s, assuming an exponential distribution, and with VEL adjusted to achieve the same heritability at the line level.

Likelihood maximization:
ML maximization was carried out using the simplex algorithm (NELDER and MEAD 1965 Down). For the analysis of simulated data, starting values for likelihood maximization were obtained from the data. A linear search strategy was employed in which a set of initial maximizations was performed over a broad range of 10 fixed values of U, varying by a factor of 2–5 about its expected value. Within each of these runs, starting values for s were obtained by fitting the observed change in mean phenotypic value of the lines in the final generation, and starting values of M and VE were computed from the control line data. The simplex was then restarted with the U, s, M, VE combination that gave highest likelihood during the initial linear search through the parameter space. Spurious convergence was checked by restarting the routine after convergence until no further improvement in fit occurred. In the analysis of the C. elegans data, starting values for assay effects or other fixed effects and M and VE were calculated from the control line data. To verify that the global ML had been reached, several sets of starting values for U, s, M, VE, and fixed effects were investigated. Checks were also performed using the grid search strategy described above. Approximated standard errors were obtained from the curvature of profile likelihoods about their maxima (WEIR 1996 Down, ch. 2). A C computer program to carry out the likelihood calculations is available on request.

C. elegans data sets:
KEIGHTLEY and CABALLERO 1997 Down and VASSILIEVA and LYNCH 1999 Down carried out MA experiments with the C. elegans strain N2 for 60 and 49 generations, respectively, and with 50 and 100 MA lines, respectively, and measured several life history traits. Reproductive output is lifetime output of viable progeny (KEIGHTLEY and CABALLERO 1997 Down) or output for the first 4 days of reproduction (VASSILIEVA and LYNCH 1999 Down). This fitness measure includes the viability of the parents. Longevity was assayed by similar methods in the two experiments, although there were slight differences in the criteria used to score a worm as alive or dead. Daily reproductive output and numbers of parental worms alive were used by VASSILIEVA and LYNCH 1999 Down to calculate replicate-specific intrinsic growth rate, r, by methods described by CHARLESWORTH 1994 Down. Here, we performed similar calculations to obtain estimates of r for the KEIGHTLEY and CABALLERO 1997 Down data. VASSILIEVA and LYNCH 1999 Down include several other derived traits, related to r, that are highly correlated with it. We carried out ML analysis with untransformed data for line means, or the Box-Cox procedure was used to select the power transformation that best achieved normality, and the data for individual values were transformed accordingly (SOKAL and ROHLF 1995 Down, ch. 13), and then line means were calculated. In the cases of r and longevity, for which between-control line variance is nonsignificant (VASSILIEVA and LYNCH 1999 Down), tests for nonnormality were carried out using all available control line data, after correction for assay effects, estimated by a REML analysis (GENSTAT 5 COMMITTEE 1993). In the case of productivity, which shows evidence of control line variance (VASSILIEVA and LYNCH 1999 Down), the test was carried out on residuals after correction for line means and assay effects. The distribution of the VASSILIEVA and LYNCH 1999 Down control data for r is significantly negatively skewed: under the Box-Cox power transformation, the value of the exponent is 3.85 [P < 0.001; see SOKAL and ROHLF 1995 Down, p. 417], so these data were transformed. The effect of the transformation is to increase s and decrease Û. Estimates of the Box-Cox exponent for productivity and longevity are nonsignificantly different from 1.

Analysis under a variable mutation-effects model:
A multigeneration ML procedure along the lines described above to estimate parameters of models with variable mutation effects was found to be computationally intractable at the present time. To test for evidence of variability among mutation effects in the C. elegans data, therefore, we analyzed the last generation of MA line data plus control data (line means) with a single-generation ML procedure (KEIGHTLEY 1998 Down). The fit to the data of several gamma distributions (shape parameter ß, scale {alpha}) ranging from equal mutation effects (ß -> {infty}) to more leptokurtic distributions (ß < 1) was compared.


*  RESULTS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

ML analysis of simulated MA experiments with equal mutation effects:
Simulation results are summarized in Table 1 for values of U and corresponding values of s such that VL remains constant and = 5 (see MATERIALS AND METHODS). Over the range of parameter values simulated, BM and ML give mean estimates close to the simulated values. As expected, if the effect of each mutation is small compared to the environmental standard deviation of line means ({sigma}eL, 0.0566 in this case), ML and BM estimators perform similarly. However, if the effects of mutations are relatively large (s > {sigma}eL), the ML estimator makes more efficient use of the information available and can have a much smaller variance than the BM estimator (see variance of the estimators empirically determined through Monte Carlo simulations in Table 1). The difference in precision becomes smaller if the experiment is noiser (e.g., = 2; data not shown). Both ML and BM estimators can become unstable and give infinite variance among estimates if VL/VEL falls below ~1.


 
View this table:
In this window
In a new window

 
Table 1. Simulation results: equal mutation effects

The variances of the estimates shown in Table 1 also suggest that an increase in precision can be obtained with the ML estimator (but much less so with the BM procedure) by including extra generations in the analysis. Again this suggests that ML makes more efficient use of the information available. The effect can increase dramatically as the number of generations assayed and/or the magnitude of mutation effect increases. DENG and FU 1998 Down and DENG et al. 1999 Down in their simulation studies concluded that adding extra generations does not reduce estimation variance, but this only applies to the BM procedure (see Table 1, variance of BM estimates for U and s).

ML analysis of simulated MA experiments with variable mutation effects:
Simulation results are reported in Table 2. If data simulated under an exponential distribution of mutation effects model are analyzed under the assumption of constant effects, both methods give estimates of U (s) that are biased downward (upward) by a factor B. Empirically we find that B {approx} 1 + . This was already known for the BM estimator (see, e.g., CROW and SIMMONS 1983 Down; LYNCH and WALSH 1998 Down, p. 341). As in the simulations with equal mutation effects (Table 1), if mutation effects are small relative to the environmental standard deviation, both estimators perform essentially identically [see variance (Var) or mean square error (MSE) of estimates, Table 2]. In cases where the mean mutation effect is very large, ML is slightly less biased than BM, and this leads to an increase in precision in terms of the MSE. The improvement in precision is more apparent for estimates of s than for U.


 
View this table:
In this window
In a new window

 
Table 2. Simulation results: exponentially distributed mutation effects

Multigeneration ML analysis of C. elegans MA experiments:
Data for productivity, r, and longevity of the VASSILIEVA and LYNCH 1999 Down and KEIGHTLEY and CABALLERO 1997 Down MA experiments were analyzed by the multigeneration ML method (Table 3 and Table 4). Both experiments employed control lines that had been kept frozen, and data from these were included. KEIGHTLEY and CABALLERO 1997 Down carried out repeat assays of generation 32 and 60 MA plus control lines. VASSILIEVA and LYNCH 1999 Down carried out MA line assays contemporaneously with the control at 4 different generations (7, 20, 30, and 49) and at generation 0 (with the exception of longevity), so we include assay effects along with the other parameters. Each generation was therefore allowed to have a different mean that was common to control and MA lines. We also investigated models where, in addition, an effect was included to allow MA and control lines to have different means. In the likelihood evaluation, the variance of a line mean was inversely proportional to the number of worms assayed.


 
View this table:
In this window
In a new window

 
Table 3. ML and BM estimates of U and s from data of VASSILIEVA and LYNCH 1999 Down


 
View this table:
In this window
In a new window

 
Table 4. ML and BM estimates of U and s from data of KEIGHTLEY and CABALLERO 1997 Down

ML and BM estimates of genome-wide mutation rates and average mutation effects for three traits in VASSILIEVA and LYNCH's (1999) experiment are shown in Table 3. Estimates of s are scaled relative to the control population mean. Standard errors (SEs) of estimates are much smaller under ML than BM, a result we also obtained in the Monte Carlo simulation experiments, although the improvement in precision is larger than we expected on the basis of the simulations. Defining precision as the squared coefficient of variation of an estimate, ML is 24 times and 69 times more precise than BM, on average, for U and s, respectively [ VASSILIEVA and LYNCH 1999 Down data]. The inclusion of assay effects in the model may explain a large part of the improvement in precision, because the increase in log likelihood of the model containing these effects was very large (Table 5). The standard errors for the estimates were obtained from profile likelihood curves (Fig 1). In the case of s for all traits, and U for productivity and r, these curves are of quadratic form and reasonably symmetrical, and support limits based on a drop in log likelihood of 2 from their maxima are ~ ±2 SEs from the maximum-likelihood estimates. The profile likelihood for U in the case of longevity is strongly asymmetrical, and +2 SE gives a poor estimate of the upper support limit of ~0.03.



View larger version (20K):
In this window
In a new window
Download PPT slide
 
Figure 1. Profile likelihoods for the three traits analyzed (data from VASSILIEVA and LYNCH 1999 Down) as functions of U (a) and s (b).


 
View this table:
In this window
In a new window

 
Table 5. ML estimates of assay effects for VASSILIEVA and LYNCH 1999 Down data

An effect of freezing worms could influence the VASSILIEVA and LYNCH 1999 Down results, because control lines were maintained frozen and MA lines were never frozen [both control and MA lines were cryopreserved in KEIGHTLEY and CABALLERO 1997 Down]. To investigate this effect, ML analysis was carried out with an effect for freezing included. Although the effect is significant for all three traits (P < 0.001), the parameter estimates are hardly affected: for example, in the case of r, Û = 0.0053 and s = -0.55.

Models with variable mutation effects:
The line mean data from the last MA generation along with the control data were analyzed by ML under the assumption that mutations effects are gamma distributed with scale and shape parameters {alpha} and ß, respectively. In the analysis of the VASSILIEVA and LYNCH 1999 Down data (generation 49), all the control data were included, and effects specific to each block of control assays were estimated. Because there was also a significant effect for freezing (see above), an MA/control line effect was also estimated. Estimates of U and the mean mutation effect = from such analyses are shown in Table 6. To simplify the interpretation of the results, the analysis was carried out for several ß models including the equal effects model (ß -> {infty}).


 
View this table:
In this window
In a new window

 
Table 6. ML estimates of U and the mean mutation effect assuming gamma distributions of mutation effects

For productivity, somewhat surprisingly, in both experiments log likelihood decreases as the kurtosis of the assumed gamma distribution increases, i.e., the best-fitting gamma distribution is the limiting case of equal mutation effects with ß -> {infty}. Distributions much more leptokurtic than a gamma distribution with shape parameter 1 (i.e., an exponential distribution) are inconsistent with the data on the basis of likelihood-ratio tests. For longevity and r, log likelihood for different ß models changes nonsignificantly, so these data contain little information that can allow different distributions of mutation effects to be distinguished.


*  DISCUSSION
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Simulation experiments:
Over the range of parameter values studied, if the data conform to the model assumed, the ML and BM procedures give mean parameter estimates close to the simulated parameter values. If mutation effects are relatively small there is little benefit from using ML over the BM method of moments. However, if an appreciable fraction of the genetic variance is contributed by mutations with relatively large effects, ML can produce estimates with substantially lower variances than BM. Presumably, the distribution of MA line values will depart from a normal distribution, and replicates within lines may consistently deviate, so there is information to be extracted from the line value distribution in addition to the first and second moments. Furthermore, individual lines will show "jumps" between generations, and again the ML procedure will use this information. Hence there is a benefit from including additional intermediate generations. The opposite conclusion was drawn by DENG and FU 1998 Down and DENG et al. 1999 Down, on the basis of analysis restricted to the BM procedure. It remains to be investigated if, for a fixed number of experimental units available, the best design would involve the assay of several MA generations or rather a single generation assay involving greater replication. This trade-off may be particularly important for organisms where a single large common garden experiment would be the rule (such as plants).

We also explored the robustness of the constant mutation effects model by simulating data sets in which effects of mutation are exponentially distributed. As with the case of data simulated with equal mutation effects, the BM and ML procedures perform similarly if mutation effects are small relative to the environmental standard deviation. Both methods also show similar levels of bias in this situation. If average effects of mutations are large, ML tends to be less biased and shows a higher level of precision than BM, as measured either in terms of the among-estimate variance or mean square error. As with the case of equal mutation effects, the difference in precision between ML and BM increases as the number of intermediate generations included in the analysis increases, and the effect is more apparent for the mean mutation effect than for U.

Improvement of precision under ML in analysis of C. elegans data:
In both C. elegans MA experiments, ML estimates have considerably smaller sampling variances than corresponding BM estimates. The simulation studies suggest that an improvement in precision is to be expected in general (Table 1 and Table 2), due to a more efficient used of information, but the improvement turned out to be larger than we expected on the basis of the simulations. There are three factors that may account for this:

  1. In the experiments, there were lines that deviated by several standard deviations from the control means and probably carried mutations with large effects. Data of this sort lead to the greatest improvement of ML over BM.

  2. The fitting of assay effects, which are large and significant (Table 5), removes much of the noise that clouds the results from the regression analysis. This is probably the most important factor.

  3. The model of equal mutation effects appears to give a good fit to the data, at least in explaining the major effect mutations (Table 6), and the improvement in precision is expected to be greatest in this case.

C. elegans MA experiments:
The negative estimates of s for productivity are in line with expectation and are in accord with the negative estimates for the mean mutation effect on r, a highly correlated trait. For longevity, VASSILIEVA and LYNCH 1999 Down obtained a significant erosion of the mean ({Delta}M) and estimates for U and s of 0.064 and -0.048, respectively. However, due to a discrepancy caused by a single data point (generation 49 for the MA lines), our regression estimate of {Delta}M is about two-thirds of VASSILIEVA and LYNCH's (1999), and our BM estimates of U and s are consequently about 2 times smaller and 1.5 times greater, respectively (Table 2). The data file provided to us contains the most meaningful measure of longevity, and furthermore the level of mutational decay for longevity observed up to generation 49 has not been seen in later generations (M. LYNCH and L. VASSILIEVA, personal communication). ML estimates of U and s for longevity are 0.0040 and -0.26, respectively. In terms of mutational target sizes, the conclusion from the two MA experiments taken together is r ~= productivity > longevity.

Overall, the ML estimates for the two C. elegans MA experiments agree with one another reasonably well (Table 3 and Table 4). Taking an average over traits, estimates of U per haploid genome are 0.0041 (VASSILIEVA and LYNCH 1999 Down) and 0.0057 (KEIGHTLEY and CABALLERO 1997 Down). Average estimates of s are -0.51 VASSILIEVA and LYNCH 1999 Down and -0.13 (KEIGHTLEY and CABALLERO 1997 Down), so VASSILIEVA and LYNCH 1999 Down accumulated mutations with significantly larger effects, particularly for productivity and r. VASSILIEVA and LYNCH 1999 Down discuss the possibility that natural selection strongly affected the KEIGHTLEY and CABALLERO 1997 Down results. However, the estimates of mean mutation effects rather than numbers of mutations detected differ more strongly. An alternative explanation for this difference is a difference in environmental conditions, because experimental conditions resulted in lower productivity than is typical for the N2 strain in the case of VASSILIEVA and LYNCH 1999 Down, e.g., JOHNSON and HUTCHINSON 1993 Down. Furthermore, natural selection is ineffective in eliminating all but strongly deleterious mutations if MA lines are propagated by transferring individuals each generation (KIBOTA and LYNCH 1996 Down; KEIGHTLEY and CABALLERO 1997 Down).

The ML estimates of the mutation effect parameter are surprisingly high, particularly for productivity and r in the case of VASSILIEVA and LYNCH's (1999) data set, and may be influenced by extreme lines with low mean fitness. Visual inspection of the data suggested this to be the case: a minority of lines had consistently low fitness across several generations. A scatter plot of the rank of the line means for productivity at the last two generations (30 and 49) gives an indication of the extent of contribution of such extreme lines (Fig 2). Over most of the plot, points seem to be distributed at random, suggesting little covariance between rank across generations, but there is a deficit of points at the left-hand and lower edges along with the suggestion of an excess of lines that rank low at both generations. We further investigated the contribution of the low-ranking lines to the U and s estimates by excluding subsets of extreme lines with mean phenotype <50% of the control population mean. [cf. MUKAI et al. 1972 Down, who performed a similar procedure]. The effect of excluding extreme lines is to somewhat reduce estimates of both U and s, but the latter estimates are still surprisingly high (e.g., for productivity, s ~ -0.5, data not shown).



View larger version (17K):
In this window
In a new window
Download PPT slide
 
Figure 2. Scatter plot of rank of line averages for productivity in generations 30 and 49 (VASSILIEVA and LYNCH 1999 Down).

Nature of mutational variability for life-history traits in C. elegans:
Taking an average over traits, the ML analysis of VASSILIEVA and LYNCH's (1999) data provides an estimate for U more than five times smaller than the corresponding average BM estimate. By the criterion of comparing Drosophila melanogaster and C. elegans on the basis of the sizes of their genomes (measured by the number of base pairs), and taking into account the lower number of germ line cell divisions in C. elegans than D. melanogaster, VASSILIEVA and LYNCH 1999 Down argue that their average estimate of U (per haploid) of 0.025 is only ~2-fold smaller than an average U estimate for D. melanogaster from MA experiments with balancer chromosomes [~0.3 per haploid genome; see SIMMONS and CROW 1977 Down]. However, the average ML estimate of U (~0.005 per haploid) is five times smaller again, and leads us to conclude that the mutation rates differ ~10-fold. If extreme lines are excluded from the ML analysis, this difference increases. Furthermore, comparing two genomes on the basis of numbers of base pairs may be inappropriate, because the more compact C. elegans may contain less redundant DNA than D. melanogaster, and current estimates of gene number in C. elegans and D. melanogaster are similar (SIMMEN et al. 1998 Down; ASHBURNER et al. 1999 Down). The estimates of mean mutation effects from the two C. elegans studies and the D. melanogaster experiments involving balancer chromosomes of MUKAI 1964 Down, MUKAI et al. 1972 Down, and OHNISHI 1977 Down evaluated under the same model of equal mutation effects point to a qualitative difference in mutation spectra between these organisms. The D. melanogaster balancer experiments are characterized by a mutational erosion for a major fitness component (competitive viability) caused by a high rate of mutations with effects of only a few percent. This is manifest in a rapid decline in viability of "quasinormal" chromosomes. In contrast, average effects of mutations for the primary fitness traits (productivity and r) in C. elegans are estimated to be -0.62 (VASSILIEVA and LYNCH 1999 Down) and -0.15 (KEIGHTLEY and CABALLERO 1997 Down), with small ML standard errors. Thus, we argue that the decay in life-history trait mean and increase in genetic variance in C. elegans seem to be mostly attributable to mutations with relatively large effects. This can be clearly seen in the frequency distributions of control and MA lines (KEIGHTLEY and CABALLERO 1997 Down, Figure 1; VASSILIEVA and LYNCH 1999 Down, Figure 2), which show little appreciable loss in fitness of quasinormal lines. A possible explanation for this difference in behavior is that active systems of transposable elements, present in D. melanogaster but absent from C. elegans strain N2 (EIDE and ANDERSON 1985 Down), could generate a different distribution of mutational effects than non-TE mutations, although other explanations have also been proposed (KEIGHTLEY 1996 Down; GARCIA-DORADO 1997 Down; FRY et al. 1999 Down).

The estimates for U we have obtained for C. elegans are extremely small. However, most of the analysis here has assumed that mutations have equal effects and produces an estimate of an "effective" number of mutations similar to the effective number of loci influencing a quantitative trait that can be estimated from line crosses (FALCONER and MACKAY 1996 Down). The estimates presented here from the equal-effects ML analysis can therefore be taken only as an index of the number of mutations that make a large change to life-history trait values in the conditions assayed. However, the results of the two C. elegans MA experiments suggest that mutations with small effects make only a very small contribution to a mutational decay of life-history traits in the laboratory, a conclusion supported by the analysis of a recent ethylmethane sulfonate mutagenesis experiment in C. elegans (DAVIES et al. 1999 Down). The question of the generality of mutational theories of population extinction (LANDE 1995 Down; LYNCH et al. 1995 Down), which depend critically on the distribution of mutation effects, will therefore depend on further work in a variety of species.


*  ACKNOWLEDGMENTS

We thank Mike Lynch and Larissa Vassilieva for kindly providing the data from their mutation-accumulation experiment. We are grateful to Deborah Charlesworth, Esther Davies, Bruno Goffinet, Bill Hill, Armando Caballero, Mark Kirkpatrick, Brian Charlesworth, Ruth Shaw, Dan Schoen, Mike Lynch, and an anonymous referee for helpful comments on the manuscript. T.B. acknowledges support from the European Science Foundation (Plant Adaptation Program) and the French Institut National de la Recherche Agronomique. P.K. acknowledges support from the Royal Society of London.

Manuscript received February 22, 1999; Accepted for publication November 23, 1999.


*  LITERATURE CITED
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

ASHBURNER, M., S. MISRA, J. ROOTE, S. E. LEWIS, and R. BLAZEJ et al., 1999  An exploration of the sequence of a 2.9-Mb region of the genome of Drosophila melanogaster: the Adh region. Genetics 153:179-219[Abstract/Free Full Text].

BATEMAN, A. J., 1959  The viability of near-normal irradiated chromosomes. Int. J. Radiat. Biol. 1:170-180.

CHARLESWORTH, B., 1994 Evolution in Age Structured Populations, Ed. 2. Cambridge University Press, Cambridge, United Kingdom.

CROW, J. F., and M. J. SIMMONS, 1983 The mutation load in Drosophila, pp. 1–35 in The Genetics and Biology of Drosophila, Vol. 3C, edited by M. ASHBURNER, H. L. CARSON and J. N. THOMPSON. Academic Press, London.

DAVIES, E. K., A. D. PETERS, and P. D. KEIGHTLEY, 1999  High frequency of cryptic deleterious mutations in Caenorhadbitis elegans.. Science 285:1748-1751[Abstract/Free Full Text].

DENG, H.-W. and Y.-X. FU, 1998  On the three methods for estimating deleterious genomic mutation parameters. Genet. Res. 71:223-236[Medline].

DENG, H.-W., J. LI, and J.-L. LI, 1999  On the experimental design and data analysis of mutation accumulation experiments. Genet. Res. 73:147-164[Medline].

EIDE, D. and P. ANDERSON, 1985  The gene structures of spontaneous mutations affecting a Caenorhabditis elegans myosin heavy-chain gene. Genetics 109:67-79[Abstract/Free Full Text].

FALCONER, D. S., and T. F. C. MACKAY, 1996 Introduction to Quantitative Genetics, Ed. 4. Longman Scientific and Technical, Harlow, Essex, United Kingdom.

FRY, J. D., P. D. KEIGHTLEY, S. L. HEINSOHN, and S. V. NUZHDIN, 1999  New estimates of the rates and effects of mildly deleterious mutation in Drosophila melanogaster.. Proc. Natl. Acad. Sci. USA 96:574-579[Abstract/Free Full Text].

GARCIA-DORADO, A., 1997  The rate and effects distribution of viability mutation in Drosophila: minimum distance estimation. Evolution 51:1130-1139.

GARCIA-DORADO, A., C. LOPEZ-FANJUL, and A. CABALLERO, 1999  Properties of spontaneous mutations affecting quantitative traits. Genet. Res. in press.

GENSTAT 5 COMMITTEE, 1993 Genstat 5 Release 3 Reference Manual. Clarendon Press, Oxford.

JOHNSON, T. E. and E. W. HUTCHINSON, 1993  Absence of strong heterosis for life span and other life history traits in Caenorhabditis elegans.. Genetics 134:465-474[Abstract].

KEIGHTLEY, P. D., 1994  The distribution of mutation effects on viability in Drosophila melanogaster.. Genetics 138:1315-1322[Abstract].

KEIGHTLEY, P. D., 1996  Nature of deleterious mutation load in Drosophila.. Genetics 144:1993-1999[Abstract].

KEIGHTLEY, P. D., 1998  Inference of genome-wide mutation rates and distributions of mutation effects for fitness traits: a simulation study. Genetics 150:1283-1293[Abstract/Free Full Text].

KEIGHTLEY, P. D. and A. CABALLERO, 1997  Genomic mutation rates for lifetime reproductive output and lifespan in Caenorhabditis elegans.. Proc. Natl. Acad. Sci. USA 94:3823-3827[Abstract/Free Full Text].

KEIGHTLEY, P. D. and A. EYRE-WALKER, 1999  Terumi Mukai and the riddle of deleterious mutation rates. Genetics 153:515-523[Abstract/Free Full Text].

KIBOTA, T. T. and M. LYNCH, 1996  Estimate of the genomic mutation rate deleterious to overall fitness in E. coli.. Nature 381:694-696[Medline].

LANDE, R., 1995  Mutation and conservation. Conserv. Biol. 9:782-791.

LYNCH, M. and W. G. HILL, 1986  Phenotypic evolution and neutral mutation. Evolution 49:915-935.

LYNCH, M., and B. WALSH, 1998 Genetics and Analysis of Quantitative Traits. Sinauer Associates, Sunderland, MA.

LYNCH, M., J. CONERY, and R. BURGER, 1995  Mutation accumulation and the extinction of small populations. Am. Nat. 146:489-581.

LYNCH, M., J. BLANCHARD, D. HOULE, T. KIBOTA, and S. SCHULTZ et al., 1999  Perspective: spontaneous deleterious mutation. Evolution 53:645-663.

MUKAI, T., 1964  The genetic structure of natural populations of Drosophila melanogaster. I. Spontaneous mutation rate of polygenes controlling viability. Genetics 50:1-19[Free Full Text].

MUKAI, T., S. I. CHIGUSA, L. E. METTLER, and J. F. CROW, 1972  Mutation rate and dominance of genes affecting viability in Drosophila melanogaster.. Genetics 72:333-355.

NELDER, J. A. and R. MEAD, 1965  A simplex method for function minimization. Comput. J. 7:308-313.

OHNISHI, O., 1977  Spontaneous and ethyl methanesulfonate-induced mutations controlling viability in Drosophila melanogaster. II. Homozygous effect of polygenic mutations. Genetics 87:529-545[Abstract/Free Full Text].

SIMMEN, M. W., S. LEITGEB, V. H. CLARK, S. J. M. JONES, and A. BIRD, 1998  Gene number in an invertebrate chordate, Ciona intestinalis.. Proc. Natl. Acad. Sci. USA 95:4437-4440[Abstract/Free Full Text].

SIMMONS, M. J. and J. F. CROW, 1977  Mutations affecting fitness in Drosophila populations. Annu. Rev. Genet. 11:49-78[Medline].

SOKAL, R. R., and F. J. ROHLF, 1995 Biometry, Ed. 3. W. H. Freeman, New York.

VASSILIEVA, L. and M. LYNCH, 1999  The rate of spontaneous mutation for life-history traits in Caenorhabditis elegans.. Genetics 151:119-129[Abstract/Free Full Text].

WEIR, B. S., 1996 Genetic Data Analysis II. Sinauer, Sunderland, MA.




This article has been cited by other articles:


Home page
GeneticsHome page
D. Ostrow, N. Phillips, A. Avalos, D. Blanton, A. Boggs, T. Keller, L. Levy, J. Rosenbloom, and C. F. Baer
Mutational Bias for Body Size in Rhabditid Nematodes
Genetics, July 1, 2007; 176(3): 1653 - 1661.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
R. Sanjuan, A. Moya, and S. F. Elena
The distribution of fitness effects caused by single-nucleotide substitutions in an RNA virus
PNAS, June 1, 2004; 101(22): 8396 - 8401.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
A. Garcia-Dorado and A. Gallego
Comparing Analysis Methods for Mutation-Accumulation Data: A Simulation Study
Genetics, June 1, 2003; 164(2): 807 - 819.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
A. D. Cutter and B. A. Payseur
Selection at Linked Sites in the Partial Selfer Caenorhabditis elegans
Mol. Biol. Evol., May 1, 2003; 20(5): 665 - 673.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
R. B. R. Azevedo, P. D. Keightley, C. Lauren-Maatta, L. L. Vassilieva, M. Lynch, and A. M. Leroi
Spontaneous Mutational Variation for Body Size in Caenorhabditis elegans
Genetics, October 1, 2002; 162(2): 755 - 765.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
D. Chavarrias, C. Lopez-Fanjul, and A. Garcia-Dorado
The Rate of Mutation and the Homozygous and Heterozygous Mutational Effects for Competitive Viability: A Long-Term Experiment With Drosophila melanogaster
Genetics, June 1, 2001; 158(2): 681 - 693.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
A. D. Peters and P. D. Keightley
A Test for Epistasis Among Induced Mutations in Caenorhabditis elegans
Genetics, December 1, 2000; 156(4): 1635 - 1647.
[Abstract] [Full Text]


Home page
GeneticsHome page
P. D. Keightley, E. K. Davies, A. D. Peters, and R. G. Shaw
Properties of Ethylmethane Sulfonate-Induced Mutations Affecting Life-History Traits in Caenorhabditis elegans and Inferences About Bivariate Distributions of Mutation Effects
Genetics, September 1, 2000; 156(1): 143 - 154.
[Abstract] [Full Text]