- THIS ARTICLE
-
Abstract
- Full Text (PDF)
-
All Versions of this Article:
genetics.104.035774v1
171/2/861 most recent - Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Zheng, Q.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Zheng, Q.
Originally published as Genetics Published Articles Ahead of Print on July 14, 2005.
Genetics, Vol. 171, 861-864, October 2005, Copyright © 2005
doi:10.1534/genetics.104.035774
Update on Estimation of Mutation Rates Using Data From Fluctuation Experiments
Qi Zheng1
Department of Epidemiology and Biostatistics, School of Rural Public Health, Texas A&M University System Health Science Center, Bryan, Texas 77802
1 Address for correspondence: Department of Epidemiology and Biostatistics, School of Rural Public Health, Texas A&M University System Health Science Center, 3000 Briarcrest, Suite 310, Bryan, Texas 77802.
E-mail: qzheng{at}srph.tamhsc.edu
This note discusses a minor mathematical error and a problematic mathematical assumption in Luria and Delbrück's (1943) classic article on fluctuation analysis. In addition to suggesting remedial measures, the note provides information on the latest development of techniques for estimating mutation rates using data from fluctuation experiments.
THE fluctuation test protocol devised by LURIA and DELBRÜCK (1943) still serves as the basis for estimating microbial mutation rates today, although later developments have resulted in much improved methods. Among the later contributions that enhanced our ability to analyze fluctuation experiments are those made by LEA and COULSON (1949), ARMITAGE (1952), CRUMP and HOEL (1974), MANDELBROT (1974), KOCH (1982), STEWART et al. (1990), MA et al. (1992), JONES et al. (1994), and many others found in a recent review (ZHENG 1999). ROSCHE and FOSTER's (2000) critical comparison of the then existing methods is a useful guide for biologists. One goal of this note is to notify the reader of the latest developments that can further help biologists improve their ability to measure mutation rates. Another goal is to discuss a minor mathematical error and a problematic mathematical assumption in LURIA and DELBRÜCK's (1943) article that have caused lingering confusion. Previous attempts to clarify the confusion were scarce and to a large extent failed to resolve some relevant practical issues. As a result, the genetics literature is increasingly fraught with mutation rates that were computed using either incorrect or unreliable methods. It appears helpful that the minor error and the problematic assumption are explained and remedial measures are provided. I begin with a paradox that has puzzled many.
In a fluctuation experiment, each of n parallel cultures is seeded at time zero with N0 nonmutant cells for incubation. At a later time T each culture has about NT nonmutant cells and the contents of each culture are plated to facilitate counting of mutants existing at time T in the n cultures. This process results in experimental data in the form of X1, X2,..., Xn, the numbers of mutants existing in the n cultures immediately before plating. If z of the n cultures still remain devoid of mutant cells at time T, LURIA and DELBRÜCK's (1943) P0 method estimates the mutation rate by
![]() | (1) |
. Although Luria and Delbrück did not give the above equation, their numerical example on page 507 clearly indicates that they used (1) to estimate mutation rates defined as "mutations per bacterium per division cycle." This definition of mutation rates has been widely accepted, and throughout this note the term "mutation rate" is used in that sense. In other words, a mutation rate is the probability that a cell undergoes a mutation during the cell's life cycle. Using the same definition, LEDERBERG (1951, p. 99) argued that mutation rates should be estimated by
![]() | (2) |
![]() |
yields an estimator of µ0 in the form of (2). Thus Equations 1 and 2, differing by a factor of log 2, aim at estimating the same quantity.
This paradox has led some to seek justifications of (1) (e.g., HAYES 1968, p. 194, and DRAKE 1970, p. 49) and others to cast doubt on it (e.g., LEA and COULSON 1949, p. 266, and KONDO 1972). No consensus has emerged. A helpful approach to clarifying this issue is to present an argument that not only reinforces the validity of (2) but also highlights what was overlooked by Luria and Delbrück in arriving at (1). Note that Luria and Delbrück used average doubling time divided by log 2 to measure time, rendering their derivation of (1) unnecessarily difficult to understand. I use clock time by introducing an explicit cell growth parameter ß. Specifically, the nonmutant population size at time t, denoted by N(t), is modeled by an exponential function N(t) = N0eßt. Thus, occurrence of mutation is assumed to be governed by a Poisson process having intensity function µN0eßt. Because the probability of a mutation occurring in a small time interval (t, t +
t) is approximately µN0eßt
t, the parameter µ is often called the probability of mutation per cell per unit time. However, except for time points t = 0 and t = tk with tk = ß1 log(k/N0) for k = 1, 2..., N(t) = N0eßt does not represent the actual population size at time t, because N(t) is a positive integer if and only if t = 0 or t = tk for some k. A literal interpretation of µ as "mutation per cell per unit time" out of the intended context can lead to unexpected results, as the following analysis demonstrates.
Consider a time interval (tk, tk+1] for an arbitrary positive integer k. As hinted above, tk+1 tk can be viewed as an interdivision time under the assumption N(t) = N0eßt. Extending an argument of KONDO (1972), I express the probability of one or more mutations occurring in that time interval is
![]() | (3) |
![]() | (4) |
![]() | (5) |
leads to an estimator identical to that given in (2). On the other hand, Luria and Delbrück's reasoning leading to (1) was based on their interpretation of µ as "a fixed small chance per unit time for each bacterium to undergo a mutation" (LURIA and DELBRÜCK 1943, p. 494). Because a cell's life cycle is considered to be (log 2)/ß under the assumption N(t) = N0eßt,
![]() | (6) |
The above discussion indicates that, from a theoretical point of view, one should choose µß as a definition of the mutation rate and use (2) instead of (1) as an estimator thereof. However, the discussion does not answer the question of whether µß agrees with the definition of a mutation rate from a practical point of view. To address this issue I ran several simulations, one of which I now report. I first simulated the numbers of mutants for 30,000 cultures. To maintain a degree of independence between my simulation and the mathematical model that gives rise to (5), I used a discrete-time cell division model based on the five assumptions given by ANGERER (2001, p. 149). Specifically, each culture starts with one nonmutant cell. At each step one cell from the culture is chosen to divide. If the culture already has x mutants and y nonmutants, then the probability that a mutant is chosen to divide is x/(x + y), and the probability that a nonmutant is chosen to divide is y/(x + y). When a mutant divides, it splits into two mutant daughter cells; when a nonmutant divides, it splits into one mutant and one nonmutant with probability p, or it splits into two nonmutants with probability 1 p. (Clearly, p thus defined agrees with the definition of a mutation rate as is commonly understood.) For each culture, this procedure is repeated until x + y = NT. In the simulation I chose p = 5 x 108 and NT = 108. I then considered the first 30 cultures as coming from the first experiment, the next 30 cultures as coming from the second experiment, and so on. For each simulated experiment, I used SALVADOR (ZHENG 2002, 2005) to find the maximum-likelihood estimate of m(T). Finally, I used the relation µß
m(T)/NT suggested by (5) to compute mutation rates. The average of these 1000 estimated mutation rates is 5.036 x 108 and the median is 4.988 x 108, indicating that µß, and hence not (log 2)µß, coincides with p. The distribution of these 1000 estimates of the mutation rate is summarized in Figure 1.
|
Another issue that has also caused lingering confusion is the use of the sample mean
in estimating mutation rates. Because the sample mean
has too large a variance to be useful in statistical inference, Luria and Delbrück introduced the concept of a "likely average" to alleviate this problem. Theoretically, the expected number of mutants in a culture can be found by solving LURIA and DELBRÜCK's (1943) Equation 6,
![]() | (7) |
(0) = 0. From standard differential equation theory it follows that
![]() | (8) |
![]() | (9) |
![]() | (10) |
![]() | (11) |
; as a result, (11) reduces to
![]() | (12) |
gives ß(T t0) = log(NT/N(t0)). Because N(t0) = 1/(nµß) from (12), it then follows that
![]() | (13) |
![]() | (14) |
![]() | (15) |
. Recently Lederberg's caution began to be appreciated, and (15) is no longer in common use.
But the idea of a likely average has become entrenched in the literature due to a popular modification of (15). Setting n = 1 and replacing the sample mean
with the sample median
yields an estimating equation
![]() | (16) |
30, a rare experimental scenario. The inadequate performance of (16) casts doubt on the usefulness of the concept of a likely average. In assessing the usefulness of a likely average, one might note that (10) is a solution of (7) satisfying the initial condition
(t0) = 0. Therefore, the likely average
in (14) can be regarded as the mean number of mutants in a culture under the assumption that cells were prevented from mutating before an arbitrarily chosen time t0. The validity of this assumption seems problematic. The concept of a "likely average" has neither theoretical nor empirical bases, and hence its usefulness in estimating mutation rates is dubious.
Now I suggest the following guidelines for mutation rate estimation. First, in the context of fluctuation experiments, the term mutation rate should be reserved for µß, because this quantity agrees with the accepted definition of a mutation rate as the probability that a cell undergoes a mutation during its life cycle. The parameter µ is a necessary mathematical device, but the term mutation rate per cell per unit time can be avoided in most biological contexts to avoid confusion. Second, published mutation rates computed using (1) should be divided by log 2; the P0 method is still needed when the number of mutants in a culture is difficult to ascertain, but the presence or absence of mutants in a culture can be determined. In applying the P0 method one should use (2). Third, the concept of a likely average is obsolete, and so are methods based on that concept, e.g., Equations 15 and 16. If a published mutation rate was computed using (16), one can use the same equation to recover the sample median
(provided NT is known); for comparison purposes two additional estimates of the mutation rate can then be obtained by using LEA and COULSON's (1949) method of the median and by applying Equation 6 of JONES et al. (1994),
![]() |
. The mutation rate µß can then be extracted from the relation µß = m(T)/(NT N0) as given in (5). Finally, as ROSCHE and FOSTER (2000) emphasized, if all n observations X1,..., Xn from a fluctuation experiment are available, the best approach for estimating a mutation rate is to use the maximum-likelihood method to estimate m(T). Use of the maximum-likelihood method was not common in the past, partly due to lack of convenient and efficient computer software written specifically for fluctuation analysis. This situation has been ameliorated by the appearance of SALVADOR, which includes most of the existing methods for fluctuation analysis. In particular, SALVADOR provides methods for analyzing experiments where mutants and nonmutants grow at different rates. Moreover, recent theoretical developments (ZHENG 2002, 2005) have made it possible to construct asymptotic confidence intervals for mutation rates. These interval estimation methods can be readily applied via SALVADOR, which is available at http://library.wolfram.com/infocenter/MathSource/5556.
ANGERER, W. P., 2001 An explicit representation of the Luria-Delbrück distribution. J. Math. Biol. 42: 145174.[CrossRef][Medline]
ARMITAGE, P., 1952 The statistical theory of bacterial populations subject to mutation. J. R. Stat. Soc. B 14: 133.
CRUMP, K. S., and D. G. HOEL, 1974 Mathematical models for estimating mutation rates in cell populations. Biometrika 61: 237252.
DRAKE, J. W., 1970 The Molecular Basis of Mutation. Holden-Day, San Francisco.
HAYES, W., 1968 The Genetics of Bacteria and Their Viruses: Studies in Basic Genetics and Molecular Biology, Ed. 2. Wiley, New York.
JONES, M. E., S. M. THOMAS and A. ROGERS, 1994 Luria-Delbrück fluctuation experiments: design and analysis. Genetics 136: 12091216.[Abstract]
KOCH, A. L., 1982 Mutation and growth rates from Luria-Delbrück fluctuation tests. Mutat. Res. 95: 129143.[CrossRef]
KONDO, S., 1972 A theoretical study on spontaneous mutation rate. Mutat. Res. 14: 365374.[Medline]
LEA, E. A., and C. A. COULSON, 1949 The distribution of the numbers of mutants in bacterial populations. J. Genet. 49: 264285.
LEDERBERG, J., 1951 Inheritance, variation, and adaptation, pp. 67100 in Bacterial Physiology, edited by C. H. WERKMAN and P. W. WILSON. Academic Press, New York.
LURIA, S. E., and M. DELBRÜCK, 1943 Mutations of bacteria from virus sensitivity to virus resistance. Genetics 28: 491511.
MA, W. T., G. VH. SANDRI and S. SARKAR, 1992 Analysis of the Luria and Delbrück distribution using discrete convolution powers. J. Appl. Probab. 29: 255267.
MANDELBROT, B., 1974 A population birth-and-mutation process, I: explicit distributions for the number of mutants in an old culture of bacteria. J. Appl. Prob. 11: 437444.[CrossRef]
ROSCHE, W. A., and P. L. FOSTER, 2000 Determining mutation rates in bacterial populations. Methods 20: 417.[CrossRef][Medline]
STEWART, F. M., D. M. GORDON and B. R. LEVIN, 1990 Fluctuation analysis: the probability distribution of the number of mutants under different conditions. Genetics 124: 175185.[Abstract]
ZHENG, Q., 1999 Progress of a half century in the study of the Luria-Delbrück distribution. Math. Biosci. 162: 132.[CrossRef][Medline]
ZHENG, Q., 2002 Statistical and algorithmic methods for fluctuation analysis with SALVADOR as an implementation. Math. Biosci. 176: 237252.[CrossRef][Medline]
ZHENG, Q., 2005 New algorithms for Luria-Delbrück fluctuation analysis. Math. Biosci. 196: 198214.[CrossRef][Medline]
Communicating editor: J. WAKELEY
This article has been cited by other articles:
![]() |
J. M. Hinz, P. B. Nham, S. S. Urbin, I. M. Jones, and L. H. Thompson Disparate contributions of the Fanconi anemia pathway and homologous recombination in preventing spontaneous mutagenesis Nucleic Acids Res., June 28, 2007; 35(11): 3733 - 3740. [Abstract] [Full Text] [PDF] |
||||
- THIS ARTICLE
-
Abstract
- Full Text (PDF)
-
All Versions of this Article:
genetics.104.035774v1
171/2/861 most recent - Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Zheng, Q.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Zheng, Q.



















