- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Schneider, S.
- Articles by Excoffier, L.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Schneider, S.
- Articles by Excoffier, L.
Estimation of Past Demographic Parameters From the Distribution of Pairwise Differences When the Mutation Rates Vary Among Sites: Application to Human Mitochondrial DNA
Stefan Schneidera and Laurent Excoffieraa Genetics and Biometry Laboratory, Department of Anthropology and Ecology, University of Geneva, CP 511 1211 Geneva 24, Switzerland
Corresponding author: Laurent Excoffier, Genetics and Biometry Lab, Department of Anthropology and Ecology, University of Geneva, CP 511 1211 Geneva 24, Switzerland., laurent.excoffier{at}anthro.unige.ch (E-mail)
Communicating editor: G. B. GOLDING
| ABSTRACT |
|---|
Distributions of pairwise differences often called "mismatch distributions" have been extensively used to estimate the demographic parameters of past population expansions. However, these estimations relied on the assumption that all mutations occurring in the ancestry of a pair of genes lead to observable differences (the infinite-sites model). This mutation model may not be very realistic, especially in the case of the control region of mitochondrial DNA, where this methodology has been mostly applied. In this article, we show how to infer past demographic parameters by explicitly taking into account a finite-sites model with heterogeneity of mutation rates. We also propose an alternative way to derive confidence intervals around the estimated parameters, based on a bootstrap approach. By checking the validity of these confidence intervals by simulations, we find that only those associated with the timing of the expansion are approximately correctly estimated, while those around the population sizes are overly large. We also propose a test of the validity of the estimated demographic expansion scenario, whose proper behavior is verified by simulation. We illustrate our method with human mitochondrial DNA, where estimates of expansion times are found to be 1020% larger when taking into account heterogeneity of mutation rates than under the infinite-sites model.
WITH the advent of the coalescent theory (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
When exposed to the evidence of a past demographic expansion, one might want to estimate the parameters of the expansion, such as the time at which it occurred and its magnitude, but the choice of parameters to be estimated depends on a particular scenario of population growth one might choose, such as exponential growth, logistic growth, or an instantaneous stepwise population size change. These three models are obviously related but have rarely been compared (but see ![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
We thus propose in this article to extend the model of ![]()
| THEORY AND METHODS |
|---|
The mismatch distribution under the infinite-sites model:
We assume that t generations ago, a population at equilibrium of size N0 entered a demographic expansion phase to instantaneously reach a new size N1 and that it remained at that size ever since. Under this demographic scenario described in Figure 1 and assuming that every new mutation occurs at a new site [the infinite-sites mutation model of ![]()
![]()
![]() |
(1) |
where
0 = 2N0u,
1 = 2N1u,
= 2ut, and u is the total mutation rate per generation per gene. Here, Fi(
) is also the probability of observing i differences between two genes in an equilibrium population as (![]()
![]() |
(2) |
|
![]()
0,
1, as well as the expansion time
directly from this mismatch distribution.
The mismatch distribution under a finite-sites model:
Under the finite-sites model, F
i(
1,
0,
) provides the distribution of the number of mutations having occurred during the evolution of a random pair of genes. Note that this number can be equal to or larger than the number of observed differences, depending on whether the same site has been hit several times by mutations or not. In this case, the expected mismatch probability distribution noted by Fmi(
1,
0,
) can be obtained by taking into account those cases where j mutations (j
i) have led to exactly i differences, as
![]() |
(3) |
where Hm(i, j) is the conditional probability of observing i differences given j mutations have occurred in the ancestry of two sequences of length m. We now describe how to obtain these conditional probabilities Hm(i, j) starting with m = 1 and extending it to a sequence of arbitrary length.
One-site case:
We first solve the problem for one site (m = 1) assuming Kimura's two-parameters model of mutation (![]()

where the elements of each row add up to one. The jth power of the matrix M can be used to describe the impact of j mutations at that site. A closed-form expression for Mj can be conveniently obtained by a diagonalization of M, as M = VDV-1, where D = diag {-s, -s, v + s, -v + s} and V is a matrix where the columns are the eigenvectors of M.
We thus obtain

where
j =
(1 + 2(-s)j + (s - v)j), ßj =
(1 - (s - v)j), and
j =
(1 - 2(-v)j + (s - v)j). The diagonal terms of Mj, here all equal to
j, represent the probability of returning to the initial state after j mutations. It thus follows that
![]() |
(4) |
Multisite case, homogeneous mutation rates:
Instead of deriving an explicit equation for Hm(i, j), when m > 1 we can compute these probabilities numerically using a recurrence equation, as shown below. Let us suppose that we have already derived the probability Hm-1(i, j) and that we want to study the case for an additional site and thus derive Hm(i, j). Suppose that l mutations have occurred at the mth site and that the (j - l) remaining mutations have occurred at the m - 1 other sites. The probability of observing overall i differences will depend upon whether we observe one or no difference at the mth site. With probability P1, one difference will be observed at the mth site and (i - 1) at the (m - 1) other sites, and with probability P2, all i differences will be observed among the (m - 1) other sites. Therefore,
![]() |
(5) |
where we admit that Hm-1(-1, j - l) = 0. Summing over all possible l values and multiplying by the probability that l mutations occur at the additional site, we finally have
![]() |
(6) |
where b(l, j, p) = (j!/l!(j - l)!) pl(1 - p)j-l is the binomial probability with parameter p = 1/m.
The mismatch distribution under a two-rates finite-sites model:
Mutation rate heterogeneity arises when the mutation rates are not equal for all nucleotide sites. The simplest form of heterogeneity to be considered is a two-rates mutation model, where we make the distinction between fast and slow sites. As most mutations accumulate at a small number of fast sites, convergent or reverse mutations can become quite common. The consequence of this type of heterogeneity on the pattern of diversity has been studied in the case of the control region of human mitochondrial DNA (![]()
![]()
![]()
![]()
![]()
![]()
![]() |
(7) |
where b(k, j, p) is the same binomial probability function as in Equation 6 but with parameter p = m1r/(m1r + m2) being the conditional probability that a mutation will hit one of the m1 fast mutating sites, and r is here the ratio of fast and slow mutation rates. Note that Equation 7 is the equivalent to YANG's (1996) Equation 39, derived for the case of a stationary population. Yang used an infinite-alleles mutation model, stipulating that once a site is hit by one or more mutations, we observe one difference. Due to the high transition bias, this model also tends to overestimate the number of differences because it does not allow back mutations at nonsegregating sites.
Multisite case, m-rates mutation model:
Suppose that we have a sequence of length m and that each nucleotide has a potentially different probability pi (i = 1 ... m) of being hit by a mutation, subject to the condition
mi pi = 1. Under this m-rates model, Hm(i, j), noted here HmR(i, j), can be also obtained by the recurrence relation given in (6) except that the parameter p is now equal to
pi. Here n is the index of the recursion step (1
n
m), and pn thus changes at every step of the recursion. It is important to note that, unlike the homogenous mutation rate case, the intermediate recurrence matrices are here meaningless, and only the last matrix obtained by this recurrence is correct for the required heterogeneity pattern. Under this m-rates mutation model, the expected mismatch distribution is not given in a more complex form than in the constant mutation rate case, as all the additional complexity of the model is embedded in the term HmR(i, j) to give
![]() |
(8) |
Note that the two-rates model is a special case of the present model and that it could be treated similarly. In that case, the double summation of Equation 7 could be condensed into the last term of Equation 8.
The mismatch distribution for gamma-distributed mutation rates:
A gamma distribution of mutation rates can be seen as a special case of the m-rates mutation model. Such a distribution has been hypothesized for explaining the pattern of diversity in the control region of the mitochondrial genome (e.g., ![]()
![]()
![]()
![]()

where
is the shape parameter of the gamma distribution equal to V
, the inverse of the square of the coefficient of variation of mutation rates. We can discretize the gamma distribution over the m nucleotides as follows. We draw an arbitrarily large number of continuous gamma-distributed variates (say 1 million) with mean and variance equal to the shape parameter
(![]()
i. The relative probability of being hit by a mutation pi is then obtained by setting pi =
i. Those pi's can then be used directly in recursion Equation 6 to get the probabilities HmR(i, j) required in Equation 8.
In the present article, we used the values of the shape parameters
computed by S. Meyer (![]()
![]()
= 0.26 for HV1 and
= 0.13 for HV2.
Estimation of past demographic parameters using a least-squares approach:
We estimated the demographic parameters
0,
1, and
from the mismatch distribution, using a nonlinear least-squares approach. We use the Hooke and Jeeves algorithm (![]()
![]() |
(9) |
Depending on which mutation model we consider, we replaced {Fiexp} by the quantities defined in Equation 3, Equation 7, or 8. The Hooke-Jeeves algorithm starts from an arbitrary initial set of parameters and converges by an iterative process to a local minimum. This minimization procedure was mainly chosen for its robustness and its ability to converge under nontrivial conditions.
Bootstrap confidence intervals:
We followed a parametric bootstrap approach to generate percentile confidence intervals around the estimated parameters
1,
0, and
(see, e.g., ![]()
![]()
1,
0, and
. For each of the B simulated data sets, we applied our estimation procedure according to Equation 3, Equation 7, or 8 to evaluate B bootstrapped values
0*,
1*, and
*. For a given confidence level
, the approximate limits of the confidence interval were obtained as the
/2 and 1 -
/2 percentile values (![]()
Testing the validity of the sudden expansion model:
We tested the hypothesis that the observed data fitted the sudden expansion model defined by the estimated parameters using the same parametric bootstrap approach as described above. We used here SSD defined in Equation 9 as a test statistic. We obtained its distribution under the hypothesis that the estimated parameters are the true ones by simulating B samples around the estimated parameters. As before, we reestimated each time new parameters
0*,
1*, and
* and computed their associated sums of squares SSDsim. The P value of the test is therefore approximated by

To check the accuracy of this procedure, we generated 1000 random data sets for the parameters
0 = 1,
1 = 1000, and
= 3 under a two-rates model with 270 slow sites and 30 fast sites mutating 20 times faster than the slow sites, which corresponds to the simulation conditions of the top of Table 1. The simulated distribution of the P values for these parameters was almost uniform between 0 and 1 (data not shown), suggesting that the SSD statistic provides a valid test of the sudden expansion model.
|
| RESULTS |
|---|
We show in Figure 2 the theoretical mismatch distributions and the demographic parameters estimated using different methods and mutation models for the two hypervariable segments of Mandenka mtDNA control region (![]()
![]()
, which shows larger values for finite-sites models than for the infinite-sites model. This is of course due to the fact that several mutations can accumulate at a given site in finite-sites models and that a longer evolutionary time is necessary to lead to the same number of observed differences. The magnitude of the expansion is also found to be smaller in finite-sites models, in agreement with previous simulation results (![]()
|
In Figure 3, we show the expected mismatch distributions fitted for the Turkana sample (![]()
|
To check if these confidence intervals have good coverage properties (i.e., the true parameters should be included in the confidence interval with a probability 1 -
), we performed a series of simulations for a set of predefined parameters. For a given set of parameters
0,
1, and
, we simulated 1000 data sets from which we estimated the parameters
0,
1, and
. For each set of estimated parameters, we simulated 1000 additional data sets from which new values
0*,
1*, and
* were estimated. The distribution of these 1000 bootstrap values was used to evaluate the lower and upper limits of a 100(1 -
)% confidence interval around the
0,
1, and
values as the
/2 and 1 -
/2 percentiles of the distribution, respectively. The results of these analyses are shown in Table 1 and Table 2 for different types of mutation rate heterogeneity. It can be seen that the only parameter for which the bootstrap confidence interval has a good coverage is
, as the proportion of the times the true value is outside the confidence interval is approximately equal to the significance level
. Note, however, that the confidence interval is not well centered, as the true values outside the confidence interval are always found on the left of the distribution. The bootstrap confidence intervals for
0 and
1 are much too broad (the true value of the parameter is found too often within the empirical confidence interval). In Figure 4, we plot the distributions of the two statistics x -
and
- x* for x =
and for x =
0. To generate true confidence intervals, the bootstrap percentile method requires that these two distributions be identical (see, e.g., ![]()
but not for
0 or for
1 (data not shown for
1). Moreover, the estimations of
and
0 appear much less biased than that of
1. Note that a possible explanation of the bias for
1 is mentioned in ![]()
1. Interestingly, the parameters of old expansions (
= 9) seem more precisely recovered than those of relatively recent expansions (
= 3; (Table 1 and Table 2). These results confirm several points. First, the time of a sudden expansion (
) can be adequately recovered from the data with approximately valid confidence intervals. Second, the estimate of the initial population size appears quite well recovered, but with an overly conservative confidence interval due to a too large upper bound (see Figure 4). Finally, both the estimate of the population size after the expansion and its confidence interval cannot be adequately recovered from the mismatch distribution. However, as the bias in
1 appears mainly due to an overly large upper bound, the lower bound for
1 could be useful even though still underestimated.
|
|
For illustration purposes, we present in Figure 5 the expected mismatch distributions of a few human samples analyzed for HV1 or HV2, as well as the limits of a 95% confidence interval around the mismatch distributions. Despite an obvious lack of goodness-of-fit for some distributions, the adequacy of the sudden expansion model could only be rejected for the Ngoebe HV2 sample (SSD P value = 0.007). For the other samples, random mismatch distributions generated by simulations lead to SSD values larger than the observation in >5% of the cases, making the observed mismatch distributions compatible with the estimated parameters.
|
| DISCUSSION |
|---|
In this study, we extend the model of ![]()
![]()
![]()
) shown in Figure 2 for the Mandenka population are found, respectively, 9 and 20% larger for HV1 and HV2 when using a model with gamma-distributed mutation rates than for the infinite-sites model. Even though our methodology appears computationally more intensive, it thus seems justified to take into account the known departures from the infinite sites model to estimate the parameters of the stepwise demographic model. The present approach does not allow us to retrieve all the parameters of a demographic expansion with the same efficiency. As shown in Table 1 and Table 2, the expansion time (
) and the initial population size (
0) are the only parameters that can be estimated without much bias and with reasonable precision, while the estimation of
1 is clearly biased upward. The confidence intervals obtained from the parametric bootstrap approach are fairly estimated only for the expansion time
, while those for the population sizes are clearly too large and thus overly conservative. This implies that the magnitude of the expansion cannot be precisely recovered by the present approach. This is understandable because once the expansion is sufficiently large, very few coalescent events (if any) will have occurred between the present time and the beginning of the expansion. As it is the accumulation of those coalescent events that can provide some information on the present population size, there will often be too few of them to get a reliable estimate of the present size, which will also tend to be overestimated. The present parametric bootstrap approach for defining the confidence intervals differs somewhat from that described in previous studies (![]()
![]()
0,
1, and
was declared compatible if the goodness-of-fit statistic fell within a 95% confidence interval obtained by simulation. While this approach seems valid, it requires much heavier computations than ours if one wants to adequately explore the space of possible parameters, as a series of simulations needs to be carried out for each set of parameters. Moreover, the potential impact of the chosen goodness-of-fit statistic on the results and the reliability of the confidence intervals has not been addressed. The fact that the effective population sizes are not well recovered from the mismatch distribution would suggest that this previous approach may suffer from the same problems as the simple parametric bootstrap procedure and thus also lead to overly large confidence intervals.
A recent study has shown that time-dependent demographic models (including the present stepwise expansion model) were unstable with respect to the estimation of the demographic parameters describing the population sizes (![]()
![]()
depends essentially on the mean of the mismatch distribution (![]()
0 and
1 depend on higher moments of the distribution, those latter two parameters are more likely to be affected by the stochasticity of the genealogical process than
. This is in keeping with our simulations, which show that the expansion time is usually quite well recovered from the mismatch distribution (Table 1 and Table 2).
Even though we have refined the mutation model for mtDNA sequences, one can see that the theoretical mismatch distributions do not always perfectly fit with the observed distributions (Figure 2, Figure 3, and Figure 5). We can see two reasons explaining this discrepancy.
First, the single stepwise expansion model may be inadequate for some populations. Alternative population expansion models such as exponential growth or logistic growth could be more realistic that the stepwise growth used in this study (![]()
![]()
![]()
![]()
![]()
Second, the probabilities derived in Equation 1 and Equation 2 and their derivatives apply to a pair of genes chosen at random from the population, while they are applied here to a random pair chosen from the sample. However, pairs drawn from the sample are not independent due to the shared portions of their gene genealogy. In populations having gone through a recent and large expansion, the internal branches are very short due to the star-like structure of the tree (![]()
![]()
Although mismatch distributions carry some information on the shape of the underlying gene genealogy and coalescent process, other aspects of molecular diversity are not explicitly taken into account by this approach. It has been shown that demographic parameters recovered from the mismatch distribution did not allow the correct prediction of the number of observed polymorphic sites (![]()
![]()
![]()
To get absolute values for the demographic parameters inferred using the present approach, one should get an estimation of the substitution rate at the nucleotide level. The real value of mutation rate in humans has recently been the subject of an intense debate between those advocating the use of a phylogenetic mutation rate (~3 x 10-6 substitutions per site per generation of 20 yr) calibrated by the divergence between humans and chimpanzees (![]()
![]()
![]()
![]()
Even if the present approach is an improvement over previous methods, it seems that the use of the mismatch distribution as a summary statistic may not exploit the full potential of molecular data and that maximum-likelihood methods that take into account phylogenetic relationships between DNA sequences (e.g., ![]()
![]()
![]()
![]()
![]()
| ACKNOWLEDGMENTS |
|---|
We are grateful to Ziheng Yang, Monty Slatkin, and two anonymous reviewers for their helpful comments on the manuscript and to Simon Tavaré for some useful advice. We thank Alan Rogers for providing us with C source code for generating gamma-distributed random variates. We are grateful to André Langaney for his support throughout this work. This study was made possible by Swiss National Fund grant nos. 32-047053.96 and 31-039847.93. A program for computing the estimated demographic parameters from the mismatch distribution and performing the tests described in this article is available from S.S. upon request. These programs are also integrated into the Arlequin software, available on http://anthropologie.unige.ch/arlequin/.
Manuscript received July 30, 1998; Accepted for publication March 19, 1999.
| LITERATURE CITED |
|---|
AHRENS, J. H. and U. DIETER, 1974 Computer methods for sampling from gamma, beta, Poisson, and binomial distributions. Computing 12:223-246.
ARIS-BROSOU, S. and L. EXCOFFIER, 1996 The impact of population expansion and mutation rate heterogeneity on DNA sequence polymorphism. Mol. Biol. Evol. 13:494-504[Abstract].
BERTORELLE, G. and M. SLATKIN, 1995 The number of segregating sites in expanding human populations, with implications for estimates of demographic parameters. Mol. Biol. Evol. 12:887-892[Abstract].
CORNUET, J. M. and G. LUIKART, 1996 Description and power analysis of two tests for detecting recent population bottlenecks from allele frequency data. Genetics 144:2001-2014[Abstract].
DONNELLY, P., and S. TAVARÉ, 1997 Progress in Population Genetics and Human Evolution. Springer-Verlag, New York.
EFRON, B., and R. J. TIBSHIRANI, 1993 An Introduction to the Bootstrap. Chapman and Hall, London.
FU, Y.-X., 1997 Statistical tests of neutrality of mutations against population growth, hitchhiking and background selection. Genetics 147:915-925[Abstract].
FU, Y. X. and W. H. LI, 1993 Statistical tests of neutrality of mutations. Genetics 133:693-709[Abstract].
GRAVEN, L., G. PASSARINO, O. SEMINO, P. BOURSOT, and A. S. SANTACHIARA-BENERECETTI et al., 1995 Evolutionary correlation between control region sequence and RFLP diversity pattern in the mitochondrial genome of a Senegalese sample. Mol. Biol. Evol. 12:334-345[Abstract].
GRIFFITHS, R. C. and S. TAVARÉ, 1994 Ancestral inference in population genetics. Stat. Sci. 9:307-319.
HANDT, O., S. MEYER, and A. VON HAESELER, 1998 Compilation of human mtDNA control region sequences. Nucleic Acids Res. 26:126-129
HARPENDING, H. C., M. A. BATZER, M. GURVEN, L. B. JORDE, and A. R. ROGERS et al., 1998 Genetic traces of ancient demography. Proc. Natl. Acad. Sci. USA 95:1961-1967
HASEGAWA, M., A. DIRIENZO, T. D. KOCHER, and A. C. WILSON, 1993 Toward a more accurate time scale for the human mitochondrial DNA tree. J. Mol. Evol. 37:347-354[Medline].
HOOKE, R. and T. A. JEEVES, 1963 "Direct search" solution of numerical and statistical problem. J. Assoc. Comput. Machinery 8:212-229.
HOWELL, N., I. KUBACHA, and D. A. MACKEY, 1996 How rapidly does the human mitochondrial genome evolve? Am. J. Hum. Genet. 59:501-509[Medline].
HUDSON, R. R., 1990 Gene genealogies and the coalescent process, pp. 144 in Oxford Surveys in Evolutionary Biology, edited by D. J. FUTUYMA and J. D. ANTONOVICS. Oxford University Press, New York.
JAZIN, E., H. SOODYALL, P. JALONEN, E. LINDHOLM, and M. STONEKING et al., 1998 Mitochondrial mutation rate revisited: hot spots and polymorphism. Nat. Genet. 18:109-110[Medline].
KIMURA, M., 1969 The number of heterozygous nucleotide sites maintained in a finite population due to the steady flux of mutations. Genetics 61:893-903
KIMURA, M., 1980 A simple method for estimating evolutionary rate of base substitution through comparative studies of nucleotide sequences. J. Mol. Evol. 16:111-120[Medline].
KINGMAN, J. F. C., 1982 The coalescent. Stochastic Process. Appl. 13:235-248.
KOCHER, T. D., and A. C. WILSON, 1991 Sequence evolution of mitochondrial DNA in humans and chimpanzees: control region and protein-coding region, pp. 391413 in Evolution of Life: Fossils, Molecules and Culture, edited by S. OSAWA and T. HONJO. Springer-Verlag, Tokyo.
KOLMAN, C. J., E. BERMINGHAM, R. COOKE, R. H. WARD, and T. D. ARIAS et al., 1995 Reduced mtDNA diversity in the Ngobe Amerinds of Panama. Genetics 140:275-283[Abstract].
KUHNER, M. K., J. YAMATO, and J. FELSENSTEIN, 1995 Estimating effective population size and mutation rate from sequence data using Metropolis-Hastings sampling. Genetics 140:1421-1430[Abstract].
KUHNER, M. K., J. YAMATO, and J. FELSENSTEIN, 1998 Maximum-likelihood estimation of population growth rates based on the coalescent. Genetics 149:429-434
LI, W. H., 1977 Distribution of nucleotide differences between two randomly chosen cistrons in a finite population. Genetics 85:331-337
LUNDSTROM, R., S. TAVARÉ, and R. H. WARD, 1992 Modeling the evolution of the human mitochondrial genome. Math. Biosci. 112:319-335[Medline].
MEYER, S., G. WEISS, and A. VON HAESELER, 1999 Pattern of nucleotide substitution and rate heterogeneity in the hypervariable regions I and II of human mtDNA. Genetics 152:1103-1110
PARSONS, T. J. and M. M. HOLLAND, 1998 Reply to Jazin et al. Nat. Genet. 18:110.
PARSONS, T. J., D. S. MUNIEC, K. SULLIVAN, N. WOODYATT, and R. ALLISTON-GREINER et al., 1997 A high observed substitution rate in the human mitochondrial DNA control region. Nat. Genet. 15:363-368[Medline].
PIERCY, R., K. M. SULLIVAN, N. BENSON, and P. GILL, 1995 The application of mitochondrial DNA typing to the study of white Caucasian genetic identification. Int. J. Legal. Med. 106:85-90.
POLANSKI, A., M. KIMMEL, and R. CHAKRABORTY, 1998 Application of a time-dependent coalescence process for inferring the history of population size changes from DNA sequence data. Proc. Natl. Acad. Sci. USA 95:5456-5461
RICE, J. A., 1995 Mathematical Statistics and Data Analysis. Duxburry Press, Belmont, CA.
ROGERS, A., 1992 Error introduced by the infinite-sites model. Mol. Biol. Evol. 9:1181-1184[Medline].
ROGERS, A., 1995 Genetic evidence for a Pleistocene population explosion. Evolution 49:608-615.
ROGERS, A. R., 1997 Population structure and modern humans origins, pp. 5579 in Progress in Population Genetics and Human Evolution, edited by P. DONNELLY and S. TAVARÉ. Springer-Verlag, New York.
ROGERS, A. R. and H. HARPENDING, 1992 Population growth makes waves in the distribution of pairwise genetic differences. Mol. Biol. Evol. 9:552-569[Abstract].
ROGERS, A. R., A. E. FRALEY, M. J. BAMSHAD, W. S. WATKINS, and L. B. JORDE, 1996 Mitochondrial mismatch analysis is insensitive to the mutational process. Mol. Biol. Evol. 13:895-902[Abstract].
SLATKIN, M., 1994 Linkage disequilibrium in growing and stable populations. Genetics 137:331-336[Abstract].
SLATKIN, M. and R. R. HUDSON, 1991 Pairwise comparisons of mitochondrial DNA sequences in stable and exponentially growing populations. Genetics 129:555-562[Abstract].
TAJIMA, F., 1989 The effect of change in population size on DNA polymorphism. Genetics 123:597-601
TAJIMA, F., 1993 Measurement of DNA polymorphism, pp. 3759 in Mechanisms of Molecular Evolution. Introduction to Molecular Paleopopulation Biology, edited by N. TAKAHATA and A. G. CLARK. Japan Scientific Societies Press, Sinauer Associates, Tokyo/Sunderland, MA.
TAJIMA, F., 1996 The amount of DNA polymorphism maintained in a finite population when the neutral mutation rate varies among sites. Genetics 143:1457-1465[Abstract].
TAVARÉ, S., D. BALDING, R. C. GRIFFITHS, and P. DONNELY, 1997 Inferring coalescence times from DNA sequence data. Genetics 145:505-518[Abstract].
VIGILANT, L., M. STONEKING, H. HARPENDING, K. HAWKES, and A. C. WILSON, 1991 African populations and the evolution of mitochondrial DNA. Science 253:1503-1507
WAKELEY, J., 1993 Substitution rate variation among sites in hypervariable region I of human mitochondrial DNA. J. Mol. Evol. 37:613-623[Medline].
WAKELEY, J. and J. HEY, 1997 Estimating ancestral population parameters. Genetics 145:847-855[Abstract].
WATSON, E., K. BAUER, R. AMAN, G. WEISS, and A. VON HAESELER et al., 1996 mtDNA sequence diversity in Africa. Am. J. Hum. Genet. 59:437-444[Medline].
WATTERSON, G., 1975 On the number of segregating sites in genetical models without recombination. Theor. Popul. Biol. 7:256-276[Medline].
WEISS, G. and A. VON HAESELER, 1998 Inference of population history using a likelihood approach. Genetics 149:1539-1546
YANG, Z., 1994 Maximum likelihood phylogenetic estimation from DNA sequences with variable rates over sites: approximate methods. J. Mol. Evol. 39:306-341[Medline].
YANG, Z., 1996 Statistical properties of a DNA sample under the finite-sites model. Genetics 144:1941-1950[Abstract].
YANG, Z., S. KUMAR, and M. NEI, 1995 A new method of inference of ancestral nucleotide and amino acid sequences. Genetics 141:1641-1650[Abstract].
This article has been cited by other articles:
![]() |
M. M. Pilkington, J. A. Wilder, F. L. Mendez, M. P. Cox, A. Woerner, T. Angui, S. Kingan, Z. Mobasher, C. Batini, G. Destro-Bisol, et al. Contrasting Signatures of Population Growth for Mitochondrial DNA and Y Chromosomes among Human Populations in Africa Mol. Biol. Evol., March 1, 2008; 25(3): 517 - 525. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Wirth, G. Morelli, B. Kusecek, A. van Belkum, C. van der Schee, A. Meyer, and M. Achtman The rise and spread of a new pathogen: Seroresistant Moraxella catarrhalis Genome Res., November 1, 2007; 17(11): 1647 - 1656. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Muir and D. Filatov A Selective Sweep in the Chloroplast DNA of Dioecious Silene (Section Elisanthe) Genetics, October 1, 2007; 177(2): 1239 - 1247. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. R. S. Meadows, I. Cemal, O. Karaca, E. Gootwine, and J. W. Kijas Five Ovine Mitochondrial Lineages Identified From Sheep Breeds of the Near East Genetics, March 1, 2007; 175(3): 1371 - 1379. [Abstract] [Full Text] [PDF] |
||||
![]() |

















