- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Pluzhnikov, A.
- Articles by Hudson, R. R.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Pluzhnikov, A.
- Articles by Hudson, R. R.
Inferences About Human Demography Based on Multilocus Analyses of Noncoding Sequences
Anna Pluzhnikova, Anna Di Rienzoa, and Richard R. Hudsonba Department of Human Genetics, University of Chicago, Chicago, Illinois 60637
b Department of Ecology and Evolution, University of Chicago, Chicago, Illinois 60637
Corresponding author: Richard R. Hudson, 1101 E. 57th St., University of Chicago, Chicago, IL 60637., rr-hudson{at}uchicago.edu (E-mail)
Communicating editor: N. TAKAHATA
| ABSTRACT |
|---|
Data from 10 unlinked autosomal noncoding regions, resequenced in 15 individuals from each of three populations, were used in a multilocus analysis to test models of human demography. Each of the 10 regions consisted of
2500 bp. The multilocus analysis, based on summary statistics (average and variance of Tajima's D and Fu and Li's D*), was used to test a family of models with recent population expansion. The African sample (Hausa of Cameroon) is compatible with a constant population size model and a range of models with recent expansion. For this population sample, we estimated confidence sets that showed the limited range of parameter values compatible with growth. For an exponential growth rate as low as 1 x 10-3/generation, population growth is unlikely to have started prior to 50,000 years ago. For higher growth rates, the onset of growth must be more recent. On the basis of the average value of Tajima's D, our sample from an Italian population was found to be incompatible with a constant population size model or any simple expansion model. In the Chinese sample, the variance of Tajima's D was too large to be compatible with the constant population size model or any simple expansion model.
ELUCIDATING the history of the human population size is an important part of reconstructing human evolution and understanding patterns of human variation. Changes in population size are thought to mark important events in the history of a species, e.g., geographic range expansions, development of technological innovations, and climatic changes. In addition, the estimation of the time since the most recent common ancestor (TMRCA), which has important implications for human evolution, relies critically on assumptions about human demography (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Similarly, studies of nuclear sequence variation lead to somewhat contrasting conclusions. In general, nuclear loci show more ancient coalescence times compared to mtDNA (as might be expected on the basis of the different effective population sizes), but no evidence for a star-shaped genealogy. The latter observation suggests that rapid growth from a small initial size is not compatible with the data and that, if ancient population growth occurred, it started from a population of nontrivial size. Unfortunately, studies of nuclear sequence variation also vary greatly with regard to the scheme for sampling populations (from population-based to grid sampling), the type of genomic regions studied (from coding to noncoding), and the method of variation detection (![]()
![]()
More recently, 10 noncoding regions were surveyed in three population samples to characterize the decay of linkage disequilibrium and obtain population-based estimates of the crossing-over and gene conversion rates (![]()
![]()
Here, we reanalyze the noncoding sequence data in ![]()
![]()
![]()
![]()
In agreement with ![]()
50,000 years ago. For higherpossibly more realisticgrowth rates, the onset of growth must be more recent. Both non-African samples are incompatible with the constant population size model or any version of our growth model.
| MATERIALS AND METHODS |
|---|
Data collection:
A new scheme for data collection was developed to survey simultaneously and efficiently sequence variation and linkage disequilibrium (LD). This consisted of resequencing two segments of
1 kb separated by
8 kb in all individuals from three population samples. Each of these two-segment units is referred to as a "locus pair." The data set analyzed here consists of 10 such locus pairs that are unlinked to each other. The genomic regions were chosen according to a fixed set of criteria. These criteria were determined by the need to pool data from different locus pairs in the analysis and, thus, to select locus pairs with similar recombination and mutation rates. In addition, because the main goal of this analysis is to reconstruct demographic histories, it was necessary to reduce the probability that the surveyed genomic regions were affected by natural selection. This was achieved by choosing regions that do not contain or flank known or strongly predicted coding regions (the minimum distance between the regions surveyed and the closest known or strongly predicted gene was >25 kb). The details of the procedure for selecting genomic regions that fulfilled these criteria are described in ![]()
|
Demographic history models in the coalescent framework:
The basic model of population demography is the Wright-Fisher model, which assumes a panmictic population with nonoverlapping generations. We assume the diploid population size in the distant past was constant at size NA. At a time, tonset generations in the past, the population began exponentially growing until the present. Thus, measuring time in units of generations before present we assume the population size, N(t), is
![]() |
(1) |
(see ![]()
![]()
, is denoted N0. The primary goal of this article is to determine the values of
and tonset for which the model is compatible with the data. We assume a generation time of 20 years. Only positive values of
are considered.
Coalescent simulations with recombination were used to generate samples under this model. These simulations used standard methodology (![]()
![]()
Parameter values:
Since the goal of this study is making inferences about various demographic scenarios, the parameters not directly associated with demography, such as the mutation rate, µ, and the recombination rate, c, can be thought of as nuisance parameters. Elimination of these nuisance parameters is easily achieved by adopting the Bayesian approach, namely by viewing them as random quantities and subsequently integrating them out (![]()
and
of these distributions correspond to genome-wide estimates for these parameters, namely an average mutation rate of 2 x 10-8/site/generation and a recombination rate between adjacent base pairs of 1 x 10-8/generation. Hence, we set
and
. The central 90% intervals for these distributions are (0.36 x 10-8, 4.74 x 10-8) and (0.18 x 10-8, 2.37 x 10-8) for µ and c, respectively. Recent findings point to 10- to 1000-fold variability in recombination rate over 12 kb in the MHC region (![]()
Estimates of the neutral mutation rate are based on observed levels of sequence divergence from a great ape outgroup from a number of surveys (![]()
![]()
![]()
![]()
![]()
We also treat NA, the ancestral effective population size (which is identical to N0 in the constant population size model), as a random variable independent of all other parameters. In particular, we let NA be randomly distributed as Gamma(4, ßA). We note that, with the other parameters fixed (
, tonset, and ßµ), the value of NA determines the mean number of polymorphic sites in samples. To incorporate prior information about observed levels of polymorphism in earlier studies, we chose the value of ßA (= ENA/4) so that the expected number of segregating sites per kilobase in a sample of 30 chromosomes is 4. This choice for ßA is based on the following observations. In a large number of studies, Watterson's estimate of
(= 4Neµ) is on average
0.001 (somewhat larger in African populations and somewhat smaller in non-African populations). This is also the average estimated value of
in the 10 locus pairs analyzed here. In a sample of 30 chromosomes, this value of
leads to an expected number of polymorphic sites of 4/kb under the neutral constant population size model. Thus, for the constant population size model we chose ßA so that
. For models with population growth, we also set the value of ßA so that the expected number of polymorphic sites is 4/kb. In this case a simple formula is not available but the appropriate value of ßA or ENA can be obtained numerically for any specified value of tonset and
, as shown in the Appendix
To complete the model specification under a growth scenario, the remaining two parameters, tonset and
, are allowed to vary over a grid of fixed values tonset = 1K, 2K, ... , 8K generations,
= 0.5 x 10-3, 1 x 10-3, ... , 10 x 10-3. Note that some combinations of tonset,
, and ENA would be omitted from consideration since to attain the specified mean number of polymorphic sites the corresponding values of EN0 would have greatly exceeded the current size of the entire world population. Smaller values of
were not considered because they would result in models virtually indistinguishable from the equilibrium model.
Simulation procedure:
The polymorphism data are simulated using methods of ![]()
![]()
(including tonset = 0.0 and
= 0.0 corresponding to the constant population size), and a fixed sample size of n chromosomes, we generate independent random realizations following these steps:
- Step 1. Simulate the parameter NA as described above, and compute the current effective population size N0.
- Step 2. Simulate the parameters µ and c, and compute the scaled mutation rate
and recombination rate
, where L denotes the length of the sequence. - Step 3. Simulate the genealogical history with recombination events as described by
HUDSON 1983 .
- Step 4. Simulate mutations on the genealogy assuming an infinite sites model and rates obtained in step 2.
For our preliminary investigations of the distribution of the statistics, we considered simulated samples of sequences with L = 10,000 bp, but only the mutations that fell in the two 1-kb flanking segments were considered. The polymorphic sites in the middle 8000 bp were ignored to match the structure of the locus pair data. This is referred to as simulations of the simplified data. For testing the models, a similar scheme was used except that the sequence length and distances between the sequenced segments were adjusted to match exactly the data. This is referred to as simulations of the real data. To generate samples of several genetically independent regions, for a single realization of NA steps 24 were repeated for all unlinked loci in question, keeping the value of NA the same while allowing other parameters to vary randomly from locus to locus. This effectively accounts for the mutation and recombination rate heterogeneity between loci. In addition, all realizations without polymorphic sites were discarded.
Summary statistics and hypotheses testing:
Methods for using full data likelihoods are not available or feasible for the models tested here with recombination. Hence, we investigated the power of each of the following summary statistics at single-locus pairs to detect recent population growth: the mean pairwise nucleotide differences,
, the sample standard deviation of the pairwise nucleotide difference,

(![]()
, Tajima's D statistic (![]()
![]()
![]()
![]()
![]()
In a preliminary investigation, 100,000 independent samples of locus pair sequences were generated using the parameters and procedures described above (simulations of the simplified data). The behavior of the summary statistics under different demographic scenarios was compared to choose the most informative one for detecting recent population expansion.
First, we obtained empirical distributions for the statistics under the null and alternative hypotheses and observed that, for a fixed
, they form a stochastically monotone family of distributions decreasing as tonset increased, i.e., went farther away into the past, until
20K generations, after which the direction of the change reversed (data not shown). We are interested in testing the hypothesis of a relatively recent population expansion (tonset < 5K generations); thus we limited our investigation to the time interval where monotonicity applies. Fig 1 illustrates this observation for two such families of empirical distributionsthose of Tajima's D and Fu and Li's D* statistics. Note that the monotonicity of these families of distributions implies monotonicity of power functions (![]()
|
For all test statistics, we obtained empirical cutoff points for the average value of the statistic over 10 locus pairs corresponding to the 5% significance level. Likewise, the critical values of the variance of Tajima's D over 10 locus pairs were estimated. The power of each test was assessed as a function of the parameter tonset for a range of fixed values of growth rate
. The empirical density functions can also be used to obtain P values for the experimental data.
The power of a test based on a summary statistic was estimated by simulating 100,000 realizations from an alternative model in question and counting the number of times the null hypothesis was rejected. The corresponding empirical power functions are shown in Fig 2. The plots clearly indicate that tests based on Tajima's D and Fu and Li's D* statistics are by far more powerful than all other tests considered. These two statistics were used for testing the growth models. In addition, we carried out a test of the equilibrium model on the basis of the sample variance of Tajima's D; the critical values for this statistic were estimated on the basis of the same set of simulations described above. The use of this test was motivated by previous results of a similar test, which suggested a significantly large variance of Tajima's D in the Chinese sample. However, the test carried out here properly takes into account the structure of the data and incorporates the effect of recombination.
|
For each value of
and tonset, a multilocus P value for the data was estimated as twice (i.e., two-tailed test) the proportion of computer-generated samples with a more extreme average value of the test statistics than observed. The set of values of
and tonset for which the P value was greater than a specified value constituted our estimated confidence set. Only values of
and tonset such that N0 is <4 x 109 are considered. All multilocus P values were estimated on the basis of simulations of the real data. Note, however, that the power functions were calculated for a one-tailed test.
| RESULTS |
|---|
The results of testing the constant population size model are shown in Table 2. Based on the average value of Tajima's D and Fu and Li's D*, the Hausa sample is compatible with the constant population size model. In addition, it is compatible with a set of models with recent population growth. The confidence regions for the parameters (
and tonset) defining the growth model are shown in Fig 3. These are the set of parameter values for which the estimated P value is greater than the specified values 0.1, 0.05, 0.02, and 0.01. Such confidence sets show that, for an exponential growth rate as low as 1 x 10-3/generation, the growth phase is unlikely to have started earlier than
50,000 years ago.
|
|
As shown in Fig 3, the parameters of the growth model are interdependent: high growth rates are compatible with the data only for small tonset, and models with large tonset are accepted only for small growth rates. It should be noted that the expectation of NA is varied across the confidence region plot with varying values of
and tonset in such a way that the expected number of polymorphic sites is 4/kb (see MATERIALS AND METHODS). This variation of ENA across combinations of
and tonset values is exemplified in Table 3 for six points (labeled AF) on the boundary of the 95% confidence set. These values range from 7800 to 10,300, depending on the growth rate and the test statistic applied. For any
value, ENA decreases with decreasing tonset. Hence, for all points in the 95% confidence set with
> 1 x 10-3, ENA must be >7800. As evident in the plot, the boundaries of the confidence sets sharply increase as
approaches zero. This implies that although a more ancient onset of growth is compatible with the data, the growth rate must be so small as to be essentially indistinguishable from the constant population size model. The dashed line in Fig 3 corresponds to points with ENA = 10,000, a value that is often reported in human variation studies. This widely reported effective population size estimate is based on the implicit assumption of an equilibrium population and, as such, is not equivalent to an estimate of the ancestral population size in a model that incorporates growth.
|
In contrast, the Italian data show large positive average values of Tajima's D and Fu and Li's D* across loci. The simulations showed that the observed average value of Tajima's D is too large to be compatible with the constant population size model. As expected, recent population growth shifts the distribution of these statistics toward smaller values (Fig 1). Thus, it follows that the Italian sample is also incompatible with the family of growth models tested here (for any positive growth rate). Likewise, the variance of Tajima's D across loci in the Chinese is significantly too large compared to the expectations for the constant population size model. Since we showed that the distribution of the variance of Tajima's D decreases monotonically with increasing time of onset of growth (Fig 1), these results allow us to rule out both the equilibrium model and the family of growth models tested here. These findings are consistent with the results in ![]()
| DISCUSSION |
|---|
Our multilocus analysis of noncoding sequences showed that the Hausa sample is compatible with a constant population size model as well as a model with recent population growth if the growth parameter and the time of initiation of growth are in a constrained range as shown in Fig 3. This figure shows a 95% confidence region based on Tajima's D and Fu and Li's D*. For an exponential growth rate of 10-3/generation, the earliest onset compatible with the observed Fu and Li's D* in the Hausa sample is
50,000 years ago. It should be noted that this growth rate is rather small, resulting in population size increasing only by a factor of 12 over 50,000 years. If the growth parameter is larger, the onset of growth must be more recent than 50,000 years ago.
Conversely, the equilibrium model or models with population growth from a population of nontrivial size are not compatible with the observed average value and variance of Tajima's D in the Italian and Chinese samples, respectively. More complex demographic models with population bottlenecks and/or with some degree of geographic substructure in the past may account for the non-African data.
These conclusions are consistent with the results of ![]()
![]()
![]()
![]()
![]()
Despite many attempts to infer the history of population size in humans, a coherent picture has yet to emerge. A synthesis of the available evidence is complicated by the heterogeneity of data used, including unlinked autosomal loci as well as nonrecombining uniparentally inherited loci such as those in the mtDNA genome and in the nonrecombining portion of the Y chromosome. A further level of heterogeneity results from the analysis of loci experiencing different mutation processes, namely nucleotide substitution and insertion/deletion (i.e., microsatellites). Finally, the methods of analysis, the specific models tested, and the populations sampled vary greatly across studies.
Most human population samples show patterns of mtDNA variation consistent with rapid population growth. These data were used to estimate the time of onset of growth to an interval that largely overlaps with our estimates for the Hausa sample (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Some interesting parallels exist between the mtDNA and the Y chromosome findings. Like mtDNA, Y chromosome loci show patterns consistent with rapid growth in most human populations (![]()
![]()
![]()
![]()
![]()
![]()
![]()
Overall, the data from uniparentally inherited nonrecombining loci differ markedly from our results in two main respects: the smaller-than-expected ancestral population size and the signal of growth in non-African samples. While the latter discrepancy might be reconciled by more complex demographic models, including a population size reduction before expansion (![]()
![]()
![]()
Although a number of autosomal microsatellite data sets agree in showing evidence for some population growth, many aspects of the results are incongruous, thus hindering any comparison to the locus pair data. Under the assumption of a more general stepwise mutation model, ![]()
![]()
![]()
Probably due to the similar type of data and demographic models tested, our results are most consistent with those of ![]()
A recent survey of sequence variation in 313 human genes showed a marked skew toward negative Tajima's D values (![]()
Our data and methods of analyses have several advantages over those of earlier studies. The use of single-nucleotide substitution rather than microsatellite data implies better estimates of the mutation rate at each locus and hence more reliable estimates of population parameters. Furthermore, avoiding coding regions reduces the probability that patterns of variation were shaped by natural selection rather than demography. The availability of sequence data from several independent loci in exactly the same population samples also eliminates the possibility that the observed interlocus variability is due to the different histories of the populations surveyed at different loci. Since evolutionary processes are highly stochastic, demographic inferences must of necessity rely on the analysis of many independent loci. Unless natural selection is thought to act on a specific subset of the loci, any demographic model should account for the data at all loci. Thus, a simultaneous analysis of multiple independent loci will lead to better estimates and more powerful tests. Accordingly, our multilocus analysis led to narrower confidence intervals and more easily interpretable results compared to those in ![]()
![]()
![]()
![]()
![]()
It should be noted that, although we have rejected simple growth models for the non-African samples, the data may be consistent with other scenarios that include a growth phase as part of a more complex model. In this regard, it is interesting to note that a recent analysis of ascertained single nucleotide polymorphisms in humans supported a model that included growth in effective population size in the context of a subdivided population. However, when population subdivision was removed from the model, a simple equilibrium model could not be rejected (![]()
| ACKNOWLEDGMENTS |
|---|
We thank M. Przeworski, J. Pritchard, P. Donnelly, and S. Zoellner for comments on the manuscript. This work was supported by a National Institutes of Health grant (HG02098) to A.D.
Manuscript received December 27, 2001; Accepted for publication March 19, 2002.
| APPENDIX |
|---|
Here, we compute the expected value of NA conditional on the expected number of polymorphisms Sn in the sample. We begin by deriving the expression for the expected number ESn of polymorphic sites in a sample of haploid size n as a function of NA and the parameters of the evolution model. First, assume that NA and the mutation rate µ are constants rather than random variables. This assumption is not required for the recombination rate since the only property it affects is the variance of Sn.
In the infinite sites model of mutation under standard coalescent theory, Sn is identical to the number of mutations on a coalescent tree since the most recent common ancestor of the sample and is given by
![]() |
(A1) |
where
is the scaled mutation rate per sequence of length l, and E
n is the expected length of the ancestral tree (i.e., the total length of all branches). This relationship holds for all models of demographic history of the population, in particular, for the exponential growth model in which
.
To find the expression for E
n in terms of NA and other parameters, we note that
n can be partitioned as
![]() |
(A2) |
where
cn and
gn are the parts of the ancestral tree corresponding to the constant and growth phase, respectively. We observe that
![]() |
(A3) |
where An(t) is the number of ancestors of the sample at time t in the past. For a sample from an exponentially growing panmictic population,
![]() |
(A4) |
and
![]() |
(A5) |
(![]()
, and
. Hence, similarly to ![]()
![]() |
(A6) |
where E1(·) is the exponential integral (![]()
By the Markov property of the ancestral process An(t),
![]() |
(A7) |
where Agn(t) and Acn(t) refer to the ancestral process during the growth and constant periods, respectively, and the probability distribution of Agn(tonset) (![]()
![]() |
(A8) |
The factor of 1/G in expression (A7) is due to different timescales for Agn(t) and Acn(t).
For relatively small sample sizes (n < 25), expressions (A6) and (A7) can be easily evaluated by means of, for instance, the Numerical Recipes software package (![]()
, and tonset. For larger n, numerical evaluation of (A7) becomes increasingly unstable due to a so-called "catastrophic cancellation" (![]()
![]() |
(A9) |
which gives rise to (A8). A Fortran 95 program that implements both of these approaches is available from the authors.
Finally, solving Equation A1 numerically with respect to NA yields the desired result. Note that for a random, rather than fixed, NA this gives only an approximation for ENA; however, the approximation is sufficiently accurate due to the smoothness of ESn as a function of NA.
| LITERATURE CITED |
|---|
ABRAMOVITZ, M., and I. A. STEGUN, 1964 Handbook of Mathematical Functions. Dover, New York.
BROOKFIELD, J. F., 1997 Importance of ancestral DNA ages. Nature 388:134.[Medline]
CASELLA, G., and R. L. BERGER, 1990 Statistical Inference. Warsworth & Brooks/Cole, Pacific Grove, CA.
CHEN, F. C. and W. H. LI, 2001 Genomic divergences between humans and other hominoids and the effective population size of the common ancestor of humans and chimpanzees. Am. J. Hum. Genet. 68:444-456.[Medline]
DI RIENZO, A. and A. C. WILSON, 1991 Branching pattern in the evolutionary tree for human mitochondrial DNA. Proc. Natl. Acad. Sci. USA 88:1597-1601.
DI RIENZO, A., P. DONNELLY, C. TOOMAJIAN, B. SISK, and A. HILL et al., 1998 Heterogeneity of microsatellite mutations within and between loci, and implications for human demographic histories. Genetics 148:1269-1284.
FAY, J. C. and C.-I WU, 1999 A human population bottleneck can account for the discordance between patterns of mitochondrial versus nuclear DNA variation. Mol. Biol. Evol. 16:1003-1005.[Medline]
FAY, J. C. and C.-I WU, 2000 Hitchhiking under positive Darwinian selection. Genetics 155:1405-1413.
FORSYTHE, G. E., M. A. MALKOLM and C. B. MOLER, 1977 Computer Methods for Mathematical Computations. Prentice-Hall, Englewood Cliffs, NJ.
FRISSE, L., R. R. HUDSON, A. BARTOSZEWICZ, J. D. WALL, and J. DONFACK et al., 2001 Gene conversion and different population histories may explain the contrast between polymorphism and linkage disequilibrium levels. Am. J. Hum. Genet. 69:831-843.[Medline]
FU, Y.-X. and W.-H. LI, 1993 Statistical tests of neutrality of mutations. Genetics 133:693-709.[Abstract]
GONSER, R., P. DONNELLY, G. NICHOLSON, and A. DI RIENZO, 2000 Microsatellite mutations and inferences about human demography. Genetics 154:1793-1807.
GRIFFITHS, R. C. and S. TAVARÉ, 1998 The age of a mutation in a general coalescent tree. Commun. Stat. Stoch. Models 14:273-295.
HAMBLIN, M. T., E. E. THOMPSON, and A. DI RIENZO, 2002 Complex signatures of natural selection at the Duffy blood group locus. Am. J. Hum. Genet. 70:369-373.[Medline]
HUDSON, R. R., 1983 Properties of a neutral allele model with intragenic recombination. Theor. Popul. Biol. 23:183-201.[Medline]
HUDSON, R. R., 1987 Estimating the recombination parameter of a finite population model without selection. Genet. Res. 50:245-250.[Medline]
HUDSON, R. R., 1990 Gene genealogies and the coalescent process. Oxf. Surv. Evol. Biol. 7:1-44.
INGMAN, M., H. KAESSMANN, S. PAABO, and U. GYLLENSTEN, 2000 Mitochondrial genome variation and the origin of modern humans. Nature 408:708-713.[Medline]
JEFFREYS, A. J., L. KAUPPI, and R. NEUMANN, 2001 Intensely punctate meiotic recombination in the class II region of the major histocompatibility complex. Nat. Genet. 29:217-222.[Medline]
KIMMEL, M., R. CHAKRABORTY, J. P. KING, M. BAMSHAD, and W. S. WATKINS et al., 1998 Signatures of population expansion in microsatellite repeat data. Genetics 148:1921-1930.
MARJORAM, P. and P. DONNELLY, 1994 Pairwise comparisons of mitochondrial DNA sequences in subdivided populations and implications for early human evolution. Genetics 136:673-683.[Abstract]
NACHMAN, M. W. and S. L. CROWELL, 2000 Estimate of the mutation rate per nucleotide in humans. Genetics 156:297-304.
PRESS, W. H., S. A. TEUKOLSKY, W. T. VETTERLING and B. P. FLANNERY, 1996 Numerical Recipes in Fortran 90: The Art of Parallel Scientific Computing. Cambridge University Press, Cambridge, UK.
PRITCHARD, J. K., M. T. SEIELSTAD, A. PEREZ-LEZAUN, and M. W. FELDMAN, 1999 Population growth of human Y chromosomes: a study of Y chromosome microsatellites. Mol. Biol. Evol. 16:1791-1798.[Abstract]
PRZEWORSKI, M., 2002 The signature of natural selection at randomly chosen loci. Genetics 160:1179-1189.
PRZEWORSKI, M., R. R. HUDSON, and A. DI RIENZO, 2000 Adjusting the focus on human variation. Trends Genet. 16:296-302.[Medline]
REICH, D. and D. GOLDSTEIN, 1998 Genetic evidence for a Paleolithic human population expansion in Africa. Proc. Natl. Acad. Sci. USA 95:8119-8123.
REICH, D. E. and E. S. LANDER, 2001 On the allelic spectrum of human disease. Trends Genet. 17:502-510.[Medline]
ROGERS, A. R. and H. HARPENDING, 1992 Population growth makes waves in the distribution of pairwise genetic differences. Mol. Biol. Evol. 9:552-569.[Abstract]
SEVERINI, T. A., 1999 On the relationship between Bayesian and non-Bayesian elimination of nuisance parameters. Stat. Sinica 9:713-724.
SHERRY, S. T., A. R. ROGERS, H. HARPENDING, H. SOODYALL, and T. JENKINS et al., 1994 Mismatch distributions of mtDNA reveal recent human population expansions. Hum. Biol. 66:761-775.[Medline]
SIMONSEN, K. L., G. A. CHURCHILL, and C. F. AQUADRO, 1995 Properties of statistical tests of neutrality for DNA polymorphism data. Genetics 141:413-429.[Abstract]
SLATKIN, M. and R. R. HUDSON, 1991 Pairwise comparisons of mitochondrial DNA sequences in stable and exponentially growing populations. Genetics 129:555-562.[Abstract]
STEPHENS, J. C., J. A. SCHNEIDER, D. A. TANGUAY, J. CHOI, and T. ACHARYA et al., 2001 Haplotype variation and linkage disequilibrium in 313 human genes. Science 293:489-493.
STEPHENS, M. and P. DONNELLY, 2000 Inference in molecular population genetics. J. R. Stat. Soc. B 62:605-635.
TAJIMA, F., 1989 Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 123:585-595.
THOMSON, R., J. K. PRITCHARD, P. SHEN, P. J. OEFNER, and M. W. FELDMAN, 2000 Recent common ancestry of human Y chromosomes: evidence from DNA sequence data. Proc. Natl. Acad. Sci. USA 97:7360-7365.
WAKELEY, J., R. NIELSEN, S. N. LIU-CORDERO, and K. ARDLIE, 2001 The discovery of single-nucleotide polymorphismsand inferences about human demographic history. Am. J. Hum. Genet. 69:1332-1347.[Medline]
WALL, J. D. and M. PRZEWORSKI, 2000 When did the human population size start increasing? Genetics 155:1865-1874.
WATSON, E., K. BAUER, R. AMAN, G. WEISS, and A. VON HAESELER et al., 1996 mtDNA sequence diversity in Africa. Am. J. Hum. Genet. 59:437-444.[Medline]
WEISS, G. and A. VON HAESELER, 1998 Inference of population history using a likelihood approach. Genetics 149:1539-1546.
YU, A., C. ZHAO, Y. FAN, W. JANG, and A. J. MUNGALL et al., 2001 Comparison of human genetic and sequence-based physical maps. Nature 409:951-953.[Medline]
ZHIVOTOVSKY, L. A., L. BENNETT, A. M. BOWCOCK, and M. W. FELDMAN, 2000 Human population expansion and microsatellite variation. Mol. Biol. Evol. 17:757-767.
This article has been cited by other articles:
![]() |
N. J. R. Fagundes, N. Ray, M. Beaumont, S. Neuenschwander, F. M. Salzano, S. L. Bonatto, and L. Excoffier Statistical evaluation of alternative models of human evolution PNAS, November 6, 2007; 104(45): 17614 - 17619. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Pyhajarvi, M. R. Garcia-Gil, T. Knurr, M. Mikkonen, W. Wachowiak, and O. Savolainen Demographic History Has Influenced Nucleotide Diversity in European Pinus sylvestris Populations Genetics, November 1, 2007; 177(3): 1713 - 1724. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. J. Clark, S. E. Ptak, I. Tiemann, Y. Qian, G. Coop, A. C. Stone, M. Przeworski, N. Arnheim, and A. D. Rienzo Combining Sperm Typing and Linkage Disequilibrium Analyses Reveals Differences in Selective Pressures or Recombination Rates Across Human Populations Genetics, February 1, 2007; 175(2): 795 - 804. [Abstract] [Full Text] [PDF] |
||||
![]() |
O Thalmann, A Fischer, F Lankester, S Paabo, and L Vigilant The Complex Evolutionary History of Gorillas: Insights from Genomic Data Mol. Biol. Evol., January 1, 2007; 24(1): 146 - 158. [Abstract] [Full Text] [PDF] |
||||
![]() |
|


and (B) 2.0 x 10-3.












