- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Hammer, M. F.
- Articles by Wilder, J. A.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Hammer, M. F.
- Articles by Wilder, J. A.
Human Population Structure and Its Effects on Sampling Y Chromosome Sequence Variation
Michael F. Hammera,b, Felisa Blackmera, Dan Garriganc, Michael W. Nachmanb, and Jason A. Wilderaa Genomic Analysis and Technology Core, Division of Biotechnology
b Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721
c Department of Biology, Arizona State University, Tempe, Arizona 85287
Corresponding author: Michael F. Hammer, Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721., mhammer{at}u.arizona.edu (E-mail)
Communicating editor: M. AGUADÉ
| ABSTRACT |
|---|
The excess of rare variants in global sequencing studies of the nonrecombining portion of the Y chromosome (NRY) has been interpreted as evidence for the effects of human demographic expansion. However, many NRY polymorphisms are geographically localized and the effect of different geographical sampling on patterns of NRY variation is unknown. We use two sampling designs to detect population structure and its effects on patterns of human NRY polymorphism. First, we sequence 26.5 kb of noncoding Y chromosome DNA from 92 globally distributed males representing 35 populations. We find that the number of polymorphisms with singleton variants is positively correlated with the number of populations sampled and that there is a significant negative correlation of Tajima's D (TD) and Fu and Li's D (FD) statistics with the number of pooled populations. We then sequence the same region in a total of 73 males sampled from 3 distinct populations and find that TD and FD values for the 3 pooled and individual population samples were much less negative than those in the aforementioned global sample. Coalescent simulations show that a simple splitting model of population structure, with no changes in population size, is sufficient to produce the negative values of TD seen in our pooled samples. These empirical and simulation results suggest that observed levels of NRY population structure may lead to an upward bias in the number of singleton variants in global surveys and call into question inferences of population expansion based on global sampling strategies.
PATTERNS of genetic variation within and among human populations contain information about the origin and demographic history of our species. The bulk of the evidence from nuclear and mitochondrial DNA studies has been claimed to support a recent African origin of anatomically modern humans (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Two standard test statistics, Tajima's D (TD; ![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
One of the problems associated with interpreting these patterns of human DNA variability concerns the sampling strategies employed by different investigators. Current sampling schemes vary between two extremes: global sampling, in which a small number of individuals from many different populations are used in sequencing surveys, and population-based sampling, in which larger numbers of individuals from fewer populations are sequenced. It is important to ask to what degree different sampling designs lead to increased variance in observed patterns among loci. Recently, ![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Our purpose here is to examine sequence variation on the NRY using both global and population-based sampling designs to assess the effects of population structure and/or population growth on patterns of NRY diversity. First, we sequence 26.5 kb of noncoding DNA from the Y chromosomes of 92 globally distributed males, and then we sequence the same region from a sample of 73 males drawn from three distinct populations: the Khoisan from Africa (n = 25), the Khalkhs from Mongolia (n = 24), and Papua New Guinean highlanders (n = 24). The results are consistent with substantial levels of population structure and suggest that global sampling designs may bias NRY sequence surveys toward an excess of polymorphisms with singleton variants.
| SUBJECTS AND METHODS |
|---|
Subjects:
DNA sequences were screened for polymorphism in two sampling panels. Panel A included a sample of 92 human males, including 28 from Africa (10 Khoisan, 6 East Bantu, 5 West Bantu, 3 Mbuti, 2 Biaka, 1 Ethiopian, and 1 Egyptian), 13 from the Americas (3 Southwest Amerinds, 3 Surui, 2 Karatiana, 2 Mayans, 1 Navajo, 1 Porch Creek, and 1 Tohono O'Odham), 20 from Europe/Middle East (4 Russians, 5 Ashkenazi Jews, 4 Adygeans, 4 Germans, 1 Turk, 1 Englishman, and 1 Yemenite Jew), 23 from Asia (5 Yakuts, 4 Japanese, 5 Han Chinese, 3 Pakistanis, 1 Khmer Cambodian, 1 Buryat, 1 Forest Nentsi, 1 Khant, 1 Selkup, and 1 Sinhalese), and 8 from Oceania (4 Papua New Guineans, 2 Australian Aboriginal People, and 2 Nasioi) (Fig 1). Seventy-two of these samples were from the Y Chromosome Consortium (YCC) cell line repository (![]()
![]()
![]()
![]()
![]()
|
PCR amplification and mutation detection:
Panels A and B were screened for NRY polymorphism using two different approaches. Denaturing high-performance liquid chromatography (DHPLC) was used to screen for polymorphisms in the following regions on the NRY: (1) 11.3 kb of the arylsulfatase D pseudogene (ARSDP; GenBank accession no.
AC002992), (2) two Y
5 Alu elements within the 16E4 and 486,O,2 clones (GenBank accession nos.
AC003094 and
AC002531, respectively), and (3) four noncoding regions originally identified in anonymous clones used as probes to detect restriction fragment length polymorphism variation on the NRY of humans and great apes (![]()
10.6 kb of anonymous clone region sequence was analyzed. Sequence information from these regions was used to design primers to amplify shorter fragments for DHPLC analysis. Internal primers (available from authors upon request) were used to generate overlapping products for DHPLC analysis and for automated DNA sequencing.
All PCR products producing chromatograms with profiles differing from those of the homoduplex controls were subjected to DNA sequencing. DNA sequencing was performed by standard procedures to identify mutations that altered mobility in DHPLC chromatograms. We were extremely conservative in choosing products for DNA sequencing so as to identify and confirm all possible polymorphisms. The entire 11.3-kb ARSDP and 10.6-kb anonymous clone regions were sequenced from several individuals to assess the error rate in DHPLC. No additional polymorphisms were discovered by sequencing that were not found by DHPLC. The following regions were amplified and subjected directly to DNA sequencing: (1) 2.7 kb of the YAP region (![]()
![]()
Data analysis:
Nucleotide diversity,
(![]()
(![]()
![]()
and
estimate the neutral parameter 2Neµ for the NRY, where Ne is the male effective population size and µ is the neutral mutation rate per nucleotide site. Tajima's D (![]()
![]()
![]()
![]()
Computer simulations:
To distinguish the expected contributions of population structure, growth, and sampling design on patterns of NRY variation, we simulated samples according to three different models of the neutral coalescent process. To begin, we examined the effect on TD of the sample size taken from an exponentially growing panmictic population, for sample sizes up to 600 chromosomes. Second, we examined the effect on TD of the number of demes sampled from a finite island model of population structure (![]()
For each model of population structure, the number of sampled demes varied between 5 and 35. No unsampled demes were included in the model. For each of these sampling schemes, an additional level of sampling effect was examined, which includes (1) sampling only 2 haploid individuals from each deme and (2) sampling 20 haploid individuals per deme. A two-phase model of population growth was implemented by assuming that the onset of growth occurred 103 generations in the past (tg = 103). Before time tg, the population is assumed to be stationary in size, at N = 103 (N is the effective number of haploid individuals). Then, at time tg, the population grows exponentially to N = 105 in the current generation. Gene genealogies were constructed according to the coalescent probabilities given by ![]()
For the island model of population structure, both strong (Nm = 1.0, m is the rate of migration per deme per generation) and weak (Nm = 10-3) migration were considered. In this implementation of the island model, m is held constant each generation and migration occurs symmetrically between all demes (![]()
![]()
generations in the past. The prior probability for this time was characterized by a gamma distribution with E(
) = 5 x 103 generations ago. The prior distribution of subsequent population splitting times is jointly uniform over the interval (0,
). Additionally, each deme was assumed to constitute an equal proportion of the total population size and these proportions were described by a Dirichlet prior distribution. Each simulation bout consisted of 1000 replicates.
| RESULTS |
|---|
Nucleotide diversity in global samples:
A total of 26.5 kb of NRY DNA was screened for polymorphism in a sample of 92 globally distributed Y chromosomes. The screened regions11.3 kb of the ARSDP, 2.7 kb of the YAP region (![]()
![]()
5 Alu elements, and 10.5 kb of anonymous DNA (![]()
![]()
Nucleotide diversity (
± SD) values varied slightly across the five noncoding regions (ARSDP, 0.009 ± 0.001%; YAP, 0.018 ± 0.004%; SRY, 0.015 ± 0.005%; anonymous DNA, 0.010 ± 0.001%), with the highest
value observed for the Y
5 Alu elements (0.088 ± 0.009%). Nucleotide diversity for the entire 26.5-kb region was 0.014 ± 0.001%, a value very similar to estimates based on other global NRY polymorphism surveys (Table 1). This suggests that the methods used to detect NRY polymorphisms (DHPLC and direct DNA sequencing of PCR products) are comparable. For example, if DHPLC were actually less efficient at detecting polymorphism, we would expect studies based on this method to yield a lower proportion of singletons. In fact, the studies that used DNA sequencing did not find a higher proportion of polymorphisms with singleton variants than the studies based on DHPLC. This supports earlier studies demonstrating the high sensitivity and low error rate of DHPLC (![]()
|
Levels of nucleotide variability were generally higher in African than in non-African populations (Table 2), consistent with other studies of NRY SNP variation (![]()
![]()
![]()
![]()
![]()
values, Africans were about two to four times more diverse than non-Africans. This discrepancy was not as apparent when considering
: non-Africans as a whole had higher
values than Africans; however, no single non-African continental region had a higher
than Africans (Table 2).
|
Patterns of nucleotide diversity in global samples:
The 26.5-kb noncoding region examined here exhibited statistically significant negative TD and FD values (P < 0.05) for the sample of 92 Y chromosomes (Table 1), reflecting a more than twofold excess of singletons over neutral expectations (Fig 2A). A similar observation was made in previous studies of NRY variation using a global sampling strategy (![]()
![]()
![]()
![]()
|
To further address the relationship between the frequency distribution and the number of populations surveyed, we measured variation within each continental region separately for the 92 Y chromosomes in this survey (Table 2), as well as in the published survey of ![]()
![]()
0.01), while the correlations for the other two regions (DFFRY and UTY) were marginally statistically significant (e.g., P = 0.05 and P = 0.06, respectively; data not shown). When all five regions were combined (Fig 4), the correlation between FD and number of populations sampled was highly statistically significant (r = -0.756, r2 = 0.577, P < 0.0001). A nonparametric Spearman rank correlation test also found a statistically significant monotonic decrease between FD and the number of populations sampled (r = -0.8005, T = -7.56, P < 0.0001). Despite these clear patterns, it is important to point out that not all the data points in Fig 3 and Fig 4 are independent since some of the small samples are subsets of the larger samples.
|
|
Patterns of nucleotide diversity in population samples:
To discern the effects of sampling on patterns of human NRY nucleotide diversity, we sequenced the same 26.5-kb noncoding region on the Y chromosomes of 73 males representing three distinct populations: the Khoisan of Namibia (n = 25), Papua New Guinean highlanders (n = 24), and Mongolian Khalks (n = 24). In the total sample, we identified 30 SNPs and 3 indels (Fig 5). The nucleotide diversity in this sample (
= 0.014 ± 0.001%) was almost identical to the
for the global survey of 92 chromosomes. However, singletons (s = 10) made up a smaller proportion (33.3%) of the segregating sites within populations than within the global sample (s = 24 or 47.1%). This was reflected in less negative and statistically nonsignificant TD and FD values in the combined sample of 73 chromosomes (P > 0.10; Table 3). The three populations each exhibited different frequency distributions (Fig 2, BD). The Khoisan had no singletons and several intermediate-frequency sites, while 7 of 11 of the Mongolian polymorphisms were singletons and only 2 were at intermediate frequency. The Papua New Guinean (PNG) frequency distribution was characterized by a lower percentage of singletons (4 of 9 polymorphisms), 2 intermediate-frequency sites, and 1 high-frequency polymorphism. All TD values were slightly or moderately negative, ranging from -0.007 (P > 0.10) in the Khoisan to -0.701 (P > 0.10) in the Mongolians, but none was statistically significant (Table 3). In contrast, the FD value for the Khoisan was positive and statistically significant (FD = 1.51, P < 0.02), signifying a deficiency of singletons.
|
|
Population structure:
In the population-based sample, only 4 of the 23 nonsingleton polymorphisms and a single NRY haplogroup were shared among the populations (Fig 5). Two polymorphisms were shared among all three populations, 1 SNP was shared between PNG and Mongolians, and 1 indel (YAP) was shared between the Khoisan and Mongolians. While this low number of shared polymorphisms/haplogroups relative to exclusive polymorphisms is suggestive of NRY population structure, two additional analyses supported the hypothesis of strong population structure. AMOVA revealed that 43% of the total variance was partitioned among populations (
ST = 0.43; P < 0.00001) and population differentiation tests showed that all populations were significantly differentiated (P < 0.00001; data not shown). The
ST value for the 92 global samples grouped by continent was lower (
ST = 0.32), but still statistically significant (P < 0.00001).
Computer simulations:
Coalescent simulation of samples drawn from a panmictic population experiencing exponential growth showed a dependence of TD on sample size (Fig 6). When the onset of growth occurred at tg = 103 generations in the past, TD was less negative compared with tg = 5 x 103 generations ago. The rate of decrease of TD is initially high, and then TD begins to asymptote for sample sizes >100 chromosomes. When samples are divided into varying numbers of demes, the results systematically differ between the two models of population structure. Pooling samples generated by coalescent simulation of the island model of population structure did not produce the effect of an increasingly negative TD, either for 2 sampled chromosomes per deme (Fig 7A and Fig B) or for 20 chromosomes per deme (Fig 7C and Fig D). When migration in the island model is strong (Nm = 1.0), TD never significantly deviates from zero, even when the population experiences growth. Weak migration (Nm = 10-3) in the island model produced simulated samples with positive TD values for each sampling protocol examined, thus countering the tendency of population growth to make TD negative. Although TD decreased as a function of the number of demes sampled when migration is weak (Fig 7C and Fig D), the values never became negative.
|
|
The population bifurcation model did produce TD values that became increasingly negative as samples were pooled from an increasing number of demes. When only two samples per deme were simulated, the mean TD was almost always negative and it became significantly negative when the number of demes pooled was >15 (Fig 8A). The only effect of population growth in this case was to decrease the values of TD slightly. When 20 chromosomes per deme were sampled, TD still became more negative as a greater number of demes were pooled (Fig 8B). When the total population size remained constant, TD never significantly differed from zero. However, when growth occurred, the values of TD from a sample of 20 chromosomes per deme were nearly identical to the values when 2 chromosomes per deme were sampled.
|
| DISCUSSION |
|---|
This study was motivated, in part, by the observation that patterns of NRY sequence variation differ according to sampling strategy. For example, early studies of Y chromosome polymorphism, based on sample sizes ranging from 5 (![]()
![]()
![]()
![]()
Under the standard neutral model, the average value of TD or FD does not depend on sample size and has an expectation of approximately zero (![]()
![]()
![]()
![]()
![]()
![]()
![]()
Population structure:
We posit that a violation of the assumption of random mating could give rise to the observed pattern on the Y chromosome, if a small number of individuals (e.g., 13) are sampled from many locally differentiated populations. Recently, ![]()
![]()
![]()
![]()
ST values. The results of analyzing subsets (i.e., by continent) of global samples [using both our present data and those of ![]()
To further explore the effects of sampling on the frequency distribution, we sequenced the same 26.5-kb noncoding region in a population-based sample of 24 or 25 chromosomes each from three human populations. We found more positive overall TD and FD values for the 73 sampled chromosomes from three populations than for the globally sampled set of 92 Y chromosomes (Table 3). The effect of accumulating singletons in the globally based sample of 92 Y chromosomes was shown by the stronger negative correlation using FD vs. TD as the number of subpopulations sampled increased. This difference is due to the stronger sensitivity of FD than of TD to singletons (![]()
Computer simulations were also employed simply to establish that some models of population structure could lead to the observed relationship between TD and the number of populations sampled. We simulated samples under both the finite island model of population structure and a model of population bifurcation with no migration between demes. The island model of population structure, with weak migration, primarily produced samples with positive values of TD. ![]()
![]()
The population bifurcation model of structure did produce samples with TD values that are compatible with our NRY data. Simulations of the global sampling strategy under this model of population structure yielded a negative correlation between TD and the number of pooled demes, whether or not growth was implemented. Analytical work examining the predictions of this type of population structure model is scant compared with the island model [although see ![]()
![]()
Population growth:
Undoubtedly, human populations have experienced both population growth and population structure at some time in the past. The question is to what extent either or both of these aspects of population history left a signature on patterns of variation. Under a growth model there is a dependence of TD and FD on sample size (Fig 6; ![]()
|
As previously mentioned, simulation of the population bifurcation model shows that population growth is not necessary to create a negative correlation between TD and the number of pooled demes. Indeed, the addition of growth into the two models of population structure that we tested did not greatly influence the results of our simulations. In the island model, growth has only a negligible influence on the simulated values of TD, as can be seen by comparing Fig 7A and Fig C, with Fig 7B and Fig D, respectively. Likewise, in the population bifurcation model of structure, when only two chromosomes per deme are sampled, the influence of growth on TD is weak (Fig 8A). However, when 20 chromosomes per deme are sampled, the effect of growth is more pronounced, making TD more negative (Fig 8B). This suggests that if one wishes to distinguish the effects of population subdivision from population growth in a global sample, one must sample thoroughly within demes to obtain a robust estimate of the frequency distribution of mutations.
Implications for NRY studies:
The observation of an excess of rare variants (i.e., over those expected under a neutral, equilibrium model) in global NRY data sets has played a key role in supporting the hypothesis of a human Pleistocene population explosion (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
It should be pointed out that the differences in frequency spectra patterns observed among the populations sampled here may be exacerbated by local population growth, decline, or selection. For example, the Khoisan are thought to have experienced a population contraction over the past several thousand years as a result of encroachment by expanding Bantu-speaking populations (![]()
![]()
![]()
![]()
Furthermore, the scale of the observed pattern of NRY structure is not clear. Global surveys of NRY SNP variation show statistically significant structure both among continents and among populations within continents, as well as isolation by distance at some regional scales (![]()
Finally, the results presented here suggest that estimates of the time of onset of population growth and the time to the most recent common ancestor (TMRCA) that are based on global sampling strategies and the assumption of panmixia should be considered with caution (![]()
![]()
![]()
Conclusions:
While the NRY is more susceptible to genetic drift and the effects of social processes (e.g., patrilocality, polygyny, and/or kin-structured migration) that tend to increase the proportion of among-group variation, there is accumulating evidence of population structure affecting other regions of the genome. In the largest survey of sequence variation from a single panel of humans performed to date, ![]()
![]()
![]()
![]()
![]()
| FOOTNOTES |
|---|
This article is dedicated to the memory of David C. Rowe. ![]()
| ACKNOWLEDGMENTS |
|---|
We thank Ian J. Wilson for sharing computer code to help perform the Bayesian coalescent simulations of the population bifurcation model, Amit Indap for writing Perl scripts, and Himla Soodyall and Trefor Jenkins for providing Khoisan DNA samples. We also thank two anonymous reviewers for helpful suggestions. Publication of this article was made possible by grant GM-53566 from the National Institute of General Medical Sciences (to M.F.H.). Its contents are solely the responsibility of the authors and do not necessarily represent the official views of the NIH.
Manuscript received October 21, 2002; Accepted for publication April 10, 2003.
| LITERATURE CITED |
|---|
ALLEN, B. S. and H. OSTRER, 1994 Conservation of human Y chromosome sequences among male great apes: implications for the evolution of Y chromosomes. J. Mol. Evol. 39:13-21.[Medline]
ALONSO, S. and J. A. ARMOUR, 2001 A highly variable segment of human subterminal 16p reveals a history of population growth for modern humans outstide Africa. Proc. Natl. Acad. Sci. USA 98:864-869.
BATZER, M. A., M. STONEKING, M. ALEGRIA-HARTMAN, H. BAZAN, and D. H. KASS et al., 1994 African origin of human-specific polymorphic Alu insertions. Proc. Natl. Acad. Sci. USA 91:12288-12292.
BRAVERMAN, J. M., R. R. HUDSON, N. L. KAPLAN, C. H. LANGLEY, and W. STEPHAN, 1995 The hitchhiking effect on the site frequency spectrum of DNA polymorphisms. Genetics 140:783-796.[Abstract]
CANN, R. L., M. STONEKING, and A. C. WILSON, 1987 Mitochondrial DNA and human evolution. Nature 325:31-36.
CARGILL, M., D. ALTSHULER, J. IRELAND, P. SKLAR, and K. ARDLIE et al., 1999 Characterization of single-nucleotide polymorphisms in coding regions of human genes. Nat. Genet. 22:231-238.[Medline]
CAVALLI-SFORZA, L. L., P. MENOZZI and A. PIAZZA, 1994 The History and Geography of Human Genes. Princeton University Press, Princeton, NJ.
DIRIENZO, A. and A. C. WILSON, 1991 Branching pattern in the evolutionary tree for human mitochondrial DNA. Proc. Natl. Acad. Sci. USA 88:1597-1601.
EXCOFFIER, L. and S. SCHNEIDER, 1999 Why hunter-gatherer populations do not show signs of pleistocene demographic expansions. Proc. Natl. Acad. Sci. USA 96:10597-10602.
EXCOFFIER, L., B. PELLEGRINI, A. SANCHEZ-MAZAS, C. SIMON, and A. LANGANEY, 1987 Genetics and history of sub-Saharan Africa. Yearb. Phys. Anthropol. 30:151-194.
FAY, J. and C.-I WU, 1999 A human population bottleneck can account for the discordance between patterns of mitochondrial vs. nuclear DNA variation. Mol. Biol. Evol. 16:1003-1005.[Medline]
FRISSE, L., R. R. HUDSON, A. BARTOSZEWICZ, J. D. WALL, and J. DONFACK et al., 2001 Gene conversion and different population histories may explain the contrast between polymorphism and linkage disequilibrium levels. Am. J. Hum. Genet. 69:831-843.[Medline]
FU, Y. X., 1996 New statistical tests of neutrality for DNA samples from a population. Genetics 143:557-570.[Abstract]
FU, Y. X. and W.-H. LI, 1993 Statistical tests of neutrality of mutations. Genetics 133:693-709.[Abstract]
HALUSHKA, M. K., J. B. FAN, K. BENTLEY, L. HSIE, and N. SHEN et al., 1999 Patterns of single-nucleotide polymorphisms in candidate genes for blood-pressure homeostasis. Nat. Genet. 22:239-247.[Medline]
HAMMER, M. F., 1994 A recent insertion of an Alu element on the Y chromosome is a useful marker for human population studies. Mol. Biol. Evol. 11:749-761.[Abstract]
HAMMER, M. F., 1995 A recent common ancestry for human Y chromosomes. Nature 378:376-378.[Medline]
HAMMER, M. F., A. B. SPURDLE, T. KARAFET, M. R. BONNER, and E. T. WOOD et al., 1997 The geographic distribution of human Y chromosome variation. Genetics 145:787-805.[Abstract]
HAMMER, M. F., T. KARAFET, A. RASANAYAGAM, E. T. WOOD, and T. K. ALTHEIDE et al., 1998 Out of Africa and back again: nested cladistic analysis of human Y chromosome variation. Mol. Biol. Evol. 15:427-441.[Abstract]
HAMMER, M. F., T. M. KARAFET, A. J. REDD, H. JARJANAZI, and S. SANTACHIARA-BENERECETTI et al., 2001 Hierarchical patterns of global human Y-chromosome diversity. Mol. Biol. Evol. 18:1189-1203.
HARDING, R. M., S. M. FULLERTON, R. C. GRIFFITHS, J. BOND, and M. J. COX et al., 1997 Archaic African and Asian lineages in the genetic ancestry of modern humans. Am. J. Hum. Genet. 60:772-789.[Medline]
HARPENDING, H. and A. ROGERS, 2000 Genetic perspectives on human origins and differentiation. Annu. Rev. Genomics Hum. Genet. 1:361-385.[Medline]
HARPENDING, H. C., 1994 Signature of ancient population growth in a low-resolution mitochondrial DNA mismatch distribution. Hum. Biol. 66:591-600.[Medline]
HARPENDING, H. C., M. A. BATZER, M. GURVEN, L. B. JORDE, and A. R. ROGERS et al., 1998 Genetic traces of ancient demography. Proc. Natl. Acad. Sci. USA 95:1961-1967.
HARRIS, E. E. and J. HEY, 1999 X chromosome evidence for ancient human histories. Proc. Natl. Acad. Sci. USA 96:3320-3324.
HEY, J., 1997 Mitochondrial and nuclear genes present conflicting portraits of human origins. Mol. Biol. Evol. 14:166-172.[Abstract]
HUDSON, R., 1990 Gene genealogies and the coalescent process. Oxf. Surv. Evol. Biol. 7:1-44.
A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms. (2001) Nature 409:928-933.[Medline]
JOBLING, M. A. and C. TYLER-SMITH, 1995 Fathers and sons: the Y chromosome and human evolution. Trends Genet. 11:449-456.[Medline]
KAESSMANN, H., F. HEISSIG, A. VON HAESELER, and S. PAABO, 1999 DNA sequence variation in a non-coding region of low recombination on the human X chromosome. Nat. Genet. 22:78-81.[Medline]
KODA, Y., H. TACHIDA, H. PANG, Y. H. LIU, and M. SOEJIMA et al., 2001 Contrasting patterns of polymorphisms at the ABO-secretor gene (FUT2) and plasma
(1,3)fucosyltransferase gene (FUT6) in human populations. Genetics 158:747-756.
MARTH, G., G. SCHULER, R. YEH, R. DAVENPORT, and R. AGARWALA et al., 2003 Sequence variations in the public human genome data reflect a bottlenecked population history. Proc. Natl. Acad. Sci. USA 100:376-381.
MARTINEZ-ARIAS, R., F. CALAFELL, E. MATEU, D. COMAS, and A. ANDRES et al., 2001 Sequence variability of a human pseudogene. Genome Res. 11:1071-1085.
MARUYAMA, T., 1970 Effective number of alleles in a subdivided population. Theor. Popul. Biol. 1:273-306.[Medline]
NACHMAN, M. W., 2001 Single nucleotide polymorphisms and recombination rate in humans. Trends Genet. 17:481-485.[Medline]
NACHMAN, M. W. and S. L. CROWELL, 2000 Contrasting evolutionary histories of two introns of the duchenne muscular dystrophy gene, Dmd, in humans. Genetics 155:1855-1864.
NEI, M. and W.-H. LI, 1979 Mathematical model for studying genetic variation in terms of restriction endonucleases. Proc. Natl. Acad. Sci. USA 76:5269-5273.
O'DONOVAN, M. C., P. J. OEFNER, S. C. ROBERTS, J. AUSTIN, and B. HOOGENDOORN et al., 1998 Blind analysis of denaturing high-performance







