| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Corresponding author: Michael W. Nachman, Biosciences West Bldg., University of Arizona, Tucson, AZ 85721., nachman{at}u.arizona.edu (E-mail)
Communicating editor: M. A. F. NOOR
| ABSTRACT |
|---|
The centromeric region of the X chromosome in humans experiences low rates of recombination over a considerable physical distance. In such a region, the effects of selection may extend to linked sites that are far away. To investigate the effects of this recombinational environment on patterns of nucleotide variability, we sequenced 4581 bp at Msn and 4697 bp at Alas2, two genes situated on either side of the X chromosome centromere, in a worldwide sample of 41 men, as well as in one common chimpanzee and one orangutan. To investigate patterns of linkage disequilibrium (LD) across the centromere, we also genotyped several informative sites from each gene in 120 men from sub-Saharan Africa. By studying X-linked loci in males, we were able to recover haplotypes and study long-range patterns of LD directly. Overall patterns of variability were remarkably similar at these two loci. Both loci exhibited (i) very low levels of nucleotide diversity (among the lowest seen in the human genome); (ii) a strong skew in the distribution of allele frequencies, with an excess of both very-low and very-high-frequency derived alleles in non-African populations; (iii) much less variation in the non-African than in the African samples; (iv) very high levels of population differentiation; and (v) complete LD among all sites within loci. We also observed significant LD between Msn and Alas2 in Africa, despite the fact that they are separated by
10 Mb. These observations are difficult to reconcile with a simple demographic model but may be consistent with positive and/or purifying selection acting on loci within this large region of low recombination.
THE amount and distribution of genetic variation in human populations is a central issue in population genetics. With the completion of the human genome sequence (![]()
![]()
Considerable work over the past decade has documented DNA sequence variation in humans. Early studies focused primarily on mitochondrial DNA (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
= 0.1%), and subsequent work has largely confirmed this result (![]()
= 0.036%; ![]()
= 0.46%; ![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
These and other patterns of DNA sequence variation are context dependent in at least two important ways. First, the distribution of genetic variation is a property of populations and, as such, is expected to vary among populations with different histories. For example, ![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
A second way in which context is important is in the genomic position of genes. Different regions of the genome differ in many important attributes, including gene density, local rate of recombination, mutation rate, and base composition. With the human genome sequence in hand, we can now begin to quantify some of these parameters more precisely and ask how they influence patterns of genetic variation. For example, nucleotide heterozygosity is positively correlated with recombination rate and negatively correlated with gene density (![]()
![]()
![]()
![]()
10 times higher than the average as a consequence of deamination of 5-methylcytosine (![]()
![]()
![]()
![]()
![]()
![]()
![]()
To understand the determinants of nucleotide variation in humans, we have initiated a long-term study of DNA sequence polymorphism in different regions of the human genome in a common set of samples (![]()
![]()
![]()
|
| SUBJECTS AND METHODS |
|---|
Samples:
Forty-one men were chosen for the initial sequencing of Msn and Alas2 (see below), including 10 from Africa, 10 from Europe, 11 from Asia (including 1 from Melanesia), and 10 from the Americas. Human genomic DNAs were isolated from lymphoblastoid cell lines established by the Y ![]()
750 bp from each gene to capture several informative sites in 110 men from sub-Saharan Africa (29 South African Bantu speakers, 1 Biaka, 13 Cameroonians, 21 Gambians, 32 Khoisan, 1 Mbuti, and 13 Tanzanians). All sampling protocols were according to procedures approved by the New York Blood Center and University of Arizona Human Subjects Committees. A single male common chimpanzee (Pan troglodytes) and a single male orangutan (Pongo pygmaeus) were also surveyed from DNAs provided by O. A. Ryder.
PCR amplification and sequencing of Msn and Alas2:
A map of the centromeric region of the human X chromosome is shown in Fig 1. Msn (moesin, membrane-organizing extension spike protein) and Alas2 (aminolevulinate, delta-, synthase 2) are separated by
10 Mb of DNA in the assembled sequence of the human genome (![]()
![]()
![]()
![]()
Data analysis:
Sequences were aligned by eye, and the numbers and frequencies of all polymorphisms were counted. Two measures of nucleotide variability were calculated:
(![]()
(![]()
, is based on the average number of nucleotide differences between two sequences randomly drawn from a sample, and
is based on the proportion of segregating sites in a sample. Under neutral equilibrium conditions, both
and
estimate the parameter 3Neµ for X-linked loci, where Ne is the effective population size and µ is the neutral mutation rate. Departures from a neutral steady-state frequency distribution of polymorphisms were evaluated using three approaches (![]()
![]()
![]()
) with the proportion of polymorphic sites (
) in a sample; FU and LI's (1993) test is based on the number of singletons in a sample; and FAY and WU's (2000) test is based on the number of high-frequency-derived polymorphic nucleotides in a sample. Both TAJIMA's (1989) and FU and LI's (1993) tests may reject the null model because of selection or because of demographic processes (such as a population bottleneck); however, FAY and WU's (2000) test is unlikely to reject the null model except in cases where selection is operating (but see also ![]()
![]()
![]()
![]()
![]()
![]()
| RESULTS |
|---|
Levels of polymorphism and divergence:
A total of
9.6 kb was sequenced from Msn and Alas2 in a sample of 41 globally dispersed humans (Fig 1). Because all of the sequence from Msn is from introns, we have excluded the short exon sequences from Alas2 in all the analyses that follow. This increases the likelihood that all comparisons are among genomic regions experiencing similar levels of selective constraint. Thus, most analyses and discussion refer only to the 9281 bp of intron sequences (Table 1). Polymorphic sites for Msn and Alas2 introns are shown in Table 2. Numbers of segregating sites, nucleotide diversity, measures of the distribution of allele frequencies, and levels of divergence are summarized in Table 1 for the complete data set of 41 individuals. Nine segregating sites were observed in Msn, while 7 segregating sites and two single-base insertion-deletion polymorphisms were observed in Alas2. Msn also had a variable poly(A) tract ranging in length from 17 to 25 bp. Nucleotide diversity was low at both Msn (
= 0.00035) and Alas2 (
= 0.00015), as was Watterson's
(0.00046 and 0.00035, respectively).
|
|
Divergence between humans and chimpanzee for both Msn (0.0092) and Alas2 (0.0055) was comparable to results from previous studies of X-linked introns (average divergence for seven loci = 0.0072; ![]()
2 x 108/site/generation; ![]()
We compared levels of polymorphism and divergence at Msn and Alas2 to levels of polymorphism and divergence at two other X-linked loci using the HKA test (![]()
![]()
![]()
![]()
|
Frequency distribution of polymorphisms:
The frequency distribution of all polymorphisms is plotted in Fig 2. The ancestral state of each polymorphic site was inferred by comparison with the chimpanzee and orangutan sequences, and the frequency of the derived state is shown for each polymorphism. The distribution is characterized by a large number of both low-frequency- and high-frequency-derived polymorphisms. We compared the observed distribution with the distribution expected under the standard neutral model using Tajima's D, Fu and Li's D, and Fay and Wu's H tests (Table 1 and Table 4). In the total sample, all three of these tests take on negative values for Msn alone, Alas2 alone, and for the combined data (Msn + Alas2), and most of these values are either significant or marginally significant (Table 1). When different geographic regions are considered separately, all three test statistics are
0 in Africa, consistent with a neutral model, but are strongly negative in non-African populations (Table 4). Thus, much of the deviation observed in the total sample appears to be due to deviations largely in the non-African populations. The direction of the deviation is consistent with positive selection, a population expansion, background selection (depending on the strength of selection), and/or some form of population structure (see DISCUSSION).
|
|
Linkage disequilibrium:
Complete linkage disequilibrium was observed among all sites within Msn and among all sites within Alas2. For example, when all pairwise comparisons were made among nonsingleton segregating sites, none of the comparisons between pairs of sites in Msn (N = 10) or Alas2 (N = 5) contained all four gametic types (i.e., D' = 1 in all cases). LD was also observed between Msn and Alas2; none of the 20 comparisons between pairs of sites across Msn and Alas2 contained all four gametic types. We tested the significance of LD by comparing pairs of sites in order along the chromosome; this provides a set of statistically independent comparisons for tests of significance (![]()
We were surprised to find LD between Msn and Alas2 in Africa because these genes are separated by
10 Mb. For example, in a study of 19 genomic regions using an African sample from Nigeria, ![]()
750 bp from each gene. Table 5 shows all of the polymorphisms among these 120 Africans (the original 10 plus 110 new individuals). Three observations are noteworthy. First, D' = 1 between sites within each gene in the total sample of 120 Africans. Second, in comparisons between Msn and Alas2, we observed all four gametic types at appreciable frequencies, suggesting that the absence of some haplotypes in the smaller set of 10 individuals (Table 2) was simply a consequence of the small sample size. Third, despite the presence of these new haplotypes in the larger sample, we observed significant LD between sites at Msn and sites at Alas2 (Msn 1046Alas2 3203, FET P = 0.01, D' = 0.58; Msn 1046Alas2 3416, FET P = 0.004, D' = 0.86). The difference in D' between our sample of 10 (D' = 1) and our sample of 120 (D' = 0.58) in comparisons between Msn 1046 and Alas2 3203 highlights the importance of using large samples to make inferences concerning LD.
|
The LD in this data set can also be seen in the phylogenetic analysis. Using parsimony, the 18 human polymorphisms in Table 2 were mapped onto a single shortest tree of length 19 (Msn site 245 includes two mutations resulting in three segregating nucleotides; Fig 3A). In this tree, there are two equally parsimonious placements of the root. A haplotype network of the 120 Africans based on subregions of Msn and Alas2 is shown in Fig 3B. The reticulation in Fig 3B is consistent with the recombinant haplotypes in Table 5. In this network, there is a single most parsimonious placement of the root between haplotypes G and H. Two alternative evolutionary hypotheses for the evolution of the six African haplogroups (CH) are shown in Fig 4; both hypotheses involve four mutations and one recombination event.
|
|
Geographic variation:
The geographic distribution of nucleotide variation at Msn and Alas2 is shown in Table 4. For both genes, nucleotide diversity is substantially lower in the non-African than in the African samples. The distribution of haplotypes in the combined Msn and Alas2 data (N = 41, Table 2 and Fig 3A) is quite different in the African and non-African samples. In Africa, haplotypes are present in only one or two individuals; in the non-African sample, a single very common haplotype (A1) is shared among 25 of 31 individuals (Fig 3A). This haplotype, represented by the consensus sequence in Table 2, is present in each of the non-African continents surveyed. The tree in Fig 3A is largely split into an African clade and a non-African clade. One exception to this pattern is a single Native American [Y Chromosome Consortium (YCC) 27] with a Msn haplotype otherwise found only in Africa. This same individual also contained a polymorphism at DmdI7 otherwise found only in Africa (![]()
![]()
The differentiation between Africans and non-Africans was reflected in patterns of within- and between-group variation in an AMOVA. When the four population samples (i.e., from each continent) were clustered in a single group,
ST was 0.45 (Table 6). When populations were divided into Africans and non-Africans, the
ST value increased to 0.63. Interestingly, 100% of this between-group variation was partitioned between Africans and non-Africans (e.g.,
CT = 0.65 and
SC = 0.06; Table 6). The difference between Africans and non-Africans at Msn + Alas2 is greater than the level of differentiation observed for other loci on the X or Y chromosomes sampled in these same individuals (Table 6).
|
The network in Fig 3A contains three major haplogroups (CE) in sub-Saharan Africa. Haplogroup C was found in a single Pygmy, haplogroup D (D1D5) was found in South African Bantu speakers and Pygmies (as well as in the Poarch Creek; see above), and haplogroup E (E1E3) was found in Khoisan and in Pygmies. Thus, Pygmies exhibit the highest level of diversity in this small sample of sub-Saharan Africans.
Comparisons with the chimpanzee sequence place the root of the tree in Fig 3A either on the branch between haplogroups C and D or on the branch between haplogroups C and E; these alternative roots are equally parsimonious in our sample of 41 individuals. The larger sample of 120 African individuals places the root between haplogroups G and H (Fig 3B). In both cases, African samples occur on both sides of the deepest node in the tree. Thus, Africa is the most likely location of the ancestral Msn-Alas2 sequence. We used two approaches to estimate the time to the most recent common ancestral Msn-Alas2 sequence. First, we calculated the average number of differences between the root of the tree in Fig 3A and each haplotype and multiplied this by two. This reflects the average distance between haplotypes through the root of the tree. We assumed a human-chimpanzee divergence of 6 million years and calculated the ratio of the average distance between haplotypes through the root of the human tree to the human-chimpanzee Msn-Alas2 divergence. The average number of mutations between sequences across the base of the human gene tree was 6.2, while divergence between the human consensus and the chimpanzee sequences was 73. This leads to an estimated time of human sequence divergence of 510,000 years. This time estimate is slightly larger than the one generated from the TMRCA obtained from maximum-likelihood simulations (![]()
![]()
| DISCUSSION |
|---|
We investigated the levels and patterns of nucleotide variation in noncoding sequences from two genes mapping on either side of the X chromosome centromere in a worldwide sample of 41 humans. These genes are located
10 Mb apart and both lie in genomic regions with low rates of recombination and low gene density. We were interested in exploring the effects of this "genomic context" on patterns of variation at these genes. Two major results emerge from this study. First, we found significant linkage disequilibrium across the X chromosome centromere. Second, levels and patterns of variation at both genes show significant departures from a standard neutral model of evolution. We discuss each of these in turn.
Linkage disequilibrium across the centromeric region of the X chromosome:
We observed significant LD within Msn, within Alas2, as well as significant LD between these genes. LD was seen in our worldwide sample of 41 individuals (containing only 10 Africans), and it was also seen in our sample of 120 African individuals.
It is instructive to compare the LD seen in this study to the LD observed at Dmd and G6pd, two other X-linked loci that have been surveyed in these same 41 individuals. For example, at DmdI44, recombinants (i.e., all four gametic types) are seen between nucleotides separated by <200 bp. Dmd lies in a genomic region with high rates of recombination (>2 cM/Mb), and this likely contributes to the difference in LD seen between Dmd and Msn + Alas2. G6pd lies in Xq28 in a region of moderate recombination (
12 cM/Mb). Several mutations at G6pd are known to confer resistance to malaria and thus are under selection in regions of the world where malaria is common. In Africa, the G6pd A allele is in linkage disequilibrium with mutations at L1cam, a locus situated
500 kb from G6pd, and this long-range LD is likely caused by selection acting on the G6pd A allele (![]()
![]()
10 times greater than the LD seen near G6pd, a locus known to have unusually high LD as a consequence of selection. Because these estimates come from the same sample of individuals, the differences in LD among loci are unlikely to be due to population-level effects or to sampling strategy.
It is also useful to compare the LD seen in this study to the LD observed in other samples. ![]()
![]()
10 Mb.
What is the cause of the high LD near the centromere of the X chromosome? Because this amount of LD is not seen at other loci sampled in the same individuals, nor in other samples from the same general geographic regions, it is unlikely to be due to population-level effects such as a bottleneck or admixture. Two other factors may contribute to the observed LD. The first is simply that the centromeric region of the X chromosome experiences low rates of recombination. Estimates of recombination rate in this region are on the order of 0.10.6 cM/Mb (![]()
![]()
![]()
![]()
![]()
![]()
Rejection of the standard neutral model:
Several observations suggest that patterns of variation at both Msn and Alas2 are not consistent with the standard neutral model. First, there are low levels of variability at both loci despite typical levels of divergence. In the total data set (Msn + Alas2, N = 41), the HKA test rejects the null model in comparison with two other X-linked loci, DmdI44 and Pdha1, chosen because they reside in genomic regions with above-average rates of recombination and thus are likely to be free of the effects of selection at linked sites (Table 3). The inference of significantly lower variation at Msn + Alas2 comes with two caveats. One is that we have performed multiple HKA tests but have not corrected the significance level for multiple comparisons; thus the reduction should be interpreted as modest. The other is that the significance of the HKA test, and the subsequent inference of selection on particular loci, depends on the choice of reference loci. For example, compared with other loci showing little variation (such as Xq13.3, ![]()
![]()
, Alas2 is the least variable locus, and Msn is the third least variable locus. Most other genes that show low variability, such as F9 (![]()
![]()
![]()
![]()
|
In addition to the reduction in variability at Msn and Alas2, we observed a significant skew in the distribution of allele frequencies, particularly in non-African populations. This is seen in the negative values for Tajima's D, Fu and Li's D, and Fay and Wu's H statistics (Table 1 and Table 4). These observations are certainly consistent with selection, but may also be consistent with some demographic explanations. For example, a population expansion is expected to lead to negative values of Tajima's D and Fu and Li's D and may help account for the values in Table 4. Fay and Wu's H, which is based on the frequency of derived polymorphic nucleotides, is not expected to reject the null model under a simple population expansion (![]()
![]()
![]()
![]()
A third unexpected observation is the long-range LD seen between Msn and Alas2. As discussed above, it is not easy to account for this by any simple model of population structure, since LD is seen in the total data set and in the African sample alone. It is also not easy to account for this by the reduced recombination rate in this genomic region, since the total genetic distance between Msn and Alas2 is 16 cM.
Finally, we observed a very high level of population structure in our data, mostly driven by differences between African and non-African samples. Two sites show a nearly fixed difference between Africa and the rest of the world (Msn 1414 and Msn 4325), and the resulting
ST for the combined data is 0.45, a value greater than that for other loci surveyed in these individuals (Table 6). ![]()
6% of the markers had FST
0.40. Thus Msn + Alas2 show more population structure than most loci in the genome. However, autosomal loci, which constitute most of the data in ![]()
The standard neutral model is based on a population of constant size at mutation-drift equilibrium and in principle may be rejected because of selection, population processes, or both. At Msn and Alas2 we observe low variability, a skew in the frequency spectrum, high LD, and high
ST. Can we distinguish selection from demography as the cause of these patterns? One standard approach for distinguishing population-level processes (or artifacts of sampling) from locus-specific effects is to compare multiple loci, ideally sampled in the same set of individuals. Viewed in the context of other X-linked loci (Table 6 and Table 7), Msn and Alas2 are unusual in many but not all respects. Msn and Alas2 show less variability than most loci and greater LD than virtually all loci. They show a strong skew in the frequency distribution with an excess of rare variants, but this is also seen at a handful of other loci. Likewise, they show considerable population structure, but this too is seen at some other loci. It appears difficult to reconcile a single demographic model with this combination of results. For example, while a population expansion out of Africa coupled with subsequent migration might explain the negative values for Tajima's D, Fu and Li's D, and Fay and Wu's H, it does not account for the unusually high levels of LD both in the total sample and in Africa nor does it account for the significantly lower variability at Msn and Alas2 but not in other genes sampled in these same individuals.
On the other hand, many of our observations for Msn and Alas2 are consistent with the action of selection at linked sites near the centromere of the X chromosome. The combination of low variability, a skew in the frequency spectrum, high LD, and high
ST could be explained by background selection, positive directional selection, or some combination of these processes.
Background selection (![]()
![]()
Likewise, genetic hitchhiking (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Two recent genomic scans for selection, in each case based on different data and approaches, suggest that positive selection has acted recently near the X chromosome centromere (![]()
![]()
![]()
![]()
The observations presented here are difficult to reconcile with a simple demographic model. However, numerous aspects of our data seem consistent with both background selection and hitchhiking models, and we emphasize that both processes may be important. In principle, it might be possible to distinguish between them by surveying microsatellite variation in this region of the X chromosome. Background selection predicts a reduction in levels of variability, even for markers with high mutation rates such as microsatellites, while genetic hitchhiking only predicts reduced variation at microsatellites for very recent selective sweeps (![]()
![]()
![]()
| ACKNOWLEDGMENTS |
|---|
We thank the members of the Nachman and Hammer labs for useful discussions and comments on the manuscript. We also thank M. Noor and two anonymous reviewers for comments. This work was supported by the National Science Foundation.
Manuscript received December 25, 2003; Accepted for publication January 29, 2004.
| LITERATURE CITED |
|---|
AKEY, J. M., G. ZHANG, K. ZHANG, L. JIN, and M. D. SHRIVER, 2002 Interrogating a high-density SNP map for signatures of natural selection. Genome Res. 12:1805-1814.
ALONSO, S. and J. A. ARMOUR, 2001 A highly variable segment of human subterminal 16p reveals a history of population growth for modern humans outside Africa. Proc. Natl. Acad. Sci. USA 98:864-869.
BAHLO, M. and R. C. GRIFFITHS, 2000 Inference from gene trees in a subdivided population. Theor. Popul. Biol. 57:79-95.[CrossRef][Medline]
BAMSHAD, M. J., S. MUMMIDI, E. GONZALEZ, S. S. AHUJA, and D. M. DUNN et al., 2002 A strong signature of balancing selection in the 5' cis-regulatory region of CCR5. Proc. Natl. Acad. Sci. USA 99:10539-10544.
BRAVERMAN, J. M., R. R. HUDSON, N. L. KAPLAN, C. H. LANGLEY, and W. STEPHAN, 1995 The hitchhiking effect on the site frequency spectrum of DNA polymorphims. Genetics 140:783-796.[Abstract]
CHARLESWORTH, B., M. T. MORGAN, and D. CHARLESWORTH, 1993 The effect of deleterious mutations on neutral molecular variation. Genetics 134:1289-1303.[Abstract]
CLARK, A. G., K. M. WEISS, D. A. NICKERSON, S. L. TAYLOR, and A. BUCHANAN et al., 1998 Haplotype structure and population genetic inferences from nucleotide-sequence variation in human lipoprotein lipase. Am. J. Hum. Genet. 63:595-612.[CrossRef][Medline]
COOPER, D. N., and M. KRAWCZAK, 1993 Human Gene Mutation. Bios Scientific, Oxford.
ENARD, W., M. PRZEWORSKI, S. E. FISHER, C. S. LAI, and V. WIEBE et al., 2002 Molecular evolution of FOXP2, a gene involved in speech and language. Nature 418:869-872.[CrossRef][Medline]
EXCOFFIER, L., P. E. SMOUSE, and J. M. QUATTRO, 1992 Analysis of molecular variance inferred from metric distances among DNA haplotypes: application to human mitochondrial DNA restriction data. Genetics 131:479-491.[Abstract]
FAY, J. C. and C.-I WU, 2000 Hitchhiking under positive Darwinian selection. Genetics 155:1405-1413.
FRISSE, L., R. R. HUDSON, A. BARTOSZEWICZ, J. D. WALL, and J. DONFACK et al., 2001 Gene conversion and different population histories may explain the contrast between polymorphism and linkage disequilibrium levels. Am. J. Hum. Genet. 69:831-843.[CrossRef][Medline]
FU, Y. X. and W. H. LI, 1993 Statistical tests of neutrality of mutations. Genetics 133:693-709.[Abstract]
FULLERTON, S. M., A. G. CLARK, K. M. WEISS, D. A. NICKERSON, and S. L. TAYLOR et al., 2000 Apolipoprotein E variation at the sequence haplotype level: implications for the origin and maintenance of a major human polymorphism. Am. J. Hum. Genet. 67:881-900.[CrossRef][Medline]
GALTIER, N., F. DEPAULIS, and N. H. BARTON, 2000 Detecting bottlenecks and selective sweeps from DNA sequence polymorphism. Genetics 155:981-987.
GILAD, Y., S. ROSENBERG, M. PRZEWORSKI, D. LANCET, and K. SKORECKI, 2002 Evidence for positive selection and population structure at the human MAO-A gene. Proc. Natl. Acad. Sci. USA 99:862-867.
HAMBLIN, M. T. and A. DI RIENZO, 2000 Detection of the signature of natural selection in humans: evidence from the duffy blood group locus. Am. J. Hum. Genet. 66:1669-1679.[CrossRef][Medline]
HAMMER, M. F., 1995 A recent common ancestry for human Y chromosomes. Nature 378:376-378.[CrossRef][Medline]
HAMMER, M. F., F. BLACKMER, D. GARRIGAN, M. W. NACHMAN, and J. A. WILDER, 2003 Human population structure and its effects on sampling Y chromosome sequence variation. Genetics 164:1495-1509.
HARDING, R. M., S. M. FULLERTON, R. C. GRIFFITHS, J. BOND, and M. J. COX et al., 1997 Archaic African and Asian lineages in the genetic ancestry of modern humans. Am. J. Hum. Genet. 60:772-789.[Medline]
HARDING, R. M., E. HEALY, A. J. RAY, N. S. ELLIS, and N. FLANAGAN et al., 2000 Evidence for variable selective pressures at MC1R. Am. J. Hum. Genet. 66:1351-1361.[CrossRef][Medline]
HARDISON, R. C., K. M. ROSKIN, S. YANG, M. DIEKHANS, and W. J. KENT et al., 2003 Covariation in frequencies of substitution, deletion, transposition, and recombination during eutherian evolution. Genome Res. 13:13-26.
HARRIS, E. E. and J. HEY, 1999 X chromosome evidence for ancient human histories. Proc. Natl. Acad. Sci. USA 96:3320-3324.
HARRIS, E. E. and J. HEY, 2001 Human populations show reduced DNA sequence variation at the Factor IX locus. Curr. Biol. 11:774-778.[CrossRef][Medline]
HELLMANN, I., I. EBERSBERGER, S. E. PTAK, S. PAABO, and M. PRZEWORSKI, 2003 A neutral explanation for the correlation of diversity with recombination rates in humans. Am. J. Hum. Genet. 72:1527-1535.[CrossRef][Medline]
HEY, J., 1997 Mitochondrial and nuclear genes present conflicting portraits of human origins. Mol. Biol. Evol. 14:166-172.[Abstract]
HUDSON, R. R., M. KREITMAN, and M. AGUADE, 1987 A test of neutral molecular evolution based on nucleotide data. Genetics 116:153-159.
HUTTLEY, G. A., M. W. SMITH, M. CARRINGTON, and S. J. O'BRIEN, 1999 A scan for linkage disequilibrium across the human genome. Genetics 152:1711-1722.