IDT. Quality oligos. Every time.

Genetics, Vol. 155, 1855-1864, August 2000, Copyright © 2000

Contrasting Evolutionary Histories of Two Introns of the Duchenne Muscular Dystrophy Gene, Dmd, in Humans

Michael W. Nachmana and Susan L. Crowella
a Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721

Corresponding author: Michael W. Nachman, Department of Ecology and Evolutionary Biology, Biosciences West Bldg., University of Arizona, Tucson, AZ 85721., nachman{at}u.arizona.edu (E-mail)

Communicating editor: W. STEPHAN


*  ABSTRACT
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

The Duchenne muscular dystrophy (Dmd) locus lies in a region of the X chromosome that experiences a high rate of recombination and is thus expected to be relatively unaffected by the effects of selection on nearby genes. To provide a picture of nucleotide variability at a high-recombination locus in humans, we sequenced 5.4 kb from two introns of Dmd in a worldwide sample of 41 alleles from Africa, Asia, Europe, and the Americas. These same regions were also sequenced in one common chimpanzee and one orangutan. Dramatically different patterns of genetic variation were observed at these two introns, which are separated by >500 kb of DNA. Nucleotide diversity at intron 44 ({pi} = 0.141%) was more than four times higher than nucleotide diversity at intron 7 ({pi} = 0.034%) despite similar levels of divergence for these two regions. Intron 7 exhibited significant linkage disequilibrium extending over 10 kb and also showed a significant excess of rare polymorphisms. In contrast, intron 44 exhibited little linkage disequilibrium and no skew in the frequency distribution of segregating sites. Intron 7 was much more variable in Africa than in other continents, while intron 44 displayed similar levels of variability in different geographic regions. Comparison of intraspecific polymorphism to interspecific divergence using the HKA test revealed a significant reduction in variability at intron 7 relative to intron 44, and this effect was most pronounced in the non-African samples. These results are best explained by positive directional selection acting at or near intron 7 and demonstrate that even genes in regions of high recombination may be influenced by selection at linked sites.


IDENTIFYING the forces shaping genetic variation in natural populations remains a key problem in population genetics. Surprisingly, our understanding of the amount and structure of genetic variation at the nucleotide level in humans is still in its early stages. Mutation, migration, drift, recombination, selection at individual loci, the effects of selection at linked sites, and demographic history undoubtedly all play a role in shaping patterns of human genetic variation, although the relative importance of these different factors is not yet clear. Significant progress into this problem has been made with recent studies of nucleotide variation at ß-globin (HARDING et al. 1997 Down), dystrophin (ZIETKIEWICZ et al. 1997 Down, ZIETKIEWICZ et al. 1998 Down), lipoprotein lipase (CLARK et al. 1998 Down), introns of seven X-linked genes (NACHMAN et al. 1998 Down), pyruvate dehydrogenase E1 {alpha} subunit (HARRIS and HEY 1999 Down), angiotensin converting enzyme (RIEDER et al. 1999 Down), a noncoding region at Xq13.3 (KAESSMANN et al. 1999 Down), the X-chromosome-specific zinc-finger protein (JARUZELSKA et al. 1999 Down), and the melanocortin 1 receptor (RANA et al. 1999 Down; HARDING et al. 2000 Down). Collectively, these studies have shown that the average level of nucleotide diversity in humans is quite low, largely confirming a result first obtained by LI and SADLER 1991 Down. However, there is also substantial heterogeneity in levels and patterns of genetic variation among loci, and a central challenge now is to explain these differences.

Theoretical studies show that the interaction of selection and recombination can have a dramatic effect on levels of nucleotide variability, either through the fixation of advantageous mutations (i.e., genetic hitchhiking; MAYNARD SMITH and HAIGH 1974 Down) or the removal of deleterious mutations (i.e., background selection; CHARLESWORTH et al. 1993 Down). Both processes are expected to reduce levels of neutral genetic variation in genomic regions with low rates of recombination. Estimates of recombination rate for different genomic regions can be obtained by comparing the genetic and physical locations of markers. In humans, there is evidence for both local and large-scale variation in the recombinational landscape. For example, several studies have revealed recombinational hotspots, suggesting that recombination rates may vary substantially over a scale of several kilobases (e.g., OUDET et al. 1992 Down; HARDING et al. 1997 Down). A common large-scale pattern is the suppression of recombination near centromeres of metacentric chromosomes (e.g., NAGARAJA et al. 1997 Down). Variation at both of these scales is likely to be important in determining the effects of selection at linked sites. There is good evidence for a positive correlation between regional recombination rate and levels of nucleotide heterozygosity in Drosophila melanogaster (BEGUN and AQUADRO 1992 Down; AQUADRO et al. 1994 Down; MORIYAMA and POWELL 1996 Down) and weaker evidence for a positive association between recombination rate and levels of nucleotide variability in a number of other organisms (NACHMAN 1997 Down; DVORAK et al. 1998 Down; KRAFT et al. 1998 Down; STEPHAN and LANGLEY 1998 Down), including humans (NACHMAN et al. 1998 Down; PRZEWORSKI et al. 2000 Down).

Motivated by theoretical expectations concerning the effects of selection on linked neutral variation and the empirical evidence suggesting that such effects may be common, we were interested in documenting patterns of nucleotide variability at a gene that experiences a very high rate of recombination in humans. In principle, high-recombination genes are least likely to be affected by selection at linked sites and are thus more likely to reflect neutral, equilibrium conditions.

Dystrophin is the protein product of the Duchenne muscular dystrophy (Dmd) locus. Duchenne muscular dystrophy is a common inherited disease with an incidence worldwide of 1 in 3500 births, many of which arise from new mutations. The Dmd locus is ~2.4 Mb long and consists of 79 exons that encode a 14-kb transcript. This mRNA codes for a 3685-amino-acid protein of 427 kD that shows similarity to several cytoskeletal proteins. Dmd is X-linked and lies in a genomic region experiencing high rates of recombination. Fine scale mapping of this region reveals overall recombination frequencies of 12 cM across 2 Mb of DNA (ABBS et al. 1990 Down; OUDET et al. 1992 Down). This overall rate, 6 cM/Mb, is about six times the average value of ~1 cM/Mb across the human genome. OUDET et al. 1992 Down documented considerable heterogeneity in recombination frequencies in different regions of the Dmd gene and found that some regions experience recombination rates >10 cM/Mb (Fig 1). Previous studies have surveyed worldwide variation in intron 44 of Dmd using single-strand conformation polymorphisms (SSCPs) to detect mutations (ZIETKIEWICZ et al. 1997 Down, ZIETKIEWICZ et al. 1998 Down) or through direct DNA sequencing (NACHMAN et al. 1998 Down). ZIETKIEWICZ et al. 1997 Down, ZIETKIEWICZ et al. 1998 Down screened 7622 bp in a worldwide sample of 250 chromosomes but may not have uncovered all of the underlying variation since their study was based on mutation detection using SSCP. NACHMAN et al. 1998 Down surveyed 1537 bp in 10 individuals by sequencing all sites.



View larger version (23K):
In this window
In a new window
Download PPT slide
 
Figure 1. Map of Dmd. (A) Recombination rate estimates for different portions of the Dmd locus (data from OUDET et al. 1992 Down). (B) Physical map of Dmd. (C) Position of exons. (D) Amplified regions of introns 7 and 44 in this study. Arrows denote PCR primer positions.

Here, we further investigate patterns of genetic variation at two introns (7 and 44) of Dmd in a global sample of 41 alleles and find strikingly different patterns of genetic variation in each region. Both of these introns experience recombination rates well above the genomic average and are expected to be relatively free of the effects of selection at linked sites. Nonetheless, the contrasting patterns of variation at these two introns suggest that recent directional selection has acted at or near intron 7 of Dmd.


*  MATERIALS AND METHODS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Samples:
Forty-one men were sampled, including 10 from Africa, 10 from Europe, 11 from Asia (including one from Melanesia), and 10 from the Americas. Human genomic DNAs were provided by Dr. M. F. Hammer from the Y chromosome consortium (YCC) DNA repository. A single male common chimpanzee (Pan troglodytes) and a single male orangutan (Pongo pygmaeus) were also surveyed from DNAs provided by Dr. O. A. Ryder. By sequencing X chromosomes in males, we were able to amplify by PCR and sequence a single allele per individual and thus avoid problems associated with sequencing and scoring heterozygous sites. We were also able to recover haplotypes directly and thereby look at patterns of linkage disequilibrium among all sites in the sample.

PCR amplification and sequencing of Dmd:
A map of the Dmd locus is shown in Fig 1. Additional detailed information about the structure of this locus can be found at http://www.dmd.nl. Intron 7 and intron 44 are separated by >500 kb of DNA. Both introns lie in genomic regions experiencing high rates of recombination (>4 cM/Mb), although the intervening introns experience considerably lower rates of recombination (<1 cM/Mb). DNA was PCR amplified (SAIKI et al. 1988 Down) in 25-µl volumes with 40 cycles of 94° 1 min, 55° 1 min, and 72° 2 min. Amplification primers were designed from published sequence for intron 7 (GenBank accession no. U60822) and intron 44 (GenBank accession no. M86524) and are listed in Table 1. Products were cycle-sequenced and run on an ABI 377 automated sequencer or sequenced manually as in NACHMAN et al. 1998 Down. A total of 2389 bp from intron 7 and 3000 bp from intron 44 were sequenced. The 3000 bp in intron 44 are contiguous but the 2389 bp in intron 7 consist of two fragments (1388 bp and 1001 bp) separated by 8820 bp of largely repetitive DNA, which we found difficult to sequence (Fig 1). The 3000-bp portion of intron 44 includes and extends the smaller region (1537 bp) sequenced in 10 individuals in NACHMAN et al. 1998 Down. Sequences have been submitted to GenBank under accession nos. AF279921, AF280049.


 
View this table:
In this window
In a new window

 
Table 1. Amplification primers used in this study

Data analysis:
Sequences were aligned by eye, and the numbers and frequencies of all polymorphisms were counted. Two measures of nucleotide variability, {pi} (NEI and LI 1979 Down) and {theta} (WATERSON 1975 Down), were calculated. Nucleotide diversity, {pi}, is based on the average number of nucleotide differences between two sequences randomly drawn from a sample, and {theta} is based on the proportion of segregating sites in a sample. Under neutral, equilibrium conditions, both {pi} and {theta} estimate the parameter 3Neµ for X-linked loci, where Ne is the effective population size and µ is the neutral mutation rate. Departures from a neutral equilibrium frequency distribution of polymorphisms were evaluated using two approaches (TAJIMA 1989 Down; FU and LI 1993 Down). Linkage disequilibrium (D') was calculated for a set of independent pairwise comparisons between non-unique polymorphic sites (LEWONTIN 1964 Down, LEWONTIN 1995 Down), and the significance of D' was assessed using Fisher's exact tests. Ratios of polymorphism within humans to divergence between human and chimpanzee or human and orangutan were compared with expectations under a neutral model using the Hudson, Kreitman and Aguadé (HKA) test (HUDSON et al. 1987 Down). Polymorphism was based on variation segregating among the 41 human alleles and divergence was based on a single randomly chosen human allele and the single chimpanzee or orangutan allele.


*  RESULTS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Polymorphic sites for introns 7 and 44 are shown in Table 2 and Table 3, respectively. Numbers of segregating sites, nucleotide diversity, measures of the frequency distribution, and levels of divergence are summarized in Table 4 for both introns. Nine segregating sites were observed in intron 7, and 19 segregating sites were observed in intron 44. Intron 7 had three insertion-deletion polymorphisms; two consisted of a single nucleotide and one consisted of 5 bp. Intron 44 contained a complicated compound microsatellite consisting of several different dinucleotide repeats (Table 3). Nucleotide diversity at intron 44 ({pi} = 0.141%) was more than four times greater than nucleotide diversity at intron 7 ({pi} = 0.034%). Waterson's {theta}, which is based on the number of segregating sites, was less than twice as large in intron 44 ({theta} = 0.148) as in intron 7 ({theta} = 0.088). The relative similarity in {theta} despite the difference in {pi} between the two introns is due in large part to the difference in the number of singletons in each intron. Seven out of 9 (78%) polymorphic sites in intron 7 are singletons, while 6 out of 19 (32%) polymorphic sites in intron 44 are singletons. The frequency distribution of polymorphisms is consistent with neutral expectations for intron 44, but there is an excess of rare polymorphisms in intron 7, reflected in the significantly negative values of Tajima's D and Fu and Li's D statistics (Table 4).


 
View this table:
In this window
In a new window

 
Table 2. Individuals sampled and polymorphic sites at Dmd intron 7


 
View this table:
In this window
In a new window

 
Table 3. Polymorphic sites at Dmd intron 44


 
View this table:
In this window
In a new window

 
Table 4. Nucleotide polymorphism and divergence at Dmd

Divergence was significantly higher at intron 7 than at intron 44 in comparisons between human and chimpanzee (t = 2.30, P < 0.05). In comparisons between human and orangutan, divergence was only slightly and not significantly higher at intron 7 than at intron 44 (Table 4).

We investigated patterns of linkage disequilibrium by comparing pairs of sites in order along the chromosome; this provides a set of independent comparisons for tests of significance (LEWONTIN 1995 Down). Sites containing singletons were excluded from this analysis. In intron 7, comparisons were made among four sites (section one, 268, 551; section two, 481, 540). Significant linkage disequilibrium was observed in each of the three sequential comparisons involving these sites (Fisher's exact test, P < 0.001 for each, after Bonferroni correction for multiple tests). Sites 268 and 551 in section one of intron 7 are ~10 kb away from sites 481 and 540 in section two of intron 7. The high level of linkage disequilibrium in intron 7 results in two major haplotypes (represented by YCC individuals 32 and 8; Table 2). In intron 44, 13 sequential pairwise comparisons were made among 14 sites (1, 191, 681, 814, 834, 1160, 1224, 1353, 1830, 1858, 2374, 2423, 2532, and 2691). Significant linkage disequilibrium was observed in three of these comparisons (Fisher's exact test, P < 0.0001 for sites 834–1160 and 1160–1224; P < 0.05 for sites 2374–2423, after Bonferroni correction).

None of the 3 comparisons between pairs of sites in intron 7 contained all four gametic types, while 6 of the 13 comparisons between pairs of sites in intron 44 contained all four gametic types. Thus, more recombination is observed among the sequences in intron 44 than in intron 7, consistent with the mapping data in Fig 1. We also calculated the neutral recombination parameter, {gamma}, from the polymorphism data at intron 44 using the method of HEY and WAKELEY 1997 Down. This provided an estimate of the per-site population recombination rate, 2Ncf = 5.58 x 10-3, which corresponds to a rate of 27.9 cM/Mb (where cf is the female recombination rate assuming N = 104; e.g., HAMMER 1995 Down). This value is substantially larger than the estimate of 15 cM/Mb obtained from mapping data (Fig 1), although the variance associated with both of these estimates is large. If population size is closer to 20,000–30,000 (e.g., NACHMAN et al. 1998 Down; HARRIS and HEY 1999 Down), then the inferred recombination rate from the sequence data (9.3–14.0 cM/Mb) is in better agreement with the estimate from mapping data. {gamma} could not be calculated for either portion of intron 7 because there are no incongruent pairs of sites in these data; the maximum-likelihood estimate of {gamma} in this case is zero (HEY and WAKELEY 1997 Down).

The geographic distribution of nucleotide variation at each intron is shown in Table 5. For intron 7, nucleotide diversity is substantially lower in the non-African samples ({pi} ranges from 0 to 0.025%) than in the African sample ({pi} = 0.08%). The two major haplotypes at intron 7 are both present in Africa, but only one is present out of Africa. For intron 44, nucleotide diversity in the non-African samples ({pi} ranges from 0.111 to 0.144%) is more than half the value observed in the African sample ({pi} = 0.173%). Surprisingly, for both introns, the Asian sample is the least variable and is even slightly, though not significantly, less variable than the sample from the Americas. Average FST calculated across all populations was six times higher for intron 7 (FST = 0.176) than for intron 44 (FST = 0.028). This overall difference in FST is attributable to the differences between the two introns in the partitioning of genetic variation between African and non-African populations, as can be seen from the distribution of variation in Table 2 and Table 3. Average FST calculated across all non-African populations was zero for intron 7 and was very small for intron 44 (FST = 0.013).


 
View this table:
In this window
In a new window

 
Table 5. Amount and distribution of polymorphisms at Dmd introns 7 and 44 by geographic region

HKA comparisons involving polymorphism and Homo-Pan divergence between intron 7 and intron 44 are shown in Table 6. When all the data are considered, there is only a marginally significant rejection of the null model (P = 0.08). However, when the non-African populations are considered collectively, there is a significant reduction in the ratio of polymorphism to divergence at intron 7 relative to intron 44 (P < 0.05). A significant reduction is also seen in Europe and in Asia, but not in Africa or the Americas. HKA tests involving Homo-Pongo comparisons yield similar results: a significant or marginally significant rejection of the null model is obtained in comparisons involving Asia (P < 0.05), Europe (P = 0.06), or all non-African populations (P = 0.08), but not in comparisons involving the total sample, Africa, or the Americas (P > 0.10). We also performed HKA tests comparing Dmd intron 7 and Dmd intron 44 to another X-linked gene, Pdha1 (HARRIS and HEY 1999 Down). The Pdha1 data consist of 4200 bp surveyed in a worldwide sample of 35 chromosomes. In comparisons using the entire sample, the ratio of polymorphism to divergence is lower at Dmd intron 7 than at Pdha1 (HKA {chi}2 = 3.41, P = 0.06), but is nearly identical at Dmd intron 44 and at Pdha1 (HKA {chi}2 = 0.01, P > 0.5).


 
View this table:
In this window
In a new window

 
Table 6. HKA tests comparing Dmd intron 7 vs. intron 44, Homo vs. Pan


*  DISCUSSION
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

We investigated the amount and structure of DNA sequence variation at two introns of Dmd in a worldwide sample of 41 humans and found that these two introns have strikingly different patterns of genetic variation. In general, intron 44 had a high level of nucleotide diversity, little linkage disequilibrium, no skew in the frequency distribution of polymorphisms, and revealed similar patterns of variation in and out of Africa. Patterns of variation at intron 44 are entirely consistent with a neutral model of molecular evolution. In contrast, intron 7 had a low level of nucleotide diversity, displayed significant linkage disequilibrium extending over 10 kb, a significant excess of rare polymorphisms, and very different patterns of variation in and out of Africa. Jointly, the patterns of variation observed at these two introns are inconsistent with a standard, neutral equilibrium model. The statistical evidence against this model derives from the significantly negative values of TAJIMA's (1989) D and FU and LI's (1993) D for intron 7 (Table 4) and from the significant HKA tests showing reduced variability at intron 7 in non-African populations (Table 6). These patterns are difficult to reconcile with nonequilibrium population-level effects, such as migration or changes in population size, since such effects are expected to affect all loci in a roughly proportional fashion. On the other hand, all of our observations are consistent with positive directional selection acting recently at or near intron 7. Positive directional selection can reduce levels of linked neutral variability, increase levels of linkage disequilibrium, and produce a skew in the frequency distribution toward an excess of rare sites (MAYNARD SMITH and HAIGH 1974 Down; KAPLAN et al. 1989 Down; TAJIMA 1989 Down; BRAVERMAN et al. 1995 Down). Moreover, if selection does not act equally in all geographic regions, it may also lead to increased levels of population differentiation (e.g., STEPHAN 1994 Down; STEPHAN et al. 1998 Down).

The exact nature of selection is difficult to determine from the observed distribution of variation. There are two major haplotypes at intron 7 (represented by YCC individuals 32 and 8) and these haplotypes are three (YCC 32) and one (YCC 8) mutational steps derived from the ancestral human haplotype, inferred from parsimony using the chimpanzee and orangutan sequences as outgroups. Both of the major haplotypes are present in Africa but only one is present out of Africa. All other haplotypes in our sample are one mutational or recombinational step derived from one of these two major haplotypes. One straightforward explanation for the differing patterns of variation at intron 44 and at intron 7 is a partial selective sweep of the more common haplotype (YCC 32) at intron 7, especially in non-African populations. The fact that variation is reduced primarily in non-African populations suggests that a selective sweep may have occurred concomitant with or following the movement of anatomically modern humans out of Africa. It should be noted that despite the presence of two major alleles, there is no evidence for an excess of variation or for polymorphisms at intermediate frequency as might be expected under prolonged balancing selection (e.g., HUDSON et al. 1987 Down; KREITMAN and HUDSON 1991 Down).

The likelihood that selection has acted at or near intron 7 raises the question of which site or sites are the direct targets of selection. The genomic distance in base pairs (d) over which selection is likely to exert a strong effect on levels of linked neutral variability is a function of the strength of selection, s, and the recombination rate per nucleotide, c, and is approximated by d = (0.01) s/c (KAPLAN et al. 1989 Down, p. 896). For example, WANG et al. 1999 Down observed a reduction in neutral variation over a region of only a few kilobases in the vicinity of the 5' promoter region of the teosinte-branched 1 locus in maize, a gene that has been a target of strong artificial selection during the domestication of maize. The recombination rate over the entire Dmd locus is ~6 cM/Mb, but it may be closer to 4 cM/Mb in intron 7 (OUDET et al. 1992 Down; Fig 1), corresponding to c = 4 x 10-8 per nucleotide. Mutations with selection coefficients <10-4 are unlikely to have been affected by deterministic processes in ancestral human populations, given most estimates of effective population size (e.g., HAMMER 1995 Down; ZIETKIEWICZ et al. 1998 Down). If we consider a range of selection coefficients, 0.001 < s < 0.1, then linked neutral variability is expected to be reduced over a genomic distance ranging from 500 bp to 50 kb. Nucleotide variability at intron 7 is low in both of the segments we sequenced, and these are separated by ~9 kb. This implies that selection coefficients may be >10-3, assuming a simple model of genetic hitchhiking (KAPLAN et al. 1989 Down). Nonetheless, the large size of the window of reduced variation points to the difficulty of identifying the specific site or sites that have been under selection. This stands in contrast to the narrow window of reduced variability in maize at the tb1 locus (WANG et al. 1999 Down) or the narrow window of elevated variability at Adh in Drosophila (KREITMAN and HUDSON 1991 Down). Because average recombination rates in humans (10-8 per site) are roughly threefold lower than in Drosophila (2–3 x 10-8 per site; NACHMAN and CHURCHILL 1996 Down), the size of windows affected by selection on linked sites in humans may, on average, be larger than in flies (assuming equivalent selection coefficients).

The effects of selection are expected to be easiest to detect in genomic regions experiencing low rates of recombination because these regions will contain more potential targets of selection for a given genetic distance. Indeed, our study was motivated by an interest in depicting patterns of variation at a high-recombination gene to capture the distribution of variation that may be closest to neutral, equilibrium values. However, the observed patterns of variation strongly suggest that selection has acted in this region, and these observations raise the possibility that the signature of selection at the molecular level may be common in the human genome. Moreover, the differences in patterns of variation seen at intron 7 and intron 44 highlight that a single functional gene may contain segments with dramatically different evolutionary histories.

Overall, the level of variation we observed in intron 44 is in general agreement with previous surveys of nucleotide variability in this intron (NACHMAN et al. 1998 Down; ZIETKIEWICZ et al. 1998 Down). In a sample of 10 alleles surveyed over 1537 bp, NACHMAN et al. 1998 Down reported nucleotide diversity of 0.187%, a value that is not significantly different from the value reported here. ZIETKIEWICZ et al. 1997 Down, ZIETKIEWICZ et al. 1998 Down surveyed 7622 bp in a worldwide sample of 250 alleles and reported nucleotide diversity of 0.101%. The slightly lower value obtained by ZIETKIEWICZ et al. 1998 Down may reflect that their survey was based on polymorphism detection using SSCP.

The level of nucleotide variability observed at each intron can be used to estimate the effective population size under the neutral expectation for X-linked genes, {pi} = 3Neµ, assuming a sex ratio of 1. Using the human-chimpanzee divergence values in Table 4, the estimated mutation rates are µ = 3.26 x 10-8 for intron 7 and µ = 1.8 x 10-8 for intron 44 assuming a divergence time of 5 mya and a generation time of 20 years. The estimated population sizes are ~Ne = 3500 for intron 7 and Ne = 26,000 for intron 44. The corresponding coalescence times are ~210,000 years for intron 7 and 1,560,000 years for intron 44. Despite the large variance associated with each of these estimates, these differences underscore the fact that different regions of the genome, and even of the same gene, may provide quite different estimates of parameters that are important for understanding human evolution. Genomic regions that have been influenced by selection at linked sites may provide substantial underestimates of the long-term effective population size for humans. The larger value of Ne obtained from intron 44 is likely to better reflect equilibrium conditions and suggests that a long-term effective population size for humans may be on the order of 30,000 rather than 10,000 (e.g., HAMMER 1995 Down).

The geographic patterns reported here are in general agreement with other studies of nucleotide variability in humans in revealing more variation in Africa than in other continental regions (e.g., HARDING et al. 1997 Down; ZIETKIEWICZ et al. 1998 Down; HARRIS and HEY 1999 Down; KAESSMANN et al. 1999 Down). This observation is often interpreted as evidence that modern humans throughout the world derived recently from an ancestral African population (e.g., KAESSMANN et al. 1999 Down), although it has also been pointed out that some of the African diversity may derive from human migration back to Africa (e.g., HARDING et al. 1997 Down; HAMMER et al. 1998 Down). One surprising observation in our data is the greater variability in the Americas than in Asia. The Americas are typically thought to have been colonized from Asia, and samples from the Americas typically reveal lower levels of genetic variability than samples from Asia (e.g., KARAFET et al. 1999 Down). At both intron 7 and intron 44, we observe higher levels of variability in the Americas than in Asia, though in neither case is this difference significant. Surveys of nucleotide variability at additional unlinked loci from multiple populations will be essential for disentangling the effects of population-level processes from selection in shaping variation in different geographic regions.


*  ACKNOWLEDGMENTS

We thank Mike Hammer for discussions, Isaac Jones for help with sequencing, and Wolfgang Stephan and two anonymous reviewers for helpful comments on the manuscript. This work was supported by the National Science Foundation.

Manuscript received September 13, 1999; Accepted for publication April 17, 2000.


*  LITERATURE CITED
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

ABBS, S., R. G. ROBERTS, C. G. MATHEW, D. R. BENTLEY, and M. BOBROW, 1990  Accurate assessment of intragenic recombination frequency within the Duchenne muscular dystrophy gene. Genomics 7:602-606[Medline].

AQUADRO, C. F., D. J. BEGUN and E. C. KINDAHL, 1994 Selection, recombination, and DNA polymorphism in Drosophila, pp. 46–56 in Non-neutral Evolution: Theories and Molecular Data, edited by B. GOLDING. Chapman & Hall, New York.

BEGUN, D. J. and C. F. AQUADRO, 1992  Levels of naturally occurring DNA polymorphism correlate with recombination rates in D. melanogaster.. Nature 356:519-520[Medline].

BRAVERMAN, J. M., R. R. HUDSON, N. L. KAPLAN, C. H. LANGLEY, and W. STEPHAN, 1995  The hitchhiking effect on the site frequency spectrum of DNA polymorphism. Genetics 140:783-796[Abstract].

CHARLESWORTH, B., M. T. MORGAN, and D. CHARLESWORTH, 1993  The effect of deleterious mutations on neutral molecular variation. Genetics 134:1289-1303[Abstract].

CLARK, A. G., K. M. WEISS, D. A. NICKERSON, S. L. TAYLOR, and A. BUCHANAN et al., 1998  Haplotype structure and population genetic inferences from nucleotide sequence variation in human lipoprotein lipase. Am. J. Hum. Genet. 63:595-612[Medline].

DVORÁK, J., M. C. LUO, and Z. L. YANG, 1998  Restriction fragment length polymorphism and divergence in the genomic regions of high and low recombination in self-fertilizing and cross-fertilizing Aegilops species. Genetics 148:423-434[Abstract/Free Full Text].

FU, Y. X. and W. H. LI, 1993  Statistical tests of neutrality of mutations. Genetics 133:693-709[Abstract].

HAMMER, M., 1995  A recent common ancestry for human Y chromosomes. Nature 378:376-378[Medline].

HAMMER, M. F., T. KARAFET, A. RASANAYAGAM, E. T. WOOD, and T. K. ALTHEIDE et al., 1998  Out of Africa and back again: nested cladistic analysis of human Y chromosome variation. Mol. Biol. Evol. 15:427-441[Abstract].

HARDING, R. M., S. M. FULLERTON, R. C. GRIFFITHS, J. BOND, and M. J. COX et al., 1997  Archaic African and Asian lineages in the genetic ancestry of modern humans. Am. J. Hum. Genet. 60:772-789[Medline].

HARDING, R. M., E. HEALY, A. J. RAY, N. S. ELLIS, and N. FLANAGAN et al., 2000  Evidence for variable selective pressures at Mc1r. Am. J. Hum. Genet. 66:1351-1361[Medline].

HARRIS, E. E. and J. HEY, 1999  X chromosome evidence for ancient human histories. Proc. Natl. Acad. Sci. USA 96:3320-3324[Abstract/Free Full Text].

HEY, J. and J. WAKELEY, 1997  A coalescent estimator of the population recombination rate. Genetics 145:833-846[Abstract].

HUDSON, R. R., M. KREITMAN, and M. AGUADÉ, 1987  A test of neutral molecular evolution based on nucleotide data. Genetics 116:153-159[Abstract/Free Full Text].

JARUZELSKA, J., E. ZIETKIEWICZ, M. BATZER, D. E. C. COLE, and J. P. MOISAN et al., 1999  Spatial and temporal distribution of the neutral polymorphisms in the last Zfx intron: analysis of haplotype structure and genealogy. Genetics 152:1091-1101[Abstract/Free Full Text].

KAESSMANN, H., F. HEI&SZLIG;IG, A. VON HAESELER, and S. PÄÄBO, 1999  DNA sequence variation in a non-coding region of low recombination on the human X chromosome. Nat. Genet. 22:78-81[Medline].

KAPLAN, N. L., R. R. HUDSON, and C. H. LANGLEY, 1989  The "hitchhiking effect" revisited. Genetics 123:887-899[Abstract/Free Full Text].

KARAFET, T. M., S. L. ZEGURA, O. POSUKH, L. OSIPOVA, and A. BERGEN et al., 1999  Ancestral Asian source(s) of New World Y-chromosome founder haplotypes. Am. J. Hum. Genet. 64:817-831[Medline].

KRAFT, T., T. SALL, I. MAGNUSSON-RADING, N. O. NILSSON, and C. HALLDEN, 1998  Positive correlation between recombination rates and levels of genetic variation in natural populations of sea beet (Beta vulgaris subsp. maritima). Genetics 150:1239-1244[Abstract/Free Full Text].

KREITMAN, M. and R. R. HUDSON, 1991  Inferring the evolutionary histories of the Adh and Adh-dup loci in Drosophila melanogaster from patterns of polymorphism and divergence. Genetics 127:565-582[Abstract].

LEWONTIN, R. C., 1964  The interaction of selection and linkage. I. General considerations; heterotic models. Genetics 43:419-434.

LEWONTIN, R. C., 1995  The detection of linkage disequilibrium in molecular sequence data. Genetics 140:377-388[Abstract].

LI, W. H. and L. A. SADLER, 1991  Low nucleotide diversity in man. Genetics 129:513-523[Abstract].

MAYNARD SMITH, J. and J. HAIGH, 1974  The hitch-hiking effect of a favourable gene. Genet. Res. 23:23-35[Medline].

MORIYAMA, E. N. and J. R. POWELL, 1996  Intraspecific nuclear DNA variation in Drosophila. Mol. Biol. Evol. 13:261-277[Abstract].

NACHMAN, M. W., 1997  Patterns of DNA variability at X-linked loci in Mus domesticus.. Genetics 147:1303-1316[Abstract].

NACHMAN, M. W. and G. A. CHURCHILL, 1996  Heterogeneity in rates of recombination across the mouse genome. Genetics 142:537-548[Abstract].

NACHMAN, M. W., V. L. BAUER, S. L. CROWELL, and C. F. AQUADRO, 1998  DNA variability and recombination rates at X-linked loci in humans. Genetics 150:1133-1141[Abstract/Free Full Text].

NAGARAJA, R., S. MACMILLAN, J. KERE, C. JONES, and S. GRIFFIN et al., 1997  X chromosome map at 75-kb STS resolution, revealing extremes of recombination and GC content. Genome Res. 7:210-222[Abstract/Free Full Text].

NEI, M. and W.-H. LI, 1979  Mathematical model for studying genetic variation in terms of restriction endonucleases. Proc. Natl. Acad. Sci. USA 76:5269-5273[Abstract/Free Full Text].

OUDET, C., A. HANAUER, P. CLEMENS, T. CASKEY, and J. L. MANDEL, 1992  Two hot spots of recombination in the Dmd gene correlate with the deletion prone regions. Hum. Mol. Genet. 1:599-603[Abstract/Free Full Text].

PRZEWORSKI, M., R. R. HUDSON, and A. DIRIENZO, 2000  Adjusting the focus on human variation. Trends Genet. in press.

RANA, B. K., D. HEWETT-EMMETT, L. JIN, B. H. J. CHANG, and N. SAMBUUGHIN et al., 1999  High polymorphism at the human melanocortin 1 receptor locus. Genetics 151:1547-1557[Abstract/Free Full Text].

RIEDER, M. J., S. L. TAYLOR, A. G. CLARK, and D. A. NICKERSON, 1999  Sequence variation in the human angiotensin converting enzyme. Nat. Genet. 22:59-62[Medline].

SAIKI, R. K., D. H. GELFAND, S. STOFFEL, S. J. SCHARF, and R. HIGUCHI et al., 1988  Primer-directed enzymatic amplification of DNA with a thermostable DNA polymerase. Science 239:487-491[Abstract/Free Full Text].

STEPHAN, W., 1994 Effects of genetic recombination and population subdivision on nucleotide sequence variation in Drosophila ananassae, pp. 57–66 in Non-neutral Evolution: Theories and Molecular Data, edited by B. GOLDING. Chapman & Hall, New York.

STEPHAN, W. and C. H. LANGLEY, 1998  DNA polymorphism in Lycopersicon and crossing-over per physical length. Genetics 150:1585-1593[Abstract/Free Full Text].

STEPHAN, W., L. XING, D. A. KIRBY, and J. M. BRAVERMAN, 1998  A test of the background selection hypothesis based on nucleotide data from Drosophila ananassae.. Proc. Natl. Acad. Sci. USA 95:5649-5654[Abstract/Free Full Text].

TAJIMA, F., 1989  Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 123:585-595[Abstract/Free Full Text].

WANG, R.-L., A. STEC, J. HEY, L. LUKENS, and J. DOEBLEY, 1999  The limits of selection during maize domestication. Nature 398:236-239[Medline].

WATERSON, G. A., 1975  On the number of segregating sites in genetical models without recombination. Theor. Popul. Biol. 7:256-276[Medline].

ZIETKIEWICZ, E., V. YOTOVA, M. JARNIK, M. KORAB-LASKOWSKA, and K. K. KIDD et al., 1997  Nuclear DNA diversity in worldwide distributed human populations. Gene 205:161-171[Medline].

ZIETKIEWICZ, E., V. YOTOVA, M. JARNIK, M. KORAB-LASKOWSKA, and K. K. KIDD et al., 1998  Genetic structure of the ancestral population of modern humans. J. Mol. Evol. 47:146-155[Medline].




This article has been cited by other articles:


Home page
Mol Biol EvolHome page
A. R. Rogers, S. Wooding, C. D. Huff, M. A. Batzer, and L. B. Jorde
Ancestral Alleles and Population Origins: Inferences Depend on Mutation Rate
Mol. Biol. Evol., April 1, 2007; 24(4): 990 - 997.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
M. K. Shimada, K. Panchapakesan, S. A. Tishkoff, A. Q. Nato Jr, and J. Hey
Divergent Haplotypes and Human History as Revealed in a Worldwide Survey of X-Linked DNA Sequence Variation
Mol. Biol. Evol., March 1, 2007; 24(3): 687 - 698.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
N. Elango, J. W. Thomas, NISC Comparative Sequencing Program, and S. V. Yi
Variable molecular clocks in hominoids
PNAS, January 31, 2006; 103(5): 1370 - 1375.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
E. Santiago and A. Caballero
Variation After a Selective Sweep in a Subdivided Population
Genetics, January 1, 2005; 169(1): 475 - 483.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
C. D. Meiklejohn, Y. Kim, D. L. Hartl, and J. Parsch
Identification of a Locus Under Complex Positive Selection in Drosophila simulans by Haplotype Mapping and Composite-Likelihood Estimation
Genetics, September 1, 2004; 168(1): 265 - 279.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
J. F. Storz, B. A. Payseur, and M. W. Nachman
Genome Scans of DNA Variability in Humans Reveal Evidence for Selective Sweeps Outside of Africa
Mol. Biol. Evol., September 1, 2004; 21(9): 1800 - 1811.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
M. F. Hammer, D. Garrigan, E. Wood, J. A. Wilder, Z. Mobasher, A. Bigham, J. G. Krenz, and M. W. Nachman
Heterogeneous Patterns of Variation Among Multiple Human X-Linked Loci: The Possible Role of Diversity-Reducing Selection in Non-Africans
Genetics, August 1, 2004; 167(4): 1841 - 1853.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
M. W. Nachman, S. L. D'Agostino, C. R. Tillquist, Z. Mobasher, and M. F. Hammer
Nucleotide Variation at Msn and Alas2, Two Genes Flanking the Centromere of the X Chromosome in Humans
Genetics, May 1, 2004; 167(1): 423 - 437.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
G. Zhou, Y. Zhai, X. Dong, X. Zhang, F. He, K. Zhou, Y. Zhu, H. Wei, Z. Yao, S. Zhong, et al.
Haplotype Structure and Evidence for Positive Selection at the Human IL13 Locus
Mol. Biol. Evol., January 1, 2004; 21(1): 29 - 36.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
M. F. Hammer, F. Blackmer, D. Garrigan, M. W. Nachman, and J. A. Wilder
Human Population Structure and Its Effects on Sampling Y Chromosome Sequence Variation
Genetics, August 1, 2003; 164(4): 1495 - 1509.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
T. Kitano, C. Schwarz, B. Nickel, and S. Paabo
Gene Diversity Patterns at 10 X-Chromosomal Loci in Humans and Chimpanzees
Mol. Biol. Evol., August 1, 2003; 20(8): 1281 - 1289.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
M. A. Saunders, M. F. Hammer, and M. W. Nachman
Nucleotide Variability at G6pd and the Signature of Malarial Selection in Humans
Genetics, December 1, 2002; 162(4): 1849 - 1861.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
N. Yu, F.-C. Chen, S. Ota, L. B. Jorde, P. Pamilo, L. Patthy, M. Ramsay, T. Jenkins, S.-K. Shyue, and W.-H. Li
Larger Genetic Differences Within Africans Than Between Africans and Eurasians
Genetics, May 1, 2002; 161(1): 269 - 274.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
M. Przeworski
The Signature of Positive Selection at Randomly Chosen Loci
Genetics, March 1, 2002; 160(3): 1179 - 1189.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
B. A. Payseur and M. W. Nachman
Gene Density and Human Nucleotide Polymorphism
Mol. Biol. Evol., March 1, 2002; 19(3): 336 - 340.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
Y. Kim and W. Stephan
Detecting a Local Signature of Genetic Hitchhiking Along a Recombining Chromosome
Genetics, February 1, 2002; 160(2): 765 - 777.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
Y. Gilad, S. Rosenberg, M. Przeworski, D. Lancet, and K. Skorecki
Evidence for positive selection and population structure at the human MAO-A gene
PNAS, January 22, 2002; 99(2): 862 - 867.
[Abstract] [Full Text] [PDF]


Home page
Hum Mol GenetHome page
L.B. Jorde, W.S. Watkins, and M.J. Bamshad
Population genomics: a bridge from evolutionary history to genetic medicine
Hum. Mol. Genet., October 1, 2001; 10(20): 2199 - 2207.
[Abstract] [Full Text] [PDF]


Home page
J. Immunol.Home page
E. J. Adams, S. Cooper, and P. Parham
A Novel, Nonclassical MHC Class I Molecule Specific to the Common Chimpanzee
J. Immunol., October 1, 2001; 167(7): 3858 - 3869.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
I. Smirnova, M. T. Hamblin, C. McBride, B. Beutler, and A. Di Rienzo
Excess of Rare Amino Acid Polymorphisms in the Toll-like Receptor 4 in Humans
Genetics, August 1, 2001; 158(4): 1657 - 1664.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
P. Andolfatto and M. Przeworski
Regions of Lower Crossing Over Harbor More Rare Variants in African Populations of Drosophila melanogaster
Genetics, June 1, 2001; 158(2): 657 - 665.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
A. R. Rogers
Order emerging from chaos in human evolutionary genetics
PNAS, January 30, 2001; 98(3): 779 - 780.
[Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
S. Alonso and J. A. L. Armour
A highly variable segment of human subterminal 16p reveals a history of population growth for modern humans outside Africa
PNAS, December 14, 2000; (2000) 11244998.
[Abstract] [Full Text]


Home page
Proc. Natl. Acad. Sci. USAHome page
S. Alonso and J. A. L. Armour
A highly variable segment of human subterminal 16p reveals a history of population growth for modern humans outside Africa
PNAS, January 30, 2001; 98(3): 864 - 869.
[Abstract] [Full Text] [PDF]