| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
,1




* Genomic Analysis and Technology Core, Division of Biotechnology, University of Arizona, Tucson, Arizona 85721
Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721
1 Corresponding author: Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721.
E-mail: mfh{at}u.arizona.edu
| ABSTRACT |
|---|
|
|
|---|
Considerable work over the past decade has documented DNA sequence variation in humans. Early studies focused primarily on mitochondrial DNA (VIGILANT et al. 1991) and the Y chromosome (HAMMER 1995; WHITFIELD et al. 1995), while more recent single-locus studies have focused on the X chromosome (NACHMAN et al. 1998; HARRIS and HEY 1999; KAESSMANN et al. 1999; NACHMAN and CROWELL 2000; GILAD et al. 2002; SAUNDERS et al. 2002; YU et al. 2002b) and on the autosomes (reviewed in PRZEWORSKI et al. 2000; EXCOFFIER 2002). Two major features to emerge from this body of work are (1) substantial heterogeneity among genes in overall patterns of variation, including differences in the level of nucleotide diversity, the amount of linkage disequilibrium, and the distribution of allele frequencies, and (2) clear differences in levels and patterns of variation among populations. For example, there is mounting evidence that African populations have more genetic variation (VIGILANT et al. 1991; TISHKOFF et al. 1996; PRZEWORSKI et al. 2000; HAMMER et al. 2001), harbor more rare alleles (WALL and PRZEWORSKI 2000), and have lower levels of linkage disequilibrium (REICH et al. 2001) than non-African populations.
These genetic patterns have led to contrasting inferences of human demographic history. Results from loci with an excess of rare polymorphisms (i.e., with large negative Tajima's D values; TAJIMA 1989) have been used to support models in which humans expanded dramatically from small initial size (HARPENDING and ROGERS 2000; SHEN et al. 2000; WOODING and ROGERS 2000; ALONSO and ARMOUR 2001; ROGERS 2001). On the other hand, many nuclear loci have positive Tajima's D values so they do not provide evidence of population growth (HARDING et al. 1997; HEY 1997; ZIETKIEWICZ et al. 1997; PRZEWORSKI et al. 2000). Using data from the 12 nuclear loci then available, WALL and PRZEWORSKI (2000) tested several simple models of population growth and found that the different patterns among loci were not compatible with any of their models. This led to the suggestion that various forms of selection have influenced a subset of loci (WALL and PRZEWORSKI 2000; EXCOFFIER 2002). It is also becoming clear that more complex models of human demography must be considered, such as those incorporating geographic structure and changes in population size (PLUZHNIKOV et al. 2002; PTAK and PRZEWORSKI 2002).
One of the major challenges for interpreting the contrasting patterns observed among human loci comes from the sampling strategies used by different investigators. Studies of nuclear sequence variation vary greatly with regard to the scheme for sampling populations (from population-based to global "grid" sampling), the type of genomic regions studied (from coding to noncoding), and the molecular methods of variation detection employed (PRZEWORSKI et al. 2000; PTAK and PRZEWORSKI 2002). This diverse array of strategies has made it difficult to compare results across studies. Recently, several surveys have sampled DNA sequences from multiple loci in a common set of individuals (FRISSE et al. 2001; HARRIS and HEY 2001; STEPHENS et al. 2001; YU et al. 2002a,b; CARLSON et al. 2003). These studies have typically focused on multiple autosomal loci from either noncoding regions exclusively (FRISSE et al. 2001) or regions encompassing exons (STEPHENS et al. 2001). Studies of the X chromosome have typically focused on coding regions and/or only a few loci (HARRIS and HEY 1999, 2001; STEPHENS et al. 2001; KITANO et al. 2003). These studies also vary in the way humans are sampled, ranging from panels of individuals from the United States (STEPHENS et al. 2001), to panels containing many globally dispersed samples (HARRIS and HEY 1999, 2001), to panels with multiple individuals from a limited number of human populations (FRISSE et al. 2001).
Here we build on a continuing study of noncoding regions on the X chromosome (Figure 1) in a panel of 41 globally sampled humans representing African and non-African populations by examining patterns of variation at four loci (APXL, AMELX, TNFSF5, and RRM2P4) and comparing these patterns with those previously reported at DMD (i.e., introns 7 and 44; NACHMAN and CROWELL 2000), G6PD and L1CAM (SAUNDERS et al. 2002), and MSN and ALAS2 (NACHMAN et al. 2004). We also include comparisons with patterns of noncoding variation seen at five additional X-linked loci that were sequenced in similar global panels: PDHA1 (HARRIS and HEY 1999), Xq13.3 (KAESSMANN et al. 1999), FIX (HARRIS and HEY 2001), MAO-A (GILAD et al. 2002), and Xq21.3 (YU et al. 2002b). Because we studied X-linked loci only, we were able to avoid some of the complications that arise when comparing loci with different modes of inheritance and effective population sizes, such as those associated with the Y chromosome, autosomes, or the mitochondrial genome (FAY and WU 1999; PRZEWORSKI et al. 2000; HELLMANN et al. 2003). Our results indicate that, despite a common sampling strategy, there is still substantial heterogeneity in patterns of variation among loci on the human X chromosome. This degree of heterogeneity does not appear to be compatible with a simple demographic model and may reflect the effects of recent diversity-reducing selection acting on a subset of loci.
|
|
| SUBJECTS AND METHODS |
|---|
|
|
|---|
Choice of loci:
We chose to sequence APXL (apical protein-like Xenopus laevis), AMELX (amelogenin, X-linked), RRM2P4 (ribonucleotide reductase M2 polypeptide pseudogene 4), and TNFSF5 (tumor necrosis factor ligand superfamily, member 5) because they map to telomeric regions with moderate to high rates of recombination. These loci complement our existing database of six other genes (sequenced in the same global panel) mapping to X chromosome regions with a range of recombination rates (Figure 1): DMD intron 7, DMD intron 44 (NACHMAN and CROWELL 2000), G6PD and L1CAM (SAUNDERS et al. 2002), and MSN and ALAS2 (NACHMAN et al. 2004). Approximately 5 kb of intron from each gene was sequenced. With the exception of G6PD in Africa (SAUNDERS et al. 2002), none of these loci was a priori believed to be influenced by selective forces. In addition, five published X chromosome DNA sequence data sets were used for comparisons with the 10 loci examined in our global panel. These loci included PDHA1 (HARRIS and HEY 1999), FIX (HARRIS and HEY 2001), Xq13.3 (KAESSMANN et al. 1999; referred to here as P2Y10), Xq21.1 (YU et al. 2002b; referred to here as DACH2), and MAO-A (GILAD et al. 2002). We did not include ZFX (JARUZELSKA et al. 1999) or DYS44 (ZIETKIEWICZ et al. 1997) because the polymorphism data were ascertained mainly by single-strand conformation polymorphism rather than by DNA sequencing.
PCR amplification and sequencing:
DNA was PCR amplified in 25-µl volumes with 40 cycles. Conditions for each of the fragments described below varied slightly and are available from the authors upon request. Amplification primers were designed from published sequences for APXL (AC002365), AMELX (AY040206), RRM2P4 (NG_000871; HSJ169P22), and TNFSF5 (NT_011786) and are available upon request. Internal primers (also available upon request) were used to generate overlapping sequence runs on an ABI3730 automated sequencer. Contiguous sequence that included coding and noncoding regions (4885, 5331, 5240, and 2385 bp for APXL, AMELX, TNFSF5, and RRM2P4, respectively) was assembled for each individual and aligned using the computer program Sequencher (GeneCodes). Sequences have been submitted to GenBank under accession nos. AY694820, AY694987.
Data analysis:
Nucleotide diversity,
(NEI and LI 1979), WATTERSON's (1975) estimator of
, and FST (HUDSON et al. 1992) were calculated using the program DNAsp 3.99 (ROZAS and ROZAS 1999), excluding insertion-deletion polymorphisms. Under neutral equilibrium conditions both
and
estimate the neutral parameter 3Neµ for X-linked loci, where Ne is the effective population size and µ is the neutral mutation rate. To test for deviations from a neutral equilibrium frequency distribution, Tajima's D (TAJIMA 1989), Fu and Li's D with an outgroup (FU and LI 1993), and Fay and Wu's H (FAY and WU 2000) were also calculated using DNAsp 3.99 (P values were determined by 1000 replicates of Monte Carlo simulation of the coalescent process under a neutral panmictic model with no recombination). Ratios of polymorphism to divergence were compared with the expectations under a neutral model using a multilocus Hudson-Kreitman-Aguadé (HKA) test (HUDSON et al. 1987) with the software "HKA" (J. Hey; http://lifesci.rutgers.edu/heylab/). This program does not take account of intragenic recombination and therefore the resulting P values are slightly inflated (FRISSE et al. 2001). Divergence data were derived for each of these loci by estimating the net divergence (DA; NEI 1987) between homologous sequences from a chimpanzee and all 41 human sequences. The times to most recent common ancestors (TMRCAs) among sampled sequences were estimated by dividing Watterson's estimator of 3Neµ (WATTERSON 1975) by the locus-specific rates of neutral mutation, estimated from the interspecific divergence. We assumed a human-chimpanzee divergence of 6 million years and a 20-year human generation time. Female estimated recombination rates were taken from the University of California, Santa Cruz, web site (http://www.genome.ucsc.edu) using the July 2003 freeze of the Human Genome Project Working Draft. The recombination rates represent average rates for a window of 1 Mb around each locus estimated through a comparison of the sequence of the human X chromosome with the deCODE Genetics map (KONG et al. 2002), which is based on 5136 microsatellite markers in 146 families with a total of 1257 meioses.
| RESULTS |
|---|
|
|
|---|
|
Comparisons with patterns of variation at other X-linked loci:
Summary statistics for the 10 loci sampled in the same panel of 41 individuals are compared with those from five additional X-linked loci surveyed in global samples in Table 2. Average global levels of variation at these five loci (average
= 0.00076 and
= 0.00068) are very similar to the 10 others in Table 2 (Mann-Whitney test, P = 0.668 and 0.951, respectively), as are summaries of the frequency spectra (average TD = 0.552, FLD = 1.401; Mann-Whitney test, P = 0.582 and 0.951, respectively). Heterogeneity among loci is also apparent: levels of variation at FIX, MAO-A, and P2Y10 are low, while those at DACH2 and PDHA1 are average and high, respectively. P2Y10, FIX, and DACH2 show an excess of rare and/or singleton polymorphisms (Table 2). In contrast, PDHA1 and MAO-A have positive TD values. As expected given the variation in levels of polymorphism, estimates of the TMRCA also vary considerably among all 15 loci in Table 2. For example, the TMRCA for FIX and MAO-A is <500 KY, while 6 loci have TMRCAs >1 MY.
KITANO et al. (2003) recently surveyed sequence variation at 10 X-linked genes that contain mutations known to cause mental retardation. Global patterns of intron variability within these genes are similar to those reported in Table 2, although the average global level of diversity at their 10 loci (
= 0.00051 and
= 0.00035) is
40% lower than that for the 15 loci in Table 2 (
= 0.00081 and
= 0.00061). Their sequences also had an
30% reduction in human-chimpanzee divergence (0.699%) relative to the 15 X-linked loci in Table 2 (1.01%) and a lower mean TMRCA (474 vs. 1004 KY, respectively), possibly reflecting higher levels of selective constraint on loci involved in human cognitive function (KITANO et al. 2003).
Nucleotide diversity and recombination rate:
Global nucleotide diversity (
) and recombination rate for the 10 loci sampled in the N = 41 panel are positively correlated (Pearson linear correlation, two-tailed t-test, R2 = 0.648, P = 0.005). When all 15 loci in Table 2 are considered, there is still a positive but weaker correlation (R2 = 0.315, P = 0.029; Figure 2A). This increased scatter in the larger sample may reflect variation in levels of diversity that is attributable to different sampling strategies. Similarly, in Drosophila melanogaster, the correlation between diversity and recombination rate observed in heterogeneous samples (BEGUN and AQUADRO 1992) is stronger when studied from multiple loci in a single sample (AQUADRO et al. 1994). When we consider the relationship of human-chimpanzee divergence levels to human recombination rates, there is no statistically significant relationship for either the set of 10 or the set of 15 loci in Table 2 (R2 = 0.073, P = 0.449; R2 = 0.082, P = 0.303, respectively), which may reflect the small number of loci investigated (Figure 2B).
|
= 0.00082 ± 0.00040,
= 0.00076 ± 0.00049) as in non-Africans (
= 0.00045 ± 0.00036,
= 0.00040 ± 0.00042). These differences are statistically significant (Mann-Whitney test, P = 0.002 and 0.004, respectively).
|
|
Under a model of population growth TD and FD are negatively correlated with sample size (PTAK and PRZEWORSKI 2002; HAMMER et al. 2003). To determine whether the more negative mean TD value in our non-African sample compared with our African sample was the result of a larger mean sample size (i.e., 32.3 vs. 12.3), we reanalyzed our non-African data by resampling each locus 100 times and making the sample size equal to the number of Africans sequenced at the locus. In other words, for the n = 41 data set, 10 non-Africans were resampled 100 times. Resampled data sets with no variation (i.e., S = 0) were thrown out and an additional resampling was performed. The FIX locus was not resampled because equal numbers of Africans and non-Africans were surveyed initially (HARRIS and HEY 2001). The mean TD value for the resampled non-African data set was still more negative (TD = 0.639) despite having an identical size as the African sample. This suggests that the more negative TD value in non-Africans compared with Africans is unrelated to sample size differences.
To explore the relationship between frequency spectra and recombination rate we plotted African or non-African FLD vs. recombination rate for each locus in Table 3. For Africans there is no relationship either for the set of 10 loci sampled in the same set of 41 individuals or for all 15 loci (R2 = 0.014, P = 0.747 and R2 = 0.014, P = 0.675, respectively; Figure 3A). In contrast, non-African FLD values exhibit a statistically significant positive correlation for both the 10 loci and 15 loci data sets (R2 = 0.520, P = 0.019 and R2 = 0.454, 0.006, respectively; Figure 3B). Similar trends were observed with TD; however, the non-African correlation was not statistically significant (P = 0.138). Interestingly, there is also a positive relationship between non-African FLD and nucleotide variability (
) for the set of 10 loci sampled in the same individuals (R2 = 0.678, P = 0.003), while no such relationship was observed for the African FLD values (R2 = 0.001, P = 0.936; data not shown). This relationship is only marginally statistically significant for the full set of 15 loci in non-Africans (R2 = 0.235, P = 0.067).
|
| DISCUSSION |
|---|
|
|
|---|
Patterns of heterogeneity seen among loci are different between African and non-African samples. There was a wide range of FST values, with two loci (PDHA1 and MSN) exhibiting some of the highest known levels of differentiation among populations and others with extremely low levels of differentiation (e.g., DACH2, FIX, and DMD44; ROMUALDI et al. 2002). As previously documented for autosomal, Y-linked, and mitochondrial loci (VIGILANT et al. 1991; PRZEWORSKI et al. 2000; SHEN et al. 2000; HAMMER et al. 2003), X-linked loci are more variable in sub-Saharan African populations than in non-African populations. However, the extent of the reduction in non-African diversity at X-linked loci appears to be greater than that observed on the autosomes. For example, the average non-African reduction in
for the 15 X chromosome loci in Table 3 is 45%, while the average non-African reduction in
on the autosomes is 30% (HALUSHKA et al. 1999; FRISSE et al. 2001; STEPHENS et al. 2001). We also note that there is substantial variability among X-linked loci in the degree of reduction in non-African variation, with some loci having <10% of African diversity (e.g., PDHA1, ALAS2, and L1CAM). Similar disparities in African and non-African autosomal levels of diversity have not been reported (PRZEWORSKI et al. 2000; ALONSO and ARMOUR 2001; FRISSE et al. 2001). Mounting evidence suggests that, for many loci, African populations contain more rare alleles than non-African populations (WALL and PRZEWORSKI 2000). We found that our African sample has TD values similar (i.e., slightly negative on average) to those reported in the literature for African populations (PRZEWORSKI et al. 2000). However, our non-African sample exhibits an unusual pattern, whereby the mean TD value is slightly more negative than the mean TD value in our African sample (0.512 and 0.401, respectively). The more negative average TD value for non-Africans held even after subsampling 10 non-Africans to control for differences in sample sizes between Africans and non-Africans. This is driven, in part, by sharply negative TD values at MSN and DMD7 and fewer loci with positive TD outside Africa (Table 3).
In summary, there is substantial heterogeneity in patterns of variation among loci on the X chromosome, even when sampled in the same set of individuals. Previous observations of heterogeneity among loci have been interpreted as evidence for selection. For example, WALL and PRZEWORSKI (2000) tested whether patterns of variation observed at a number of nuclear loci (including some of those examined here) were compatible with a variety of demographic models. They found that the low TD values at some loci (including DMD7 and P2Y10) and the high TD values at other loci (including PDHA1 and DMD44) together were not consistent with a model of constant size or with a model of constant size followed by exponential growth. Even after incorporating more complex demographic components (such as a bottleneck or geographic structure), none of their models could account for the patterns of variation seen at all loci. To explain these contrasting patterns, they suggested that selection influenced variation at several of the loci studied.
Here we consider our observations in light of two alternative demographic models incorporating selection put forward by WALL and PRZEWORSKI (2000). The first is a model with long-term population growth (HARPENDING et al. 1998) that is expected to lead to an excess of rare variants (i.e., negative TD) at all loci that are not subject to selection. Under this model, the negative TD seen at some loci reflects population growth while the positive TD values, or those close to zero, observed at other loci reflect the action of diversity-enhancing selection (HARPENDING and ROGERS 2000; WALL and PRZEWORSKI 2000; ROGERS 2001; EXCOFFIER 2002). The second model has constant population size (i.e., the onset of human population growth is too recent to leave a signature in the nuclear genome). Under this model, TD values near zero reflect constant population size while significantly negative TD values at other loci reflect the recent effects of directional selection.
Results from previous analyses of X-linked loci have been interpreted to support both models. NACHMAN and CROWELL (2000) sampled variation at two DMD introns and showed that DMD7 has much lower levels of nucleotide diversity, many more rare polymorphisms, higher levels of linkage disequilibrium, and different African vs. non-African patterns, compared with DMD44. They suggested that patterns of variation at DMD44 are consistent with a neutral equilibrium model of molecular evolution and that those at DMD7 were shaped by recent directional selection (especially out of Africa). HARRIS and HEY (2001) compared patterns of variation at PDHA1 and FIX and posited that the much lower global nucleotide diversity and skew in the frequency distribution at FIX was the result of a history of positive directional selection, or background selection, acting at or near FIX. In an earlier report, HARRIS and HEY (1999) demonstrated that PDHA1 had an unusual pattern of sequence variation and suggested that this locus experienced some form of diversity-reducing selection outside of Africa. Similarly, NACHMAN et al. (2004) present evidence that MSN and ALAS2 have patterns of variation that reflect a history of diversity-reducing selection, with stronger effects outside of Africa.
In contrast, other authors reached very different conclusions on the basis of analyses of some of these very same loci, as well as others on the X chromosome (HARPENDING and ROGERS 2000; EXCOFFIER 2002). ROGERS (2001) took the opposite view of NACHMAN and CROWELL (2000) by suggesting that patterns of variation at DMD7 reflect demography (i.e., expansion of population size) while those at DMD44 reflect a long history of balancing selection. However, it is difficult to see how a long history of balancing selection could create the patterns of variation seen at DMD44 because there is little linkage disequilibrium among sites at DMD44. WOODING and ROGERS (2000) argued that even though Tajima's D test did not reject the null hypothesis of constant population size at the P2Y10 locus (KAESSMANN et al. 1999), the significantly negative Fu's Fs value at this locus does support a model of a Pleistocene population expansion. YU et al. (2002b) interpreted a significant Fu and Li's D test to indicate a population expansion signature at DACH2, despite a failure of Tajima's D and Fu's Fs tests to reject neutrality. They suggested that ancient population subdivision must be taken into account to interpret these tests properly. We note that when the African and non-African P2Y10 and DACH2 data sets are considered separately, an excess of rare and/or singleton alleles is found only in the African samples (Table 3). Therefore, these loci do not support the simplest model of population expansion.
Non-African levels of polymorphism reject a neutral, constant-size model by the conservative HKA test (Table 4), indicating that the variance in polymorphism among the 15 X-linked loci is too large outside of Africa. Is this large variance in polymorphism across loci primarily the product of either natural selection acting upon a subset of loci or a history of nonequilibrium demography out of Africa (e.g., a population bottleneck)? Although HUDSON et al. (1987) initially dismissed the possibility that a bottleneck systematically influences the HKA test, we have conducted coalescent simulations of intermediate strength bottlenecks that result in an increase in the variance of polymorphism across unlinked loci. These simulations were not intended to estimate bottleneck parameters from the data, but instead were used to examine the effects of a simple population bottleneck on the outcome of the HKA test. Similar to the bottleneck model of FAY and WU (1999), the model we examined assumes constant population size (N = 104) until 3000 generations ago, when a 40-fold bottleneck is imposed upon the population for 500 generations, after which the population reverts to its original size of 104 individuals (i.e., the bottleneck reduces the effective population size to 250 for 10,000 years). These bottleneck parameters were chosen to (1) produce the observed reduction in non-African diversity and (2) maximize the variance in diversity among loci (data not shown). Maximizing variance among loci produces a simulated null distribution of the HKA statistic that is likely to be conservative when assessing the impact of a bottleneck on the test. We also incorporated estimates of the population recombination rate at each locus (data not shown) in the 15 locus simulations, which were replicated 1000 times. We find that our observed non-African HKA test statistic is still significantly too high (P = 0.038) when compared with the null distribution generated by the conservative bottleneck model (Figure 4). This result is compelling evidence that a simple population bottleneck out of Africa is insufficient to account for the increased variance in polymorphism across loci, although more complex demographic models might account for these observations.
|
We also found a positive relationship between recombination rate and frequency spectra in our non-African sample, but not in our African sample (Figure 3). Such a relationship is not expected under a neutral equilibrium model (PRZEWORSKI et al. 2001). One explanation for this observation is that diversity-reducing selective forces (i.e., hitchhiking or background selection) have led to an excess over neutral expectations of singletons at loci in regions of lower recombination (CHARLESWORTH et al. 1993; BRAVERMAN et al. 1995). However, ANDOLFATTO and PRZEWORSKI (2001) demonstrated that a similar positive correlation between the summary of the frequency spectrum of polymorphic mutations (both TD and FLD) and the recombination rate in D. melanogaster, while expected under simple hitchhiking models, was unlikely under a model of background selection. Neither is there an expectation that diversity-enhancing selection would lead to a positive correlation between FLD and recombination rate. Moreover, the expected signature of long-term balancing selectiona peak of polymorphism surrounding a selected sitehas not been observed at loci with high levels of variation and positive TD or FLD values (WALL and PRZEWORSKI 2000). As mentioned above, the correlation between recombination rate and nucleotide diversity was also stronger in non-Africans compared with Africans. The combined data suggest that positive directional selection (i.e., hitchhiking) may be a more important factor influencing X chromosome variation outside Africa.
Finally, our data set included nine intronic regions within functional genes and a pseudogene, which may be less perturbed by selection than introns of genes. We chose the RRM2P4 pseudogene in particular because it maps to a region of high recombination and low gene density on Xq27.3 and thus should provide good estimates of neutral parameters. We found that levels and patterns of variation at this pseudogene are similar to those at other loci exhibiting high levels of variation (e.g., DMD44, DACH2, AMELX, APXL). This region has the third highest level of diversity, exhibits no skew in the frequency spectrum, and harbors similar levels of variation in African and non-African samples. This supports the hypothesis that similar patterns of variation at other high variation loci reflect neutral demographic processes.
If we accept that these five X-linked regions, as well as P2Y10 and TNFSF5, are relatively free from the influences of natural selection, then what can we discern about human demography from patterns of variation at these loci? There is only a minor reduction in non-African diversity (i.e.,
20%), a slightly negative TD in Africa, and a TD close to zero out of Africa. These data do not provide evidence for long-term population growth outside Africa. Rather, they are consistent with a larger effective population size for Africans and the possibility that non-Africans experienced a phase of population size reduction during which rare variants were lost more quickly than common variants (ZIETKIEWICZ et al. 1997; PRZEWORSKI et al. 2000). While it is possible that both diversity-reducing selection and population expansion have left signatures on X-linked loci, it is difficult to explain the heterogeneous patterns observed here by a simple model of population expansion without selection. We note that after removing the five loci showing low variation from our analyses (ALAS2, MSN, DMD7, L1CAM, and FIX) there is still considerable heterogeneity among loci, especially in the non-African samples. More realistic models of human demography might include more complex patterns of subdivision and population size changes (PLUZHNIKOV et al. 2002), changing migration rates over time (WAKELEY 1999) and/or low levels of admixture with archaic Homo. Finally, the unexpected finding of several X-linked loci with a putative signature of selection (PRZEWORSKI 2002) is consistent with the possibility that the colonization of novel environments by modern humans as they migrated out of Africa in the last
50,000 years may have coincided with a burst of adaptive evolution (PAYSEUR et al. 2002; KAYSER et al. 2003; MISHMAR et al. 2003).
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
| LITERATURE CITED |
|---|
|
|
|---|
ALONSO, S., and J. A. ARMOUR, 2001 A highly variable segment of human subterminal 16p reveals a history of population growth for modern humans outside Africa. Proc. Natl. Acad. Sci. USA 98: 864869.
ALTSHULER, D., V. J. POLLARA, C. R. COWLES, W. J. VAN ETTEN, J. BALDWIN et al., 2000 An SNP map of the human genome generated by reduced representation shotgun sequencing. Nature 407: 513516.[CrossRef][Medline]
ANDOLFATTO, P., and M. PRZEWORSKI, 2001 Regions of lower crossing over harbor more rare variants in African populations of Drosophila melanogaster. Genetics 158: 657665.
AQUADRO, C. F., D. J. BEGUN and E. C. KINDAHL, 1994 Selection, recombination, and DNA polymorphism in Drosophila, pp. 4655 in Non-neutral Evolution: Theories and Molecular Data, edited by B. GOLDING. Chapman & Hall, New York.
AVERY, P. J., 1984 The population genetics of haplo-diploids and X-linked genes. Genet. Res. 44: 321341.
BEGUN, D. J., and C. F. AQUADRO, 1992 Levels of naturally occurring DNA polymorphism correlate with recombination rates in D. melanogaster. Nature 356: 519520.[CrossRef][Medline]
BEGUN, D. J., and P. WHITLEY, 2000 Reduced X-linked nucleotide polymorphism in Drosophila simulans. Proc. Natl. Acad. Sci. USA 97: 59605965.
BRAVERMAN, J. M., R. R. HUDSON, N. L. KAPLAN, C. H. LANGLEY and W. STEPHAN, 1995 The hitchhiking effect on the site frequency spectrum of DNA polymorphisms. Genetics 140: 783796.[Abstract]
CARLSON, C. S., M. A. EBERLE, M. J. RIEDER, J. D. SMITH, L. KRUGLYAK et al., 2003 Additional SNPs and linkage-disequilibrium analyses are necessary for whole-genome association studies in humans. Nat. Genet. 33: 518521.[CrossRef][Medline]
CHARLESWORTH, B., J. COYNE and N. BARTON, 1987 The relative rates of evolution of sex chromosomes and autosomes. Am. Nat. 130: 113146.[CrossRef]
CHARLESWORTH, B., M. T. MORGAN and D. CHARLESWORTH, 1993 The effect of deleterious mutations on neutral molecular variation. Genetics 134: 12891303.[Abstract]
EXCOFFIER, L., 2002 Human demographic history: refining the recent African origin model. Curr. Opin. Genet. Dev. 12: 675682.[CrossRef][Medline]
FAY, J. C., and C.-I WU, 1999 A human population bottleneck can account for the discordance between patterns of mitochondrial versus nuclear DNA variation. Mol. Biol. Evol. 16: 10031005.[Medline]
FAY, J. C., and C.-I WU, 2000 Hitchhiking under positive Darwinian selection. Genetics 155: 14051413.
FEARNHEAD, P., and P. DONNELLY, 2001 Estimating recombination rates from population genetic data. Genetics 159: 12991318.
FRISSE, L., R. R. HUDSON, A. BARTOSZEWICZ, J. D. WALL, J. DONFACK et al., 2001 Gene conversion and different population histories may explain the contrast between polymorphism and linkage disequilibrium levels. Am. J. Hum. Genet. 69: 831843.[CrossRef][Medline]
FU, Y. X., and W.-H. LI, 1993 Statistical tests of neutrality of mutations. Genetics 133: 693709.[Abstract]
GILAD, Y., S. ROSENBERG, M. PRZEWORSKI, D. LANCET and K. SKORECKI, 2002 Evidence for positive selection and population structure at the human MAO-A gene. Proc. Natl. Acad. Sci. USA 99: 862867.
HALUSHKA, M. K., J. B. FAN, K. BENTLEY, L. HSIE, N. SHEN et al., 1999 Patterns of single-nucleotide polymorphisms in candidate genes for blood-pressure homeostasis. Nat. Genet. 22: 239247.[CrossRef][Medline]
HAMMER, M. F., 1995 A recent common ancestry for human Y chromosomes. Nature 378: 376378.[CrossRef][Medline]
HAMMER, M. F., T. M. KARAFET, A. J. REDD, H. JARJANAZI, S. SANTACHIARA-BENERECETTI et al., 2001 Hierarchical patterns of global human Y-chromosome diversity. Mol. Biol. Evol. 18: 11891203.
HAMMER, M. F., F. BLACKMER, D. GARRIGAN, M. W. NACHMAN and J. A. WILDER, 2003 Human population structure and its effects on sampling Y chromosome sequence variation. Genetics 164: 14951509.
HARDING, R. M., S. M. FULLERTON, R. C. GRIFFITHS, J. BOND, M. J. COX et al., 1997 Archaic African and Asian lineages in the genetic ancestry of modern humans. Am. J. Hum. Genet. 60: 772789.[Medline]
HARDISON, R. C., K. M. ROSKIN, S. YANG, M. DIEKHANS, W. J. KENT et al., 2003 Covariation in frequencies of substitution, deletion, transposition, and recombination during eutherian evolution. Genome Res. 13: 1326.
HARPENDING, H., and A. ROGERS, 2000 Genetic perspectives on human origins and differentiation. Annu. Rev. Genomics Hum. Genet. 1: 361385.[CrossRef][Medline]
HARPENDING, H. C., M. A. BATZER, M. GURVEN, L. B. JORDE, A. R. ROGERS et al., 1998 Genetic traces of ancient demography. Proc. Natl. Acad. Sci. USA 95: 19611967.
HARRIS, E. E., and J. HEY, 1999 X chromosome evidence for ancient human histories. Proc. Natl. Acad. Sci. USA 96: 33203324.
HARRIS, E. E., and J. HEY, 2001 Human populations show reduced DNA sequence variation at the factor IX locus. Curr. Biol. 11: 774778.[CrossRef][Medline]
HELLMANN, I., I. EBERSBERGER, S. E. PTAK, S. PAABO and M. PRZEWORSKI, 2003 A neutral explanation for the correlation of diversity with recombination rates in humans. Am. J. Hum. Genet. 72: 15271535.[CrossRef][Medline]
HEY, J., 1997 Mitochondrial and nuclear genes present conflicting portraits of human origins. Mol. Biol. Evol. 14: 166172.[Abstract]
HUDSON, R. R., M. KREITMAN and M. AGUADé, 1987 A test of neutral molecular evolution based on nucleotide data. Genetics 116: 153159.
HUDSON, R. R., M. SLATKIN and W. P. MADDISON, 1992 Estimation of levels of gene flow from DNA sequence data. Genetics 132: 583589.[Abstract]
JARUZELSKA, J., E. ZIETKIEWICZ, M. BATZER, D. E. COLE, J. P. MOISAN et al., 1999 Spatial and temporal distribution of the neutral polymorphisms in the last ZFX intron: analysis of the haplotype structure and genealogy. Genetics 152: 10911101.
KAESSMANN, H., F. HEISSIG, A. VON HAESELER and S. PAABO, 1999 DNA sequence variation in a non-coding region of low recombination on the human X chromosome. Nat. Genet. 22: 7881.[CrossRef][Medline]
KAYSER, M., S. BRAUER and M. STONEKING, 2003 A genome scan to detect candidate regions influenced by local natural selection in human populations. Mol. Biol. Evol. 20: 893900.
KITANO, T., C. SCHWARZ, B. NICKEL and S. PAABO, 2003 Gene diversity patterns at 10 X-chromosomal loci in humans and chimpanzees. Mol. Biol. Evol. 20: 12811289.
KONG, A., D. F. GUDBJARTSSON, J. SAINZ, G. M. JONSDOTTIR, S. A. GUDJONSSON et al., 2002 A high-resolution recombination map of the human genome. Nat. Genet. 31: 241247.[CrossRef][Medline]
LERCHER, M. J., and L. D. HURST, 2002 Human SNP variability and mutation rate are higher in regions of high recombination. Trends Genet. 18: 337340.[CrossRef][Medline]
LI, W. H., and L. A. SADLER, 1991 Low nucleotide diversity in man. Genetics 129: 513523.[Abstract]
MAYNARD SMITH, J., and J. HAIGH, 1974 The hitch-hiking effect of a favourable gene. Genet. Res. 23: 2335.[Medline]
MISHMAR, D., E. RUIZ-PESINI, P. GOLIK, V. MACAULAY, A. G. CLARK et al., 2003 Natural selection shaped regional mtDNA variation in humans. Proc. Natl. Acad. Sci. USA 100: 171176.
NACHMAN, M. W., and S. L. CROWELL, 2000 Contrasting evolutionary histories of two introns of the Duchenne muscular dystrophy gene, Dmd, in humans. Genetics 155: 18551864.
NACHMAN, M. W., V. L. BAUER, S. L. CROWELL and C. F. AQUADRO, 1998 DNA variability and recombination rates at X-linked loci in humans. Genetics 150: 11331141.
NACHMAN, M. W., S. L. D'AGOSTINO, C. R. TILLQUIST, Z. MOBASHER and M. F. HAMMER, 2004 Nucleotide variation at Msn and Alas2, two genes flanking the centromere of the X chromosome in humans. Genetics 167: 423437.
NEI, M., 1987 Molecular Evolutionary Genetics. Columbia University Press, New York.
NEI, M., and W. H. LI, 1979 Mathematical model for studying genetic variation in terms of restriction endonucleases. Proc. Natl. Acad. Sci. USA 76: 52695273.
PAYSEUR, B. A., A. D. CUTTER and M. W. NACHMAN, 2002 Searching for evidence of positive selection in the human genome using patterns of microsatellite variability. Mol. Biol. Evol. 19: 11431153.
PLUZHNIKOV, A., A. DI RIENZO and R. R. HUDSON, 2002 Inferences about human demography based on multilocus analyses of noncoding sequences. Genetics 161: 12091218.
PRZEWORSKI, M., 2002 The signature of positive selection at randomly chosen loci. Genetics 160: 11791189.
PRZEWORSKI, M., R. R. HUDSON and A. DI RIENZO, 2000 Adjusting the focus on human variation. Trends Genet. 16: 296302.[CrossRef][Medline]
PRZEWORSKI, M., J. D. WALL and P. ANDOLFATTO, 2001 Recombination and the frequency spectrum in Drosophila melanogaster and Drosophila simulans. Mol. Biol. Evol. 18: 291298.
PTAK, S. E., and M. PRZEWORSKI, 2002 Evidence for population growth in humans is confounded by fine-scale population structure. Trends Genet. 18: 559563.[CrossRef][Medline]
REICH, D. E., M. CARGILL, S. BOLK, J. IRELAND, P. C. SABETI et al., 2001 Linkage disequilibrium in the human genome. Nature 411: 199204.[CrossRef][Medline]
ROGERS, A. R., 2001 Order emerging from chaos in human evolutionary genetics. Proc. Natl. Acad. Sci. USA 98: 779780.
ROMUALDI, C., D. BALDING, I. S. NASIDZE, G. RISCH, M. ROBICHAUX et al., 2002 Patterns of human diversity, within and among continents, inferred from biallelic DNA polymorphisms. Genome Res. 12: 602612.
ROZAS, J., and R. ROZAS, 1999 DnaSP version 3: an integrated program for molecular population genetics and molecular evolution analysis. Bioinformatics 15: 174175.
SAUNDERS, M. A., M. F. HAMMER and M. W. NACHMAN, 2002 Nucleotide variability at g6pd and the signature of malarial selection in humans. Genetics 162: 18491861.
SHEN, P., F. WANG, P. A. UNDERHILL, C. FRANCO, W. H. YANG et al., 2000 Population genetic implications from sequence variation in four Y chromosome genes. Proc. Natl. Acad. Sci. USA 97: 73547359.
STEPHENS, J. C., J. A. SCHNEIDER, D. A. TANGUAY, J. CHOI, T. ACHARYA et al., 2001 Haplotype variation and linkage disequilibrium in 313 human genes. Science 293: 489493.
TAJIMA, F., 1989 Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 123: 585595.
TISHKOFF, S. A., E. DIETZSCH, W. SPEED, A. J. PAKSTIS, J. R. KIDD et al., 1996 Global patterns of linkage disequilibrium at the CD4 locus and modern human origins. Science 271: 13801387.[Abstract]
VIGILANT, L., M. STONEKING, H. HARPENDING, K. HAWKES and A. C. WILSON, 1991 African populations and the evolution of human mitochondrial DNA. Science 253: 15031507.
WAKELEY, J., 1999 Nonequilibrium migration in human history. Genetics 153: 18631871.
WALL, J. D., and M. PRZEWORSKI, 2000 When did the human population size start increasing? Genetics 155: 18651874.
WATERSTON, R. H., K. LINDBLAD-TOH, E. BIRNEY, J. ROGERS, J. F. ABRIL et al., 2