| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Genetics, Vol. 174, 1421-1430, November 2006, Copyright © 2006
doi:10.1534/genetics.106.062588
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||


,2

* Department of Biology, York University, Toronto, Ontario M3J 1P3, Canada,
Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697 and
Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JT, United Kingdom
1 Corresponding author: Department of Biology, York University, 4700 Keele St., Toronto, ON M3J 1P3, Canada.
E-mail: stephenw{at}yorku.ca
| ABSTRACT |
|---|
|
|
|---|
There is consistent evidence for a strong correlation between recombination rate and DNA diversity in Drosophila melanogaster (BEGUN and AQUADRO 1992; ANDOLFATTO and PRZEWORSKI 2001), while no such effect is observed on divergence, suggesting the importance of natural selection in structuring genomewide patterns of polymorphism. However, in other organisms investigated, the evidence is less clear (NACHMAN et al. 1998; NACHMAN 2001; LERCHER and HURST 2002; CUTTER and PAYSEUR 2003; HELLMANN et al. 2003, 2005). In particular, recombination appears to be correlated with divergence as well as diversity in a number of taxa, making it difficult to uncouple the effects of mutation and natural selection.
In plants, the influence of recombination rate on DNA sequence diversity has been investigated in detail only in maize (TENAILLON et al. 2002), tomato (BAUDRY et al. 2001; ROSELIUS et al. 2005), and Arabidopsis thaliana (NORDBORG et al. 2005). In maize, no correlation was found with recombination rates estimated from recombination nodule density—only with estimates based on linkage disequilibrium (LD)—most likely reflecting effects of demographic history causing both increased linkage disequilibrium and reduced neutral genetic diversity (TENAILLON et al. 2002). However, using an extensive polymorphism data set from maize and teosinte, WRIGHT et al. (2005) showed that maize loci with a signature of positive selection tend to lie significantly closer to low-recombining centromeric regions than random loci do. The lack of any similar diversity reduction at these loci in the ancestral teosinte populations suggests that this pattern is a signature of selective sweeps during maize domestication, rather than differences in mutation rate.
In tomato, a consistent positive correlation was observed between recombination rate (estimated in Lycopersicon esculentum) and diversity in five species, but the relationship is weak and nonsignificant for several tests, suggesting that the action of linked selection may not have major effects on sequence diversity across the genome in these taxa (BAUDRY et al. 2001). Furthermore, no correlation with recombination was observed when silent polymorphism was scaled to silent divergence, suggesting that hitchhiking does not explain a significant amount of the variation in diversity in the tomato genome (ROSELIUS et al. 2005).
In A. thaliana, polymorphism is not significantly correlated with recombination rate (NORDBORG et al. 2005; SCHMID et al. 2005), but a significant negative correlation was found between gene density and nucleotide polymorphism (NORDBORG et al. 2005), consistent with stronger effects of selection in regions of high gene density. This is unlikely to be the result of mutagenic recombination; however, without divergence data from another species, neither heterogeneous mutation rates nor selective constraint on noncoding regions of the genome can be ruled out.
One possible explanation for the lack of a strong association between recombination and diversity is the effect of population subdivison and history on patterns of nucleotide variability. In structured populations, local adaptation may inflate between-population nucleotide variability (CHARLESWORTH et al. 1997). Hitchhiking, particularly background selection, will predominantly affect within-population diversity, rather than species-wide diversity. It can thus leave a signature of elevated estimates of between-population differentiation in regions of reduced recombination, without reducing diversity in species-wide samples. Furthermore, population admixture and population size changes may increase the variance in diversity across loci and may complicate the expected effects of genetic hitchhiking. Tests of the role of recombination on patterns of diversity should thus include polymorphism data from both within- and between-population samples from across a species' range (DAS et al. 2004; TAKAHASHI et al. 2004). Such data are rarely available.
To investigate the relationship between sequence diversity and recombination, we studied nucleotide diversity in 26 loci on two chromosomes in samples from six natural populations of the outcrossing species Arabidopsis lyrata. The two linkage groups correspond to chromosome 1 of A. thaliana, originating from a fusion event of the two ancestral chromosomes (HANSSON et al. 2006; LYSAK et al. 2006). A. lyrata is excellent for investigating the role of recombination and selection because (1) the genomic resources from the closely related A. thaliana, combined with comparative mapping between the species, have provided information on rates of recombination in both species; (2) divergence estimates are readily obtained from loci across the genome; and (3) A. lyrata populations exhibit strong population subdivision, yet extensive polymorphism is found within populations, contrasting with the selfing congener A. thaliana (SAVOLAINEN et al. 2000; WRIGHT et al. 2003b; RAMOS-ONSINS et al. 2004; BALANA-ALCAIDE et al. 2006).
| MATERIALS AND METHODS |
|---|
|
|
|---|
PCR and sequencing:
Table 1 shows the genomic location and size for the 26 loci surveyed in this study. With the exception of two of our five pericentromeric loci, we employed a direct sequencing strategy, using single large exons for PCR amplification and sequencing to minimize the chance of unreadable sequences caused by insertion/deletion (indel) variants. Exons from across chromosome 1 of A. thaliana were selected for direct sequencing in A. lyrata, using the annotation from the A. thaliana genome-sequencing project, and were chosen from genomic regions of contrasting recombination rates along chromosome I of A. thaliana (see Figure 1 and below). The exons selected for sequencing were all >800 bp, and, to ensure correct annotation, all encode genes with at least one full-length cDNA sequence. Each exon was submitted to a BLAST search (ALTSCHUL et al. 1997) of the genomic survey sequence database (http://www.ncbi.nlm.nih.gov/BLAST/) to check for the presence of orthologous regions in the shotgun genome sequence of Brassica oleracea. These orthologous regions were aligned to the A. thaliana genomic sequence to identify conserved regions for primer design. PCR primers were designed with the aid of PrimerQuest (Integrated DNA Technologies, http://biotools.idtdna.com/primerquest/) to amplify 650- to 750-bp fragments for sequencing using the same forward and reverse primers. Primers were also submitted to a BLAST search against the A. thaliana genome to ensure amplification of a single-copy region. To increase our sampling in regions expected to have reduced recombination, we designed primers for two additional pericentromeric loci, which included both coding and intron sites. The sequencing procedure for these two intron-containing loci was the same as for the other loci, except when two or more heterozygous indel variants were found. In such cases, the PCR products were cloned and sequences of at least two clones for both alleles were determined. A total of five to seven outbred individuals from the same populations were used for sequencing these latter two loci.
|
|
Recombination rate comparisons:
We tested the effect of recombination rate on nucleotide diversity using two approaches: (1) "regional comparisons" of diversity levels across large-scale chromosomal regions with contrasting recombination rates, comparing the surveyed pericentromeric region (suppressed recombination) with the chromosome arms ("normal" recombination), and (2) tests for correlations between diversity at each locus and local point estimates of recombination rates. Note that while the second test relies to some extent on the assumption of conserved physical distances between the two Arabidopsis species (see below), the first test assumes only suppressed recombination in the pericentromeric region in both species, which is supported by both linkage and cytological results. Since both background selection and selective sweeps are expected to strongly reduce diversity only when recombination rates are low (reviewed in INNAN and STEPHAN 2003), our regional comparison (normal vs. suppressed recombination near the centromere) should be ideal for detecting any such effects in A. lyrata.
Given the low marker density around the centromere of chromosome I (CEN1), we used mapping data from the fine-scale centromere map of chromosome I of A. thaliana (HAUPT et al. 2001) to select loci found in the region of suppressed recombination. On the basis of these results, our data set includes five loci from the region of suppressed recombination around CEN1; three loci (AT1G36310, AT1G36730, AT1G36730) are in a region with an estimated 50-fold reduction in recombination relative to the genome average, and two loci (AT1G42470, AT1G42990) are in a region with an estimated 10-fold reduction in recombination rates (HAUPT et al. 2001). Importantly, the five loci from the region of suppressed recombination surrounding the centromere in A. thaliana were completely linked in the A. lyrata mapping family, strongly suggesting that they lie in a conserved region of suppressed recombination (HANSSON et al. 2006). This is consistent with cytological observations showing the conserved position of the centromere between species (LYSAK et al. 2006).
To obtain point estimates of recombination for each locus, genetic and physical distances of markers from chromosome 1 of A. thaliana were fit to a third-order polynomial (WRIGHT et al. 2003a). In addition, we used the genetic map of A. lyrata (KUITTINEN and AGUADE 2000; KUITTINEN et al. 2004; HANSSON et al. 2006) to assess the similarity in recombination rates between species. Of our 26 genes, 20 have been genetically mapped in A. lyrata (HANSSON et al. 2006) and the results confirm the general genome collinearity for the regions under study, with two exceptions discussed below (HANSSON et al. 2006). Point estimates of recombination from A. lyrata mapping data were obtained in a similar way to A. thaliana, dividing the linkage groups into distinct regions, fitting third-order polynomials to the relationship between physical and genetic distance, and taking the first derivative of this polynomial at various physical positions. Note that our estimates assume that physical distances are similar between the species. Several genome size estimates have been made in A. lyrata and suggest a DNA content
1.5 times larger than that of A. thaliana (JOHNSTON et al. 2005). If the difference in size is distributed uniformly across the genome, estimates of relative recombination rates in different genome regions will be unaffected by the overall difference in DNA content. This must be tested in the future, but there is some evidence suggesting that the difference in genome size may be due to a general difference in the relative rates of insertion and deletion. In particular, intron lengths in A. lyrata are consistently higher than in A. thaliana, on the basis of samples including 50 genes, mostly from A. thaliana chromosomes I and IV (WRIGHT et al. 2002; A. KAWABE, unpublished results).
In the A. lyrata second linkage group, collinearity with chromosome 1 in A. thaliana is disrupted by an inversion in A. thaliana that encompasses four of our genes (Table 1). Furthermore, the location of the centromeric region for chromosome 2 is unknown, since this region was lost in the fusion that created the A. thaliana chromosome I (see Figure 4 of HANSSON et al. 2006). Evidence from chromosome painting and genetic mapping suggests a location close to the inversion breakpoint (LYSAK et al. 2006), bounded by the locus At1G59720 (HANSSON et al. 2006), and we therefore estimated recombination rates separately for loci within the inversion and outside it. Several A. lyrata loci mapping to the ends of the inversion and the chromosome ends were excluded from the correlation analysis due to poorly estimated recombination rates. Although the departures from synteny on chromosome 2 make recombination rate estimates less certain, our results and interpretation remain unchanged when the analyses are restricted to the loci on chromosome 1 (data not shown), where there is no evidence for disruption of synteny, and recombination rate estimates and map distances are highly conserved. Additional information on recombination rate estimates across species is presented as supplemental material at http://www.genetics.org/supplemental/.
Data analysis:
Average numbers of silent pairwise differences within A. lyrata (
), levels of differentiation between populations (FST), and Jukes–Cantor estimates of the number of silent (Ks) and nonsynonymous (Ka) substitutions per site between A. lyrata and A. thaliana were calculated using DNASp. version 4.0. To compare diversity levels across regions with contrasting recombination rates, we used a maximum-likelihood estimate of
= 4Neu, based on the numbers of segregating sites (see WRIGHT et al. 2002) and using the
95% credibility intervals assuming a chi-square distribution of likelihood values. Note that this method assumes no recombination, and therefore our credibility intervals are conservative, particularly in regions of intermediate and high recombination rates. We tested for differences in diversity that might be caused by selection with a maximum-likelihood (ML) extension of the Hudson–Kreitman–Aguadé (HKA) test (HUDSON et al. 1987). Using the MLHKA program (WRIGHT and CHARLESWORTH 2004), we tested for a diversity difference between the chromosome arm genes and the set of five genes in the regions near the centromere, using both polymorphism data and divergence from A. thaliana. Specifically, we tested a neutral null model in multi-locus tests, against an alternative allowing for the uncoupling of diversity and divergence (i.e., hitchhiking) on all centromeric loci. The alternative model estimates five independent parameters for the centromeric loci and thus has 5 d.f. This method assumes independent segregation of the loci, which is incorrect, because the centromere loci are partially linked. An alternative method, taking the centromeric region as a single locus in MLHKA with 1 d.f. for the selection model, yielded results and P-values similar to those presented.
Linkage disequilibrium among pairs of sites was calculated using WEIR's (1996) EM algorithm for estimating the squared correlation coefficient, r2, from unphased diploid genotype data, as implemented by MACDONALD et al. (2005). R code was kindly provided by S. Macdonald for r2 estimates.
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
= 4Neu at silent sites for each population. Levels of polymorphism are heterogeneous; the central European Plech, Germany, population has the highest diversity (maximum-likelihood estimate of
= 0.019 in regions of normal recombination), while the North American populations show an approximately fivefold reduction in diversity (Indiana
= 0.004 and Ontario
= 0.004), consistent with previous results suggesting a strong population bottleneck in North American populations (WRIGHT et al. 2003b; RAMOS-ONSINS et al. 2004). In comparison with central Europe, diversity values are slightly reduced in Sweden (
= 0.008), Russia (
= 0.009), and Iceland (
= 0.009), suggesting additional population bottlenecks associated with postglacial recolonization of northern Europe from an ancestral, central European source population (CLAUSS and MITCHELL-OLDS 2006; S. WRIGHT, J. P. FOXE, A. KAWABE, J. ROSS-IBARRA, L. DEROSE-WILSON, G. GOS, D. CHARLESWORTH and B. GAUT, unpublished results).
Levels of polymorphism and divergence in regions of normal vs. suppressed recombination
As explained above, our primary prediction is reduced nucleotide polymorphism in the region of clearly suppressed recombination, in comparison with regions of normal recombination (the regional test for hitchhiking). Figure 1 compares multi-locus likelihood estimates of
for the five loci from the pericentromeric region of suppressed recombination (complete linkage in the A. lyrata mapping family and an estimated 10- to 50-fold reduction in recombination compared with average rates in A. thaliana) with those for the remaining 22 loci, from chromosome arm regions (normal recombination; the average recombination rate in A. lyrata, estimated from the chromosome 1 and chromosome 2 data, is 4.8 cM/Mb). In contrast with expectation, there is no general reduction in polymorphism in the pericentromere region. The single exception is in the Karhumaki population from Russia; in this population, diversity for the pericentromeric loci is significantly reduced, and we found no segregating synonymous sites in any of the five loci in the reduced recombination region. This finding supports a suppression of recombination in this region; if recombination is infrequent and selection has acted, the loci will have correlated evolutionary histories. For these regions, there is complete collinearity (no inversion or fusion events) between A. lyrata and A. thaliana and the recombination rate estimates are nearly identical in the two species. Note that the failure to find a general reduction in diversity in this pericentromere is unlikely to be due to lack of power in our study; in many populations, polymorphism in these genes is higher in these regions than in chromosome arm genes, although not significantly.
Similarly, except for the Karhumaki population, map-based estimates of individual locus recombination rates are uncorrelated with silent polymorphism within populations or with total diversity (Table 2), or with differentiation between populations (FST, data not shown). For the Karhumaki sample, the silent-site diversity values are positively correlated with the recombination rate estimates from either species (one-tailed P = 0.006 with A. lyrata recombination estimates, P = 0.006 for A. thaliana estimates; Table 2), and the one-tailed test remains significant after a Bonferroni correction for six tests (P < 0.05). Excluding the pericentromeric regions, no population shows a correlation between recombination and diversity. The primary effect in Karhumaki is thus a difference in polymorphism levels between pericentromeric regions of highly suppressed recombination and the rest of the genome. These patterns hold when only data from the 15 loci from chromosome 1 are considered; recombination is significantly positively correlated with synonymous diversity in Karhumaki samples (Spearman's r = 0.734 with A. lyrata recombination estimates, P < 0.01; r = 0.692 with A. thaliana recombination estimates, P < 0.01), but not in any other population. We checked the PCR failure rate for genes in the pericentromeric region in Russia, in case the low diversity was an artifact caused by failure to amplify a very divergent haplotype. As there is no difference in failure rates from that for other genes (supplemental Table S2 at http://www.genetics.org/supplemental/), this explanation is ruled out.
|
s, is significantly correlated with silent-site divergence, Ks (Spearman's r = 0.4, P < 0.05), consistent with an effect of mutation rate variation on diversity levels. However, estimated recombination rates and Ks are uncorrelated in our data set (Table 2; Spearman's r = 0.02, P >> 0.05), so recombination rate variation does not explain the mutation rate heterogeneity. To further test for effects of hitchhiking in the pericentromere, controlling for divergence, we performed maximum-likelihood-ratio HKA tests for diversity differences using multi-locus polymorphism and divergence values (WRIGHT and CHARLESWORTH 2004). The Karhumaki population yielded significant support for diversity-reducing selection (selective sweeps) at pericentromeric loci (
2 = 40.0, d.f. = 5, P < 0.001, significant following a Bonferroni correction for multiple tests, P < 0.01). In contrast, all other populations showed no significant evidence for hitchhiking at pericentromeric loci (P >> 0.05). Although there is no general correlation between recombination and divergence, one of the most polymorphic loci, AT1G36310, lies in the pericentromeric region, and synonymous diversity and divergence are both high (supplemental Table S2 at http://www.genetics.org/supplemental/), suggesting a real difference in mutation rate. This locus is also unusually divergent between A. thaliana and B. oleracea (Ks = 0.82 average Ks between A. thaliana and Brassica spp., 0.474; TIFFIN and HAHN 2002). This isolated case of high divergence of a gene in a putatively low recombination region contrasts with results from other taxa, suggesting generally positive correlations between recombination and mutation rates (CUTTER and PAYSEUR 2003; HELLMANN et al. 2003). Locus AT1G36310, a methyltransferase, was also previously hypothesized to be subject to balancing selection on the basis of a smaller number of loci and a population sample from Iceland (WRIGHT and CHARLESWORTH 2004). This raises the possibility that balancing selection could obscure the expected positive correlation between recombination and diversity. Our previous inference of balancing selection acting on or near the highly polymorphic AT1G36310 locus was based on fewer reference loci than analyzed here and on a sample from Iceland only (WRIGHT and CHARLESWORTH 2004). Our new data set with additional loci gives no evidence for the action of balancing selection in any population, including Iceland, when MLHKA is used to test for selection specifically on this locus (P > 0.05 for all tests). Thus, neither mutation rate heterogeneity nor balancing selection can explain the variability observed in the region of suppressed recombination.
Levels of linkage disequilibrium along chromosome arms vs. pericentromere:
It is important to test whether the pericentromeric regions undergo recombination. If most recombination occurs within genes, our coding sequences in the pericentromeric regions may be hotspots of recombination. Even if recombination is suppressed across the region, local recombination within coding regions could then occur at a significant rate. A related issue is whether our estimates of diversity in the pericentromeric regions are nonindependent, due to low recombination rates. With our present sample we can analyze LD, although a detailed understanding of LD in our populations requires fuller analysis of coding and noncoding regions of more loci, particularly at an intermediate physical distance, taking into account population structure and formally considering the possibility of gene conversion at high rates in regions of low crossing over (e.g., LANGLEY et al. 2000). As expected, average LD (measured by r2) declines with physical distance in this region, and the decline is slower for the pericentromeric region genes than for those in the chromosome arms (Figure 2). This is consistent with a reduced rate of recombination in the former regions and at least partial independence of our observed levels of polymorphism for different loci in the pericentromeric region. In addition, the decline in LD is clearly considerably faster than that estimated in the highly selfing A. thaliana (NORDBORG et al. 2005).
|
However, it is possible that recent between-species introgression from related taxa could be causing a departure from the pattern expected under background selection. Preliminary results suggest some shared polymorphisms between A. lyrata and its close relative, Arabidopsis halleri (RAMOS-ONSINS et al. 2004). Thus, a low rate of introgression into A. lyrata following a period of isolation could elevate levels of diversity in regions of low recombination. While this possibility should be investigated in the future by surveying diversity at these loci in closely related taxa, it remains unclear whether the shared polymorphism is due to recent divergence or to ongoing migration, and we therefore also explore further the possibility that background selection is weak in A. lyrata.
The strength of background selection is a function of the deleterious mutation rate, the strength of purifying selection, and the rate of recombination. If deleterious mutation rates are unusually low in the pericentromeric regions, polymorphism might not be detectably reduced, despite suppression of recombination. One possibility is that the genes located in regions of low recombination are subject to little or no selective constraints. We should then see a high ratio of amino acid to synonymous divergence in regions of low recombination. However, the ratio of nonsynonymous to synonymous divergence correlates positively with recombination rates in our sample of genes (A. lyrata-based recombination rate estimates: Spearman's R = 0.553, P < 0.01; A. thaliana-based rates: R = 0.499, P < 0.05). Thus, either selective constraints are unusually high for our sample of pericentromeric genes or adaptive amino acid substitutions are more common in the genes from higher recombination regions.
These possibilities can be distinguished by comparing the ratio of amino acid replacements to synonymous variants among polymorphisms and fixed differences, using the MCDONALD-KREITMAN (1991) test. Compared with the other loci, the pericentromeric genes show many fewer nonsynonymous relative to synonymous differences among both polymorphisms and divergence (for the chromosome arm genes, we find a ratio of nonsynonymous to synonymous polymorphisms equal to 0.86 vs. 0.67 among fixations and, for the pericentromeric genes, the respective values are 0.1 and 0.07). Thus this global analysis gives no evidence for adaptive protein evolution in our chromosome arm genes (MCDONALD–KREITMAN test P > 0.05). Since both the pericentromeric and the chromosome arm genes have higher ratios of nonsynonymous to synonymous polymorphism than fixed differences, there is a trend suggesting weak purifying selection. Thus, while we cannot rule out positive selection contributing to amino acid substitution for some genes, the general picture suggests stronger selective constraints on our pericentromeric genes. This contrasts with results from Drosophila, where there is evidence for most adaptive protein evolution in regions of high recombination and accumulation of slightly deleterious amino acid polymorphisms in regions of reduced recombination (PRESGRAVES 2005).
Low gene density is another possibility that may cause hitchhiking to be weak in the pericentromeric region. There may simply be too few targets for purifying or positive selection for diversity to be generally reduced. Gene density is low in the pericentromeric regions of A. thaliana (HAUPT et al. 2001; WRIGHT et al. 2003a) and rice (NAGAKI et al. 2004), so the strength of background selection may be low in the highly outcrossing A. lyrata. Using gene expression data from full-length cDNAs (YAMADA et al. 2003), ESTs, whole-genome tiling arrays (YAMADA et al. 2003), and massively parallel signature sequencing (MEYERS et al. 2004), we estimate that there are only
40 genes in the region of suppressed recombination surrounding the centromere of chromosome 1 in A. thaliana, excluding transposable elements. Using the genome annotation, this totals
78 kb of coding sequence. Using the estimated number of deleterious amino acid mutations accumulated per site since the divergence of A. lyrata and A. thaliana (0.077; WRIGHT et al. 2002), this translates to a deleterious mutation rate of
0.0024 in this region per generation, assuming 5 million years since the common ancestor of A. thaliana and A. lyrata (KOCH et al. 2000) and an average generation time of 2 years during this period. Under background selection, the relative amount of diversity in a nonrecombining region follows the equation (CHARLESWORTH et al. 1993)
![]() |
To test for effects of hitchhiking acting in genes outside the pericentromeric regions, we investigated whether gene density per unit of genetic distance affects levels of genetic diversity, as was found in A. thaliana (NORDBORG et al. 2005). In contrast with the A. thaliana results, we observe no correlation between gene density (measured as genes/centimorgan on the basis of the A. thaliana genome sequence) and levels of silent nucleotide polymorphism for the loci tested (P >> 0.05, all populations). If this result is confirmed with additional loci, it will suggest that regions of high gene density per centimorgan in A. lyrata may also experience little hitchhiking. This would be consistent with A. lyrata's higher effective rates of recombination and historical effective population sizes (given its outcrossing mating system), compared with A. thaliana. Hitchhiking causes only slight effects on diversity unless recombination is infrequent, even in high gene density regions. Thus, highly selfing species should experience such effects over a much larger portion of the genome. Recent theoretical work suggests that weak selection against deleterious mutations has a stronger diversity-reducing effect than previously thought, but very large data sets would be required to detect this (B. CHARLESWORTH, personal communication).
Our results contrast with those in D. melanogaster, where reduced diversity has consistently been observed in all low crossing-over regions (reviewed in CHARLESWORTH 1996). The D. melanogaster fourth chromosome has numbers of genes comparable to the pericentromeric region surveyed here and has reduced diversity (JENSEN et al. 2002; WANG et al. 2004). It is possible that Drosophila experiences a higher deleterious mutation rate and/or a larger class of mutations subject to weak purifying selection in regions of low recombination, for example, through transposable element activity driving background selection (CHARLESWORTH 1996). Our results thus suggest either that A. lyrata lacks a large class of slightly deleterious mutations that are segregating and being selectively removed from populations or that D. melanogaster experiences more frequent positive selection.
If background selection is generally weak in A. lyrata, the reduced polymorphism in the Karhumaki population may reflect a local selective sweep associated with recent directional selection. Alternatively, however, a population bottleneck or other demographic situation may have increased the variance in diversity across loci in the Russian population, by chance reducing polymorphism close to the centromere. If recombination is infrequent in this region, the loci will have correlated demographic history and should all have low polymorphism if a recent diversity-reducing event affected the region. Data from genes in the pericentromeric regions of other centromeres, combined with additional polymorphism data from the pericentromere surveyed here, should help distinguish between a population bottleneck and a selected substitution in a gene near the centromere of chromosome 1.
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
| FOOTNOTES |
|---|
| LITERATURE CITED |
|---|
|
|
|---|
ALTSCHUL, S. F., T. L. MADDEN, A. A. SCHAFFER, J. ZHANG, Z. ZHANG et al., 1997 Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25: 3389–3402.
ANDOLFATTO, P., and M. PRZEWORSKI, 2001 Regions of lower crossing over harbor more rare variants in African populations of Drosophila melanogaster. Genetics 158: 657–665.
BALANA-ALCAIDE, D., S. E. RAMOA-ONSINS, Q. BOONE and M. AGUADE, 2006 Highly structured nucleotide variation within and among Arabidopsis lyrata populations at the FAH1 and DFR gene regions. Mol. Ecol. 15: 2059–2068.[CrossRef][Medline]
BARTON, N. H., 2000 Genetic hitchhiking. Philos. Trans. R. Soc. Lond. B Biol. Sci. 355: 1553–1562.[CrossRef][Medline]
BAUDRY, E., C. KERDELHUE, H. INNAN and W. STEPHAN, 2001 Species and recombination effects on DNA variability in the tomato genus. Genetics 158: 1725–1735.
BEGUN, D. J., and C. F. AQUADRO, 1992 Levels of naturally occurring DNA polymorphism correlate with recombination rates in D. melanogaster. Nature 356: 519–520.[CrossRef][Medline]
CHARLESWORTH, B., 1996 Background selection and patterns of genetic diversity in Drosophila melanogaster. Genet. Res. 68: 131–149.[Medline]
CHARLESWORTH, B., M. T. MORGAN and D. CHARLESWORTH, 1993 The effects of deleterious mutations on neutral molecular variation. Genetics 134: 1289–1303.[Abstract]
CHARLESWORTH, B., M. NORDBORG and D. CHARLESWORTH, 1997 The effects of local selection, balanced polymorphism and background selection on equilibrium patterns of genetic diversity in subdivided populations. Genet. Res. 70: 155–174.[CrossRef][Medline]
CLAUSS, M. J., and T. MITCHELL-OLDS, 2003 Population genetics of tandem trypsin inhibitor genes in Arabidopsis species with contrasting ecology and life history. Mol. Ecol. 12: 1287–1299.[CrossRef][Medline]
CLAUSS, M. J., and T. MITCHELL-OLDS, 2006 Population genetic structure of Arabidopsis lyrata in Europe. Mol. Ecol. 15: 2753–2766.[Medline]
CUTTER, A. D., and B. A. PAYSEUR, 2003 Selection at linked sites in the partial selfer Caenorhabditis elegans. Mol. Biol. Evol. 20: 665–673.
DAS, A., S. MOHANTY and W. STEPHAN, 2004 Inferring the population structure and demography of Drosophila ananassae from multilocus data. Genetics 168: 1975–1985.
HANSSON, B., A. KAWABE, S. PREUSS, H. KUITTINEN and D. CHARLESWORTH, 2006 Comparative gene mapping in Arabidopsis lyrata chromosomes 1 and 2 and the corresponding A. thaliana chromosome 1: recombination rates, rearrangements and centromere location. Genet. Res. 87: 75–85.[CrossRef][Medline]
HAUPT, W., T. C. FISCHER, S. WINDERL, P. FRANSZ and R. A. TORRES-RUIZ, 2001 The centromere1 (CEN1) region of Arabidopsis thaliana: architecture and functional impact of chromatin. Plant J. 27: 285–296.[CrossRef][Medline]
HELLMANN, I., I. EBERSBERGER, S. E. PTAK, S. PAABO and M. PRZEWORSKI, 2003 A neutral explanation for the correlation of diversity with recombination rates in humans. Am. J. Hum. Genet. 72: 1527–1535.[CrossRef][Medline]
HELLMANN, I., K. PRUFER, H. JI, M. C. ZODY, S. PAABO et al., 2005 Why do human diversity levels vary at a megabase scale? Genome Res. 15: 1222–1231.
HUDSON, R. R., and N. L. KAPLAN, 1995 Deleterious background selection with recombination. Genetics 141: 1605–1617.[Abstract]
HUDSON, R. R., M. KREITMAN and M. AGUADE, 1987 A test of neutral molecular evolution based on nucleotide data. Genetics 116: 153–159.
INNAN, H., and W. STEPHAN, 2003 Distinguishing the hitchhiking and background selection models. Genetics 165: 2307–2312.
JENSEN, M. A., B. CHARLESWORTH and M. KREITMAN, 2002 Patterns of genetic variation at a chromosome 4 locus of Drosophila melanogaster and D. simulans. Genetics 160: 493–507.
JOHNSTON, J. S., A. E. PEPPER, A. E. HALL, Z. J. CHEN, G. HODNETT et al., 2005 Evolution of genome size in Brassicaceae. Ann. Bot. 95: 229–235.
KOCH, M. A., B. HAUBOLD and T. MITCHELL-OLDS, 2000 Comparative evolutionary analysis of the chalcone synthase and alcohol dehydrogenase loci among different lineages of Arabidopsis, Arabis and related genera (Brassicaceae). Mol. Biol. Evol. 17: 1483–1498.
KUITTINEN, H., and M. AGUADE, 2000 Nucleotide variation at the CHALCONE ISOMERASE locus in Arabidopsis thaliana. Genetics 155: 863–872.
KUITTINEN, H., A. A. DE HAAN, C. VOGL, S. OIKARINEN, J. LEPPALA et al., 2004 Comparing the linkage maps of the close relatives Arabidopsis lyrata and A. thaliana. Genetics 168: 1575–1584.
LANGLEY, C. H., B. P. LAZZARO, W. PHILLIPS, E. HEIKKINEN and J. M. BRAVERMAN, 2000 Linkage disequilibria and the site frequency spectra in the su(s) and su(w(a)) regions of the Drosophila melanogaster X chromosome. Genetics 156: 1837–1852.
LERCHER, M. J., and L. D. HURST, 2002 Human SNP variability and mutation rate are higher in regions of high recombination. Trends Genet. 18: 337–340.[CrossRef][Medline]
LYSAK, M. A., A. BERR, A. PECINKA, R. SCHMIDT, K. MCBREEN et al., 2006 Mechanisms of chromosome number reduction in Arabidopsis thaliana and related Brassicaceae species. Proc. Natl. Acad. Sci. USA 103: 5224–5229.
MACDONALD, S. J., T. PASTINEN and A. D. LONG, 2005 The effect of polymorphisms in the enhancer of split gene complex on bristle number variation in a large wild-caught cohort of Drosophila melanogaster. Genetics 171: 1741–1756.
MAYNARD-SMITH, J., and J. HAIGH, 1974 The hitch-hiking effect of a favorable gene. Genet. Res. 23: 23–35.[Medline]
MCDONALD, J. H., and M. KREITMAN, 1991 Adaptive protein evolution at the Adh1 locus in Drosophila. Nature 351: 652–654.[CrossRef][Medline]
MEYERS, B. C., T. H. VU, S. S. TEJ, H. GHAZAL, M. MATVIENKO et al., 2004 Analysis of the transcriptional complexity of Arabidopsis thaliana by massively parallel signature sequencing. Nat. Biotechnol. 22: 1006–1011.[CrossRef][Medline]
NACHMAN, M. W., 2001 Single nucleotide polymorphisms and recombination rate in humans. Trends Genet. 17: 481–485.[CrossRef][Medline]
NACHMAN, M. W., V. L. BAUER, S. L. CROWELL and C. F. AQUADRO, 1998 DNA variability and recombination rates at X-linked loci in humans. Genetics 150: 1133–1141.
NAGAKI, K., Z. CHENG, S. OUYANG, P. B. TALBERT, M. KIM et al., 2004 Sequencing of a rice centromere uncovers active genes. Nat. Genet. 36: 138–145.[CrossRef][Medline]
NORDBORG, M., T. T. HU, Y. ISHINO, J. JHAVERI, C. TOOMAJIAN et al., 2005 The pattern of polymorphism in Arabidopsis thaliana. PLoS Biol. 3: e196.[CrossRef][Medline]
PRESGRAVES, D. C., 2005 Recombination enhances protein adaptation in Drosophila melanogaster. Curr. Biol. 15: 1651–1656.[CrossRef][Medline]
RAMOS-ONSINS, S. E., B. E. STRANGER, T. MITCHELL-OLDS and M. AGUADE, 2004 Multilocus analysis of variation and speciation in the closely related species Arabidopsis halleri and A. lyrata. Genetics 166: 373–388.
ROSELIUS, K., W. STEPHAN and T. STADLER, 2005 The relationship of nucleotide polymorphism, recombination rate and selection in wild tomato species. Genetics 171: 753–763.
SAVOLAINEN, O., C. H. LANGLEY, B. P. LAZZARO and H. FR, 2000 Contrasting patterns of nucleotide polymorphism at the alcohol dehydrogenase locus in the outcrossing Arabidopsis lyrata and the selfing Arabidopsis thaliana. Mol. Biol. Evol. 17: 645–655.
SCHMID, K. J., S. RAMOS-ONSINS, H. RINGYS-BECKSTEIN, B. WEISSHAAR and T. MITCHELL-OLDS, 2005 A multilocus sequence survey in Arabidopsis thaliana reveals a genome-wide departure from a neutral model of DNA sequence polymorphism. Genetics 169: 1601–1615.
TAKAHASHI, A., Y. H. LIU and N. SAITOU, 2004 Genetic variation versus recombination rate in a structured population of mice. Mol. Biol. Evol. 21: 404–409.
TENAILLON, M. I., M. C. SAWKINS, L. K. ANDERSON, S. M. STACK, J. DOEBLEY et al., 2002 Patterns of diversity and recombination along chromosome 1 of maize (Zea mays ssp. mays L.). Genetics 162: 1401–1413.
TIFFIN, P., and M. W. HAHN, 2002 Coding sequence divergence between two closely related plant species: Arabidopsis thaliana and Brassica rapa ssp. pekinensis. J. Mol. Evol. 54: 746–753.[CrossRef][Medline]
WANG, W., K. THORNTON, J. J. EMERSON and M. LONG, 2004 Nucleotide variation and recombination along the fourth chromosome in Drosophila simulans. Genetics 166: 1783–1794.
WATTERSON, G. A., 1975 On the number of segregating sites in genetical models without recombination. Theor. Popul. Biol. 7: 256–276.[CrossRef][Medline]
WEIR, B. S., 1996 Genetic Data Analysis II. Sinauer Associates, Sunderland, MA.
WRIGHT, S. I., and B. CHARLESWORTH, 2004 The HKA test revisited: a maximum-likelihood-ratio test of the standard neutral model. Genetics 168: 1071–1076.
WRIGHT, S. I., B. LAUGA and D. CHARLESWORTH, 2002 Rates and patterns of molecular evolution in inbred and outbred Arabidopsis. Mol. Biol. Evol. 19: 1407–1420.
WRIGHT, S. I., N. AGRAWAL and T. E. BUREAU, 2003a Effects of recombination rate and gene density on transposable element distributions in Arabidopsis thaliana. Genome Res. 13: 1897–1903.
WRIGHT, S. I., B. LAUGA and D. CHARLESWORTH, 2003b Subdivision and haplotype structure in natural populations of Arabidopsis lyrata. Mol. Ecol. 12: 1247–1263.[CrossRef][Medline]
WRIGHT, S. I., I. V. BI, S. G. SCHROEDER, M. YAMASAKI, J. F. DOEBLEY et al., 2005 The effects of artificial selection on the maize genome. Science 308: 1310–1314.
YAMADA, K., J. LIM, J. M. DALE, H. CHEN, P. SHINN et al., 2003 Empirical analysis of transcriptional activity in the Arabidopsis genome. Science 302: 842–846.
This article has been cited by other articles:
![]() |
J. P. Foxe, V.-u.-N. Dar, H. Zheng, M. Nordborg, B. S. Gaut, and S. I. Wright Selection on Amino Acid Substitutions in Arabidopsis Mol. Biol. Evol., July 1, 2008; 25(7): 1375 - 1383. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Kawabe, A. Forrest, S. I. Wright, and D. Charlesworth High DNA Sequence Diversity in Pericentromeric Genes of the Plant Arabidopsis lyrata Genetics, June 1, 2008; 179(2): 985 - 995. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. M. Hazzouri, A. Mohajer, S. I. Dejak, S. P. Otto, and S. I. Wright Contrasting Patterns of Transposable-Element Insertion Polymorphism and Nucleotide Diversity in Autotetraploid and Allotetraploid Arabidopsis Species Genetics, May 1, 2008; 179(1): 581 - 592. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. D. Cutter Multilocus Patterns of Polymorphism and Selection Across the X Chromosome of Caenorhabditis remanei Genetics, March 1, 2008; 178(3): 1661 - 1672. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Kuittinen, A. Niittyvuopio, P. Rinne, and O. Savolainen Natural Variation in Arabidopsis lyrata Vernalization Requirement Conferred by a FRIGIDA Indel Polymorphism Mol. Biol. Evol., February 1, 2008; 25(2): 319 - 329. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Kamau, B. Charlesworth, and D. Charlesworth Linkage Disequilibrium and Recombination Rate Estimates in the Self-Incompatibility Region of Arabidopsis lyrata Genetics, August 1, 2007; 176(4): 2357 - 2369. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. O. Borevitz, S. P. Hazen, T. P. Michael, G. P. Morris, I. R. Baxter, T. T. Hu, H. Chen, J. D. Werner, M. Nordborg, D. E. Salt, et al. Genome-wide patterns of single-feature polymorphism in Arabidopsis thaliana PNAS, July 17, 2007; 104(29): 12057 - 12062. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||