- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Quesada, H.
- Articles by Aguadé, M.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Quesada, H.
- Articles by Aguadé, M.
Large-Scale Adaptive Hitchhiking Upon High Recombination in Drosophila simulans
Humberto Quesadaa, Ursula E. M. Ramíreza, Julio Rozasa, and Montserrat Aguadéaa Departament de Genètica, Facultat de Biologia, Universitat de Barcelona, Diagonal 645, 08028 Barcelona, Spain
Corresponding author: Humberto Quesada, Facultat de Biologia, Universitat de Barcelona, Diagonal 645, 08028 Barcelona, Spain., humberto{at}porthos.bio.ub.es (E-mail)
Communicating editor: O. SAVOLAINEN
| ABSTRACT |
|---|
Natural selection is expected to leave a characteristic footprint on neighboring nucleotide variation through the effects of genetic linkage. The size of the region affected is proportional to the strength of selection and greatly reduced with the recombinational distance from the selected site. Thus, the genomic footprint of selection is generally believed to be restricted to a small DNA stretch in normal and highly recombining regions. Here, we study the effect of selection on linked polymorphism (hitchhiking effect) by surveying nucleotide variation across a highly recombining
88-kb genomic fragment in an African population of Drosophila simulans. We find a core region of up to 38 kb with a major haplotype at intermediate frequency. The extended haplotype structure that gradually vanishes until disappearing is unusual for a highly recombining region. Both the presence in the structured genomic domain of a single major haplotype depleted of variability and the detected spatial pattern of variation along the
88-kb fragment are incompatible with neutral predictions in a panmictic population. A major role of demographic effects could also be discarded. The observed pattern of variation clearly provides evidence that directional selection has acted recently on this region, sweeping out variation around a strongly adaptive mutation. Our findings suggest a major role of positive selection in shaping DNA variability even in highly recombining regions.
THE extent to which DNA variation is driven by selective and stochastic factors is one of the greatest standing controversies in molecular evolution (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Detecting the effect of selection on neighboring variation may be facilitated by surveying variation in genomic regions containing loci under putatively recent selection (![]()
![]()
![]()
![]()
![]()
![]()
Positive selection has been proposed to explain the unusual haplotype structure detected in Drosophila simulans for a 1.3-kb region encompassing the rp49 gene (![]()
7 kb from rp49 (![]()
![]()
![]()
![]()
![]()
88 kb. This is among the longest stretches of a highly recombining region of Drosophila whose haplotype structure in a single population has been directly determined by DNA sequencing. We were particularly interested in detecting the spatial pattern of variation associated with positive selection. The survey of variation in such a large region has allowed us not only to unambiguously establish the role of positive selection but also to assess the size of the region affected and to estimate the strength of selection.
| MATERIALS AND METHODS |
|---|
Fly samples and regions studied:
Fourteen D. simulans lines from Maputo (Mozambique) were used in this study. These were highly inbred strains obtained by 10 generations of sib-mating. The DNAs from the same individuals previously sequenced for the rp49 region (![]()
![]()
On the basis of the annotated sequence of D. melanogaster (GenBank accession no.
AE003772; ![]()
88-kb genomic fragment were selected (Fig 1). They consisted primarily of noncoding DNA and their size ranged between 0.5 and 1.3 kb (Appendix).
|
DNA sequencing strategy:
Oligonucleotides for PCR amplification and sequencing were designed on the available sequence of D. melanogaster (GenBank accession no.
AE003772; ![]()
Data analysis for individual loci:
DNA sequences were multiply aligned using the Clustal X program (![]()
![]()
![]()
) and the population recombination parameter (C = 2Nc, where N is the effective population size and c is the per-base recombination rate). It was also used to perform different tests of neutrality: Tajima's D test (![]()
![]()
![]()
![]()
![]()
Multilocus tests of neutrality:
We built a multilocus statistic
where Oi is the observed value in region i, Ei is the value expected under the null hypothesis, and n is the number of regions (n = 10 in our case). We determined whether the number of identical sequences or the haplotype diversity departs from the neutral model (i.e., mutation-drift equilibrium in a panmictic population). The expected values of identical haplotypes or of haplotype diversity for each region and the empirical distribution of
were obtained by neutral coalescent simulations (1000 replicates) with recombination (CM = 0.0368, obtained from the comparison of physical and genetic maps and assuming N = 2 x 106 and c = 0.92 x 10-8; ![]()
values more extreme than observed.
| RESULTS AND DISCUSSION |
|---|
A total of 340 segregating sites (nucleotide diversity
= 0.0157) and 45 insertion/deletion polymorphisms were detected across the sequenced regions in the 14 lines from Maputo (Mozambique; Fig 1). The number of segregating sites in each region was rather high and ranged from 18 to 53. The high recombination rate of the genomic fragment surveyed (see Introduction) is apparent in the polymorphism data. A minimum number of 48 recombination events (RM) were detected in the sample using the four-gamete rule method (![]()
![]()
![]()
![]()
![]()
88-kb genomic fragment analyzed.
Despite extensive recombination, nucleotide variation was highly structured along an extended genomic area encompassing rp49 and the three paralogous genes (Fig 1). The clustering pattern in phylogenetic trees is very similar, in many regions, to that previously found: a cluster of identical or nearly identical sequences within a set of highly diverged and heterogeneous sequences. Two features of the data (Fig 1) are consistent with a selective sweep. First, the unusual haplotype structure does not extend across the entire fragment. The affected area is surprisingly large, but flanked by boundary regions with no appreciable haplotype structure (A, I, and J). Second, the number of DNA sequences within the homogeneous subset decays gradually on both sides of the most structured stretch (E and F). Remarkably, some of the lines that form part of the homogeneous subset in a particular region belong to the heterogeneous subset in the surrounding regions. This indicates that recombination has disrupted the association between regions on either side of the most structured stretch.
Positive selection is also inferred from a nonuniform deviation from neutral expectations along the entire fragment, as revealed by several neutrality tests. In all the tests, probability values were obtained by coalescent-Monte Carlo simulations assuming recombination (![]()
![]()
![]()
![]()
![]()
24-kb stretch covering the regions with strong haplotype structure (C, D, E, F, and G; Fig 1). However, the footprint of selection likely spans >38 kb, since the flanking regions B and H still show a detectable haplotype structure (Fig 1) with a number of identical sequences higher than expected (Fig 2). To test whether the observed heterogeneity along the entire fragment occurred simply by chance, we used a coalescent-based approach to take into account stochastic variation in the number of identical sequences or in the haplotype diversity along the entire fragment. For this analysis, we generated random samples of 14 sequences of 88 kb with 4282 segregating sites (those resulting from scaling up the number observed in the sequenced regions) to test for spatial structure across the surveyed fragment using the multilocus
-statistic (see MATERIALS AND METHODS). Both the overall high number of identical sequences observed and the overall low haplotype diversity are clearly incompatible with the neutral model (P < 0.005 in both cases). The results from the multilocus tests were corroborated using a different method, the sliding-window haplotype test (![]()
|
We also used Tajima's D statistic to examine the fit of the observed frequency spectrum of polymorphic sites to that expected under neutrality. No significant departure from selective neutrality was found in any individual region or in the fragment as a whole (Fig 1). However, 8 out of the 10 regions show negative D values indicating an excess of singleton variants. Although all singletons are derived (as inferred from the D. melanogaster outgroup sequence), many occur in the heterogeneous subset of sequences. The negative Tajima's D values are thus not caused by new mutations occurring after the selective sweep, but may be due, for example, to historical events predating the sweep. On the other hand, comparison of polymorphism and divergence levels between the region(s) studied and the vermilion locus (![]()
![]()
![]()
The haplotype tests clearly indicate that natural selection has acted recently and strongly. The homogeneous subset was most likely a formerly rare haplotype driven to an intermediate frequency due to its linkage to a positively selected variant. This is consistent with its phylogenetically derived position (Fig 1) and the high number of fixed derived nucleotide variants in this subset relative to the heterogeneous subset (21 out of 24 along regions BH). Selection would favor the same mutation in the whole distribution area of the species, since the same haplotype was found at intermediate frequency in the populations surveyed for rp49 and the three paralogous genes (![]()
![]()
![]()
The surprisingly large genomic distance over which selection is likely to exert a strong effect on linked variation (at least 2438 kb of a highly recombining region) implies a high selection coefficient. From these distances, we estimated the selection coefficient s to be 0.011 or higher (d x c/0.01, where d is half the length of the affected fragment; ![]()
![]()
3-kb interval defined by the sliding-window haplotype test. However, valleys of depleted variability may be rather asymmetrical around the site of the beneficial mutation (![]()
Our observation of a single major haplotype depleted of variation strongly supports a partial selective sweep (![]()
![]()
![]()
![]()
![]()
![]()
We have considered the alternative hypothesis that the unusual haplotype structure was due to the recent admixture of two differentiated populations, one of them devoid of variation (![]()
![]()
![]()
Recent studies in Drosophila have yielded promising results in the struggle to understand whether natural selection plays a substantial role in protein evolution, where adaptive amino acid substitutions appear to occur at a remarkably high rate (![]()
![]()
![]()
| ACKNOWLEDGMENTS |
|---|
We thank S. Ramos-Onsins for sharing his unpublished coalescent-simulation program, G. Blasco for technical assistance, and W. Stephan for critical comments on the manuscript. We also thank Serveis Científico-Tècnics from Universitat de Barcelona for automated sequencing facilities and Centre de Supercomputació de Catalunya (CESCA) for computer facilities. This work was supported by grants BMC2001-2906 from Comisión Interdepartamental de Ciencia y Tecnología (CICyT) and 2001SGR-101 from Comissió Interdepartamental de Recerca i Innovació Tecnològica (CIRIT), Catalonia, Spain, to M.A.
Manuscript received February 6, 2003; Accepted for publication June 6, 2003.
| APPENDIX |
|---|
|
| LITERATURE CITED |
|---|
ADAMS, M. D., S. E. CELNIKER, R. A. HOTL, C. A. EVANS, and J. D. GOCAYNE et al., 2000 The genome sequence of Drosophila melanogaster.. Science 287:2185-2195.
ANDOLFATTO, P., 2001 Adaptive hitchhiking effects on genome variability. Curr. Opin. Genet. Dev. 11:635-641.[Medline]
ANDOLFATTO, P. and M. PRZEWORSKI, 2000 A genome-wide departure from the standard neutral model in natural populations of Drosophila. Genetics 156:257-268.
ANDOLFATTO, P., J. D. WALL, and M. KREITMAN, 1999 Unusual haplotype structure at the proximal breakpoint of In(2L)t in a natural population of Drosophila melanogaster.. Genetics 153:1297-1311.
ASHBURNER, M., 1989 Drosophila: A Laboratory Handbook. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY.
BEGUN, D. J. and C. F. AQUADRO, 1991 Molecular population genetics of the distal portion of the X chromosome in Drosophila: evidence for genetic hitchhiking of the yellow-achaete region. Genetics 129:89-99.
BEGUN, D. J. and C. F. AQUADRO, 1995 Molecular variation at the vermilion locus in geographically diverse populations of Drosophila melanogaster and D. simulans.. Genetics 140:1019-1032.[Abstract]
CHARLESWORTH, B., M. T. MORGAN, and D. CHARLESWORTH, 1993 The effect of deleterious mutations on neutral molecular variation. Genetics 134:1289-1303.[Abstract]
DEPAULIS, F. and M. VEUILLE, 1998 Neutrality tests based on the distribution of haplotypes under an infinite site model. Mol. Biol. Evol. 15:1788-1790.[Medline]
FAY, J. C., G. J. WYCKOFF, and C.-I WU, 2002 Testing the neutral theory of molecular evolution with genomic data from Drosophila. Nature 415:1024-1026.[Medline]
GILAD, Y., S. ROSENBERG, M. PRZEWORSKI, D. LANCET, and K. SKORECKI, 2002 Evidence for positive selection and population structure at the human MAO-A gene. Proc. Natl. Acad. Sci. USA 99:862-867.
GILLESPIE, J. H., 1991 The Causes of Molecular Evolution. Oxford University Press, Oxford.
HUDSON, R. R., 1983 Properties of a neutral allele model with intra-genic recombination. Theor. Popul. Biol. 23:183-201.[Medline]
HUDSON, R. R., 1990 Gene genealogies and the coalescent process, pp. 144 in Oxford Surveys in Evolutionary Biology, edited by P. H. HARVEY and L. PARTRIDGE. Oxford University Press, New York.
HUDSON, R. R. and N. L. KAPLAN, 1985 Statistical properties of the number of recombination events in the history of a sample of DNA sequences. Genetics 111:147-164.
HUDSON, R. R. and N. L. KAPLAN, 1995 Deleterious background mutations with recombination. Genetics 141:1605-1617.[Abstract]
HUDSON, R. R., M. KREITMAN, and M. AGUADÉ, 1987 A test of neutral molecular evolution based on nucleotide data. Genetics 116:153-159.
HUDSON, R. R., K. BAILEY, D. SKARECKY, J. KWIATOWSKI, and F. J. AYALA, 1994 Evidence for positive selection in the Superoxide Dismutase (Sod) region of Drosophila melanogaster.. Genetics 136:1329-1340.[Abstract]
KAPLAN, N. L., R. R. HUDSON, and C. H. LANGLEY, 1989 The "hitchhiking effect" revisited. Genetics 123:887-899.
KIM, Y. and W. STEPHAN, 2002 Detecting the local signature of genetic hitchhiking along a recombining chromosome. Genetics 160:765-777.
KIMURA, M., 1983 The Neutral Theory of Molecular Evolution. Oxford University Press, Oxford.
KIRBY, D. A. and W. STEPHAN, 1995 Haplotype test reveals departure from neutrality in a segment of the white gene of Drosophila melanogaster.. Genetics 141:1483-1490.[Abstract]
KIRBY, D. A. and W. STEPHAN, 1996 Multi-locus selection and the structure of variation at the white gene of Drosophila melanogaster.. Genetics 144:635-645.[Abstract]
KUMAR, S. K., K. TAMURA, I. B. JAKOBSEN, and M. NEI, 2001 MEGA2. Molecular evolutionary genetics analysis software. Bioinformatics 17:1244-1245.
LACHAISE, D., M.-L. CARIOU, J. R. DAVID, F. LEMEUNIER, and L. TSACAS et al., 1984 Historical biogeography of the Drosophila melanogaster species group. Evol. Biol. 22:159-255.
MADDISON, W. P., and D. R. MADDISON, 1992 MacClade: Analysis of Phylogeny and Character Evolution, Version 3. Sinauer, Sunderland, MA.
MAYNARD SMITH, J. and J. HAIGH, 1974 The hitch-hiking effect of a favourable gene. Genet. Res. 23:23-35.[Medline]
NURMINSKI, D., D. DE AGUIAR, C. D. BUSTAMANTE, and D. L. HARTL, 2001 Chromosomal effects of rapid gene evolution in Drosophila melanogaster.. Science 291:128-130.
OLSON, S., 2002 Seeking the signs of selection. Science 298:1324-1325.
PARSCH, J., C. D. MEIKLEJOHN, and D. HARTL, 2001 Patterns of DNA sequence variation suggest the recent action of positive selection in the janus-ocnus region of Drosophila simulans.. Genetics 159:647-657.
PRITCHARD, J. K. and M. PRZEWORSKI, 2001 Linkage disequilibrium in humans: models and data. Am. J. Hum. Genet. 69:1-14.[Medline]
PRZEWORSKI, M., 2002 The signature of positive selection at randomly chosen loci. Genetics 160:1179-1189.
ROZAS, J. and R. ROZAS, 1999 DnaSP version 3: an integrated program for molecular population genetics and molecular evolution analysis. Bioinformatics 15:174-175.
ROZAS, J., M. GULLAUD, G. BLANDIN, and M. AGUADÉ, 2001 DNA variation at the rp49 gene region of Drosophila simulans: evolutionary inferences from an unusual haplotype structure. Genetics 158:1147-1155.
SABETI, P. C., D. E. REICH, J. M. HIGGINS, H. Z. P. LEVINE, and D. J. RICHTER et al., 2002 Detecting recent positive selection in the human genome from haplotype structure. Nature 419:832-837.[Medline]
SMITH, N. G. C. and A. EYRE-WALKER, 2002 Adaptive protein evolution in Drosophila. Nature 415:1022-1024.[Medline]
STEPHAN, W., T. H. E. WIEHE, and M. W. LENZ, 1992 The effect of strongly selected substitutions on neutral polymorphism: analytical results based on diffusion theory. Theor. Popul. Biol. 41:237-254.
TAJIMA, F., 1989 Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 123:585-595.
THOMPSON, J. D., T. J. GIBSON, F. PLEWNIAK, F. JEANMOUGIN, and D. G. HIGGINGS, 1997 The CLUSTAL X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 25:4876-4882.
WANG, W., K. THORNTON, A. BERRY, and M. LONG, 2002 Nucleotide variation along the Drosophila melanogaster fourth chromosome. Science 295:134-137.
ZUROVCOVÁ, M. and F. J. AYALA, 2002 Polymorphism patterns in two tightly linked developmental genes, Idgf1 and Idgf3, of Drosophila melanogaster.. Genetics 162:177-188.
This article has been cited by other articles:
![]() |
A. Sanchez-Gracia and J. Rozas Unusual Pattern of Nucleotide Sequence Variation at the OS-E and OS-F Genomic Regions of Drosophila simulans Genetics, April 1, 2007; 175(4): 1923 - 1935. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Ojeda, J. Rozas, J. M. Folch, and M. Perez-Enciso Unexpected High Polymorphism at the FABP4 Gene Unveils a Complex History for Pig Populations Genetics, December 1, 2006; 174(4): 2119 - 2127. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Glinka, D. De Lorenzo, and W. Stephan Evidence of Gene Conversion Associated with a Selective Sweep in Drosophila melanogaster Mol. Biol. Evol., October 1, 2006; 23(10): 1869 - 1878. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Quesada, S. E. Ramos-Onsins, J. Rozas, and M. Aguade Positive Selection Versus Demography: Evolutionary Inferences Based on an Unusual Haplotype Structure in Drosophila simulans Mol. Biol. Evol., September 1, 2006; 23(9): 1643 - 1647. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Baudry, N. Derome, M. Huet, and M. Veuille Contrasted Polymorphism Patterns in a Large Sample of Populations From the Evolutionary Genetics Model Drosophila simulans Genetics, June 1, 2006; 173(2): 759 - 767. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Beisswanger, W. Stephan, and D. De Lorenzo Evidence for a Selective Sweep in the wapl Region of Drosophila melanogaster Genetics, January 1, 2006; 172(1): 265 - 274. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Pearce, A. Malisa, S. P. Kachur, K. Barnes, B. Sharp, and C. Roper Reduced Variation Around Drug-Resistant dhfr Alleles in African Plasmodium falciparum Mol. Biol. Evol., September 1, 2005; 22(9): 1834 - 1844. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. K. Ingvarsson Molecular Population Genetics of Herbivore-induced Protease Inhibitor Genes in European Aspen (Populus tremula L., Salicaceae) Mol. Biol. Evol., September 1, 2005; 22(9): 1802 - 1812. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. D. Jensen, Y. Kim, V. B. DuMont, C. F. Aquadro, and C. D. Bustamante Distinguishing Between Selective Sweeps and Demography Using DNA Polymorphism Data Genetics, July 1, 2005; 170(3): 1401 - 1410. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Braverman, B. P. Lazzaro, M. Aguade, and C. H. Langley DNA Sequence Polymorphism and Divergence at the erect wing and suppressor of sable Loci of Drosophila melanogaster and D. simulans Genetics, July 1, 2005; 170(3): 1153 - 1165. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. J. Vilella, A. Blanco-Garcia, S. Hutter, and J. Rozas VariScan: Analysis of evolutionary patterns from large-scale DNA sequence polymorphism data Bioinformatics, June 1, 2005; 21(11): 2791 - 2793. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. A. Schlenke and D. J. Begun Linkage Disequilibrium and Recent Selection at Three Immunity Receptor Loci in Drosophila simulans Genetics, April 1, 2005; 169(4): 2013 - 2022. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. P. Lazzaro Elevated Polymorphism and Divergence in the Class C Scavenger Receptors of Drosophila melanogaster and D. simulans Genetics, April 1, 2005; 169(4): 2023 - 2034. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. D. Meiklejohn, Y. Kim, D. L. Hartl, and J. Parsch Identification of a Locus Under Complex Positive Selection in Drosophila simulans by Haplotype Mapping and Composite-Likelihood Estimation Genetics, September 1, 2004; 168(1): 265 - 279. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Kim and R. Nielsen Linkage Disequilibrium as a Signature of Selective Sweeps Genetics, July 1, 2004; 167(3): 1513 - 1524. [Abstract] [Full Text] [PDF] |
||||
- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Quesada, H.
- Articles by Aguadé, M.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Quesada, H.
- Articles by Aguadé, M.





