Genetics, Vol. 149, 971-981, June 1998, Copyright © 1998

Allele-Specific Population Structure of Drosophila melanogaster Alcohol Dehydrogenase at the Molecular Level

Michel Veuillea, Véronique Bénassib, Sylvie Aulardb, and Frantz Depaulisa
a Laboratoire d'Ecologie, Ecole Pratique des Hautes Etudes, Unité de Recherche Associée 258, Université Paris-6, 75005 Paris, France
b Laboratoire Populations Génétique Evolution, CNRS, 91190 Gif-sur-Yvette, France

Corresponding author: Michel Veuille, Laboratoire d’Ecologie, C 237, Université Paris-6, 7 quai Saint Bernard, 75252 Paris cedex 05, France, mveuille{at}snv.jussieu.fr (E-mail).

Communicating editor: A. G. CLARK


*  ABSTRACT
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*APPENDIX
*LITERATURE CITED

The history of the Drosophila melanogaster alcohol dehydrogenase (ADH) Fast/Slow polymorphism was studied by recording molecular variation and inversion polymorphism in 233 chromosomes from European and African populations. Silent molecular variation in the Slow allele was very different between standard chromosomes and chromosomes bearing the In(2L)t inversion. Within populations, inverted Slow haplotypes were more variable than standard Slow haplotypes. Between populations, geographical structure was almost nonexistent for inverted Slow haplotypes and highly significant for standard Slow. All Fast haplotypes occurred on standard chromosomes. They showed little variation within and between populations. They were highly significantly closer to standard Slow haplotypes from Europe. These results suggest that the current range of Fast and In(2L)t Slow haplotypes is recent and that an older genetic differentiation between populations was followed by allele-specific gene flow.


AN important part of population genetics theory describes the differentiation of populations for selectively neutral alleles under the counteracting effects of genetic drift and migration (WRIGHT 1943 Down, WRIGHT 1951 Down). Our understanding of population structuring for locally advantageous alleles, although critical for theories of evolution, is far less developed. It relies on empirical evidence that differentiation is extreme for some loci in some populations. A methodological assumption is that the neutral model is a null hypothesis and that natural selection can result in locally increased differentiation. An alternative is that newly evolved genetic systems, when successful, spread to the other populations and drive them toward a new adaptive peak. We thus do not know whether species are more, less or equally geographically structured for recently selected alleles than they are for more ancient neutral variation.

To answer this question, we studied natural populations of Drosophila melanogaster. According to TSACAS and LACHAISE 1974 Down, this species originated in Africa, where it lives on decaying fruit. In neolithic times, it became adapted to human environments (LACHAISE et al. 1988 Down) and colonized Europe. It is thought to have spread through the rest of the inhabited world in historical times (DAVID and CAPY 1988 Down).

This history is likely to have caused recent genetical differentiation between populations. There is evidence that it was accompanied by increased resistance to ethanol (reviewed by DAVID 1988 Down). The alcohol dehydrogenase enzyme presents two electromorphs that differ in their biochemical properties. Fast may be broadly described as being more active, whereas Slow is more thermostable (reviewed by CHAMBERS 1988 Down). Allozyme frequencies have been recorded in America (VIGUE and JOHNSON 1973 Down; JOHNSON and SCHAEFFER 1976; PIPKIN et al. 1976 Down), Australia (OAKESCHOTT et al. 1982), Japan (WATADA et al. 1986 Down), China (JIANG et al. 1989 Down), and the Old World (DAVID et al. 1989 Down). They systematically show clines in which the Fast allele is predominant at higher latitudes than Slow.

The sequencing of 10 alleles has shown that Fast haplotypes are less variable than Slow haplotypes (KREITMAN 1983 Down). This was confirmed by restriction site polymorphism studies (AQUADRO et al. 1986 Down; KREITMAN and AGUADE 1986A Down, KREITMAN and AGUADE 1986B Down; AGUADE 1988 Down; SIMMONS et al. 1989 Down; BENASSI et al. 1993 Down; BERRY and KREITMAN 1993 Down; BENASSI and VEUILLE 1995 Down), thus suggesting that the Fast allele is recent. HUDSON et al. 1987 Down found excess polymorphism at the Adh locus. KREITMAN and HUDSON 1991 Down found excess polymorphism in a world sample of Fast and Slow alleles. This was consistent with a model of balancing selection on the amino acid polymorphism or a site closely linked to it. A similar but somewhat smaller peak was also observed for within Slow comparisons, indicating a more complex history for the Slow haplotypes. The Fast allele lacked polymorphism, indicating a recent origin for Fast/Slow or a recent selective sweep within Fast haplotypes. The next step was to compare geographical structuring for Fast/Slow and for silent variation among Slow haplotypes. Among Old World populations, most silent variation opposes West African to East African populations, whereas Fast/Slow opposes European to African populations (BENASSI and VEUILLE 1995 Down).

The recent appearance of the Fast allele is suggested by its linkage disequilibrium with the nearby In(2L)t inversion. The proximal breakpoint of this chromosome arrangement is located one chromosome division (~1700 kb) away from the Adh locus. Despite the high frequency of this inversion in natural populations of Drosophila (LEMEUNIER and AULARD 1992 Down), the alcohol dehydrogenase (ADH) Fast allele has almost always been reported to occur in the standard chromosomal arrangement (reviewed by VAN DELDEN and KAMPING 1989 Down). Knowledge of the recombination rate between the inversion and the gene, coupled with the high frequency of the In(2L)t inversion in West Africa, suggested that the Fast allele could not be older than a few hundred years, at least in this part of the species range (BENASSI et al. 1993 Down). This observation could be interpreted in two ways. According to one hypothesis, the Fast allele recently appeared in the standard chromosome arrangement and dramatically increased in frequency, its absence in In(2L)t chromosomes being due to history. According to a second hypothesis, the standard-Fast arrangement is adaptive and is maintained by natural selection. The purpose of this study is to find evidence relating to this question by determining the karyotype of Adh haplotypes previously reported from Europe and East Africa (BENASSI and VEUILLE 1995 Down), and comparing them to available data for West Africa (BENASSI et al. 1993 Down).


*  MATERIALS AND METHODS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*APPENDIX
*LITERATURE CITED

Population sampling, restriction site polymorphism data, and cytogenetical techniques have been described previously (BENASSI et al. 1993 Down; BENASSI and VEUILLE 1995 Down). All population samples originate from natural populations from France, Ivory Coast, and Malawi. The distances between these populations are 4200 km (France-Ivory Coast), 4600 km (Ivory Coast-Malawi), and 7000 km (Malawi-France). African samples were caught in ecological stations and nature reserves. Isochromosomal lines were directly obtained either from wild-caught individuals or, for some West African samples, from recently established isofemale lines.

Chromosome and molecular data were obtained for 65 Malawi lines, 84 Ivory Coast lines, and 84 French lines. The Ivory Coast and Malawi samples were random samples. The French sample included a random sample from Cognac (of which 8 Slow lines were available for the cytogenetic study) and 31 additional Slow lines from Bordeaux to compensate for the predominance of the Fast allele in this area (see BENASSI and VEUILLE 1995 Down). This did not bias statistical analysis because the different allelic classes were examined separately below. At the other end of the ADH cline, only two Fast alleles were available for Malawi.

The numbering of indels and restriction sites follows their order along the sequence, as in BENASSI and VEUILLE 1995 Down. From 5' to 3': indel ID1, MspI -346, DdeI 9, BanI 102, indel ID2, HaeIII 287, DdeI 321, AluI 562, indels {Delta}1, {Delta}2, and ID3, CfoI 573, MspI 583, HaeIII 688, HaeIII 816, MspI 818, CfoI 864, indel ID5, Sau3AI 1354, HaeIII 1425, Sau96I 1425, DdeI 1518, DdeI 1527, AluI 1596, indel ID6, DdeI 1875, HaeIII 1925.

This study is concerned with genetic distance between nine "groups" (three haplotype classes in three populations). The three haplotype classes will be defined in the RESULTS section.

Genetic distance was calculated after NEI 1987 Down as the average number of differences between pairs of chromosomes. Let Dx and Dy be the average genetic distance within groups X and Y, respectively (hereafter "molecular diversity"), and let Dxy be the average genetic distance between individuals from the X and Y groups; "molecular divergence" was calculated according to the expression Da = Dxy - (Dx + Dy)/2. We used NEI's and MILLER's (1990) RESTSITE program that takes into account probability differences in restriction sites and includes a jackknifing procedure over restriction enzymes to estimate standard errors.

A phylogenetic analysis was carried out on the 36 genetic distances calculated between the nine groups using the neighbor-joining and unweighted pair-group method with arithmetic mean (UPGMA) procedures from version 1.01 of Molecular Evolutionary Genetics Analysis (MEGA; KUMAR et al. 1993 Down). These two different methods were used in order to check that our distance data exhibited a reliable tree-like structure. Because haplotypes exchange polymorphisms through recombination and because populations exchange haplotypes through migration, internal nodes have little biological meaning, except the most distal ones, which represent the average value of each group. The phylogenetic analysis and the distance matrix were therefore used as two different ways to analyze the relationships between the nine groups. Their significance was assessed in the same way, by comparing distances in order to state if a given group was significantly closer to a second one than to a third one. Comparing measures of molecular divergence cannot be achieved by variance tests like a t-test, because divergence values are not normally distributed and because distances are not independent. We used a resampling procedure. We first established a table of divergence values (Da) between pairs of haplotypes. We then estimated the average distance between groups of 100 haplotypes by resampling haplotypes from pooled samples. One-tailed probability level was estimated over 1000 repetitions as the percentage of departure from the observed value. Samples of 100 were used to equalize the power of the test over observations. The sample size was smaller for several comparisons involving Fast haplotypes from Malawi, for which only two Fast alleles were available. The number of comparisons was set at 81 for comparing inversions from Malawi and France, in each of which 9 haplotypes had been found. The power of the test is the same for all other comparisons. There is much redundancy in all the calculated pairwise comparisons between distances. These data are not shown. Data relating to the questions addressed in this study will be presented below.


*  RESULTS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*APPENDIX
*LITERATURE CITED

Inversion polymorphism:
Primary data are given in Table 3. Slow chromosomes were found to carry In(2L)t inversions in the following proportions: 52 out of 69 for Ivory Coast, 9 out of 63 for Malawi, and 9 out of 39 for France. In the later sample, all inversions were found in the Bordeaux subsample. We also found 2 In(2R)NS (52AB;56F) and 2 In(2R) (44F;51A; 54A; 55F) inversions in Malawi, and 1 In(2L) (26A;30F) inversion in France, the latter 2 being new endemic inversions. This will not be dealt with in any detail, inversions from the right arm of chromosome 2 being beyond the scope of this study, whereas the In(2L) endemic inversion was pooled with the standard arrangement, to which it is evolutionarily closer than to In(2L)t. All Fast lines were standard for the In(2L)t inversion. We will thus only consider the three following classes of haplotypes:

  • "Fast" refers to Fast haplotypes in standard chromosomes for the In(2L)t polymorphism.

  • "st-S" refers to Slow haplotypes in standard chromosomes for the In(2L)t polymorphism.

  • "in-S" refers to Slow haplotypes in In(2L)t chromosomes.

These three haplotype classes are present in each population, making up the nine groups that will be compared below.

Distribution of molecular polymorphism within groups:
Table 1 shows the frequency of 20 restriction sites in the nine groups. Eleven of them were polymorphic among the 62 Fast chromosomes, 16 from the 101 st-S chromosomes, and 15 from the 70 in-S chromosomes. The number of polymorphic sites was not very different in the three classes. Six of them were unique to one or the other Slow classes. They were at a low frequency ranging from 2 to 8%. This value was never significantly different from the 0% value observed in the other groups. Therefore, there is no evidence that there are diagnostic sites between alleles. None of the 11 polymorphisms found in Fast were unique to this allele. All of them were observed in one or both of the Slow alleles. This suggests that they originated from Slow haplotypes through recombination or gene conversion.


 
View this table:
In this window
In a new window

 
Table 1. Frequency of restriction site polymorphisms in each haplotype group (three haplotype classes in three populations)


 
View this table:
In this window
In a new window

 
Table 2. Molecular diversity, molecular distance, and molecular divergencea between Adh, Fast, and Slow haplotypes in standard and In (2L)t chromosomes

Linkage disequilibrium between inversions and molecular polymorphisms:
Linkage disequilibrium between polymorphic sites and the inversion was estimated using Fisher's exact test. We included indels in this survey, because two indels from the first intron of Adh have been shown to affect expression levels (LAURIE et al. 1991 Down; MATTHEWS et al. 1992 Down). Linkage disequilibrium involved four significant cases in Ivory Coast, two in Malawi, and none in France, because In(2L)t was absent from the French random sample (Cognac subsample). The two significant cases for Malawi included sites HaeIII 816 (P = 0.026) and Sau3AI 1354 (P = 0.002). In Ivory Coast, the linkage disequilibrium was highly significant (P < 10-8) for the two sites that were characteristic of Fast haplotypes: the F/S amino acid polymorphism itself and the {Delta}1 indel (KREITMAN 1983 Down). The other two significant linkage disequilibria involved MspI -346 and HaeIII 816 sites.

Genetical distances between haplotype classes:
Genetical distances between the nine groups are shown in Table 2. The diagonal (boldface numbers) indicates molecular diversity within each group. The remaining cells are divided into two halves. The lower left half shows genetic distance Dxy between groups. The upper right half indicates molecular divergence Da.

Intragroup variation (diagonal) is very homogeneous within haplotype classes. In-S chromosomes differ by an average of 6.36 restriction sites (SD = 0.39), st-S chromosomes differ by 5.34 sites (SD = 0.65), and Fast chromosomes differ by 1.60 sites (SD = 0.27). Over all populations, in-S chromosomes are 1.20 times more variable than st-S (range: 1.11–1.26), and Fast chromosomes are 0.30 times as variable as st-S (range: 0.28–0.31).

The difference between Slow haplotypes from each chromosome arrangement is slight. Its significance was assessed by bootstrap. The n1 st-S distances and the n2 in-S distances were pooled, and two samples of the same size were randomly drawn. Over 1000 replicates, the 11% difference observed in France was nonsignificant (P = 0.134). The 21 and 26% differences observed in Malawi and in Ivory Coast were highly significant (P = 0.003 and P < 0.001, respectively).

The upper right sector of each of the squares crossing the diagonal of Table 2 shows the molecular divergence between chromosomes from the same haplotype class but from different populations. The three haplotype classes showed similar patterns of differentiation between populations. Malawi and France were always the most different populations. Ivory Coast was consistently, albeit never significantly, closer to Malawi than to France. An unexpected observation is the difference of orders of magnitude. Although divergence values between the three comparisons of populations were weak and very similar in in-S (0.03, 0.14, and 0.61) and in Fast (0.09, 0.22, and 0.69), the divergence in st-S was much larger (0.83, 1.74, and 2.28). Most of these estimates were not significantly different. However, Malawi st-S was significantly more different from France st-S than from Ivory Coast st-S (P < 0.02).

The divergence between st-S and Fast is shown on the middle right square of Table 2. The Fast haplotypes from all three populations were always highly significantly (P < 10-3) closer to st-S from France (mean = 1.12, SD = 0.23; range: 0.89–1.36) than to st-S from Ivory Coast and Malawi (mean = 4.23, SD = 0.31; range: 3.92–4.79).

The divergence between Fast and in-S is shown in the upper right square of Table 2. It shows a less marked divergence, which is consistent with the fact that in-S haplotypes are geographically less divergent than st-S haplotypes. The Fast haplotypes from all three populations were significantly (P < 0.005) closer to in-S from France than to in-S from Malawi. Ivory Coast in-S were intermediate. Only Fast haplotypes from France were significantly more different from in-S from Ivory Coast than from French in-S (P < 0.05).

The divergence between in-S and st-S from the different populations is shown in the middle upper square of Table 2. The st-S haplotypes from France were not very different from in-S haplotypes from either population. The contrary was observed for st-S haplotypes from Malawi. Their divergence with Malawi in-S was significantly smaller than with Ivory Coast in-S (P < 0.05) and highly significantly smaller than with French in-S (P < 0.005). Ivory Coast st-S were intermediate because a significant result (P < 0.05) was observed between distances with European and Malawi in-S.

Phylogenetic analysis:
The phylogenetic analysis is shown on Figure 1. Results from the UPGMA and the neighbor-joining trees are virtually identical. The three terminal branches corresponding to the Fast alleles from the three populations are the shortest. They join each other distally in the phylogeny. This reflects the low haplotype variability among Fast alleles because external branches contain the average genetic distance in each population. The two nodes joining Fast haplotypes are very close to each other. This reflects the close relationship of these alleles between populations. The three in-S groups corresponding to the three populations are at the opposite of Fast. They branch very deeply (proximally) in the tree. They stem from three neighboring nodes. These two aspects are complementary. They indicate the substantial variability of these haplotypes and their relatively low differentiation between populations. The pattern of distribution of the three st-S groups is very different from the former groups. The sample from France branches off on one side of the tree between Fast haplotypes and in-S haplotypes from France and Ivory Coast. The other two st-S samples, from West and East Africa, are at the other end of the tree. This is in agreement with the highly significant geographical differentiation observed among st-S haplotypes. This confirms that Fast haplotypes from any population are closer to European st-S than to African st-S. This also accounts for the fact that in-S haplotypes are intermediate between st-S haplotypes for their distance with Fast haplotypes.



View larger version (20K):
In this window
In a new window
Download PPT slide
 
Figure 1. —UPGMA (top) and Neighbor-Joining (bottom) trees of nine Adh groups (three haplotype classes in three populations), based on the genetic distance (Dxy) matrix of Table 2.

The phylogenetic analysis provides a consistent interpretation of the distance matrix from which it is derived. For instance, the most rightward column of Table 2, which gives the distance of France Fast with the other groups, predicts exactly the order in which these groups connect to the phylogenetic line leading from France Fast to the most extreme group (Malawi and Ivory Coast st-S) on the NJ-tree. It should be noted that the trees of Figure 1 do not represent lines of descent between sequences, but rather distances between groups of haplotypes arranged in consideration of their chromosomal genotype, allozyme genotype, and population of origin. Each of these groups is polymorphic. These trees have no root. Previous studies have shown that the relationships between Adh haplotypes make up a multiconnected network as a result of intragenic recombination (see e.g., SIMMONS et al. 1989 Down; BENASSI et al. 1993 Down). They do not display a tree-like hierarchical structure. The coalescence of branches at the basis of the trees of Figure 1 represent the average number of differences in pairwise comparisons between the most extreme distal ends. Therefore, Slow haplotypes, which carry much recombined polymorphic sites, especially for in-S, converge closer to the basis. The linear series connecting European st-S and Fast haplotypes on one branch is probably the only well-defined line of descent. Little recombination has altered the relationships between these groups.

A noteworthy feature of these trees is the closeness between Ivory Coast in-S and both French in-S and st-S, whereas Ivory Coast st-S is further away in the topology. This accounts for the small molecular divergence observed between French and West African Slow haplotypes by BENASSI and VEUILLE 1995 Down. This also confirms that geographical distances in Slow alleles are fundamentally heterogeneous at the chromosome level.

The phylogenetic analysis thus confirms and synthesizes what was already apparent from the distance analysis.


*  DISCUSSION
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*APPENDIX
*LITERATURE CITED

At least three haplotype classes can be distinguished in the Adh system:
In a previous study (BENASSI and VEUILLE 1995 Down), the same sample was used for a different purpose. Fixation indices for the Fast/Slow allozymes were compared with fixation indices for Slow molecular variation. Genetic structuring was different in the two cases: the Fast/Slow distribution opposed Europe to Africa, and the Slow molecular variation opposed East Africa to West Africa and Europe. The purpose of this study was to demonstrate the action of selection on the amino acid polymorphism. A similar method was followed by BERRY and KREITMAN 1993 Down to study the ADH cline in North America.

This study assumed that silent variation in Slow was neutral and thus represented historical patterns, whereas changes in Fast/Slow frequencies represented selection. The present analysis shows that the silent variation in Slow is heterogeneous. Molecular variation is differently distributed in inverted and noninverted Slow chromosomes. Furthermore, the In(2L)t inversion frequency changes between populations: from 0.73 in Ivory Coast (SD = 0.05) to 0.23 in Malawi (SD = 0.06) and 0.14 in France (SD = 0.04). The inversion frequency changes as dramatically between populations as does the Fast allele. This variation is likely to have biased former estimates of genetical distance between populations (BENASSI and VEUILLE 1995 Down).

Genetic structuring of Adh between Old World populations:
Three haplotype classes (Fast, in-S, and st-S) can be distinguished instead of two alleles (Fast and Slow). They induce discontinuities in the distribution of silent variation. Silent molecular variation in each of them can be provisionally considered neutral. This is substantiated by the close similarity of relative divergence between Malawi, Ivory Coast, and France in each class. Malawi and Europe are always the most distant populations. Assuming that the best estimate of the divergence between populations is provided by the average for the three classes, and setting the Malawi-France divergence at 1.00, we obtain 0.27 between Ivory Coast and Malawi and 0.59 between France and Ivory Coast. It should be noted that the distances between Ivory Coast and the other two populations are not significantly different among themselves. Moreover, the imperfection of this estimation is apparent in its violation of triangular inequality (i.e., AC <= AB + BC), which is generally considered a requirement of mathematical distances.

Variation patterns differ in the three haplotype classes:
Levels of polymorphism within each haplotype class are remarkably similar between populations. Inverted-Slow haplotypes are 1.20 times more polymorphic than st-S, and Fast haplotypes are 0.30 times less polymorphic than st-S. Levels of divergence between populations, however, are different. There is more geographical structuring in st-S than in in-S or Fast. The three groups thus present three different patterns of polymorphism: st-S shows substantial polymorphic and substantial geographic differentiation; in-S is very polymorphic but shows little geographic differentiation; and Fast shows little polymorphism and little geographic differentiation.

Heterogeneity among Slow haplotypes:
An unexpected result of this study is the heterogeneity among Slow haplotypes. The global amount of variation present in the two chromosome arrangements is similar. For instance, between the most extreme populations surveyed in this study (Europe and Malawi) the average number of restriction site differences between two st-S haplotypes (7.96) is close to that observed between two in-S haplotypes (7.19), or between an st-S haplotype from France and an in-S haplotype from Malawi (7.24). However, the geographical apportionment of this variation is different. The whole variation is shared by in-S chromosomes from all populations, which are therefore very similar among themselves. In st-S, each population is less variable than the pooled sample, whereas more variation segregates geographically. This differentiation is mostly between France and African populations. The fact that the two Slow classes share the same polymorphisms and have close heterozygosity levels indicates that they have been exchanging variation through recombination for a long time. Some event, more recent than the setting of this inversion polymorphism, occurred and produced the present pattern. We can imagine two possibilities. First, natural selection, acting on the Adh locus itself, could have affected Slow haplotypes on one chromosome arrangement, the other arrangement remaining unaffected because inversions inhibit crossovers. This hypothesis is not likely because polymorphism levels indicate no selective sweep event. Second, a selective sweep event could have occurred elsewhere on the chromosome. For instance, an advantageous mutant linked to In(2L)t could have appeared at a remote locus on the chromosome. It could have spread among world populations and have erased any geographical structure without sweeping Adh variation away. If so, this phenomenon could be recorded in other parts of the second chromosome. This hypothesis is currently being tested.

Recent origin of Fast haplotypes:
The low variability of Fast, its low differentiation between populations, and its absence from In(2L)t karyotypes together suggest the recent origin of Fast haplotypes. This may be because the Fast mutation appeared recently, or because it was subject to a recent selective sweep. The karyotype of this allele clearly indicates that it originated from a standard chromosome. It is genetically closer to st-S from Europe than from other populations. These facts are in favor of a European origin of Fast. Its broad distribution in Africa suggests that it subsequently spread throughout the Old World.

Origin of the In(2L)t-Fast linkage disequilibrium:
A long-recognized property of the ADH system is the absence of Fast in In(2L)t chromosomes (reviewed by VAN DELDEN and KAMPING 1989 Down). Given known levels of recombination between the Adh locus and the proximal breakpoint of the In(2L)t inversion (MALPICA et al. 1987 Down), and assuming no epistatic selection between ADH and the inversion, the coexistence of the Fast allele and the inversion in Ivory Coast cannot be older than a few hundred years (BENASSI et al. 1993 Down). The genotype of Fast/Slow vs. In(2L)t/standard arrangements is available for several samples of D. melanogaster chromosomes from Raleigh, NC (MUKAI et al. 1971 Down), Eugene, OR (VOELKER et al. 1977 Down), and Carpenter, NC (LANGLEY et al. 1977 Down). In addition to the random samples from Ivory Coast and Malawi, these published studies make up 1002 chromosomes, including 328 Fast alleles (32.7%) and 150 inversions (14.9%). Only two In(2L)t-Fast chromosomes have been reported in this list (LANGLEY et al. 1977 Down), although 49 would be expected at random. According to ASHBURNER et al. 1979 Down(cited in AQUADRO et al. 1986 Down), the recombination rate within and around the Adh locus is 10-5 events/kilobase/generation. According to MALPICA et al. 1987 Down, the recombination rate between the Adh locus and the In(2L)t proximal breakpoint in In(2L)t/standard heterozygotes is 5 x 10-4 (range: 1.9 x 10-3, 6 x 10-5). The recombination rate between In(2L)t and standard is thus an order of magnitude larger than the recombination rate within the gene in a standard homozygote. This makes sense because the distance between the gene and the breakpoint is ~1000x larger than the gene. Given the substantial heterozygosity of this inversion, more recombination is expected between the F/S amino acid polymorphism and the inversion than between F/S and Adh silent polymorphisms. The contrary is observed. Fast haplotypes are polymorphic for eight restriction sites that are already present in Slow. These Fast haplotypes certainly originated by recombination, because they are polymorphic for substitutions that are also polymorphic in Slow. There are two possible explanations for this. One is that natural selection favors the Fast-Standard configuration. The other is historical and is suggested by the results of this study. According to this hypothesis, Fast evolved in Europe from a standard chromosome and remained isolated for some time. Then Fast and In(2L)t haplotypes spread and came into contact. The absence of recombinants would not indicate a recent origin of the haplotypes, but rather a recent gene flow between populations.

An hypothesis on the history of the Adh genetic system:
A way to account for features of the Adh polymorphism is to assume that the history of the species combines geographic differentiation and gene flow. In a first stage, differentiation would have occurred between standard Slow from Europe and from Africa. The In(2L)t inversion would have been restricted to Africa, in agreement with LEMEUNIER's and AULARD's (1992) observation that D. melanogaster inversions mostly occur in tropical areas. Fast would have originated from a standard Slow chromosome in Europe, where it became predominant. In a second stage, world populations would have come into contact, thus favoring the spreading of Fast to Africa, and possibly that of In(2L)t to Europe.

According to LACHAISE et al. 1988 Down and DAVID and CAPY 1988 Down, D. melanogaster was long confined to Africa and Europe and only recently expanded its range to the rest of the world. This is in agreement with mitochondrial DNA variation, which shows that the founders of American and Far Eastern populations made up a few closely related lineages (HALE and SINGH 1991 Down). According to this classic scenario, gene flow occurred from the center of origin toward the periphery. The variation patterns being currently observed in the Old World would result from long-established equilibrium frequencies. An alternative scenario is that Fast appeared in Europe, increased in frequency, and secondarily spread to Africa. Its allele frequency probably rose from 0 to 17.6% as observed in the Lamto ecological reserve, showing that native fruitfly populations from remote field stations are not isolated. Driven by the ecological success of D. melanogaster in human-modified environments, the Fast allele may be invading old established African populations, possibly displacing them toward new adaptative peaks.


*  ACKNOWLEDGMENTS

We thank MATTHEW COBB for helpful criticism and careful reading of the manuscript, and SUZANNE MAZEAU and OLIVIER LANGELLA for technical assistance. The article greatly benefited from the referees' comments on a previous version of the manuscript. The phylogenetic analysis was suggested by ANDY CLARK.

Manuscript received April 5, 1997; Accepted for publication February 23, 1998.


*  APPENDIX
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*APPENDIX
*LITERATURE CITED


 
View this table:
In this window
In a new window

 
Table 3. Restriction site and indel genotype for each haplotype


*  LITERATURE CITED
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*APPENDIX
*LITERATURE CITED

AGUADE, M., 1988  Restriction map variation at the Adh locus of D. melanogaster in inverted and non inverted chromosomes. Genetics 119:135-140[Abstract/Free Full Text].

AQUADRO, C. F., S. F. DESSE, M. A. BLAND, C. H. LANGLEY, and C. C. LAURIE-AHLBERG, 1986  Molecular population genetics of the Alcohol dehydrogenase gene region of Drosophila melanogaster.. Genetics 114:1165-1190[Abstract/Free Full Text].

ASHBURNER, M., R. CAMFIELD, B. CLARKE, D. THATCHER and R. WOODRUFF, 1979 A genetic analysis of the locus coding for alcohol dehydrogenase, and its adjacent chromosome region, in Drosophila melanogaster, pp. 95–106 in Eucaryotic Gene Regulation, edited by R. AXEL, T. MANIATIS and C. F. FOX. Academic Press, New York.

BENASSI, V. and M. VEUILLE, 1995  Comparative population structuring of molecular and allozyme variation of Drosophila melanogaster Adh between Europe, West Africa and East Africa. Genet. Res. 65:95-103[Medline].

BENASSI, V., S. AULARD, S. MAZEAU, and M. VEUILLE, 1993  Molecular variation of Adh and P6 genes in African populations of Drosophila melanogaster and its relation to chromosomal inversions. Genetics 134:789-799[Abstract].

BERRY, A. and M. KREITMAN, 1993  Molecular analysis of an allozyme cline: alcohol dehydrogenase in Drosophila melanogaster on the East coast of North America. Genetics 134:869-893[Abstract].

CHAMBERS, G. K., 1988  The Drosophila Alcohol dehydrogenase gene-enzyme system. Advances in Genetics 25:39-107.

DAVID, J. R., 1988 Ethanol adaptation and alcohol dehydrogenase polymorphism in Drosophila: from phenotypic functions to genetic structures, pp. 163–172 in Population Genetics and Evolution, edited by G. DE JONG. Springer-Verlag, Berlin.

DAVID, J. R. and P. CAPY, 1988  Genetic variation of Drosophila melanogaster natural populations. Trends Genet. 4:106-111[Medline].

DAVID, J. R, F. ALONSO-PORAGA BORAI, P. CAPY, H. MERÇOT, and S. F. MCEVEY et al., 1989  Latitudinal variation of Adh gene frequencies. Drosophila melanogaster: a Mediterranean instability. Heredity 62:11-16.

HALE, L. R. and R. S. SINGH, 1991  A comprehensive study of genic variations in natural populations of Drosophila melanogaster. IV. Mitochondrial DNA variation and the role of history vs. selection in the genetic structure of geographic populations. Genetics 129:103-117[Abstract].

HUDSON, R. R., M. KREITMAN, and M. AGUADE, 1987  A test of neutral molecular evolution based on nucleotide data. Genetics 116:153-159[Abstract/Free Full Text].

JIANG, C., J. B. GIBSON, and H. CHEN, 1989  Genetic differentiation in populations of Drosophila melanogaster from the People's Republic of China: comparison with patterns on other continents. Heredity 62:193-198.

JOHNSON, F. M. and H. E. SCHAFFER, 1973  Isozyme variability in species of the genus Drosophila. VII. Genotype-environment relationship in populations of D. melanogaster from the eastern United States. Biochem. Genet. 10:149-163[Medline].

KREITMAN, M., 1983  Nucleotide polymorphism at the alcohol dehydrogenase gene region of Drosophila melanogaster.. Nature 304:412-417[Medline].

KREITMAN, M. and M. AGUADE, 1986a  Genetic uniformity of two populations of Drosophila melanogaster as revealed by filter hybridization of four-nucleotide restriction enzyme digests. Proc. Natl. Acad. Sci. USA 83:3562-3566[Abstract/Free Full Text].

KREITMAN, M. and M. AGUADE, 1986b  Excess polymorphim at the Adh locus in Drosophila melanogaster.. Genetics 114:93-110[Abstract/Free Full Text].

KREITMAN, M. and R. HUDSON, 1991  Inferring the evolutionary histories of the Adh and Adh-dup loci in Drosophila melanogaster from patterns of polymorphism and divergence. Genetics 127:565-582[Abstract].

KUMAR, S., K. TAMURA and M. NEI, 1993 MEGA: molecular evolutionary genetic analysis. Pennsylvania State University, University Village.

LACHAISE, D., M.-L. CARIOU, J. R. DAVID, F. LEMEUNIER, and L. TSACAS, 1988  Historical biogeography of the Drosophila melanogaster species subgroup. Evol. Biol. 22:159-225.

LANGLEY, C. H., K. ITO, and R. A. VOELKER, 1977  Linkage disequilibrium in natural populations of Drosophila melanogaster. Seasonal variation. Genetics 86:447-454[Abstract/Free Full Text].

LAURIE, C. C., J. T. BRIDGHAM, and M. CHOUDHARY, 1991  Association between DNA sequence variation and variation in expression of the Adh gene in natural populations of Drosophila melanogaster.. Genetics 129:489-499[Abstract].

LEMEUNIER, F., and S. AULARD, 1992 Inversion polymorphism in Drosophila melanogaster, pp. 339–405 in Drosophila Inversion Polymorphism, edited by C. B. KRIMBAS and J. R. POWELL. CRC Press, Cleveland.

MALPICA, J. M., J. M. VASSALLO, A. FRIAS, and F. FUENTES-BOL, 1987  On recombination among In(2L)t, {alpha}-Gpdh and Adh in Drosophila melanogaster.. Genetics 115:141-142[Abstract/Free Full Text].

MATTHEWS, P., A. AGROSTIS, A. C. TAYLOR, and W. MCKECHNIE, 1992  An association between ADH protein levels and polymorphic nucleotide variation in the Adh gene of Drosophila melanogaster.. Mol. Biol. Evol. 9:526-536[Abstract].

MUKAI, T., L. E. METTLER, and S. I. CHIGUSA, 1971  Linkage disequilibrium in a local population of Drosophila melanogaster.. Proc. Natl. Acad. Sci. USA 68:1065-1069[Abstract/Free Full Text].

NEI, M., 1987 Molecular Evolutionary Genetics. Columbia University Press, New York.

NEI, M. and J. C. MILLER, 1990  A simple method for estimating the average number of nucleotide substitutions within and between populations from restriction data. Genetics 125:873-879[Abstract].

OAKESHOTT, J. G., J. B. GIBSON, P. R. ANDERSON, W. R. KNIBB, and D. G. ANDERSON et al., 1982  Alcohol dehydrogenase and Glycerol-3-phosphate dehydrogenase clines in Drosophila melanogaster on different continents. Evolution 36:86-96.

PIPKIN, S. B., E. FRANKLIN-SPRINGER, S. LAW, and S. LUBEGA, 1976  New studies of the alcohol dehydrogenase cline in Drosophila melanogaster from Mexico. J. Hered. 67:258-266[Free Full Text].

SIMMONS, G. M., M. KREITMAN, W. F. QUATTELBAUM, and N. MIYASHITA, 1989  Molecular analysis of the alleles of Alcohol dehydrogenase along a cline in D. melanogaster.. Evolution 43:393-409.

TSACAS, L. and D. LACHAISE, 1974  Quatre nouvelles espèces de la Côte d'Ivoire du genre Drosophila, groupe melanogaster et discussion de l'origine du sous-groupe melanogaster (Diptera: Drosophiliidae). Ann. Univ. Abidjan E 7:193-211.

VAN DELDEN, W. V. and A. KAMPING, 1989  The relation between the polymorphisms at the Adh and {alpha}Gpdh loci and the In(2L)t inversion in Drosophila melanogaster in relation to temperature. Evolution 43:775-793.

VIGUE, C. L. and F. M. JOHNSON, 1973  Isozyme variability in species of the genus Drosophila. VI. Frequency-property-environment relationships of allelic alcohol dehydrogenases in D. melanogaster. Biochem. Genet. 9:213-227[Medline].

VOELKER, R. A., T. MUKAI, and F. M. JOHNSON, 1977  Genetic variation in populations of Drosophila melanogaster from the western United States. Genetica 47:143-148.

WATADA, M., Y. N. TOBARI, and S. OHTA, 1986  Genetic differentiation in Japanese populations of Drosophila simulans and D. melanogaster. I. Allozyme polymorphisms. Jpn. J. Genet. 61:469-480.

WRIGHT, S., 1943  Isolation by distance. Genetics 28:114-138[Free Full Text].

WRIGHT, S., 1951  The genetical structure of populations. Ann. Eugen. 15:323-354.