help button home button Genetics J Histo Cito
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

Originally published as Genetics Published Articles Ahead of Print on June 18, 2006.

Genetics, Vol. 173, 2039-2047, August 2006, Copyright © 2006
doi:10.1534/genetics.105.053611

This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
genetics.105.053611v1
173/4/2039    most recent
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Panhuis, T. M.
Right arrow Articles by Swanson, W. J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Panhuis, T. M.
Right arrow Articles by Swanson, W. J.

Molecular Evolution and Population Genetic Analysis of Candidate Female Reproductive Genes in Drosophila

Tami M. Panhuis1 and Willie J. Swanson

Department of Genome Sciences, University of Washington, Seattle, Washington 98195

1 Corresponding author: Department of Biology, University of California, Riverside, CA 92521.
E-mail: tpanhuis{at}jsd.claremont.edu

Manuscript received November 18, 2005. Accepted for publication June 1, 2006.


    ABSTRACT
 TOP
 ABSTRACT
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 ACKNOWLEDGEMENTS
 LITERATURE CITED
 
Molecular analyses in several taxa have consistently shown that genes involved in reproduction are rapidly evolving and subjected to positive selection. The mechanism behind this evolution is not clear, but several proposed hypotheses involve the coevolution between males and females. In Drosophila, several male reproductive proteins (Acps) involved in male–male and male–female interactions show evidence of rapid adaptive evolution. What has been missing from the Drosophila literature is the identification and analysis of female reproductive genes. Recently, an evolutionary expressed sequence tag analysis of Drosophila female reproductive tract genes identified 169 candidate female reproductive genes. Many of these candidate genes still await further molecular analysis and independent verification of positive selection. Our goal was to expand our understanding of the molecular evolution of Drosophila female reproductive genes with a detailed polymorphism and divergence study on seven additional candidate female reproductive genes and a reanalysis of two genes from the above study. We demonstrate that 6 candidate female genes of the 9 genes surveyed show evidence of positive selection using both polymorphism and divergence data. One of these proteins (CG17012) is modeled to reveal that the sites under selection fall around and within the active site of this protease, suggesting potential differences between species. We discuss our results in light of potential function as well as interaction with male reproductive proteins.


MOLECULAR analyses reveal that many proteins involved in reproduction are rapidly evolving. This phenomenon is observed for a number of different organisms, including marine invertebrates, mammals, plants, and insects (SWANSON and VACQUIER 2002; CLARK et al. 2006). For example, one of the fastest evolving metazoan proteins is the abalone sperm protein lysin (METZ et al. 1998). In mammals, the egg coat protein ZP2 is among the most divergent molecules between human and rodent (SWANSON et al. 2001a). In many cases, patterns of nucleotide variation within and between species suggest that natural selection has promoted the divergence of reproductive proteins (SWANSON and VACQUIER 2002; YANG 2005).

In Drosophila, patterns of nucleotide variation reveal rapid evolution by positive selection in a group of male reproductive proteins known as the accessory gland proteins (Acps) (SWANSON et al. 2001b). Darwinian selection on Acps is potentially influenced by their role in mediating both male–male and male–female postmating interactions (CHAPMAN et al. 2002). Acps are synthesized and secreted by cells in a paired reproductive structure called the accessory gland. These proteins are ejaculated into females during mating and affect sperm storage, sperm defense during sperm competition, female remating receptivity, egg-laying rate, and longevity (see review by WOLFNER 1997). Additionally, several Acp proteins have been localized to specific regions in a mated female's reproductive tract (BERTRAM et al. 1996; HEIFETZ et al. 2000; RAM et al. 2005). The negative effect of male Acps on female fitness suggests females should evolve interacting proteins to Acps that might play a role in processes relating to male reproductive success. There is evidence that females affect male reproductive success, both in sperm competition and in the fecundity of a single mating (PRICE 1997; CLARK and BEGUN 1998; CLARK et al. 1999; T. M. PANHUIS and L. NUNNEY, unpublished data). However, there is still little information on the evolution of female reproductive proteins (SWANSON et al. 2004). Insight into the evolution of female reproductive proteins is the missing ingredient to a better understanding of coevolutionary processes involved in male–female postmating interactions (PANHUIS et al. 2006), such as sexual selection and sexual conflict.

Recently, SWANSON et al. (2004) identified candidate Drosophila female reproductive genes from an expressed sequence tag (EST) analysis. The screen for candidate genes was based on an evolutionary EST analysis and presence of a signal sequence (indicating a secreted protein). Candidate female genes may be those that show adaptive divergence and potential to be secreted. Adaptive divergence is the most common recurring observation for reproductive proteins and may be explained by selection pressures, such as sexual conflict, sexual selection, immune defense, and self vs. nonself recognition (SWANSON and VACQUIER 2002). Adaptive divergence was determined for female reproductive genes by generating ESTs from Drosophila simulans and aligning them to their putative orthologs in the completed D. melanogaster genome (ADAMS et al. 2000; SWANSON et al. 2004). The overall analysis revealed that candidate genes, as a group, had a 50% increase in nonsynonymous sequence divergence compared to noncandidate reproductive proteins, and that synonymous sequence divergence was similar to the expected value for the species pair (SWANSON et al. 2004). SWANSON et al. (2004) also performed a polymorphism survey and divergence study on portions of nine candidate genes. This survey revealed positive selection in six of the nine genes independently surveyed.

The SWANSON et al. (2004) study is an important first step in identifying potential female reproductive genes, but only 9 of the 169 candidate genes had been surveyed independently for positive selection. Thus, many of the candidate genes still await further molecular analysis and independent verification of positive selection. In this study, our goal was to expand our understanding of the molecular evolution of Drosophila female reproductive genes with a detailed polymorphism and divergence study on 7 additional candidate female reproductive genes and a reanalysis of 2 genes from SWANSON et al. (2004). Unlike SWANSON et al. (2004), whose polymorphism survey focused on a portion of a gene, we present nucleotide polymorphism surveys on the entire gene (coding and noncoding) for all 9 genes. We also assessed DNA sequence divergence among a number of increasingly divergent species of Drosophila by comparing rates of nonsynonymous and synonymous substitutions using PAML (YANG 2000). Our polymorphism and divergence results identified 6 of 9 genes evolving by positive selection. In combination with SWANSON et al. (2004), this work identifies interesting candidate female reproductive genes to evaluate further in experimental and functional analyses from those identified in the original EST analysis. Future analysis biochemically identifying interacting male–female reproductive genes will be important.


    MATERIALS AND METHODS
 TOP
 ABSTRACT
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 ACKNOWLEDGEMENTS
 LITERATURE CITED
 
Fly stocks and DNA preparation:
D. melanogaster flies used in this study came from 42 isofemale lines established by E. Warren and L. Nunney in 2000 from orange groves at the University of California, Riverside. Genomic DNA was extracted from 10–15 adults/line using Purgene DNA isolation kit from Gentra Systems (Minneapolis). Nine candidate female genes (CG10200, CG5106, CG5273, CG9897, CG13004, CG17108, CG17012, CG8453, CG5976) were sequenced. All genes, except CG17012, were picked from SWANSON et al. (2004) on the basis of the presence of a signal sequence or on an overall ratio of nonsynonymous (dN) to synonymous (dS) changes >0.5 (a value predicted by the authors to be indicative of adaptive evolution). CG17012 was chosen as a female candidate gene on the basis of work by ARBEITMAN et al. (2004), who show this gene to be strongly expressed in the female spermatheca and parovaria, two regions of the female reproductive tract likely to interact with male reproductive proteins or sperm. For all genes, Table 1 shows the chromosome of each locus, the final number of isofemale lines sequenced per gene (N), the potential gene ontology function from FlyBase, the estimated EST dN/dS ratio from SWANSON et al. (2004), and the presence or absence of a signal sequence. Polymerase chain reaction products were diluted fivefold with water, sequenced directly using ABI big dye terminator sequencing chemistry, and analyzed on an ABI 3100 automated sequencer. A consensus sequence for each gene was generated from visualizing both PCR product sequence strands using Sequencer 4.1 for Mac (Genecodes, Ann Arbor, MI). PCR primers and conditions are available from the authors upon request. Sequences are deposited in GenBank under accession nos. DQ539048DQ539337.


View this table:
[in this window]
[in a new window]

 
TABLE 1 General information on female candidate genes examined in this study

 
Polymorphism survey and divergence study:
We performed tests of neutrality for each gene including Tajima's D (TAJIMA 1989), Fu and Li's D (with an outgroup) and D* (without an outgroup) (FU and LI 1993), and Fay and Wu's H (FAY and WU 2000) using DnaSP4.0 (ROZAS and ROZAS 1999). We used a combination of test statistics to reveal more about the pattern of selection (OTTO 2000). Significance for all tests was determined by coalescent simulation with R (recombination) estimated from the data using HUDSON (1987) and putative orthologous D. simulans sequences for an outgroup sequence in Fu and Li's D and Fay and Wu's H. Orthologous D. simulans sequences were determined from NCBI BLAST (ALTSCHUL et al. 1990) and aligned to the D. melanogaster sequences with Clustal W (THOMPSON et al. 1994). Each of the three tests analyze sequences on the basis of a site-frequency spectrum (proportion of alleles at high or intermediate vs. low frequencies) and determine if loci depart from the expectation of a neutral equilibrium model (AQUADRO 1997). An excess of rare alleles is consistent with positive selection and is indicated by a negative Tajima's D and/or Fu and Li's D (TAJIMA 1989, FU and LI 1993). An excess of high-frequency variation is consistent with balancing selection and is indicated by a positive Tajima's D and/or Fu and Li's D (TAJIMA 1989, FU and LI 1993). Fay and Wu's H predicts positive selection on the basis of a site-frequency spectrum comparing the proportion of alleles at intermediate vs. high frequency and is indicated by a negative Fay and Wu's H (FAY and WU 2000). Departures from neutrality are predicted to be associated with recent selection acting at or near the locus. To control for demographic effects, we also sequenced and analyzed five randomly chosen loci (CG12191, CG1124, CG11105, CG33554, CG3022) focusing on intronic regions. Intronic regions were the focus to increase the number of polymorphic sites. Sequence protocol was similar to that used for the candidate female genes.

A Hudson–Kreitman–Aquade test (HUDSON et al. 1987) was used to compare the levels of polymorphism to divergence, as implemented in the HKA program for multiple loci by Jody Hey (http://lifesci.rutgers.edu/~heylab/HeylabSoftware.htm#HKA). Under neutrality, the levels of polymorphism within a species and divergence between species should be proportional to the neutral mutation rate. A McDonald–Kreitman test (MCDONALD and KREITMAN 1991) was performed for each gene individually and for all loci combined using DnaSP 4.0 (ROZAS and ROZAS 1999). The McDonald–Kreitman test tests the prediction that if both synonymous (silent, dS) and nonsynonymous (replacement, dN) mutations are neutral, then the ratio of synonymous to nonsynonymous polymorphism within a species will be similar to the ratio of synonymous to nonsynonymous divergence between species (fixed differences). Statistical departure from neutrality was tested with a G-test on a 2 x 2 contingency table of silent and replacement fixed differences between species and silent and replacement polymorphic changes within D. melanogaster. Polymorphism data is from the D. melanogaster sequences after alignment with the putative orthologous D. simulans sequence. We also report the neutrality index (NI) (RAND and KANN 1996), which shows the directionality of the McDonald–Kreitman test. An NI value >1 is consistent with negative selection, while an NI value <1 is consistent with positive selection. McDonald–Kreitman tests were not performed on the neutral loci because coding regions were not sequenced.

For the divergence study, we used BLAST to determine putative orthologous sequences from several Drosophila species (D. simulans, D. yakuba, D. anannasse, D. pseudoobscura, D. virilis, and D. mojavensis) for all female genes except CG17108 (resulting in a total of eight genes analyzed in the divergence study). CG17108 was not used due to its biased amino acid and codon usage, which may induce errors in parameter estimates that use codon models (SWANSON et al. 2004). D. pseudoobscura putative orthologs could not be identified for CG5273 and CG5106, and D. virilis putative orthologs could not be identified for CG9897, CG10200, and CG13004. Sequences were aligned using MEGA 3.1 (KUMAR et al. 2004) and Se-Al v2.0 (RAMBAUT 1996) and analyzed in the phylogenetic analysis by maximum likelihood program (PAML, YANG 2000). PAML tests for positive selection using a likelihood ratio test that compares a null (neutral) model where no codons could have a dN/dS ratio >1 (L0) with the likelihood of a model in which a subset of sites could have a dN/dS ratio >1 (L1) (NIELSEN and YANG 1998; YANG and BIELAWSKI 2000; YANG et al. 2000). Statistical significance is calculated by the negative of twice the difference in the log-likelihood obtained from these two models (–2[log(L0) – log(L1)]) compared to a chi-square distribution with degrees of freedom equal to the difference in the number of estimated parameters (SWANSON et al. 2004). We examined variation in the dN/dS ratio between sites using both discrete (PAML models M0 and M3) and ß- (PAML models M7 and M8) distributions. The comparison of M0 and M3 is not a robust test of adaptive evolution, but tests for variation in the dN/dS ratio between sites (SWANSON et al. 2004). The M7 and M8 comparison is a robust test of adaptive evolution. Details on the test statistics and distributions can be found at YANG et al. (2000). Sites predicted to be subjected to positive selection, using a Bayes empirical approach (YANG et al. 2005), were mapped onto a three-dimensional structure for CG17012 predicted using SwissModel (SCHWEDE et al. 2003).


    RESULTS
 TOP
 ABSTRACT
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 ACKNOWLEDGEMENTS
 LITERATURE CITED
 
Tajima's D, Fu and Li's D* and D, and Fay and Wu's H:
To test if any of the nine candidate genes have been subjected to positive selection, we performed several different tests of neutrality. Table 2 shows that levels of polymorphism in five of the nine genes surveyed depart significantly from equilibrium neutral expectations. Genes CG5273 and CG9897 show significant departure from neutral expectations on the basis of Tajima's D (TAJIMA 1989) and/or Fu and Li's D* (FU and LI 1993). A significant negative Tajima's D, Fu and Li's D* or D value (as in the case of CG5273) indicates an excess of rare alleles, which may result from a recent selective sweep. When a selective sweep occurs, in the presence of recombination, linked variation is dragged toward fixation, which results in an excess of high-frequency-derived mutations in regions near the site of selection. Polymorphism at sites near the selected site is eliminated as the favored variant is fixed. After the sweep, an excess of rare alleles is observed as new mutations occur in the targeted region and drift upward in frequency, since every new mutation produces a new allele. The time to return to levels expected under neutral equilibrium will depend on the population size and can be slow to occur for large populations. A significantly positive Fu and Li's D* (without an outgroup) for CG9897 suggest that this gene may be experiencing balancing selection, since balancing selection maintains mutations at intermediate frequencies.


View this table:
[in this window]
[in a new window]

 
TABLE 2 Five candidate genes show an indication of positive selection on the basis of polymorphism surveys

 
Three additional genes show positive selection on the basis of Fay and Wu's H when a putative orthologous D. simulans sequence is used as an outgroup. An outgroup sequence is required to infer the ancestral and derived states at segregating sites. These genes are CG5106, CG17108, and CG8453. A negative H value is indicative of genetic hitchhiking. Genetic hitchhiking occurs when a neutral mutation is tightly linked to a locus under positive selection. This linkage allows the mutation to rise to high frequency, which is indicated by a significantly negative H value (FAY and WU 2000).

Statistical tests of neutrality on the basis of the site frequency spectrum are known to be confounded by demographic processes. For example, Tajima's D and Fu and Li's D can be affected by population bottlenecks and migration (TAJIMA 1989; FU and LI 1993). Additionally, population admixture can result in significant Fay and Wu's H values (PRZEWORSKI 2002). Therefore, care needs to be taken to account for demographic effects, particularly in derived (non-African) populations of D. melanogaster (as studied herein) where a bottleneck model is the appropriate demographic null model (HADDRILL et al. 2005; THORNTON and ANDOLFATTO 2006). We took two approaches to control for demographic effects. First, under a simple neutral model, selection is often thought to be a locus-specific effect while demographics may affect the entire genome; therefore, we sequenced five randomly chosen loci (Table 2) from the same individuals used for the candidate female genes. Consistent with a lack of demographic effects, none of these randomly chosen genes showed a departure from neutrality (Table 2). However, it should be noted that certain demographic scenarios may increase the variance between loci (NIELSEN 2005). Therefore, our second approach (see below) compares synonymous and nonsynonymous changes using the McDonald–Krietman test (MCDONALD and KREITMAN 1991) and divergence between species (NIELSEN and YANG 1998; YANG et al. 2000), both of which are not confounded by demographic effects (NIELSEN 2005).

Hudson–Kreitman–Aquadé and McDonald–Kreitman test:
We used two tests that compare the ratio of polymorphisms within species to divergence between species. First, we used a multilocus Hudson–Kreitman–Aquadé (HKA) (HUDSON et al. 1987) test to assess departures from a neutral model (HAMMER et al. 2004). We found significant departure from a neutral model (P < 0.001), with most loci showing a deficit of polymorphism. One exception was CG9897, in which we observed twice as many polymorphisms as expected. Along with the positive Fu and Li D* test (Table 2), this observation from the HKA test is consistent with CG9897 being subjected to some form of balancing selection.

For our second test comparing polymorphism to divergence, we used a McDonald–Kreitman test (MCDONALD and KREITMAN 1991), which uses information from both silent (synonymous) and replacement (nonsynonymous) sites to test for positive selection (whereas the neutrality tests used in the frequency spectrum and HKA test do not distinguish between silent and replacement sites). Unlike the frequency spectrum-based tests and HKA test above, the results of a McDonald–Kreitman test are not confounded by demographic effects (NIELSEN 2005). This analysis depends on the prediction that under neutrality, the number of substitutions between two species and the number of polymorphic changes within a species will both be proportional to the mutation rate, and this will be true for both silent and replacement nucleotide changes (OTTO 2000). Table 3 shows silent and replacement variation and the NI (RAND and KANN 1996) in all female genes for polymorphisms from D. melanogaster sequences and fixed differences between D. melanogaster and a putative orthologous D. simulans sequence. A G-test revealed that CG17012 deviates significantly from homogeneity and has an NI value less than one, which is consistent with positive selection. When all loci are combined there is significant heterogeneity (Table 3). This result may be due to the significant deviation from homogeneity in CG17012. When this locus is removed from the analysis, the analysis remains significant (Table 3). This indicates that overall these female genes have been subjected to positive selection (as indicated by the NI value less than one and significant G-test).


View this table:
[in this window]
[in a new window]

 
TABLE 3 Contingency tables and G-test results for silent and replacement polymorphic and fixed variation in candidate female genes using D. melanogaster (Riverside) sequences and a putative orthologous D. simulans sequence

 
Divergence study:
Unlike the neutrality tests on the basis of polymorphism data, which detect recent selection in a single species, the divergence analysis detects recurrent positive selection events on the same codons of several species. We found a significant amount of variation in the dN/dS ratio between sites for all eight genes analyzed with the discrete model M3 (Table 4, M0 vs. M3). Three of these genes have a class of sites with a dN/dS > 1 (Table 4). This discrete model, however, is only an indication of variable dN/dS ratios between sites and should not be used as a robust test of adaptive evolution (SWANSON et al. 2001a). A more robust test of adaptive evolution is one with a beta distribution of dN/dS for "neutral" or functionally constrained codons that covers the interval 0–1 and permits a class of sites to vary freely, including dN/dS > 1 (model M8, Table 4). Evidence of selection acting on a subset of codons was found for four of the eight genes using this model (Table 4, M7 vs. M8). For one of the genes, CG17012, we were able to map residues predicted to be under positive selection onto a three-dimensional model (Figure 1). The sites under positive selection fall around the active site of this protease, suggesting potential functional differences between species. For example, these methods have been used to identify sites under positive selection that effectively predict the location of the binding sites in MHC (YANG and SWANSON 2002), abalone lysin (YANG and SWANSON 2002), and mammalian egg coat proteins (SWANSON et al. 2001a).


View this table:
[in this window]
[in a new window]

 
TABLE 4 Positive selection detected for female candidate reproductive genes by maximum likelihood analysis

 

Figure 1
View larger version (91K):
[in this window]
[in a new window]
[Download PPT slide]
 
FIGURE 1.— Three-dimensional model of the protease CG17012. The active site catalytic triad is shown in white and the sites predicted to be subjected to positive selection are shown in dark gray. Note that the sites under selection fall around and within the active site of the protease, suggesting potential functional differences between species.

 

    DISCUSSION
 TOP
 ABSTRACT
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 ACKNOWLEDGEMENTS
 LITERATURE CITED
 
Rapid reproductive protein evolution is emerging as a common phenomenon among many taxa. Much of this evolution appears to be adaptive. Sperm–egg binding partners in mammals and abalone, for instance, show rapid adaptive evolution in both the male and the female proteins involved in the interaction (SWANSON et al. 2001a; JANSA et al. 2003; PANHUIS et al. 2006), suggesting the selective pressure relates to a coevolutionary process between the male and the female. In Drosophila, the outcome of male–female postmating interactions reveals that male Acps are influential in altering female physiology and behavior and that female genotype contributes to the reproductive success of male gametes (WOLFNER 1997, 2002). Male proteins have been studied in great detail and several Acps are rapidly evolving by adaptive evolution. One explanation for this rapid evolution is coevolution between the sexes; however, several other evolutionary hypotheses have been proposed (PARKER 1970; EBERHARD 1996; RICE 1996; GAVRILETS 2000; SWANSON and VACQUIER 2002; SWANSON et al. 2004). To fully evaluate a process that facilitates coevolution between interacting proteins we need information on both male and female protein evolution (CHAPMAN et al. 2003; SWANSON et al. 2004; PANHUIS et al. 2006). This study has identified six (including CG17018 previously identified by SWANSON et al. 2004) female reproductive genes that have experienced adaptive evolution. Furthermore, these six genes make excellent candidates for further functional studies on their role in male–female postmating interactions.

The two approaches we used to detect positive selection in these candidate female genes, polymorphism surveys, and divergence analyses, detect different kinds of adaptive evolution. The strength of the polymorphism survey lies in its ability to detect signs of recent selection acting on a locus. The divergence analysis, however, detects recurrent selection that may act upon codons in most lineages studied (ANISIMOVA et al. 2001; SWANSON et al. 2004). A significant result from either method is good evidence of adaptive evolution (SWANSON et al. 2004; NIELSEN 2005). Importantly, by using two methods (McDonald–Kreitman test and dN/dS analyses) that are robust to demographic effects (NIELSEN 2005) we have increased confidence that the signatures we have documented are not the result of demography.

Our polymorphism survey used tests of neutrality to look for signs of recent selective events in each locus. These tests revealed that five of the nine genes surveyed exhibit a recent selective event. Significant negative values for four of the five genes (CG5106, CG17108, CG8453, and CG5273) suggest a role of directional selection, while CG9897 may be influenced by balancing selection, as indicated by a positive Fu and Li's D* (Table 2). Consistent with balancing selection is the high level of replacement polymorphism seen at this locus in the 36 D. melanogaster sequences surveyed (Table 3). A McDonald–Kreitman test on all loci combined revealed a significant departure from homogeneity; even when CG17012 is removed from this analysis (due to its significant departure from homogeneity alone) there is a significant departure from neutrality (Table 3). Our divergence analyses indicated positive selection, a dN/dS ratio >1, for three of the nine loci (Table 4, M7 vs. M8): CG9897, CG10200, and CG17012. Several of the genes surveyed that show signs of positive selection (CG5106, CG5273, CG10200, and CG17108) (Table 1) have no documented function. Three of the genes surveyed, which also show signs of positive selection, have a putative ontology function (Table 1) (ASHBURNER et al. 2000). We discuss these positively selected genes in light of their potential function and interaction with male reproductive proteins. We stress, however, that experiments to demonstrate functionality are needed before we can conclude their role in male–female interactions.

Positive selection detected for two trypsin-like serine proteases—CG17012 and CG9897:
Two loci with signs of positive selection, CG9897 and CG17012, contain a conserved trypsin-like serine protease domain (Blastp, 100% aligned, expect 7e-33 and 3e-46, respectively; 232 and 231 residues, respectively) (ARBEITMAN et al. 2004). Proteases are enzymes that cleave other proteins and may be important in interactions with male reproductive proteins (SWANSON et al. 2004).

CG17012 is an interesting candidate locus for its potential interaction with male proteins. Our study has revealed that this gene shows signs of rapid adaptive evolution (Table 24) using both polymorphism and divergence data. Sites predicted under selection fall right around the active site of the protease, suggesting functional differentiation between species (Figure 1). Furthermore, this locus shows increased expression in the female sperm storage organ (spermatheca) and the lumen of the parovaria (ARBEITMAN et al. 2004). CG17012's high expression level in the spermatheca suggests that it may be involved in sperm storage or sperm motility, as tyrpsins are in Lepidoptera (FRIEDLANDER et al. 2001; ARBEITMAN et al. 2004). As a putative protease this protein may play a role in the proteolytic cleavage of a male prohormone inside the mated female (PARK and WOLFNER 1995; ARBEITMAN et al. 2004). Additionally, it may offset some of the harmful effects of male reproductive proteins after mating (reviewed in WOLFNER 2002; ARBEITMAN et al. 2004). Evolutionary models of genes involved in male–female coevolution, such as sexual conflict and sexual selection, predict these genes to be rapidly evolving by adaptive evolution (RICE 1996; GAVRILETS 2000). A role in any of these above scenarios would be consistent with coevolution between the sexes; our result of rapid adaptive evolution supports this prediction. Further functional studies on this gene are needed to confirm a role for this protein in male–female postmating interactions.

Like CG17012, locus CG9897 also has a conserved protease domain and is rapidly evolving by positive selection. This locus has a significant, positive Fu and Li's D* (Table 3) indicating balancing selection may contribute to the evolution of this gene. Balancing selection tends to maintain mutations at intermediate frequencies, and may evolve in response to male–female interactions if a rare variant is favored, as seen in self vs. nonself recognition systems of several plant species (CHARLESWORTH 2002). Sperm competition and single mating studies in Drosophila show that male fertilization success is partly determined by the female genotype (CLARK and BEGUN 1998; CLARK et al. 1999). This male x female interaction has been suggested to contribute to the maintenance of allelic variation and balancing selection in several male reproductive proteins and may also explain the degree of polymorphism and potential balancing selection seen for CG9897 (CLARK et al. 1999).

Positive selection in a cytochrome P450 gene—CG8453:
CG8453 is a cytochrome P450 gene, Cyp6g1. Cytochrome P450 genes are involved in oxidative degradation of various compounds, such as endogenous and exogenous toxins. This locus has been studied for its potential role in response to insecticide resistance, such as DDT and other harmful molecules (DABORN et al. 2002). Lines in D. melanogaster with high levels of Cyp6g1 transcript appear to be resistant to DDT compared to susceptible lines, which do not show high expression level (DABORN et al. 2001, 2002; SCHLENKE and BEGUN 2004). A transposable element, Accord, inserted several hundred base pairs upstream of the transcription start site appears to be present only in DDT-resistant strains suggesting a role for this element in the upregulation of Cyp6g1 and insecticide resistance (DABORN et al. 2002). A significant Fay and Wu's H seen at this locus (Table 2) is consistent with a recent hitchhiking event due to selection at or near this locus. If the Accord transposable element is indeed favored by selection due to its putative role in the upregulation of Cyp6g1 we might expect a reduction in heterozygosity at this locus (SCHLENKE and BEGUN 2004). This hitchhiking event has been found at this locus in D. simulans (SCHLENKE and BEGUN 2004). SCHLENKE and BEGUN (2004) also found a low level of variation in a region near Cyp6g1 for a small sample of D. melanogaster. This level of variation is similar to what we observe at this locus in our larger sample of D. melanogaster ({theta} = 0.00169; Table 2). We do not have sequence data for the Accord region and cannot comment on whether or not our D. melanogaster sample from Riverside has this transposable element.

The potential detoxification effects of this protein may be important in male–female postmating interactions by reducing harmful molecules introduced to the female upon mating. Interestingly, toxic effects of male reproductive proteins on females have been shown (LUNG et al. 2002) and a greater expression level of this gene in adult female tissue compared to male tissue (ARBEITMAN et al. 2004) may aid in the detoxification of male introduced toxins. A specific role of CG8453 in the detoxification of toxic male proteins awaits further study.

Conclusion:
We have shown several genes expressed in the female reproductive tract that have been targets of positive selection. This includes a broad class of genes with a variety of functions. While identification of positive selection in genes encoding reproductive proteins is a recurrent observation (SWANSON and VACQUIER 2002), there are still very few female genes for which adaptive evolution has been documented. Since reproduction, including sperm competition, is a dynamic process involving both male and female components we believe it is necessary to study the dynamics of reproductive genes from both sexes.


    ACKNOWLEDGEMENTS
 TOP
 ABSTRACT
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 ACKNOWLEDGEMENTS
 LITERATURE CITED
 
We thank David Rand, Jennifer Calkins, and two anonymous reviewers for comments on the manuscript. T.M.P was supported on National Institutes of Health Genome Training Grant HG-000035 and W.J.S. was supported by National Institutes of Health grant HD-42563 and National Science Foundation grant DEB-0410112.


    LITERATURE CITED
 TOP
 ABSTRACT
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 ACKNOWLEDGEMENTS
 LITERATURE CITED
 

ADAMS, M. D., S. E. CELNIKER, R. A. HOLT, C. A. EVANS, J. D. GOGAYNE et al., 2000 The genome sequence of Drosophila melanogaster. Science 287: 2185–2195.[Abstract/Free Full Text]

ALTSCHUL, S. F., W. GISH, W. MILLER, E. W. MYERS and D. J. LIPMAN, 1990 Basic local alignment search tool. J. Mol. Biol. 215: 403–410.[CrossRef][Medline]

ANISIMOVA, M., J. P. BIELAWSKI and Z. YANG, 2001 Accuracy and power of the likelihood ratio test in detecting adaptive molecular evolution. Mol. Biol. Evol. 18: 1585–1592.[Abstract/Free Full Text]

AQUADRO, C. F., 1997 Insights into the evolutionary process from patterns of DNA sequence variability. Curr. Opin. Genet. Dev. 7: 835–840.[CrossRef][Medline]

ARBEITMAN, M. N., A. A. FLEMING, M. L. SIEGAL, B. H. NULL and B. S. BAKER, 2004 A genomic analysis of Drosophila somatic sexual differentiation and its regulation. Development 131: 2007–2021.[Abstract/Free Full Text]

ASHBURNER, M., C. A. BALL, J. A. BLAKE, D. BOTSEIN, H. BUTLER et al., 2000 Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat. Genet. 25: 25–29.[CrossRef][Medline]

BERTRAM, M. J., D. M. NEUBAUM and M. F. WOLFNER, 1996 Localization of the Drosophila male accessory gland protein Acp36DE in the mated female suggests a role in sperm storage. Insect Biochem. Mol. Biol. 26: 971–980.[CrossRef][Medline]

CHAPMAN, T., G. ARNQVIST, J. BANGHAM and L. ROWE, 2002 Sexual conflict. TREE 18: 41–46.

CHAPMAN, T, J. BANGHAM, G. VINTI, B. SEIFRIED, O. LUNG et al., 2003 The sex peptide of Drosophila melanogaster: female post-mating responses analyzed by using RNA interference. Proc. Natl. Acad. Sci. USA 100: 9923–9928.[Abstract/Free Full Text]

CHARLESWORTH, D., 2002 Self-incompatibility: how to stay incompatible. Curr. Biol. 12: R424–R426.[CrossRef][Medline]

CLARK, A. G., and D. J. BEGUN, 1998 Female genotype affects sperm displacement in Drosophila. Genetics 149: 1487–1493.[Abstract/Free Full Text]

CLARK, A. G., D. J. BEGUN and T. PROUT, 1999 Female x male interactions in Drosophila sperm competition. Science 283: 217–220.[Abstract/Free Full Text]

CLARK, N. L., J. E. AAGAARD and W. J. SWANSON, 2006 Evolution of reproductive proteins from animals and plants. Reproduction 131: 11–22.[Abstract/Free Full Text]

DABORN, P., S. BOUNDY, J. YEN, B. PITTENDRIGH, and R. FFRENCH-CONSTANT, 2001 DDT resistance in Drosophila correlates with Cyp6g1 over-expression and confers cross-resistance to the neonicotinoid imidacloprid. Mol. Genet. Genomics 266: 556–563.[CrossRef][Medline]

DABORN, P., J. L. YEN, M. R. BOGWITZ, G. LE GOFF, E. FEIL et al., 2002 A single P450 allele associated with insecticide resistance in Drosophila. Science 297: 2253–2256.[Abstract/Free Full Text]

EBERHARD, W. G., 1996 Female Control: Sexual Selection by Cryptic Female Choice. Princeton University Press, Princeton, NJ.

FAY, J. C., and C.-I WU, 2000 Hitchhiking under positive Darwinian selection. Genetics 155: 1405–1413.[Abstract/Free Full Text]

FRIEDLANDER, M., A. JESHTADI and S. REYNOLDS, 2001 The structural mechanism of trypsin-induced intrinsic motility in Manduca sexta spermatozoa in vitro. J. Insect Physiol. 47: 245–255.[CrossRef][Medline]

FU, Y. X., and W. H. LI, 1993 Statistical tests of neutrality of mutations. Genetics 133: 693–709.[Abstract]

GAVRILETS, S., 2000 Rapid evolution of reproductive barriers driven by sexual conflict. Nature 403: 886–889.[CrossRef][Medline]

HADDRILL, P. R., K. R. THORNTON, B. CHARLESWORTH and P. ANDOLFATTO, 2005 Multilocus patterns of nucleotide variability and the demographic and selection history of Drosophila melanogaster populations. Genome Res. 15: 790–799.[Abstract/Free Full Text]

HAMMER, M. F., D. GARRIGAN, E. WOOD, J. A. WILDER, Z. MOBASHER et al., 2004 Heterogeneous patterns of variation among multiple human x linked loci: the possible role of diversity-reducing selection in non-Africans. Genetics 167: 1841–1853.[Abstract/Free Full Text]

HEIFETZ, Y., O. LUNG, E. A. FRONGILLO, JR. and M. F. WOLFNER, 2000 The Drosophila seminal fluid protein Acp26Aa stimulates release of oocytes by the ovary. Curr. Biol. 10: 99–102.[CrossRef][Medline]

HUDSON, R. R., 1987 Estimating the recombination parameter of a finite population model without selection. Genet. Res. 50: 245–250.[Medline]

HUDSON, R. R., M. KREITMAN and M. AGUADÉ, 1987 A test of neutral molecular evolution based on nucleotide data. Genetics 116: 153–159.[Abstract/Free Full Text]

JANSA, S. A., B. L. LUNDRIGAN and P. K. TUCKER, 2003 Tests for positive selection on immune and reproductive genes in closely related species of the murine genus mus. J. Mol. Evol. 56: 294–307.[CrossRef][Medline]

KUMAR, S., K. TAMURA and M. NEI, 2004 MEGA3: integrated software for molecular evolutionary genetics analysis and sequence alignment. Brief. Bioinformatics 5: 150–163.[Abstract/Free Full Text]

LUNG, O., U. TRAM, C. M. FINNERTY, M. A. EIPPER-MAINS, J. M. KALB et al., 2002 The Drosophila melanogaster seminal fluid protein Acp62F is a protease inhibitor that is toxic upon ectopic expression. Genetics 160: 211–224.[Abstract/Free Full Text]

MCDONALD, J. H., and M. KREITMAN, 1991 Adaptive protein evolution at the Adh locus in Drosophila. Nature 351: 652–654.[CrossRef][Medline]

METZ, E. C., R. ROBLES-SIKISAKA and V. D. VACQUIER, 1998 Nonsynonymous substitution in abalone sperm fertilization genes exceeds substitution in introns and mitochondrial DNA. Proc. Natl. Acad. Sci. USA 95: 10676–10681.[Abstract/Free Full Text]

NIELSEN, H., J. ENGELBRECHT, S. BRUNAK and G. VON HEIGNE, 1997 A neural network method for identification of prokaryotic and eukaryotic signal peptides and prediction of their cleavage sites. Int. J. Neural Syst. 8: 581–599.[CrossRef][Medline]

NIELSEN, R., 2005 Molecular signatures of natural selection. Annu. Rev. Genet. 39: 197–218.[CrossRef][Medline]

NIELSEN, R., and Z. YANG, 1998 Likelihood models for detecting positively selected amino acid sites and applications to the HIV-1 envelope gene. Genetics 148: 929–936.[Abstract/Free Full Text]

OTTO, S. P., 2000 Detecting the form of selection from DNA sequence data. Trends Genet. 16: 526–529.[CrossRef][Medline]

PANHUIS, T. M., N. L. CLARK and W. J. SWANSON, 2006 Rapid evolution of reproductive proteins in Abalone and Drosophila. Phil. Trans. R. Soc. 361: 261–268.[CrossRef]

PARK, M., and W. F. WOLFNER, 1995 Male and female cooperate in the prohormone-like processing of a Drosophila melanogaster seminal fluid protein. Dev. Biol. 171: 694–702.[CrossRef][Medline]

PARKER, G. A., 1970 Sperm competition and its evolutionary consequences in the insects. Biol. Rev. 45: 525–567.

PRICE, C. S., 1997 Conspecific sperm precedence in Drosophila. Nature 388: 663–666.[CrossRef][Medline]

PRZEWORSKI, M., 2002 The signature of positive selection at randomly chosen loci. Genetics 160: 1179–1189.[Abstract/Free Full Text]

RAM, K. R., S. JI and M. F. WOLFNER, 2005 Fates and targets of male accessory gland porteins in mated female Drosophila melanogaster. Insect Biochem. Mol. Biol. 35: 1059–1071.[CrossRef][Medline]

RAMBAUT, A., 1996 Se-Al: Sequence Alignment Editor (http://evolve.zoo.ox.ac.uk/).

RAND, D. M., and L. M. KANN, 1996 Excess amino acid polymorphism in mitochondrial DNA: contrasts among genes from Drosophila, mice, and humans. Mol. Biol. Evol. 13: 735–748.[Abstract]

RICE, W. R., 1996 Sexually antagonistic male adaptation triggered by experimental arrest of female evolution. Nature 381: 232–234.[CrossRef][Medline]

ROZAS, J., and R. ROZAS, 1999 DnaSP version 3: an integrated program for molecular population genetics and molecular evolution analysis. Bioinformatics 15: 174–175.[Abstract/Free Full Text]

SCHLENKE, T. A., and D. J. BEGUN, 2004 Strong selective sweep associated with a transposon insertion in Drosophila simulans. Proc. Natl. Acad. Sci. USA 101: 1626–1631.[Abstract/Free Full Text]

SCHWEDE, T., J. KOPP, N. GUEX and M. C. PEITSCH, 2003 SWISS-MODEL: an automated protein homology-modeling server. Nucleic Acids Res. 31: 3381–3385.[Abstract/Free Full Text]

SWANSON, W. J., and V. D. VACQUIER, 2002 Rapid evolution of reproductive proteins. Nat. Rev. Genet. 3: 137–144.[Medline]

SWANSON, W. J., Z. YANG, M. F. WOLFNER and C. F. AQUADRO, 2001a Positive Darwinian selection drives the evolution of several female reproductive proteins in mammals. Proc. Natl. Acad. Sci. USA 98: 2509–2514.[Abstract/Free Full Text]

SWANSON, W. J., A. G. CLARK, H. M. WALDRIP-DAIL, M. F. WOLFNER and C. F. AQUADRO, 2001b Evolutionary EST analysis identifies rapidly evolving male reproductive proteins in Drosophila. Proc. Natl. Acad. Sci. USA 98: 7375–7379.[Abstract/Free Full Text]

SWANSON, W. J., A. WONG, M. F. WOLFNER and C. F. AQUADRO, 2004 Evolutionary expressed sequence tag analysis of Drosophila female reproductive tracts identifies genes subjected to positive selection. Genetics 168: 1457–1465.[Abstract/Free Full Text]

TAJIMA, F., 1989 Statistical methods for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 123: 585–595.[Abstract/Free Full Text]

THOMPSON, J. D., D. G. HIGGINS and T. J. GIBSON, 1994 Clustal W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22: 4673–4680.[Abstract/Free Full Text]

THORNTON, K., and P. ANDOLFATTO, 2006 Approximate Bayesian inference reveals evidence for a recent, severe bottleneck in a Netherlands population of Drosophila melanogaster. Genetics 172: 1607–1619.[Abstract/Free Full Text]

WOLFNER, M. F., 1997 Tokens of love: functions and regulation of Drosphila male accessory gland products. Insect Biochem. Mol. Biol. 27: 179–192.[CrossRef][Medline]

WOLFNER, M. F., 2002 The gifts that keep on giving: physiological functions and evolutionary dynamics of male seminal proteins in Drosophila. Heredity 88: 85–93.[CrossRef][Medline]

YANG, Z., 2000 Phylogenetic Analysis by Maximum Likelihood (PAML). University College London, London.

YANG, Z., 2005 The power of phylogenetic comparison in revealing protein function. Proc. Natl. Acad. Sci. USA. 102: 3179–3180.[Free Full Text]

YANG, Z., and J. P. BIELAWSKI, 2000 Statistical methods for detecting molecular adaptation. Trends. Ecol. Evol. 15: 496–503.[CrossRef][Medline]

YANG, Z., and W. J. SWANSON, 2002 Codon-substitution models to detect adaptive evolution that account for heterogeneous selective pressures among site classes. Mol. Biol. Evol. 19: 49–57.[Abstract/Free Full Text]

YANG, Z., R. NIELSEN, N. GOLDMAN and A. M. PEDERSEN, 2000 Codon-substitution models for heterogeneous selection pressure at amino acid sites. Genetics 155: 431–449.[Abstract/Free Full Text]

YANG, Z., W. S. WONG and R. NIELSEN, 2005 Bayes empirical bayes inference of amino acid sites under positive selection. Mol. Biol. Evol. 22: 1107–1118.[Abstract/Free Full Text]

Communicating editor: D. M. RAND




This article has been cited by other articles:


Home page
Mol Biol EvolHome page
A. Wong, M. C. Turchin, M. F. Wolfner, and C. F. Aquadro
Evidence for Positive Selection on Drosophila melanogaster Seminal Fluid Protease Homologs
Mol. Biol. Evol., March 1, 2008; 25(3): 497 - 506.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
W. Haerty, S. Jagadeeshan, R. J. Kulathinal, A. Wong, K. Ravi Ram, L. K. Sirot, L. Levesque, C. G. Artieri, M. F. Wolfner, A. Civetta, et al.
Evolution in the Fast Lane: Rapidly Evolving Sex-Related Genes in Drosophila
Genetics, November 1, 2007; 177(3): 1321 - 1335.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
M. K. N. Lawniczak and D. J. Begun
Molecular population genetics of female-expressed mating-induced serine proteases in Drosophila melanogaster
Mol. Biol. Evol., September 1, 2007; 24(9): 1944 - 1951.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
S. Jagadeeshan and R. S. Singh
Rapid Evolution of Outer Egg Membrane Proteins in the Drosophila melanogaster Subgroup: A Case of Ecologically Driven Evolution of Female Reproductive Traits
Mol. Biol. Evol., April 1, 2007; 24(4): 929 - 938.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
genetics.105.053611v1
173/4/2039    most recent
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Panhuis, T. M.
Right arrow Articles by Swanson, W. J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Panhuis, T. M.
Right arrow Articles by Swanson, W. J.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Copyright © 2006 by the Genetics Society of America.