- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Supporting Information
-
All Versions of this Article:
genetics.108.099549v1
182/1/325 most recent - Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Omilian, A. R.
- Articles by Lynch, M.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Omilian, A. R.
- Articles by Lynch, M.
Originally published as Genetics Published Articles Ahead of Print on March 19, 2009.
Genetics, Vol. 182, 325-336, May 2009, Copyright © 2009
doi:10.1534/genetics.108.099549
Patterns of Intraspecific DNA Variation in the Daphnia Nuclear Genome
Angela R. Omilian*,
,1 and
Michael Lynch*
* Department of Biology, Indiana University, Bloomington, Indiana 47405 and
Department of Biological Sciences, State University of New York, Buffalo, New York 14260
1 Corresponding author: Department of Biological Sciences, Hochstetter Hall (Room 220) SUNY Buffalo, Buffalo, NY 14260.
E-mail: alr2{at}buffalo.edu
Understanding nucleotide variation in natural populations has been a subject of great interest for decades. However, many taxonomic groups, especially those with atypical life history attributes remain unstudied, and Drosophila is the only arthropod genus for which DNA polymorphism data are presently abundant. As a result of the recent release of the complete genome sequence and a wide variety of new genomic resources, the Daphnia system is quickly becoming a promising new avenue for expanding our knowledge of nucleotide variation in natural populations. Here, we examine nucleotide variation in six protein-coding loci for Daphnia pulex and its congeners with particular emphasis on D. pulicaria, the closest extant relative of D. pulex. Levels of synonymous intraspecific variation,
s, averaged 0.0136 for species in the Daphnia genus, and are slightly lower than most prior estimates in invertebrates. Tests of neutrality indicated that segregating variation conforms to neutral model expectations for the loci that we examined in most species, while Ka/Ks ratios revealed strong purifying selection. Using a full maximum-likelihood coalescent-based method, the ratio of the recombination rate to the mutation rate (c/u), averaged 0.5255 for species of the Daphnia genus. Lastly, a divergence population-genetics approach was used to investigate gene flow and divergence between D. pulex and D. pulicaria.
A thorough understanding of molecular variation in natural populations is fundamental to many important evolutionary issues. Thus, the characterization of molecular variation in natural populations and the discernment of the underlying forces that shape this variation have been a central focus of evolutionary geneticists for decades (e.g., KREITMAN 1983; MORIYAMA and POWELL 1996; SCHMID et al. 2005). The ability to collect vast amounts of DNA sequence data has improved greatly in recent years, and this has expanded our understanding of the mechanisms that maintain genetic variability—especially in model systems. However, many animal groups with atypical life history attributes remain unstudied, and Drosophila is the only arthropod genus for which DNA polymorphism data are presently abundant. Undoubtedly, a more complete understanding of nucleotide variation in natural populations can be obtained by examining additional taxonomic groups.
The waterflea Daphnia (Cladocera, Anomopoda) has been central to thousands of studies in many diverse areas of biology (reviewed in PETERS and DE BERNARDI 1987). Recently, Daphnia has been recognized as a new genetic model system. As a result, the scope of Daphnia research has expanded to include gene expression, mapping, and comparative genomic studies (e.g., WATANABE et al. 2005; CRISTESCU et al. 2006; OMILIAN et al. 2006; SOETAERT et al. 2006). Yet, comparatively little is known about daphniid population genetics at nuclear protein-coding loci. Past studies of genetic variation in daphniid populations mainly employed allozymes, microsatellites, and mitochondrial DNA (e.g., HEBERT 1974; MORT and WOLF 1986; CREASE et al. 1990; CREASE et al. 1997; WEIDER and HOBAEK 1997; PALSSON 2000), and few population-genetics analyses of nuclear DNA sequences are published (LITTLE et al. 2004; ISHIDA and TAYLOR 2007). A better understanding of Daphnia population-genetics parameters will facilitate the interpretation of the new genomic data that are derived from this organism, especially data that are interpreted in an evolutionary context.
In this study, we treat Daphnia pulex as the focal species and also report on smaller surveys of five of its congeners: D. pulicaria, D. obtusa, D. ambigua, D. magna, and D. mendotae. The latter four species are distantly related to D. pulex, being separated by millions of years of divergence (COLBOURNE and HEBERT 1996). However, the relationship between D. pulex and D. pulicaria is less clear. Previous studies on the basis of allozymes and mitochondrial DNA have regarded D. pulicaria as a sister species to D. pulex (HEBERT et al. 1993; COLBOURNE and HEBERT 1996). However, the species barrier between North American D. pulex and D. pulicaria is tenuous; the two are nearly morphologically indistinguishable and hybridize readily in nature. Although D. pulex–D. pulicaria hybrids reproduce via obligate parthenogenesis in the wild (e.g., HEBERT et al. 1993; HEBERT and FINSTON 2001), they are capable of producing progeny via sexual reproduction in the lab (HEIER and DUDYCHA 2009). The most obvious difference between D. pulex and D. pulicaria is their habitat: D. pulex usually reside in ephemeral fishless ponds, whereas D. pulicaria primarily inhabit lakes.
Thus, D. pulex and D. pulicaria appear to be in the process of ecological speciation, with differences between pond and lake habitats being the source of divergent selection. Divergence population genetics can be used to shed light on the divergence process between these two species. This approach is based on the premise that two nascent species share polymorphisms due to their recent divergence from a common ancestor, and as time progresses, shared polymorphisms are lost, and new mutations arise in each species. When multiple loci are examined, the variance in levels of divergence among loci is used to assess gene flow (e.g., WANG et al. 1997; MACHADO et al. 2002; HEY 2006). If some loci show large divergences while others do not, it may be postulated that gene flow is still occurring at some loci. Divergence in the face of gene flow implicates selection as a driving force for the divergence (MAYNARD-SMITH 1966; ENDLER 1977; RICE and HOSTERT 1993).
Here, we investigate the evolutionary processes shaping divergence between D. pulex and D. pulicaria by examining DNA sequence polymorphism data in six nuclear protein-coding loci. In particular, we ask whether an "isolation" speciation model, in which an ancestral population splits with no subsequent gene flow between descendent populations (WAKELEY and HEY 1997), or the "isolation with migration" model (WAKELEY and HEY 1998) is a better fit to the data. In doing so, multiple demographic parameters are estimated that characterize the divergence between D. pulex and D. pulicaria. This article also quantifies specieswide levels of nuclear variation in several other Daphnia species, and we report for all species included in this study the following basic population-genetics information: diversity estimates, tests of neutrality, Ka/Ks ratios, and the recombination rate (c/u).
Taxonomic sampling:
Species delimitation and nomenclature within the D. pulex complex is problematic (MERGEAY et al. 2008). Thus, we present two salient issues that warrant attention for this study. The D. pulex species name is used to denote two different lineages, one that inhabits Europe (sensu LEYDIG 1860), and the other is primarily found in North America (sensu HEBERT 1995). In the same vein, two different lineages are named D. pulicaria; European D. pulicaria (sensu ALONSO 1996) are distinct from North American D. pulicaria populations (sensu FORBES 1893). Here, we refer only to North American D. pulex and D. pulicaria, and restrict our conclusions to these lineages.Six loci were examined in D. pulex, D. pulicaria, and D. obtusa. Due to the difficulty of obtaining PCR products, two loci were analyzed for three additional species that are much more distant in their taxonomic relationship to D. pulex: D. ambigua, D. mendotae, and D. magna. For D. pulex, two individuals per collection site were sampled (36 alleles, supporting information, Table S1). Obligate parthenogens that are known to occur in D. pulex were not included in this study. For all other species, one individual from each of nine collection sites was sampled (18 alleles per species, see Table S1 for exceptions). With the exception of D. pulex, each individual sampled within a species was from a unique collection site, making it impracticable to address questions at the level of the deme. This specieswide sampling strategy from multiple demes may increase the effective population size relative to that of a single deme (WRIGHT 1943). With multiple subdivided demes, the migration rate among demes can influence inferences regarding divergence time, relative population sizes, and phylogenetic relationships (WAKELEY 2000), and the appropriateness of our sampling approach depends on how well Daphnia fit the assumptions of an island model of migration with many demes. Moreover, Daphnia are cyclic parthenogens, and asexual periods of reproduction may influence adherence to the standard coalescent, potentially influencing our analyses. Although it is beyond the scope of the present article to deal formally with this issue, we expect that the influence of selection on linked sites will be greater in a cyclically parthenogenetic population than a purely sexual population, thereby increasing the among-individual variance in reproductive success and reducing effective population size.
Because taxonomic differentiation on the basis of morphology is notoriously difficult in Daphnia, we verified the taxonomic identification of all individuals with sequence data from the 12S rDNA gene (COLBOURNE and HEBERT 1996). Allozyme analysis for the lactate dehydrogenase (LDH) locus was used to differentiate between D. pulex and D. pulicaria (Table S1), following the conventional notion that D. pulicaria is homozygous for the F allele and D. pulex is homozygous for the S allele (HEBERT et al. 1989, 1993).
It has been proposed that certain populations of D. pulex endemic to Oregon are a separate species called D. arenata (HEBERT 1995). Phylogenies on the basis of mitochondrial DNA sequences show that D. pulex is paraphyletic with respect to these Oregon populations (COLBOURNE et al. 1998; LYNCH et al. 2008), but nuclear data reveal support for D. arenata (OMILIAN et al. 2008). This study includes two populations from Oregon (denoted LOG and GI); one population (LOG) consists of D. arenata, while the GI population is likely to be of hybrid origin (OMILIAN et al. 2008). For our population-genetics analyses, we found that the D. arenata individuals were sufficiently divergent to be analyzed separately from D. pulex. The GI individuals were excluded from most analyses.
Locus information:
Loci were chosen on the basis of the presence of conserved regions for primer design in sequences present in cDNA libraries from D. pulex and D. magna (cDNA libraries provided by John Colbourne and Hajime Watanabe, see Table S2 for primer sequences). These loci appeared to be single-copy genes based on BLAST searches to the D. pulex genome (v.1.0), and data obtained from cloning did not reveal evidence for more than two alleles per individual at any locus. We examined sequences from the following loci: ATP synthase epsilon chain (atp-ep), a calcium-binding protein with an EF-hand (cbp-EF), cleavage stimulation factor (cstf), glyceraldehyde-3-phosphate dehydrogenase (g3pdh), a rab GTPase(rab4), and a protein with translation initiation factor and subtilase activity (tif).
PCR amplification, DNA sequencing, and cloning:
Specific PCR information is described in OMILIAN et al. (2008). Briefly, Taq polymerase with proofreading capability (Clontech) was used, and PCR products were sequenced in both directions. Because Daphnia were from natural populations, direct sequencing of PCR products revealed many heterozygous loci, that is, two overlapping peaks were observed for at least one site of the DNA sequence electropherogram. Heterozygous sites were detected with CodonCode Aligner software v1.4.3 set at the highest sensitivity and then verified by eye. Approximately 50% of our amplicons had multiple heterozygous sites and were cloned with the Invitrogen TOPO TA kit to determine the gametic phase. The QIAprep Spin miniprep kit (QIAGEN) was used for plasmid purification, and a T7 primer was used to sequence the cloned inserts. Four to 12 cloned fragments were sequenced, and the results from cloned PCR products were compared to the direct sequences of PCR products to ensure that polymorphisms were not the result of PCR or cloning-induced errors (CRONN et al. 2002).
Sequence analysis and statistical tests:
Polymorphism patterns:
Sequences were aligned with MEGA version 3.1 (KUMAR et al. 2004), using the ClustalW algorithm (THOMPSON et al. 1994) and then manually corrected. Population-genetics parameters and tests of neutrality were calculated with DnaSP version 4.0 (ROZAS et al. 2003) unless noted otherwise. Two measures of nucleotide diversity were estimated:
, the average of pairwise differences among DNA sequences (TAJIMA 1983), obtained using the Jukes–Cantor correction (JUKES and CANTOR 1969; LYNCH and CREASE 1990); and
, based on the total number of segregating mutations in the sample (WATTERSON 1975; TAJIMA 1996). Under neutrality,
and
are expected to be equal and estimators of 4Neu, where Ne is the genetic effective population size and u is the mutation rate per site per generation. Both
and
were estimated for the following categories of nucleotide sites/regions: total (
T,
T), nonsynonymous (
n,
n), synonymous (
s,
s), intron (
i,
i), and UTR (
U,
U).
Recombination:
A full-likelihood coalescence-based approach in LAMARC 2.0 (KUHNER 2006) was used to estimate the recombination rate, rLAMARC = c/µ, where c is the recombination rate per site per generation and µ is the neutral mutation rate per site per generation. LAMARC uses a finite-sites model and operates under the following assumptions: (1) recombination frequency is not affected by sequence divergence, (2) all recombination is homologous, (3) gene conversion and interference do not occur, (4) recombination events are selectively neutral, and (5) the recombination rate is constant across the region. We excluded loci with 10 or fewer variable sites because LAMARC requires sufficient numbers of variable sites to get reliable estimates of the parameters.Because it was not possible to implement the best fitting model (determined by Modeltest v3.7; POSADA and CRANDALL 1998) for most loci in LAMARC, we used the Felsenstein '84 (F84) model with empirical base frequencies (KISHINO and HASEGAWA 1989; FELSENSTEIN 1993); for most loci, the F84 model is a closer fit than the other model offered by LAMARC and is computationally faster. Two categories of relative mutation rate were assigned, accounting for substitution rate differences between nonsynonymous sites and all other sites. Our sampling strategy included 20 initial chains of 1000 and 2 final chains of 50,000 genealogies with 1000 genealogies discarded per chain. Adaptive heating was used to improve the search of parameter space. The entire analysis was replicated three times and the results were combined (GEYER 1991).
Tests of neutrality and Ka/Ks ratios:
Two tests of neutrality were conducted: the Hudson–Kreitman–Aguade test (HKA, HUDSON et al. 1987) and Tajima's D test (TAJIMA 1989). We used a multilocus version of the HKA test (Hey's HKA program; http://lifesci.rutgers.edu/
heylab). Test statistics for both HKA and Tajima's D tests were compared with distributions generated from 10,000 coalescent simulations to determine significance. Values for Ka and Ks were calculated in MEGA using the Kumar method, which is a modification of the Pamilo–Bianchi–Li (PAMILO and BIANCHI 1993; LI 1993) and COMERON (1995) methods; this method accounts for some problematic degeneracy assignments. For both Ka and Ks we subtracted the mean within-species diversity (averaged between the two species being compared) from raw estimates of divergence to get the net divergence between species. Standard errors for Ka/Ks ratios were calculated from equation A1.19b from LYNCH and WALSH (1998).
Phylogenetic analyses:
We used MrBayes 3.1.2 (HUELSENBECK and RONQUIST 2001) to elucidate the genealogical relationships of D. pulex, D. pulicaria, and D. arenata alleles with Bayesian inference. The six loci were concatenated in an alignment of 3160 bp for a total evidence analysis. The tree was rooted with D. obtusa, which is an uncontroversial outgroup. Large indel mutations that were likely to be the result of one insertion event, rather than several independent events, were weighted as one event (e.g., a novel intron insertion in some individuals).
We conducted an analysis without partitioning and an analysis with the sequence data partitioned by one noncoding and three codon positions (first, second, and third codon positions). Model selection for each partition was made according to the Akaike information criterion in Modeltest v3.7 (POSADA and CRANDALL 1998). For MrBayes, default prior settings were used except for the ratepr parameter that was set to variable for the partitioned analyses, so that partitions could evolve at different rates. Branch lengths and topology were shared among partitions, but the substitution rate matrix, state frequency, and shape parameter of the
-distribution were unlinked, allowing for separate parameter estimates. Two independent and simultaneous Markov chain Monte Carlo (MCMC) analyses of 15 heated and one cold chain were run for 6 million generations sampling from the chain every 100 generations. After determining chain convergence (average standard deviation of split frequencies <0.01), we discarded the initial 25% of trees as a burn-in. A 50% majority-rule consensus topology with posterior probability (PP) values for each node was constructed from the post-burn-in trees.
Genealogical relationships were further examined with median-joining haplotype networks using the program Network version 4.5 (BANDELT et al. 1999; http://www.fluxus-engineering.com). This method accounts for the coexistence of ancestral and descendent haplotypes, multifurcations, and reticulate relationships (POSADA and CRANDALL 2001). We used the MP post-processing option, which removes all superfluous median vectors and links that are not contained in the shortest trees of the network.
Divergence population genetics:
A Bayesian methodology employed by the Isolation with Migration (IMa) software (HEY and NIELSEN 2007) was used to generate posterior probability distributions for six parameters relevant to the divergence between D. pulex and D. pulicaria. In turn, these parameters were used to estimate the time since divergence, migration rates, and effective population sizes for D. pulex, D. pulicaria, and their ancestral population. Assuming five generations per year in Daphnia, we applied the mutation rate 2.9 x 10–8 mutations/site/year (single-nucleotide mutation rate from HAAG-LIAUTARD et al. 2007). Although this mutation rate is from a distant relative (Drosophila), it remains the most rigorously determined arthropod nuclear rate that is available. Because there is error associated with mutation-rate estimation (outlined in HAAG-LIAUTARD et al. 2007) and the application of a mutation rate from a distant relative, we also present the scaled IMa parameters.Once appropriate prior distributions were identified (WON and HEY 2005), we used multiple runs with identical conditions except the random number seed to determine convergence. We applied the HKY mutation model (HASEGAWA et al. 1985) to accommodate multiple substitutions at a single nucleotide site. Five chains with heating were used for each run. Runs of 60,000,000 steps and a burn-in period of 100,000 steps provided for effective sample sizes of at least 95.
The isolation with migration methodology is contingent upon several assumptions. First, it is assumed that variation within the dataset is selectively neutral. Second, the sampled populations, D. pulex and D. pulicaria, are assumed to be more closely related to each other than other populations, and gene flow with unsampled populations is assumed to be nonexistent. Lastly, there should be no recombination within loci and free recombination between loci (HEY and NIELSEN 2004). The assumption of selective neutrality is likely to hold as neutrality tests indicated that sequences for D. pulex and D. pulicaria conform to neutral model expectations (see RESULTS). Phylogenetic analyses from previous studies reveal mixed results regarding the closest relative to D. pulex; studies on the basis of mitochondrial DNA usually reconstruct D. arenata as the closest relative to D. pulex (COLBOURNE et al. 1998; MERGEAY et al. 2008), whereas nuclear DNA reveals that D. pulex and D. pulicaria are sister groups (OMILIAN et al. 2008; this study). Thus, we conducted three separate IMa analyses in which the following were compared: (1) D. pulex and D. pulicaria, (2) D. pulex and D. arenata, and (3) D. arenata and D. pulicaria. Because the assumption of no intralocus recombination is violated in our data set, we also assessed migration with a full-likelihood coalescence-based approach in LAMARC that accounts for recombination.
Polymorphism patterns, tests of neutrality, and Ka/Ks ratios:
Levels of DNA polymorphism varied in Daphnia according to functional site/region, locus, and species (Table 1). Nucleotide diversity was far lower for nonsynonymous than synonymous and intron sites (P
0.0001, Figure 1). Diversity in UTRs was threefold lower than at synonymous sites or introns (P
0.002, Figure 1). Synonymous-site diversity averaged across all loci was highest in D. ambigua (
s = 0.0310), lowest in D. arenata (
s = 0.0017), and the focal species D. pulex and D. pulicaria were intermediate with values of 0.0188 and 0.0159, respectively (Table 2). Relative to the mutation rate, the overall recombination rate per site per generation (c/µ) ranged from 0.2383 to 0.7611 for Daphnia species and averaged 0.5255 for the Daphnia genus (Table 3).
|
|
|
|
We did not observe a significant departure from neutrality for any locus with the HKA test (Table 4). A significant negative mean Tajima's D was observed only for D. obtusa (Table 5). Synonymous (Ks) and nonsynonymous (Ka) divergences among D. pulex and its congeners are shown in Table 6; Ks values were higher than Ka values for all loci, and Ka/Ks ratios ranged from –0.0120 to 0.0501. D. pulex and D. pulicaria were not significantly different at silent or replacement sites (Table 6).
|
|
|
Evolutionary relationships between D. pulex and D. pulicaria:
In MrBayes, the lowest average standard deviation of split frequencies was obtained with the unpartitioned data set that employed the GTR + I +
model of nucleotide evolution; the 50% majority-rule consensus Bayesian topology is presented from this analysis (Figure 2). The consensus tree revealed that D. pulex consists of two major monophyletic groups that are paraphyletic with respect to D. pulicaria. We find that D. pulicaria is a strongly supported monophyletic group (PP = 0.99), and D. pulex and D. pulicaria are more closely related to each other than either is to D. arenata, which is also a distinct monophyletic group (PP = 1). However, support was low for the sister relationship of D. pulex and D. pulicaria to the exclusion of D. arenata (PP = 0.65). The GI population is unique in that one of the sampled individuals groups with D. pulicaria and the other groups with D. pulex.
|
Because a bifurcating genealogy neglects recombination within a species, we also constructed a network. Network results were consistent with those obtained from the Bayesian analysis (Figure S1).
Divergence population genetics:
For D. pulex and D. pulicaria, multiple IMa runs with identical conditions except the starting seed revealed clear marginal posterior probability distributions of the parameters, with unimodal peaks for all curves; results are presented from a single representative run (Figure 3, Table 7). For gene flow from D. pulicaria to D. pulex, the highest probability of the posterior distribution is near zero, and the curve drops to zero at higher migration rates (Figure 3). For gene flow from D. pulex to D. pulicaria, a model of no gene flow can be rejected, but migration is low (Figure 3, Table 7). D. pulex and D. pulicaria are estimated to have diverged
82,000 years ago (95% HPD interval: 58,000–126,000). Population size for D. pulex is significantly larger than the ancestral population, suggesting population expansion; this is consistent with the negative mean value for Tajima's D observed in this species (Tables 5 and 7).
|
|
Migration analyes in LAMARC, which accounts for recombination within nuclear loci, determined that migration estimates were roughly an order of magnitude higher than those obtained with IMa (Table 7). The effective number of gene migrants (2Nem) from D. pulicaria to D. pulex, per generation was 1.71 (95% support interval: 1.34–2.14), and for D. pulex to D. pulicaria, 1.67 (95% support interval: 1.28–2.10).
Despite several varied attempts, IMa runs for D. pulex and D. arenata, and D. pulicaria and D. arenata, did not have a narrow unimodal peak for the divergence-time parameter. Instead, the highest likelihood appears to have an infinitely wide range of parameter values. Thus, all parameter estimates from these runs may be invalid and are not presented. This is likely due to the low levels of variation observed in the D. arenata sequences.
Patterns of nucleotide polymorphism:
Averaged across species, daphniid nucleotide diversity (
s = 0.0136,
s = 0.0141) is lower than the average of other invertebrates (
s = 0.0265, LYNCH 2006). In reference to two prominent arthropod species, synonymous diversity in Daphnia is similar to Drosophila melanogaster (
s = 0.0158, ANDOLFATTO 2001) and lower than Anopheles gambiae (
s = 0.0235, BESANSKY et al. 2003; MUKABAYIRE et al. 2001). In Daphnia, there is considerable heterogeneity in polymorphism levels for the various sites/regions and loci spanning a range from 0.0000 to 0.0694 (Table 1, Figure 1). Levels of diversity for nonsynonymous polymorphisms are far lower than for synonymous polymorphisms, reflecting purifying selection against amino acid substitutions. Mean
i and
s are significantly higher than
U (P < 0.0001 and P = 0.002, respectively), suggesting that UTRs are selectively constrained in Daphnia. These results indicate that synonymous, intron, and UTR sites should not be grouped together under the umbrella of silent sites in Daphnia and are consistent with studies that document lower levels of polymorphism in UTR regions than synonymous sites (e.g., ANDOLFATTO 2005). Levels of polymorphism were heterogeneous among the different species examined (Tables 1 and 2). Nucleotide diversity is notably low for D. mendotae and D. arenata, being nearly an order of magnitude lower than the other examined species for some functional sites/regions. These low diversity estimates may be due to sampling effects as only two loci were examined for D. mendotae and only four individuals were available for D. arenata. Alternatively, low estimates of nucleotide diversity could be indicative of small effective population size or lower mutation rates in these species (reviewed in WRIGHT and GAUT 2005).
The population-genetic consequences of cyclical parthenogenesis are largely unexplored empirically. Daphnia are cyclical parthenogens that experience phases of clonal reproduction, so it is of interest if they exhibit low rates of recombination. Low recombination may cause greater vulnerability to hitchhiking effects as recombination separates independently arising mutations. The scaled recombination rate, c/µ, can be used to address this question, as recombinational activity across groups can be compared (e.g., HUDSON 1987; HADDRILL et al. 2005). In general, c/µ estimates are similar among Daphnia species, ranging from 0.2383 to 0.7611 (Table 3). These estimates are likely to be a lower bound on the recombination rate in Daphnia given that gene conversion may be an important factor for closely linked sites (ANDOLFATTO and NORDBORG 1998), and LAMARC does not account for gene conversion. Nevertheless, despite having phases of clonal reproduction, Daphnia appear to have recombination rates that are similar to other organisms for which estimates are available (LYNCH 2007), save Drosophila, which have higher relative recombination rates than most animals (HADDRILL et al. 2005; LYNCH 2007).
A longstanding concern of population geneticists is quantifying how natural selection influences levels of genetic variation. The evaluation of intraspecific polymorphism and interspecific divergence can elucidate whether neutral, selective, and/or demographic forces are acting—especially when multiple loci are examined. Neutrality tests (HKA and Tajima's D) suggest that all species of Daphnia examined in this study conform to neutral model expectations, with the exception of D. obtusa, which had a significantly negative mean value for Tajima's D (Tables 4 and 5). The ratios of the number of nonsynonymous substitutions per nonsynonymous site (Ka) to the number of synonymous substitutions per synonymous site (Ks) were low for all Daphnia species, suggesting strong purifying selection (Table 6). This observation is potentially a consequence of our focus on functionally conserved genes, to maximize PCR success across a wide range of Daphnia species.
Species barrier between D. pulex and D. pulicaria:
The observation of virtually no divergence at synonymous sites (average Ks = 0.0034, standard error = 0.0038) between D. pulex and D. pulicaria suggests that the species status assigned to D. pulicaria may be inappropriate. Furthermore, previous work has shown that D. pulicaria is often paraphyletic with respect to D. pulex (e.g., CREASE et al. 1989; LEHMAN et al. 1995; CREASE et al. 1997; DUDYCHA 2004). Yet, for some loci/types of data, D. pulicaria does have a unique genotypic constellation to the exclusion of D. pulex (e.g., CREASE et al. 1997; COLBOURNE et al. 1998;
ERN
and HEBERT 1999). Our total evidence analysis of six nuclear protein-coding loci reveals that D. pulicaria and D. arenata are monophyletic groups (PP = 0.99, PP = 1, respectively; Figure 2), supporting the notion that D. pulex, D. pulicaria, and D. arenata are distinct species. Many studies report improved resolution and greater node support when data from multiple loci are combined (e.g., REMSEN and DESALLE 1998), perhaps explaining the paraphyly for D. pulex and D. pulicaria in earlier studies that is not observed here. We used a divergence population-genetics approach to address whether a simple isolation model is appropriate for D. pulex and D. pulicaria. Both IMa and LAMARC analyses reveal that gene flow does occur between D. pulex and D. pulicaria, thereby ruling out a strict isolation speciation model. One migrant gene copy per generation, on average, may prevent substantial divergence at a locus (WRIGHT 1931). Although the point estimates obtained with IMa are clearly below this, the upper bound of the 90% highest posterior density intervals exceed one, as do the migration estimates obtained with LAMARC. Despite the fact that LAMARC and IMa differ in their underlying assumptions, the confidence intervals for the estimates obtained with the different programs do overlap, with estimates obtained in LAMARC being higher (Table 7).
In contrast to IMa, LAMARC assumes migration equilibrium and accounts for intralocus recombination. Loci with unacknowledged recombination are expected to have longer gene trees, on average, than nonrecombining loci (WON and HEY 2005). Here, inferred migration events are spread over a longer time period, perhaps explaining the lower gene flow estimates obtained with the IMa program. Other studies that have used IMa with the largest nonrecombining block of sequence data vs. the full locus (with recombination) have revealed that migration estimates are not significantly affected (although in one case they did decrease); however, estimates of population size are substantially inflated when ignoring recombination (BULL et al. 2006; STRASBURG and RIESEBERG 2008). Thus, estimates of population size for D. pulex and D. pulicaria that were obtained in IMa should be regarded as an upper bound.
Numerous studies have outlined physiological and life history differences between North American D. pulex and D. pulicaria (e.g., BRANDLOVA et al. 1972; TESSIER and CONSOLATTI 1991; DUDYCHA 2004; WEIDER et al. 2004). In some parts of their distribution, D. pulex and D. pulicaria switch to sexual reproduction in response to different photoperiodic cues, resulting in a prereproductive isolating barrier (DENG 1997), and they usually occupy different but geographically overlapping habitats. D. pulex typically resides in vernal ponds with leaf litter whereas D. pulicaria occupies lakes that contain fish (e.g., HEBERT et al. 1993; HEBERT 1995; HEBERT and FINSTON 2001). Despite the above observations, hybrids between D. pulex and D. pulicaria do commonly occur, but are found to be obligately asexual in nature (e.g., HEBERT and CREASE 1983; HEBERT et al. 1993; HEBERT and FINSTON 2001). Laboratory studies, however, have revealed that these hybrids are sexual and easily backcrossed (HEIER and DUDYCHA 2009). Our divergence population genetics approach, on the basis of nuclear loci, is consistent with the above observations—D. pulex and D. pulicaria have undergone genetic differentiation as a result of different ecological pressures, but still exchange genes at a level that may be high enough to prevent reproductive isolation.
Supporting information is available online at http://www.genetics.org/cgi/content/full/genetics.108.099549/DC1.
ALONSO, M., 1996 Crustacea, Branchiopoda, Anomopoda. Museum Nacional de Ciencias Naturales, Madrid.
ANDOLFATTO, P., 2001 Contrasting patterns of X-linked and autosomal nucleotide variation in Drosophila melanogaster and Drosophila simulans. Mol. Biol. Evol. 18: 279–290.
ANDOLFATTO, P., 2005 Adaptive evolution of non-coding DNA in Drosophila. Nature 437: 1149–1152.[CrossRef][Medline]
ANDOLFATTO, P., and M. NORDBORG, 1998 The effect of gene conversion on intralocus associations. Genetics 148: 1397–1399.
BANDELT, H. J., P. FORSTER and A. RÖHL, 1999 Median-joining networks for inferring intraspecific phylogenies. Mol. Biol. Evol. 16: 37–48.[Abstract]
BESANSKY, N. J., J. KRZYWINSKI, T. LEHMANN, F. SIMARD, M. KERN et al., 2003 Semipermeable species boundaries between Anopheles gambiae and Anopheles arabiensis: evidence from multilocus DNA sequence variation. Proc. Natl. Acad. Sci. USA 100: 10818–10823.
BRANDLOVA, J., Z. BRANDL and C. H. FERNANDO, 1972 The Cladocera of Ontario with remarks on some species and distribution. Can. J. Zool. 50: 1373–1403.[CrossRef]
BULL, V., M. BELTRAN, C. D. JIGGINS, W. O. MCMILLAN, E. BERMINGHAM et al., 2006 Polyphyly and gene flow between non-sibling Heliconius species. BMC Biol. 4: 11.[CrossRef][Medline]
ERN
, M., and P. D. N. HEBERT, 1999 Intercontinental allozyme differentiation among four holarctic Daphnia species. Limnol. Oceanogr. 44: 1381–1387.
COLBOURNE, J. K., and P. D. N. HEBERT, 1996 The systematics of North American Daphnia (Crustacea: Anomopoda): a molecular phylogenetic approach. Proc. R. Soc. Lond. Ser. B 351: 349–360.
COLBOURNE, J. K., T. J. CREASE, L. J. WEIDER, P. D. N. HEBERT, F. DUFRESNE et al., 1998 Phylogenetics and evolution of a circumarctic species complex (Cladocera: Daphnia pulex). Biol. J. Linn. Soc. 65: 347–365.
COMERON, J. M., 1995 A method for estimating the numbers of synonymous and nonsynonymous substitutions per site. J. Mol. Evol. 41: 1152–1159.[Medline]
CREASE, T. J., D. J. STANTON and P. D. N. HEBERT, 1989 Polyphyletic origins of asexuality in Daphnia pulex. Mitochondrial DNA variation. Evolution 43: 1016–1026.[CrossRef]
CREASE, T. J., M. LYNCH and K. SPITZE, 1990 Hierarchical analysis of isozyme and mitochondrial DNA variation in Daphnia pulex. Mol. Biol. Evol. 7: 444–458.[Abstract]
CREASE, T. J., S. K. LEE, S. L. YU, K. SPITZE, N. LEHMAN et al., 1997 Allozyme and mtDNA variation in populations of the Daphnia pulex complex from both sides of the Rocky Mountains. Heredity 79: 242–251.[CrossRef]
CRISTESCU, M. E. A., J. K. COLBOURNE, Y. RADIVOJAC and M. LYNCH, 2006 A microsatellite based genetic linkage map of the waterflea, Daphnia pulex: on the prospect of crustacean genomics. Genomics 88: 415–430.[CrossRef][Medline]
CRONN, R., M. CEDRONI, T. HASELKORN, C. GROVER and J. F. WENDEL, 2002 PCR-mediated recombination in amplification products derived from polyploid cotton. Theor. Appl. Genet. 104: 482–489.[CrossRef][Medline]
DENG, H. W., 1997 Photoperiodic response of sexual reproduction in the Daphnia pulex group is reversed in two distinct habitats. Limnol. Oceanogr. 42: 609–611.
DUDYCHA, J. L., 2004 Mortality dynamics of Daphnia in contrasting habitats and their role in ecological divergence. Freshw. Biol. 49: 505–514.[CrossRef]
ENDLER, J. A., 1977 Geographic Variation, Speciation, and Clines. Princeton University Press, Princeton, NJ.
FELSENSTEIN, J., 1993 PHYLIP (Phylogeny Inference Package), version 3.5c. Department of Genetics, University of Washington, Seattle.
FORBES, S. A., 1893 A preliminary report on the aquatic invertebrate fauna of the Yellowstone National Park, Wyoming, and the Flathead Region of Montana. Bull. U.S. Fish Comm. 11: 207–256.
GEYER, C. J., 1991 Estimating normalizing constants and reweighting mixtures in Markov chain Monte Carlo. Technical Report No. 568, revised 1994. School of Statistics, University of Minnesota, Minneapolis.
HAAG-LIAUTARD, C., M. DORRIS, X. MASIDE, S. MACASKILL, D. HALLIGAN et al., 2007 Direct estimation of per nucleotide and genomic deleterious mutation rates in Drosophila. Nature 445: 82–85.[CrossRef][Medline]
HADDRILL, P. R., K. R. THORNTON, B. CHARLESWORTH and P. ANDOLFATTO, 2005 Multilocus patterns of nucleotide variability and the demographic and selection history of Drosophila melanogaster populations. Genome Res. 15: 790–799.
HASEGAWA, M., H. KISHINO and T. A. YANO, 1985 Dating of the human-ape splitting by a molecular clock of mitochondrial DNA. J. Mol. Evol. 22: 160–174.[CrossRef][Medline]
HEBERT, P. D. N., 1974 Enzyme variability in natural populations of Daphnia magna II. Genotypic frequencies in permanent populations. Genetics 77: 323–334.
HEBERT, P. D. N., 1995 The Daphnia of North America: An Illustrated Fauna. CD-ROM, University of Guelph, Ontario, Canada.
HEBERT, P. D. N., and T. J. CREASE, 1983 Clonal diversity in populations of Daphnia pulex reproducing by obligate parthenogenesis. Heredity 51: 353–369.[CrossRef]
HEBERT, P. D. N., and T. L. FINSTON, 2001 Macrogeographic patterns of breeding system diversity in the Daphnia pulex group from the United States and Mexico. Heredity 87: 153–161.[CrossRef][Medline]
HEBERT, P. D. N., M. J. BEATON, S. S. SCHWARTZ and D. J. STANTON, 1989 Polyphyletic origins of asexuality in Daphnia pulex. I. Breeding system variation and levels of clonal diversity. Evolution 43: 1004–1015.[CrossRef]
HEBERT, P. D. N., S. S. SCHWARTZ, R. D. WARD and T. L. FINSTON, 1993 Macrogeographic patterns of breeding system diversity in the Daphnia pulex group. I. Breeding systems of Canadian populations. Heredity 70: 148–161.[CrossRef]
HEIER, C. R., and J. L. DUDYCHA, 2009 Ecological speciation in cyclic parthenogens: sexual capability of experimental hybrids between Daphnia pulex and Daphnia pulicaria. Limnol. Oceanogr. 52: 492–502.
HEY, J., 2006 Recent advances in assessing gene flow between diverging populations and species. Curr. Opin. Genet. Dev. 16: 592–596.[CrossRef][Medline]
HEY, J., and R. NIELSEN, 2004 Multilocus methods for estimating population sizes, migration rates, and divergence time, with applications to the divergence of Drosophila pseudoobscura and D. persimilis. Genetics 167: 747–760.
HEY, J., and R. NIELSEN, 2007 Integration within the Felsenstein equation for improved Markov chain Monte Carlo methods in population genetics. Proc. Natl. Acad. Sci. USA 104: 2785–2790.
HUDSON, R. R., 1987 Estimating the recombination parameter of a finite population model without selection. Genet. Res. 50: 245–250.[Medline]
HUDSON, R. R., M. KREITMAN and M. AGUADE, 1987 A test of neutral molecular evolution based on nucleotide data. Genetics 116: 153–159.
HUELSENBECK, J. P., and F. RONQUIST, 2001 MRBAYES: Bayesian inference of phylogenetic trees. Bioinformatics 17: 754–755.
ISHIDA, S., and D. J. TAYLOR, 2007 Mature habitats associated with genetic divergence despite strong dispersal ability in an arthropod. BMC Evol. Biol. 7: 52.[CrossRef][Medline]
JUKES, T. H., and C. R. CANTOR, 1969 Evolution of protein molecules, pp. 21–132 in Mammalian Protein Metabolism, edited by H. N. MUNRO. Academic Press, New York.
KISHINO, H., and M. HASEGAWA, 1989 Evaluation of the maximum likelihood estimate of the evolutionary tree topologies from DNA sequence data and the branching order in Hominoidea. J. Mol. Evol. 29: 170–179.[CrossRef][Medline]
KREITMAN, M., 1983 Nucleotide polymorphism at the alcohol-dehydrogenase locus of Drosophila melanogaster. Nature 304: 412–417.[CrossRef][Medline]
KUHNER, M. K., 2006 LAMARC 2.0: maximum likelihood and Bayesian estimation of population parameters. Bioinformatics 22: 768–770.
KUMAR, S., K. TAMURA and M. NEI, 2004 MEGA3: integrated software for molecular evolutionary genetics analysis and sequence alignment. Brief. Bioinformatics 5: 150–163.
LEHMAN, N., M. E. PFRENDER, P. A. MORIN, T. J. CREASE and M. LYNCH, 1995 A hierarchical molecular phylogeny within the genus Daphnia. Mol. Phylogenet. Evol. 4: 395–407.[CrossRef][Medline]
LEYDIG, F., 1860 Naturgeschichte der Daphniden. Verlag der H. Laupp'shen Buchhandlung, Tübingen, Germany.
LI, W-H., 1993 Unbiased estimation of the rates of synonymous and nonsynonymous substitution. J. Mol. Evol. 36: 96–99.[CrossRef][Medline]
LITTLE, T. J., J. K. COLBOURNE and T. J. CREASE, 2004 Molecular evolution of Daphnia immunity genes: polymorphism in a gram-negative binding protein gene and an alpha-2-macroglobulin gene. J. Mol. Evol. 59: 498–506.[CrossRef][Medline]
LYNCH, M., 2006 The origins of eukaryotic gene structure. Mol. Biol. Evol. 23: 450–468.
LYNCH, M., 2007 The Origins of Genome Architecture. Sinauer Associates, Sunderland, MA.
LYNCH, M., and T. J. CREASE, 1990 The analysis of population survey data on DNA sequence variation. Mol. Biol. Evol. 7: 377–394.[Abstract]
LYNCH, M., and B. WALSH, 1998 Genetics and Analysis of Quantitative Traits. Sinauer Associates, Sunderland, MA).
LYNCH, M., A. SEYFERT, B. EADS and E. WILLIAMS, 2008 Localization of the genetic determinants of meiosis suppression in Daphnia pulex. Genetics 180: 317–327.
MACHADO, C. A., R. M. KLIMAN, J. A. MARKERT and J. HEY, 2002 Inferring the history of speciation from multilocus DNA sequence data: the case of Drosophila pseudoobscura and close relatives. Mol. Biol. Evol. 19: 472–488.
MAYNARD SMITH, J., 1966 Sympatric speciation. Am. Nat. 100: 637–650.[CrossRef]
MERGEAY, J., X. AGUILERA, S. DECLERCK, A. PETRUSEK, T. HUYSE et al., 2008 The genetic legacy of Bolivian Daphnia: the tropical Andes as a source for the North and South American D. pulicaria complex. Mol. Ecol. 17: 1789–1800.[Medline]
MORIYAMA, E. N., and J. R. POWELL, 1996 Intraspecific nuclear DNA variation in Drosophila. Mol. Biol. Evol. 13: 261–277.[Abstract]
MORT, M. A., and H. G. WOLF, 1986 The genetic structure of large lake Daphnia populations. Evolution 40: 756–766.[CrossRef]
MUKABAYIRE, O., J. CARIDI, X. WANG, Y. T. TOURE, M. COLUZZI et al., 2001 Patterns of DNA sequence variation in chromosomally recognized taxa of Anopheles gambiae: evidence from rDNA and single copy loci. Insect Mol. Biol. 10: 33–46.[CrossRef][Medline]
OMILIAN, A. R., M. E. A. CRISTESCU, J. L. DUDYCHA and M. LYNCH, 2006 Ameiotic recombination in asexual lineages of Daphnia. Proc. Natl. Acad. Sci. USA 103: 18638–18643.
OMILIAN, A. R., D. G. SCOFIELD and M. LYNCH, 2008 Intron presence-absence polymorphisms in Daphnia. Mol. Biol. Evol. 25: 2129–2139.
PALSSON, S., 2000 Microsatellite variation in Daphnia pulex from both sides of the Baltic Sea. Mol. Ecol. 9: 1075–1088.[Medline]
PAMILO, P., and N. O. BIANCHI, 1993 Evolution of the Zfx and Zfy genes: rates and interdependence between the genes. Mol. Biol. Evol. 10: 271–281.[Abstract]
PETERS, R. H., and R. DE BERNARDI, 1987 Memorie dell'Istituto Italiano di Idrobiologia, Vol. 45: Daphnia. Verbania Palanza, Palanza, Italy.
POSADA, D., and K. A. CRANDALL, 1998 MODELTEST: testing the model of DNA substitution. Bioinformatics 14: 817–818.
POSADA, D., and K. A. CRANDALL, 2001 Intraspecific gene genealogies: trees grafting into networks. Trends Ecol. Evol. 16: 37–45.[CrossRef][Medline]
REMSEN, J., and R. DESALLE, 1998 Character congruence of multiple data partitions and the origin of the Hawaiian Drosophilidae. Mol. Phylogenet. Evol. 9: 225–235.[CrossRef][Medline]
RICE, W. R., and E. F. HOSTERT, 1993 Laboratory experiments on speciation: what have we learned in 40 years. Int. J. Org. Evol. 47: 1637–1653.
ROZAS, J., J. C. SANCHEZ-DELBARRIO, X. MESSEGUER and R. ROZAS, 2003 DnaSP, DNA polymorphism analyses by the coalescent and other methods. Bioinformatics 19: 2496–2497.
SCHMID, K. J., S. RAMOS-ONSINS, H. RINGYS-BECKSTEIN, B. WEISSHAAR and T. MITCHELL-OLDS, 2005 A multilocus sequence survey in Arabidopsis thaliana reveals a genomewide departure from a neutral model of DNA sequence polymorphism. Genetics 169: 1601–1615.
SOETAERT, A., L. N. MOENS, K. VAN DER VEN, K. VAN LEEMPUT, B. NAUDTS et al., 2006 Molecular impact of propiconazole on Daphnia magna using a reproduction-related cDNA array. Comp. Biochem. Physiol. 142: 66–76.
STRASBURG, J. L., and L. H. RIESEBERG, 2008 Molecular demographic history of the annual sunflowers Helianthus annuus and H. petiolaris: large effective population sizes and rates of long-term gene flow. Evolution 62: 1936–1950.[CrossRef][Medline]
TAJIMA, F., 1983 Evolutionary relationship of DNA sequences in finite populations. Genetics 105: 437–460.
TAJIMA, F., 1989 Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 123: 585–595.
TAJIMA, F., 1996 The amount of DNA polymorphism maintained in a finite population when the neutral mutation rate varies among sites. Genetics 143: 1457–1465.[Abstract]
TESSIER, A. J., and N. L. CONSOLATTI, 1991 Resource quantity and offspring quality in Daphnia. Ecology 72: 468–478.[CrossRef]
THOMPSON, J. D., D. G. HIGGINS and T. J. GIBSON, 1994 ClustalW: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position specific gap penalties, and weight matrix choice. Nucleic Acids Res. 22: 4673–4680.
WAKELEY, J., 2000 The effects of subdivision on the genetic divergence of populations and species. Evolution 54: 1092–1101.[CrossRef][Medline]
WAKELEY, J., and J. HEY, 1997 Estimating ancestral population parameters. Genetics 145: 847–855.[Abstract]
WAKELEY, J., and J. HEY, 1998 Testing speciation models with DNA sequence data, pp. 157–175 in Molecular Approaches to Ecology and Evolution, edited by R. DESALLE and B. SCHIERWATER. Birkhäuser Verlag, Boston.
WANG, R. L., J. WAKELEY and J. HEY, 1997 Gene flow and natural selection in the origin of Drosophila pseudoobscura and close relatives. Genetics 147: 1091–1106.[Abstract]
WATANABE, H., N. TATARAZAKO, S. ODA, H. NISHIDE, I. UCHIYAMA et al., 2005 Analysis of expressed sequence tags of the water flea Daphnia magna. Genome 48: 606–609.[Medline]
WATTERSON, G. A., 1975 Number of segregating sites in genetic models without recombination. Theor. Popul. Biol. 7: 256–276.[CrossRef][Medline]
WEIDER, L. J., and A. HOBAEK, 1997 Postglacial dispersal, glacial refugia, and clonal structure in Russian/Siberian populations of the Arctic Daphnia pulex complex. Heredity 78: 363–372.[CrossRef]
WEIDER, L. J., K. L. GLENN, M. KYLE and J. J. ELSER, 2004 Associations among ribosomal (r)DNA intergenic spacer length, growth rate, and C: N: P stoichiometry in the genus Daphnia. Limnol. Oceanogr. 49: 1417–1423.
WON, Y. J., and J. HEY, 2005 Divergence population genetics of chimpanzees. Mol. Biol. Evol. 22: 297–307.
WRIGHT, S., 1931 Evolution in Mendelian populations. Genetics 16: 97–159.
WRIGHT, S., 1943 Isolation by distance. Genetics 28: 114–138.
WRIGHT, S. I., and B. S. GAUT, 2005 Molecular population genetics and the search for adaptive evolution in plants. Mol. Biol. Evol. 22: 506–519.
Communicating editor: D. BEGUN
- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Supporting Information
-
All Versions of this Article:
genetics.108.099549v1
182/1/325 most recent - Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Omilian, A. R.
- Articles by Lynch, M.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Omilian, A. R.
- Articles by Lynch, M.


