| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |


* Department of Ecology and Evolutionary Biology and the Centre for the Analysis of Genome Evolution and Function, University of Toronto, Toronto, Ontario M5S 3G5, Canada,
Program for Molecular Structure and Function, Hospital for Sick Children, Toronto, Ontario M5G 1X8, Canada and
Life Sciences Division, Lawrence Berkeley National Laboratories, Berkeley, California 94720
1 Corresponding author: Department of Ecology and Evolutionary Biology, University of Toronto, 25 Harbord St., Toronto, ON M5S 3G5, Canada.
E-mail: asher.cutter{at}utoronto.ca
| ABSTRACT |
|---|
|
|
|---|
4 million years ago, which is less than a quarter of the time since they shared a common ancestor with outcrossing species. We conclude that the profound changes in mating behavior, physiology, and developmental mechanisms that accompanied the transition from an obligately outcrossing to a primarily selfing mode of reproduction evolved in the not-too-distant past.
The onset of a self-fertilizing mode of reproduction is an example of an evolutionary transition that will cause a reduced effective population size compared to the obligately outcrossing ancestor and close relatives (POLLAK 1987). This results as a consequence of the direct effects of homozygosity, induced by selfing, and indirectly through reduced effective recombination (NORDBORG 2000; CHARLESWORTH and WRIGHT 2001). The strong linkage disequilibrium within selfing species will cause selective sweeps and purifying selection to reduce genetic diversity across large portions of the genome, rather than just at the targets of selection, which further reduces the effective population size (MAYNARD SMITH and HAIGH 1974; CHARLESWORTH et al. 1993). Additional reductions in the effective size of selfing populations can occur if they tend to have greater metapopulation dynamics (PANNELL and CHARLESWORTH 1999; PANNELL 2003); more generally, the generation of population structure is facilitated by inbreeding and leads to lower effective population sizes locally (CHARLESWORTH et al. 1997). The combined action of these forces can result in drastically depressed effective breeding sizes of selfing populations beyond the simple twofold reduction due to elevated homozygosity, with potentially dramatic consequences for genomic patterns of molecular evolution (CHARLESWORTH and WRIGHT 2001; CHARLESWORTH 2003).
Self-fertilization has arisen independently twice within the nematode genus Caenorhabditis (KIONTKE et al. 2004) (Figure 1), but it remains unknown as to how long this mode of reproduction has persisted in these lineages and whether sufficient time has elapsed for the molecular evolutionary consequences of self-fertilization to become apparent. Surveys of population genetic variation show that the selfing species of Caenorhabditis, Caenorhabditis elegans and C. briggsae, do have lower polymorphism levels and effective population sizes than the obligate outcrossing C. remanei (GRAUSTEIN et al. 2002; JOVELIN et al. 2003; SIVASUNDAR and HEY 2003; BARRIÈRE and FÉLIX 2005, 2007; CUTTER 2006; CUTTER et al. 2006a,b), and observed levels of linkage disequilibrium indicate that the effective selfing rate is exceedingly high (KOCH et al. 2000; BARRIÈRE and FÉLIX 2005; HABER et al. 2005; CUTTER 2006; CUTTER et al. 2006b). Studies of polymorphism also have determined that weak selection for preferred codons operates in contemporary populations of C. remanei, with the intensity of selection (Nes) estimated to be
0.1 on average (CUTTER and CHARLESWORTH 2006; CUTTER 2008b). For C. elegans, genomic analyses have demonstrated that selection for translational efficiency and/or accuracy has shaped codon usage in its history (STENICO et al. 1994; DURET and MOUCHIROUD 1999; DURET 2000; CUTTER et al. 2006c), although the
20-fold smaller effective sizes of present-day C. elegans and C. briggsae populations imply that current selection for preferred codons will effectively be absent. The extent of codon usage bias, the conservation of preferred codon identity, and any association of codon usage bias with reproductive mode is presently uncharacterized in Caenorhabditis generally.
|
4 million years have elapsed since the onset of relaxed selection on codon usage in C. elegans and C. briggsae and, by implication, the origin of a predominantly self-fertilizing mode of reproduction. We therefore conclude that the novel features involving hermaphroditism and self-fertilization in these species are not ancient in origin. | MATERIALS AND METHODS |
|---|
|
|
|---|
-TriplEx2 phage vector and plated with Escherichia coli strain XL1-Blue for blue-white screening of recombinants. Recombinant plaques were picked into buffer, from which an aliquot was used for PCR amplification with primers TW5 (CTCGGGAAGCGCGCCATTGTGTTGGT) and TW3 (AGGCGGCCGACATGTTTTTTTTTTTT), followed by direct sequencing with primer TW5. Sequencing was performed on ABI 3730 automated sequencers by the University of Edinburgh School of Biological Sciences sequencing service and by the University of Arizona Genome and Technology Core sequencing service. EST sequencing was performed on a total of 42 96-well microtitre plates for C. sp. 5, 27 microtitre plates for C. brenneri, and 18 microtitre plates for C. japonica.
EST analysis:
The PartiGene software system (PARKINSON et al. 2004a) was used to process the raw EST sequence data to trim low quality sequence, remove contaminating vector sequence, and cluster replicate sequences into a single sequence object. Hereafter, we refer to these clustered sequence objects as "genes," recognizing that in most cases they do not represent full-length coding sequences. The resulting genes were then passed through prot4EST (WASMUTH and BLAXTER 2004), a computational pipeline that infers the most appropriate peptide translation, identifies and masks putative insertion/deletion sequencing errors, and identifies putative mitochondrial vs. nuclear-encoded loci. This procedure generated 2320 genes (3857 ESTs) for C. sp. 5, 1405 genes (2449 ESTs) for C. brenneri, and 1073 genes (1906 ESTs, including 218 already present in dbEST) for C. japonica, including mitochondrial loci. Of these, 4760 genes in total are putatively nuclear encoded. The resulting sequences have been deposited in GenBank (C. japonica, FD512256–FD513938; C. brenneri, FD509784–FD512255; and C. sp. 5, FD513939–FD517806) and added to Nembase (PARKINSON et al. 2004b).
For the sequenced genomes, we obtained EST sequences from dbEST for C. elegans (346,064), C. remanei (28,205), and C. briggsae (2517). Using a custom Perl script, we associated these ESTs to corresponding predicted coding sequences from WormBase WS170. After applying the restriction that the BLAST alignment must match at least half the length of the EST with
90% identity, we associated 280,966 ESTs with 15,207 genes for C. elegans, 15,868 ESTs with 5603 genes for C. remanei, and 1798 ESTs with 765 genes for C. briggsae. Although predefined associations between EST and coding sequence are publicly available for C. elegans, we used the EST hit counts from this procedure to ensure uniform criteria for associating ESTs with genes among these three species.
Analysis of codon usage:
We used the species-specific EST counts for each locus as a measure of the expression level for the corresponding locus in each species, for nuclear-encoded loci only (supplemental Data 1). Following DURET and MOUCHIROUD (1999) and CUTTER et al. (2006c), we partitioned the 26,390 loci described above into two classes: those that had a single EST "hit" and those with EST "hits" numbering greater than or equal to the 90th percentile (n90). Loci in these categories are taken to represent genes with low and high levels of expression. For C. japonica, C. brenneri, and C. sp. 5, n90 = 3. We also associated ESTs in dbEST with the predicted coding sequences of C. elegans, C. remanei, and C. briggsae (see above), and partitioned them into two categories as for the other species, with n90 = 37, 5, and 4, respectively. We then calculated the relative synonymous codon usage (RSCU) separately for the loci in the two expression categories, which for codon j of amino acid i is simply
, where d is the degeneracy of the corresponding amino acid and nij is codon frequency.
To identify those codons that are disproportionately represented in highly expressed genes, termed preferred or optimal codons, we used t-tests to test for significant differences in RSCU between high- and low-expression loci (
RSCU) using JMP v.5.01. For this analysis, we restricted the data set to those loci with
100 codons. Codons experiencing significantly elevated RSCU in highly expressed genes (optimal codons) were subsequently used to calculate the frequency of optimal codons (Fop) codon usage bias statistic (IKEMURA 1985) with a customized optimal codon table in CodonW (J. Peden, http://codonw.sourceforge.net). The effective number of codons (ENC) (WRIGHT 1990) and base composition at fourfold degenerate sites (GC3s) were also calculated for each locus with CodonW and INCA 2.1 (SUPEK and VLAHOVICEK 2004). On the basis of these summary statistics, we constructed an ANOVA model to explain variation in codon bias (Fop) as a function of base composition (GC3s), length (log-transformed), and EST hit counts (log-transformed), and their first-order interactions using JMP. A significant positive effect of gene expression (EST hit counts) is consistent with selection having caused codon bias, although this analysis will underestimate the role of selection on codon bias because an association between codon bias and GC3s could result from selection rather than mutational effects. To obtain an overall index of codon usage bias for the species, we report the average of positive
RSCU values:
(CUTTER et al. 2006c). This metric does not depend strongly on genomic base composition or on the identities of specific optimal codons and therefore represents a useful index for contrasting the strength of selection on codon usage among species (CUTTER et al. 2006c), although it is not known how strongly this metric is affected by sample size (and therefore the 90th percentile cutoff level relative to the class of genes with a single EST hit).
Divergence:
The coding sequences resulting from the computational procedures applied to ESTs from C. japonica, C. brenneri, and C. sp. 5 were combined with gene predictions for C. elegans, C. briggsae, and C. remanei acquired from WormBase release WS170 to infer putative one-to-one orthologs with OrthoMCL using default parameters (LI et al. 2003). Multiple-sequence alignments of the resulting putative orthologs with ClustalW (THOMPSON et al. 1994) were followed by calculation of lineage-specific synonymous (dS) and nonsynonymous (dN) site substitution rates with PAML (YANG 1997) assuming the Caenorhabditis phylogenetic topology of KIONTKE et al. (2004, 2007) and KIONTKE and SUDHAUS (2006) (Figure 1), implemented with custom Bioperl-based scripts. Alignments were based on canonical peptide translations and substitution rates used the Goldman–Yang codon-based maximum-likelihood method, permitting branch-specific dN/dS (i.e., model = 1, codon model F3X4) (YANG et al. 1994). Because the EST collections do not fully represent the gene complement of their respective genomes, we eliminated sets of genes for which orthology designation was likely to be inappropriate; specifically, we excluded putative orthologous groups that exhibited excessively high substitution rates along any branch (dN > 0.5 or dS > 5; <3% of genes fit these criteria). This procedure resulted in 63 orthologous groups with representatives in all six species of Caenorhabditis and 1244 orthologous groups with other combinations of three to five species (supplemental Data 2). We exclude from consideration the 6398 orthologous groups specific to the three sequenced genomes (C. elegans, C. briggsae, and C. remanei), because in the six-species context, they provide only lineage-specific data to C. remanei. However, we also followed the above procedure to infer orthologs and calculate divergence for the three sequenced genomes only, excluding the EST-derived sequences, which provided lineage-specific divergence for C. remanei and C. briggsae (7846 orthologous groups) with respect to their common ancestor (using C. elegans as outgroup).
The substitution rate at neutral sites will occur at a rate equal to the mutation rate and independently of population size (KIMURA 1968), and therefore can be used to standardize replacement-site substitutions for genomic heterogeneity in mutation rate. Divergence at synonymous sites (dS) is typically used as a neutral reference. However, all of these Caenorhabditis species demonstrate correlations between dS and codon bias (ENC) (Spearman's nonparametric correlations: C. elegans = 0.55, C. brenneri = 0.60, C. remanei = 0.50, C. briggsae = 0.58, C. sp. 5 = 0.51; all P < 0.0001), reflecting selection on codon usage (SHARP and LI 1987). To represent neutral substitution rates, we consequently adjusted the divergence values to correct for selection on codon bias by adding the residuals of least-squares regressions of branch-specific dS on ENC to the value of dS expected at ENC = 61 (where codon bias is not present). These adjusted synonymous-site substitution rates are referred to as dS', and are used to standardize nonsynonymous substitution rates as dN/dS'. We restrict assessment of substitution rates to those sequences with a Caenorhabditis outgroup, and therefore exclude divergence for the basal C. japonica lineage from consideration (and for C. elegans for the sequenced-genome inference of orthologous groups). This restriction, and the exclusion of loci for which ENC could not be calculated, results in the different sample sizes of genes for each species.
| RESULTS |
|---|
|
|
|---|
|
|
|
) across all loci (CUTTER et al. 2006c). Second, we calculated the average ENC (WRIGHT 1990) and Fop for the set of 63 orthologous groups of loci that were common to all six taxa (values for the full data set are composed of different sets of loci for each species; Table 1). Because base composition is similar in this group of species (Table 1), it is permissible to contrast mean ENC and Fop across taxa. For all of these metrics, the outcrossing C. sp. 5 exhibits the most extreme value in the direction of strong codon bias (Table 1), although all species of Caenorhabditis demonstrate strong overall codon bias relative to many other nematodes, particularly parasites that have
typically
0.1 (CUTTER et al. 2006c). However, average codon bias did not differ significantly among species (63 orthologous groups, Kruskal–Wallis P > 0.2 for both ENC and Fop). We also tested for a difference in codon bias levels in a pairwise contrast of the >800 orthologs shared between C. briggsae and C. sp. 5 (the two closest relatives). This analysis revealed that codon bias is significantly weaker in the selfing C. briggase (median Fop C. briggsae = 0.461, C. sp. 5 = 0.498, Wilcoxon's P = 0.0022; median ENC C. briggsae = 48.40, C. sp. 5 = 46.67, Wilcoxon's P = 0.0003). Finally, to investigate the correspondence of codon bias in matched orthologous sequences, we performed orthogonal regressions of the frequency of optimal codons for each pair of species for the 63 orthologous groups common to all six species (Figure 4). This third analysis revealed that C. elegans exhibits reduced codon bias relative to most other species of Caenorhabditis, despite the small magnitude of effect. In summary, we find evidence for weaker codon bias in both selfing species, although the magnitude of the reduction is relatively small.
|
= 4Nes) estimated to be only
0.4 in Caenorhabditis (CUTTER and CHARLESWORTH 2006; CUTTER 2008b). Consequently, a sharp reduction in effective population size, like that brought on by the evolution of self-fertilization, will result in relaxed selection on codon usage such that its evolution will be governed solely by the neutral processes of mutation and genetic drift. Applying this logic, we draw on the results of SUEOKA (1962) and MARAIS et al. (2004a) to infer the degree of codon bias at time t, pt, following a complete relaxation of selection on codon usage. We begin with the Li–Bulmer (LI 1987; BULMER 1991) equation for the expected equilibrium frequency of preferred codons in the ancestor that experienced selection for codon bias
, where k is the ratio of the mutation rate from (u) and to (v) preferred codons and
is the selection intensity (4Nes). The equilibrium frequency of preferred codons in the absence of selection is simply p** = v/(u + v). Analogous to previous results for the evolution of base composition (SUEOKA 1962; MARAIS et al. 2004a), we find that change in codon bias over time following relaxation of selection can be described by the relation
![]() | (1) |
3.8 million years. It is plausible that the generation time in nature is faster than we have assumed, which would imply that relaxation of selection, and selfing, originated more recently (assuming a 14- to 90-day generation time, 8.9 x 105 < t < 5.7 x 106). Likewise, a lower ancestral codon bias would imply a more recent origin than suggested by these calculations (assuming Fop = p* range 0.66–0.71, 1.9 x 105 < t < 4.7 x 106; Figure 5).
|
|
|
| DISCUSSION |
|---|
|
|
|---|
0.1 in C. remanei (CUTTER and CHARLESWORTH 2006; CUTTER 2008b). Therefore, selection on codon usage is expected to be relaxed completely in the present day for those species that reproduce by self-fertilization, by virtue of their
20-fold lower effective population sizes (GRAUSTEIN et al. 2002; JOVELIN et al. 2003; CUTTER et al. 2006a). However, we report that the overall levels of codon bias are not much reduced in the selfing species, implying that little time has elapsed to permit the accumulation of slightly deleterious mutations in the form of unpreferred codons. Consistent with the conclusion that the selfing mode of reproduction is not ancient, we also find little evidence for differences in rates of substitution at replacement sites within coding sequences for orthologous groups of genes in these six species of Caenorhabditis. In the large data set of orthologs defined for the three sequenced genomes (C. elegans, C. remanei, and C. briggsae), we do observe an elevation in the rate of replacement substitutions in the selfing C. briggsae lineage. However, the synonymous substitution rate is also higher, perhaps as a consequence of a more rapid turnover of generations in C. briggsae and its recent ancestors or to a higher mutation rate per generation (BAER et al. 2005). Due to the high absolute levels of synonymous site divergence (saturation is observed in pairwise comparisons) and the strong correlation between synonymous and replacement divergence, it is possible that the observed differences in dN and dN/dS' are due to a more rapid pace of mutation per year rather than to a change in population size coincident with the origin of selfing in C. briggsae's ancestry. The contrast of C. briggsae with the more closely related C. sp. 5 is the more informative comparison, for which no significant difference in replacement-site divergence was detected. It is worth noting that if selection coefficients on replacement sites are sufficiently large, then even a significant reduction in effective population size due to selfing might not lead them to be effectively neutral. However, given a broad distribution of selection coefficients, as observed in Drosophila (PIGANEAU and EYRE-WALKER 2003; LOEWE and CHARLESWORTH 2006; LOEWE et al. 2006), we would nonetheless expect a reasonable fraction of replacement sites to become effectively neutral with a 20-fold smaller effective population size.
Using a model that describes the change in codon bias over time due to relaxed selection, we estimate that selfing likely evolved in the C. elegans lineage within the last
4 million years, assuming coincident onset of relaxed selection with the evolution of selfing. If a divergence time of
18 million years separating C. elegans from its closest relatives is roughly correct (CUTTER 2008a), then this implies that the lineage leading to C. elegans spent <25% of the time since the common ancestor in a selfing state. These calculations could overestimate the date for the origin of selfing for two reasons. First, if codon bias were not as strong in the outcrossing ancestor of C. elegans as we assume, which is plausible because the outgroup C. japonica also exhibits nominally weaker codon bias than the monophyletic group that is sister to C. elegans (Table 1; Figure 1), then less time would be required to account for the smaller resultant change in codon bias to the present day. For example, taking the level of codon bias in C. japonica as the ancestral state for C. elegans, we would infer an origin of selfing of only
2.1 x 106 generations or
347 thousand years ago (given an average 60-day generation time in nature). Second, if C. elegans's generation time is more rapid than the conservative rate used in our calculations, then a more recent origin of selfing would be inferred. The lack of difference in nonsynonymous substitution rates among species also is consistent with the conclusion that low population sizes in the selfing taxa have not persisted for very long. C. briggsae demonstrates very little reduction in codon bias relative to its sister species, C. sp. 5, suggesting that selfing might have evolved even more recently in C. briggsae's ancestry than for C. elegans. These findings imply that the evolution of hermaphrodite mating behavior (CHASNOV and CHOW 2002; GARCIA et al. 2007; KLEEMANN and BASOLO 2007), chemo-attraction (CHASNOV et al. 2007), sperm production (CUTTER 2004; GELDZILER et al. 2006), and developmental mechanisms (HILL et al. 2006) likely do not have ancient origins.
Additional circumstantial support for the notion of a recent origin of selfing in Caenorhabditis comes from the lack of hermaphrodite-only clades comprising multiple species (KIONTKE et al. 2004, 2007), although further sampling of biodiversity in this genus could refute this line of reasoning. Selfing hermaphroditism appears to arise regularly in Rhabditid nematodes, but occurs primarily among tip-taxa (KIONTKE and FITCH 2005), suggesting that extinction of hermaphroditic lineages likely outpaces speciation. In addition, to the extent that relaxed selection on male function and hermaphrodite attractiveness to males is occurring in C. elegans and C. briggsae, or active selection against these traits, insufficient time has elapsed for the complete loss of a functional male phenotype. According to the model of male maintenance by CHASNOV and CHOW (2002) (Equation 13), it would seem that too many male-specific genes occur in the C. elegans genome to permit male maintenance via a balance between degeneration and mating (see also CUTTER et al. 2003). Such a balance could maintain males if there were fewer than
62 male genes, given an average per gene deleterious mutation rate each generation of 0.48/20,000 = 2.4 x 10–5 (DENVER et al. 2004), an average X chromosome nondisjunction rate of 0.0033 per generation (TEOTONIO et al. 2006), and an average male mating-ability parameter of 0.18 (TEOTONIO et al. 2006)—yet there are likely >100 male genes in the genome (KIM et al. 2001; REINKE et al. 2004; CUTTER and WARD 2005). Consequently, this suggests that male genes are expected to degenerate, which is consistent with phenotypic observations of strains in nature with genetically sterile males (HODGKIN and DONIACH 1997; LINTS and EMMONS 2002) and generally poor male C. elegans mating ability relative to gonochoristic males (CHASNOV and CHOW 2002). The lack of significantly elevated rates of sequence evolution in male genes (CUTTER and WARD 2005) therefore is consistent with a relatively recent origin of the extreme form of selfing currently inferred for C. elegans.
Contrasts with other taxa:
How does codon usage bias in Caenorhabditis compare with other eukaryotes? Differences in background base composition make it infeasible to use average ENC or Fop as a general comparative tool, whereas the
statistic (the average difference in RSCU between highly and lowly expressed genes) provides a useful index for comparison among species with very different GC content (CUTTER et al. 2006c). The consistently strong selection-mediated codon bias among Caenorhabditis species is consistent with other free-living nematodes and contrasts with the limited role for selection causing codon bias in parasitic nematodes (CUTTER et al. 2006c). For more distant relatives, a reanalysis of DURET and MOUCHIROUD's (1999) data to calculate
for short proteins indicates that differential selection on codon usage among genes with high vs. low expression has been substantially stronger in C. elegans history (
) than for Drosophila melanogaster (
) or Arabidopsis thaliana (
). The different value for C. elegans
than reported here (0.335; Table 1) stems from their data set containing only short proteins, which experience stronger codon bias and a different definition of highly and lowly expressed genes. KANAYA et al. (2001) also showed that C. elegans exhibits stronger selection on codon usage than D. melanogaster, Xenopus laevis, and human. The substantially higher average frequency of optimal codons observed in D. melanogaster than C. elegans (DURET and MOUCHIROUD 1999) likely reflects the greater overall GC content of the fly genome relative to the worm, rather than differences in selection on codon usage, because optimal codons tend to be GC rich. However, polymorphism-based inference of codon bias suggests stronger selection intensities for species of Drosophila than Caenorhabditis (reviewed in CUTTER and CHARLESWORTH 2006), indicating that further work is necessary to elucidate the conflicting implications of population polymorphism and genomic trends in codon usage for these groups.
Breeding system variation analogous to that in Caenorhabditis is found within the plant genus Arabidopsis. Like the results reported here, no differences in substitution rates or codon bias have been observed between the inbreeding A. thaliana and outbreeding A. lyrata (WRIGHT et al. 2002). Overall codon usage bias is much weaker in Arabidopsis than in Caenorhabditis (see above), so an effect of selfing would be more difficult to detect in Arabidopsis because codon usage will be closer to the no-selection equilibrium levels. Furthermore, a recent origin of selfing in A. thaliana (CHARLESWORTH and WRIGHT 2001) or population bottlenecks in A. lyrata might also obscure a detectable effect of selfing on the expected patterns of molecular evolution (WRIGHT et al. 2002, 2003). Analyses of the self-incompatibility locus in Arabidopsis on the basis of patterns of diversity (SHIMIZU et al. 2004) and linkage disequilibrium (TANG et al. 2007) implicate an origin of selfing of
1 million generations ago. This contrasts with the broader plausible range for C. elegans of 0.32 x 106–23.3 x 106 generations ago, on the basis of patterns of nucleotide diversity (CUTTER 2006) or the decay of codon bias.
Effects of selfing on other genomic features:
The most straightforward prediction of the smaller effective population sizes induced by selfing is that nucleotide diversity will be reduced genomewide. This prediction is clearly met in Caenorhabditis (GRAUSTEIN et al. 2002; JOVELIN et al. 2003; CUTTER et al. 2006a).
We have focused on the potential for accumulation of slightly deleterious mutations in orthologous coding sequences. However, relaxed selection might also impact multigene families disproportionately in selfing lineages, through patterns of divergence and pseudogene production. Such an effect would be important for understanding whether accelerated rates of protein evolution in some gene families result from positive selection or relaxed selection (STEWART et al. 2005; THOMAS et al. 2005), making sure to account for past selection on synonymous sites when standardizing by synonymous site divergence.
In addition, the rapid generation of homozygosity produced by self-fertilization might facilitate more rapid fixation of genomic rearrangements, if heterozygous fitness is most strongly affected (CHARLESWORTH 1992). It will be important to determine whether the extensive intrachromosomal rearrangements observed between C. elegans and C. briggsae (COGHLAN and WOLFE 2002; HILLIER et al. 2007) is a general feature of the genus or whether the rates of translocation, inversion, and transposition are elevated specifically in the selfing lineages.
Models of transposable element dynamics in selfers vs. outcrossers depend critically on the relative importance of deleterious insertion vs. ectopic exchange (unequal crossing-over caused by repetitive elements) as selective agents against transposable element proliferation (NUZHDIN 1999; MORGAN 2001). A dominant effect of deleterious insertion could cause mobile elements to be eliminated completely from highly selfing lineages. Consequently, if it is generally true that outcrossing Caenorhabditis have higher mobile element loads than the selfing species, as appears to be the case for retroelements in C. remanei (Z. BAO and N. JIANG, personal communication), then it might support a primary role for the deleterious effects of element insertion in controlling copy numbers, in contrast to the prevailing view for Drosophila (PETROV et al. 2003). The greater abundance of transposable elements in the genome of C. briggsae than in C. elegans (STEIN et al. 2003) therefore might be a consequence of a more recent origin of selfing in C. briggsae. It is also important to note that differences among species in the effectiveness of an RNA-interference response to limit transposition could play a fundamental role in transposable element activity and genomic copy number (SIJEN and PLASTERK 2003).
Conclusions:
Transitions in breeding system from obligately outcrossing to predominantly self-fertilizing will shift genomic patterns of molecular evolution, through the accumulation of slightly deleterious mutations by genetic drift due to the concomitant reduction in effective population size. However, the extent of change resulting from relaxed selection depends on the mutation rate and the time since the origin of selfing and will be more apparent for genomic features that are more susceptible to becoming selectively neutral, such as codon usage bias. We have shown that Caenorhabditis genomes have been shaped by weak selection in their history, yet selfing species do not differ greatly from outcrossing ones in terms of the magnitude of codon bias or replacement-site substitution rate. As a result, a model of relaxed selection suggests that it is unlikely that selfing evolved much longer than
4 million years ago. We therefore conclude that evolution over a modest timescale resulted in the novel features of hermaphrodites relative to females, in such traits as mate searching and avoidance (LIPTON et al. 2004; KLEEMANN and BASOLO 2007), activity during mating (GARCIA et al. 2007), loss of pheromone attraction (CHASNOV et al. 2007), self-sperm production, activation, and numerical optimization (CUTTER 2004; GELDZILER et al. 2006), and the sex-determination pathway (HILL et al. 2006). Additional consequences of selfing for other aspects of genome architecture, such as chromosomal rearrangements, intron size, and gene family and transposable element dynamics, will be much informed by comparisons among the ongoing genome sequencing projects in this group.
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
| LITERATURE CITED |
|---|
|
|
|---|
AKASHI, H., 1995 Inferring weak selection from patterns of polymorphism and divergence at silent sites in Drosophila DNA. Genetics 139: 1067–1076.[Abstract]
BAER, C. F., F. SHAW, C. STEDING, M. BAUMGARTNER, A. HAWKINS et al., 2005 Comparative evolutionary genetics of spontaneous mutations affecting fitness in rhabditid nematodes. Proc. Natl. Acad. Sci. USA 102: 5785–5790.
BARRIÈRE, A., and M. A. FÉLIX, 2005 High local genetic diversity and low outcrossing rate in Caenorhabditis elegans natural populations. Curr. Biol. 15: 1176–1184.[CrossRef][Medline]
BARRIÈRE, A., and M. A. FÉLIX, 2007 Temporal dynamics and linkage disequilibrium in natural Caenorhabditis elegans populations. Genetics 176: 999–1011.
BETANCOURT, A. J., and D. C. PRESGRAVES, 2002 Linkage limits the power of natural selection in Drosophila. Proc. Natl. Acad. Sci. USA 99: 13616–13620.
BIERNE, N., and A. EYRE-WALKER, 2006 Variation in synonymous codon use and DNA polymorphism within the Drosophila genome. J. Evol. Biol. 19: 1–11.[CrossRef][Medline]
BULMER, M., 1991 The selection-mutation-drift theory of synonymous codon usage. Genetics 129: 897–907.[Abstract]
C. ELEGANS SEQUENCING CONSORTIUM, 1998 Genome sequence of the nematode C. elegans: a platform for investigating biology. Science 282: 2012–2018.
CHARLESWORTH, B., 1992 Evolutionary rates in partially self-fertilizing species. Am. Nat. 140: 126–148.[CrossRef]
CHARLESWORTH, B., M. T. MORGAN and D. CHARLESWORTH, 1993 The effect of deleterious mutations on neutral molecular variation. Genetics 134: 1289–1303.[Abstract]
CHARLESWORTH, B., M. NORDBORG and D. CHARLESWORTH, 1997 The effects of local selection, balanced polymorphism and background selection on equilibrium patterns of genetic diversity in subdivided populations. Genet. Res. 70: 155–174.[CrossRef][Medline]
CHARLESWORTH, D., 2003 Effects of inbreeding on the genetic diversity of populations. Phil. Trans. R. Soc. Lond. B Biol. Sci. 358: 1051–1070.[CrossRef][Medline]
CHARLESWORTH, D., and S. I. WRIGHT, 2001 Breeding systems and genome evolution. Curr. Opin. Genet. Dev. 11: 685–690.[CrossRef][Medline]
CHASNOV, J. R., and K. L. CHOW, 2002 Why are there males in the hermaphroditic species Caenorhabditis elegans? Genetics 160: 983–994.
CHASNOV, J. R., W. K. SO, C. M. CHAN and K. L. CHOW, 2007 The species, sex, and stage specificity of a Caenorhabditis sex pheromone. Proc. Natl. Acad. Sci. USA 104: 6730–6735.
COGHLAN, A., and K. H. WOLFE, 2002 Fourfold faster rate of genome rearrangement in nematodes than in Drosophila. Genome Res. 12: 857–867.
COMERON, J. M., and T. B. GUTHRIE, 2005 Intragenic Hill-Robertson interference influences selection intensity on synonymous mutations in Drosophila. Mol. Biol. Evol. 22: 2519–2530.
CUTTER, A. D., 2004 Sperm-limited fecundity in nematodes: How many sperm are enough? Evolution 58: 651–655.[CrossRef][Medline]
CUTTER, A. D., 2006 Nucleotide polymorphism and linkage disequilibrium in wild populations of the partial selfer Caenorhabditis elegans. Genetics 172: 171–184.
CUTTER, A. D., 2008a Divergence times in Caenorhabditis and Drosophila inferred from direct estimates of the neutral mutation rate. Mol. Biol. Evol. 25: 778–786.
CUTTER, A. D., 2008b Multilocus patterns of polymorphism and selection across the X-chromosome of Caenorhabditis remanei. Genetics 178: 1659–1670.
CUTTER, A. D., and B. CHARLESWORTH, 2006 Selection intensity on preferred codons correlates with overall codon usage bias in Caenorhabditis remanei. Curr. Biol. 16: 2053–2057.[CrossRef][Medline]
CUTTER, A. D., and B. A. PAYSEUR, 2003 Rates of deleterious mutation and the evolution of sex in Caenorhabditis. J. Evol. Biol. 16: 812–822.[CrossRef][Medline]
CUTTER, A. D., and S. WARD, 2005 Sexual and temporal dynamics of molecular evolution in C. elegans development. Mol. Biol. Evol. 22: 178–188.
CUTTER, A. D., L. AVILÉS and S. WARD, 2003 The proximate determinants of sex ratio in C. elegans populations. Genet. Res. 81: 91–102.[CrossRef][Medline]
CUTTER, A. D., S. E. BAIRD and D. CHARLESWORTH, 2006a High nucleotide polymorphism and rapid decay of linkage disequilibrium in wild populations of Caenorhabditis remanei. Genetics 174: 901–913.
CUTTER, A. D., M. A. FELIX, A. BARRIERE and D. CHARLESWORTH, 2006b Patterns of nucleotide polymorphism distinguish temperate and tropical wild isolates of Caenorhabditis briggsae. Genetics 173: 2021–2031.
CUTTER, A. D., J. WASMUTH and M. L. BLAXTER, 2006c The evolution of biased codon and amino acid usage in nematode genomes. Mol. Biol. Evol. 23: 2303–2315.
DENVER, D. R., K. MORRIS, M. LYNCH and W. K. THOMAS, 2004 High mutation rate and predominance of insertions in the Caenorhabditis elegans nuclear genome. Nature 430: 679–682.[CrossRef][Medline]
DURET, L., 2000 tRNA gene number and codon usage in the C. elegans genome are co-adapted for optimal translation of highly expressed genes. Trends Genet. 16: 287–289.[CrossRef][Medline]
DURET, L., and D. MOUCHIROUD, 1999 Expression pattern and, surprisingly, gene length shape codon usage in Caenorhabditis, Drosophila, and Arabidopsis. Proc. Natl. Acad. Sci. USA 96: 4482–4487.
GARCIA, L. R., B. LEBOEUF and P. KOO, 2007 Diversity in mating behavior of hermaphroditic and male-female Caenorhabditis nematodes. Genetics 175: 1761–1771.
GELDZILER, B., I. CHATTERJEE, P. KADANDALE, E. PUTIRI, R. PATEL et al., 2006 A comparative study of sperm morphology, cytology and activation in Caenorhabditis elegans, Caenorhabditis remanei and Caenorhabditis briggsae. Dev. Genes Evol. 216: 198–208.[CrossRef][Medline]
GRAUSTEIN, A., J. M. GASPAR, J. R. WALTERS and M. F. PALOPOLI, 2002 Levels of DNA polymorphism vary with mating system in the nematode genus Caenorhabditis. Genetics 161: 99–107.
HABER, M., M. SCHUNGEL, A. PUTZ, S. MULLER, B. HASERT et al., 2005 Evolutionary history of Caenorhabditis elegans inferred from microsatellites: evidence for spatial and temporal genetic differentiation and the occurrence of outbreeding. Mol. Biol. Evol. 22: 160–173.
HILL, R. C., C. EGYDIO DE CARVALHO, J. SALOGIANNIS, B. SCHLAGER, D. PILGRIM et al., 2006 Genetic flexibility in the convergent evolution of hermaphroditism in Caenorhabditis nematodes. Dev. Cell 10: 531–538.[Medline]
HILLIER, L. W., R. D. MILLER, S. E. BAIRD, A. CHINWALLA, L. A. FULTON et al., 2007 Comparison of C. elegans and C. briggsae genome sequences reveals extensive conservation of chromosome organization and synteny. PLoS Biol. 5: e167.[CrossRef][Medline]
HODGKIN, J., and T. DONIACH, 1997 Natural variation and copulatory plug formation in Caenorhabditis elegans. Genetics 146: 149–164.[Abstract]
IKEMURA, T., 1985 Codon usage and transfer-RNA content in unicellular and multicellular organisms. Mol. Biol. Evol. 2: 13–34.[Abstract]
JOVELIN, R., B. C. AJIE and P. C. PHILLIPS, 2003 Molecular evolution and quantitative variation for chemosensory behaviour in the nematode genus Caenorhabditis. Mol. Ecol. 12: 1325–1337.[CrossRef][Medline]
KANAYA, S., Y. YAMADA, M. KINOUCHI, Y. KUDO and T. IKEMURA, 2001 Codon usage and tRNA genes in eukaryotes: correlation of codon usage diversity with translation efficiency and with CG-dinucleotide usage as assessed by multivariate analysis. J. Mol. Evol. 53: 290–298.[CrossRef][Medline]
KIM, S. K., J. LUND, M. KIRALY, K. DUKE, M. JIANG et al., 2001 A gene expression map for Caenorhabditis elegans. Science 293: 2087–2092.
KIMURA, M., 1968 Evolutionary rate at molecular level. Nature 217: 624–626.[CrossRef][Medline]
KIONTKE, K., and D. H. A. FITCH, 2005 The phylogenetic relationships of Caenorhabditis and other rhabditids, in WormBook, edited by THE C. ELEGANS RESEARCH COMMUNITY. WormBook, http://www.wormbook.org.
KIONTKE, K., and W. SUDHAUS, 2006 Ecology of Caenorhabditis species, in WormBook, edited by D. H. A. FITCH and THE C. ELEGANS RESEARCH COMMUNITY. WormBook, http://www.wormbook.org.
KIONTKE, K., N. P. GAVIN, Y. RAYNES, C. ROEHRIG, F. PIANO et al., 2004 Caenorhabditis phylogeny predicts convergence of hermaphroditism and extensive intron loss. Proc. Natl. Acad. Sci. USA 101: 9003–9008.
KIONTKE, K., A. BARRIÈRE, I. KOLOTUEV, B. PODBILEWICZ, R. SOMMER et al., 2007 Trends, stasis, and drift in the evolution of nematode vulva development. Curr. Biol. 17: 1925–1937.[CrossRef][Medline]
KLEEMANN, G. A., and A. L. BASOLO, 2007 Facultative decrease in mating resistance in hermaphroditic Caenorhabditis elegans with self-sperm depletion. Anim. Behav. 74: 1339–1347.
KOCH, R., H. G. A. M. VAN LUENEN, M. VAN DER HORST, K. L. THIJSSEN and R. H. A. PLASTERK, 2000 Single nucleotide polymorphisms in wild isolates of Caenorhabditis elegans. Genome Res. 10: 1690–1696.
KREITMAN, M., and M. ANTEZANA, 1999 The population and evolutionary genetics of codon bias, pp. 82–101 in Evolutionary Genetics: From Molecules to Morphology, edited by R. S. SINGH and C. B. KRIMBAS. Cambridge University Press, New York.
LI, L., C. J. STOECKERT, JR. and D. S. ROOS, 2003 OrthoMCL: identification of ortholog groups for eukaryotic genomes. Genome Res. 13: 2178–2189.
LI, W. H., 1987 Models of nearly neutral mutations with particular implications for nonrandom usage of synonymous codons. J. Mol. Evol. 24: 337–345.[CrossRef][Medline]
LINTS, R., and S. W. EMMONS, 2002 Regulation of sex-specific differentiation and mating behavior in C. elegans by a new member of the DM domain transcription factor family. Genes Dev. 16: 2390–2402.
LIPTON, J., G. KLEEMANN, R. GHOSH, R. LINTS and S. W. EMMONS, 2004 Mate searching in Caenorhabditis elegans: a genetic model for sex drive in a simple invertebrate. J. Neurosci. 24: 7427–7434.
LOEWE, L., and B. CHARLESWORTH, 2006 Inferring the distribution of mutational effects on fitness in Drosophila. Biol. Lett. 2: 426–430.