| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Genetics, Vol. 176, 1119-1130, June 2007, Copyright © 2007
doi:10.1534/genetics.106.069690
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Center for Environmental Genomics Department of Biology, McMaster University, Hamilton, Ontario L8S 4K1, Canada
1 Address for correspondence: Center for Environmental Genomics, Department of Biology, McMaster University, Life Sciences Building, Room 328, 1280 Main St. W., Hamilton, ON L8S 4K1, Canada.
E-mail: evansb{at}mcmaster.ca
| ABSTRACT |
|---|
|
|
|---|
African clawed frogs (genera Xenopus and Silurana) offer a promising model with which to examine the impact of ancestry on gene fate in an allopolyploid genome because multiple independent instances of allopolyploidization occurred (EVANS et al. 2004, 2005), and the long term effects of this type of genome duplication can be explored with replication. All extant species of clawed frogs in the genus Xenopus (i.e., those species with multiples of 2x = 18 chromosomes) share a common tetraploid ancestor (EVANS et al. 2004, 2005). A definitive test of allopolyploid vs. autopolyploid origin of this ancestor is not currently possible because no extant Xenopus diploids (2x = 18) are known. However, this ancestor is suspected to have been an allotetraploid because (a) other polyploid clawed frogs are definitively allopolyploids (EVANS et al. 2005), (b) Xenopus genomes are diploidized and duplicated pairs of chromosomes have visible differences in secondary constrictions (TYMOWSKA 1991), (c) allopolyploid individuals can be created in the laboratory by crossing extant species (KOBEL 1996), and (d) multiple unlinked loci indicate that the phylogenetic signal of many paralogs is not blended by recombination (EVANS et al. 2005; CHAIN and EVANS 2006; F. J. J. CHAIN, D. ILIEVA and B. J. EVANS, unpublished results). Silurana and Xenopus diversified from one another
5364 million years ago (MYA) and tetraploidization in Xenopus occurred
2141 MYA (EVANS et al. 2004; CHAIN and EVANS 2006).
RAG1 and RAG2 proteins form a heterodimer that is crucial for the process of somatic rearrangement of DNA known as V(D)J recombination, making possible the extraordinary molecular variation of B-cell and T-cell antigen receptors that is needed to combat pathogen attack. The core region of RAG1, which, when paired with RAG2 is sufficient to carry out V(D)J recombination, spans human residues 3861011 out of a total of 1040 amino acids in the protein (SADOFSKY et al. 1993). The core region of RAG1 was derived from the Transib transposon superfamily, whereas the RAG2 and the N-terminal domain of RAG1 probably was derived from other sources (KAPITONOV and JURKA 2005). These genes are tightly linked in jawed vertebrates; a recent build (version 4.1) of the complete genome sequence of the diploid clawed frog Silurana tropicalis indicates that there is only one copy of RAG1 and one of RAG2, and that each is convergently transcribed and tightly linked by an
6.5-kb intergenic region. This is consistent with findings in Xenopus laevis that suggest that this tetraploid was derived from the fusion of two diploid genomes and that each of these diploid genomes carried only one copy of RAG1 (GREENHALGH et al. 1993; EVANS et al. 2005). Thus, in the absence of paralog deletion and degeneration, a null expectation is that the number of RAG1 and RAG2 paralogs corresponds with the ploidy level of a species: diploids should have one copy of each gene, tetraploids should have two, octoploids should have four, and dodecaploids should have six (Figure 1). Each of these paralogs would have two alleles.
|
" and "ß"). This pattern of degeneration could be explained by (a) a shared ancestral degeneration of a coding or regulatory region of RAG1 paralog ß that was not sequenced by EVANS et al. (2005), (b) multiple independent and biased degenerations in different species, or (c) some combination of shared and independent degenerations (EVANS et al. 2005). This study aims to further investigate these possibilities and track the evolutionary history of this heterodimer through multiple episodes of speciation and genome duplication. To this end, (a) new sequences were obtained from most or all upstream coding regions of RAG1 paralogs to test for shared coding-region degeneration, (b) expression analyses were performed on multiple species to test for ancestral silencing of paralogs, and (c) sequences were obtained from paralogs of the linked partner gene RAG2.
| MATERIALS AND METHODS |
|---|
|
|
|---|
3135 bp total per RAG1 paralog and 1123 bp out of 1575 bp total per RAG2 paralog (supplemental Figure 1 at http://www.genetics.org/supplemental/). The sequenced portion of each RAG1 paralog varied but generally covers most or all of the core region of RAG1 (supplemental Figure 2 at http://www.genetics.org/supplemental/). Some paralogs were not detected using a variety of primer combinations, and these may have been deleted (supplemental Figure 1). Sequencing of individual paralogs was accomplished through a combination of TA cloning (Invitrogen) and targeted amplification using paralog-specific primers (supplemental Figure 1). Allelic clones with the least number of autapomorphic mutations were selected for analysis. When both alleles of a paralog were directly sequenced with paralog-specific primers, polymorphic positions were analyzed using IUPAC degenerate nucleotide symbols. Some sequence data from expressed paralogs were obtained from nonvouchered individuals; these data were conatenated with sequences from the corresponding paralog from other individuals that usually were vouchered, as detailed in (EVANS et al. 2004, 2005). Following EVANS et al. (2005), paralogs of RAG1 and RAG2 that are closely related to the linked X. laevis paralogs identified by GREENHALGH et al. (1993) are referred to as ß paralogs and the others are referred to as
paralogs. In Silurana, tetraploid paralogs closely related to the diploid S. tropicalis are designated
paralogs and the others are ß paralogs.
Attempts to amplify and clone the
paralogs of RAG2 failed in all species of Xenopus using a variety of primer combinations (supplemental information 1 at http://www.genetics.org/supplemental/), even though both paralogs were detected in the Silurana tetraploids. To rigorously test whether RAG2
paralogs were deleted in clawed frogs as opposed to just not being amplified, a systematic effort was made to coamplify both paralogs using seven pairwise combinations of three forward and three reverse primers (supplemental Figure 1 at http://www.genetics.org/supplemental/). S. tropicalis was used as a positive control to demonstrate that in addition to Xenopus RAG2 paralog ß, these primers can successfully amplify RAG2 in a more distantly related lineage than Xenopus RAG2 paralog
.
Amplification of expressed RAG1 paralogs:
To determine which RAG1 paralogs are expressed in various species, cDNA was amplified across an intron in the 5' untranslated region of the RAG1 transcript (GREENHALGH et al. 1993). Negative controls with no DNA and with genomic DNA were performed to ensure that only expressed and spliced paralogs were amplified; previously published and new primers were used (supplemental Figure 1 at http://www.genetics.org/supplemental/; GREENHALGH et al. 1993). RNA was extracted using the RNeasy mini kit (QIAGEN) and converted to cDNA using the Omniscript RT kit (QIAGEN). Individual expressed paralogs were amplified from cDNA generated from different tissues (brain, liver, spleen, testis, and/or bone marrow) from a variety of species (X. laevis, X. gilli, X. borealis, X. muelleri, X. amieti, X. andrei, X. new octoploid, and X. boumbaensis), and then cloned and sequenced. Additionally, cDNA from some tissues was directly sequenced and chromatograms were inspected for paralog-specific single nucleotide polymorphisms.
Phylogenetic analyses:
Phylogenetic analysis was performed on coding portions of RAG1 and RAG2 in three types of data configurations: (1) each locus was analyzed independently, (2) putatively linked paralogs were combined into single taxonomic units, and (3) for Xenopus only, a "synthetic" data set was constructed in which
and ß paralogs of RAG1 and RAG2 that were derived from the same most recent tetraploid ancestor were combined into a single taxonomic unit. This third configuration exploits the redundant phylogenetic information available in co-inherited paralogs. For example, in the third data configuration, tetraploid
and ß paralogs were combined, octoploid
1 and ß1 paralogs were combined, but octoploid
1 and ß2 paralogs were not combined. For the third analysis, S. tropicalis was used as an out-group to RAG1
and RAG2 paralogs and no out-group sequence was used for the portion of the sequences that was composed of RAG1 ß paralogs.
MrBayes version 3.1.2 was used for Bayesian phylogenetic analysis, and Bayes factors were used to select a model of evolution, as described by NYLANDER et al. (2004). Seven partitioned models were explored (supplemental Table 1 at http://www.genetics.org/supplemental/). These models were compared on the basis of the harmonic mean of the posterior probability of trees sampled after a conservative burn-in of 1 million generations from two independent MCMC runs, each of 2 million generations. A highly parameterized model of evolution was favored for phylogenetic analyses of RAG1 and RAG2 (supplemental Table 1), and this model was also used for the combined analyses, using 5 million generations and the same burn-in. Branch support was also evaluated with 2000 nonparametric bootstrapping replicates, each with a single replication of random taxon addition, a limit of 10 million rearrangements per replicate, and the maximum parsimony criterion using PAUP version 4.0b10 (SWOFFORD 2002). Almost all of the well-supported relationships from the Bayesian analyses also have nonparametric bootstrap values of >80% (supplemental Figure 3 at http://www.genetics.org/supplemental/).
To test hypotheses of fewer gene silencing events in RAG1 than was suggested by phylogenetic analysis, parametric bootstrap tests (HUELSENBECK et al. 1996; GOLDMAN et al. 2000) were performed as in EVANS et al. (2005). This procedure tests the fit of the data to alternative phylogenetic hypotheses that are different from the consensus tree that was obtained from the Bayesian analysis, and that would be consistent with fewer instances of independent gene degeneration (supplemental Figure 4 at http://www.genetics.org/supplemental/). To maximize the phylogenetic signal of these tests, simulations were performed according to the synthetic data configuration using Seq-Gen version 1.3.2 (RAMBAUT and GRASSLY 1997). A Perl script was written to modify the simulations to match the observed data in terms of the quantity and positions of missing data for each taxon.
A caveat to the interpretation that autapomorphic degenerations occurred independently is that recurrent substitutions could have erased an ancestral degeneration in one or both descendant paralogs. To explore this possibility, marginal ancestral reconstruction of ancestral character states was performed with a general time-reversible nucleotide model and a gamma-distributed rate heterogeneity parameter using the baseml program of PAML version 3.14 (YANG 1997). Reconstructed sequences were then translated into protein using MacClade version 4.08 (MADDISON and MADDISON 2000) and inspected for stop codons.
Testing for phylogenetic bias in RAG1 degeneration:
To explore whether there is significant bias in gene degeneration of RAG1 with respect to ancestry of each paralog, two approaches were taken. The first approach used a maximum likelihood framework to compare rates of degeneration (
), using Discrete version 4.0 (PAGEL 1994). This framework tested whether rates of degeneration in RAG1 and RAG2 were significantly different, and also whether rates of degeneration in the RAG1
and ß lineages were significantly different. The rate of resuscitation of degenerate paralogs was set to a negligible value, missing paralogs were coded as degenerate, and the most recent ancestor of expressed paralogs with autapomorphic degenerations was set as nondegenerate. Likelihood ratio tests were used to compare models with two degeneration parameters to models with only one, and these tests were performed with and without a gamma-distributed approximation for rate heterogeneity (
). To be conservative, modified topologies were used in which the independent degenerations (numbered 112 in Figure 2) were each forced to be a clade. Branch lengths were estimated under the GTR+I+
model and imposing a molecular clock using PAUP* (SWOFFORD 2002).
|
or ß) whenever possible. For eight extant or ancestral species with observed independent RAG1 degeneration (see RESULTS), 100,000 simulations drew k degeneration events from n 1 paralogs, where n is the total number of nondegenerate paralogs inferred to be present when that species originated. Degeneration was modeled as a delayed transformation phenomenon wherein paralogs degenerated after allopolyploidization, and it was also performed in a phylogenetically independent manner such that ancestral degenerations were inherited and not "re-simulated" in descendant species. Chimerical heterodimers were quantified for each simulation on the basis of observed and suspected degenerations and deletions of RAG2 (i.e., degenerations 1012 in Figure 2).
Recombination between alleles of different paralogs:
Recombination between alleles of different paralogs could blend their phylogenetic signal, and recombination between the intergenic regions of paralogous chromosomes could alter the synteny of paralogs (GREENHALGH et al. 1993). To test for evidence of recombination in Xenopus and Silurana sequences, multiple tests were used because their performance varies with the level of divergence, the extent of recombination, and among site-rate heterogeneity (POSADA and CRANDALL 2001; POSADA 2002). These tests included the recombination detection program, geneconv, chimera, bootscan, and siscan, as implemented by the Recombination Detection Program (MARTIN and RYBICKI 2000). Details of these methods can be found elsewhere (MAYNARD SMITH 1992; SALMINEN et al. 1995; PADIDAM et al. 1999; GIBBS et al. 2000; MARTIN and RYBICKI 2000; POSADA and CRANDALL 2001). A variety of parameter settings were explored for each method as in EVANS et al. (2005).
| RESULTS |
|---|
|
|
|---|
paralog and two RAG2 paralogs whose relationships are suggestive of dodecaploidy (Figure 2). This suggests that this individual from Younde, Cameroon, was previously incorrectly classified as X. boumbaensis (EVANS et al. 2004, 2005), which is an octoploid species (LOUMONT 1983; TYMOWSKA 1991). It appears that X. cf. boumbaensis is a dodecaploid derived from allopolyploidization between an octoploid ancestor of X. boumbaensis and a tetraploid ancestor of X. cf. fraseri 2 (Figure 3). Data from an X. boumbaensis individual from the type locality of Moloundou, Cameroon, which were not included in EVANS et al. (2005), are consistent with octoploidy (Figure 2 and 3) and lead to a re-evaluation of the evolutionary history of this species. Moreover, it appears that X. boumbaensis, X. ameiti, and X. andrei share a common octoploid ancestor, as opposed to each originating independently as was previously proposed (EVANS et al. 2005). Thus these genealogies support three rather than five independent allopolyploid origins of most extant octoploids: (1) X. vestitus, (2) X. wittei and X. new octoploid, and (3) X. amieti and X. andrei and X. boumbaensis, but three rather than two independent origins of dodecaploids: (1) X. ruwenzoriensis, (2) X. longipes, and (3) X. cf. boumbaensis (Figure 3). This new information also changes the number of ancestral species that are predicted but for whom an extant descendant with the same ploidy level is unknown, from three diploids, three tetraploids, and one octoploid (EVANS et al. 2005) to three diploids and three tetraploids (Figure 3). Mitochondrial DNA sequences from X. cf. boumbaensis are almost identical to a X. boumbaensis sample from the type locality (EVANS et al. 2004), suggesting recent dodecaploidization of this individual.
|
Another suspected deletion of RAG2 occurred in an ancestral paralog from which paralog ß2 of X. amieti, X. ruwenzoriensis, X. boumbaensis, and X. longipes and RAG2 paralog ß3 of X. cf. boumbaensis would have descended (deletion 12 in Figure 2). These predicted paralogs were not detected in amplifications from genomic DNA, even after multiple attempts with different primer combinations (supplemental Figure 1 at http://www.genetics.org/supplemental/). Linked RAG1 paralogs from these species also were not detected (deletion 12 in Figure 2), suggesting that the deleted region spans both of them. Apart from these putative deletions, the only other observed gene degeneration in RAG2 was in X. andrei RAG2 paralog ß2, which experienced a frameshift deletion.
Independent degeneration of RAG1 paralogs:
To test whether the unique degenerations in the 3' region of RAG1 (EVANS et al. 2005) could have occurred after a shared ancestral degeneration in the 5' coding region, RAG1 sequences were obtained from most of the coding region of most or all RAG1 paralogs of all known species of clawed frog (supplemental Figure 2 at http://www.genetics.org/supplemental/). Including previously identified degenerate paralogs (EVANS et al. 2005), a total of 17 degenerate RAG1 ß paralogs and 2 degenerate RAG1
paralogs were detected in various Xenopus species (Figure 2, supplemental Figure 2). The only shared stop codons or frameshift mutations that were identified were (1) one frameshift and one stop codon shared by X. pygmaeus paralog ß and X. ruwenzoriensis paralog ß3, (2) a stop codon shared by X. vestitus paralog ß1 and X. longipes paralog ß3, and (3) a stop codon shared by the X. ruwenzoriensis paralog ß1 and X. vestitus paralog ß2. Maximum likelihood reconstructions indicate that the first example is due to shared ancestry, but that the other two evolved independently.
Expression of at least one RAG1 paralog was confirmed in heart, brain, liver, testes, spleen, and bone marrow (Table 1). Xenopus laevis and X. gilli each express both of their RAG1 paralogs and neither is degenerate. Likewise, no degeneration was observed at the DNA level in either RAG1 paralog of some other tetraploids including X. clivii, X. largeni, and X. muelleri (supplemental Figure 2 at http://www.genetics.org/supplemental/).
|
2, for example, are expressed even though each one is degenerate (degenerations 4, 7, and 9, respectively, in Figures 2 and 4). Overall, under the assumptions of no resuscitation of silenced genes, phylogenetic analyses support a minimum of seven independent episodes of degeneration of the Xenopus RAG1 ß paralogs and two independent episodes in the Xenopus RAG1
paralogs (Figures 2 and 4).
|
paralogs are (8) X. ruwenzoriensis paralog
2 and (9) X. amieti paralog
2 (Figure 2). Both instances of degeneration of RAG1 paralog
occurred very recently and potentially after interactions between RAG1 paralog
and RAG2 paralog ß proteins were already established by pseudogenization of other RAG1 paralogs. Nine additional instances of degeneration of RAG1 ß paralogs are possible but their independence depends on whether ancestral gene silencing preceded autapomorphic degeneration of the coding regions of various paralogs (Figure 2). Expression of X. muelleri RAG1 paralog ß, for example, was not detected in multiple tissues (Table 1) and expression of this paralog may have been silenced ancestrally prior to the speciation of X. muelleri and X. borealis and coding-region degeneration in X. borealis (degeneration 1 in Figures 2 and 4). Likewise, a lack of detected expression of X. new octoploid RAG1 paralog ß2 could be explained by silenced expression of paralog ß in one of the tetraploid ancestors of X. vestitus, X. wittei, and X. new octoploid. If this were the case, then degeneration of RAG1 paralog ß1 and ß2 in X. vestitus, paralog ß2 in X. wittei, and paralog ß2 in X. new octoploid should be considered a single event (degeneration 2 in Figures 2 and 4), even though no shared degeneration of the coding region of these paralogs was observed (supplemental information 2 at http://www.genetics.org/supplemental/).
Parametric bootstrap tests strongly reject hypotheses of fewer independent degenerations in the ß lineage of RAG1 (P < 0.001). Of note is that the full sequence of X. boumbaensis RAG1 paralog ß1 was not obtained (supplemental Figure 2 at http://www.genetics.org/supplemental/) so the possibility of shared degeneration with another closely related paralog cannot be completely dismissed. Also of interest is the observation that in X. new octoploid, the intensity of paralog-specific polymorphisms on sequence chromatograms suggests that expression of RAG1 paralog
1 was higher than paralog
2 in the brain, whereas the opposite was observed in amplifications of RAG1 from heart cDNA from this species (Table 1), a pattern of expression that is suggestive of subfunctionalization (FORCE et al. 1999).
Significant bias to RAG1 degeneration:
Comparison of alternative parameterizations of a model of stochastic degeneration indicates that the rate of degeneration of the RAG1 ß lineage is significantly higher than the RAG1
lineage with a gamma rate heterogeneity model (P = 0.0274,
RAG1
= 6.5519,
RAG1ß = 50.0343,
RAG1
= 0.0082,
RAG1ß = 0.0042, d.f. = 2), or without one (P = 0.0054,
RAG1
= 4.8927,
RAG1ß = 34.4187, d.f. = 1). Additionally, the overall rate of degeneration of RAG1 is significantly higher than the rate of degeneration of RAG2 when modeled with a gamma rate heterogeneity model (P = 0.0431,
RAG1 = 31.2407,
RAG2 = 3.08511,
RAG1 = 0.0082,
RAG2 = 0.0096, d.f. = 2), or without one (P = 0.0323,
RAG1 = 7.8978,
RAG2 = 1.8141, d.f. = 1).
Because some species may have inherited RAG1 ß paralogs that were already degenerate, even if subsequent degeneration of RAG1 were unbiased, chimerical heterodimers would still be expected to occur. However, under the assumptions discussed above and given the observed pattern of deletion in RAG2, simulations indicate that the observed number of chimerical heterodimers that resulted from RAG1 degeneration is significantly in excess of expectations if RAG1 degeneration were unbiased (P < 0.0001, Figure 5).
|
Linkage of RAG1 paralog ß and RAG2 paralog ß has been demonstrated (GREENHALGH et al. 1993) and a similar linkage structure is suggested by a putative deletion that spanned RAG1 and RAG2 paralogs of an ancestor of X. amieti, X. ruwenzoriensis, X. boumbaensis, X. cf. boumbaensis, and X. longipes (deletion 12 in Figure 2). The possibility that recombination occurred between the intergenic region that separates linked alleles of RAG1 and RAG2 is difficult to conclusively rule out because a diploid Xenopus (2x = 18), which could confirm the ancestral synteny of RAG1 and RAG2 paralogs, is not available. However, phylogenetic congruence among RAG1, RAG2, and mtDNA, and the internal consistency between the
and ß genealogies of RAG1 support the contention that recombination between these paralogs is rare (Figure 2; EVANS et al. 2004, 2005). Recombination among paralogous alleles is expected to be rare if diploidization of these allopolyploid genomes occurred soon after their formation, which is typical of allopolyploid cotton, for example (CRONN et al. 1999).
| DISCUSSION |
|---|
|
|
|---|
Before the tetraploid Xenopus ancestor diversified to give rise to the extant species, it appears that one RAG2 paralogparalog
was deleted, leaving only the other RAG2 paralogparalog ßbut still two functional paralogs of RAG1,
and ß (Figures 2 and 4). Later, after speciation of this tetraploid ancestor without change in genome size and also after further episodes of allopolyploidization, paralogs of RAG1 in different species independently degenerated, making their copy number similar or equal to RAG2. Surprisingly, until very recently degeneration of RAG1 paralogs occurred exclusively in closely related members of paralog lineage ß. As a result, many Xenopus species have functional paralogs of RAG1 that are exclusively
paralogs, and their protein products must heterodimerize with proteins encoded by RAG2 ß paralogs (Figures 2 and 4). Moreover, in many species it appears that the functional copy of RAG1 is linked to a region where a paralog of RAG2 was deleted, and the functional copy of RAG2 is linked to a paralog of RAG1 that either degenerated or was deleted. While the tetraploid species X. laevis and X. gilli still express functional linked ß paralogs of RAG1 and RAG2 and also an
paralog of RAG1, in most other Xenopus species the only functional paralogs of RAG1 and RAG2 are unlinked, suggesting that each one was derived from a different diploid ancestor. In contrast, allotetraploid clawed frogs that evolved independently in the genus Silurana (4x = 40) retain both paralogs of RAG1 and of RAG2, all appear functional at the DNA level in the portions of these genes that were sequenced, and both RAG1 paralogs are expressed in at least one Silurana tetraploid (S. new tetraploid 1).
The rate of degeneration is significantly higher in RAG1 ß paralogs than in RAG1
paralogs and significantly higher in RAG1 than in RAG2. Simulations also indicate that, given the observed pattern of degeneration of RAG2, the probability of unbiased degeneration of RAG1 producing by chance such a high number of chimerical heterodimers derived from unlinked paralogs of RAG1 and RAG2 is very low. It is surprising that degeneration of RAG1 paralogs occurred in this manner because an allopolyploid origin of the tetraploid ancestor of Xenopus would suggest that unlinked paralogs share a shorter coevolutionary history than do the linked ones.
Unique characteristics of each subgenome could account for the biased degeneration of RAG1 paralog ß. Epigenetic phenomena after allopolyploidization could contribute to this bias, for example, if paralog expression were differently affected by asymmetric mobility of transposable elements in each subgenome. Alternatively, genetic explanations for nonrandom degeneration of RAG1 paralogs include (1) "intraparalog" phenomena, such as differences in dosage of each RAG1 paralog, or (2) "intermolecular" phenomena involving selection on interactions between the protein products of specific RAG1 paralogs and other molecules. This second genetic explanation includes scenarios involving a disadvantage (negative selection) or an advantage (positive selection) to interactions between proteins derived from specific paralogs of RAG1 and other molecules, which may or may not include proteins encoded by specific paralogs of RAG2.
Dosage as an explanation for nonrandom gene silencing of RAG1:
One explanation for biased degeneration of RAG1 is that, after allotetraploidization in Xenopus, expression of the RAG1 paralog that was derived from diploid ancestor
was greater than the one that was derived from diploid ancestor ß. If this difference in dosage meant that the RAG1 paralog
could operate alone in a polyploid genome whereas RAG1 paralog ß could not, then the former would be both necessary and sufficient. Under this scenario, the function of each RAG1 paralog would be identical and interchangeable, and the difference driving degeneration of RAG1 lineage ß would be one of quantity, not of quality, of proteins encoded by RAG1 paralogs. However, because copy number of RAG2 was reduced by deletion in an ancestor before any RAG1 copies degenerated, dosage constraints on RAG1 after allopolyploidization would have to have been imposed by other cofactors of RAG1.
Haplo-insufficient phenotypes result from reduced expression or activity of a heterozygous locus, and these phenotypes could stem from multiple mechanisms including altered enzymatic stoichiometry (VEITIA 2002). The importance of stoichiometric requirements is suggested in yeast in that proteins that form complexes are rarely members of large gene families and underexpression or overexpression of these genes can be deleterious (PAPP et al. 2003). In the same way, ancestral differences in gene dosage could influence allopolyploid phenotypes and ultimately affect the genetic fate of paralogs in an allopolyploid genome. For example, laboratory generated allopolyploids with two Z chromosomes from X. gilli and one W chromosome from X. laevis were mostly male whereas individuals with the same W chromosome but one Z chromosome from X. gilli and one Z chromosome from X. muelleri (or X. laevis) were only female (KOBEL 1996). Dosage could also be an important factor in the evolution of dominance. That wild-type alleles are generally dominant over new mutations could be a byproduct of selection for surplus capability that is needed to operate under heterogeneous conditions (WRIGHT 1934; CHARLESWORTH 1979; KACSER and BURNS 1981; ORR 1991; FORSDYKE 1994).
Selection on interactions between specific paralogs of RAG1 and other molecules:
If the tetraploid ancestor of Xenopus evolved through allopolyploidization, negative selection on molecular interactions could arise from DobzhanskyMuller incompatibilities (DOBZHANSKY 1937; MULLER 1942). However, DobzhanskyMuller incompatibilities between paralogs of RAG1 and RAG2 could explain the degeneration of specific RAG1 paralogs only if the paralogs that are currently functional were derived from the same diploid ancestor. But this does not appear to be the case because comparison to linked paralogs in X. laevis (GREENHALGH et al. 1993) suggests that in most species the only functional RAG1 and RAG2 paralogs are not in synteny. Thus, similar to the dosage hypothesis, if DobzhanskyMuller incompatibilities drove degeneration of RAG1 paralog ß, they would probably involve an interaction other than that with the protein product of the RAG2 gene. This could include interactions with other protein cofactors of V(D)J recombination or with the recombination signal sequences (RSSs) or the spacer regions between RSSs that flank variable, diversity, and joining segments (SAKANO et al. 1979; RAMSDEN et al. 1994).
If the tetraploid ancestor of Xenopus was allopolyploid, positive selection could account for this pattern of gene silencing in at least two ways. First, coadaptation of paralogs of RAG1 and RAG2 could have occurred in one of the diploid ancestors and then these paralogs could have been unlinked by recombination after allopolyploidization. However, the coadaptation hypothesis is disfavored for the same reason that DobzhanskyMuller incompatibilities between RAG1 and RAG2 are an unlikely explanation: recombination between alleles of different paralogs appears rare and functional RAG1 and RAG2 paralogs in many species therefore are probably derived from different diploid ancestors. A second possibility is that there is a performance advantage to proteinprotein interactions between RAG1 and RAG2 paralogs that are derived from different diploid ancestors. This scenario posits an advantage, or heterosis, to a combination of two evolutionarily naive proteins over a heterodimer whose constituents have a longer period of coevolutionary history. Heterosis is also suggested, for example, in Xenopus polyploids that are resistant to parasites that infect both of their parental taxa (JACKSON and TINSLEY 2003).
Explanations for heterosis include overdominance, dominance, or epistasis. The overdominance hypothesis posits that heterozygous loci are more fit than homozygous loci (whether dominant or recessive) (SHULL 1908; HULL 1945), and this could apply to allopolyploids that coexpress alleles from each parental species. In fact, allopolyploidization provides a way to maintain heterosis associated with heterozygosity because alleles from each parental species are forced to cosegregate in disomic allopolyploids. In the current case, however, the overdominance explanation for heterosis does not apply because, while many allopolyploids have a RAG1/RAG2 heterodimer composed of proteins derived from genes with different ancestry, each paralog that encodes the proteins in these heterodimers is homozygous with respect to their diploid ancestry. The dominance explanation for heterosis posits that if dominant alleles are more fit, hybrids (or allopolyploids) would benefit from the dominant alleles from each parent (DAVENPORT 1908). Dominance can result from differences in dosage, which was discussed earlier, and could also be a consequence of epistasis, so these explanations for heterosis are not mutually exclusive. Epistasis could lead to heterosis in an allopolyploid if there were a performance advantage to interactions between proteins derived from different ancestors.
Taken together, these observations suggest that allopolyploid transcriptomes are sculpted by natural selection on each subgenome. Mutational or regulatory differences that accumulated in each ancestor may be advantageous or deleterious, and their paralogs can be preserved or discarded after genome fusion on the basis of their performance and their interactions with molecules from the other subgenome. In some cases, pseudogenization is strongly biased with respect to ancestry over millions of years, and chimerical interactions between proteins from different ancestors may be favored over interactions between proteins that share a longer period of coevolution.
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
| FOOTNOTES |
|---|
| LITERATURE CITED |
|---|
|
|
|---|
ADAMS, K. L., and J. F. WENDEL, 2005 Polyploidy and genome evolution in plants. Curr. Opin. Plant Biol. 8: 135141.[CrossRef][Medline]
ADAMS, K. L., R. CRONN, R. PERCIFELD and J. F. WENDEL, 2003 Genes duplicated by polyploidy show unequal contributions to the transcriptome and organ-specific reciprocal silencing. Proc. Natl. Acad. Sci. USA 100: 4649.
ADAMS, K. L., R. PERCIFELD and J. F. WENDEL, 2004 Organ-specific silencing of duplicated genes in a newly synthesized cotton allotetraploid. Genetics 168: 22172226.
CHAIN, F. J. J., and B. J. EVANS, 2006 Molecular evolution of duplicate genes in Xenopus laevis is consistent with multiple mechanisms for their retained expression. PLoS Genet. 2: e56.[CrossRef][Medline]
CHARLESWORTH, B., 1979 Evidence against Fisher's theory of dominance. Nature 278: 848849.[CrossRef]
COMAI, L., 2000 Genetic and epigenetic interactions in allopolyploid plants. Plant Mol. Biol. 43: 387399.[CrossRef][Medline]
CRONN, R. C., R. L. SMALL and J. F. WENDEL, 1999 Duplicated genes evolve independently after polyploid formation in cotton. Proc. Natl. Acad. Sci. USA 96: 1440614411.
DAVENPORT, C. B., 1908 Degeneration, albinism and inbreeding. Science 28: 454455.
DOBZHANSKY, T., 1937 Genetics and the Origin of Species. Columbia University Press, New York.
EVANS, B. J., D. B. KELLEY, R. C. TINSLEY, D. J. MELNICK and D. C. CANNATELLA, 2004 A mitochondrial DNA phylogeny of clawed frogs: phylogeography on sub-Saharan Africa and implications for polyploid evolution. Mol. Phylogenet. Evol. 33: 197213.[CrossRef][Medline]
EVANS, B. J., D. B. KELLEY, D. J. MELNICK and D. C. CANNATELLA, 2005 Evolution of RAG-1 in polyploid clawed frogs. Mol. Biol. Evol. 22: 11931207.
FORCE, A., M. LYNCH, B. PICKETT, A. AMORES, Y. L. YAN et al., 1999 Preservation of duplicate genes by complementary, degenerative mutations. Genetics 151: 15311545.
FORSDYKE, D. R., 1994 The heat-shock response and the molecular basis of genetic dominance. J. Theor. Biol. 167: 15.[CrossRef][Medline]
GIBBS, M. J., J. S. ARMSTRONG and A. J. GIBBS, 2000 Sister-Scanning: a Monte Carlo procedure for assessing signals in recombinant sequences. Bioinformatics 16: 573582.
GOLDMAN, N., J. P. ANDERSON and A. G. RODRIGO, 2000 Likelihood-based tests of topologies in phylogenetics. Syst. Biol. 49: 652670.[CrossRef][Medline]
GREENHALGH, P., C. E. M. OLESEN and L. A. STEINER, 1993 Characterization and expression of Recombination Activating Genes (RAG-1 and RAG-2) in Xenopus laevis. J. Immunol. 151: 31003110.[Abstract]
GU, X., 2003 Evolution of duplicate genes versus genetic robustness against null mutations. Trends Genet. 19: 354356.[CrossRef][Medline]
HUELSENBECK, J. P., D. M. HILLIS and R. JONES, 1996 Parametric bootstrapping in molecular phylogenetics: applications and performance, pp. 1945 in Molecular Zoology: Advances, Strategies, and Protocols, edited by J. D. FERRARIS and S. R. PALUMBI. Wiley-Liss, New York.
HUGHES, M. K., and A. L. HUGHES, 1993 Evolution of duplicate genes in a tetraploid animal, Xenopus laevis. Mol. Biol. Evol. 10: 13601369.[Abstract]
HULL, F. G., 1945 Recurrent selection and specific combining ability in corn. J. Am. Soc. Agronomy 37: 134145.
JACKSON, J. A., and R. C. TINSLEY, 2003 Parasite infectivity to hybridising host species: A link between hybrid resistance and allopolyploid speciation? Int. J. Parasitol. 33: 137144.[CrossRef][Medline]
JIANG, C.-X., P. W. CHEE, X. DRAYE, P. L. MORRELL, C. W. SMITH et al., 2000 Multilocus interactions restrict gene introgression in interspecific populations of polyploid Gossypium (cotton). Evolution 54: 798814.[CrossRef][Medline]
KACSER, H., and J. A. BURNS, 1981 The molecular basis of dominance. Genetics 97: 639666.
KAPITONOV, V., and J. JURKA, 2005 RAG1 core and V(D)J recombination signal sequences were derived from Transib transposons. PLoS Biol. 3: e181.[CrossRef][Medline]
KOBEL, H. R., 1996 Allopolyploid speciation, pp. 391401 in The Biology of Xenopus, edited by R. C. TINSLEY and H. R. KOBEL. Clarendon Press, Oxford.
LEE, H.-S., and Z. J. CHEN, 2001 Protein-coding genes are epigenetically regulated in Arabidopsis polyploids. Proc. Natl. Acad. Sci. USA 98: 67536758.
LEVY, A. A., and M. FELDMAN, 2004 Genetic and epigenetic reprogramming of the wheat genome upon allopolyploidization. Biol. J. Linn. Soc. 82: 607613.[CrossRef]
LIU, B., and J. F. WENDEL, 2002 Non-mendelian phenomena in allopolyploid genome evolution. Curr. Genomics 3: 489505.[CrossRef]
LIU, B., and J. F. WENDEL, 2003 Epigenetic phenomena and the evolution of plant allopolyploids. Mol. Phylogenet. Evol. 29: 365379.[CrossRef][Medline]
LIU, B., C. L. BRUBAKER, G. MERGEAI, R. C. CRONN and J. F. WENDEL, 2001 Polyploid formation in cotton is not accompanied by rapid genomic changes. Genome 44: 321330.[Medline]
LOUMONT, C., 1983 Deux especes nouvelles de Xenopus du Cameroun (Amphibia, Pipidae). Rev. Suisse Zool. 90: 169177.
LUKENS, L. N., P. A. QUIJADA, J. UDALL, J. C. PIRES, M. E. SCHRANZ et al., 2004 Genome redundancy and plasticity within ancient and recent Brassica crop species. Biol. J. Linn. Soc. 82: 665674.[CrossRef]
LYNCH, M., and A. FORCE, 2000 The probability of duplicate gene preservation by subfunctionalization. Genetics 154: 459473.
LYNCH, M., M. O'HELY, B. WALSH and A. FORCE, 2001 The probability of preservation of a newly arisen gene duplicate. Genetics 159: 17891804.
MADDISON, D. R., and W. P. MADDISON, 2000 MacClade. Sinauer Associates, Sunderland, MA.
MADLUNG, A., R. MASUELLI, B. WATSON, S. H. REYNOLDS, J. DAVISON et al., 2002 Remodeling of DNA methylation and phenotypic and transcriptional changes in synthetic Arabidopsis allotetraploids. Plant Physiol. 129: 733746.
MARTIN, D., and E. RYBICKI, 2000 RDP: detection of recombination amongst aligned sequences. Bioinformatics 16: 562563.
MAYNARD SMITH, J., 1992 Analyzing the mosaic structure of genes. J. Mol. Evol. 34: 126129.[Medline]
MOSCONE, E. A., M. A. MATZKE and A. J. M. MATZKE, 1996 The use of combined FISH/GISH in conjunction with DAPI counterstaining to identify chromosomes containing transgene inserts in amphidiploid tobacco. Chromosoma 105: 231236.[Medline]
MULLER, H. J., 1942 Isolating mechanisms, evolution and temperature. Biol. Symp. 6: 71125.
NYLANDER, J. A. A., F. RONQUIST, J. P. HUELSENBECK and J. L. NIEVES-ALDREY, 2004 Bayesian phylogenetic analysis of combined data. Syst. Biol. 53: 4767.[CrossRef][Medline]
OHNO, S., 1970 Evolution by Gene Duplication. Springer-Verlag, Berlin.
ORR, H. A., 1991 A test of Fisher's theory of dominance. Proc. Natl. Acad. Sci. USA 88: 1141311415.
OZKAN, H., A. A. LEVY and M. FELDMAN, 2001 Allopolyploidy-induced rapid genomic evolution of the wheat (Aegilops-Triticum) group. Plant Cell 13: 17351747.
PADIDAM, M., S. SAWYER and C. M. FAUQUET, 1999 Possible emergence of new geminiviruses by frequent recombination. Virology 265: 218225.[CrossRef][Medline]
PAGEL, M., 1994 Detecting correlated evolution on phylogenies: a general method for the comparative analysis of discrete characters. Proc. R. Soc. Lond. Ser. B 255: 3745.
PAPP, B., C. PÁL and L. D. HURST, 2003 Dosage sensitivity and the evolution of gene families in yeast. Nature 424: 194197.[CrossRef][Medline]
POSADA, D., 2002 Evaluation of methods for detecting recombination from DNA sequences: empirical data. Mol. Biol. Evol. 19: 708717.
POSADA, D., and K. A. CRANDALL, 2001 Evaluation of methods for detecting recombination from DNA sequences: computer simulations. Proc. Natl. Acad. Sci. USA 98: 1375713762.
RAMBAUT, A., and N. C. GRASSLY, 1997 Seq-Gen: an application for the Monte Carlo simulation of DNA sequence evolution along phylogenetic trees. Comput. Appl. Biosci. 13: 235238.
RAMSDEN, D. A., K. BAETZ and G. E. WU, 1994 Conservation of sequence in recombination signal sequence spacers. Nucleic Acids Res. 22: 17851796.
SADOFSKY, M. J., J. E. HESSE, J. F. MCBLANE and M. GELLERT, 1993 Expression and V(D)J recombination activity of mutated RAG-1 proteins. Nucleic Acids Res. 21: 56445650.
SAKANO, H., K. HUPPI, G. HEINRICH and S. TONEGAWA, 1979 Sequences at the somatic recombination sites of immunoglobulin light-chain genes. Nature 280: 288294.[CrossRef][Medline]
SALMINEN, M. O., J. K. CARR, D. S. BURKE and F. E. MCCUTCHAN, 1995 Identification of breakpoints in intergenotypic recombinants of HIV-1 by bootscanning. AIDS Res. Hum. Retroviruses 11: 14231425.[Medline]
SHAKED, H., K. KASHKUSH, H. OZKAN, M. FELDMAN and A. A. LEVY, 2001 Sequence elimination and cytosine methylation are rapid and reproducible responses of the genome to wide hybridization and allopolyploidy in wheat. Plant Cell 13: 17491759.
SHULL, G. H., 1908 The composition of a field of maize. Am. Breed. Assoc. 4: 296301.
SKALICKÁ, K., K. Y. LIM, R. MATYASEK, M. MATZKE, A. R. LEITCH et al., 2005 Preferential elimination of repeated DNA sequences from the paternal, Nicotiana tomentosiformis genome donor of a synthetic, allotetraploid tobacco. New Phytol. 166: 291303.[CrossRef][Medline]
SOLTIS, D. E., P. S. SOLTIS, J. C. PRIRES, A. KOVARIK, J. A. TATE et al., 2004 Recent and recurrent polyploidy in Tragopogon (Asteraceae): cytogenetic, genomic and genetic comparisons. Biol. J. Lin. Soc. 82: 485501.[CrossRef]
SWOFFORD, D. L., 2002 Phylogenetic Analysis Using Parsimony (* and Other Methods), Version 4. Sinauer Associates, Sunderland, MA.
TYMOWSKA, J., 1991 Polyploidy and cytogenetic variation in frogs of the genus Xenopus, pp. 259297 in Amphibian Cytogenetics and Evolution, edited by D. S. GREEN and S. K. SESSIONS. Academic Press, San Diego.
VEITIA, R. A., 2002 Exploring the etiology of haploinsufficiency. BioEssays 24: 175184.[CrossRef][Medline]
VOLKOV, R. A., N. V. BORISJUK, I. I. PANCHUK, D. SCHWEIZER and V. HEMLEBEN, 1999 Elimination and rearrangement of parental rDNA in the allotetraploid Nicotiana tabacum. Mol. Biol. Evol. 16: 311320.[Abstract]
WANG, J., L. TIAN, A. MADLUNG, H.-S. LEE, M. CHEN et al., 2004 Stochastic and epigenetic changes of gene expression in Arabidopsis polyploids. Genetics 168: 22172226.
WENDEL, J. F., A. SCHNABEL and T. SEELANAN, 1995 Bidirectional interlocus concerted evolution following allopolyploid speciation in cotton (Gossypium). Proc. Natl. Acad. Sci. USA 92: 280284.
WRIGHT, S., 1934 Molecular and evolutionary theories of dominance. Am. Nat. 63: 2453.