- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Katju, V.
- Articles by Lynch, M.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Katju, V.
- Articles by Lynch, M.
The Structure and Early Evolution of Recently Arisen Gene Duplicates in the Caenorhabditis elegans Genome
Vaishali Katjua and Michael Lynchaa Department of Biology, Indiana University, Bloomington, Indiana 47405
Corresponding author: Vaishali Katju, Jordan Hall 142, Indiana University, 1001 E. Third St., Bloomington, IN 47405., vkatju{at}bio.indiana.edu (E-mail)
Communicating editor: S. P. OTTO
| ABSTRACT |
|---|
The significance of gene duplication in provisioning raw materials for the evolution of genomic diversity is widely recognized, but the early evolutionary dynamics of duplicate genes remain obscure. To elucidate the structural characteristics of newly arisen gene duplicates at infancy and their subsequent evolutionary properties, we analyzed gene pairs with
10% divergence at synonymous sites within the genome of Caenorhabditis elegans. Structural heterogeneity between duplicate copies is present very early in their evolutionary history and is maintained over longer evolutionary timescales, suggesting that duplications across gene boundaries in conjunction with shuffling events have at least as much potential to contribute to long-term evolution as do fully redundant (complete) duplicates. The median duplication span of 1.4 kb falls short of the average gene length in C. elegans (2.5 kb), suggesting that partial gene duplications are frequent. Most gene duplicates reside close to the parent copy at inception, often as tandem inverted loci, and appear to disperse in the genome as they age, as a result of reduced survivorship of duplicates located in proximity to the ancestral copy. We propose that illegitimate recombination events leading to inverted duplications play a disproportionately large role in gene duplication within this genome in comparison with other mechanisms.
THE enormous disparity in the genome sizes of extant organisms is a striking reminder that genic and genome-wide duplications are a ubiquitous and evolutionarily important feature of genomes. Although the evolutionary significance of gene duplicates had been recognized by early geneticists and evolutionary biologists (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Despite a widespread acceptance of the significance of gene duplication and extensive theoretical work relating to aspects of persistence and functionality of duplicated genes, empirical insight into the early evolution of newly arisen gene duplicates has been limited. Past studies of natural populations have revealed a handful of relatively young gene duplicates and intraspecific polymorphism for gene copy number (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Population-genetic models have been employed to study the evolutionary dynamics of gene duplicates with regard to the probabilities of fixation (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
To test the common assumption of complete structural resemblance between gene duplicates at birth, we analyzed a population of recent gene duplicates (290 duplicate pairs with 10% or less divergence at synonymous sites) in the Caenorhabditis elegans genome. We posed several additional questions with respect to the features of gene duplicates at conception and early in their history. First, to what extent does a duplicated gene copy structurally resemble the progenitor copy at the nucleotide level, and how is this structural homology altered with time? Second, are the introns of the progenitor gene maintained in the duplicate copy and does reverse transcription of processed mRNA play a significant role in the gene duplication process? Third, where do duplicate copies tend to reside at or close to the time of conception, and does their location alter over evolutionary time? Finally, what is the approximate span (length) of the duplicated stretch of DNA?
| MATERIALS AND METHODS |
|---|
Identification of gene duplicates within the C. elegans genome:
A total of 333 gene-duplicate pairs with KS (number of substitutions per synonymous site) values ranging from 0.00 to 0.10 within the C. elegans genomic data set of ![]()
![]()
![]()
Grouping of gene-duplicate pairs into cohorts:
To facilitate the comparison of putatively different cohorts of gene duplicates and discern evolutionary change with increasing sequence divergence, the 290 pairs of gene duplicates were originally classified into six cohorts on the basis of divergence at synonymous sites (KS = 0, 0 < KS
0.01, 0.01 < KS
0.03, 0.03 < KS
0.05, 0.05 < KS
0.07, and 0.07 < KS
0.10). However, statistical analyses revealed that the salient differences occurred between the putative youngest cohort (KS = 0) and duplicate pairs with 0 < KS
0.10. Hence, we present only results for the data set as classified into two cohorts, namely KS = 0 and 0 < KS
0.10 (68 and 222 gene-duplicate pairs, respectively).
Accession and analysis of sequences:
For each gene within a duplicate pair, two nucleotide sequence files were obtained from WormBase: (i) the unspliced version of the gene containing all exons and introns as well as 1 kb of flanking region in both the 5' and 3' directions and (ii) the predicted spliced version lacking introns. In addition, information about genomic location, strand orientation, cDNAs, and potential function was collected from the gene sequence report. All sequence analysis was implemented in the BioEdit Sequence Alignment Editor, Version 5.0.9 (![]()
![]()
Classification of duplicate pairs into structural categories:
On the basis of a direct comparison of the ORF nucleotide sequence of the two copies within a gene-duplicate pair, we classified duplicates as exhibiting one of three categories of structural resemblance, namely (i) complete, (ii) partial, or (iii) chimeric (Fig 1). Gene duplicates exhibiting nucleotide sequence homology between the initiation and the termination codons were categorized as having a complete structure. An assignment to this category is straightforward for duplicates with amino acid sequences of identical length. In the case of gene duplicates with amino acid sequences of differing length, duplicate pairs were designated as complete if the shorter copy exhibited nucleotide sequence homology to the lengthier copy throughout the latter's ORF, irrespective of differently demarcated exon-intron and flanking region(s) boundaries. The disruption of sequence homology between the two copies as a result of indels (including intron loss in one copy) was ignored as long as nucleotide sequence homology between the two copies was resumed within the ORF boundaries of the lengthier reference sequence, before the start of flanking region(s). Another class was composed of duplicate pairs with gene copies of differing amino acid sequence length wherein the entire ORF of the shorter gene was homologous to the longer gene's ORF, but the longer gene had a unique ORF sequence absent in the shorter gene. These duplicate pairs were classified as exhibiting a partial structure. A third class was composed of duplicate pairs with gene copies of differing amino acid sequence length wherein sequence homology between the two copies was disrupted within the ORFs of both genes, such that both had some unique ORF sequence to the exclusion of the other copy. These were classified as exhibiting a chimeric structure. Simply put, gene-duplicate copies with complete resemblance were homologous over their entire ORFs; those with partial resemblance had one copy with a unique ORF sequence that was absent in the other copy; and those with chimeric resemblance comprised pairs in which both copies had a unique ORF sequence to the exclusion of the other copy. The observed frequencies of the three structural categories of gene duplicates within the two cohorts (KS = 0 and 0 < KS
0.10) were compared using a G-test (likelihood-ratio test) for goodness of fit (![]()
|
Two issues deserve mention with respect to our structural classification scheme. First, our nucleotide sequence analysis revealed a frequent occurrence of small insertion/deletions (indels) in one copy relative to the other for duplicate pairs with increasing divergence at synonymous sites. These indels ranged from a few to several hundred base pairs and were located in both the coding and noncoding regions. Insofar as sequence homology between the two duplicate copies was resumed on both sides of the indel within the ORF and flanking regions, it was ignored under the parsimonious assumption that it occurred in the postduplication period as a mutation event and does not accurately reflect structural resemblance at origin. The second issue relates to gene annotation. The exon-intron organization and therefore the structure of many annotated genes within a genome are essentially predicted by computer programs such as Genefinder. Despite sequence homology, two duplicate copies can be assigned different exon-intron boundaries due to either inaccurate predictions by such programs or a disruption of the reading frame in one of the duplicate copies as a result of mutation(s). We frequently encountered cases wherein homologous nucleotide sequences are alternatively depicted as an exon and a flanking region in the two duplicate copies. Similarly, a genic region can be depicted alternatively as an exon and an intron in the two duplicate copies. To avoid the influence of erroneous annotations by gene-predicting programs, our method of structural classification is directly based on comparisons of nucleotide sequences between the start and termination codons of the two duplicates, irrespective of exon-intron predictions. In addition, we collected cDNA information from WormBase for all putative gene duplicates in our data set to indirectly verify the accuracy of gene predictions.
Span of duplication:
Another aspect of gene duplication is concerned with understanding the frequency distribution of the span of duplication. In this study, this measure is restricted to duplicated nucleotide stretches containing identifiable open reading frames. The length of sequence homology (in kilobases) between two duplicate copies was taken to be the span of duplication. Of the 290 pairs of gene duplicates, the majority of the cases (276 of 290) involved the duplication of a single gene. However, 7 cases involved the duplication of multiple loci with intervening flanking regions. These linked sets of duplicated loci were treated as a single duplication event and assigned a single value for duplication span. Therefore, the KS = 0 and 0 < KS
0.10 cohorts comprise 62 and 221 duplication events, respectively.
As mentioned earlier, duplicate genes with increasing synonymous-site divergence often have numerous indels within their homologous regions, ranging in length from a few to several hundred base pairs. Under these circumstances, we generated two values for duplication span by separately considering each of the two duplicate loci in turn as the ancestral copy. The lower of the two duplication span values was included in the analysis. This may lead to a slight deflation in our estimate of the average length of duplication. For logistic purposes, we had also assumed that the duplication was terminated at the points beyond which no homology was apparent between the two gene copies for a continuous stretch of 1 kb on either end. This methodology is therefore biased against the detection of large indels. In other words, if an insertion/deletion of >1 kb occurred in one copy, we would fail to detect the resumption of homology between the two copies beyond the indel location. This too would lead to a deflation in our estimate of the length of duplication. Therefore, the values reported here are minimal estimates of duplication span.
Physical organization of duplicates residing on the same chromosome:
We determined the relative strand orientation and physical organization of gene-duplicate copies located on the same chromosome. Duplicates on the same chromosome were categorized as having direct orientation if the direction of transcriptional orientation was preserved in both copies (i.e., both duplicates were located on the positive strand or on the negative strand). Duplicates with inverse orientation on the same chromosome had one copy each on the positive and negative strands. Additionally, with respect to physical organization on the same chromosome, duplicates were classified as tandem if there were no intervening genes between the two copies or nontandem if intervening gene(s) were present. The observed frequencies of (i) direct vs. inverse duplicates and (ii) tandem vs. nontandem duplicates across the two cohorts (KS = 0 and 0 < KS
0.10) were compared using a G-test (likelihood-ratio test) for goodness of fit (![]()
Measures of genomic movement of one duplicate relative to the other:
To determine if the genomic location of duplicate copies is altered over evolutionary time, two measures of location and dispersion were calculated as a function of divergence at synonymous sites: (i) the relative frequencies of duplicate pairs residing on the same vs. a different chromosome and (ii) the physical distance separating duplicate copies residing on the same chromosome.
With respect to chromosomal location, we calculated the relative frequencies of gene-duplicate pairs with both member copies residing on the same chromosome vs. different chromosomes across both cohorts of gene duplicates (KS = 0 and 0 < KS
0.10). The observed frequencies of the two categories of chromosomal location across the two cohorts were compared using a G-test (likelihood-ratio test) for goodness of fit (![]()
The physical distance (in base pairs) between duplicate copies on the same chromosome was plotted against synonymous-site divergence between gene duplicates to determine if duplicates on the same chromosome increasingly disperse with evolutionary time, which would be suggestive of intrachromosomal secondary movement by one copy or differential loss in the postduplication period. We independently calculated the correlation coefficient between physical distance and synonymous-site divergence for (i) all 180 gene-duplicate pairs on the same chromosome across both cohorts (0
KS
0.10) and (ii) 125 gene-duplicate pairs within the 0 < KS
0.10 cohort only. We tested for a significant sample correlation coefficient by employing (i) the nonparametric Kendall's coefficient of rank correlation test and (ii) the product-moment correlation coefficient (under the assumption of normality).
| RESULTS |
|---|
Early presence of structural heterogeneity between gene duplicates:
Structural comparisons revealed that duplicates with partial and chimeric structural resemblance are present in high frequency even within the cohort with no synonymous-site divergence in homologous regions. Together, gene-duplicate pairs exhibiting partial or chimeric structure between the two copies comprise 50 and 64% of all duplicate pairs in the KS = 0 and 0 < KS
0.10 cohorts, respectively (Fig 2). A G-test for goodness of fit revealed no significant difference in the frequencies of the three structural categories between the two cohorts of gene duplicates (Gadj = 4.24, d.f. = 2, 0.1< P < 0.5).
|
Currently, 70% (203/290) of all predicted gene-duplicate pairs in our data set have cDNA sequence identified for at least one copy of a gene-duplicate pair. There were no significant differences among structural categories or KS classes in the frequency of genes for which cDNA has been identified.
Minor role of reverse transcription in the origin of gene duplicates:
The structural comparisons of gene duplicates also addressed the extent to which reverse transcription of processed mRNA contributes to gene duplication. Of the 290 gene-duplicate pairs analyzed, 278 were gene duplicates with introns in at least one gene copy. Intron(s) are preserved along the region of homology between the two copies in all but 3 of these 278 cases (
99%; Fig 3).
|
The first such case involves the duplicate pair C54C6.1/W01D2.1 wherein the two copies have different chromosomal locations and differ by one nonsynonymous substitution and a 3-bp indel (Fig 3). Both genes are members of the ribosomal protein L37 protein family. Gene locus W01D2.1, the lengthier copy, is composed of two exons separated by an intron of 300 bp. Locus C54C6.1 is composed of a single exon that is homologous to the two exons of locus W01D2.1, with the precise deletion of the intron.
The second case involves the gene-duplicate pair C03A7.14/C03A7.7 with 5.6% substitutions per synonymous site (Fig 3). The lengthier and shorter loci comprise three and two exons, respectively. Exons 1 and 2 of the lengthier copy (C03A7.14) are fused as one exon minus the intervening intron in the shorter copy (C03A7.7). The two duplicates are separated by nine intervening genes and a physical distance of
31 kb on chromosome V.
The third case involves the gene-duplicate pair B0035.2/C47A4.1 with 6.9% substitutions per synonymous site (Fig 3). The two loci display a chimeric structure relative to one another, each having unique exons to the exclusion of the other locus. The region of homology toward the 3' end is composed of three exons with intervening introns in one gene and a single exon minus both introns in the other gene. The two duplicates are separated by a physical distance of 2.4 Mb on chromosome IV.
Predominance of duplications involving short sequence tracts:
Fig 4 displays the distribution of duplication spans for all 283 duplication events analyzed. With the exception of four cases involving duplicated clusters of genes spanning
10.1, 15.8, 23.5, and 108.3 kb, respectively, all duplication span values were <8.7 kb. The L-shape of the distribution implies that duplications involving relatively short tracts of sequence are extremely frequent. In contrast, lengthier duplication events, including partial chromosomal duplications, are relatively rare. In this data set 70% (199/283) of all duplication events resulted in a duplication span of <2 kb. The >0.5- to 1-kb duplication span class has the highest frequency of duplicate pairs (57/283 = 20%), followed by the >1- to 1.5-kb class (52/283 = 18% of all duplicate pairs). The median value for duplication span within this data set was 1419 bp.
|
Increase in genomic distance between duplicates over evolutionary time:
With respect to chromosomal location, we calculated the frequencies of gene-duplicate pairs with both member copies residing (i) on the same chromosome vs. (ii) on different chromosomes. Approximately 89% (55/62) of duplicate pairs comprising the KS = 0 cohort had both copies residing on the same chromosome compared to only 56% (125/221) in the 0 < KS
0.10 cohort (Fig 5). A G-test for goodness of fit revealed chromosomal location to be highly associated with the degree of divergence at synonymous sites (Gadj = 24.6, d.f. = 1, P << 0.0001). With respect to physical distance between duplicate copies residing on the same chromosome, Kendall's coefficient of rank correlation revealed a significant positive correlation between physical distance and sequence divergence at synonymous sites if all 180 gene-duplicate pairs are considered (
= 0.317; P < 0.0001; Fig 6). The median physical distance between duplicates residing on the same chromosome was 1138 and 8644 bp for the KS = 0 and 0 < KS
0.10 cohorts, respectively. Therefore, not only are gene duplicates within the KS = 0 cohort more likely to occur on the same chromosome relative to older cohorts, but also they tend to be closely spaced together on the same chromosome (often as tandem loci; see Table 1). Hence, these distance measures are consistent with a pattern of increased genomic distance between gene duplicates over evolutionary time.
|
|
|
A gradual increase in genomic distance (greater likelihood of residence on different chromosomes and/or increased distance between gene copies on the same chromosome) between gene duplicates with increased synonymous-site divergence would support secondary movement by one or both copies in the postduplication period as the mechanism for genomic dispersal. Conversely, a lack of correlation between distance measures and synonymous-site divergence would argue against the hypothesis of secondary movement leading to genomic dispersal of gene duplicates.
Logistic regression analysis on the chromosomal location data found no significant effect of synonymous-site divergence on chromosomal location of gene duplicates (Wald test statistic = 0.181, d.f. = 1, P = 0.67). When gene duplicates are broken up into smaller cohorts (0 < KS
0.01, 0.01 < KS
0.03, 0.03 < KS
0.05, 0.05 < KS
0.07, 0.07 < KS
0.10), there is a large jump in frequency of duplicates residing on different chromosomes from the KS = 0 to the next cohort (0 < KS
0.01) but no further increase in older cohorts. These results argue against the hypothesis of secondary movement by gene duplicates to different chromosomes with increasing evolutionary time.
Likewise, we find no evidence for a gradual increase in distance between duplicate copies residing on the same chromosome with evolutionary time. As mentioned earlier, we found a significant positive relationship between synonymous-site divergence and physical distance between gene duplicates residing on the same chromosome. However, if gene-duplicate pairs within the KS = 0 cohort (55 pairs) are removed from the data set, the correlation between the two variables is no longer evident (
= 0.055; P = 0.366; Fig 6). Significance tests of the sample correlation coefficient under the assumption of normality yielded P values similar to the nonparametric Kendall's coefficient of rank correlation test (see Fig 6). Thus, there is a significant excess of closely spaced gene duplicates in the KS = 0 cohort and this excess alone might have caused the positive correlation between KS and physical distance between gene duplicates on the same chromosome. The difference in median distance between the KS = 0 and the 0 < KS
0.10 is quite dramatic. For gene duplicates with KS = 0, half of the duplicates are within 6.5 kb of each other, whereas half of the duplicates in the 0 < KS
0.10 group are within 32 kb of each other. Given that most gene duplicates in the 0 < KS
0.10 cohort are still relatively close to each other, the dispersion of duplicates uniformly across a chromosome does not explain the lack of relationship between KS and distance.
The majority of gene duplicates in the KS = 0 cohort occur on the same chromosome as tandem genes with inverse transcriptional orientation:
As demonstrated earlier, the majority of gene-duplicate pairs in the KS = 0 cohort have both gene copies located on the same chromosome (Fig 5). Table 1 represents the relative strand orientation and physical organization of 180 gene-duplicate pairs with both copies on the same chromosome. Within the KS = 0 cohort, we observe the following frequencies: inverse tandem (45%) > direct tandem and inverse nontandem (24% each) > direct nontandem (7%). The following pattern is observed for the 0 < KS
0.10 cohort: inverse nontandem (40%) > direct nontandem (34%) > direct tandem and inverse tandem (13% each).
We observed a striking difference between the two cohorts of gene duplicates with respect to strand orientation of the two copies. The KS = 0 cohort of gene duplicates had a twofold excess of duplicate copies in inverse orientation (69%) relative to those exhibiting direct orientation (31%). Within the 0 < KS
0.10 cohort, gene duplicates are equally likely to occur in direct vs. inverse orientation (47 and 53%, respectively). A G-test for goodness of fit comparing the two duplicate cohorts rejects the null hypothesis that the frequencies of strand orientation are independent of sequence divergence at synonymous sites (Gadj = 4.199, d.f. = 1, P < 0.05).
Likewise, the two gene-duplicate cohorts also exhibit differences with respect to physical organization when both copies are present on the same chromosome. The majority (69%) of the 55 gene-duplicate pairs on the same chromosome within the KS = 0 cohort appear as tandemly organized loci. In contrast, the majority (74%) of the 125 gene-duplicate pairs on the same chromosome within the 0 < KS
0.10 cohort exhibit a nontandem organization. A G-test for goodness of fit comparing the two cohorts rejects the null hypothesis that physical organization on the same chromosome (tandem vs. nontandem) is independent of sequence divergence at synonymous sites (Gadj = 30.341, d.f. = 1, P << 0.001).
| DISCUSSION |
|---|
We have focused on 290 gene-duplicate pairs in the C. elegans genome with <10% sequence divergence at synonymous sites to address questions relating to the structure and genomic location of presumably young gene duplicates and the possible mechanisms of gene duplication. We conducted our analysis under the initial assumption that the number of substitutions per synonymous site (KS) is an appropriate indicator of the evolutionary age of a duplicate pair, at least for low estimates of KS. However, concerted evolution, particularly gene conversion, has the potential to homogenize the sequence of previously diverged duplicate copies in relation to one another, so that they appear evolutionarily young. Unfortunately, the methods to detect and test for gene conversion in the absence of a close outgroup sequence do not work when there is high sequence identity between the copies (![]()
![]()
![]()
![]()
Slippage and unequal exchange are expected to result in tandem gene duplicates with direct orientation and these mechanisms are often invoked as an explanation for closely spaced gene duplicates. On the basis of an apparent excess of tandem duplicates in a partial-genome analysis of gene duplicates in C. elegans, it was concluded that slippage or unequal crossing over rather than transposition was the primary gene duplication mechanism within this genome (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Several models of inverted duplications have been proposed, especially in conjunction with the phenomenon of gene amplification in mammalian cells (![]()
![]()
![]()
![]()
![]()
![]()
It is quite possible that slippage or unequal exchange does indeed lead to a large number of tandem duplications. Direct tandem repeats, however, are expected to be very unstable and unless under selection from the outset, are easily lost by the very same mechanisms that created them in the first place (![]()
![]()
![]()
![]()
Gene-duplicate copies typically reside close to one another in the genome, most often as tandem and inversely oriented genes on the same chromosome. The observation that gene duplicates often reside on the same chromosome has been noted in other eukaryotic genomes as well (![]()
![]()
![]()
![]()
If gene duplicates are being moved apart predominantly by interchromosomal rearrangements (secondary movement), we would expect a progressive increase with age (KS) in the frequency of duplicate pairs with the two copies located on different chromosomes. The chromosomal location data analyzed here do indeed show a significant enrichment with time of duplicate pairs with the two copies located on different chromosomes. However, the increase in the frequency of gene duplicates on separate chromosomes with increasing sequence divergence is primarily due to the fact that gene duplicates within the KS = 0 cohort are overwhelmingly located on the same chromosome. When these are excluded from the data, no further increase in frequency of gene duplicates occurs on separate chromosomes with increasing synonymous-site divergence. Similar results emerge when the distance between duplicates residing on the same chromosome is analyzed, in that there is no relationship between synonymous-site divergence and distance when the KS = 0 class is excluded. There is no doubt that chromosomal rearrangements (inversions and translocations) occur frequently in the C. elegans genome (![]()
0.01). However, the mechanisms responsible for moving gene duplicates apart presumably cannot distinguish between young and old gene duplicates and stop operating once one of the duplicated pair has been hit by a point mutation.
There are two alternatives to the secondary movement explanation for the relationship between synonymous-site divergence and distance between gene duplicates, namely (i) differential retention of gene duplicates and (ii) gene conversion; the frequency of both may depend on the distance between duplicate copies. First, gene duplicates in genomic proximity to the cognate copy are probably less stable than duplicates far apart. Although all gene duplicates can be lost by a simple deletion, closely spaced gene duplicates can also be lost by slippage or recombination with the cognate partner resulting in unequal exchange. For example, tandem duplications in yeast are known to be extremely unstable, given the high level of homologous recombination within this genome (![]()
![]()
![]()
![]()
![]()
The nontandem duplications in our data set are more likely to occur on the same chromosome than on different chromosomes. This suggests that the duplication mechanisms involve interaction between sites that are in physical proximity in the nucleus. Such mechanisms could involve replicative processes such as "transposition without transposase" (![]()
![]()
![]()
We found only three pairs of gene duplicates for which intron(s) were missing in one member relative to the other copy. Such a condition could result from either the insertion of intron(s) in one copy or their precise deletion in the other duplicate. In each of these three cases, the duplicate copies were located either on different chromosomes or at a considerable distance away from each other on the same chromosome. Since reverse-transcribed genes are expected to randomly reintegrate into the genome, these cases may represent gene duplication by reverse transcription of processed or partially processed mRNA. An analysis of pseudogenes in the C. elegans genome (which were excluded from this study) found that only a small fraction (10%) of these appear to be processed (![]()
![]()
![]()
The average gene length in C. elegans is
2.5 kb (![]()
![]()
1.4 kb, and 70% (199/283) of all duplication events resulted in a duplication span of <2.5 kb (Fig 4). The L-shaped frequency distribution of duplication spans indicates that, aside from a few lengthy regional duplications, the average duplication event within this genome is fairly localized and spawns relatively short tracts of duplicate sequence that may not encompass entire genes. These results lend credence to the idea that partial gene duplications are to be expected (![]()
The mechanisms responsible for gene duplication (except for reverse transcription) are unlikely to respect gene boundaries. Many, if not most, gene duplications should therefore include gene fractions rather than complete copies, resulting in either a partial copy of the original or a chimeric gene fusion of a partial copy to another gene. This hypothesis is bolstered by our duplication span analysis demonstrating that the median duplication tract falls short of the average gene length in C. elegans. Furthermore, our structural comparison results (see below) are consistent with the idea that incomplete gene duplications are common.
We compared the ORF nucleotide sequences of both duplicates to determine the extent of sequence homology between them. Our results indicate that mosaicism or structural heterogeneity between duplicate copies is visible very early in their evolutionary history, if not at birth. Approximately half of the C. elegans gene duplicates within both the KS = 0 and 0 < KS
0.10 cohorts have unique coding region sequence to the exclusion of the other copy, in addition to the region of homology. To what degree such partial or chimeric gene duplicates contribute to the creative process in evolution by gene duplication is an important question. Some partial gene duplications could be maintained by a process of duplication, degeneration, and conservation (![]()
![]()
The creative potential of chimeric duplicates is well appreciated in the context of evolution of organismal diversity. For example, the demands imposed by a multicellular existence in metazoans were met by an enormous assemblage of novel animal-specific proteins that arose as a result of partial/chimeric duplications in conjunction with shuffling events (![]()
![]()
0.10 cohort). Taken at face value, this means that partial and chimeric gene duplicates not only are present at "birth," but also have at least as much potential to contribute to long-term evolution as do complete gene duplicates. Of course, it is also possible that gene duplicates do not stay within their structural category. For example, if many partial and chimeric gene duplicates were nonfunctional and had no initial evolutionary potential, their loss could have been countered by the creation of new partial and chimeric genes from complete duplicates. The relationship between a gene duplicate's structure at birth and its future evolutionary potential remains to be determined.
| ACKNOWLEDGMENTS |
|---|
We thank Ulfar Bergthorsson for critical reading of the manuscript and are grateful to two anonymous reviewers for helpful comments on the manuscript. We give a special thanks to Sarah Otto, the communicating editor, for extremely insightful and constructive suggestions. This research has been supported by a National Science Foundation Integrative Graduate Education and Research Traineeship Program in Evolution, Development, and Genomics graduate fellowship to V.K. and a National Institutes of Health grant RO1-GM36827 to M.L.
Manuscript received July 11, 2003; Accepted for publication September 8, 2003.
| LITERATURE CITED |
|---|
ACHAZ, G., E. COISSAC, A. VIARI, and P. NETTER, 2000 Analysis of intrachromosomal duplications in yeast Saccharomyces cerevisiae: a possible model for their origin. Mol. Biol. Evol. 17:1268-1275.
ACHAZ, G., P. NETTER, and E. COISSAC, 2001 Study of intrachromosomal duplications among the eukaryote genomes. Mol. Biol. Evol. 18:2280-2288.
ALLENDORF, F., 1979 Rapid loss of duplicate gene expression by natural selection. Heredity 43:247-258.
ANDERSON, R. P. and J. R. ROTH, 1977 Tandem genetic duplications in phage and bacteria. Annu. Rev. Microbiol. 31:473-505.[Medline]
AVEROF, M., R. DAWES, and D. FERRIER, 1996 Diversification of arthropod Hox genes as a paradigm for the evolution of gene functions. Semin. Cell Dev. Biol. 7:539-551.
BAE, Y.-S., I. KAWASAKI, H. IKEDA, and L. F. LIU, 1988 Illegitimate recombination mediated calf thymus DNA topoisomerase II in vitro.. Proc. Natl. Acad. Sci. USA 85:2076-2080.
BAILEY, G. S., R. T. POULTER, and P. A. STOCKWELL, 1978 Gene duplication in tetraploid fish: model for gene silencing at unlinked duplicated loci. Proc. Natl. Acad. Sci. USA 75:5575-5579.
BEGUN, D. J., 1997 Origin and evolution of a new gene descended from alcohol dehydrogenase in Drosophila. Genetics 145:375-382.[Abstract]
BI, X. and L. F. LIU, 1996 DNA rearrangement mediated by inverted repeats. Proc. Natl. Acad. Sci. USA 93:819-823.
BRIDGES, C. B., 1935 Salivary chromosome maps. J. Hered. 26:60-64.
CHEN, L., A. L. DEVRIES, and C. H. CHENG, 1997 Evolution of antifreeze glycoprotein gene from a trypsinogen gene in Antarctic notothenioid fish. Proc. Natl. Acad. Sci. USA 94:3811-3816.
CLARK, A. G., 1994 Invasion and maintenance of a gene duplication. Proc. Natl. Acad. Sci. USA 91:2950-2954.
COGHLAN, A. and K. H. WOLFE, 2002 Fourfold faster rate of genome rearrangement in nematodes than in Drosophila.. Genome Res. 16:857-867.
COHEN, A., D. HASSIN, S. KARBY, and S. LAVI, 1994 Hairpin structures are the primary amplification products: a novel mechanism for generation of inverted repeats during gene amplification. Mol. Cell. Biol. 14:7782-7791.
COOKE, J., M. A. NOWAK, M. BOERLIJST, and J. MAYNARD-SMITH, 1997 Evolutionary origins and maintenance of redundant gene expression during metazoan development. Trends Genet. 13:360-364.[Medline]
DOOLITTLE, R. F., 1985 The geneaology of some recently evolved vertebrate proteins. Trends Biochem. Sci. 10:233-237.
DROUIN, G., 2002 Characterization of the gene conversions between the multigene family members of the yeast genome. J. Mol. Evol. 55:14-23.[Medline]
DUNHAM, I., N. SHIMIZU, B. A. ROE, S. CHISSOE, and A. R. HUNT et al., 1999 The DNA sequence of human chromosome 22. Nature 402:489-495.[Medline]
DURET, L. and D. MOUCHIROUD, 1999 Expression pattern and, surprisingly, gene length shape codon usage in Caenorhabditis, Drosophila and Arabidopsis.. Proc. Natl. Acad. Sci. USA 96:4482-4487.
EISEN, J. A., J. F. HEIDELBERG, O. WHITE and S. L. SALZBERG, 2000 Evidence for symmetric chromosomal inversions around the replication origin in bacteria. Genome Biol. 1: 0011.00110011.0019.
FISCHER, G., C. NEUVEGLISE, P. DURRENS, C. GAILLARDIN, and B. DUJON, 2001 Evolution of gene order in the genomes of two related yeast species. Genome Res. 11:2009-2019.
FORCE, A., M. LYNCH, F. BRYAN PICKETT, A. AMORES, and Y. YAN et al., 1999 Preservation of duplicate genes by complementary degenerative mutations. Genetics 151:1531-1545.
GALITSKI, T. and J. R. ROTH, 1997 Pathways of homologous recombination between chromosomal direct repeats in Salmonella typhimurium.. Genetics 146:751-767.[Abstract]
GILLEY, J. and M. FRIED, 1999 Extensive gene order differences within regions of conserved synteny between the Fugu and human genomes: implications for chromosomal evolution and the cloning of disease genes. Hum. Mol. Genet. 8:1313-1320.
GORDON, A. J. and J. A. HALLIDAY, 1995 Inversions with deletions and duplications. Genetics 140:411-414.[Abstract]
GU, Z., L. M. STEINMETZ, X. GU, C. SCHARFE, and R. W. DAVIS et al., 2003 Role of duplicate genes in genetic robustness against null mutations. Nature 421:63-66.[Medline]
HALDANE, J. B. S., 1933 The part played by recurrent mutation in evolution. Am. Nat. 67:5-19.
HALL, T. A., 1999 BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/99/NT. Nucleic Acids Symp. Ser. 41:95-98.
HARRISON, P. M., N. ECHOLS, and M. B. GERSTEIN, 2001 Digging for dead genes: an analysis of the characteristics of the pseudogene population in the Caenorhabditis elegans genome. Nucleic Acids Res. 29:818-830.
HIGGINS, D. G., A. J. BLEASBY, and R. FUCHS, 1992 CLUSTAL V: improved software for multiple sequence alignment. Comput. Appl. Biosci. 8:189-191.
HOLT, S. J., W. A. CRESS, and J. VAN STADEN, 2002 Evidence for dynamic alteration in histone gene clusters of Caenorhabditis elegans: A topoisomerase II connection? Genet. Res. 79:11-22.[Medline]
HUGHES, A. L., 1994 The evolution of functionally novel proteins after gene duplication. Proc. R. Soc. Lond. Ser. B Biol. Sci. 256:119-124.[Medline]
HYRIEN, O., M. DEBATISSE, G. BUTTIN, and B. ROBERT DE SAINT VINCENT, 1988 The multicopy appearance of a large inverted duplication and the sequence at the inversion joint suggest a new model for gene duplication. EMBO J. 7:407-417.[Medline]
KIMURA, M., 1983 The Neutral Theory of Molecular Evolution. Cambridge University Press, Cambridge, UK.
KIMURA, M. and J. L. KING, 1979 Fixation of a deleterious allele at one of two "duplicate" loci by mutation pressure and random drift. Proc. Natl. Acad. Sci. USA 76:2858-2861.
KRAKAUER, D. C. and M. A. NOWAK, 1999 Evolutionary preservation of redundant duplicated genes. Semin. Cell Dev. Biol. 10:555-559.[Medline]
LENORMAND, T., T. GUILLEMAUD, D. BOURGUET, and M. RAYMOND, 1998 Appearance and sweep of a gene duplication: adaptive response and potential for new functions in the mosquito Culex pipiens.. Evolution 52:1705-1712.
LERCHER, M. J., T. BLUMENTHAL, and L. D. HURST, 2003 Coexpression of neighboring genes in Caenorhabditis elegans is mostly due to operons and duplicate genes. Genome Res. 13:238-243.
LI, W. H., 1980 Rate of gene silencing at duplicate loci: a theoretical study and interpretation of data from tetraploid fishes. Genetics 95:237-258.
LIN, C. T., W. H. LIN, Y. L. LYU, and J. WHANG-PENG, 2001 Inverted repeats as genetic elements for promoting DNA inverted duplication: implications in gene amplification. Nucleic Acids Res. 29:3529-3538.
LLORENTE, B., A. MALPERTUY, C. NEUVEGLISE, J. DE MONTIGNY, and M. AIGLE et al., 2000 Genomic exploration of the hemiascomycetous yeasts: 18. Comparative analysis of chromosome maps and synteny with Saccharomyces cerevisiae. FEBS Lett. 487:101-112.[Medline]
LONG, M. and C. H. LANGLEY, 1993 Natural selection and the origin of jingwei, a chimeric processed functional gene in Drosophila.. Science 260:91-95.
LOOTENS, S., J. BURNETT, and T. B. FRIEDMAN, 1993 An intraspecific gene duplication polymorphism of the urate oxidase gene of Drosophila virilis: a genetic and molecular analysis. Mol. Biol. Evol. 10:635-646.[Abstract]
LOVETT, S. T., T. J. GLUCKMAN, P. J. SIMON, V. J. SUTERA, and P. T. DRAPKIN, 1994 Recombination between repeats in Escherichia coli by a recA-independent, proximity-sensitive mechanism. Mol. Gen. Genet. 245:294-300.[Medline]
LYCKEGAARD, E. M. and A. G. CLARK, 1989 Ribosomal DNA and Stellate gene copy number variation on the Y chromosome of Drosophila melanogaster.. Proc. Natl. Acad. Sci. USA 86:1944-1948.
LYNCH, M. and J. CONERY, 2000 The evolutionary fate and consequences of duplicate genes. Science 290:1151-1155.
LYNCH, M. and A. FORCE, 2000 The probability of duplicate gene preservation by subfunctionalization. Genetics 154:459-473.
LYNCH, M., M. O'HELY, B. WALSH, and A. FORCE, 2001 The probability of preservation of a newly arisen gene duplicate. Genetics 159:1789-1804.
MARONI, G., J. WISE, J. E. YOUNG, and E. OTTO, 1987 Metallothionein gene duplications and metal tolerance in natural populations of Drosophila melanogaster.. Genetics 117:739-744.
MAYNARD-SMITH, J., 1992 Analyzing the mosaic structure of genes. J. Mol. Evol. 34:126-129.[Medline]
MULLER, H. J., 1935 The origination of chromatin deficiencies as minute deletions subject to insertion elsewhere. Genetica 17:237-252.
MULLER, H. J., 1936 Bar duplication. Science 83:528-530.
NEI, M. and A. K. ROYCHOUDHURY, 1973 Probability of fixation of nonfunctional genes at duplicate loci. Am. Nat. 107:362-372.
NOWAK, M. A., M. C. BOERLIJST, J. COOKE, and J. M. SMITH, 1997 Evolution of genetic redundancy. Nature 388:167-171.[Medline]
NURMINSKY, D. I., M. V. NURMINSKAYA, D. DE AGUIAR, and D. L. HARTL, 1998 Selective sweep of a newly evolved sperm-specific gene in Drosophila. Nature 396:572-575.[Medline]
OHNO, S., 1970 Evolution by Gene Duplication. Springer-Verlag, New York.
OHTA, T., 1987 Simulating evolution by gene duplication. Genetics 115:207-213.
OHTA, T., 1988a Further simulation studies on evolution by gene duplication. Evolution 42:375-386.
OHTA, T., 1988b Time for acquiring a new gene by duplication. Proc. Natl. Acad. Sci. USA 85:3509-3512.
OLSON, M. V., 1991 Genome structure and organization in Saccharomyces cerevisiae, pp. 139 in The Molecular and Cellular Biology of the Yeast Saccharomyces: Genome Dynamics, Protein Synthesis, and Energetics, edited by J. R. BROACH, J. R. PRINGLE and E. W. JONES. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY.
PASSANANTI, C., B. DAVIES, M. FORD, and M. FRIED, 1987 Structure of an inverted duplication formed as a first step in a gene amplification event: implications for a model of gene amplification. EMBO J. 6:1697-1703.[Medline]
PATTHY, L., 1985 Evolution of the proteases of blood coagulation and fibrinolysis by assembly from modules. Cell 41:657-663.[Medline]
PATTHY, L., 1994 Introns and exons. Curr. Opin. Struct. Biol. 4:383-392.
PATTHY, L., 1999 Genome evolution and the evolution of exon-shuffling: a review. Gene 238:103-114.[Medline]
PETES, T. D. and C. W. HILL, 1988 Recombination between repeated genes in microorganisms. Annu. Rev. Genet. 22:147-168.[Medline]
RAPPLEYE, C. A. and J. R. ROTH, 1997 Transposition without transposase: a spontaneous mutation in bacteria. J. Bacteriol. 179:2047-2052.
RUBIN, G. M., M. D. YANDELL, J. R. WORTMAN, G. L. GABOR MIKLOS, and C. R. NELSON et al., 2000 Comparative genomics of the eukaryotes. Science 287:2204-2215.
SAWYER, S., 1989 Statistical tests for detecting gene conversion. Mol. Biol. Evol. 6:526-538.[Abstract]
SEMPLE, C. and K. H. WOLFE, 1999 Gene duplication and gene conversion in the Caenorhabditis elegans genome. J. Mol. Evol. 48:555-564.[Medline]
SEOIGHE, C., N. FEDERSPIEL, T. JONES, N. HANSEN, and V. BIVOLAROVIC et al., 2000 Prevalence of small inversions in yeast gene order evolution. Proc. Natl. Acad. Sci. USA 97:14433-14437.
SOKAL, R. R., and F. J. ROHLF, 1997 Biometry: The Principles and Practice of Statistics in Biological Research. W. H. Freeman, San Francisco.
SPOFFORD, J. B., 1969 Heterosis and the evolution of duplications. Am. Nat. 103:407-432.
STEIN, L., P. STERNBERG, R. DURBIN, J. THIERRY-MIEG, and J. SPIETH, 2001 WormBase: network access to the genome and biology of Caenorhabditis elegans.. Nucleic Acids Res. 29:82-86.
TAKAHATA, N. and T. MARUYAMA, 1979 Polymorphism and loss of duplicate gene expression: a theoretical study with application of tetraploid fish. Proc. Natl. Acad. Sci. USA 76:4521-4525.
THEODORE, L., A. S. HO, and G. MARONI, 1991 Recent evolutionary history of the metallothionein gene Mtn in Drosophila.. Genet. Res. 58:203-210.[Medline]
THOMSON, T. M., J. J. LOZANO, N. LOUKILI, R. CARRIO, and F. SERRAS et al., 2000 Fusion of the human gene for the polyubiquitination coeffector UEV1 with Kua, a newly identified gene. Genome Res. 10:1743-1756.
VELLAI, T. and G. VIDA, 1999 The origin of eukaryotes: the difference between prokaryotic and eukaryotic cells. Proc. R. Soc. Lond. Ser. B Biol. Sci. 266:1571-1577.[Medline]
WAGNER, A., 1999 Redundant gene functions and natural selection. J. Evol. Biol. 12:1-16.
WALSH, J. B., 1995 How often do duplicated genes evolve new functions? Genetics 139:421-428.[Abstract]
WATTERSON, G. A., 1983 On the time for gene silencing at duplicate loci. Genetics 105:745-766.
This article has been cited by other articles:
![]() |
J. C. Opazo, A. M. Sloan, K. L. Campbell, and J. F. Storz Origin and Ascendancy of a Chimeric Fusion Gene: The {beta}/{delta}-Globin Gene of Paenungulate Mammals Mol. Biol. Evol., July 1, 2009; 26(7): 1469 - 1478. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. L. McGrath, C. Casola, and M. W. Hahn Minimal Effect of Ectopic Gene Conversion Among Recent Duplicates in Four Mammalian Genomes Genetics, June 1, 2009; 182(2): 615 - 622. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. D. Cutter, A. Dey, and R. L. Murray Evolution of the Caenorhabditis elegans Genome Mol. Biol. Evol., June 1, 2009; 26(6): 1199 - 1234. [Abstract] [Full Text] [PDF] |
||||
![]() |
Q. Zhou, G. Zhang, Y. Zhang, S. Xu, R. Zhao, Z. Zhan, X. Li, Y. Ding, S. Yang, and W. Wang On the origin of new genes in Drosophila Genome Res., September 1, 2008; 18(9): 1446 - 1455. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Li, W. Ma, L. Gao, Y. Zhang, A. Wang, K. Ji, K. Wang, R. Appels, and Y. Yan A Novel Chimeric Low-Molecular-Weight Glutenin Subunit Gene From the Wild Relatives of Wheat Aegilops kotschyi and Ae. juvenalis: Evolution at the Glu-3 Loci Genetics, September 1, 2008; 180(1): 93 - 101. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Katju, E. M. LaBeau, K. J. Lipinski, and U. Bergthorsson Sex Change by Gene Conversion in a Caenorhabditis elegans fog-2 Mutant Genetics, September 1, 2008; 180(1): 669 - 672. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. D. Cutter Divergence Times in Caenorhabditis and Drosophila Inferred from Direct Estimates of the Neutral Mutation Rate Mol. Biol. Evol., April 1, 2008; 25(4): 778 - 786. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Qian and J. Zhang Evolutionary dynamics of nematode operons: Easy come, slow go Genome Res., March 1, 2008; 18(3): 412 - 421. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Irimia, J. L. Rukov, D. Penny, J. Garcia-Fernandez, J. Vinther, and S. W. Roy Widespread Evolutionary Conservation of Alternatively Spliced Exons in Caenorhabditis Mol. Biol. Evol., February 1, 2008; 25(2): 375 - 382. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Stanvitch and L. L. Moore cin-4, a Gene With Homology to Topoisomerase II, Is Required for Centromere Resolution by Cohesin Removal From Sister Kinetochores During Mitosis Genetics, January 1, 2008; 178(1): 83 - 97. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. W. Ganko, B. C. Meyers, and T. J. Vision Divergence in Expression between Duplicated Genes in Arabidopsis Mol. Biol. Evol., October 1, 2007; 24(10): 2298 - 2309. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. W. Roy and D. Penny On the Incidence of Intron Loss and Gain in Paralogous Gene Families Mol. Biol. Evol., August 1, 2007; 24(8): 1579 - 1581. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Lynch Colloquium Papers: The frailty of adaptive hypotheses for the origins of organismal complexity PNAS, May 15, 2007; 104(suppl_1): 8597 - 8604. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. L. Rukov, M. Irimia, S. Mork, V. K. Lund, J. Vinther, and P. Arctander High Qualitative and Quantitative Conservation of Alternative Splicing in Caenorhabditis elegans and Caenorhabditis briggsae Mol. Biol. Evol., April 1, 2007; 24(4): 909 - 917. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. P. Cusack and K. H. Wolfe Not Born Equal: Increased Rate Asymmetry in Relocated and Retrotransposed Rodent Gene Duplicates Mol. Biol. Evol., March 1, 2007; 24(3): 679 - 686. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. D. Akhunov, A. R. Akhunova, and J. Dvorak Mechanisms and Rates of Birth and Death of Dispersed Duplicated Genes during the Evolution of a Multigene Family in Diploid and Tetraploid Wheats Mol. Biol. Evol., February 1, 2007; 24(2): 539 - 550. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. H. Thomas Adaptive evolution in two large families of ubiquitin-ligase adapters in nematodes and plants Genome Res., August 1, 2006; 16(8): 1017 - 1030. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Katju and M. Lynch On the Formation of Novel Genes by Duplication in the Caenorhabditis elegans Genome Mol. Biol. Evol., May 1, 2006; 23(5): 1056 - 1067. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. H. Thomas Concerted Evolution of Two Novel Protein Families in Caenorhabditis Species Genetics, April 1, 2006; 172(4): 2269 - 2281. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Lynch The Origins of Eukaryotic Gene Structure Mol. Biol. Evol., February 1, 2006; 23(2): 450 - 468. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. H. Thomas Analysis of Homologous Gene Clusters in Caenorhabditis elegans Reveals Striking Regional Cluster Domains Genetics, January 1, 2006; 172(1): 127 - 143. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Bhushan, A. Stahl, S. Nilsson, B. Lefebvre, M. Seki, C. Roth, D. McWilliam, S. J. Wright, D. A. Liberles, K. Shinozaki, et al. Catalysis, Subcellular Localization, Expression and Evolution of the Targeting Peptides Degrading Protease, AtPreP2 Plant Cell Physiol., June 1, 2005; 46(6): 985 - 996. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. D. Jones, A. W. Custer, and D. J. Begun Origin and Evolution of a Chimeric Fusion Gene in Drosophila subobscura, D. madeirensis and D. guanche Genetics, May 1, 2005; 170(1): 207 - 219. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Force, W. A. Cresko, F. B. Pickett, S. R. Proulx, C. Amemiya, and M. Lynch The Origin of Subfunctions and Modular Gene Regulation Genetics, May 1, 2005; 170(1): 433 - 446. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. B. Pereira-Leal and S. A. Teichmann Novel specificities emerge by stepwise duplication of functional modules Genome Res., April 1, 2005; 15(4): 552 - 559. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. H. Shah, R. C. MacFarlane, D. Bhattacharya, J. C. Matese, J. Demeter, S. E. Stroup, and U. Singh Comparative Genomic Hybridizations of Entamoeba Strains Reveal Unique Genetic Fingerprints That Correlate with Virulence Eukaryot. Cell, March 1, 2005; 4(3): 504 - 515. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Huminiecki and K. H. Wolfe Divergence of Spatial Gene Expression Profiles Following Species-Specific Gene Duplications in Human and Mouse Genome Res., October 1, 2004; 14(10a): 1870 - 1879. [Abstract] [Full Text] [PDF] |
||||
![]() |
A.-M. Mallon, L. Wilming, J. Weekes, J. G.R. Gilbert, J. Ashurst, S. Peyrefitte, L. Matthews, M. Cadman, R. McKeone, C. A. Sellick, et al. Organization and Evolution of a Gene-Rich Region of the Mouse Genome: A 12.7-Mb Region Deleted in the Del(13)Svea36H Mouse Genome Res., October 1, 2004; 14(10a): 1888 - 1901. [Abstract] [Full Text] [PDF] |
||||
- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Katju, V.
- Articles by Lynch, M.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Katju, V.
- Articles by Lynch, M.











