- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Zhang, Z.
- Articles by Kishino, H.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Zhang, Z.
- Articles by Kishino, H.
Genomic Background Predicts the Fate of Duplicated Genes: Evidence From the Yeast Genome
Ze Zhanga,b and Hirohisa Kishinoaa Laboratory of Biometrics and Bioinformatics, Graduate School of Agriculture and Life Sciences, University of Tokyo, Tokyo 113-8657, Japan
b Institute for Bioinformatics Research and Development, Japan Science and Technology Agency, Tokyo 102-0081, Japan
Corresponding author: Ze Zhang, University of Birmingham, Birmingham B15 2TT, United Kingdom.
Communicating editor: S. YOKOYAMA
| ABSTRACT |
|---|
Gene duplication with subsequent divergence plays a central role in the acquisition of genes with novel function and complexity during the course of evolution. With reduced functional constraints or through positive selection, these duplicated genes may experience accelerated evolution. Under the model of subfunctionalization, loss of subfunctions leads to complementary acceleration at sites with two copies, and the difference in average rate between the sequences may not be obvious. On the other hand, the classical model of neofunctionalization predicts that the evolutionary rate in one of the two duplicates is accelerated. However, the classical model does not tell which of the duplicates experiences the acceleration in evolutionary rate. Here, we present evidence from the Saccharomyces cerevisiae genome that a duplicate located in a genomic region with a low-recombination rate is likely to evolve faster than a duplicate in an area of high recombination. This observation is consistent with population genetics theory that predicts that purifying selection is less effective in genomic regions of low recombination (Hill-Robertson effect). Together with previous studies, our results suggest the genomic background (e.g., local recombination rate) as a potential force to drive the divergence between nontandemly duplicated genes. This implies the importance of structure and complexity of genomes in the diversification of organisms via gene duplications.
GENE duplication has long been thought to be one of the principal engines powering the evolution of protein function, allowing for increases in genomic complexity (![]()
![]()
![]()
![]()
![]()
![]()
![]()
Increasing evidence from genomic data indicates that asymmetric sequence divergence of duplicate genes is quite common. ![]()
5% of the 101 duplicated gene pairs analyzed, whereas ![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
In this article, we compare evolutionary rates for the duplicated genes with different local recombination rates by the analysis of the Saccharomyces cerevisiae genome. It is well known that the arrayed structure of duplicated genes in the genome also affects their evolutionary pattern. Tandemly duplicated genes usually evolve in a concerted manner due mainly to unequal crossover and gene conversion mechanisms (![]()
![]()
![]()
The data set used in this article consists of the S. cerevisiae gene duplication database (http://wolfe.gen.tcd.ie/). The Smith-Waterman algorithm was used to search for paralogs. As most duplicate genes occurred a long time ago [
100 million years ago (MYA)], resulting from whole-genome duplication in S. cerevisiae (![]()
![]()
![]()
![]()
We then searched for the corresponding orthologous sequence for each of the selected paralogs from the Candida albicans genome data as a reference for the relative rate test between nontandem paralogs after gene duplication (Fig 1; ![]()
140330 MYA (![]()
![]()
![]()
|
All the protein alignments were produced using CLUSTAL X (![]()
![]()
![]()
Because most duplicate gene pairs are very old, synonymous substitutions between the two copies are almost saturated. Therefore, we focused on the difference in the evolutionary rate of amino acid sequence between the two copies. Table 1 compares KDH, the branch length of a copy at a high-recombination-rate region, with KDL, that of another copy at a low-recombination-rate region (Fig 1). The average of the difference was 0.129 and the t value was 2.325 (d.f. = 29), suggesting a significantly lower rate of the copies at higher-recombination-rate regions (P = 0.014). We found that 14 of the 30 (46.6%) duplicate gene pairs showed asymmetry in the evolutionary rate of amino acid sequence (Table 1). Among the 14 duplicated gene pairs showing asymmetric evolution, 11 experienced an acceleration in the evolutionary rate of the copy in the region of low recombination (Table 1). This proportion is significantly higher than one-half (
2 = 4.28, P < 0.05). This evidence that the paralog in a region of low recombination rate is more likely to evolve faster than the paralog in an area of high-local-recombination rate is consistent with population genetics theory, which predicts that purifying selection is less effective in genomic regions of low recombination (![]()
![]()
![]()
![]()
|
We did not detect a significant difference in the evolutionary rate between the two copies for 16 of the 30 (53.4%) duplicate gene pairs. The averages of KDH for the groups for which two copies show evolutionary asymmetry and symmetry after duplication are 0.522 and 0.622, respectively (Fig 1 and Table 1). Although there is no significant difference in average evolutionary rate between the two groups (P = 0.18 for t-test), the group showing evolutionary symmetry has a higher average of KDH than the group with the pattern of evolutionary asymmetry. The GC3 contents of copies with higher recombination rates were examined for the two groups. They had very similar average GC3 contents (0.461 and 0.465, respectively). Therefore, different evolutionary patterns of two groups do not appear to be due to base composition. Probably this is because the duplication events and divergence between S. cerevisiae and C. albicans are too old to guarantee sufficient power of the relative rate test. Alternatively, some of them may have been in the process of subfunctionalization, during which the sites of the two copies experienced complementary acceleration. As a result, the different rates were cancelled out over the sites.
![]()
55 duplicate blocks in the S. cerevisiae genome. To contrast the above result with a negative control, we randomly selected 40 duplicate gene pairs of which two copies of each pair have similar recombination rates and come from different paralogous blocks in the S. cerevisiae gene duplication database (http://wolfe.gen.tcd.ie/). Of 40 duplicate pairs, 8 were found to show asymmetric evolution (results not shown). This proportion (20%) is much lower than that (46.6%) of duplicate pairs, which have very different recombination rates. This suggests that recombination rate may be a potential force to drive the divergence of duplicate genes.
During the past decade, much attention has been focused on the effect of recombination on evolutionary rates and patterns of genes (![]()
![]()
![]()
![]()
![]()
![]()
Recently, ![]()
![]()
![]()
In this study, we found a significant effect of recombination on divergence in protein evolution between the two duplicates in different recombination rate regions. Differences in purifying selection intensity triggered by differences in local recombination rate can explain these observations. When genome data from many closely related species become available, it will be interesting to examine the site-specific asymmetry of duplicate genes (![]()
![]()
![]()
![]()
| ACKNOWLEDGMENTS |
|---|
We thank J. A. Birdsell for kindly sending us sequence data of S. cerevisiae duplicated genes and for helpful discussions. This study has been supported by the Institute for Bioinformatics Research and Development, Japan Science and Technology Agency.
Manuscript received September 24, 2003; Accepted for publication December 31, 2003.
| LITERATURE CITED |
|---|
BETRAN, E., K. THORNTON, and M. LONG, 2002 Retroposed new genes out of the X in Drosophila.. Genome Res. 12:1854-1859.
BIRDSELL, J. A., 2002 Integrating genomics, bioinformatics, and classical genetics to study the effects of recombination on genome evolution. Mol. Biol. Evol. 19:1181-1197.
CARVALHO, A. B. and A. G. CLARK, 1999 Intron size and natural selection. Nature 401:344.[CrossRef][Medline]
COMERON, J. M., M. KREITMAN, and M. AGUADE, 1999 Natural selection on synonymous site is correlated with gene length and recombination in Drosophila. Genetics 151:239-249.
CONANT, G. C. and A. WAGNER, 2003 Asymmetric sequences divergence of duplicate genes. Genome Res. 13:2052-2058.
DROUIN, G., 2002 Characterization of the gene conversions between the multigene family members of the yeast genome. J. Mol. Evol. 55:14-23.[CrossRef][Medline]
FORCE, A., M. LYNCH, F. B. PICKETT, A. AMORES, and Y. YAN et al., 1999 Preservation of duplicate genes by complementary, degenerative mutations. Genetics 151:1531-1545.
GERTON, J. L., J. DERISI, R. SHROFF, M. LICHTEN, and P. O. BROWN et al., 2000 Global mapping of meiotic recombination hotspots and coldspots in the yeast Saccharomyces cerevisiae. Proc. Natl. Acad. Sci. USA 97:11383-11390.
GRAUR, D., and W.-H. LI, 1999 Fundamentals of Molecular Evolution, Ed. 2. Sinauer Associates, Sunderland, MA.
GU, X., 1999 Statistical methods for testing functional divergence after gene duplication. Mol. Biol. Evol. 16:1664-1674.[Abstract]
HALL, T. A., 1999 BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symp. Ser. 41:46-49.
HILL, W. G. and A. ROBERTSON, 1966 The effect of linkage on limits to artificial selection. Genet. Res. 8:269-294.[Medline]
HUGHES, A. L., 1999 Adaptive Evolution of Genes and Genomes. Oxford University Press, Oxford.
KNUDSEN, B. and M. M. MIYAMOTO, 2001 A likelihood ratio test for evolutionary rate shifts and functional divergence among proteins. Proc. Natl. Acad. Sci. USA 98:14512-14517.
KNUDSEN, B., M. M. MIYAMOTO, P. J. LAIPIS, and D. N. SILVERMAN, 2003 Using evolutionary rates to investigate protein functional divergence and conservation: a case study of the carbonic anhydrases. Genetics 164:1261-1269.
KONDRASHOV, F. A., I. B. ROGOZIN, Y. I. WOLF and E. V. KOONIN, 2002 Selection in the evolution of gene duplications. Genome Biol. 3: RESEARCH0008.10008.9.
LI, W.-H., 1997 Molecular Evolution. Sinauer Associates, Sunderland, MA.
MUNTE, A., M. AGUADE, and C. SEGARRA, 2001 Changes in the recombinational environment affect divergence in the yellow gene of Drosophila. Mol. Biol. Evol. 18:1045-1056.
NOWAK, M. A., M. C. BOERLIJST, J. COOKE, and J. M. SMITH, 1997 Evolution of genetic redundancy. Nature 388:167-170.[CrossRef][Medline]
OHNO, S., 1970 Evolution by Gene Duplication. Springer-Verlag, Heidelberg, Germany.
PEARSON, W. R., 1991 Searching protein sequence libraries: comparison of the sensitivity and selectivity of the Smith-Waterman and FASTA algorithms. Genomics 11:635-650.[CrossRef][Medline]
REMM, M., C. E. V. STORM, and E. L. SONNHAMMER, 2001 Automatic clustering of orthologs and in-paralogs from pairwise species comparisons. J. Mol. Biol. 314:1041-1052.[CrossRef][Medline]
ROBINSON-RECHAVI, M. and D. HUCHON, 2000 RRTree: relative-rate tests between groups of sequences on a phylogenetic tree. Bioinformatics 16:296-297.
ROONEY, A. P., H. PIONTKIVSKA, and M. NEI, 2002 Molecular evolution of the histone 3 multigene family. Mol. Biol. Evol. 19:68-75.
SEOIGHE, C., N. FEDERSPIEL, T. JONES, N. HANSEN, and V. BIVOLAROVIC et al., 2000 Prevalence of small inversions in yeast gene order evolution. Proc. Natl. Acad. Sci. USA 97:14433-14437.
STEPHAN, W. and C. H. LANGLEY, 1989 Molecular genetic variation in the centromeric region of the X chromosome in three Drosophila ananassae populations. I. Contrasts between the vermilion and forked loci. Genetics 121:89-99.
TAKANO-SHIMIZU, T., 1999 Local recombination and mutation effects on molecular evolution. Genetics 153:1285-1296.
TAKANO-SHIMIZU, T., 2001 Local changes in GC/AT substitution biases and in crossover frequencies on Drosophila chromosomes. Mol. Biol. Evol. 18:606-619.
THOMPSON, J. D., D. G. HIGGINS, and T. J. GIBSON, 1994 CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22:4673-4680.
THORNTON, K. and M. LONG, 2002 Rapid divergence of gene duplicates on the Drosophila melanogaster X chromosome. Mol. Biol. Evol. 19:918-925.
VAN DE PEER, Y., J. S. TAYLOR, I. BRAASCH, and A. MEYER, 2001 The ghost of selection past: rates of evolution and functional divergence of anciently duplicated genes. J. Mol. Evol. 53:436-446.[CrossRef][Medline]
WANG, Y. and X. GU, 2001 Predicting functional divergence of caspase gene family. Genetics 158:1311-1320.
WOLFE, K. H. and D. S. SHIELDS, 1997 Molecular evidence for an ancient duplication of the entire yeast genome. Nature 387:708-713.[CrossRef][Medline]
WU, C.-I and W.-H. LI, 1985 Evidence for higher rates of nucleotide substitution in rodents than in man. Proc. Natl. Acad. Sci. USA 82:1741-1745.
ZHANG, J., 2003 Evolution by gene duplication: an update. Trends Ecol. Evol. 18:292-298.[CrossRef]
ZHANG, P., Z. GU, and W.-H. LI, 2003 Different evolutionary patterns between young duplicate genes in the human genome. Genome Biol. 4:R56.[CrossRef][Medline]
ZHANG, Z. and H. KISHINO, 2004 Genomic background drives the divergence of duplicated amylase genes at synonymous sites in Drosophila. Mol. Biol. Evol. 21:222-227.
This article has been cited by other articles:
![]() |
H. M. Fischer, C. W. Wheat, D. G. Heckel, and H. Vogel Evolutionary Origins of a Novel Host Plant Detoxification Gene in Butterflies Mol. Biol. Evol., May 1, 2008; 25(5): 809 - 820. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. J. Leach, Z. Zhang, C. Lu, M. J. Kearsey, and Z. Luo The Role of Cis-Regulatory Motifs and Genetical Control of Expression in the Divergence of Yeast Duplicate Genes Mol. Biol. Evol., November 1, 2007; 24(11): 2556 - 2565. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. He and J. Zhang Higher Duplicability of Less Important Genes in Yeast Genomes Mol. Biol. Evol., January 1, 2006; 23(1): 144 - 151. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Kuepfer, U. Sauer, and L. M. Blank Metabolic functions of duplicate genes in Saccharomyces cerevisiae Genome Res., October 1, 2005; 15(10): 1421 - 1430. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. L. Prigoda, A. Nassuth, and B. K. Mable Phenotypic and Genotypic Expression of Self-incompatibility Haplotypes in Arabidopsis lyrata Suggests Unique Origin of Alleles in Different Dominance Classes Mol. Biol. Evol., July 1, 2005; 22(7): 1609 - 1620. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. H. Shah, R. C. MacFarlane, D. Bhattacharya, J. C. Matese, J. Demeter, S. E. Stroup, and U. Singh Comparative Genomic Hybridizations of Entamoeba Strains Reveal Unique Genetic Fingerprints That Correlate with Virulence Eukaryot. Cell, March 1, 2005; 4(3): 504 - 515. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Thornton and M. Long Excess of Amino Acid Substitutions Relative to Polymorphism Between X-Linked Duplications in Drosophila melanogaster Mol. Biol. Evol., February 1, 2005; 22(2): 273 - 284. [Abstract] [Full Text] [PDF] |
||||
- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Zhang, Z.
- Articles by Kishino, H.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Zhang, Z.
- Articles by Kishino, H.



