| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Genetics, Vol. 173, 1787-1791, July 2006, Copyright © 2006
doi:10.1534/genetics.106.057919
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Molecular Biology Institute, Center for Genomics and Proteomics, Department of Chemistry and Biochemistry, University of California, Los Angeles, California 90095-1570
1 Corresponding author: Molecular Biology Institute, Center for Genomics and Proteomics, Department of Chemistry and Biochemistry, University of California, Los Angeles, CA 90095-1570.
E-mail: leec{at}mbi.ucla.edu
| ABSTRACT |
|---|
|
|
|---|
Recent comparative genomic analyses indicate a reduced nucleotide substitution rate at flanks of alternatively spliced exons. Sorek and Ast analyzed the 100-nt flanks of exons that were alternatively spliced in human and mouse transcripts. They observed a nearly 20% increase of percentage of sequence identity at 30-nt exon flanks of alternatively spliced exons, compared to flanks of constitutive exons (SOREK and AST 2003). The average lengths of conserved sequences were 103 nt for upstream flanks of alternative exons and 94 nt for downstream flanks of alternative exons (SOREK and AST 2003). Other studies reported similar results (SUGNET et al. 2004, 2006; XING and LEE 2005). In fact, the increased sequence identity at flanks of alternative exons has been successfully used to predict alternative splicing from raw genomic sequences (PHILIPPS et al. 2004; SOREK et al. 2004b; YEO et al. 2005). It has been suggested that such a higher sequence conservation reflects stronger negative selection associated with regulation of alternative splicing (SOREK and AST 2003; ITOH et al. 2004).
However, there are other factors that affect the nucleotide substitution rate in a genomic region. One important factor is the mutation rate. It is well known that the mutation rate varies significantly across the entire genome, leading to mutational hot spots and cold spots (CHUANG and LI 2004). Even within the same gene, there can be remarkable site-by-site variations of the mutation rate. For instance, CpG dinucleotides are highly mutable due to methylation effects (LI et al. 2002). Keightley et al. analyzed the substitution rate in rodent genomes. They found that the substitution rate was significantly higher in CpG-susceptible sites (i.e., sites preceded by C or followed by G, which thus were likely to be parts of ancestral CpG dinucleotides) compared to non-CpG-susceptible sites (KEIGHTLEY and GAFFNEY 2003; GAFFNEY and KEIGHTLEY 2005). A similar observation was made through genomewide humanchimpanzee comparisons (EBERSBERGER et al. 2002; HELLMANN et al. 2003) and more recently at distant intronic sites and fourfold redundant synonymous sites using humanchimpanzee genome alignments (KONDRASHOV et al. 2006). These data indicate the heterogeneity of mutation rate even within a small region of a gene, depending on CpG contexts.
In this article, we performed an in-depth analysis on the evolutionary divergence of exon flanks. We are interested in the following questions. First, how does the local CpG context affect substitution rates at flanks of constitutive and alternative exons? Second, what is the major cause for the increased sequence conservation at flanks of alternative exons? Is it reduced mutability or increased selection or both? Since multiple substitutions at a single site can obscure the impact of ancestral CpG dinucleotides, it is important to compare closely related species, where multiple substitutions are unlikely to occur (EBERSBERGER et al. 2002). The human and chimpanzee exon flanks are the most suitable for our comparison, because these two species diverged only
5 million years ago (HEDGES 2002). The goal of our analysis is to dissect the contribution of mutability and selection to the evolutionary rate of exon flanks. We divided sites within exon flanks into distinct types according to their local CpG contexts. Windows ranging from 50 to 350 nt were used in previous analyses of exon flanks (SOREK and AST 2003; KAUFMANN et al. 2004; SUGNET et al. 2004, 2006; XING and LEE 2005; YEO et al. 2005). Throughout this article, following the consensus choice in the current literature (SOREK and AST 2003; SUGNET et al. 2004, 2006), we defined our exon flanks as the 100-nt intronic sequences immediately upstream (upstream exon flanks) or downstream (downstream exon flanks) of the exonintron junctions. A different definition (e.g., 50-nt exon flanks) produced very similar results (data not shown).
| MATERIALS AND METHODS |
|---|
|
|
|---|
For each ancestral alternative or constitutive exon, we calculated the evolutionary divergence of its flanks between human and chimpanzee. We extracted the alignment of its flanks between human and chimpanzee using the UCSC humanchimpanzee pairwise genome alignments (http://hgdownload.cse.ucsc.edu/goldenPath/hg17/vsPanTro1/axtNet/) (KENT et al. 2003; SCHWARTZ et al. 2003). We divided the nucleotide sites at exon flanks into four distinct types on the basis of the local CpG context: sites preceded by a C and followed by a G (postCpreG), sites followed by a G but not preceded by a C (preG), sites preceded by a C but not followed by a G (postC), and sites neither preceded by a C nor followed by a G (nonCpG) (KEIGHTLEY and GAFFNEY 2003; KONDRASHOV et al. 2006). The first three types of sites were CpG susceptible according to the definition of Keightley and Gaffney, i.e., they were likely to be parts of ancestral CpG dinucleotides (KEIGHTLEY and GAFFNEY 2003). We required each site to be flanked by two conserved nucleotides in the alignment and excluded sites that corresponded to insertions or deletions. For each nucleotide site, we checked the alignment to see whether it was conserved or substituted between human and chimpanzee. For each type of CpG sites, we counted the number of substituted sites and the total numbers of sites. The alignment data can be accessed at http://www.bioinformatics.ucla.edu/yxing/cpg/.
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
We observed a systematic difference of nucleotide substitution rates among different types of sites. In upstream flanks of the constitutive exons, the nucleotide substitution rate averaged over all four types of sites was 0.0113. postCpreG sites had the highest rate of nucleotide substitution at 0.0172, followed by postC (0.0134) and preG (0.0124) sites. nonCpG sites had the lowest rate of nucleotide substitution at 0.0097, a nearly twofold reduction compared to postCpreG sites (see Table 1). The nucleotide composition at different types of sites also showed a systematic difference: postC sites had a very low frequency of G; preG sites had a very low frequency of C; while postCpreG sites were strongly depleted of both C and G. We observed a similar trend at upstream flanks of ancestral alternatively spliced exons. The overall nucleotide substitution rate was 0.0075. The rate was the lowest (0.0063) for non-CpG-susceptible sites (nonCpG sites, compared to CpG-susceptible sites including postCpreG, postC, and preG sites) (see Table 1). Analyses of downstream exon flanks yielded very similar results (see Table 2). Our results were consistent with humanchimpanzee analyses on distant intronic sites and fourfold redundant synonymous sites in exons (KONDRASHOV et al. 2006). These data indicate increased mutability at CpG-susceptible sites compared to non-CpG-susceptible sites in exon flanks, regardless of constitutive splicing or alternative splicing.
|
|
|
Our study extended previous comparative analyses of intronic sites flanking alternatively spliced exons (SOREK and AST 2003; SUGNET et al. 2004; XING and LEE 2005). Instead of analyzing the evolutionary divergence of the entire exon flanks, we divided all nucleotide sites at exon flanks into distinct types, according to their local CpG contexts. Such treatment allowed us to separate the contribution of mutability and selection, two most important factors that affect the divergence rate of exon flanks. Our study indicates significant site-by-site variations of mutability in the exon flanks, depending on the local CpG context. Such a phenomenon was observed in flanks of both alternative and constitutive exons. However, the site-by-site variation of mutability is not the major contributor to the increased conservation at flanks of alternative exons. The higher sequence conservation is not due to a reduced fraction of hypermutable sites (i.e., CpG-susceptible sites). In fact, at all types of CpG sites (with different propensities for mutation) inside flanks of alternative exons, we observed a significant reduction of the nucleotide substitution rate (see Figure 1). This consistent pattern, independent of the variation in mutability due to different CpG contexts, strongly argues for an increased selection at flanks of alternative exons. Such a strong purifying selection might reflect RNA sequence motif (ITOH et al. 2004) or secondary structure (MEYER and MIKLOS 2005) requirements for proper controls of alternative splicing. Our study provides concrete evidence for widespread selection pressure in mammalian genomes that is associated with alternative splicing (SOREK and AST 2003; BAEK and GREEN 2005; XING and LEE 2005). Understanding selection pressure at exon flanks will shed light on the regulation of alternative splicing and how it evolves.
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
| LITERATURE CITED |
|---|
|
|
|---|
ALTSCHUL, S. F., T. L. MADDEN, A. A. SCHÄFFER, J. ZHANG, Z. ZHANG et al., 1997 Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25: 33893402.
BAEK, D., and P. GREEN, 2005 Sequence conservation, relative isoform frequencies, and nonsense-mediated decay in evolutionarily conserved alternative splicing. Proc. Natl. Acad. Sci. USA 102: 1281312818.
CHUANG, J. H., and H. LI, 2004 Functional bias and spatial organization of genes in mutational hot and cold regions in the human genome. PLoS Biol. 2: E29.[CrossRef][Medline]
COSTA, D. B., L. LOZOVATSKY, P. G. GALLAGHER and B. G. FORGET, 2005 A novel splicing mutation of the {alpha}-spectrin gene in the original hereditary pyropoikilocytosis kindred. Blood 106: 43674369.
COUTINHO, G., J. XIE, L. DU, A. BRUSCO, A. R. KRAINER et al., 2005 Functional significance of a deep intronic mutation in the ATM gene and evidence for an alternative exon 28a. Hum. Mutat. 25: 118124.[CrossRef][Medline]
EBERSBERGER, I., D. METZLER, C. SCHWARZ and S. PAABO, 2002 Genomewide comparison of DNA sequences between humans and chimpanzees. Am. J. Hum. Genet. 70: 14901497.[CrossRef][Medline]
ENG, L., G. COUTINHO, S. NAHAS, G. YEO, R. TANOUYE et al., 2004 Nonclassical splicing mutations in the coding and noncoding regions of the ATM gene: maximum entropy estimates of splice junction strengths. Hum. Mutat. 23: 6776.[CrossRef][Medline]
FAIRBROTHER, W. G., and L. A. CHASIN, 2000 Human genomic sequences that inhibit splicing. Mol. Cell. Biol. 20: 68166825.
GAFFNEY, D. J., and P. D. KEIGHTLEY, 2005 The scale of mutational variation in the murid genome. Genome Res. 15: 10861094.
GARCIA-BLANCO, M. A., A. P. BARANIAK and E. L. LASDA, 2004 Alternative splicing in disease and therapy. Nat. Biotechnol. 22: 535546.[CrossRef][Medline]
HEDGES, S. B., 2002 The origin and evolution of model organisms. Nat. Rev. Genet. 3: 838849.[CrossRef][Medline]
HELLMANN, I., S. ZOLLNER, W. ENARD, I. EBERSBERGER, B. NICKEL et al., 2003 Selection on human genes as revealed by comparisons to chimpanzee cDNA. Genome Res. 13: 831837.
ITOH, H., T. WASHIO and M. TOMITA, 2004 Computational comparative analyses of alternative splicing regulation using full-length cDNA of various eukaryotes. RNA 10: 10051018.
KAUFMANN, D., O. KENNER, P. NURNBERG, W. VOGEL and B. BARTELT, 2004 In NF1, CFTR, PER3, CARS and SYT7, alternatively included exons show higher conservation of surrounding intron sequences than constitutive exons. Eur. J. Hum. Genet. 12: 139149.[CrossRef][Medline]
KEIGHTLEY, P. D., and D. J. GAFFNEY, 2003 Functional constraints and frequency of deleterious mutations in noncoding DNA of rodents. Proc. Natl. Acad. Sci. USA 100: 1340213406.
KENT, W. J., R. BAERTSCH, A. HINRICHS, W. MILLER and D. HAUSSLER, 2003 Evolution's cauldron: duplication, deletion, and rearrangement in the mouse and human genomes. Proc. Natl. Acad. Sci. USA 100: 1148411489.
KOL, G., G. LEV-MAOR and G. AST, 2005 Human-mouse comparative analysis reveals that branch-site plasticity contributes to splicing regulation. Hum. Mol. Genet. 14: 15591568.
KONDRASHOV, F. A., A. Y. OGURTSOV and A. S. KONDRASHOV, 2006 Selection in favor of nucleotides G and C diversifies evolution rates and levels of polymorphism at mammalian synonymous sites. J. Theor. Biol. 240: 616626.
LI, W. H., S. YI and K. MAKOVA, 2002 Male-driven evolution. Curr. Opin. Genet. Dev. 12: 650656.[CrossRef][Medline]
MEYER, I. M., and I. MIKLOS, 2005 Statistical evidence for conserved, local secondary structure in the coding regions of eukaryotic mRNAs and pre-mRNAs. Nucleic Acids Res. 33: 63386348.
MINOVITSKY, S., S. L. GEE, S. SCHOKRPUR, I. DUBCHAK and J. G. CONBOY, 2005 The splicing regulatory element, UGCAUG, is phylogenetically and spatially conserved in introns that flank tissue-specific alternative exons. Nucleic Acids Res. 33: 714724.
MODREK, B., and C. LEE, 2003 Alternative splicing in the human, mouse and rat genomes is associated with an increased rate of exon creation / loss. Nat. Genet. 34: 177180.[CrossRef][Medline]
MODREK, B., A. RESCH, C. GRASSO and C. LEE, 2001 Genome-wide analysis of alternative splicing using human expressed sequence data. Nucleic Acids Res. 29: 28502859.
PAN, Q., O. SHAI, C. MISQUITTA, W. ZHANG, A. L. SALTZMAN et al., 2004 Revealing global regulatory features of Mammalian alternative splicing using a quantitative microarray platform. Mol. Cell 16: 929941.[CrossRef][Medline]
PHILIPPS, D. L., J. W. PARK and B. R. GRAVELEY, 2004 A computational and experimental approach toward a priori identification of alternatively spliced exons. RNA 10: 18381844.
SCHWARTZ, S., W. J. KENT, A. SMIT, Z. ZHANG, R. BAERTSCH et al., 2003 Human-mouse alignments with BLASTZ. Genome Res. 13: 103107.
SOREK, R., and G. AST, 2003 Intronic sequences flanking alternatively spliced exons are conserved between human and mouse. Genome Res. 13: 16311637.
SOREK, R., R. SHAMIR and G. AST, 2004a How prevalent is functional alternative splicing in the human genome? Trends Genet. 20: 6871.[CrossRef][Medline]
SOREK, R., R. SHEMESH, Y. COHEN, O. BASECHESS, G. AST et al., 2004b A non-EST-based method for exon-skipping prediction. Genome Res. 14: 16171623.
SUGNET, C. W., W. J. KENT, M. ARES, JR. and D. HAUSSLER, 2004 Transcriptome and genome conservation of alternative splicing events in humans and mice. Pac. Symp. Biocomput, 6677.
SUGNET, C. W., K. SRINIVASAN, T. A. CLARK, G. O'BRIEN, M. S. CLINE et al., 2006 Unusual intron conservation near tissue-regulated exons found by splicing microarrays. PLoS Comput. Biol. 2: e4.[CrossRef][Medline]
SUN, H., and L. A. CHASIN, 2000 Multiple splicing defects in an intronic false exon. Mol. Cell. Biol. 20: 64146425.
XING, Y., and C. LEE, 2005 Evidence of functional selection pressure for alternative splicing events that accelerate evolution of protein subsequences. Proc. Natl. Acad. Sci. USA 102: 1352613531.
YEO, G. W., E. VAN NOSTRAND, D. HOLSTE, T. POGGIO and C. B. BURGE, 2005 Identification and analysis of alternative splicing events conserved in human and mouse. Proc. Natl. Acad. Sci. USA 102: 28502855.
ZHANG, X. H., K. A. HELLER, I. HEFTER, C. S. LESLIE and L. A. CHASIN, 2003 Sequence information for the splicing of human pre-mRNA identified by support vector machine classification. Genome Res. 13: 26372650.
ZHANG, X. H., C. S. LESLIE and L. A. CHASIN, 2005 Dichotomous splicing signals in exon flanks. Genome Res. 15: 768779.
This article has been cited by other articles:
![]() |
A. Lofrano-Porto, G. B. Barra, L. A. Giacomini, P. P. Nascimento, A. C. Latronico, L. A. Casulari, and F. d. A. da Rocha Neves Luteinizing Hormone Beta Mutation and Hypogonadism in Men and Women N. Engl. J. Med., August 30, 2007; 357(9): 897 - 904. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Kim, A. V. Alekseyenko, M. Roy, and C. Lee The ASAP II database: analysis and comparative genomics of alternative splicing in 15 animal species Nucleic Acids Res., January 12, 2007; 35(suppl_1): D93 - D98. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |