| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Genetics, Vol. 173, 401-417, May 2006, Copyright © 2006
doi:10.1534/genetics.105.055202
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Department of Biology, Washington University, St. Louis, Missouri 63130
1 Corresponding author: Department of Biology, Campus Box 1137, One Brookings Dr., St. Louis, MO 63130.
E-mail: richards{at}wustl.edu
| ABSTRACT |
|---|
|
|
|---|
3.5 kb), which represents the relatively simple organization of subtelomeric regions in this species. PCR fragment-length variation across the subtelomeric region indicated that the 1.4-kb distal region showed elevated structural variation relative to the centromere-proximal region. Examination of nucleotide sequences from this 1.4-kb region revealed diverse DNA rearrangements, including an inversion, several deletions, and an insertion of a retrotransposon LTR. The structures at the deletion and inversion breakpoints are characteristic of simple deletion-associated nonhomologous end-joining (NHEJ) events. There was strong linkage disequilibrium between the distal subtelomeric region and the proximal telomere, which contains degenerate and variant telomeric repeats. Variation in the proximal telomere was characterized by the expansion and deletion of blocks of repeats. Our sample of accessions documented two independent chromosome-healing events associated with terminal deletions of the subtelomeric region as well as the capture of a scrambled mitochondrial DNA segment in the proximal telomeric array. This natural variation study highlights the variety of genomic events that drive the fluidity of chromosome termini.
The role of subtelomeric regions in chromosome stability and function remains elusive. Subtelomeric regions in most organisms generally contain nonfunctional repetitive sequences, and in cases where subtelomeric sequences have been lost or omitted, cell viability and chromosome stability were not affected (see review by MEFFORD and TRASK 2002). However, subtelomeric regions can provide a backup mechanism for acquisition of a new telomeric end through ectopic recombination with shared subtelomeric sequences on nonhomologous chromosome ends (WANG and ZAKIAN 1990). In extreme cases where conventional telomere repeat addition is compromised in budding yeast, amplification of subtelomeric repeats can ensure chromosome maintenance through a so-called ALT alternative telomere lengthening mechanism (LUNDBLAD and BLACKBURN 1993). In addition to processes that might directly affect chromosome stability and telomere function, subtelomeric plasticity may also play a role as a platform for elaborating novel gene expression programs (e.g., VSG gene expression and host immune response avoidance in Trypanosoma) (BORST et al. 1996; BARRY et al. 2003). Another potential function of the subtelomeric region is to insulate the distal genes from the suppressive effects imposed by telomeric chromatin (telomere positional effect) (GOTTSCHLING et al. 1990; BAUR et al. 2001; GARCIA-CAO et al. 2004). Alternatively, the subtelomeric region can provide a location for genes to be modulated by chromatin level regulation, as has been demonstrated in Plasmodium (DURAISINGH et al. 2005; FREITAS-JUNIOR et al. 2005). However, the plastic nature of subtelomeric regions can also cause harm to an organism, as seen in human diseases associated with subtelomeric gene rearrangements (see review by MEFFORD and TRASK 2002). These considerations suggest that subtelomeric regions are under a range of functional constraints and that different evolutionary processes are shaping the boundary and organization of the subtelomeric regions in diverse organisms.
In contrast to subtelomeric regions, the telomere plays a direct and essential role in maintaining genomic integrity and cell vitality (MULLER 1938; MCCLINTOCK 1941; BLACKBURN 2000). In most eukaryotes that have been examined, the extreme chromosomal termini are formed by the interaction of specialized telomere-binding proteins and tandem arrays of short G-rich repeats. These telomeric repeat arrays are synthesized by a ribonucleoprotein complex, called telomerase, using an RNA template. The telomeric structure protects the chromosome end from shortening through rounds of incomplete replication or degradation and also from being recognized as broken ends subject to DNA repair (see review by BLACKBURN 2001; BUCHOLC et al. 2001). Telomeric repeat arrays are dynamic structures that undergo expansion and contraction throughout development and possibly in response to physiological stresses (BLACKBURN 2001; EPEL et al. 2004). As a consequence of this equilibrium, telomeric sequences added at the end of the chromosomal molecule are turned over at a higher rate than those sequences located in the centromere-proximal portion of the telomeric repeat arrays. This imbalance in turnover rate can account for the higher frequency of degenerate and variant telomeric repeats in the centromere-proximal domain of the telomere that has been observed in various organisms (ALLSHIRE et al. 1989; RICHARDS et al. 1992; KIRK and BLACKBURN 1995).
Studies using the flowering plant Arabidopsis thaliana have provided basic information on both telomere DNA structure and its maintenance (RIHA and SHIPPEN 2003), but relatively little is known about the organization, function, and dynamics of A. thaliana subtelomeric regions. In most higher plant species studied, a variety of tandemly repeated sequences and transposons are associated with subtelomeric regions (RODER et al. 1993; WU and TANKSLEY 1993; VERSHININ et al. 1995; PEARCE et al. 1996; OHMIDO et al. 2001; ALKHIMOVA et al. 2004). In contrast, A. thaliana subtelomeric regions are remarkably small and simple, in accordance with this species' small genome size and paucity of repetitive sequences. The two chromosome ends (chromosome 2 north and 4 north) that constitute the nucleolus organizer regions have telomeric repeats adjoined to the tandem array of rRNA-coding sequences (COPENHAVER and PIKAARD 1996). The remaining eight chromosome ends contain short subtelomeric regions (<5 kb) that are devoid of highly repetitive sequences or transposons (ARABIDOPSIS GENOME INITIATIVE 2000; HEACOCK et al. 2004). Although some subtelomeric regions in Arabidopsis do share a few blocks of similarity of low-copy sequences among nonhomologous chromosomes (KOTANI et al. 1999; HEACOCK et al. 2004), subtelomeric regions in Arabidopsis do not share extensive similarity among most nonhomologous chromosomes, such as that seen in yeast and humans (LOUIS 1995; MEFFORD and TRASK 2002; LINARDOPOULOU et al. 2005).
We examined natural variation in the nucleotide sequence structure of the single-copy subtelomeric region and adjacent proximal region of the telomere from chromosome 1 north (1N) in A. thaliana to determine whether this unique region of the genome is evolving differently from other genomic loci. In addition, we examined the pattern of variation at the chromosome 1N end to infer the molecular mechanisms that shape the unusually simple genomic organization present at chromosomal termini in Arabidopsis.
| MATERIALS AND METHODS |
|---|
|
|
|---|
|
|
w was calculated following the method of WATTERSON (1975). For the DV (degenerate and variant telomere repeat) region of the centromere-proximal telomere, sequences were aligned manually and organized into repeat units starting with signature of TTH (H: A or T) or TYY (Y: T or C) (RICHARDS et al. 1992, 1993). Gaps, grouped into consecutive repeat units, were generated to maximize the alignment of contiguous conserved repeats as well as to minimize the mutational steps within a repeat unit.
|
| RESULTS |
|---|
|
|
|---|
We examined the organization of chromosome 1N ends in 35 wild accessions of A. thaliana (Table 1) by PCR analysis using primers corresponding to various positions throughout At1g01010 and the subtelomeric region (Figure 1). Amplification using a combination of centromere-proximal primers (F3883 with R3270, R2789, or R2364) generated DNA products of predicted sizes from genomic DNA of all 35 accessions (data not shown). This finding indicates that the sequence corresponding to At1g01010 is adjacent to the chromosome 1N subtelomeric region for all accessions surveyed and that the accessions do not show detectable length variation in the centromere-proximal portion of the subtelomeric region. In contrast, genomic amplifications using the telomere-proximal primers F1640, F784, or F555 together with a primer corresponding to the canonical telomeric repeat [TeloRep(R): (CCCTAAA)3] resulted in predominant products of variable length among some accessions and no amplification in five accessions (Bur-0, Ct-1, Cvi-0, Oy-0, and Ws-2; data not shown). Subsequent nucleotide sequence analysis indicated that these five accessions contain a complex rearrangement of the subtelomeric region (see below). Thus, our tiling PCR analysis suggested that the general organization of the centromere-proximal portion of the chromosome 1N subtelomeric region is similar among the 35 accessions, while the telomere-proximal region has a higher degree of variation among accessions.
Sequence variation in the 1.4-kb telomere-proximal subtelomeric region:
The PCR fragment-length variation apparent in the telomere-proximal portion of the subtelomeric region led us to investigate this variation at the nucleotide sequence level. The polymorphisms observed in our sample of 35 accessions are depicted in Figure 2. We detected a total of 87 polymorphic sites, including 70 sites of single-nucleotide substitutions, 10 small indels (
11 bp), and 7 larger-scale DNA rearrangements. The single nucleotide substitutions include 44 transitions and 28 transversions; 2 sites had three segregating nucleotides. Two of the 10 small indels occurred at mononucleotide sequences [(dA)7 and (dA)4], and another small indel resulted from a deletion/duplication of adjacent sequence.
Among the larger-scale DNA rearrangements, deletions ranging from 31 to 418 bp were detected at five different positions (Figure 2), which corresponded to the length variation of predominant products observed in the PCR analysis. A sixth rearrangement corresponded to a replacement of a 409-bp sequence with an insertion of a 470-bp copia-like LTR (described below). The final large-scale rearrangement detected was a 1432-bp sequence inversion flanked by two deletions of 191 and 97 bp at the centromeric and telomeric boundaries of the inversion, respectively. This complex rearrangement underlies the distinct variation found in the five accessions that showed no amplification in the tiling PCR analysis.
Nucleotide diversity in the 1.4-kb subtelomeric region measured
w = 0.012, on the basis of the number of segregating sites (WATTERSON 1975) (supplemental Table 1, http://www.genetics.org/supplemental/). As shown in Figure 2, the polymorphic sites are not evenly distributed throughout the 1.4-kb region. A sliding-window analysis showed that nucleotide diversity is lowest in an
500-bp region closest to the centromere and peaks at the center of the 1.4-kb region (within a 250-bp region; data not shown). The central region with higher diversity is characterized by a cluster of singleton single nucleotide substitutions that are not associated with any particular sequence structure. The pattern of polymorphism in this region is consistent with a null hypothesis of neutral evolution (Tajima's D = 1.02, P > 0.1) (TAJIMA 1989). We did not detect evidence for homologous recombination events in this region on the basis of the pattern of polymorphism and the haplotype relationships among the accessions (see next section).
Subtelomeric structure and phylogenetic relationships among haplotypes:
The large-scale DNA rearrangements over the 1.4-kb subtelomeric region were grouped into five distinct structural classes (Figure 3A). The most common class includes the standard lab strains Col and Ler-1 (class 1). The class 2 subtelomeric structure is characterized by the large sequence inversion event accompanied by flanking deletions. Class 3 subtelomeric structures are marked by three independent interstitial deletions (31, 43, and 76 bp) at various positions. Two of these deletions are flanked by short direct repeat sequences (4 and 5 bp, respectively). These interstitial deletions are distinguished from class 4 subtelomeric regions, which contain deletions immediately adjacent to the telomeric repeats (subterminal deletions; Figure 3). The class 4 subtelomeric structures include two subterminal deletions with endpoints that are 42 bp apart. One subterminal deletion resulted in a 376-bp deletion (distal to coordinate 491, Figure 3B); the breakpoint was followed immediately by canonical telomeric repeats (TTTAGGG). The other subterminal deletion removed 418 bp (distal to coordinate 533), resulting in a boundary containing a short degenerate telomeric sequence (TTAAGGATTCAGAGA) followed by canonical telomeric repeats. Finally, the insertion of a 470-bp copia-like LTR in replacement of a 409-bp sequence defines the class 5 subtelomeric structure, which was found in a single accession, N13. This solo LTR has inverted repeat termini (5'-tg ... ca-3') characteristic of retroviral and retrotransposon LTRs flanked by a 5-bp target site duplication (AATGT) (Figure 3).
|
|
|
Linkage disequilibrium between the subtelomeric region and the adjacent telomeric region:
We extended our DNA sequence analysis into the telomeric region adjacent to the subtelomeric sequence, where motifs were found that conform to a greater or lesser degree to canonical telomeric repeats (TTTAGGG) (RICHARDS et al. 1993). In most of the accessions (with the exception of Kz-1) this centromere-proximal telomeric region consisted of a mosaic of degenerate repeats and variant repeats (Figure 5). We refer to this heterogeneous zone as the DV region, which is followed by more distal canonical repeats. Despite the complexity of the DV region, we were able to align the different sequences in this region by referring to nucleotide substitutions characteristic of particular degenerate and variant repeat motifs (Figure 5). The length of the DV region is variable among the accessions, ranging from 14 to 280 bp. However, two major DV region length morphs,
85 and
221 bp, exist that distinguish two clusters of subtelomeric haplotypes on the haplotype tree (distinguished by the dividing shaded vertical line shown in Figure 4A). Accessions sharing a particular subtelomeric haplotype contain nearly identical sequences over the DV region (Figure 5), indicating that the subtelomeric region and the adjacent DV region are in linkage disequilibrium. To compare explicitly the haplotype structure of the two regions, we performed a phylogenetic analysis on the DV region, which yielded 24 equally parsimonious trees (tree length = 50 steps; CI = 1; RI = 1). As with the subtelomeric tree, ambiguities in the haplotype topology are attributable to gaps (in this case, those generated by indels of telomeric repeats). A DV consensus tree (Figure 4B) has a topology that is completely congruent with the subtelomeric region haplotype tree (Figure 4A), consistent with our observation that these two adjacent regions share the same underlying haplotype structure and are in strong linkage disequilibrium. The alignment shown in Figure 5 also highlights the duplication and deletion of variable numbers of repeats, and in some cases (marked by gold arrows) we could detect duplication/deletion of adjacent repeats by referring to their unique sequence signatures.
Capture of mitochondrial DNA in the proximal telomere:
We previously isolated and partially characterized a set of A. thaliana telomeric genomic clones that corresponded to junctions between the terminal telomeric repeat arrays and the adjacent subtelomeric sequence (RICHARDS et al. 1992). One of these clones derived from accession Ler had a small island of nontelomeric sequence embedded in the telomeric repeat array (E. J. RICHARDS, unpublished data). A comparison with the genomic sequence from the accession Col indicated that this Ler telomere clone corresponds to the termini of chromosome 1N; however, the current version of the sequence database does not cover the position of the nontelomeric sequence. We confirmed the presence of this unique sequence distal to the subtelomeric region of chromosome 1N by PCR analysis with the genomic DNA isolated from Ler (data not shown). This nontelomeric sequence is 104 bp in length and is located in the homogeneous telomeric repeat array distal to the DV region (sequence marked by colored arrows in Figure 6A). A BLAST search of the database showed that this nontelomeric sequence consists of two fragments (a proximal 68-bp segment and a distal 36-bp segment) identical to two noncontiguous sequences in the mitochondrial genome of A. thaliana (Figure 6B) (UNSELD et al. 1997). In addition, the distal mtDNA fragment also matched a sequence in the pericentromeric region of chromosome 2 (coordinates 32627203262755) where a 620-kb duplicated and rearranged mitochondrial genomic sequence was transferred (LIN et al. 1999; STUPAR et al. 2001). The proximal 68-bp mtDNA segment is identical to an intergenic region between orf315 and NAD5 of the mitochondrial genome and overlaps with an intronic region between exon b and exon c of NAD2. The distal 36-bp fragment is identical to an intergenic region between orf106e and orf107f.
|
Among the 12 accessions containing the mtDNA insertion event, we observed variation corresponding to apparent deletions of one to five canonical telomeric repeats (alignment gaps indicated by red dashes in Figure 6A). This variation was found in a group of accessions sharing a common subtelomeric haplotype (Br-0, NFA-8, Ra-1, and Ts-1) (Figures 4A and 6A), implying that the canonical telomeric repeat deletion and amplification is proceeding at a comparatively fast rate. We incorporated the polymorphisms within the proximal telomeric region (DV plus the region extending to the mtDNA) to resolve the phylogenetic relationship among a number of closely related class 1 subtelomeric haplotypes (Figure 6C). The refined tree suggests that the mtDNA insertion is a unique event that occurred after the divergence of accession Lz-0 from the ancestral haplotype of the 12 accessions (indicated in the shaded boxes in Figure 6C).
| DISCUSSION |
|---|
|
|
|---|
w = 0.012) is comparable to the average genomewide silent nucleotide diversity [
w = 0.00896 from SCHMID et al. (2005);
w =
0.0070.01 from NORDBORG et al. (2005)]. The indel diversity in this subtelomeric region is also comparable to that present in other noncoding regions along chromosome 1 (NORDBORG et al. 2005; (see supplemental Table 1, http://www.genetics.org/supplemental/). Therefore, the subtelomeric region of chromosome 1N appears to be evolving in a manner similar to that seen in other noncoding regions of the genome. We note that the subtelomeric region of chromosome 1N displays an elevated frequency of larger-scale rearrangements in the distal region adjacent to the telomeric repeats. As discussed below, these larger-scale rearrangements were caused by diverse molecular mechanisms. This observation is reminiscent of a recent report that the repertoire of DNA damage-induced rearrangements in budding yeast increases in subtelomeric regions (RICCHETTI et al. 2003). A wealth of cytological observations indicates that initiation of homologous chromosome synapsis and recombination occurs toward the chromosome ends (SCHERTHAN 2001; SCHWARZACHER 2003; HARPER et al. 2004). These observations suggest that homologous recombination in subtelomeric regions may be frequent. Consistent with this expectation, the ratio of genetic to physical distance increases toward the ends of linkage maps in many organisms (NIH/CEPH COLLABORATIVE MAPPING GROUP 1992; LUKASZEWSKI and CURTIS 1993; SCHWARZACHER 1996; LIN et al. 1999; MAYER et al. 1999). It is not clear, however, whether the elevated recombination extends to the extreme terminal region of the chromosome. The presence of strong linkage disequilibrium between the distal subtelomeric region and the adjacent proximal telomere, which is observed in humans (BAIRD et al. 2000) and Arabidopsis (this study), argues that recombination is infrequent at the junctions between the subtelomeric regions and the telomeres.
The evolutionary neutrality and well-defined haplotype structure of the Arabidopsis chromosome 1N subtelomeric region suggest that the region could be useful for addressing evolutionary questions in this genus. We applied this marker to examine the evolutionary origins of the allotetraploid A. suecica (2n = 4X = 26), a putative hybrid of A. thaliana (2n = 10) and A. arenosa (2n = 16) (HYLANDER 1957; O'KANE et al. 1996). Several studies indicate A. thaliana was the maternal parent of A. suecica (MUMMENHOF and HURKA 1994; SÄLL et al. 2003); on the basis of chloroplast DNA sequences, SÄLL et al. (2003) suggested that A. suecica (endemic populations in Sweden and Finland) arose through a single hybridization event. To test this hypothesis, we analyzed a set of 10 A. suecica accessions collected from endemic geographic locations and 1 of unknown origin (Table 1). We found that all accessions shared identical sequences over the 2-kb telomere-proximal region and contained sequence rearrangement characteristic of class 2 subtelomeric structures (Figure 3A). Phylogenetic analysis indicated that these A. suecica accessions originated from a common ancestral A. thaliana haplotype closely related to the extant class 2 haplotypes (Figure 4A). Thus, our data are consistent with the single-origin hypothesis for A. suecica.
Molecular mechanisms underlying the dynamics of the proximal telomeric region:
Mutations in telomeric repeats have been shown to alter the interaction between the telomere and its associated proteins, resulting in alterations in telomere structure and developmental anomalies (YU et al. 1990; MCEACHERN and BLACKBURN 1995; PRESCOTT and BLACKBURN 1997). The accumulation of mutations in the centromere-proximal portion of the telomere suggests that this region may be under less functional constraint. In addition to the ongoing nucleotide substitutions and small indels within the repeats, comparison of the proximal telomeric region among natural accessions revealed that deletion and expansion of blocks of repeats also contribute to polymorphisms. This mutational pattern is reminiscent of the instability of microsatellites (SCHLOTTERER 2000; ELLEGREN 2004). The molecular mechanisms underlying the instability of microsatellites and other tandem repetitive sequences include unequal exchange and replication slippage (BZYMEK and LOVETT 2001; SCHLOTTERER and TAUTZ 1992). The expansion of human telomeric repeat sequences (TTAGGGn) via replication slippage has been demonstrated in an in vitro DNA synthesis assay, and instability of telomeric repeat sequences (expansions and deletions) carried out on plasmid DNA was also observed during propagation in bacterial cells (NOZAWA et al. 2000). In humans, the proximal telomeric region is also characterized by accumulation of variant telomeric repeats, which are postulated to arise from intra-allelic mutational processes, such as replication slippage or unequal sister chromatid exchange, in light of the evidence that inter-allelic homologous recombination is suppressed in this region (BAIRD et al. 1995, 2000). As discussed above, we have found no evidence for reciprocal homologous recombination in the Arabidopsis chromosome 1N subtelomeric-proximal telomere region that would accompany unequal exchange. However, it is important to note that this plant's predominant selfing habit (and associated high homozygosity) would make detection of these homologous recombination events less likely. Regardless of the precise molecular mechanisms at work, the pattern of genetic variation suggests that similar evolutionary processes are acting on the proximal telomeric regions in humans and Arabidopsis.
An unusual type of polymorphism identified within the Arabidopsis chromosome 1N proximal telomere repeat array is the insertion of mitochondrial DNA. The composite structure of the 104-bp fragment is similar to structures generated by filler DNA captured in double-strand break (DSB) repaired sites through NHEJ. Insertion of filler sequences (e.g., nuclear genomic sequences, reverse transcribed products of retrotransposons, mitochondrial DNA) at repaired DSB sites is found in many eukaryotes (NASSIF et al. 1994; MOORE and HABER 1996; GORBUNOVA and LEVY 1997; SALOMON and PUCHTA 1998; RICCHETTI et al. 1999; YU and GABRIEL 1999; LIN and WALDMAN 2001). These filler sequences integrate either as one contiguous segment or as an assemblage of scrambled segments. The synthesis-dependent strand annealing model has been proposed to explain the insertion of filler sequences at DSB sites. In this model, a 3'-protruding strand of a broken end primes DNA synthesis and copies a stretch of DNA using an ectopic template before rejoining the other broken terminus (FORMOSA and ALBERTS 1986; NASSIF et al. 1994; GORBUNOVA and LEVY 1997). This model could explain the two noncontiguous mtDNA sequences captured in a reverse orientation in the telomeric array found here: repair synthesis had switched between two ectopic mitochondrial genomic fragments, using short stretches of complementarity at the junctions between the telomeric sequence and the flanking genomic sequences of the captured fragments (Figure 6B, underlined nucleotides).
We estimated that this mtDNA insertion is a recent evolutionary event (30,000 YA) supporting the view that mtDNA integration into the nuclear genome is an ongoing process (YU and GABRIEL 1999; ADAMS et al. 2000; RICCHETTI et al. 2004). Furthermore, our results demonstrate that the telomeres can be a port of entry for organellar DNA into the nuclear genome. Interestingly, another example of a transfer of a short, scrambled mitochondrial sequence to the chromosome termini has been reported in yeast, but in this case the mtDNA was inserted into the subtelomeric XY' element boundary rather than the terminal telomeric repeat array (LOUIS and HABER 1991). The inclusion of filler sequences into the telomere repeat array has been reported in a special circumstance: NHEJ-mediated end-to-end fusions of critically shortened telomeres (
100400 bp) in an Arabidopsis telomerase mutant (HEACOCK et al. 2004). This result suggests that when the telomere length falls below a minimum threshold it will be processed by DSB-initiated DNA repair in the absence of telomerase. The captured mtDNA described in the present study occurred
140 bp from the subtelomerictelomeric boundary, suggesting that DSB repair machinery can compete with telomerase-mediated telomere addition for access to the proximal region of telomere to repair the broken telomere in wild-type backgrounds where telomerase is expected to be active.
Molecular mechanisms underlying the dynamics of the subtelomeric region:
The larger-scale DNA rearrangements found over the distal subtelomeric region in this study are associated with deletions of >30 bp. All of these deletions, with the exception of the inversion characterizing class 2 subtelomeric regions, occur without additional rearrangements, including the transposition event seen in class 5 (see below). Deletions at nonrepetitive random sequences can be derived from replication slippage between short direct repeats or NHEJ after exonucleolytic processing of DSB sites. Although little is known about the requirements for replication slippage in plants, deletions caused by replication slippage are associated with 3- to 9-bp perfect or imperfect direct repeats in the budding yeast (TRAN et al. 1995). Among the seven characterized DNA rearrangements in the distal subtelomeric region, the two smallest deletions, the 31- and 43-bp interstitial deletions, are flanked by 4-bp perfect direct repeats and could have resulted from either replication slippage or an NHEJ event. In contrast, the 76-bp interstitial deletion shows no recognizable sequence similarity flanking the breakpoints and is likely to have resulted from NHEJ. The remaining four rearrangements show 0- to 3-bp identity flanking the breakpoints and are likely to be caused by an NHEJ process rather than replication slippage, a conclusion further supported by additional characteristics of these rearrangements. For example, in the class 2 subtelomeric inversion event, we observed limited regions of identity at the sequences flanking the rejoined sites (1 and 3 bp for the proximal and the distal deletion breakpoints, respectively). This rearrangement appears to involve two DSB events, followed by exonucleolytic processing and NHEJ rejoining of three chromosome fragments, with the central fragment inverted. This rearrangement resembles the irradiation-induced DNA rearrangements found in the A. thaliana transparent testa 3 (tt3) allele (SHIRLEY et al. 1992).
Telomere addition at a broken chromosome end (chromosome healing) has been observed in a number of organisms (HABER and THORBURN 1984; POLOGE and RAVETCH 1988; WILKIE et al. 1990; YU and BLACKBURN 1991; WERNER et al. 1992; SPRUNG et al. 1999). Chromosome healing can occur through various processes, including de novo telomere synthesis by telomerase or NHEJ with a preexisting telomere (telomere capturing). The class 4 subtelomeric structures (subterminal deletions) resemble healed chromosome ends resulting from DSBs and deletions of the distal subtelomeric region. Intriguingly, such deletions occurred twice independently at nearby positions in the evolutionary history represented by the accessions examined (Figures 3B and 4). The different signatures of telomeric sequences adjacent to the subterminal breakpoints suggest that different processes underlie the two independent events.
In one accession, Kz-1, the deletion breakpoint was followed by an array of homogeneous repeats, the majority of which match the canonical telomere repeat motif. This structure is most easily explained by de novo telomere synthesis by telomerase. Under such a model, the telomere addition could have initiated using 5'-TT-3' as a priming site (underlined TT dinucleotide in Figure 3B), which is in phase with the first added telomere repeat sequence (TTTAGGG). Alternatively, telomerase can also add telomeric repeats to a DNA 3' terminus lacking any complementarities to the telomerase RNA template using a so-called default mechanism (MELEK and SHIPPEN 1996). Under such a scenario, the 5'-AC-3' dinucleotide directly proximal to the first complete canonical telomere repeat could have served as a priming site for telomerase-mediated telomere addition. Although G-rich sequences in the proximity of a noncomplementary 3' terminus are postulated to recruit telomerase by the default mechanism (HARRINGTON and GREIDER 1991; MULLER et al. 1991; BOTTIUS et al. 1998), we did not observe such a feature in the flanking sequence of the Kz-1 subterminal deletion breakpoint. It is noteworthy that an in vitro study demonstrated that Arabidopsis telomerase has a relaxed specificity for DNA recognition (FITZGERALD et al. 2001). While 5'-G13-3' is a preferred priming site for Arabidopsis telomerase in vitro, elongation of nontelomeric 3' ends can occur by positioning the DNA at a default site in the RNA template, leading to the addition of 5'-GGTTTAG-3' as the first telomeric sequence. However, the novel subtelomerictelomeric junction created in Kz-1 does not show the sequence signature predicted by the in vitro result. This discrepancy may be due to the different biochemical activity of the telomerase in vivo or a nontelomerase-mediated mechanism (e.g., NHEJ-mediated telomere capture) could underlie this healing event.
In contrast to the situation in Kz-1, the junction of the subterminal deletion shared by accessions Kas-2, Shakdara, and Tamm-27 contains two degenerate telomeric repeats before transitioning into a more homogeneous array of repeats conforming to the canonical telomere sequence (Figure 3B). Moreover, these two degenerate telomeric repeats are nearly identical to repeats that are also located adjacent to more uniform canonical telomere repeat arrays in the DV region of other accessions with different subtelomeric structures (Figure 5). These observations make NHEJ-mediated telomere capture a more plausible mechanism to explain the origin of this subterminal deletion. If a telomerase-mediated mechanism is invoked, it is also necessary to postulate that an accumulation of mutations at the subtelomerictelomeric region boundary occurred subsequent to de novo telomere synthesis.
The other complex rearrangement found in the subtelomeric regions of chromosome 1N is a deletion apparently coupled with an insertion of a solo LTR of a copia class retrotransposon in accession N13. Retrotransposons make up a significant portion of plant genomes and are important players in plant genome evolution (KUMAR and BENNETZEN 1999). In A. thaliana, retrotransposons predominantly cluster in the heterochromatic centromeres and pericentromeric regions but not in the subtelomeric regions, although subtelomeric regions have been regarded as heterochromatic and hot spots for retroelement insertion in some organisms (PEARCE et al. 1996; ZOU et al. 1996; ARABIDOPSIS GENOME INITIATIVE 2000; PETERSON-BURCH et al. 2004). A genomewide study of retrotransposon distribution in A. thaliana showed that solo LTRs are abundant (composing 1.57% of the genome) and localize predominantly in pericentromeric regions (PETERSON-BURCH et al. 2004). These elements can be derived from various processes, such as recombination between the 5' and 3' LTRs of a single element (intra-element) to generate a recombinant LTR flanked by identical target site duplications (TSDs). Alternatively, an unequal reciprocal recombination event between two elements (either intrachromatid or interchromosomal) will result in a recombinant solo LTR flanked by different TSDs (DEVOS et al. 2002). The copia class solo LTR found in N13 is flanked by identical 5-bp TSDs and is also associated with a 409-bp deletion of the subtelomeric sequence at the insertion site. A simple intra-element recombination cannot explain the associated deletion at the same location. The presence of the same TSD sequence also makes inter-element recombination (either intra- or interchromatid) unlikely, because this scenario requires two independent transposition events targeting the same 5-bp sequence motif at a nearby location. Capture of LTR sequences (derived from incomplete reverse transcribed products) at induced DSB sites by illegitimate recombination has been observed in yeast (MOORE and HABER 1996; TENG et al. 1996; YU and GABRIEL 1999), but the retention of an intact solo LTR together with identical flanking TSDs argues against such an explanation. These considerations suggest that the N13 subtelomeric haplotype arose from a sequence of events, starting with a 409-bp deletion, followed by an insertion of a copia element, and subsequent deletion of the internal region of the retrotransposon by intra-element recombination.
Implications for genome evolution:
The organization of the subtelomeric region of chromosome 1N is characteristic of the apparent simplicity of subtelomeric regions in Arabidopsis. Although there is sequence similarity shared between some subtelomeric regions (KOTANI et al. 1999; HEACOCK et al. 2004), most Arabidopsis subtelomeric regions are characterized by their small size and paucity of repetitive sequences. In contrast, subtelomeric regions found in many other organisms, such as yeast and human, are mosaics of repetitive sequences, many of which share extensive similarity among subtelomeric regions of nonhomologous chromosome (FLINT et al. 1997). Complex mechanisms operate to generate these patchwork structures, including ectopic sequence translocations, NHEJ, and homology-mediated recombination between nonhomologous chromosome ends (see Introduction). The simple organization of the chromosome 1N subtelomeric region in Arabidopsis is not due to the lack of fluidity of this genomic region, as we observed diverse large-scale rearrangements over the distal end of this subtelomeric region. The structures of the naturally occurring DNA rearrangements that we observed indicate that the predominant mechanism operating on this region is simple deletion-associated NHEJ repair (at least five of seven rearrangements are rejoined at sites with limited identity).
In plants, NHEJ is a major mechanism underlying DSB repair (GORBUNOVA and LEVY 1999; PUCHTA 2005). Characterization of experimentally induced DSBs has demonstrated that complex processes are employed during NHEJ in plants. In tobacco, two independent experimental systems demonstrate that many DSB repair events involve deletions and approximately one-third of the events result in addition of filler DNA (GORBUNOVA and LEVY 1997; SALOMON and PUCHTA 1998). In a direct comparison of NHEJ processing at DSB in Arabidopsis and tobacco (with a genome size
20-fold greater that of Arabidopsis), the average length of deletions recovered in tobacco were relatively smaller and often accompanied by sequence insertions, whereas in A. thaliana deletions were frequently larger without insertions (KIRIK et al. 2000). This observation led to a model that the species-specific difference in DSB repair is a cause for genome size evolution in plants (KIRIK et al. 2000). This hypothesis fits the inverse relationship between deletion and genome size previously demonstrated in insects (PETROV et al. 2000). The observations of KIRIK et al. (2000) are consistent with spontaneous deletions found in other plants with large genomes, such as that described in the maize Waxy locus that was associated with insertion of filler DNA (WESSLER et al. 1990). The hypothesis also gains further support from the study of turnover of retrotransposons, which have contributed significantly to the genome expansion in higher plants (DEVOS et al. 2002; KUMAR and BENNETZEN 1999). DEVOS et al. (2002) estimated that deletion-associated illegitimate recombination is > fivefold more frequent than homologous recombination-mediated elimination of retroelements in A. thaliana and suggested that such a mechanism may counteract the genome expansion in A. thaliana. The predominant simple deletion-associated NHEJ process evident in our study of natural variation in the subtelomeric region of Arabidopsis chromosome 1N parallels the process that appears to be constraining the growth of the genome of this species through the diminution of retrotransposon sequences. It is likely that this bias in processing NHEJ events dictates not only the small size but also the simple organization of the subtelomeric regions in Arabidopsis, in light of the studies that show how translocation-associated NHEJ leads to an accumulation of a complex patchwork of subtelomeric sequences shared between nonhomologous chromosome ends (LINARDOPOULOU et al. 2005). Further investigation of natural variation in A. thaliana and other organisms with small and tightly managed genomes will determine how well simple deletion-associated NHEJ repair explains the dynamics and maintenance of simple subtelomeric structures.
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
| FOOTNOTES |
|---|
| LITERATURE CITED |
|---|
|
|
|---|
ADAMS, K. L., D. O. DALEY, Y. L. QIU, J. WHELAN and J. D. PALMER, 2000 Repeated, recent and diverse transfers of a mitochondrial gene to the nucleus in flowering plants. Nature 408: 354357.[CrossRef][Medline]
ALKHIMOVA, O. G., N. A. MAZUROK, T. A. POTAPOVA, S. M. ZAKIAN, J. S. HESLOP-HARRISON et al., 2004 Diverse patterns of the tandem repeats organization in rye chromosomes. Chromosoma 113: 4252.[Medline]
ALLSHIRE, R. C., M. DEMPSTER and N. D. HASTIE, 1989 Human telomeres contain at least three types of G-rich repeat distributed non-randomly. Nucleic Acids Res. 17: 46114627.
AMARGER, V., D. GAUGUIER, M. YERLE, F. APIOU, P. PINTON et al., 1998 Analysis of distribution in the human, pig, and rat genomes points toward a general subtelomeric origin of minisatellite structures. Genomics 52: 6271.[CrossRef][Medline]
ARABIDOPSIS GENOME INITIATIVE, 2000 Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408: 796815.[CrossRef][Medline]
BAIRD, D. M., and N. J. ROYLE, 1997 Sequences from higher primates orthologous to the human Xp/Yp telomere junction region reveal gross rearrangements and high levels of divergence. Hum. Mol. Genet. 6: 22912299.
BAIRD, D. M., A. J. JEFFREYS and N. J. ROYLE, 1995 Mechanisms underlying telomere repeat turnover, revealed by hypervariable variant repeat distribution patterns in the human Xp/Yp telomere. EMBO J. 14: 54335443.[Medline]
BAIRD, D. M., J. COLEMAN, Z. H. ROSSER and N. J. ROYLE, 2000 High levels of sequence polymorphism and linkage disequilibrium at the telomere of 12q: implications for telomere biology and human evolution. Am. J. Hum.Genet. 66: 235250.[CrossRef][Medline]
BARRY, J. D., M. L. GINGER, P. BURTON and R. MCCULLOCH, 2003 Why are parasite contingency genes often associated with telomeres? Int. J. Parasitol. 33: 2945.[CrossRef][Medline]
BAUR, J. A., Y. ZOU, J. W. SHAY and W. E. WRIGHT, 2001 Telomere position effect in human cells. Science 292: 20752077.
BERGELSON, J., E. STAHL, S. DUDEK and M. KREITMAN, 1998 Genetic variation within and among populations of Arabidopsis thaliana. Genetics 148: 13111323.
BLACKBURN, E. H., 2000 Telomere states and cell fates. Nature 408: 5356.[CrossRef][Medline]
BLACKBURN, E. H., 2001 Switching and signaling at the telomere. Cell 106: 661673.[CrossRef][Medline]
BORST, P., G. RUDENKO, M. C. TAYLOR, P. A. BLUNDELL, F. VAN LEEUWEN et al., 1996 Antigenic variation in trypanosomes. Arch. Med. Res. 27: 379388.[Medline]
BOTTIUS, E., N. BAKHSIS and A. SCHERF, 1998 Plasmodium falciparum telomerase: de novo telomere addition to telomeric and nontelomeric sequences and role in chromosome healing. Mol. Cell. Biol. 18: 919925.
BROUN, P., M. W. GANAL and S. D. TANKSLEY, 1992 Telomeric arrays display high levels of heritable polymorphism among closely related plant varieties. Proc. Natl. Acad. Sci. USA 89: 13541357.
BUCHOLC, M., Y. PARK and A. J. LUSTIG, 2001 Intrachromatid excision of telomeric DNA as a mechanism for telomere size control in Saccharomyces cerevisiae. Mol. Cell. Biol. 21: 65596573.
BZYMEK, M., and S. T. LOVETT, 2001 Instability of repetitive DNA sequences: The role of replication in multiple mechanisms. Proc. Natl. Acad. Sci. USA 98: 83198325.
CARLSON, M., J. L. CELENZA and F. J. ENG, 1985 Evolution of the dispersed SUC gene family of Saccharomyces by rearrangements of chromosome telomeres. Mol. Cell. Biol. 5: 28942902.
COMAI, L., A. P. TYAGI, K. WINTER, R. HOLMES-DAVIS, S. H. REYNOLDS et al., 2000 Phenotypic instability and rapid gene silencing in newly formed Arabidopsis allotetraploids. Plant Cell 12: 15511568.
COPENHAVER, G. P., and C. S. PIKAARD, 1996 RFLP and physical mapping with an rDNA-specific endonuclease reveals that nucleolus organizer regions of Arabidopsis thaliana adjoin the telomeres on chromosomes 2 and 4. Plant J. 9: 259272.[CrossRef][Medline]
DEVOS, K. M., J. K. BROWN and J. L. BENNETZEN, 2002 Genome size reduction through illegitimate recombination counteracts genome expansion in Arabidopsis. Genome Res. 12: 10751079.
DURAISINGH, M. T., T. S. VOSS, A. J. MARTY, M. F. DUFFY, R. T. GOOD et al., 2005 Heterochromatin silencing and locus repositioning linked to regulation of virulence genes in Plasmodium falciparum. Cell 121: 1324.[CrossRef][Medline]
EICHLER, E. E., and D. SANKOFF, 2003 Structural dynamics of eukaryotic chromosome evolution. Science 301: 793797.
ELLEGREN, H., 2004 Microsatellites: simple sequences with complex evolution. Nat. Rev. Genet. 5: 435445.[CrossRef][Medline]
EPEL, E. S., E. H. BLACKBURN, J. LIN, F. S. DHABHAR, N. E. ADLER et al., 2004 Accelerated telomere shortening in response to life stress. Proc. Natl. Acad. Sci. USA 101: 1731217315.
FITZGERALD, M. S., E. V. SHAKIROV, E. E. HOOD, T. D. MCKNIGHT and D. E. SHIPPEN, 2001 Different modes of de novo telomere formation by plant telomerases. Plant J. 26: 7787.[CrossRef][Medline]
FLINT, J., G. P. BATES, K. CLARK, A. DORMAN, D. WILLINGHAM et al., 1997 Sequence comparison of human and yeast telomeres identifies structurally distinct subtelomeric domains. Hum. Mol. Genet. 6: 13051313.
FORMOSA, T., and B. M. ALBERTS, 1986 DNA synthesis dependent on genetic recombination: characterization of a reaction catalyzed by purified bacteriophage T4 proteins. Cell 47: 793806.[CrossRef][Medline]
FREITAS-JUNIOR, L. H., E. BOTTIUS, L. A. PIRRIT, K. W. DEITSCH, C. SCHEIDIG et al., 2000 Frequent ectopic recombination of virulence factor genes in telomeric chromosome clusters of P. falciparum. Nature 407: 10181022.[CrossRef][Medline]
FREITAS-JUNIOR, L. H., R. HERNANDEZ-RIVAS, S. A. RALPH, D. MONTIEL-CONDADO, O. K. RUVALCABA-SALAZAR et al., 2005 Telomeric heterochromatin propagation and histone acetylation control mutually exclusive expression of antigenic variation genes in malaria parasites. Cell 121: 2536.[CrossRef][Medline]
GARCIA-CAO, M., R. O'SULLIVAN, A. H. PETERS, T. JENUWEIN and M. A. BLASCO, 2004 Epigenetic regulation of telomere length in mammalian cells by the Suv39h1 and Suv39h2 histone methyltransferases. Nat. Genet. 36: 9499.[CrossRef][Medline]
GORBUNOVA, V., and A. A. LEVY, 1997 Non-homologous DNA end joining in plant cells is associated with deletions and filler DNA insertions. Nucleic Acids Res. 25: 46504657.
GORBUNOVA, V. V., and A. A. LEVY, 1999 How plants make ends meet: DNA double-strand break repair. Trends Plant Sci. 4: 263269.[CrossRef][Medline]
GOTTSCHLING, D. E., O. M. APARICIO, B. L. BILLINGTON and V. A. ZAKIAN, 1990 Position effect at S. cerevisiae telomeres: reversible repression of Pol II transcription. Cell 63: 751762.[CrossRef][Medline]
GRAUR, D., and W.-H. LI, 2000 Fundamentals of Molecular Evolution. Sinauer Associates, Sunderland, MA.
HABER, J. E., and P. C. THORBURN, 1984 Healing of broken linear dicentric chromosomes in yeast. Genetics 106: 207226.