| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Genetics, Vol. 176, 27-41, May 2007, Copyright © 2007
doi:10.1534/genetics.106.069724
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

* Institute of Cell Biology, School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JR, United Kingdom and
Biomathematics and Statistics Scotland, Edinburgh EH9 3JZ, United Kingdom
1 Corresponding author: Institute of Cell Biology, School of Biological Sciences, University of Edinburgh, The King's Buildings, Mayfield Rd., Edinburgh EH9 3JR, United Kingdom.
E-mail: d.leach{at}ed.ac.uk
| ABSTRACT |
|---|
|
|
|---|
, Pol
, PCNA, Fen1/Rad27, Dna2 helicase, and primasehave profound effects on repeat instability (SCHWEITZER and LIVINGSTON 1999; CALLAHAN et al. 2003). On the other hand, studies of instability in mice and in cultured cells have revealed evidence for replication-independent expansion of repeat arrays (TAKANO et al. 1996; HASHIDA et al. 1997; KOVTUN and MCMURRAY 2001). This is consistent with expansion of repeats in human tissues, such as the brain where cells seldom replicate. However, even these replication-independent expansion events are likely to be dependent on DNA synthesis at sites of DNA repair. It is therefore essential to understand how the machinery required to copy DNA interacts with these repetitive sequences. E. coli provides an attractive model system in which to study the basic properties of DNA replication, recombination, and genetic instability because of the detailed understanding of its genetic and biochemical pathways, its rapid growth, and sophisticated biotechnology. Despite these advantages, all previous studies have considered instability on bacterial plasmid-borne repeats apart from one recently reported study on chromosomal trinucleotide repeat instability (KIM et al. 2006). The studies based on plasmid systems have included investigations of the effects of transcription (BOWATER et al. 1997; SCHUMACHER et al. 2001), mismatch repair (JAWORSKI et al. 1995; SCHMIDT et al. 2000), nucleotide excision repair (OUSSATCHEVA et al. 2001), proofreading (IYER et al. 2000), and recombination (JAKUPCIAK and WELLS 1999, 2000a,b; HASHEM et al. 2004). This body of work has revealed many different effects, and contradictory conclusions have sometimes been reached. It is likely that aspects of plasmid biology such as copy number, size, and specialized processing pathways have contributed to the differences observed.
Orientation dependence of TNR instability has previously been seen for E. coli plasmids and in yeast. This has been explained on the basis of lagging-strand replication dynamics coupled to the greater thermodynamic stability of CTG repeat hairpins relative to CAG repeat hairpins (GACY et al. 1995; KANG et al. 1995; ROSCHE et al. 1995; MAURER et al. 1996; FREUDENREICH et al. 1997; MIRET et al. 1998). The model suggests that CTG repeat hairpins formed on the lagging-strand template lead to deletions while CTG repeat hairpins formed on the nascent lagging strand lead to expansions. Since the E. coli system is biased toward deletions and not many expansions are observed, even when CTG repeats are present on the nascent lagging strand, this model is at best an incomplete explanation of orientation dependence of repeat instability in E. coli.
Trinucleotide repeats, when single stranded, can fold into hairpins in vitro (GACY et al. 1995; MITAS et al. 1995a; SMITH et al. 1995; YU et al. 1995a,b; PETRUSKA et al. 1996), which can be attacked by SbcCD (CONNELLY et al. 1999). The SbcCD nuclease of E. coli is the homolog of the Mre11/Rad50 nuclease in eukaryotes (SHARPLES and LEACH 1995). The enzyme has double-strand exonuclease and hairpin endonuclease activities and acts on a variety of substrates, including DNA hairpins (CONNELLY and LEACH 1996; CONNELLY et al. 1997, 1999). It has also been shown to affect the frequency and nature of deletions between 101-bp direct repeats flanking an inverted repeat sequence and can affect the nature of a deletion event between 101-bp direct repeats, even in the absence of an inverted repeat sequence, in a manner consistent with nucleolytic processing of the slippage intermediate (BZYMEK and LOVETT 2001a,b). The observations of CTG and CAG hairpins (GACY et al. 1995; MITAS et al. 1995b; SMITH et al. 1995; YU et al. 1995a; PETRUSKA et al. 1996) and of slipped mispairing structures in vitro (PEARSON et al. 1998; SINDEN et al. 2002) have given rise to the hypothesis that misfolded structures play a role in instability. Preformed slipped mispairing structures are repaired in vitro in human cell extracts to delete, retain, or shorten the looped-out repeat containing strand, demonstrating the presence of activities capable of processing misfolded structures (PANIGRAHI et al. 2005). The hypothesis that secondary structures may be involved in the processing of CAG·CTG repeats in vivo has received some support from observations implicating the SbcCD and Rad50Mre11 complexes in E. coli and yeast. In E. coli, it has been reported that a multiply mutant strain (SURE) shows dramatic expansion of CAG·CTG repeats in a plasmid that is prevented by SbcCD (SARKAR et al. 1998). In yeast, a reduction in CAG repeat expansions associated with double-strand break repair in mre11
strains was suppressed by overexpressing the Rad50Mre11 complex, suggesting that it may cleave hairpin structures (RICHARD et al. 2000). Furthermore, CTG repeat-induced spontaneous double-strand breaks were reduced in a rad50 mutant (FREUDENREICH et al. 1998). Arguments against the involvement of secondary structures come from the inability to drive a structural transition in vitro in supercoiled templates (BACOLLA et al. 1997) and the lack of effect of the sbcCD genotype on CAG·CTG repeat instability in a plasmid system different from that of SARKER et al. (1998; SCHMIDT et al. 2000). Furthermore, the small changes in repeat length observed in plasmid substrates show no bias to even-numbered patterns of deletion products (except as a consequence of mismatch repair, which eliminates +1 and 1 repeat changes; see SCHMIDT et al. 2000) despite the existence of preferred folding patterns comprising even numbers of trinucleotide repeats in vitro (PETRUSKA et al. 1998) and in vivo (DARLOW and LEACH 1995). Relevant to many of these arguments is the observation of an SbcCD effect on strand slippage in the absence of a hairpin-forming substrate (BZYMEK and LOVETT 2001a,b), which weakens any argument for DNA structures based on SbcCD or Rad50/MRE11 effects. It is clear that despite the strong evidence that CAG·CTG repeats can form secondary structures in vitro that are substrates for enzymatic processing, the question of whether secondary structure plays a role in instability in vivo has been more difficult to determine experimentally and the hypothesis remains controversial.
In E. coli, proofreading during DNA replication is performed by the 3'5' exonucleolytic
-subunit of DNA polymerase III, which is encoded by the dnaQ gene. During replication, the proofreading function prevents slipped-strand pairing events that can lead to instability in repeated sequences. A mutation in the proofreading function of DNA polymerase III, dnaQ49ts, was shown to enhance instability of CAG·CTG trinucleotide repeats (IYER et al. 2000). Another mutation, mutD5, along with dnaQ49ts, was shown to enhance instability of tandem repeat sequences (SAVESON and LOVETT 1997; BZYMEK et al. 1999).
Here, we describe a study of trinucleotide repeat instability carried out in the E. coli chromosome. We demonstrate that instability is length and orientation dependent. Longer repeat arrays are more unstable than shorter repeat arrays and CTG repeats on the lagging-strand template are more unstable than on the leading-strand template. Furthermore, for both orientations of CAG·CTG trinucleotide repeat tracts, the distributions of deletion lengths are skewed in a way that is consistent with deletions stimulated by hairpin structures. This is direct evidence that secondary structure plays a role in CAG·CTG repeat instability in vivo. We also demonstrate that mutation of the gene encoding the proofreading subunit of DNA polymerase III (DnaQ) destabilizes CAG·CTG trinucleotide repeat tracts. Furthermore, orientation dependence of instability is lost in the dnaQ mutant, and SbcCD, whose homolog in eukaryotes is Rad50/Mre11, modulates the effect of DnaQ. These data argue that intermediates in the replicative pathway leading to trinucleotide repeat instability are detected and processed by both proofreading and the SbcCD (Rad50/Mre11) complex. Furthermore, we demonstrate that this replicative instability is not caused by transcription or recombination.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Plasmid pLacD2 was derived from pLacD1. pLacD1 had an extra site for BbsI in one lac homology arm, which was removed by changing A to G using primers SDM_BbsI_F [5'-GGG ATA CGA CGA TAC CGA GGA CAG CTC ATG-3' (underlined sequences define the position of the BbsI site and the boldface G is the base that has been changed from A by SDM)] and SDM_BbsI_R (5'-CAT CAG CTG TCC TCG GTA TCG TCG TAT CCC-3') by site-directed mutagenesis, resulting in the plasmid pLacD2. A list of plasmids is provided in Table 1.
Building of long repeat arrays:
Repeat arrays were generated in the plasmid pLacD2, which was further used to integrate the repeats in chromosomes. (CAG)5 and (CTG)5 repeats were introduced between the lac homology arms of pLacD2 by site-directed mutagenesis using primer pairs ExCAG-01 (5'-CTA TGA CCA TGG TCT CGC AGC AGC AGC AGC AGG TCT TCG TCG TTT TAC-3'), ExCTG-01 (5'-GTA AAA CGA CGA AGA CCT GCT GCT GCT GCT GCG AGA CCA TGG TCA TAG-3'), ExCAG-02 (5'-CTA TGA CCA TGG TCT CGC TGC TGC TGC TGC TGG TCT TCG TCG TTT TAC-3'), and ExCTG-02 (5'-GTA AAA CGA CGA AGA CCA GCA GCA GCA GCA GCG AGA CCA TGG TCA TAG-3'), removing the MfeI site. Another unique restriction site, HindIII, was used to perform double digestions. The pLacD2 plasmid containing the repeats was digested by BsaI and HindIII and by BbsI and HindIII, giving two fragments of 2774 and 3702 bp. The fragments containing the repeats were extracted from gels and ligated together to increase the repeat number.
Integration into chromosome:
Repeat sequences were integrated into the chromosome in both CAG and CTG orientations in the 5' part of the lacZ gene using the pKO3 integration strategy (LINK et al. 1997). Figure 1A shows the structure of the CAG-leading (CAG) orientation where CAG repeats are on the leading-strand template and CTG repeats are on the lagging-strand template. Following integration, the presence of the repeat tract and the absence of the lacL8 mutation were checked by PCR and sequencing. A list of bacterial strains is provided in Table 2.
|
|
|
GeneMapper analysis of repeats:
Repeat tracts were amplified using primers Ex-test-F (5'-TTA TGC TTC CGG CTC GTA TG-3') and Ex-test-R (5'-GGC GAT TAA GTT GGG TAA CG-3'). Primer Ex-test-F was labeled with the fluorescence tag 6-Fam (Metabion). A size standard (GeneScan-500 LIZ from ABI) was added to the fluorescent PCR product(s), and fragments were resolved by capillary electrophoresis on a polyacrylamide medium in an ABI 3730 genetic analyzer. The results were analyzed by using GeneMapper software version 4. Characteristic result outputs are shown in Figure 1, CE. Here it can be seen that, in addition to the main peaks characteristic of the repeat array lengths, several "stutter" peaks are observed. These represent deletions and expansions that have arisen during the PCR reaction and not in vivo.
The instability proportion was defined as the proportion of sibling colonies that had a repeat length changed from the parental length. To avoid counting deletions that had arisen on the plates, mixed colonies (containing cells with parental and new lengths) were classified as parental in the instability proportion. However, these lengths were included in the analysis of deletion length distributions. Rare sibling colonies, derived from one parental colony containing the same length of deletion, were counted only once on the assumption that they were sister clones. Rare expansions of the repeat array were detected but have not been included in this analysis. Logistic regression models were fitted to the CAG and CTG arrays separately, using Genstat 8th edition, to compare the instability proportions in the different arrays. Approximate 95% confidence intervals were calculated for each estimated instability proportion as the mean ± 2 x standard error.
| RESULTS |
|---|
|
|
|---|
Both CAG and CTG repeats of length 5 were introduced between BsaI and BbsI restriction sites of plasmid pLacD2 (Figure 1B) by site-directed mutagenesis. This was followed by rounds of DNA restriction and ligation to construct longer repeat lengths. The recognition sites of BsaI and BbsI direct cleavage inside the repeat sequence (Figure 2B), so in every restriction and ligation round, there was a doubling of the repeat array length coupled to the loss of two repeats. This method follows the formula nx = 2nx1 2, where n is the number of repeat units in the repeat tract and x is the round of restriction and ligation.
|
The following strategy was used to measure the instability of the repeat arrays. In every instability assay, 12 parental colonies were taken and grown in the presence and absence of IPTG (2 mM). For each parental colony, 10 sibling colonies were analyzed. PCR was carried out across the repeat array and the length of the repeat tract was determined by running the PCR products on an ABI 3730 genetic analyzer (Applied Biosystems, Foster City, CA), which automatically detects and determines the sizes of DNA fragments based on electrophoretic separation. The data collected from the ABI 3730 Genetic Analyzer were analyzed using GeneMapper software version 4.0. The fragments were visualized as peaks on a graph as displayed in Figure 1.
Instability increases with increasing repeat length and depends on repeat orientation with respect to the direction of replication:
Three different lengths of the two orientations of CAG·CTG repeats were initially studied in wild-type cells. In both orientations, instability was found to be dependent on repeat tract length as the proportion of instability increased with the length of the repeat tract. The instability proportion represents the frequency of sibling colonies that had a repeat length changed (expanded or contracted) from the parental length (see experimental procedures for a more detailed description of the instability proportion). Figure 3A shows the instability proportions of all repeat lengths studied. The instability proportion for (CAG)84 was 31-fold higher than that for (CAG)45. Similarly, (CTG)140 had a 28-fold higher instability proportion than (CTG)48. Notably, the highest repeat length studied in the CAG orientation (CAG)84 had an instability proportion 1.7-fold higher than that for the highest length of the CTG orientation i.e., (CTG)140. It is the orientation where CAG repeats lie on the leading-strand template that is more unstable than the opposite orientation, where CTG repeats lie on the leading-strand template. The same orientation dependence with respect to the direction of replication was observed for CAG·CTG repeats inserted at the
attB site (data not shown).
|
CAG repeat instability is reduced in an sbcCD mutant and CTG repeat instability is reduced in an sbcCD dnaQ double mutant:
The effect of SbcCD was investigated for both CAG and CTG repeat orientations. The instability proportion for the (CAG)75 repeat array was 1.8-fold lower in the sbcCD mutant (P = 0.027). The stabilization observed in an sbcCD mutant was lost in an sbcCD dnaQ double mutant (Figure 3B).
The CTG orientation was too stable at the length studied to obtain sufficient data permitting a statistical distinction between wild-type and sbcCD mutants (Figure 3B). However, an effect of sbcCD could be measured in an sbcCD dnaQ mutant where a significant decrease in the instability proportion was observed relative to that in dnaQ, even though the repeat length was shorter in the dnaQ mutant (P = 0.004). In contrast to the dnaQ mutant, orientation dependence was retained in sbcCD dnaQ (P < 0.001), suggesting that intermediates in the CTG orientation pathway can escape deletion in the absence of SbcCD and proofreading more easily than intermediates in the CAG orientation pathway.
Large deletions predominate over small deletions in sbcCD and sbcCD dnaQ mutants in the CAG orientation and in a dnaQ mutant in both repeat orientations:
To see the sizes of deletions obtained in CAG·CTG repeats in wild-type and mutant cells, all observed deletions were plotted as a function of the percentage of deletion size against the number of events (Figure 4). Both CAG and CTG repeat deletion distributions are negatively skewed as seen from the long tails toward the left in Figure 4. The median (midpoint of the distribution) for (CAG)75 comes at 65% and for (CAG)84 at 70%. The (CTG)95 deletion distribution has a median at 58% while, for (CTG)140, it can be seen at 61%. These distributions suggest that large deletions are more frequent than small deletions, consistent with the existence of intermediates comprising many repeats as would be predicted if large hairpins could form. Given the low frequency of deletions and their origins in populations grown from single cells carrying the original length of repeat array, the vast majority of deletions will have arisen in single events from arrays of parental length. The distributions of deletion lengths in CAG repeat tracts are not influenced substantially by mutations in dnaQ or sbcCD (Figure 4), suggesting that these genes do not affect the nature of the primary intermediate in the pathway but instead the frequency of its processing to a product with a new repeat length. In the CTG repeat orientation, the number of events observed make a comparison most meaningful between the dnaQ and dnaQ sbcCD mutants. Here, the negatively skewed distribution observed in dnaQ (median 63%) disappears in dnaQ sbcCD, giving a flat distribution with a median of 51% (Figure 4). This suggests that, contrary to the CAG orientation, the presence of SbcCD (in the absence of proofreading) favors the formation of large deletions.
|
|
|
|
| DISCUSSION |
|---|
|
|
|---|
CAG·CTG trinucleotide repeats show length- and orientation-dependent instability in the E. coli chromosome:
We show that CAG·CTG repeat instability in the E. coli chromosome increases with increasing repeat tract length and that instability is orientation dependent (Figure 3A). In humans, length dependence of repeat array mutation underlies the phenomenon of dynamic mutation where expanded repeat arrays have an increased probability of further expansion leading to anticipation in the inheritance of disease phenotypes. The orientation in which elevated instability is observed is that where the CTG repeat lies on the template for the lagging strand of the replication fork. This is the same orientation dependence with respect to replication as observed in previous studies using bacterial plasmids and yeast (KANG et al. 1995; ROSCHE et al. 1995; MAURER et al. 1996; FREUDENREICH et al. 1997; MIRET et al. 1998), but differs from a recent study of repeat instability in the E. coli chromosome (KIM et al. 2006) where the instability observed shows variable length and orientation dependence. The authors have used the same assay in both plasmid and chromosomal contexts and observed no length or orientation dependence in the plasmid. The system of KIM et al. (2006) also differs from ours in that it detects only a subset of deletion events that gives rise to chloramphenicol resistance and complete deletion of the repeat array is a common outcome. Since we do not observe complete deletion of the repeat array, it seems that the two assays are detecting different molecular events.
In both E. coli plasmids and yeast, instability is replicative and is strongly biased toward deletions, but in mice instability can occur in nondividing cells (TAKANO et al. 1996; HASHIDA et al. 1997; KOVTUN and MCMURRAY 2001) and can be influenced by the position of the transgenic insert (MONCKTON et al. 1997; SEZNEC et al. 2000). This correlates with the expansion patterns observed in humans (SEZNEC et al. 2000). However, the observation of nonreplicative instability in a mammalian system does not exclude replicative sources of instability or the importance of replicative stability in mammals. Furthermore, the nonreplicative instability itself is likely to involve DNA synthesis. This is particularly clear in the case of nonreplicative expansion where expansion in the absence of accompanying contraction implies net DNA synthesis.
The proofreading subunit of DNA polymerase III (DnaQ) determines orientation dependence of replicative instability in cells with active SbcCD nuclease:
We demonstrate that a mutation in dnaQ destabilizes CAG·CTG repeat arrays and that orientation dependence of instability is lost in this mutant. This is a specific effect of dnaQ mutation since a mutation in mutS does not have a corresponding effect (Figure 3B). Previous studies suggested that the orientation dependence of CAG·CTG repeat instability is caused by the dynamics of lagging-strand DNA synthesis accompanied by the greater thermodynamic stability of CTG repeat hairpins compared to CAG repeat hairpins (MAURER et al. 1996; FREUDENREICH et al. 1997; MIRET et al. 1998). However, it has not previously been shown how this process is mediated. Here, we suggest that proofreading is inefficient on the CTG repeat template of the lagging strand (CAG orientation), leading to orientation-dependent instability. It is the more foldable CTG repeat strand that behaves as if it is more refractory to proofreading. Although we favor the interpretation that the dnaQ mutation destabilizes the repeat tract because of its effect on proofreading, we cannot exclude the possibility that other indirect effects of the dnaQ mutation contribute or are responsible.
Our work is consistent with the previous observation that the dnaQ49ts mutation destabilizes CTG repeats in bacterial plasmids (IYER et al. 2000) but is in divergence with experiments demonstrating that proofreading mutants of DNA polymerases
and
do not destabilize these repeats in yeast (SCHWEITZER and LIVINGSTON 1999). The yeast results are interesting, given that homo and dinucleotide repeats are destabilized in these mutants (STRAND et al. 1993; TRAN et al. 1997), suggesting some particular resistance to proofreading of CAG·CTG triplet repeats by polymerases
and
. In this context, it should be noted that DnaQ has been shown to share sequence homology with human DNA editing enzyme DNase III (HOSS et al. 1999). This enzyme is present in equal amounts in nondividing and proliferating cells, which suggests that it is involved in repair processes as well as in replication. An alternative is that eukaryotic cells might correct trinucleotide repeat slippage by sharing proofreading activities between polymerases (PAVLOV et al. 2006). So, it is plausible that proofreading during replication and repair in human cells may contribute to repeat stability.
A number of studies have documented the stabilizing and destabilizing effects of MutS and its homologs on CAG·CTG repeat instability in E. coli, yeast, and mouse (JAWORSKI et al. 1995; SCHUMACHER et al. 1998; MANLEY et al. 1999; SCHMIDT et al. 2000). In this system, we observe a small destabilizing effect of the mutS mutation in one orientation (CAG on leading-strand template) that lies on the border of significance and no significant effect in the other orientation (CTG on leading-strand template). The absence of a substantial effect of mutS on the frequency of deletion formation is consistent with our observation here that deletions are distributed over a wide range of sizes, while mismatch repair can correct only small insertion/deletion loops of up to three nucleotides.
The SbcCD nuclease increases CAG repeat instability when proofreading is active and CTG repeat instability when proofreading is inactive:
We show that CAG·CTG repeat instability is reduced in an sbcCD mutant when the CTG-containing strand is the template for the lagging strand of the replication fork (CAG orientation). This stabilizing effect of an sbcCD mutation is lost in a dnaQ sbcCD double mutant. This result argues for antagonistic action of SbcCD and the proofreading subunit of DNA polymerase III. It is consistent with the existence of an SbcCD-dependent pathway of deletion formation for the CAG orientation that is significant only in the presence of proofreading. This may be because the action of SbcCD is antagonistic to proofreading through removal of the structure signaling the need to proofread.
In the CTG orientation (CAG on the lagging-strand template), SbcCD plays an active role in stimulating deletions in the absence of proofreading. This is evidenced by the small but significant decrease in instability in an sbcCD dnaQ strain compared to dnaQ and the shift from a skewed distribution of deletion sizes (in dnaQ) to a flat distribution (in sbcCD dnaQ) for the CTG orientation. Furthermore, the observation of orientation dependence in sbcCD dnaQ (but not in dnaQ) implies a role of SbcCD in removing orientation dependence in the absence of proofreading. These observations are consistent with SbcCD having access to an intermediate in the CTG deletion pathway (CAG on the lagging-strand template) in the absence of dnaQ and stimulating its conversion to a deletion product rather than its return to a parental template. It has recently been shown that fluorescently tagged fusions of Bacillus subtilis SbcC localized with a pattern similar to that of the replication factory, consistent with action of SbcCD at the site of DNA replication (MEILE et al. 2006). A similar colocalization of fluorescently tagged SbcC with a replication factory protein has also been observed in E. coli (E. DARMON, personal communication).
CAG·CTG repeat instability in the chromosome is not caused by recombination:
We demonstrate that, at the lengths studied here, CAG·CTG repeat instability in the E. coli chromosome is not affected by mutations in recombination genes recA, recB, and recF. These findings are interesting since recombination has been reported to influence instability of repeats in plasmids (JAKUPCIAK and WELLS 1999, 2000a,b; NAPIERALA et al. 2002; PLUCIENNIK et al. 2002; HASHEM et al. 2004; HEBERT et al. 2004). Further work is required to reconcile these observations.
Transcription does not influence CAG·CTG instability in the E. coli chromosome:
We report another observation that contrasts with several previous plasmid studies. We show here that, at the repeat lengths studied, instability in the chromosome is not affected by transcription. Previously, transcription was reported to affect plasmid-borne CAG·CTG repeat instability (BOWATER et al. 1997; SCHUMACHER et al. 2001; MOCHMANN and WELLS 2004) although one study did not detect such an effect (SCHMIDT et al. 2000). As we observed no effect of transcription, we reasoned that the absence of effect might be caused by the enzymatic removal of stalled transcription complexes before they were able to affect instability. We therefore tested the effect of an mfd mutant on instability. Mfd protein, a transcription-repair coupling factor, ensures the repair of DNA damage in transcribed strands of active genes. It can bind DNA, RNA polymerase, and the UvrA protein, removes RNA polymerase from the DNA, and recruits the excision repair apparatus to the damaged site. It is also required in the removal of stalled transcription complexes (PARK et al. 2002). A mutation in the mfd gene did not significantly change the proportion of instability in the CAG or CTG repeat orientations, suggesting no role of the transcription-repair coupling factor Mfd in repeat instability at the lengths studied. This result lends no support to the hypothesis that stalled transcription complexes influence CAG·CTG repeat instability in E. coli (KIM et al. 2006).
Evidence for the influence of secondary structures on instability in vivo:
Several of our observations lend weight to the hypothesis that secondary structures do influence the instability of CAG·CTG repeat arrays. First, in the CAG orientation (CTG on the lagging-strand template), the distribution of sizes of deletion products is negatively skewed, consistent with a preference for large deletions comprising enough repeats to form stable hairpins. This is independent of any of the genotypes tested here and argues for the formation of hairpins stable enough to influence the spectrum of deletion products irrespective of the presence of SbcCD. Second, we observe an effect of SbcCD on the frequency of instability in the orientation predicted to form the more stable CTG repeat hairpins, and we know that SbcCD is a hairpin nuclease that has been shown to cleave CTG repeat hairpins (CONNELLY et al. 1999). Third, we observe a negatively skewed distribution of deletion products for the CTG orientation (CAG on the lagging-strand template) in the presence of SbcCD and the absence of proofreading, suggesting the existence of less stable secondary structures in this orientation that require the presence of SbcCD to manifest themselves as deletions. Longer repeat arrays give rise to longer deletion products, arguing against the formation of a specific structure composed of a set number of repeats. Instead, it would appear that larger secondary structures are free to form within longer repeat arrays. The strongest evidence for secondary structures comes from the skewed distributions of deletion products as these reflect the nature of the slippage intermediates. Any arguments based on the action of SbcCD as a hairpin nuclease must be moderated by the observation that SbcCD can affect the nature of a deletion event between 101-bp direct repeats, even in the absence of an inverted repeat sequence (BZYMEK and LOVETT 2001a,b).
A model for replicative instability of CAG·CTG repeats:
Combining the results of sbcCD and dnaQ mutations, we propose a model to explain the orientation dependence of CAG·CTG repeat instability during replication (Figure 8). Orientation dependence is determined by proofreading of slippage intermediates formed during replication of the lagging strand. We suggest that the CTG repeat template for the lagging strand is partially refractory to proofreading, leading to elevated frequencies of deletions in wild-type cells. Intermediates in the slippage reaction in this orientation of the repeat array are accessible to the SbcCD nuclease, which can increase instability by digesting the strands that signal the presence of a substrate for proofreading. In the absence of proofreading, SbcCD can no longer affect instability and we suggest that this is because its effect is to divert intermediates from effective proofreading. An alternative possibility is that cleavage with SbcCD is not possible in a proofreading mutant. The latter hypothesis is made less likely with the observation of an effect of SbcCD on instability in a dnaQ mutant when CAG repeats are the template for the lagging strand. In this orientation, we hypothesize that more unstable intermediates are formed and that the effect of SbcCD is to divert them from a proofreading-independent pathway of return to parental length. The model is explained in detail in the legend of Figure 8.
|
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
attB site. In addition, we thank Ewa Okely for technical support, Elise Darmon for critical reading of the manuscript, John Eykelenboom for providing the strain DL1786, and the School of Biological Sciences sequencing service for DNA fragment analysis. R.Z. holds a Ph.D. studentship funded by the Commonwealth Scholarship Commission, UK. The work is supported by a grant from the Medical Research Council to D.R.F.L. | LITERATURE CITED |
|---|
|
|
|---|
BACOLLA, A., R. GELLIBOLIAN, M. SHIMIZU, S. AMIRHAERI, S. KANG et al., 1997 Flexible DNA: genetically unstable CTG.CAG and CGG.CCG from human hereditary neuromuscular disease genes. J. Biol. Chem. 272: 1678316792.
BOWATER, R. P., A. JAWORSKI, J. E. LARSON, P. PARNIEWSKI and R. D. WELLS, 1997 Transcription increases the deletion frequency of long CTG.CAG triplet repeats from plasmids in Escherichia coli. Nucleic Acids Res. 25: 28612868.
BZYMEK, M., and S. T. LOVETT, 2001a Evidence for two mechanisms of palindrome-stimulated deletion in Escherichia coli: single-strand annealing and replication slipped mispairing. Genetics 158: 527540.
BZYMEK, M., and S. T. LOVETT, 2001b Instability of repetitive DNA sequences: the role of replication in multiple mechanisms. Proc. Natl. Acad. Sci. USA 98: 83198325.
BZYMEK, M., C. J. SAVESON, V. V. FESCHENKO and S. T. LOVETT, 1999 Slipped misalignment mechanisms of deletion formation: in vivo susceptibility to nucleases. J. Bacteriol. 181: 477482.
CALLAHAN, J. L., K. J. ANDREWS, V. A. ZAKIAN and C. H. FREUDENREICH, 2003 Mutations in yeast replication proteins that increase CAG/CTG expansions also increase repeat fragility. Mol. Cell. Biol. 23: 78497860.
CONNELLY, J. C., and D. R. LEACH, 1996 The sbcC and sbcD genes of Escherichia coli encode a nuclease involved in palindrome inviability and genetic recombination. Genes Cells 1: 285291.[Abstract]
CONNELLY, J. C., E. S. DE LEAU, E. A. OKELY and D. R. LEACH, 1997 Overexpression, purification, and characterization of the SbcCD protein from Escherichia coli. J. Biol. Chem. 272: 1981919826.
CONNELLY, J. C., E. S. DE LEAU and D. R. LEACH, 1999 DNA cleavage and degradation by the SbcCD protein complex from Escherichia coli. Nucleic Acids Res. 27: 10391046.
CUMMINGS, C. J., and H. Y. ZOGHBI, 2000 Fourteen and counting: unraveling trinucleotide repeat diseases. Hum. Mol. Genet. 9: 909916.
DARLOW, J. M., and D. R. LEACH, 1995 The effects of trinucleotide repeats found in human inherited disorders on palindrome inviability in Escherichia coli suggest hairpin folding preferences in vivo. Genetics 141: 825832.[Abstract]
FREUDENREICH, C. H., J. B. STAVENHAGEN and V. A. ZAKIAN, 1997 Stability of a CTG/CAG trinucleotide repeat in yeast is dependent on its orientation in the genome. Mol. Cell. Biol. 17: 20902098.[Abstract]
FREUDENREICH, C. H., S. M. KANTROW and V. A. ZAKIAN, 1998 Expansion and length-dependent fragility of CTG repeats in yeast. Science 279: 853856.
GACY, A. M., G. GOELLNER, N. JURANIC, S. MACURA and C. T. MCMURRAY, 1995 Trinucleotide repeats that expand in human disease form hairpin structures in vitro. Cell 81: 533540.[CrossRef][Medline]
GRABCZYK, E., and K. USDIN, 1999 Generation of microgram quantities of trinucleotide repeat tracts of defined length, interspersion pattern, and orientation. Anal. Biochem. 267: 241243.[CrossRef][Medline]
HASHEM, V. I., W. A. ROSCHE and R. R. SINDEN, 2004 Genetic recombination destabilizes (CTG)n.(CAG)n repeats in E. coli. Mutat. Res. 554: 95109.[Medline]
HASHIDA, H., J. GOTO, H. KURISAKI, H. MIZUSAWA and I. KANAZAWA, 1997 Brain regional differences in the expansion of a CAG repeat in the spinocerebellar ataxias: dentatorubral-pallidoluysian atrophy, Machado-Joseph disease, and spinocerebellar ataxia type 1. Ann. Neurol. 41: 505511.[CrossRef][Medline]
HEBERT, M. L., L. A. SPITZ and R. D. WELLS, 2004 DNA double-strand breaks induce deletion of CTG.CAG repeats in an orientation-dependent manner in Escherichia coli. J. Mol. Biol. 336: 655672.[CrossRef][Medline]
HOSS, M., P. ROBINS, T. J. NAVEN, D. J. PAPPIN, J. SGOUROS et al., 1999 A human DNA editing enzyme homologous to the Escherichia coli DnaQ/MutD protein. EMBO J. 18: 38683875.[CrossRef][Medline]
IYER, R. R., and R. D. WELLS, 1999 Expansion and deletion of triplet repeat sequences in Escherichia coli occur on the leading strand of DNA replication. J. Biol. Chem. 274: 38653877.
IYER, R. R., A. PLUCIENNIK, W. A. ROSCHE, R. R. SINDEN and R. D. WELLS, 2000 DNA polymerase III proofreading mutants enhance the expansion and deletion of triplet repeat sequences in Escherichia coli. J. Biol. Chem. 275: 21742184.
JAKUPCIAK, J. P., and R. D. WELLS, 1999 Genetic instabilities in (CTG.CAG) repeats occur by recombination. J. Biol. Chem. 274: 2346823479.
JAKUPCIAK, J. P., and R. D. WELLS, 2000a Gene conversion (recombination) mediates expansions of CTG[middle dot]CAG repeats. J. Biol. Chem. 275: 4000340013.
JAKUPCIAK, J. P., and R. D. WELLS, 2000b Genetic instabilities of triplet repeat sequences by recombination. IUBMB Life 50: 355359.[CrossRef][Medline]
JAWORSKI, A., W. A. ROSCHE, R. GELLIBOLIAN, S. KANG, M. SHIMIZU et al., 1995 Mismatch repair in Escherichia coli enhances instability of (CTG)n triplet repeats from human hereditary diseases. Proc. Natl. Acad. Sci. USA 92: 1101911023.
KANG, S., A. JAWORSKI, K. OHSHIMA and R. D. WELLS, 1995 Expansion and deletion of CTG repeats from human disease genes are determined by the direction of replication in E. coli. Nat. Genet. 10: 213218.[Medline]
KAYTOR, M. D., E. N. BURRIGHT, L. A. DUVICK, H. Y. ZOGHBI and H. T. ORR, 1997 Increased trinucleotide repeat instability with advanced maternal age. Hum. Mol. Genet. 6: 21352139.
KIM, S. H., M. J. PYTLOS, W. A. ROSCHE and R. R. SINDEN, 2006 (CAG)*(CTG) repeats associated with neurodegenerative diseases are stable in the Escherichia coli chromosome. J. Biol. Chem. 281: 2795027955.
KOVTUN, I. V., and C. T. MCMURRAY, 2001 Trinucleotide expansion in haploid germ cells by gap repair. Nat. Genet. 27: 407411.[CrossRef][Medline]
KRASILNIKOVA, M. M., and S. M. MIRKIN, 2004 Replication stalling at Friedreich's ataxia (GAA)n repeats in vivo. Mol. Cell. Biol. 24: 22862295.
LINK, A. J., D. PHILLIPS and G. M. CHURCH, 1997 Methods for generating precise deletions and insertions in the genome of wild-type Escherichia coli: application to open reading frame characterization. J. Bacteriol. 179: 62286237.
MANLEY, K., T. L. SHIRLEY, L. FLAHERTY and A. MESSER, 1999 Msh2 deficiency prevents in vivo somatic instability of the CAG repeat in Huntington disease transgenic mice. Nat. Genet. 23: 471473.[CrossRef][Medline]
MAURER, D. J., B. L. O'CALLAGHAN and D. M. LIVINGSTON, 1996 Orientation dependence of trinucleotide CAG repeat instability in Saccharomyces cerevisiae. Mol. Cell. Biol. 16: 66176622.[Abstract]
MCMURRAY, C. T., and I. V. KORTUN, 2003 Repair in haploid male germ cells occurs late in differentiation as chromatin is condensing. Chromosoma 111: 505508.[Medline]
MEILE, J. C., L. J. WU, S. D. EHRLICH, J. ERRINGTON and P. NOIROT, 2006 Systematic localisation of proteins fused to the green fluorescent protein in Bacillus subtilis: identification of new proteins at the DNA replication factory. Proteomics 6: 21352146.[CrossRef][Medline]
MERLIN, C., S. MCATEER and M. MASTERS, 2002 Tools for characterization of Escherichia coli genes of unknown function. J. Bacteriol. 184: 45734581.
MIRET, J. J., L. PESSOA-BRANDAO and R. S. LAHUE, 1998 Orientation-dependent and sequence-specific expansions of CTG/CAG trinucleotide repeats in Saccharomyces cerevisiae. Proc. Natl. Acad. Sci. USA 95: 1243812443.
MITAS, M., A. YU, J. DILL and I. S. HAWORTH, 1995a The trinucleotide repeat sequence d(CGG)15 forms a heat-stable hairpin containing Gsyn.Ganti base pairs. Biochemistry 34: 1280312811.[CrossRef][Medline]
MITAS, M., A. YU, J. DILL, T. J. KAMP, E. J. CHAMBERS et al., 1995b Hairpin properties of single-stranded DNA containing a GC-rich triplet repeat: (CTG)15. Nucleic Acids Res. 23: 10501059.
MOCHMANN, L. H., and R. D. WELLS, 2004 Transcription influences the types of deletion and expansion products in an orientation-dependent manner from GAC*GTC repeats. Nucleic Acids Res. 32: 44694479.
MONCKTON, D. G., M. I. COOLBAUGH, K. T. ASHIZAWA, M. J. SICILIANO and C. T. CASKEY, 1997 Hypermutable myotonic dystrophy CTG repeats in transgenic mice. Nat. Genet. 15: 193196.[CrossRef][Medline]
NAPIERALA, M., P. PARNIEWSKI, A. PLUCIENNIK and R. D. WELLS, 2002 Long CTG.CAG repeat sequences markedly stimulate intramolecular recombination. J. Biol. Chem. 277: 3408734100.
OUSSATCHEVA, E. A., V. I. HASHEM, Y. ZOU, R. R. SINDEN and V. N. POTAMAN, 2001 Involvement of the nucleotide excision repair protein UvrA in instability of CAG*CTG repeat sequences in Escherichia coli. J. Biol. Chem. 276: 3087830884.
PANIGRAHI, G. B., R. LAU, S. E. MONTGOMERY, M. R. LEONARD and C. E. PEARSON, 2005 Slipped (CTG)*(CAG) repeats can be correctly repaired, escape repair or undergo error-prone repair. Nat. Struct. Mol. Biol. 12: 654662.[CrossRef][Medline]
PARK, J. S., M. T. MARR and J. W. ROBERTS, 2002 E. coli transcription repair coupling factor (Mfd protein) rescues arrested complexes by promoting forward translocation. Cell 109: 757767.[CrossRef][Medline]
PAVLOV, Y. I., C. FRAHM, N. MCELHINNY, A. NIIMI, M. SUZUKI et al., 2006 Evidence that errors made by DNA polymerase alpha are corrected by DNA polymerase delta. Curr. Biol. 16: 202207.[CrossRef][Medline]
PEARSON, C. E., Y. H. WANG, J. D. GRIFFITH and R. R. SINDEN, 1998 Structural analysis of slipped-strand DNA (S-DNA) formed in (CTG)n. (CAG)n repeats from the myotonic dystrophy locus. Nucleic Acids Res. 26: 816823.
PELLETIER, R., B. T. FARRELL, J. J. MIRET and R. S. LAHUE, 2005 Mechanistic features of CAG*CTG repeat contractions in cultured cells revealed by a novel genetic assay. Nucleic Acids Res. 33: 56675676.
PETRUSKA, J., N. ARNHEIM and M. F. GOODMAN, 1996 Stability of intrastrand hairpin structures formed by the CAG/CTG class of DNA triplet repeats associated with neurological diseases. Nucleic Acids Res. 24: 19921998.
PETRUSKA, J., M. J. HARTENSTINE and M. F. GOODMAN, 1998 Analysis of strand slippage in DNA polymerase expansions of CAG/CTG triplet repeats associated with neurodegenerative disease. J. Biol. Chem. 273: 52045210.
PLUCIENNIK, A., R. R. IYER, M. NAPIERALA, J. E. LARSON, M. FILUTOWICZ et al., 2002 Long CTG.CAG repeats from myotonic dystrophy are preferred sites for intermolecular recombination. J. Biol. Chem. 277: 3407434086.
RICHARD, G. F., G. M. GOELLNER, C. T. MCMURRAY and J. E. HABER, 2000 Recombination-induced CAG trinucleotide repeat expansions in yeast involve the MRE11RAD50XRS2 complex. EMBO J. 19: 23812390.