The Tol2 element is a naturally occurring active transposable element found in vertebrate genomes. The Tol2 transposon system has been shown to be active from fish to mammals and considered to be a useful gene transfer vector in vertebrates. However, cis-sequences essential for transposition have not been characterized. Here we report the characterization of the minimal cis-sequence of the Tol2 element. We constructed Tol2 vectors containing various lengths of DNA from both the left (5′) and the right (3′) ends and tested their transpositional activities both by the transient excision assay using zebrafish embryos and by analyzing chromosomal transposition in the zebrafish germ lineage. We demonstrated that Tol2 vectors with 200 bp from the left end and 150 bp from the right end were capable of transposition without reducing the transpositional efficiency and found that these sequences, including the terminal inverted repeats (TIRs) and the subterminal regions, are sufficient and required for transposition. The left and right ends were not interchangeable. The Tol2 vector carrying an insert of >11 kb could transpose, but a certain length of spacer, <276 but >18 bp, between the left and right ends was necessary for excision. Furthermore, we found that a 5-bp sequence, 5′-(A/G)AGTA-3′, is repeated 33 times in the essential subterminal region. Mutations in the repeat sequence at 13 different sites in the subterminal region, as well as mutations in TIRs, severely reduced the excision activity, indicating that they play important roles in transposition. The identification of the minimal cis-sequence of the Tol2 element and the construction of mini-Tol2 vectors will facilitate development of useful transposon tools in vertebrates. Also, our study established a basis for further biochemical and molecular biological studies for understanding roles of the repetitive sequence in the subterminal region in transposition.
TRANSPOSONS are genetic elements that move from one locus to another locus and reside in the genome as repetitive sequences. These transposon sequences are grouped into two classes: (1) autonomous members, such as Ac in maize (Fedoroff et al. 1983; Müller-Neumann et al. 1984; Pohlman et al. 1984), that encode a functional transposase and can transpose by itself; (2) nonautonomous members, such as Ds in maize (Döring et al. 1984; Sutton et al. 1984), that have mutations or deletions in the transposase gene, but retain essential cis-sequences, and can transpose in the presence of the transposase activity. The Tol2 element was found from the genome of a small teleost, the medaka fish. Tol2 belongs to the hAT family of transposons (Koga et al. 1996). An autonomous member of the Tol2 element was identified (Kawakami et al. 1998). The autonomous Tol2 contains a gene encoding a fully functional transposase that is capable of catalyzing transposition (Kawakami and Shima 1999; Kawakami et al. 2000). Thus, to date, Tol2 is the only active autonomous transposable element found in a vertebrate genome.
We have identified mRNA transcribed from the transposase gene, cloned its cDNA (Kawakami and Shima 1999), and developed two-component transposition systems in vertebrate animals. In zebrafish, the transposition system is composed of a transposon-donor plasmid containing a nonautonomous Tol2 construct and mRNA synthesized in vitro by using the transposase cDNA as a template (Kawakami et al. 2000, 2004b). The nonautonomous Tol2 construct cannot transpose by itself since part of the transposase coding region is replaced by a foreign gene, but can transpose when the transposase activity is supplied in trans. These are introduced into zebrafish cells by micro-injection into fertilized eggs. Then, in developing embryos, the transposase protein produced from the mRNA catalyzes excision of the Tol2 construct from the plasmid and integration of the excised construct into the chromosomal DNA. Since transposition can occur in the zebrafish germ lineage very efficiently, the Tol2 transposon system has been successfully applied to transgenesis and gene trapping (Kawakami et al. 2000, 2004b; Kawakami 2005). In mouse, a transposon-donor plasmid containing the Tol2 construct with the neomycin resistance gene and the helper plasmid containing the transposase cDNA placed under the control of a strong promoter are introduced in embryonic stem (ES) cells by electroporation. In the ES cells, the transposase produced from the helper plasmid catalyzes transposition of the Tol2 construct from the plasmid to the mouse genome (Kawakami and Noda 2004). The transposition frequency was increased as the transposase activity was increased (Kawakami and Noda 2004), suggesting that the Tol2 transposon system may not suffer from a phenomenon termed “overproduction inhibition,” which has been observed in the case of the Sleeping Beauty transposon system (Miskey et al. 2005). In Xenopus, the transposon-donor plasmid and the transposase mRNA are injected into fertilized eggs. The transposase produced in the injected embryos catalyzes excision of the transposon construct from the plasmid (Kawakami et al. 2004a). Recently, we have shown that the excised transposon was integrated in the Xenopus genome and transmitted to the next generation through the germ lineage (Johnson Hamlet et al. 2006). Thus, Tol2 can be used as a gene transfer vector in vertebrates. The current Tol2 vectors contain ∼2.8 kb of DNA from the original Tol2 element (Kawakami and Noda 2004; Kawakami et al. 2004b). To increase the usefulness of Tol2 as a genetic tool, it is important to remove the unnecessary part from the vector. First, manipulation of the transposon vector will become easier. Second, possible effects of the Tol2 sequence on expression of a foreign gene cloned in the transposon vector and/or on expression of chromosomal genes located near the transposon integration site will be minimized. To this end, it is important to analyze cis-sequences that are sufficient and required for transposition.
cis-Elements of other transposons belonging to the hAT family have been studied. In the case of Ac from maize, 238 bp from the 5′-end and 209 bp from the 3′-end are sufficient and required for transposition (Coupland et al. 1989). These subterminal sequences contain 13 AAACGG sequences (Chatterjee and Starlinger 1995), which are the binding motif for the Ac transposase (Kunze and Starlinger 1989). While mutations in some of these motifs reduced or abolished the cis-activity required for excision (Chatterjee and Starlinger 1995), some binding motifs were dispensable, and roles of these sequences in transposition have not been elucidated (Chatterjee and Starlinger 1995). In the case of Tag1 from Arabidopsis, 98 bp from the 5′-end and 109 bp from the 3′ end are sufficient for transposition. Tag1 is unique since the 5′ and 3′ subterminal sequences contain repeats of different sequences (Liu et al. 2001). Mutagenesis of these repetitive sequences, however, has not been carried out for Tag1. Thus, although there have been evidences that the subterminal regions are important for transposition of transposons of the hAT family, their roles in transposition have not been well understood. Therefore, it is important to know the structure–function relationship in the subterminal region of Tol2, a vertebrate member of the hAT family.
In this study, we aimed to characterize cis-elements of the Tol2 transposable element essential for transposition. First, to determine minimal sequences sufficient and required for transposition, Tol2 vectors carrying various sizes of the Tol2 end sequences were created and tested for their activities both in excision in zebrafish embryos and in transposition in the zebrafish germ lineage. Second, to further characterize sequences required for transposition, base-substitution mutations were introduced in the terminal inverted repeats (TIRs) and in the subterminal regions and their effects on the excision activity were analyzed. These studies led to identification of the minimal cis-sequence of Tol2 and the functional repetitive sequence in the subterminal region.
MATERIALS AND METHODS
The structures of the Tol2 constructs created and used in this study are shown in Figure 1A. To construct pT2KXIGΔin, pT2KXIG (Kawakami et al. 2004b) was digested with NruI, ligated to a BglII linker and then digested with BglII and self-ligated. To construct T2AL200R200G (accession no. AB262448), T2AL175R200G (AB262449), T2AL150R200G (AB262450), T2AL200R175G (AB262451), T2AL200R150G (AB262452), T2AL200R100G (AB262453), and T2AL50R50G (AB262454), DNA fragments containing various lengths of the left and right end sequences plus the backbone plasmid DNA were amplified by PCR and ligated with the GFP expression cassette, which contains the Xenopus EF1α enhancer–promoter, the rabbit β-globin intron, the EGFP gene, and the SV40 poly(A) signal at XhoI and BglII sites. The structures of all of these plasmids were confirmed by DNA sequencing. T2AL200L200G (AB262455) and T2AR200R200G (AB262456) were constructed by replacing the right end and the left end of T2AL200R200G with the PCR-amplified BglII–KpnI fragment of the left end and the PCR-amplified SpeI–XhoI fragment of the right end, respectively. The primers used for the PCR amplification were designed to retain the 8-bp target duplication sequence. The structures of the plasmids were confirmed by DNA sequencing. T2AL200R150-1139, T2AL200R150-502, T2AL200r150-276, and T2AL200R150-18 were constructed by deleting, respectively, the ApaI–ApaI, BglII–HindIII, ClaI–XhoI, and BglII–XhoI fragmentsfrom T2AL200R150G. The structures of Tol2 constructs (T2AL200R150GmL series and T2AL200R150GmR series) carrying substitution mutations in the left- and right-end sequences, respectively, are shown in Figure 4 and Table 4. To construct these plasmids, the backbone DNA fragments were amplified by PCR using pT2AL200R150G as a template and primers containing an MluI site at the 5′-end. The amplified DNA was digested with MluI and self-ligated. The structures of the plasmids were confirmed by DNA sequencing.
Micro-injection and excision assay:
Transposase mRNA was synthesized as described previously (Kawakami 2004; Kawakami et al. 2004b). Approximately 1 nanoliter of a DNA–RNA solution containing 25 ng/μl circular DNA of a transposon-donor plasmid and 25 ng/μl transposase mRNA were injected into fertilized eggs. Approximately 10 hr after the injection, DNA samples were prepared from the injected embryos and the transient excision assay was performed as described (Kawakami and Shima 1999; Kawakami 2004) with some modifications. In the experiments described in Figure 1 and Figure 5, the excision products were amplified by using Takara Ex Taq (Takara Bio, Shiga, Japan) and the primers T2AexL (5′-CGC AAT TAA CCC TCA CTA AAG G-3′) and T2AexR (5′-ACC CAA CTG ATC TTC AGC ATC T-3′). In the experiments described in Figure 3, the DNA samples were digested with ScaI prior to PCR to cut five ScaI sites in the transposon constructs on the plasmid DNA, which sometimes hampered the amplification of the excision products. PCR was carried out by using the Expand high fidelity PCR system (Roche Diagnostics GmbH, Mannheim, Germany) and the primers T2AexL-2 (5′-ACC CTC ACT AAA GGG AAC AAA AG-3′) and T2AexR-2 (5′-GTG CGG GCC TCT TCG CTA TTA C-3′).
GFP expression in embryos:
GFP expression in embryos was analyzed by using a fluorescence stereomicroscope MZ16 FA (Leica Microsystems GmbH, Wetzlar, Germany), and photos were taken by using a digital color camera DFC300 FX (Leica Microsystems GmbH).
Southern blot hybridization, inverse PCR, and adaptor ligation PCR:
Southern blot hybridization was performed as described previously (Kawakami 2004; Kawakami et al. 2004b). The genomic DNA samples were digested either with BglII (Figure 2) or with NcoI (Figure 3), which gave rise to one hybridization band from one insertion. The junction fragments containing the Tol2 ends and the genomic DNA were cloned either by inverse PCR (Kawakami 2004; Kawakami et al. 2004b) or by adaptor ligation PCR (Kotani et al. 2006). The adaptor ligation PCR method described previously (Siebert et al. 1995; Spertini et al. 1999) was applied to our system as follows. The genomic DNA from transgenic fish is digested with MboI, BglII, BamHI, SpeI, XbaI, or NheI. MboI, BglII, and BamHI generate a 5′-GATC-3′ cohesive end, and SpeI, XbaI, and NheI generate a 5′-CTAG-3′ cohesive end. The adaptors are prepared by annealing AL (5′-CTA ATA CGA CTC ACT ATA GGG CTC GAG CGG CCG CGG GGG CAG GT-3′) with phosphorylated GATC-AS (5′-pGAT CAC CTG CCC CCG CTT-3′) or phosphoylated CTAG-AS (5′-pCTA GAC CTG CCC CCG CTT-3′) at 95° for 1 min. These adaptors are ligated to the digested genomic DNA with appropriate cohesive ends. The ligated samples are diluted to 10-fold and used for PCR. To clone 5′ junctions, the first PCR is performed by using the primers Ap1 (5′-GGA TCC TAA TAC GAC TCA CTA TAG GG-3′) and 175L-out (5′-TTT TTG ACT GTA AAT AAA ATT G-3′), and the second PCR is performed by using the primers Ap2 (5′-CAC TAT AGG GCT CGA GCG G-3′) and 150L-out (5′-GAG TAA AAA GTA CTT TTT TTT CT-3′). To clone 3′ junctions, the first PCR is performed by using the primers Ap1 and 175R-out (5′-TTC TTG CTT TTA CTT TTA CTT CC-3′), and the second PCR is performed by using the primers Ap2 and 150R-out (5′-AAT ACT CAA GTA CAA TTT TA-3′). The PCR products for the 5′ and 3′ junctions are gel extracted and sequenced by using 100L-out (5′-AGT ATT GAT TTT TAA TTG TA-3′) and 100R-out (5′-AGA TTC TAG CCA GAT ACT-3′), respectively.
The minimal sequence sufficient and required for excision:
To characterize the cis-sequence essential for transposition of the Tol2 transposable element, we created Tol2 constructs that contained various lengths of DNA from the 5′-and 3′-ends (Figure 1A). The 5′- and 3′-ends were designated with respect to the orientation of the transposase gene (Kawakami and Shima 1999) and are also referred to as left (L) and right (R), respectively, in this article (Figure 1A). The activities of these transposon constructs were analyzed by the transient excision assay using zebrafish embryos (Figure 1, A and B). First, the excision product was detected from all embryos injected with the transposase mRNA and a transposon-donor plasmid containing T2KXIGΔin (100%: 32/32), which carried 517 bp of DNA from the 5′-end and 536 bp of DNA from the 3′-end. It should be noted that a Tol2 construct containing the end sequences of similar sizes was not excised in the initial excision assay (Kawakami and Shima 1999). We have found that the transposase activity in the initial condition was much lower than that in the present condition (Kawakami et al. 2000, 2004b). We hypothesize that there may be sequences that are essential for excision under lower transposase activities, but are dispensable for excision under higher transposase activities. While such sequences remain to be determined, by the transient excision assay and using the present condition, we further analyzed a smaller Tol2 construct to determine minimal cis-sequences. The T2AL200R200G construct, containing 200 bp of DNA from both ends, fully retained the excision activity (100%: 61/61). To determine the minimal DNA sequence from the left end, T2AL175R200G and T2AL150R200G were constructed and analyzed. The excision activity was reduced in T2AL175R200G (32%: 10/31) and abolished nearly completely in T2AL150R200G (<3%: 0/29), indicating that 200 bp of DNA from the left end is sufficient and >175 bp of DNA is required for the full excision activity. Then, to determine the minimal DNA sequence from the right end, T2AL200R175G, T2AL200R150G, and T2AL200R100G were constructed. T2AL200R175G and T2AL200R150G retained the excision activity (34/34 and 55/55, respectively). However, in T2AL200R100G the excision activity was drastically decreased (<3%: 0/32), indicating that 150 bp of DNA from the right end is sufficient and >100 bp of DNA is required for the excision activity.
The left and right ends are not interchangeable:
To determine whether the left and right ends of the Tol2 element are functionally equivalent, we constructed T2AL200L200G containing the 200-bp left-end sequence at both ends and T2AR200R200G containing the 200-bp right-end sequence at both ends and analyzed their activities by the transient excision assay (Figure 1A). Excision products were detected from neither of these plasmids (0/32 and 0/30, respectively), indicating that the left and right ends of Tol2 are functionally different and not interchangeable.
Germline transmission using mini-Tol2 vectors:
We have developed a highly efficient transgenesis method in zebrafish using the Tol2 transposon system (Kawakami et al. 2004b; Kawakami 2005). To determine whether the smaller “mini-Tol2” constructs are active not only for excision but also for integration in new loci, and whether they can transpose in the zebrafish germ lineage as efficiently as the authentic vector, we injected transposase mRNA and transposon-donor plasmids containing T2KXIGΔin, T2AL200R200G, T2AL200R150G, and T2AL50R50G into fertilized eggs. T2AL50R50G was used as a negative control since no excision products were detected from embryos injected with this construct (Figure 1A). The injected fish were raised to adulthood and crossed with uninjected wild-type fish, and F1 embryos were analyzed for GFP expression under a fluorescent microscope. In the progeny from 7 of 11 fish (63%) injected with pT2KXIGΔin, 7 of 10 fish (70%) injected with pT2AL200R200G, and 6 of 10 fish (60%) injected with pT2AL200R150G, embryos expressing GFP were identified (Figure 2, A, D, and G), indicating that the injected DNA was integrated into the genome and transmitted to the F1 generation very efficiently (Table 1). None of the F1 embryos from 10 fish injected with pT2AL50R50G expressed GFP, indicating that the frequency of integration of this construct in the genome was considerably low (Table 1). The injected fish that transmit the transposon insertions to the F1 generation are hereafter referred to as founder fish. These results are consistent with the results obtained from the transient excision assay using the same transposon constructs. Thus, 200 bp of DNA from the left end and 150 bp of DNA from the right end are sufficient for highly efficient transposition in the zebrafish germ lineage.
We then characterized the chromosomal insertions of the transposon constructs. First, the GFP-positive F1 embryos were raised to adulthood and the genomic DNA prepared from their tail fins were analyzed by Southern blot hybridization. In total, 68 transgenic F1 fish from seven founder fish injected with T2KXIGΔin (Figure 2, B and C), 84 transgenic F1 fish from seven founder fish injected with T2AL200R200G (Figure 2, E and F), and 72 transgenic F1 fish from six founder fish injected with T2AL200R150G (Figure 2, H and I) were subjected to Southern blot analysis. The germline of these founder fish was highly mosaic, and GFP-positive F1 fish contained different numbers of insertions in the genome, from 1 to >10 (Figure 2, B, C, E, F, H, and I). The total number of insertions transmitted by a single founder fish also varied, from 1 to 15 (Table 2). The number of bands detected by Southern blot analysis was counted and summarized in Table 2. The average number of transposon insertions transmitted by one founder fish was 6.9, 6.9, and 5.5 for T2KXIGΔin, T2AL200R200G, and T2AL200R150G, respectively (Table 1). Transgenic fish carrying a single copy of the T2KXIGΔin, T2AL200R200G, or T2AL200R150G insertion, which was identified by Southern blot analysis, expressed GFP in a quite similar manner (Figure 2, A, D, and G), indicating that those vectors have a similar ability to express a foreign gene in transgenic fish. Second, to determine whether these insertions were created by transposition, we established more transgenic fish lines with single insertions, cloned junction fragments from those fish by the inverse PCR and adaptor ligation PCR methods, and analyzed them by DNA sequencing. Both 5′ and 3′ flanking genomic sequences were identified for 4 T2KXIGΔin insertions, 6 T2AL200R200G insertions, and 6 T2AL200R150G insertions (Table 3). Duplication of the 8-bp target sequence, which is characteristic of the integration site of transposons of the hAT family, was found in all of these 16 insertions, indicating that these insertions were indeed created by transposition. The genomic sequences surrounding the insertions were analyzed by BLAST search against the ensembl database Zv6. Of the 16 insertions, 11 were mapped within either an intron or an exon of known and predicted genes (Table 3).
Spacing capacity and requirements:
To test whether the transposon vector can carry a large DNA fragment, we replaced the EF1α promoter sequence of T2KXIGΔin with an ∼9-kb fragment of the zebrafish genomic DNA. The total length between the left and right ends of the resulting Tol2 construct became ∼11.7 kb (T2KIGΔin-9k; Figure 3). Transposase mRNA and a plasmid DNA containing this construct were injected into fertilized eggs and the germline transmission frequency was analyzed. Five of six injected fish (83%) transmitted the transposon insertion to the F1 generation (Figure 3A). We identified transgenic fish carrying a single insertion by Southern blot hybridization (Figure 3B) and tested whether any structural changes, such as deletions or rearrangements, occurred with integration of the large transposon construct. We performed PCR using two sets of primers (Figure 3A: f1 and r1; f2 and r2) within the construct and compared the sizes of DNA bands amplified from the genomic DNA of the transgenic fish with those amplified from the plasmid DNA used for injection. No significant difference was detected (Figure 3C). Further, we cloned the junction fragments and identified 8-bp direct repeats at both ends of the insertion (Figure 3D), indicating that the insertion was indeed created by transposition. The insertion was mapped within an intron of a known gene (zgc:65966). From these results, we concluded that the Tol2 transposon construct of 11.7 kb can transpose without reducing its transpositional activity and without causing anomalies in its structure.
We also tested whether the distance between the left and right ends is important for transposition. We showed that T2AL200R150G, which is 2552 bp in length, could transpose efficiently (Figures 1 and 2, Table 1). We created T2AL200R150-based constructs, which contained various lengths of DNA inserts between the 200-bp left end and the 150-bp right end and analyzed their activities either by germline transmission or by the transient excision assay. Of 20 fish injected with the transposase mRNA and a transposon-donor plasmid containing T2AL200R150-1139, which carried a 1139-bp DNA insert between the left and right ends, 14 fish transmitted transposon insertions to the F1 generation, and the excision products were detected from 97% (31/32) of embryos injected with the transposase mRNA and a transposon-donor plasmid containing T2AL200R150-502, which contained a 502-bp DNA insert (Figure 3A). These frequencies were comparable to those of T2AL200R150G (Figure 1A). However, the excision activity was slightly decreased in the excision assay using T2AL200R150-276 (75%: 24/32), which contained a 276-bp DNA insert and was not detected in the experiment using T2AL200R150-18 (<3%: 0/32), which contained a DNA insert of only 18 bp (Figure 3A), indicating that a certain length of a spacer between the left and the right ends is required for transposition.
Identification of a repetitive sequence in the subterminal region:
In this study, we found that 200 bp of DNA from the 5′-end and 150 bp of DNA from the 3′-end are sufficient and required for efficient transposition of Tol2. Tol2 has perfect TIRs of 12 bp at both ends. Which sequence(s) in these regions is indeed required for transposition? To answer the question, we first created the mTIRL1-6 and mTIRR1-6 mutations in the left and right TIR, respectively (Table 4). The mutations in TIRs resulted in a loss of the excision activity (<3%; Figure 4), indicating that these sequences are essential for transposition. Then we carefully analyzed the sequence of the subterminal region and found that 5-bp sequences, 5′-AAGTA-3′ and 5′-GAGTA-3′, are repeated 24 and 9 times, respectively, in these regions. We tentatively considered 5′-(A/G)AGTA-3′ to be a consensus sequence. The 200-bp left- and the 150-bp right-subterminal region contain 17 and 16 copies of this repeat, respectively (Figure 5). Overlapping of the repeat in an opposite direction creates ScaI sites (5′-AGTACT-3′). Indeed, five ScaI sites are found in the subterminal regions.
To determine whether the repeats in the subterminal regions are functionally important, we created mutations in these sequences by “MluI mutagenesis.” In the mL13-16, mL26-30, mL48-51, mL66-70, mL106-108, mL136-148, and mL169-181 mutations, wild-type sequences containing one, two, or three repeats at seven different sites in the left subterminal region were substituted for mutant sequences containing one MluI site (Table 4). Also, in the mR15-18, mR27-31, mR48-51, mR63-67, mR103-108, and mR114-126 mutations, wild-type sequences containing one, two, or three repeats at seven different sites in the right subterminal region were substituted for mutant sequences containing one MluI site (Table 4). The activities of the T2AL200R150G constructs carrying these mutations were analyzed by the transient excision assay. The excision products were detected in 3% (1/31), 3% (1/30), 3% (1/32), 17% (5/30), <3% (0/31), 3% (1/31), 6% (2/34), 25% (8/32), 7% (2/30), 22% (7/32), 3% (1/32), 13% (4/32), and 13% (4/30) of embryos injected with the T2AL200R150G construct carrying the mL13-16, mL26-30, mL48-51, mL66-70, mL106-108, mL136-148, mL169-181, mR15-18, mR27-31, mR48-51, mR63-67, mR103-108, and mR114-126 mutations, respectively (Figure 4). Although the transient excision assay is based on micro-injection and PCR techniques, which are not easy to be controlled quantitatively, and factors that make the difference between 25% (8/32) and 3% (1/30) are unknown, it is obvious that all of these mutations drastically decreased the excision activity in comparison with the wild-type T2AL200R150G. One may argue that any substitutions created by the MluI mutagenesis in the subterminal region may cause such decreases in the excision frequency. To exclude this possibility, we created three mutations, mL88-92, mL158-162, and mR88-94, in which sequences outside of the repeats were substituted for one MluI site (Table 4), and analyzed their activities. The Tol2 constructs carrying these mutations, mL88-92, mL158-162, and mR88-94, exhibited the excision activities of 94% (30/32), 81% (25/31), and 87% (26/30), respectively (Figure 4), indicating that these mutant constructs retained the excision activity comparable to that of T2AL200R150G. From these results, we concluded that the repeats that we identified in the subterminal region should play important roles in transposition.
Minimal cis-sequences of Tol2:
In this study, we demonstrated that 200 bp of DNA from the left end and 150 bp of DNA from the right end are sufficient for excision, the first step of transposition. Although the lengths may be shortened slightly, most of those sequences are required for excision since T2AL175R200G, carrying 175-bp of DNA from the left end, showed a reduced excision activity, and T2AL200R100G, carrying 100 bp of DNA from the right end, could not be excised. The previous Tol2 construct with the GFP expression cassette T2KXIG contains 2229 bp from the left and 536 bp from the right end (Kawakami et al. 2004b). The 2229-bp left-side sequence contains ∼300-bp inverted repeats that may form a large secondary structure (Izsvák et al. 1999). We have been sometimes told by others that they encountered difficulties in manipulating the previous transposon vector. We infer that this may have happened because of the large secondary structure; i.e., a decreased copy number of the plasmid in Escherichia coli or reduced E. coli transformation efficiencies, etc. Such a problem will not happen when the smaller Tol2 vectors such as T2KXIGΔin and T2AL200R150, which lack the inverted repeats, are used. Furthermore, Tol2 contains the promoter and the poly(A) signal for the transposase gene near the 5′-end and the 3′-end, respectively (Kawakami and Shima 1999). The minimal Tol2 vector T2AL200R150 also lacks these regulatory elements. Therefore, it can be expected that the minimal vector should have minimal effects both on expression of a foreign gene cloned in the vector and on expression of chromosomal genes located near the integration site of the vector. Thus, identification of minimal cis-sequences should increase the usefulness of Tol2 as a gene transfer vector.
Transposition of mini-Tol2 vectors:
The Tol2 transposon system can create chromosomal integration in the zebrafish germ lineage very efficiently (Kawakami et al. 2004b; Kawakami 2005). Two aspects of the transpositional activities can be examined. One is the frequency of obtaining germline-transmitting founder fish and the other is the number of different transposon insertions transmitted by a single founder fish. First, in our previous study, 50% (5/10) of the fish injected with the transposase mRNA and a transposon-donor plasmid containing T2KXIG could transmit transposon insertions to the next generation (Kawakami et al. 2004b). In this study, we demonstrated that 64, 70, and 60% of fish injected with the transposase mRNA and transposon-donor plasmids containing T2KXIGΔin, T2AL200R200G, and T2AL200R150G, respectively, transmitted transposon insertions to their offspring (Table 1). Thus, the germline transmission frequencies using these mini-Tol2 vectors are comparable to, or even higher than, that observed in transgenesis using T2KXIG. Second, in the previous study, the number of insertions transmitted by a single founder fish injected with T2KXIG was 5.6 on average (Kawakami et al. 2004b). In this study, the average number of insertions transmitted by fish injected with T2KXIGΔin, T2AL200R200G, and T2AL200R150G were 6.9, 6.9, and 5.5, respectively (Table 1). Although the number with T2AL200R150G was slightly lower than those with the other mini-Tol2 constructs, the numbers of insertions transmitted by fish injected with T2KXIGΔin, T2AL200R200G, and T2AL200R150G are still comparable to that with T2KXIG. Therefore, we concluded that 200 bp from the left end and 150 bp from the right end contain cis-sequences sufficient not only for excision but also for efficient transposition.
In this study, we characterized a total of 17 integration sites of Tol2 at the sequence level. The 8-bp duplication of the target site in these 17 cases was created as has been described (Kawakami et al. 2000, 2004b). Although we did not find any obvious consensus sequences at the integration sites, we found that 12 of 17 target loci (71%) were mapped within either an intron or an exon of known or predicted genes. Although further analyses are needed to determine whether all of these predicted genes are indeed transcribed, this result may suggest that Tol2 is preferentially integrated within transcriptional units.
Insert-size capacity and requirements:
We demonstrated that the Tol2 construct of 11.7 kb could transpose without reducing the germline transmission frequency in zebrafish. This construct contained a DNA insert of 10.6 kb, including the zebrafish genomic DNA, the rabbit β-globin intron, the EGFP gene, and the poly(A) signal, between the 517-bp left and 536-bp right ends of the Tol2 sequence. Therefore, if the mini-Tol2, T2AL200R150, was used as a vector, one can clone ∼11.3-kb of DNA into the vector and perform transgenesis with high efficiencies in zebrafish and presumably in other vertebrates. This may be a great advantage for Tol2 to be used as a gene transfer vector in comparison to transposons belonging to the Tc1/mariner family since, in the case of the Tc1/mariner-type transposon Himar1, an ∼38% decrease in the transposition frequency has been observed for each 1-kb increase in transposon size within a range of 2–8 kb (Lampe et al. 1998). The transgenic fish carrying the T2KIGΔin-9k insertion in the genome expressed GFP in the kidney as expected from the nature of the zebrafish genomic DNA placed upstream of the EGFP gene (K. Hoshijima, personal communication), indicating that a large insert can be put in the transposon vector to achieve tissue- or organ-specific expression of a gene of interest.
We found that the T2AL200R150 vector carrying 18 bp of DNA between the left and the right ends could not be excised even though it contained all of the cis-sequences required for excision. The same vector carrying 276 or 502 bp of DNA was active in the transient excision assay, indicating that Tol2 has a minimal size requirement for transposition. The spacer between two ends may be essential for the left and right ends in forming a synaptic complex with oligomers of the transposase proteins.
Essential repeats in the subterminal region:
The 200-bp left-end and the 150-bp right-end sequences essential for transposition contain 12-bp perfect TIRs and the subterminal regions. Mutations in the TIRs drastically reduced the excision activity, indicating that TIRs have a critical role in transposition. In the subterminal regions, we found that a 5-bp sequence, 5′-(A/G)AGTA-3′, is repeated 33 times. Mutations in these repeats reduced the excision activity to various levels, suggesting that the repeats are functionally important and the activity of each repeat is somewhat different. In the case of Ac, 238 bp from the 5′-end and 209 bp from the 3′-end, containing 11-bp TIRs and the subterminal regions, are required for the excision activity (Coupland et al. 1989). The TIRs of Ac are also essential for excision (Chatterjee and Starlinger 1995). The Ac transposase binds to a 6-bp sequence, AAACGG (Kunze and Starlinger 1989), and the 5′ and 3′ subterminal sequences contain 10 and 3 copies of this motif, respectively (Chatterjee and Starlinger 1995). While mutations in these motifs reduce the excision activity, none of them by itself is indispensable and only combinations of mutations in more than six motifs abolish the excision activity nearly completely (Chatterjee and Starlinger 1995). The mechanism of how these multiple binding sequences regulate transposition is unknown. These observations are similar to what we observed in the case of Tol2. Therefore, one possible function of the subterminal repeat of Tol2 may be interaction with the transposase protein. Although similar numbers of repeats are observed in the left end (17 repeats) and the right end (16 repeats) (Figure 5), these ends are not functionally interchangeable. Such dissimilar ends are observed also in Ac (Coupland et al. 1989) and Tag1 (Liu et al. 2001). It will be interesting to learn the mechanism that determines how the left and right subterminal regions play different roles in transposition, which may be common for transposons belonging to the hAT family.
In summary, our study provides useful information for Tol2 to be used as a gene transfer vector and also establishes a basis for further biochemical and molecular biological studies of the roles of the subterminal sequence in transposition.
We thank S. Miyabayashi, N. Mouri, T. Uematsu, and S. Takeda for fish room maintenance, and K. Hoshijima for personal communication. G. Morvan was supported by a fellowship from the Japan Society for the Promotion of Science. This work was supported by National Institutes of Health/National Institute of General Medical Sciences grant R01GM069382 and by grants from the Ministry of Education, Culture, Sports, Science and Technology of Japan.
↵1 Present address: Department of Regulations, Development and Molecular Diversity, Muséum National d'Histoire Naturelle, 7 rue Cuvier, 75231 Paris, France.
Communicating editor: D. J. Grunwald
- Received May 2, 2006.
- Accepted August 4, 2006.
- Copyright © 2006 by the Genetics Society of America