Abstract
We explored the feasibility of a strategy based on transposons to generate identified mutants of most Caenorhabditis elegans genes. A total of 1088 random new insertions of C. elegans transposons Tc1, Tc3, and Tc5 were identified by anchored PCR, some of which result in a mutant phenotype.
PROJECTS to build large-scale collections of mutants exist for several model organisms amenable to genetics, including yeast, flies, and mouse (Zambrowiczet al. 1998; Liaoet al. 2000; Vidan and Snyder 2001). In Caenorhabditis elegans, there are currently mutants for <10% of the estimated 19,000 genes of C. elegans. In an approach complementary to knockouts, nondirected transposon-based insertional mutagenesis can be used to generate collections of mutants. In the mutants obtained, insertion sites are determined a posteriori by molecular analysis. C. elegans Tc elements from the mariner family have been used extensively for both forward and reverse genetics purposes (Anderson 1995; Plasterk and van Luenen 1997). However, a systematic study of Tc distribution at the genome level in mutator strains and of their potential mutagenic effect is not available. In this study, we characterized random insertions of Tc1 and Tc3 elements of the mariner superfamily (Plasterket al. 1999) and of the distantly related Tc5 element (Collins and Anderson 1994) to assay the feasibility of such an approach to generate a large-scale collection of identified mutants of C. elegans.
A technical difficulty in C. elegans is that natural elements used for mutagenic purposes exist in multiple copies in all strains. Approximately 30 copies of Tc1, 20 copies of Tc3, and 6 copies of Tc5 are present in the reference N2 strains. As a consequence, molecular screens need to distinguish between new and preexisting transposon copies in genomes to be analyzed.
Detection of the insertions: To generate new insertions of Tc1, Tc3, and Tc5, we propagated strains carrying the mut-7 mutation (Kettinget al. 1999) over 10 generations. Worms were frozen to be included in the strain collection, and an aliquot was used to extract DNA. We detected insertions by a modification of the transposon display protocol (Wickset al. 2000), such that the radioactive and polyacrylamide gel steps were removed. Worms (15 μl) were collected in M9 buffer, frozen at -80° for at least 30 min, and incubated in 100 mm Tris-HCl (pH 8.5), SDS 1%, EDTA 50 mm, NaCl 0.1 m, and 100 μg/ml proteinase K for 3 hr at 65°. DNA was extracted with phenol/chloroform, precipitated with 0.1 vol of sodium acetate and 0.8 vol of isopropanol, and resuspended in 50 μl TE (10 mm Tris-HCl pH 7.6, 1 mm EDTA) + 10 μg/ml RNAse. Then 5 μl of genomic DNA was digested with restriction enzymes. For enzymes producing cohesive ends, fragments were blunted with the Klenow enzyme. After heat inactivation of the enzymes, a linker formed of 5′-ACCTGCCC-3′ and 5′-CTA ATACGACTCACTATAGGGCTCGAGCGGCCGCCCGG GCAGGT-3′ was ligated to the fragments. A nested PCR was performed on 0.5 μl of the ligation reaction, using primer AP1 and one of the Tc primers (see below) for the first round of PCR (20 cycles). The second round of PCR was performed on 0.01 μl of the first PCR product using primer AP2 and one of the Tc primers (30 cycles). PCRs were performed with the Eurobio enzyme (Evry, France) in the buffer provided by the manufacturer.
Primer sequences are as follows: AP1, CCATCCTAAT ACGACTCACTATAGGGC; AP2, ACTCACTATAGGGC TCGAGCGGC; TC1R1S, GATCGACTCGATGCCACGT CGTTG; TC1R2, GATTTTGTGAACACTGTGGTGAAG; TC3.1, GGTCCTATAGAAGTTTCACACTGG; TC3.2, TTC GGAAGTTCCTCAAACCTTC; TC5.1, GCCAAACCTGCT CTGAAGCAG; and TC5.4, GGATCATCTGTAACTATCC TCTATCG.
Annealing temperature was 58° for Tc1 and 48° for Tc3 and Tc5. PCR products were run on 2% agarose gels. Bands that were unique to a clone were picked with a plastic inoculator, quickly vortexed in 100 μl water, and re-PCRed using the set of primers used for the second round of PCR. The PCR products were purified on Promega (Madison, WI) Wizard columns and sequenced.
Each DNA sample was analyzed independently with three enzymes. Most sequences reported in this study were obtained using enzymes ClaI, HindIII, and PmlI. Only a fraction of the insertions were detected by each enzyme. On the basis of the number of insertions detected with each enzyme, we estimate that ∼2/3 of the insertions are detected using this protocol.
A total of 862 clones were generated. On average, two extra bands per clone (corresponding to new insertions) were observed, reamplified, and sequenced. Sequences were subsequently compared to the C. elegans genome by Blast. A total of 1088 sequences resulted in unambigous Blast results (625 Tc1, 253 Tc3, 210 Tc5). The list of insertions can be found on http://cgmc.univlyon1.fr and on http://www.wormbase.org.
To validate the protocol, we assayed whether we could recover predicted insertions from frozen worms. This is a critical point because (1) the anchored-PCR technique might produce artifacts and (2) original plates might carry a mixture of heterozygous and homozygous animals. We designed insertion-specific primers located ∼500 bp from the predicted insertion sites. Thawed worms were singled out and PCR tested using as primers the Tc-specific primer that had been used in the first instance and the insertion-specific primer. In this way, we could recover an insertion in 64 of 69 strains tested. In most cases, more than half of the worms carried the insertion.
Distribution of the insertions: One goal of this study was to observe the distribution of randomly generated Tc insertions in the genome. The distribution of each transposon (Tc1, Tc3, and Tc5) is shown on the physical map (Figure 1). No gross bias for any of the three transposons is observed, and we detected insertions in all chromosomic regions. The greatest disequilibrium between chromosomes is seen for Tc1, for which chromosome V has three times as many insertions as chromosome III (two times if standardized to chromosome length). The current resolution is not sufficient to determine if some genes are more prone to transposon integration than others.
Another important parameter of transposon distribution is the type of genomic region in which the transposons jump (exons, introns, intergenic regions). We observed that ∼20% of the insertions are located in coding sequences, 30% in introns, and 5% in 5′- or 3′-untranslated regions (arbitrarily defined as 100 bp before and after the coding sequence as defined in ACeDB/Wormbase databases). The remainder are located in intergenic regions. Coding sequence accounts for 26% of the C. elegans genome and introns for 14% (C. elegans Consortium 1998). Results were similar for each of the transposons. A likely explanation for the high proportion of intronic insertions comes from the AT-rich content of introns (the target sequence of Tc1 and Tc3 is TA, whereas Tc5 recognizes TNA).
—Distribution of insertions on the C. elegans physical map. Tc1, Tc3, and Tc5 are shown separately. Each chromosome is shown in the standard orientation. The length of each bar indicates the number of insertions within a 300-kb interval.
Mutagenic properties of insertions located in exons: We next tried to assay the mutagenic properties of insertions located in exons by asking whether these insertions produced a phenotype comparable to that resulting from a reduction of gene function. The loss-of-function phenotype of 6 out of the 277 exon insertions was known, as a result of either genetic (4 genes) or RNAi (2 genes) studies. We performed RNAi on 92 additional genes that were chosen at random. Ten genes produced obvious phenotypes by RNAi (Table 1), a ratio similar to that published in systematic screens (Fraseret al. 2000; Gonczyet al. 2000; Pianoet al. 2000; Hanazawaet al. 2001; Maedaet al. 2001). The corresponding 16 clones were thawed, and the insertion could be recovered in 14 of them. Homozygous insertions led to phenotypes resembling those obtained by mutation or inactivation of the gene in 7 out of 14 cases (Table 1). In most cases, the phenotype of the insertion was weaker than the one observed by mutation or RNAi, a feature that is common to transposon-induced mutations (Anderson 1995).
Phenotype associated with homozygous insertion alleles
The mutagenic properties of randomly generated transposon insertions in exons in C. elegans have been a matter for discussion for many years. It has been clearly demonstrated that insertions located in the coding sequence of several genes do not lead to a phenotype because the transposon is spliced out of the mRNA (Rushforth and Anderson 1996), but it is unclear how general this phenomenon is. Although n is limited, our results indicate that Tc insertions in exons are not silent; half of randomly generated insertions located in exons of genes having a visible phenotype led to a similar phenotype.
This work was designed as a pilot experiment to analyze the feasibility of a large-scale production of mutants based on C. elegans Tc elements. It performs a community service by providing novel Tc insertions in or near 600 genes. In addition, the issues addressed in this study will be relevant to other types of transposons that may be used as mutagenic tools in C. elegans in the future.
Acknowledgments
The authors are grateful to J. Godet, J. Samarut, J.-A. Lepesant, M. Philippe, and P. Couble for their continuous support for this project. We also thank T. Drynda for help with the web site, S. Wicks for the transposon display protocol, and L. Stein for incorporating the data into public databases. This work was supported by the Centre National de la Recherche Scientifique (CNRS), the Rhône-Alpes district, and the Association Française pour la Recherche contre le Cancer (ARC).
Footnotes
-
Communicating editor: P. Anderson
- Received December 26, 2001.
- Accepted June 18, 2002.
- Copyright © 2002 by the Genetics Society of America