| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Genetics, Vol. 175, 1089-1104, March 2007, Copyright © 2007
doi:10.1534/genetics.106.065995
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
,

,**,4
* Department of Genetics, ** Department of Cell Biology and
Department of Molecular, Cellular and Developmental Biology, Yale University School of Medicine, New Haven, Connecticut 06520,
Department of Embryology, Carnegie Institution of Washington, Baltimore, Maryland 21218 and
Department of Genome Biology, Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720
4 Corresponding author: Department of Genetics, Yale University School of Medicine, P.O. Box 208005, 333 Cedar St., New Haven, CT 06520-8005.
E-mail: lynn.cooley{at}yale.edu
| ABSTRACT |
|---|
|
|
|---|
The more recent use of transposons based on the Lepidopteran PBac element has expanded the number of genes disrupted by single transposon insertions (HORN et al. 2003; BELLEN et al. 2004; BONIN and MANN 2004; THIBAULT et al. 2004). First introduced into the Drosophila melanogaster germline by HANDLER and HARRELL (1999), the PBac elements were shown to transpose and insert at TTAA sequences. Like the P element, PBac contains a single open reading frame encoding transposase and is bounded by short terminal inverted repeats. PBac elements have been demonstrated to insert into new genes that have not previously been hit using P-element techniques.
Engineered transposable elements have been productively employed to analyze gene expression by enhancer trapping, gene trapping, and, most recently, protein-trapping strategies. Enhancer trapping in Drosophila typically involves a transposon carrying a reporter gene, such as ß-galactosidase or green fluorescent protein (GFP), linked to a weak basal promoter. Insertion in the genome near genes can result in activation of the promoter and expression of the reporter gene in a pattern similar to that of the endogenous gene (O'KANE and GEHRING 1987; WILSON et al. 1989). This technology is useful in a variety of organisms including bacteria (DUNN and HANDELSMAN 1999) and zebrafish (BALCIUNAS et al. 2004).
From enhancer-trapping methods have evolved gene-trapping strategies, using transposons lacking a promoter. These elements rely on an endogenous gene's promoter and control region to report expression. First described in mouse embryonic stem cells (GOSSLER et al. 1989; SKARNES et al. 1992), these constructs usually use a splice acceptor site upstream of the reporter gene. Upon insertion into an intron, the splicing machinery incorporates, or "traps," the element into a gene's normal transcription. Variations of the mammalian constructs contain the lac Z gene and were designed with a transcription termination sequence to increase the possibility of mutation (SKARNES et al. 1995; CANNON et al. 1999; HIRASHIMA et al. 2004). GFP-tagged gene traps have also been generated in zebrafish (KOTANI et al. 2006) and Drosophila (BONIN and MANN 2004).
In addition to a gene's transcriptional readout, its protein localization pattern can also be critical in deciphering gene function. Protein traps were developed to study protein localization without the need to make antibodies or use fixed tissue. Protein traps were first developed as "CD tagging" in Chlamydomonas and Drosophila (JARVIK et al. 1996) and later in mammalian cell lines as "mini-exon epitope tagging" (SMITH 1997). Both of these methods use an open reading frame flanked by splice acceptor (SA) and donor (SD) sites. Upon insertion into an intron and subsequent transcription, a small peptide is incorporated into the endogenous protein. Since the tags are unique, specific antibodies can be used for functional studies. The concept was later adapted to use GFP as a tag, thus eliminating the use of antibodies, in Drosophila (MORIN et al. 2001; CLYNE et al. 2003). In these studies, GFP is used as a mobile artificial exon carried by a transposable P element. The GFP gene lacks initiation and stop codons and is flanked by SA and SD sites. When the intron phase matches the frame of the GFP exon, GFP is seamlessly inserted into an otherwise intact transcript and a GFP-expressing fusion protein is generated.
Like enhancer traps, protein traps provide a starting point for mutagenesis, information on gene expression, and easy identification of the trapped gene by sequencing the flanking genomic DNA. However, protein traps additionally provide information on the expression and subcellular localization of gene products. The use of GFP also allows a real-time dynamic study of the endogenous protein distribution in live tissue using flourescence microscopy. Other protein-trap vectors have recently been developed for use in systems such as mammalian cells (SINESHCHEKOVA et al. 2004).
Here, we report the results of a series of innovative protein-trap screens in D. melanogaster, which include the addition of a high-throughput embryo sorter, so that millions of animals were screened. We also describe the employment of both P and PBac transposable elements to generate a series of GFP-encoding protein traps and discuss the strategies, execution, results, and limitations of these approaches. This work was conducted in collaboration with our colleagues in the Chia laboratory in Singapore and the Spradling laboratory at the Carnegie Institution (see accompanying article by BUSZCZAK et al. 2007, this issue).
| MATERIALS AND METHODS |
|---|
|
|
|---|
2-3}99B and yw; Sb, P{
2-3}99B were obtained from the Bloomington Drosophila Stock Center at Indiana University. Deficiency stocks for complementation tests of lethal lines were also obtained from the Bloomington Drosophila Stock Center at Indiana University (see supplemental Table 2 at http://www.genetics.org/supplemental/). P-element mutator lines w; P{PTT-GA}III-1b, w; P{PTT-GB}III-3b, and P{PTT-GC}III-4b were obtained from William Chia (Temasek Lifesciences Laboratory, Singapore). Lines in the "G" and "ZCL" collections (MORIN et al. 2001) were obtained from William Chia.
P-element mutator lines:
P-element-based protein traps used in this work contain the enhanced green fluorescent protein (EGFP) gene encoded as a mobile artificial exon carried by a transposable P element (MORIN et al. 2001). The SA and SD sequences from the Drosophila Myosin heavy chain II gene flank the EGFP coding sequence, which lacks translation initiation and termination codons. The construct also encodes a 6xHis epitope fused to the amino terminus of EGFP. The transposon is marked with the eye color selection gene, mini-white. Fly lines containing three independent constructs were used, each containing the EGFP gene flanked by splicing sequences for one of the three reading frames. Splicing of the EGFP exon in PTT-GA maintains the reading frame interrupted by a phase 1 intron, PTT-GB is compatible with phase 2 introns, and PTT-GC with phase 0 introns. To move the mutators onto the CyO balancer chromosome, the insertions in w; P{PTT-GB}III-3b and P{PTT-GC}III-4b were remobilized by crossing to P{
2-3}99B in the presence of CyO. Insertions of the P{PTT-GB} and the P{PTT-GC} mapping to CyO and not expressing EGFP were used for the YB and YC screens.
piggyBac mutator and transposase lines:
A PBac protein trap construct was made by inserting the 6xHisEGFP A-frame exon into pBac[D.m. w+] (HANDLER and HARRELL 1999) and substituting the yellow gene for mini-white to create pBAC{HpaI-GFP; y+} (see accompanying article by BUSZCZAK et al. 2007 for details). Transgenic flies were created using the phspBac helper plasmid (HANDLER and HARRELL 1999), and the insertion on the X chromosome in pBAC{HpaI; y+}24.3 was remobilized to recover new insertions on the CyO balancer chromosome. An insertion on CyO that did not express EGFP was used in the YD screen.
A genomic source of piggyBac transposase was made by P-element-mediated transformation. The piggyBac transposase gene was excised from phspBac and joined with the ubiquitin promotor by cloning into the pCasper3-Up2-RX poly(A) P-element vector (WARD et al. 1998) to create P{w+, Ub-pBACtrans} (see BUSZCZAK et al. 2007 for details).
Genetic crosses:
P screen:
The first strategy was a modification of that described in MORIN et al. (2001). To accelerate the rate of recovering GFP-positive lines, we used a high-throughput automated Drosophila embryo sorter (COPAS Select; Union Biometrica). For the pilot screen, we used protein trap lines with the highest frequency of transposition (w; P{PTT-GA}III-1b, w; P{PTT-GB}III-3b, and P{PTT-GC}III-4b) (MORIN et al. 2001). Virgin females carrying P{
2-3}99B were crossed to a mixed population of males carrying the GA, GB, or GC protein trap vector. All of the starting elements were on the third chromosome and did not express detectable GFP. Male progeny carrying both P{
2-3}99B and a protein trap were mated to y,w virgin females. A virgining stock, y,w/Y, P{hs:hid}, was used to facilitate collecting large numbers of virgins. Embryos and first and second instar larvae in standard bottles were heat-shocked in 37° sand pits for 2 hr on 2 consecutive days. Females were collected shortly after eclosion. Approximately 250 males and 750 virgins were placed in large culture containers (condos). The flies were allowed to acclimate in the condos for 2–3 days with daily feeding of wet yeast paste on grape juice agar plates. For embryo collection, females were allowed to lay embryos on a fresh plate for 12–20 hr. Embryos were collected and screened from the same parents for 2 consecutive days, and then the parents were discarded. Embryos were dechorionated by incubation in 50% bleach for 2 min and rinsed extensively in PBS. The embryos were suspended in 100–200 ml of PBS with 2% Tween-20 and transferred to the sample cup of the embryo sorter. Using multiple condos, between 50,000 and 200,000 embryos were collected for each sorting session. Embryos flowed from a continuously mixed sample cup to a preanalysis chamber, where the embryos are surrounded by a sheath solution (Union Biometrica) to produce a stabilized flow and focus the embryos in the center of the flow stream. Embryos pass through the flow cell where a red diode laser (670 nm) measures size and optical density, while a multiline argon laser excites the GFP fluorophore. Emission signals are measured and real-time analysis of the measured parameters is used to make sorting decisions. Only those embryos that are GFP positive are dispensed into the standard food vials. GFP-negative embryos are directed by a puff of air to a sample container where they are discarded. The embryo sorter examined
70,000 embryos/hr. After GFP-positive embryos completed development, single animals with eye pigment (w+) were mated to y,w flies to establish stocks. Only flies that did not inherit the P{
2-3}99B chromosome were used to establish stocks. Stocks produced in this pilot screen were labeled "P" followed by an isolation number.
YB and YC screens:
The screen strategy was modified by moving the starting elements to the CyO balancer chromosome. In addition, one reading frame was screened at a time. Crosses to generate embryos for screening by the COPAS Select embryo sorter were carried out as follows: (1) w/Y; CyO, P{w+, GB} (or CyO, P{w+, GC}) x w; Sb, P{
2-3}99B (or Ki, P{
2-3}99B) virgins and (2) w/Y; CyO, P{w+, GB} (or CyO, P{w+, GC}); Sb, P{
2-3}99B (or Ki, P{
2-3}) x y,w virgins.
Cross 1 was carried out in standard fly bottles, and cross 2 was done in condos as described above. Lines resulting from this screen were designated "YB" or "YC." For the YC screen, only flies that did not inherit the P{
2-3} chromosome and the CyO chromosome were used to establish stocks. During the YB screen, we included flies carrying CyO for initial stocks. Only those lines in which CyO segregated from white+, indicating a new insertion, were kept.
YD screen:
To test the utility of PBac elements for protein trapping, the same EGFP artificial exon was mobilized in the context of a PBac transposon marked with yellow+. For this screen, the starting element encoded EGFP flanked by A-frame splicing signals compatible with the phase 1 introns. The crosses to produce embryos for screening were: (1) y,w/Y; CyO, PBac{y+, GA} x y,w; P{w+, Ub-pBACtrans} virgins and (2) y,w/Y; Cyo, PBac{y+, GA}; P{w+, Ub-pBACtrans} x y,w virgins.
Cross 1 was carried out in standard fly bottles, and cross 2 was done in condos as described above. Flies lacking the PBac transposase were identified by a white eye color. Only lines in which yellow+ segregated from CyO were kept. Lines resulting from this screen were designated "YD."
Imaging ovarian expression patterns from protein traps:
Ovaries were dissected from females fed on wet yeast paste for
36 hr. Following dissection in IMADS buffer (SINGLETON and WOODRUFF 1994a), ovaries were fixed in 6:1 of heptane and devitellinizing buffer (6% formaldehyde, 16.7 mM potassium phosphate, pH 6.8, 75 mM potassium chloride, 25 mM sodium chloride, 3.3 mM magnesium chloride) for 10 min (COOLEY et al. 1992). After washing in PBT (0.5% bovine serum albumin, 0.3% Triton X-100, PBS), egg chambers were separated and mounted in Antifade solution [30% glycerol, 2.5% DABCO (Sigma, St. Louis) in PBS]. Confocal images were acquired on a Zeiss LSM510 or LSM510 meta- (Yale Center for Cell and Molecular Imaging) or Bio-Rad (Hercules, CA) MRC600 laser-scanning microscope. Images were processed using Adobe Photoshop 7.0 and rotated to display with anterior on the left.
Analysis of DNA flanking protein trap insertions:
Genomic DNA flanking transposon insertions was amplified for DNA sequencing using inverse PCR (iPCR) or thermal asymmetric interlaced–PCR (TAIL–PCR). The procedure for iPCR was previously described (LIAO et al. 2000; BELLEN et al. 2004), and the detailed protocol is available on the website of the Drosophila Gene Disruption Project at http://flypush.imgen.bcm.tmc.edu/pscreen/. Amplification and sequencing of iPCR products was carried out either in the Cooley lab or in the Hoskins lab; amplification and sequencing of TAIL–PCR products was carried out in the Cooley lab. DNA sequencing at Yale was performed by the Keck Foundation Biotechnology Resource Laboratory.
iPCR:
For P-element insertions, genomic DNA was digested with the restriction enzyme HinP1I or Sau3A; 5' flanks were amplified by iPCR with the primers Plac1 (CACCCAAGGCTCTGCTCCCACAAT) and Pwht1 (GTAACGCTAATCACTCCGAACAGGTCACA) at an annealing temperature of 60° and sequenced with 5.SUP.seq1 (TCCAGTCACAGCTTTGCAGC); 3' flanks were amplified with EY.3.F (CCTTTCACTCGCACTTATTG) and EY.3.R (GTGAGACAGCGATATGATTGT) at an annealing temperature of 55° and sequenced with EY.3.F.
For PBac insertions, genomic DNA was digested with HaeIII; 5' flanks were amplified with PLF (CTTGACCTTGCCACAGAGGACTATTAGAGG) and PLR (CAGTGACACTTACCGCATTGACAAGCACGC) at an annealing temperature of 65° and sequenced with PLR; 3' flanks were amplified with PRF (CCTCGATATACAGACCGATAAAACACATGC) and PRR (AGTCAGTCAGAAACAACTTTGGCACATATC) at an annealing temperature of 55° and sequenced with PRF.
TAIL–PCR:
The procedure for TAIL–PCR was modified from SINGER and BURKE (2003). TAIL–PCR uses three nested primers complementary to the transposon and arbitrary degenerate (AD) primers that anneal to genomic locations. The annealing temperatures of the specific and AD primers differ, and PCRs with alternating cycles of high and low annealing temperatures allow amplification of specific products that include the junction of transposon and genomic DNA. AD primers were: AD1 (NGTCGASWGANAWGAA), AD2 (TGWGNAGSANCASAGA), AD3 (AGWGNAGWANCAWAGG), AD4 (STTGNTASTNCTNTGC), AD5 (NTCGASTWTSGWGTT), and AD6 (WGTGNAGWANCANAGA), where W equals A or T, S equals G or C, and N equals A or T or G or C. These were used singly or in various mixtures. Specific nested primers were designed for the 5' and 3' P-element ends, as well as for GFP in both directions. The 5' P-element primers for TAIL1, TAIL2, and TAIL3 reactions were Plac1 (CACCCAAGGCTCTGCTCCCACAAT), Ep5-2 (TACTCCAGTCACAGCTTTGCAGCA), and Sp1 (ACACAACCTTTCCTCTCAACAA). The P-element 3' primers for TAIL1, TAIL2, and TAIL3 reactions were Pry1out (ATTCAAACCCCACGGACATGCTAAGG), Pry4a (ACAATCATATCGCTGTCTCACTCAG), and Spep1a (CGACACTCAGAATACTATTCCTTTCAC). For GFP TAIL1, TAIL2, and TAIL3 reactions, the 5' primer set consisted of Egfp-750R (CTACGGCAAGCTGACCCTGAA), Egfp-gsp1 (GAACTTGTGGCCGTTTACGTCGCC), and GgfpRT-A (GTCCAGCTCGACCAGGATGGGCAC), while the 3' end was amplified with the primers gfpF1 (GGAGGACGGCAACATCCTGG), gfpF2 (CAACGTCTATATCATGGCCG), and gfpF3 (AGACCCCAACGAGAAGCGCG).
Genomic DNA was prepared by homogenizing 50 flies in 180 µl of PBS with a disposable microtube pestle and then isolated using the DNeasy Tissue kit membranes (QIAGEN, Valencia, CA) and eluted in 200 µl buffer AE (10 mM Tris–HCl, 0.5 mM EDTA, pH 9.0). For the TAIL1 PCR reaction,
600 ng genomic DNA were placed in a 20-µl reaction containing 2 µl 10x PCR reaction plus Mg buffer (Roche, Indianapolis), 0.4 µl 10 mM mixed dNTPs (Roche), 0.2 µl 5 units/µl Taq DNA polymerase (Roche), 1.5 µl 2 µM specific TAIL1 primer (listed above), 1.5 µl 40 µM mixed AD (or 40 µM AD3) primer (SINGER and BURKE 2003; LIU et al. 2005), and water. TAIL1 reactions were run in a Bio-Rad MyCycler thermal cycler programmed to settings decribed below. Reaction products were then diluted 1:50, 1 µl of which was used as template for 20-µl TAIL2 reactions (as TAIL1 reactions above, but substituting nested specific TAIL2 primer). For TAIL3 reactions, TAIL2 products were again diluted 1:50, and 1 µl was placed in a 20-µl reaction, as above, substituting nested specific TAIL3 primer. The thermal cycler settings for the three TAIL reactions were as follows: TAIL1, 94° for 3 min, 95° for 1 min, followed by 5 cycles of 94° for 1 min, 62° for 1 min, 72° for 2 min, 94° for 1 min, 25° for 3 min, ramping to 72° over 3 min, 72° for 2.5 min and 15 cycles of 94° for 30 sec, 65° for 1 min, 72° for 2.5 min, 94° for 30 sec, 65° for 1 min, 72° for 2.5 min, 94° for 30 sec, 44° for 1 min, 72° for 2.5 min, with an ensuing extension step of 72° for 10 min and cooling to 4°; TAIL2, 94° for 3 min, followed by 12 cycles of 94° for 30 sec, 64° for 1 min, 72° for 2.5 min, 94° for 30 sec, 64° for 1 min, 72° for 2.5 min, 94° for 30 sec, 44° for 1 min, 72° for 2.5 min, with a final extension step of 72° for 10 min and then cooling to 4°; TAIL3, 94° for 3 min, followed by 20 cycles of 94° for 1 min, 44° for 1 min, 72° for 2.5 min, with a final extension step of 72° for 10 min and then cooling to 4°.
TAIL3 PCR products were directly sequenced (unless multiple bands were visualized on an agarose gel in which case bands were extracted using a QIAquick gel extraction kit from QIAGEN) using the TAIL3-specific primer by the Keck Foundation Biotechnology Resource Laboratory at Yale.
Sequence analysis:
DNA sequences amplified by iPCR or TAIL–PCR were examined for the tranposon–genome junction by scanning for the proximal P or PBac ends. The distal junction made by iPCR was identified by text searching for the appropriate restriction site. All transposon and low-quality sequences were masked, and remaining sequences >20 bases were aligned to the D. melanogaster genome (Release 3.2) using BLASTN (ALTSCHUL et al. 1997). If sequences from both flanks aligned to the same position, the insertion site was assigned to the first base of the 8-bp target site duplication of P-element sequences or the 4-bp target site duplication of PBac insertions. If the flanking sequences aligned to different locations, the two were assumed to derive from separate insertions. If multiple alignments were present, the insertion was in repetitive DNA. When the proximal transposon–genome junction was not apparent in the flanking sequence, the insertion site was defined as the first base of the alignment to genomic DNA.
Analysis of GFP-containing transcripts in protein trap lines:
Ovarian RNA was isolated from lines that expressed EGFP during oogenesis. RT–PCR was used to determine the junction between EGFP coding sequence and the endogenous sequence in protein trap lines. Analysis of the 5' and 3' ends of GFP-containing transcripts was carried out by RACE.
RNA isolation:
Ovaries from 10 female flies fed on wet yeast paste were dissected in IMADS buffer (SINGLETON and WOODRUFF 1994a) using forceps cleaned in chloroform, then placed into 1.5-ml Eppendorf tubes containing 300 µl Trizol reagent (Invitrogen, San Diego) on ice, and homogenized within 30 min using disposable plastic micropestles (cleaned in chloroform). Homogenate was incubated at room temperature for 5 min, then 30 µl DEPC-treated water was added, and samples were vortexed vigorously before adding 60 µl chloroform and shaking tubes vigorously by hand. Samples were spun at 12,000 rpm for 10 min at 4°, and the aqueous layer was transferred to a new tube to which was added 0.8 vol isopropanol and then vortexed to mix and allowed to precipitate overnight at –20°. Samples were pelleted at 14,000 rpm for 30 min at 4°, supernatant was removed, and pellets were washed with 75% ethanol and allowed to dry 1–2 min at room temperature before dissolving in 50 µl DEPC-treated water. Following spectrophotometry, 10 µg of each sample were DNase treated in a 50-µl reaction, using the DNA Free kit (Ambion, Austin, TX).
3' RACE:
For the first-strand reaction 1 µg of RNA was extended with 0.5 µM of AdP primer (GGCCACGCGTCGACTAGAACTTTTTTTTTTTTTTTTT) and 10 units of M-MuLV reverse transcriptase (Roche no. 11062603001) in a total of 20 µl at 42° for 50 min. The reaction was terminated with a 15-min incubation at 70°. The first PCR was performed with 2 µl of the first-strand reaction and 0.2 µM AdAmpP primer (GGCCACGCGTCGACTAGAAC) and 0.2 µM gfpF2 primer (CAACGTCTATATCATGGCCG) with an annealing temperature of 55°, an extension time of 3 min, and 30 cycles. Nested PCR was performed with 2 µl of the first PCR reaction and 0.2 µM AdAmpP primer (GGCCACGCGTCGACTAGAAC) and 0.2 µM gfpF3 primer (AGACCCCAACGAGAAGCGCG) with an annealing temperature of 55°, an extension time of 3 min, and 30 cycles. Individual bands were isolated (QIAGEN QIAquick gel extraction kit) and sequenced using the gfpF3 primer (W. M. Keck Foundation Biotechnology Resource Laboratory, Yale University).
5' RACE:
For the first-strand reaction 5 µg of RNA were extended with 0.5 µM of gfp R1 primer (GTCGTGCTGCTTCATGTGGTCG) and 10 units of M-MuLV reverse transcriptase (Roche no. 11062603001) in a total of 25 µl at 42° for 50 min. The reaction was terminated with a 15-min incubation at 70°. First-strand reactions were cleaned with the QIAquick PCR purification kit (QIAGEN) and eluted in 30 µl water. A terminal transferase reaction was then done with 10 µl of the first-strand reaction with 400 units terminal transferase (Roche no. 03333566001) and 0.2 µM dCTP. The first PCR was performed with 5 µl of dC-tailed cDNA, 0.2 µM AchP primer (GGCCACGCGTCGACTAGTTCGGGIIGGGIIGGGIIG), and 0.2 µM gfpR2 primer (GACACGCTGAACTTGTGGCCG) with an annealing temperature of 55°, an extension time of 3 min, and 30 cycles. Nested PCR was performed with 5 µl of the first PCR reaction diluted 1:100 and 0.2 µM AchAmpP primer (GGCCACGCGTCGACTAGTTCG) and 0.2 µM gfpR3 primer (AGCTCCTCGCCCTTGCTCACC) with an annealing temperature of 55°, an extension time of 3 min, and 30 cycles. Individual bands were isolated (QIAGEN QIAquick gel extraction kit) and sequenced using the gfpR3 primer (W. M. Keck Foundation Biotechnology Resource Laboratory, Yale University).
RT–PCR:
One-half microgram of DNase-treated total ovarian RNA was placed in a 50-µl reaction containing DEPC-treated water, 1 µl each of both 2 µM sense and 2 µM anti-sense primers, 25 µl reaction buffer and 1 µl RT/Taq enzyme from the SuperScript One-Step RT–PCR kit (Invitrogen). Reactions were processed in a GeneAmp PCR System 9700 thermal cycler (PE Applied Biosystems, Foster City, CA): RT at 50° 30 min, denaturation at 94° 2 min, followed by 40 cycles of 94° 30 sec, 55° 30 sec, 72° 1 min, with a final extension at 72° 10 min and cooling to 4°. PCR products were visualized on agarose gels and sequenced either directly or following gel extraction of bands using the QIAquick gel extraction kit (QIAGEN).
Western blots:
Ovaries were dissected in IMADS buffer (SINGLETON and WOODRUFF 1994b), homogenized in 10 µl sample buffer (10% glycerol, 2% SDS, 0.06 M Tris pH 6.8, 0.7 M ß-mercaptoethanol) per ovary, and boiled for 5 min. Proteins were separated by SDS–PAGE electrophoresis and transferred to nitrocellulose. Polyclonal rabbit antisera against GFP (Torrey Pines TP401) was diluted 1:1000 in PBS with 5% powdered milk. Following overnight incubation at 4° in primary antibody, membranes were washed with PBS. Membranes were incubated for 1 hr with horseradish–peroxidase-conjugated secondary antibodies at 1:1000, and bands were detected using SuperSignal West Dura Extended Duration substrate (no. 34076; Pierce Chemical, Rockford, IL).
FlyTrap website:
Information on fly lines generated during this project and how to obtain them is available on our FlyTrap website (http://flytrap.med.yale.edu) (KELSO et al. 2004). This site is significantly enhanced with new searching tools and displays for insertion sites and images of GFP expression in tissues. FlyTrap was implemented using MySQL (version 5) (http://www.mysql.com). Our web server was implemented using Apache running on a Linux server. The web interface was built using PHP (http://www.php.net). The system also incorporates the open source gif-draw (GD) library V2 to generate the gene and insertion diagrams. Routine updates to the MySQL database are made from a FileMaker Pro (version 7) database using custom Perl scripts.
| RESULTS |
|---|
|
|
|---|
2-3}99B). Males with both a mutator and P{
2-3}99B transposase were crossed to y,w virgins, and their progeny were screened using the embryo sorter. New insertions of a mutator element into a gene that result in production of GFP can be detected by monitoring GFP fluorescence. During the course of 3 months, we recovered >4000 GFP+ embryos from 2.5 million embryos (Table 1, P screen); however, low embryo survival (25%) reduced the number of lines that could be established to 1060. Lines were established from surviving fertile adults that did not inherit the P{
2-3}99B-bearing chromosome and designated P lines. We focused on examining ovaries for evidence of GFP-fusion protein expression and found that
75% of the lines had a GFP pattern during oogenesis. Thus, the embryo sorter can capture rare GFP+ animals and can improve the rate of screening for protein trap insertions.
|
2-3}99B when choosing GFP+ animals, all initial lines set up contained the starting mutator elements. This complicated mapping and balancing of new insertions on the basis of following eye pigment produced by the white+ marker in the mutator. Reexamination of lines after balancing revealed many cases where the GFP+ pattern was lost. An additional complication was encountered when we attempted to determine the sequence of genomic DNA flanking protein trap insertions. New insertions on the third chromosome were accompanied by the starting element. Therefore, iPCR of third chromosome lines resulted in amplification of the starting elements as well as the protein trap insertions. The same problem occurred when sequencing flanking DNA from lines made by MORIN et al. (2001). We tried to identify insertion sites in 132 G lines and 293 ZCL lines (Table 2) sent to us by William Chia. No informative sequence was recovered for 26 G lines and 107 ZCL lines. Most of the lines in which sequencing attempts failed mapped to the third chromosome: 14 of 26 G lines and 83 of 107 ZCL lines. Because of this, we focused on sequencing P lines that mapped to the second and X chromosomes and obtained informative sequence from 79 of 95 (Table 2). Therefore, the pilot screen showed that protein trapping with an automated embryo sorter was promising; however, the starting position of the mutator elements needed to be addressed.
|
2-3}99B. This allowed us to select against both the starting mutator elements and the transposase source.
We screened >2.2 million embryos to obtain GFP+ embryos carrying insertions of P{PTT-GC} and nearly 3.8 million embryos for P{PTT-GB} (Table 1). These screens were done separately; therefore, the reading frame of the protein traps in the resulting lines was known. The date each GFP+ embryo was sorted was recorded and used to keep track of independent insertions. Lines made with P{PTT-GB} were designated with a YB prefix followed by an isolation number, and lines made with P{PTT-GC} were designated with YC. Survival of GFP+ embryos was improved compared to the pilot screen: 80% for YB embryos and 75% for YC embryos. Crosses to establish stocks and map insertions to chromosomes were set up from 1713 YB adults and 832 YC adults, resulting in 217 balanced YB lines and 110 YC lines (Table 1). There were several reasons the number of balanced lines was reduced compared with the initial number of adult survivors: death of the adults prior to mating, sterility of the adults, and discarding of stocks in which the new insert mapped to either the P{
2-3} chromosome or CyO. In addition, there were cases where no white+ progeny were produced; these were false positives collected by the sorter. We set the parameters for embryo collection close to the background level of GFP detection, reasoning that this would allow us to capture embryos with weak GFP expression. Overall for the YB and YC screens, one balanced line was produced for every
17,000 and
21,000 embryos sorted, respectively. This was lower than the rate for the P screen in which three mutator elements were mobilized simultaneously.
piggyBac screen:
The final screen we carried out used a PBac mutator that allows in-frame splicing of EGFP in the A reading frame. PBac has a simple AT-rich target sequence, TTAA (CARY et al. 1989), that allows PBac to insert into introns, which are relatively AT rich, more readily than P elements (BONIN and MANN 2004; THIBAULT et al. 2004). Therefore, we tested whether PBac-based protein trapping is more efficient than P-element screens.
Over 10 million embryos were sorted, resulting in 3279 embryos scored as GFP+ (Table 1). A total of 1929 animals survived to adulthood (59%) and were mated to y,w flies followed by crosses to map the insertions to chromosomal locations. A total of 433 balanced lines were produced and designated as YD lines. The overall production rate (1 line per 23,098 embryos screened) was similar to the rate for P-element mutators (Table 1).
Mapping and sequence analysis of insertion sites:
We mapped new insertions of protein trap mutators to chromosomal locations before analyzing expression patterns or sequencing insertion sites. We chose this strategy so that any expression patterns we found later were associated with mapped, stable inserts. Initial crosses of GFP+ animals occasionally produced progeny with two different (nonwhite) eye colors; independent lines were established and most cases were found to represent two independent insertions. Because we did not keep lines with insertions on the CyO chromosome (carrying the starting element) or the P{
2-3}-bearing chromosome, the available targets for new insertions were the X chromosome and one each of the major autosomes. Therefore, the expected distribution of inserts was 20% on the X, 40% on the second, and 40% on the third on the basis of the lengths of the euchromatic portions of these chromosomes. Genetic mapping of the lines made with P-element mutators (YB and YC) revealed marked deviations from this distribution. There were fewer insertions than expected on the second chromosome in both the YB and the YC lines (25%) (Table 3). The YC lines were skewed dramatically with 69% of insertions on the third chromosome and only 5% on the X chromosome. The distribution of insertions in the lines made with the PBac mutator (YD lines) was closest to the expected distribution (Table 3).
|
Carrying out en masse matings to produce embryos for screening in the embryo sorter introduced the possibility of capturing premeiotic clusters of nonindependent transpositions. To produce a list of independent events among the YB, YC, and YD lines, we used genetic mapping data, the date the founding GFP+ animal was obtained, and the insertion site to identify duplicate events and remove them from further consideration. For the P lines, we used genetic mapping and insertion sites as criteria since the sort dates were not recorded for each line. For the G and ZCL lines we sequenced, we relied on the insertion site and the line designation to judge which lines had independent events. A total of 819 lines represent independent events (Table 4). In several instances more than one insertion was recovered per gene (Table 5); therefore, these insertions were associated with a total of 335 genes (see supplemental Table 1 at http://www.genetics.org/supplemental/ for complete list).
|
|
Insertions can create both protein and enhancer traps:
The gene for EGFP in the mutator lines lacks start and stop codons; therefore, expression of EGFP was expected to depend on insertion into an intron of a host gene and incorporation into the open reading frame of mature mRNAs by the splicing apparatus. We analyzed the position of insertions relative to nearby genes to predict whether they were within coding regions and found evidence not only for the expected protein traps, but also for enhancer traps. We included G and ZCL lines (MORIN et al. 2001) that we sequenced, the P lines from the pilot screen, and the Y lines in this analysis.
Protein traps:
A total of 295 of the insertions, representing 115 genes (Table 4), were in introns between annotated protein-coding exons. These insertions were positioned to function as protein traps. Analysis of GFP expression in ovaries confirmed that the EGFP-fusion proteins had specific subcellular localization patterns. We obtained protein traps in both known and predicted genes that had specific expression patterns in the ovary. For example, lines with insertions in I'm not dead yet (Indy), which encodes a plasma membrane cation transporter (KNAUF et al. 2002), localized specifically to follicle cell membranes (Figure 1A). Protein traps in Rtnl1, an endoplasmic reticulum (ER) protein (WAKEFIELD and TEAR 2006), showed EGFP localization to the ER in nurse cells and oocytes (Figure 1B). Insertions in calmodulin (Cam) produced GFP fluorescence localized primarily to cell membranes and less so to nuclei (Figure 1C). Finally, insertions in trailer hitch (WILHELM et al. 2005) expressed GFP specifically in germline cytoplasm with enrichment in the oocyte (Figure 1D). Insertions in uncharacterized annotated genes also had specific subcellular localizations. For example, insertions in CG6416 expressed GFP in the ovarian muscle sheath (Figure 1E), and insertions in CG15926 resulted in GFP localization to follicle cell membranes (Figure 1F).
|
Enhancer traps:
Unexpectedly, we found that many lines had insertions not predicted to be protein traps, suggesting that GFP was produced by other mechanisms. A total of 254 lines (Table 4, Enhancer) had insertions within 500 nucleotides of the 5' end of a gene and with the GFP gene in the same 5'–3' orientation, but upstream of the annotated protein-coding sequence. Thus, production of GFP in these lines depended on the availability of an upstream methionine codon. The GFP fluorescence was nuclear (Figure 2, A, C, and G) in 234 of 254 of these lines. Western blot analysis of ovary extracts probed with antibodies to GFP showed most bands to be similar in size,
40 kDa (Figure 2I, solid arrowhead), which is
14 kDa larger than GFP alone. We found two additional types of insertions: those that were inserted within an annotated gene but in the opposite orientation (Table 4, enhancer opposite) and those that were inserted >500 nucleotides from any annotated genes (Table 4, no gene). These lines also had almost exclusively nuclear GFP fluorescence patterns (Figure 2, B and D–F) and had GFP bands on Western blots of similar sizes to those seen in enhancer lines (Figure 2I, solid arrowhead). The highest frequency of enhancer, enhancer opposite, and no gene traps was in the YB and YD lines: 122 of 159 YB lines and 255 of 314 YD lines (Table 4).
|
We found three different types of cDNA sequence downstream of the GFP exon in the P-element lines. In five lines (three enhancers and two no genes), the transcript was unspliced and continued through the splice donor sequence (Figure 2J, circled 1). This introduced one additional amino acid codon followed by a stop codon. A polyadenylation signal was present 196 bases from the end of the GFP coding sequence within the 3' P end. Other lines had evidence of splicing to downstream exons. In nine lines designated "no gene," the GFP exon spliced into an unannotated genomic exon with partial canonical SA sites and poly(A) signal. These added between 1 and 19 codons to the end of the GFP. The lack of significant coding sequence and the noncanonical SA sequences indicate that these are most likely cryptic exons and not unannotated genes (Figure 2J, circled 2). None of these 3' splice variants add significantly to the size of the predicted GFP-containing protein produced.
The third type of cDNA sequence found downstream of GFP in these lines was the first coding exon of an annotated gene where the reading frame was in frame with that gene (Figure 2J, circled 3). This was the case for YB0011, which is 5' of the D1 chromosomal protein (D1) gene, and ZCL3170, which is 5' of string of pearls (sop). The use of the Met codon from exon 0 of the P-element transposase gene allows for what is essentially an N-terminal tag to the native protein with a short linker sequence between. In support of this, the proteins seen on Western blots were much larger than those from lines that appear to be enhancer traps (Figure 2I, open arrowheads). In addition, the expresson pattern in egg chambers for these two lines was consistent with the type of proteins encoded by the genes. D1::GFP is present in nuclei with prominent dots in nurse cell nuclei (Figure 2G). Sop is a ribosomal protein, and Sop::GFP is located in the cytoplasm of both germline and follicle cells (Figure 2H). Although we originally designated these lines as enhancer traps because the insertion sites were 5' of genes, the cDNA sequence, Western blot data, and expression patterns supported reclassifying these as protein traps.
RNA analysis of PBac enhancer traps revealed a splicing pattern in each of five lines examined that results in an identical predicted GFP-containing protein. In all cases, splicing 3' of GFP was to the yellow genomic region in the vector (Figure 2K). This "exon" is not part of the yellow gene but lies in the upstream genomic region and on the strand to the yellow coding region. Like the downstream cryptic exons seen in the P-element lines, the cryptic exon in yellow contained a noncanonical SA site and poly(A) addition signal, but resulted only in six amino acid codons before a stop codon. Additionally, all lines analyzed showed 5' splicing to sequences from the 5' PBac end (Figure 2K). Two segments of the 5' PBac end were incorporated into cDNAs, neither of which is a bona fide exon. The second segment contained the start codon of PBac transposase, which normally reads through the entire length of the 5' PBac sequence. However, in these lines splicing results in the first 91 amino acids being fused to GFP, resulting in a predicted protein of 39 kDa. Unlike the P-transposase::GFP fusion protein, there is no predicted nuclear localization sequence in the PBac-transposase::GFP fusion protein.
These results suggest that the large number of non-protein traps was isolated due to splicing in frame to sequences in the 5' P-element and PBac ends, thus contributing an upstream start codon to the GFP exon so that it can act as an enhancer trap. These lines are useful indicators of which cells express the trapped gene, but, unlike the protein traps, they do not provide information on endogenous protein localization.
Lethality as a function of trap type:
Once the trap types were known, we could determine the fraction of each that caused lethality among the YB, YC, and YD balanced lines (Table 6). Of the 81 genes in these lines with Protein trap insertions, we determined the homozygous phenotype for 70 and found that 36% had only lethal inserts. In contrast, for the enhancer trap insertions, 53% of the genes investigated had only lethal insertions. This difference may be because the enhancer trap insertions are near the transcription start sites and more likely to disrupt expression of the genes. The enhancer opposite type of insertion had the lowest rate of lethality; 29% of the examined genes had only lethal insertions. The lower lethality of enhancer opposite traps compared to enhancer traps is probably because enhancer opposites were not restricted to the vicinity of transcription start sites, but were distributed along the length of the genes.
|
To test whether transposon insertions caused the observed lethality, we focused on the protein traps that were homozygous lethal and determined whether the lethality was complemented by deficiencies that delete the insertion sites. Deficiencies were available from the Bloomington Stock Center for lethal protein trap insertions in 17 of the 25 genes (Table 6). Of these, 10 were lethal over a deficiency, one was male lethal over a deficiency, and six were viable (see supplemental Table 2 at http://www.genetics.org/supplemental/). These data suggest that the lethality in
35% of the protein trap lines was not due to the insertion we mapped. Therefore, lethal lines must be tested for secondary lesions before continuing with analysis of the genes.
Protein traps can reveal isoform-specific expression patterns:
Multiple protein trap insertions in a gene of interest can be valuable for determining expression patterns of protein isoforms encoded by different transcripts. One example we discovered through our protein-trapping screen is the IGF-II mRNA-binding protein (Imp) gene for which we collected a total of 16 independent lines, incorporating both P and PBac elements. Imp encodes eight annotated protein isoforms by alternative splicing. The insertion sites were arrayed widely across the span of the gene within introns (Figure 3A). The insertions in the YD lines (PBac elements) were 5' of all annotated start codons (Figure 3A) and had very low GFP expression during oogenesis. For the P-element insertions, we found several distinguishable expression patterns, three of which are shown in Figure 3. The G00080 line had the most 3' insertion sites, and expression of GFP was in the cytoplasm of both the germline and the somatic follicle cells throughout oogenesis (Figure 3, D and G). GFP expression in G00080 showed a dramatic oocyte enrichment beginning in region 2 of the germarium (Figure 3D). In stage 10 egg chambers, GFP fluorescence was enriched subcortically in oocytes with the highest level of protein at the posterior pole (Figure 3G). The insertion in G00152 represents a group of insertions within the first exon of Imp-RA, Imp-RB, and Imp-RC, but still within an intron of the remaining transcripts. Expression was low within the germline, but strong in follicle cells throughout oogenesis (Figure 3, C and F). There was a slight enrichment of GFP in the oocytes in G00152 egg chambers (Figure 3C). YB0057 represents one of the insertions located upstream of Imp transcripts Imp-RA, Imp-RB, and Imp-RC (Figure 3A). GFP expression in YB0057 was highest in somatic follicle cells (Figure 3, B and E), and, in contrast to G00152 (Figure 3C), expression was reduced during stages 7–9 (Figure 3B).
|
Comparing P-element and piggyBac protein traps:
We acquired confocal micrographs of adult ovarian tissue from a total of 431 P-element lines and 96 PBac lines and noted that the PBac lines generally appeared to have lower expression levels than the P-element lines. This was the case both for protein traps and for enhancer traps. Ninety-four percent of the P-element lines had expression in adult ovaries, 15% of which we classified as "faint." In contrast, 70% of the PBac lines had expression during oogenesis, and 60% of these were faint.
To investigate this in more detail, we compared expression levels produced by PBac and P-element protein traps in several genes for which we recovered insertions of both elements (Figure 4). Since we had only PBac lines with the "A" reading frame, we compared these to P elements also carrying the A frame. These were available in the G and ZCL lines that we sequenced. The PBac insertion in all four genes we examined in detail produced weaker GFP fluorescence than a P-element insertion within the same intron. Although weaker, the localization of GFP was the same. Fasciclin3::GFP was expressed highly in the follicle cells of the germarium and later resolved to specific expression in the polar follicle cells (Figure 4, A and B). This matches published Fasciclin3 antibody localization (LEE et al. 1997; HAN et al. 2000). Sgg::GFP was present in the cytoplasm of both nurse cells and follicle cells, with some enrichment on plasma membranes and nurse cell ring canals (Figure 4C). This expression is similar to that reported in BOBINNEC et al. (2006). Expression from the PBac insertion in sgg was similar to the P-element insertion, but less intense (Figure 4D). The Oda gene (also called gutfeeling) encodes Ornithine decarboxylase antizyme. The P element and PBac protein trap insertions were both within the first intron of Oda, such that GFP tags the Oda-PA isoform. In egg chambers Oda::GFP was present in both the cytoplasm and the nuclei of nurse cells and follicle cells (Figure 4E). Although localization of Oda has not been reported previously, the Oda gene has been shown to regulate Sex-lethal in the germline (VIED et al. 2003) so its presence in egg chambers was expected. Again, the level of expression produced in the PBac line was lower than that in the P-element line (Figure 4F). Finally, we compared expression of GFP::Pdi from P-element and PBac protein traps. Protein disulfide isomerase (Pdi) is present in the lumen of the ER, and GFP::Pdi expressed from P-element protein traps have been used to reveal ER dynamics during oogenesis (BOBINNEC et al. 2003). We found the same pattern of expression of GFP::Pdi from PBac protein traps in Pdi, although the level of expression was again lower compared to that from the P-element protein traps (Figure 4, G and H).
|
| DISCUSSION |
|---|
|
|
|---|
To help reach the goal of producing GFP fusions to the majority of endogenous Drosophila proteins, we have explored transposon-mediated approaches for introducing an artificial GFP exon into genes (MORIN et al. 2001). Because productive protein traps require insertion into introns, this approach has the advantage of introducing tags into genes at their normal chromosomal location with potentially minimal damage to promoters. The resulting protein expression patterns are controlled by the normal transcriptional control sequences and reflect endogenous expression. However, since the GFP-coding sequence must be in the same orientation as the host gene, and the reading frame of the GFP gene must match the intron phase, insertion events that produce bona fide protein traps are rare. Therefore, we used a high-throughput embryo sorter to accelerate the rate of recovering protein traps. We screened >18 million embryos to obtain a pool of >5000 animals carrying potential new protein traps. Analysis of the resulting insertions revealed that we recovered not only the expected protein traps, but also a variety of enhancer traps that result from unexpected splicing patterns. Our work both validated the use of transposons to produce protein traps and provided key insights into the limitations of current configurations of protein trap constructs.
Trap types:
Extensive sequence and expression analysis showed that, as expected, we identified protein trap insertions within introns that interrupt coding regions. A total of 295 fly lines were associated with 115 genes, including both characterized and uncharacterized genes. By RNA analysis, we confirmed that the GFP coding sequence is fused to predicted exons in 34 lines examined. This strongly suggests that all the protein traps are behaving as expected. Even though we screened for GFP-fusion proteins during embryogenesis, >70% of the lines were also expressed during oogenesis where they displayed a range of expression patterns revealing both cell-specific expression and subcellular localization. For example, Indy and CG15926 were expressed only in follicle cells, Fas3 was in polar follicle cells, while Rtnl and Trailer hitch were almost exclusively expressed in the germline. We observed many examples of uniform cytoplasmic localization, nuclear localization, association with plasma membranes, endoplasmic reticulum, and muscle fibers. Furthermore, in cases where we could compare, the majority of the protein trap expression patterns were similar to antibody staining patterns. Therefore, these lines will be useful for examining expression patterns in other cell types.
Although this screen was explicitly designed for the purpose of engineering protein traps, we in fact identified enhancer traps in more genes than we did for protein traps (136 enhancer traps vs. 115 protein traps). The enhancer traps were almost exclusively in the P-element lines that contained the B reading frame and the PBac lines that contained the A reading frame. RNA analysis indicated that this was due to splicing of sequences within the 5' portion of the vector to provide a methionine start codon in frame with the coding region of the GFP. In the case of P-element enhancer traps, it appears that transcription can initiate in the 5' end of the transposon, and the cellular splicing machinery can then create a usable poly(A)+ mRNA from the transposon alone, or by splicing to a cryptic downstream exon (see Figure 2). In the PBac enhancer traps we analyzed, transcription initiated in the first (noncoding) exon of a gene, and splicing produced GFP-coding mRNAs that terminated within the transposon. The enhancer trap lines therefore produce GFP that is not fused to endogenous proteins. This was confirmed by Western blot analysis of several enhancer traps; proteins detected with GFP antibodies were all
40 kDa in molecular weight. Another hallmark of this type of enhancer trap is accumulation of GFP in nuclei. For the P-element enhancer traps this is explained by the presence of a nuclear localization signal in the transposase peptide fused to GFP. The mechanism for nuclear targeting of the PBac-derived proteins is not known. While they are clearly not protein traps, the enhancer traps are still very useful since they provide accurate readouts of transcriptional activity.
Among the lines with insertions upstream of the coding sequence of a gene, we did find cases where the GFP-coding sequence was spliced to an annotated genomic coding exon and produced a fusion protein with GFP linked to the amino terminus. These have been reclassified as protein traps. However, the vast majority of the predicted enhancer trap lines had nuclear expression patterns and small GFP-containing proteins, suggesting that N-terminal fusion of GFP to endogenous proteins is uncommon in the lines we analyzed. Therefore, the majority of insertions that lie upstream of the coding region of an annotated gene likely represent enhancer traps.
This work underscores the importance of careful vector design. A number of constructs (protein traps, RNAi, and genomic rescuing fragments) contain exons that must be spliced for proper expression. Attempts should be made to avoid the possibility of splicing unintended portions of the transposon into the desired mRNAs. One easy way to do this may be to put genes in the reverse orientation of the transposable element used. It should be noted, however, that although the possibility of P-element exon 0 being spliced to the B frame of GFP could have been predicted, the splicing that occurred in the PBac 5' end could not. This splicing pattern does not reflect any splicing that normally occurs in PBac.
We classified 125 lines, representing 82 genes, as enhancer opposite on the basis of the orientation of the insertions and proximity to an annotated gene. Although we did not analyze RNA from these lines, two pieces of evidence suggest they are also acting as enhancer traps. First, as with the enhancer traps, the enhancer opposites are overrepresented in the YB and YD lines, both of which we have shown include many lines that produce GFP-encoding mRNAs (see above). Second, 10 of 15 lines we examined on Western blots had GFP-positive bands
40 kDa. Therefore, expression of GFP in these lines is in response to enhancers that may or may not be associated with the gene they are inserted in.
Finally, we classified 128 lines as no gene because their insertions were located >500 nucleotides from any annotated gene. Although these could have been picking up unannotated genes, our evidence suggests they are acting as enhancer traps. The lines represent 75 different locations in the genome, 7 of which had at least two independent insertions. Of 19 lines analyzed by Western blotting, 17 had GFP-positive bands
40 kDa. One of the sites with multiple independent insertions is between CG6301 and mir-8 where we found 25 PBac insertions in the plus orientation among the YD lines and five independent P-element insertions in the minus orientation among the P, G, ZCL, and YB lines. RNA analysis of two of the P-element insertions showed that the lines were splicing into the same cryptic exon. Such hot spots could indicate that the genome contains strong transcriptional control sequences far removed from known transcription units (MANAK et al. 2006). Alternatively, they may point to the positions of short noncoding RNA genes. However, very few of the no gene lines were lethal (14% compared to 42% lethality overall), suggesting that associated genes are not essential.