- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Dufourcq, P.
- Articles by Labouesse, M.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Dufourcq, P.
- Articles by Labouesse, M.
lir-2, lir-1 and lin-26 Encode a New Class of Zinc-Finger Proteins and Are Organized in Two Overlapping Operons Both in Caenorhabditis elegans and in Caenorhabditis briggsae
Pascale Dufourcq1,2,a, Philippe Chanal1,a, Serge Vicairea, Elise Camuta, Sophie Quintina, Bart G. W. den Boera, Julia M. Boshera, and Michel Labouesseaa IGBMC, CNRS/INSERM/ULP, 67404 Illkirch Cedex, France
Corresponding author: Michel Labouesse, IGBMC, CNRS/INSERM/ULP, BP163, 1, rue Laurent Fries, 67404 Illkirch Cedex, France., lmichel{at}igbmc.u-strasbg.fr (E-mail)
Communicating editor: R. K. HERMAN
| ABSTRACT |
|---|
lin-26, which encodes a unique Zn-finger protein, is required for differentiation of nonneuronal ectodermal cells in Caenorhabditis elegans. Here, we show that the two genes located immediately upstream of lin-26 encode LIN-26-like Zn-finger proteins; hence their names are lir-1 and lir-2 (lin-26 related). lir-2, lir-1, and lin-26 generate several isoforms by alternative splicing and/or trans-splicing at different positions. On the basis of their trans-splicing pattern, their intergenic distances, and their expression, we suggest that lir-2, lir-1, and lin-26 form two overlapping transcriptional operons. The first operon, which is expressed in virtually all cells, includes lir-2 and long lir-1 isoforms. The second operon, which is expressed in the nonneuronal ectoderm, includes short lir-1 isoforms, starting at exon 2 and lin-26. This unusual genomic organization has been conserved in C. briggsae, as shown by cloning the C. briggsae lir-2, lir-1, and lin-26 homologs. Particularly striking is the sequence conservation throughout the first lir-1 intron, which is very long in both species. Structural conservation is functionally meaningful as C. briggsae lin-26 is also expressed in the nonneuronal ectoderm and can complement a C. elegans lin-26 null mutation.
WHEN the embryo develops from a single pluripotent cell (the zygote) to a multicellular organism, decisions become more and more complex as they involve more and more different cell types. A commonly used genetic strategy to achieve this complexity is to use a combination of different transcription factors in different cells. As a source of diversity for transcription factors, animals have sometimes evolved partially homologous and redundant genes that can be either unlinked (e.g., the retinoic acid receptors in mammals, ![]()
![]()
![]()
In this study, we focus on the developmental decision that leads an ectodermal cell to become neuronal, glial or epidermal. We have previously described a gene in Caenorhabditis elegans, lin-26, that is expressed in and required for nonneuronal ectodermal cells to acquire their normal fates (![]()
![]()
![]()
![]()
![]()
![]()
![]()
Because the decision to become neuronal vs. nonneuronal certainly depends on more than one gene, we decided to determine if there are genes partially homologous to lin-26 that could be involved in the same decision as lin-26. The gene lin-26 encodes a putative transcription factor (![]()
![]()
rst putative Zn-finger motif is characterized by the presence of five additional amino acids between the second cysteine and the first histidine and by the absence of the conserved phenylalanine normally located four amino acids after the second cysteine. The second putative Zn-finger motif is more canonical, except that it also misses the conserved phenylalanine.
Here we describe two genes that encode several isoforms with the same unusual C2H2 motif as LIN-26. These genes, which we named lir-1 and lir-2 (lin-26-related genes), map immediately upstream of lin-26. We report their structure and expression pattern. In addition, we describe the structure of the homologous lin-26 and lir genes in the nematode C. briggsae and show that the C. briggsae lin-26 homolog can substitute for the C. elegans lin-26 gene.
| MATERIALS AND METHODS |
|---|
Strains and general methods:
Methods for maintaining animals and genetic analysis were as described by ![]()
![]()
![]()
Isolation of cDNAs:
lir-2 and lir-1 cDNAs were isolated from two libraries prepared from size-selected cDNAs (1- to 2- and 2- to 3-kb ranges) primed with oligo(dT) (![]()
lir-2: Seven lir-2 cDNAs were isolated. The longest cDNA (pML124) contains 1781 bp. Four clones were found to contain the first exon, among which three had additional nucleotides (nts) at their 5' end that were identical to the final nts of the spliced leader type 1 (SL1); a smaller cDNA starting at the beginning of the second exon had three nts matching the last three nts of SL1.
lir-1:
Forty-two lir-1-positive clones were isolated. We found that cDNAs can start either within the first exon (9 clones) with a member of the SL2 family of spliced leaders (1 clone had nine additional nts identical to the final nts of the SL2 variant, called SL4, and another had five additional nts identical to the final nts of SL4 and SL5, see ![]()
|
|
lin-26:
Twenty-three partial and full-length lin-26 cDNA clones were isolated after screening 4 x 105 plaques of an oligo(dT) cDNA library (![]()
![]()
Isolation of C. briggsae lir-2, lir-1, and lin-26 homologs:
Southern blots of gels containing restriction digests of C. briggsae genomic DNA (prepared according to ![]()
![]()
DNA sequencing and sequence analysis:
DNA (~20 µg) from recombinant clones #a3 and #d1 was sheared by sonication. The ends were repaired with Klenow enzyme and T4 DNA polymerase (![]()
![]()
In the remainder of the text, nts will be numbered starting at the +1 nt of cosmid F18A1 (GenBank accession no. U41535) for C. elegans genes and at the +1 nt of the a3d1 sequence for C. briggsae genes.
Northern blot analysis:
C. elegans and C. briggsae RNA were isolated from mixed-stage populations, as described by ![]()
![]()
Reverse transcriptase-PCR experiments:
Reverse transcriptase reactions (RT) were performed using the first-strand Synthesis kit from Stratagene. To determine the respective ratios of certain C. elegans lir-1 and lin-26 isoforms, we performed PCR experiments for only 20 cycles (to avoid becoming rate limiting for one component of the PCR reaction) starting from different amounts of RT products. RT-PCR products were detected by Southern blot analysis with a probe that was PCR generated using the same primers as for the RT-PCR reaction. RT-PCR experiments involving C. briggsae transcripts were carried out for 35 cycles; RT-PCR products were detected by ethidium bromide staining after agarose gel electrophoresis. We used the primers described below.
- C.e.lir-1: 5'TACGGATCCGACGCCGTCTCGGAAAGCTGAAG (exon 3) and 5'TCTAAGCTTTAATAACCTGATTAAATTTATTATTTG (3' UTR);
- C.e.lin-26: 5'TCTAAGCTTGAAGAGAGTCGATCA (exon 4) and 5'GGTTGAGCCATTGGCTCGAG (3' UTR);
- C.e.act-123: 5'GAGTCATGGTCGGTATGGGAC and 5'GGGAGAGGACAGCTTGGATGG;
- C.b.lir-2: 5'TGACTACAAATGGCCCACGG (exon 1) and 5'CAAGTATTGCTCGGCAGGGC (exon 4), or 5'GAAAATGGGCACTCACAGGGCC (exon 4) and 5'AAATTGAAAACATCGCGACC (3' UTR);
- C.b.lir-1: 5'GAACGATGAAGCAGCAACCA (exon 1) and 5'TATGGCTTGATGTGCGAGTA (exon 3), or 5'CACAGACTTTGGCTATGGAA (exon 2) and 5'ATTAAATTTGCACACGGCAG (3' UTR);
- C.b.lin-26: 5'TGAAGTGAGCAATACCAACA (exon 1) and 5'TTCTTTGATGGCACACTTCC (exon 2), or 5'CAGGAAGTGTGCCATCAAAG (exon 2) and 5'AATCTCGAGTTTCTGATGGG (3' UTR).
We also used the primers 5'TCTAGAATTCCGCGGTTTAATTACCCAAGT and 5'TCTAGAATTCCGCGGTTTTAACCCAGTTAC, which are derived from the sequences of the C. elegans SL1 and SL2 splice leaders, respectively, as alternatives to the exon 1 primers listed above; note that the SL2-specific primer cannot discriminate between SL2 and its variants in RT-PCR experiments.
lacZ reporter and other constructs:
Standard protocols (![]()
![]()
lir-2::lacZ construct:
A 4781-nt genomic fragment was generated by PCR from the cosmid F18A1 using the primers 5'TACACTGCAGAAGCTCCAGCTACTATTGCG and 5'GCAGGATCCGGTTCGTGTTCCGAATTGTG and was subcloned into the PstI andBamHI sites of pPD21.28 (![]()
lir-1::lacZ constructs:
Plasmid pML169 carries the F18A1 BamHI/EcoRI fragment spanning nts 12,01327,160 (an 8-mer BglII linker was added at the EcoRI site after making it blunt), the BamHI/StuI fragment containing the ß-galactosidase coding sequence from plasmid pPD16.43 (![]()
lin-26::lacZ construct: Plasmid pML20 carries the F18A1 PstI/NheI fragment spanning nts 19,87328,789, the SmaI/SpeI fragment containing the ß-galactosidase coding sequence from plasmid pPD16.43 (with an 8-mer NheI linker at the SmaI site), and the F18A1 NheI/BssHII fragment spanning nts 28,78936,308.
Other lin-26 constructs:
Plasmid pML224 is a derivative of pML301 (![]()
C. briggsae construct: Plasmid pML303 contains the SacII/SpeI fragment of C. briggsae genomic DNA spanning nts 14,04325,168.
Immunostaining and in situ hybridization:
Animals and embryos were fixed and stained as described previously (![]()
![]()
![]()
![]()
The accession numbers of the sequences described in this work are AJ130959 (lir-2A), AJ130960 (lir-2B), AJ012485 (lir-1A), AJ012486 (lir-1C), AJ130957 (lin-26B), and AJ131134 (C. briggsae a3d1 sequence).
| RESULTS |
|---|
LIN-26 and three LIN-26-related proteins define a new group of Zn-finger proteins:
Sequencing of an 8-kb fragment surrounding lin-26 (![]()
![]()
Figure 1 shows that the predicted proteins encoded by lir-1, lir-2 and lin-26 in both species and by C. elegans lir-3 share the same unique features within their C2H2 motifs as LIN-26. In particular, there are four (LIR-1 and LIR-3) or five (LIR-2 and LIN-26) additional amino acids (aa) between the second cysteine and the first histidine of the first motif compared to the canonical TFIIIA zinc finger (![]()
The overall similarity among the C. elegans LIR proteins is rather low (the highest pairwise similarity is 28% identity, as determined by the Bestfit program, see Figure 1 legend). Besides the C2H2 motifs, another region, within the C end (PEL/M/VR/KAEWxxxL/MxxCFP, see Figure 1), is conserved between all proteins and does not show any similarity with proteins in the databases. A region rich in acidic residues, as can be found in the transactivation domain of certain transcription factors (![]()
Genomic organization of the lir-2/lir-1/lin-26 gene cluster:
To confirm the predicted structures of lir-2 and lir-1, we isolated cDNAs (see MATERIALS AND METHODS) and compared their sequences to the sequence of cosmid F18A1 (Figure 2). We performed RT-PCR experiments to confirm the existence of cDNAs (Figure 3 and data not shown).
|
lir-2 structure:
We found a major lir-2 isoform (lir-2A), as well as a minor isoform (lir-2B, see Figure 2A) that misses the first lir-2A exon. Both isoforms are trans-spliced to SL1 (![]()
![]()
lir-1 structure:
A complex pattern of trans-splicing and alternative splicing could potentially lead to nine different lir-1 isoforms (Figure 2, AC) that would code for proteins ranging in size between 141 and 309 aa. The first and second lir-1 exons are separated by a 9147-bp intron, which is very striking, as most C. elegans introns are <60 nts (![]()
![]()
![]()
lin-26 structure:
We had previously identified a long lin-26 cDNA (![]()
![]()
The intergenic regions separating the probable polyadenylation signal (AATAAA) of lir-2 and the beginning of lir-1A-C or that of lir-1 and the beginning of lin-26B are short (125 and 136 bp, respectively). In addition, the major lir-1 and lin-26 isoforms start with SL2 or its variants. These criteria indicate that lir-2, lir-1 and lin-26 have characteristics typical of genes organized as transcriptional operons in C. elegans (![]()
![]()
Synteny of the lir-2, lir-1, and lin-26 region in C. briggsae:
To gain further insights into the function of the genes lir-2, lir-1, and lin-26 and to identify potential sequences important to control their expression, we cloned their homologs from the nematode C. briggsae using low-stringency hybridization conditions. We discovered that the C. briggsae lir-2, lir-1, and lin-26 homologs are also tightly linked (Figure 2D). This led us to sequence a unique 27,242-nt fragment (which will be referred to as a3d1, see MATERIALS AND METHODS). Comparison of cDNA (see above), F18A1, and a3d1 sequences showed that a3d1 contains six complete putative genes corresponding to the homologs of lir-2, lir-1, lin-26 and of three genes located upstream of lir-2 in C. elegans; a3d1 might also contain part of a gene located downstream of lin-26. Each predicted gene in a3d1 was named after its homolog in C. elegans (as referred to in the ACeDB databank; ![]()
We observed that the gene order and the intron/exon positions and sizes are strikingly similar in this region (Figure 2D and Figure 4). This is not the first reported example of synteny between C. briggsae and C. elegans (![]()
![]()
![]()
|
As shown in Figure 3B, C.b.lir-2, C.b.lir-1, and C.b. lin-26 transcripts have approximate sizes of 1.8, 1.2, and 2.0 kb, respectively, in agreement with expected sizes and similar to sizes observed in C. elegans. Sequencing of RT-PCR products (see MATERIALS AND METHODS) confirmed the predicted intron/exon boundaries and showed that the trans- and alternative splicing patterns have been conserved, although the number of isoforms that can be produced in C. briggsae seems smaller. We specifically found (i) a single SL1 trans-spliced C.b.lir-2 equivalent to C.e.lir-2A; (ii) four potential C.b.lir-1 isoforms equivalent to C.e.lir-1A, C.e.lir-1C, C.e.lir-1D, and C.e.lir-1E that, as in C. elegans, originate from trans-splicing at the beginning of exon 1 with SL1 or at the beginning of exon 2 with one of the SL2 variants and from alternative splicing at the 3' end (Figure 2C); (iii) one SL2 trans-spliced C.b.lin-26 isoform equivalent to C.e.lin-26B. We did not find the C.b.lir-1 isoform corresponding to C.e.lir-1B (see also Figure 2 legend) nor the C.b.lin-26 isoform corresponding to C.e.lin-26A. Although we did not directly search for the C.e.lin-26C homolog, its generation in C. briggsae is possible given the precise conservation of intron/exon positions; the LIN-26C isoforms would be conserved, even within their unique terminal exon. Absence of these isoforms in C. briggsae coincides with poor sequence conservation in the lir-1/lin-26 intergenic region and suggests that these forms might not be essential for lir-1 and lin-26 function in C. elegans.
lir-2 is less conserved than lir-1 and lin-26:
The six predicted proteins encoded within a3d1 have been strongly conserved. Whereas C.b.LIR-2 is only 62% identical to C.e.LIR-2, all other predicted proteins show >77% identity with their C. elegans counterparts (Table 1 and Figure 4). This weaker conservation of LIR-2 extends to its zinc fingers, which have accumulated the most changes in comparison to LIR-1 and LIN-26 (Figure 1). We also estimated the evolutionary divergence using the parameters KS and KA, which count the number of substitutions per synonymous and nonsynonymous sites, respectively (![]()
![]()
![]()
|
Analysis of noncoding regions:
Noncoding regions have generally been less conserved during evolution than coding regions (![]()
![]()
![]()
![]()
![]()
![]()
The most remarkable homologies, however, reside in the first intron of lir-1, which is unusually long in both species (almost 9 kb). As shown in Figure 4C and Figure F, several long blocks (>500 nts) that are spread over the entire intron display up to 88% identity and are interspersed with smaller blocks of nonconserved sequences. This is very different from other noncoding areas in the a3d1 sequence (see the lin-26/F18A1.1 intergenic region in Figure 4) or from noncoding regions belonging to other genes that have been sequenced in nematodes (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
LIN-26B is the major functional isoform:
To assess the functional requirement for the C.e.LIN-26B and C.e.LIN-26A isoforms, we generated two constructs that should allow the expression of only one isoform. Compared to plasmid pML301, which can rescue the null mutation lin-26(mc15) (![]()
Further support in favor of the idea that LIN-26A is nonfunctional comes from the fact that a plasmid containing C.b.lin-26 with its promoter (pML303, note that there is no lin-26A-like isoform in C. briggsae, see above) rescued all phenotypes conferred by lin-26(mc15) (Table 2). Specifically, pML303 rescued the lethality, which assays lin-26 function in hypodermal cells; the amphid and phasmid defects, which assay lin-26 function in support cells; and partially the sterility conferred by lin-26(mc15), which assays lin-26 function in the somatic gonad (![]()
![]()
![]()
|
lir-2, lir-1, and lin-26 expression patterns are overlapping:
To analyze the lir-2 and lir-1 expression patterns, we performed in situ hybridization and analyzed the expression of lacZ reporter constructs (Figure 5 and Figure 6).
|
|
lir-2 transcripts were detected by in situ hybridization in all cells between the 2- and 200-cell stages, the signal being strongest at the 50-cell stage (Figure 5, AD). Expression of a lir-2::lacZ construct was also detected ubiquitously between the 50- and 200-cell stages (Figure 6, AB) and disappeared after the 350-cell stage. Presence of ß-galactosidase after the 200-cell stage probably reflects a difference in stability between the fusion protein and native lir-2 transcripts.
lir-1 transcripts were detected by in situ hybridization in all cells between the 2- and 50-cell stages (Figure 5E and Figure F), disappearing after that stage. They reappeared between the 200- and 350-cell stages, but they were too weak to allow a clear identification of cells. However, we presume that cells expressing lir-1 at the 200-cell stage are precursors of the major hypodermal cells (e.g., P, seam, and hyp4-7 cells), as these were clearly labeled at the 350-cell stage (Figure 5G). At the comma stage (Figure 5H), a signal was detected mostly in the anterior and ventral parts of the head and in the tail. Because this pattern resembles the pattern observed at the same stage for lin-26 (Figure 5L), we believe that it could correspond to head and tail support cells and to all minor hypodermal cells (hyp1-3, hyp8-11, and cells of the anus and excretory system). Expression of the lir-1::lacZ construct pML169, which should reflect the expression of all lir-1 isoforms, and of the lir-1::lacZ construct pML165, which should reflect only the expression of lir-1 isoforms starting after the second exon, were first very weakly detected at the 200-cell stage and then clearly detected at the 350-cell stage in major hypodermal cells (Figure 6E and Figure H) and at the early comma stage in those same hypodermal cells, as well as in support cells and in all minor hypodermal cells (Figure 6F and data not shown). Starting at the late comma stage, expression of the lir-1::lacZ construct pML169, but not that of pML165, was detected in all cells (i.e., in intestinal, pharyngeal, muscle, neuronal, hypodermal, and support cells, Figure 6G). Similarly, during larval development, expression of the lir-1::lacZ pML169 was detected in virtually all cells, including cells of the somatic gonad, but not in intestinal cells after the late L1 stage nor in germ cells (Figure 6O).
The expression pattern of lin-26 as determined by in situ hybridization and lacZ reporter constructs correlates perfectly with that reported for the LIN-26 protein (![]()
To assess the expression pattern of C.b.lin-26, we reasoned that the strong conservation of LIN-26 proteins in both species meant that the antiserum raised against C.e.LIN-26 was likely to detect C.b.LIN-26. We found that the C.e.LIN-26 antiserum does detect a nuclear protein in C. briggsae embryos (larvae were not examined). Assuming that this staining is as specific as with C. elegans animals, C.b.LIN-26 was detected at very low levels in all or nearly all cells of embryos with <200 cells (Figure 6C and Figure D), whereas C.e.LIN-26 is detected only in the germline precursors until the 100-cell stage and only in hypodermal precursors between the 100- and 200-cell stages (![]()
![]()
In summary, we have observed that lir-2 and lir-1A-C isoforms are expressed in all or most cells, whereas lir-1D-F and lin-26 are expressed in the nonneural ectoderm.
| DISCUSSION |
|---|
Multiple lir-2, lir-1, and lin-26 isoforms define a new zinc-finger motif:
This work reports the analysis of the structure and the expression of the C. elegans genes lir-2, lir-1, and lin-26, which span a 20-kbp region on chromosome II, as well as their conservation in the nematode C. briggsae. Several features characterize lir-2, lir-1, and lin-26. First, they encode proteins that define a new C2H2 motif related to that of TFIIIA zinc fingers. Their first C2H2 motif deviates from the canonical TFIIIA zinc
nger by the presence of four or five additional amino acids between the second cysteine and the first histidine, and by the absence of a conserved phenylalanine close to the second cysteine. Second, each gene can generate multiple isoforms by alternative splicing and trans-splicing. Trans-splicing is a mechanism that involves the transfer of a 22-nt leader sequence from either of two classes of spliced leader RNAs (SL1 or one of the SL2 variants) to the 5' ends of mRNAs (![]()
Although C. elegans and C. briggsae appear to have diverged a long time ago (current estimates put the divergence at 2050 mya; ![]()
![]()
![]()
Possible function of lir-2, lir-1, and lin-26 isoforms:
The presence of C2H2 motifs related to TFIIIA zinc fingers and the presence of potential transcription transactivation domains suggest that LIR-2, LIR-1, and LIN-26 could be transcription factors. Because the LIR-1A/LIR-1D and LIR-1C/LIR-1E specific exons correspond to the potential transactivation domains, these isoforms could have different activation specificity and/or strength. The fact that lir-2 and lir-1 are expressed in most cells raises the possibility that they could act as cofactors of other transcription factors. Determining the function of lir-1 and lir-2 will require the isolation of specific mutations. We would be surprised if lir-1 had no function because it has been strongly conserved in C. briggsae. By contrast, lir-2 has been less conserved, both at the nucleotide and amino acid levels. Either selective pressures are not high enough to maintain strict conservation, or its function has diverged between the two species.
As far as lin-26 is concerned, we show that C.e.lin-26B, but not C.e.lin-26A, can rescue the null phenotype of lin-26, and that C.e.lin-26C is much less abundant than C.e.lin-26B. Taken together, these data suggest that lin-26B is the major isoform, that lin-26A might be dispensable, and, on the basis of its relative abundance, that lin-26C is required only in a subset of cells. We are currently exploring the possibility that LIN-26C is required during the L1 stage in the Pn.p cells, which are hypodermal precursors. The rationale for this hypothesis is threefold: first, lin-26C is mainly detected at the L1 stage; second, the Pn.p cells are transformed into neuroblasts at the L1 stage by the mutation lin-26(n156); third, the Pn.p cells are the only cells to be transformed with a high penetrance into neuronal cells by any lin-26 mutation (![]()
![]()
lir-2, lir-1, and lin-26 form two overlapping operons:
Although eukaryotic genes are usually transcribed individually, ~25% of C. elegans genes appears to be organized in transcriptional operons that resemble bacterial operons (![]()
![]()
![]()
The organization and expression of lir-2, lir-1, and lin-26 bear many features characteristic of C. elegans operons. First, the intergenic distances between lir-2 and lir-1 and between lir-1 and lin-26 are short: the first lir-1 exon is located at 125 bp from the putative lir-2 polyadenylation signal; the first lin-26 exon of lin-26B, the major isoform, is located at 136 bp from the putative lir-1 polyadenylation signal. Second, we found that lir-2 is trans-spliced by SL1, while lir-1 transcripts, starting at the first exon, are trans-spliced by members of the SL2 family. In addition, we found that lir-1 transcripts starting at the beginning of the second or the third exons are trans-spliced by SL1, while major lin-26 isoforms are trans-spliced by SL2. Third, as further discussed below, the expression domains of lir-2, lir-1, and lin-26 partially overlap. Finally, this organization has been conserved in C. briggsae.
Two features of the lir-2, lir-1, and lin-26 genes suggest that these genes actually form a complex of two overlapping operons: a lir-2/lir-1A-C operon followed by a lir-1D-F/lin-26B operon. Indeed, transcripts from the most upstream genes (lir-2 and lir-1D-F) get trans-spliced to SL1, and the intergenic distances between all three genes are short. Although we cannot formally exclude the possibility that these three genes form a single operon, we strongly favor the two-operon model mainly because lir-2 and lir-1A-C are nearly ubiquitously expressed, whereas lir-1D-F and lin-26 are expressed in the nonneuronal ectoderm (see below). A unique operon, whose transcription would be under the control of a unique promoter, would not easily account for these differences. One would have to postulate some specific and complex mechanism that prevents the accumulation of lir-1D-F and lin-26 transcripts in many cells. By contrast, the two different expression patterns are easily explained in the context of two operons transcribed from distinct promoters. Further support in favor of the two-operon model is provided by our characterization of the lin-26 promoter. We have shown that it is possible to fully rescue the lin-26 null phenotype using plasmids that lack the first half of what would be a single operon (lir-2 and part of lir-1 intron 1, ![]()
![]()
![]()
|
Strong sequence similarity of a 9-kb noncoding intron:
On the basis of in situ hybridization, we found that lir-2 and lir-1 are maternally expressed and present in all cells until the 50-cell stage. On the basis of the expression of lacZ reporter fusions, we conclude that lir-2 and lir-1 are expressed in nearly all cells, lir-2 until the 200-cell stage and lir-1 starting at the comma stage (except in the gut of larvae). It is important to note that ubiquitous lir-1::lacZ expression was observed only for a construct containing 5.6 kb upstream of the first lir-1 exon (pML169) but not for constructs starting within the first exon (pML165). Thus, we suggest that there is a promoter element, which could correspond to the conserved sequences called CSE4 and CSE5 in Figure 4B, that would drive expression of the lir-2/lir-1A-C operon in the hermaphrodite adult germline and subsequently in all cells. The fact that we failed to detect ubiquitous lir-1::lacZ expression in early embryos might reflect a difficulty of the rapidly dividing cells of the early embryo in transcribing and splicing out, in a short time range, the first lir-1 intron, which is very long by C. elegans standards (9 kb). Such time constraints have previously been suggested to account for the apparent absence of function during early Drosophila embryogenesis of Dwnt-4 and invected, which are homologs of wingless and engrailed, respectively, because of the presence of large introns (![]()
![]()
On the basis of in situ hybridization and lacZ reporter fusions, we found that lir-1 and lin-26 are expressed at midembryogenesis in all hypodermal and support cells. Because the construct pML165 reflects the expression of lir-1D-F isoforms and was expressed in these cells, we suggest that common promoter elements drive the expression of the lir-1D-F/lin-26B operon in hypodermal and support cells. The most likely position of these promoter elements is within the first lir-1 intron. Interestingly, this intron has been very well conserved in size and sequence in C. briggsae. Genefinder predictions, careful examination, and C. elegans/C. briggsae sequence comparisons did not reveal any significant and conserved open reading frame within this 9-kb intron. Because most of this intron is required for normal lin-26 expression (![]()
/
T-cell receptor loci (![]()
In conclusion, the lir-2/lir-1/lin-26 complex has revealed some unique features in terms of genomic organization. We believe that it constitutes two separate and overlapping operons that contain an unusually large intron. Our initial motivation for characterizing lir-2 and lir-1 was to determine if they would be involved in the same process as lin-26, by analogy, for instance, with the four proneural genes from the achaete-scute complex in Drosophila (![]()
| FOOTNOTES |
|---|
1 These authors have equally contributed to this work. ![]()
2 Present address: Harvard Medical School, Department of Pathology, 200 Longwood Ave., Boston, MA 02115. ![]()
| ACKNOWLEDGMENTS |
|---|
We are very grateful to David Baillie for the C. briggsae genomic library, to David Eisenman for the mutant lin-26(ga91) and to Chad Nusbaum for the gift of RNA samples prepared from staged populations. The C. briggsae strain was provided by the CGC, which is supported by the National Institutes of Health and the National Center for Research Resources. We thank Satis Sookhareea for expert technical assistance. We also thank Thomas Bürglin for sharing cDNA filters, Elisabeth Georges for useful comments about the manuscript, as well as Sandra Metz and Bernard Boulay for help in the preparation of figures. P.D. was supported by fellowships from the Association de Recherche contre le Cancer, and P.C. received support from the Ministère de l'Education Nationale and the Ligue Nationale de Recherche contre le Cancer. This work was funded by the CNRS, INSERM, Hôpital Universitaire de Strasbourg, Human Frontier Science Program Organization, Association pour la Recherche contre le Cancer, Groupement de Recherches et d'Etudes sur les Génomes, Action Concertée Coordonnée des Sciences du Vivant (thème no. 4) du Ministère de la Recherche et de l'Education Nationale.
Manuscript received December 2, 1998; Accepted for publication February 1, 1999.
| LITERATURE CITED |
|---|
BARSTEAD, R. J. and R. H. WATERSTON, 1989 The basal component of the nematode dense-body is vinculin. J. Biol. Chem. 264:10177-10185
BLUMENTHAL, T., and K. STEWARD, 1997 RNA processing and gene structure, pp. 117145 in C. elegans II, edited by D. L. RIDDLE, T. BLUMENTHAL, B. J. MEYER and J. R. PRIESS. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY.
BRENNER, S., 1974 The genetics of Caenorhabditis elegans.. Genetics 77:71-94
CHAMBON, P., 1996 A decade of molecular biology of retinoic acid receptors. FASEB J. 10:940-954[Abstract].
CHANAL, P. and M. LABOUESSE, 1997 A screen for genetic loci required for hypodermal cell and glial-like cell development during Caenorhabditis elegans embryogenesis. Genetics 146:207-226[Abstract].
DEN BOER, B., S. SOOKHAREEA, P. DUFOURCQ, and M. LABOUESSE, 1998 A tissue specific knock-out strategy reveals that lin-26 is required for the formation of the somatic gonad epithelium in C. elegans. Development 125:3213-3224[Abstract].
DEN DUNNEN, J. T., J. W. VAN NECK, R. P. M. CREMERS, N. H. LUBSEN, and J. G. SCHOENMAKERS, 1989 Nucleotide sequence of the rat
crystallin gene region and comparison with an orthologous human region. Gene 78:201-213[Medline].
DEVEREUX, J., P. HAEBERLI, and O. SMITHIES, 1984 A comprehensive set of sequence analysis programs for VAX. Nucleic Acids Res. 12:387-395.
EECKMAN, F. H., and R. DURBIN, 1995 ACeDB and Macace, pp. 583605 in Methods in Cell Biology, Vol. 48, edited by H. F. EPSTEIN and D. C. SHAKES. Academic Press, San Diego.
EVANS, D., D. ZORIO, M. MACMORRIS, C. E. WINTER, and K. LEA et al., 1997 Operons and SL2 trans-splicing exist in nematodes outside the genus Caenorhabditis.. Proc. Natl. Acad. Sci. USA 94:9751-9756
FERGUSON, E. L., P. W. STERNBERG, and H. R. HORVITZ, 1987 A genetic pathway for the specification of the vulval cell lineages of Caenorhabditis elegans.. Nature 326:259-267[Medline].
FILES, J. G., S. CARR, and D. HIRSH, 1983 Actin gene family of Caenorhabditis elegans.. J. Mol. Biol. 164:355-375[Medline].
FIRE, A., S. W. HARRISON, and D. DIXON, 1990 A modular set of lacZ fusion vectors for studying gene expression in Caenorhabditis elegans.. Gene 93:189-198[Medline].
FRANCIS, R. and R. H. WATERSTON, 1991 Muscle cell attachment in Caenorhabditis elegans.. J. Cell Biol. 114:465-479
GRABA, Y., K. GIESELER, D. ARAGNOL, P. LAURENTI, and M. C. MARIOL et al., 1995 DWnt-4, a novel Drosophila Wnt gene acts downstream of homeotic complex genes in the visceral mesoderm. Development 121:209-218[Abstract].
GUSTAVSON, E., A. S. GOLDSBOROUGH, Z. ALI, and T. B. KORNBERG, 1996 The Drosophila engrailed and invected genes: partners in regulation, expression and function. Genetics 142:893-906[Abstract].
HESCHL, M. F. P. and D. L. BAILLIE, 1990 Functional elements and domains inferred from sequence comparisons of a heat-shock gene in two nematodes. J. Mol. Evol. 31:3-9[Medline].
HOOPER, J. E., M. PEREZ-ALONSO, J. R. BERMINGHAM, M. PROUT, and B. A. ROCKLEIN et al., 1992 Comparative studies of Drosophila Antennapedia genes. Genetics 132:453-469[Abstract].
KENNEDY, B. P., E. J. AAMODT, F. L. ALLEN, M. A. CHUNG, and M. F. HESCHL et al., 1993 The gut esterase gene (ges-1) from the nematodes Caenorhabditis elegans and Caenorhabditis briggsae.. J. Mol. Biol. 229:890-908[Medline].
KOOP, B. F. and L. HOOD, 1994 Striking sequence similarity over almost 100 kilobases of human and mouse T-cell receptor DNA. Nature Genet. 7:48-53[Medline].
KRAUSE, M., 1995 Techniques for analyzing transcription and translation, pp. 513529 in Methods in Cell Biology, Vol. 48, edited by H. F. Epstein and D. C. Shakes. Academic Press, San Diego.
KRAUSE, M. and D. HIRSH, 1987 A trans-spliced leader sequence on actin mRNA in C. elegans.. Cell 49:753-761[Medline].
KRAUSE, M., S. W. HARRISON, S. Q. XU, L. CHEN, and A. FIRE, 1994 Elements regulating cell- and stage-specific expression of the C. elegans MyoD family homolog hlh-1.. Dev. Biol. 166:133-148[Medline].
KUWABARA, P. E. and S. SHAH, 1994 Cloning by synteny: identifying C. briggsae homologues of C. elegans genes. Nucleic Acids Res. 21:4414-4418
LABOUESSE, M., S. SOOKHAREEA, and H. R. HORVITZ, 1994 The Caenorhabditis elegans gene lin-26 is required to specify the fates of hypodermal cells and encodes a presumptive zinc-finger transcription factor. Development 120:2359-2368
LABOUESSE, M., E. HARTWIEG, and H. R. HORVITZ, 1996 The Caenorhabditis elegans LIN-26 protein is required to specify and/or maintain all non-neuronal ectodermal cell fates. Development 122:2579-2588[Abstract].
LEE, Y. H., X. Y. HUANG, D. HIRSH, G. E. FOX, and R. M. HECHT, 1992 Conservation of gene organization and trans-splicing in the glyceraldehyde-3-phosphate dehydrogenase-encoding genes of Caenorhabditis briggsae.. Gene 121:227-235[Medline].
LI, W. H., 1993 Unbiased estimation of the rates of synonymous and nonsynonymous substitution. J. Mol. Evol. 36:96-99[Medline].
LUCHE, R. M., W. C. SMART, T. MARION, M. TILLMAN, and R. A. SUMRADA et al., 1993 Saccharomyces cerevisiae BUF protein binds to sequences participating in DNA replication in addition to those mediating transcriptional repression (URS1) and activation. Mol. Cell Biol. 13:5749-5761
LUKOWITZ, W., C. SCHRODER, G. GLASER, M. HULSKAMP, and D. TAUTZ, 1994 Regulatory and coding regions of the segmentation gene hunchback are functionally conserved between Drosophila virilis and Drosophila melanogaster.. Mech. Dev. 45:105-115[Medline].
MA, J. and M. PTASHNE, 1987 Deletion analysis of GAL4 defines two transcriptional activating segments. Cell 48:847-853[Medline].
MADURO, M. and D. PILGRIM, 1996 Conservation of function and expression of unc-119 from two Caenorhabditis species despite divergence of non-coding DNA. Gene 183:77-85[Medline].
MELLO, C. C., J. M. KRAMER, D. STINCHCOMB, and V. AMBROS, 1991 Efficient gene transfer in C. elegans: extrachromosomal maintenance and integration of transforming sequences. EMBO J. 10:3959-3970[Medline].
NILSEN, T.W., 1993 Trans-splicing of nematode premessenger RNA. Annu. Rev. Microbiol. 47:413-440[Medline].
RHODES, D. and A. KLUG, 1986 An underlying repeat in some transcriptional control sequences corresponding to half a double helical turn of DNA. Cell 46:123-132[Medline].
ROSS, L. H., J. H. FREEDMAN, and C. S. RUBIN, 1995 Structure and expression of novel spliced leader RNA genes in Caenorhabditis elegans.. J. Biol. Chem. 270:22066-22075
SAMBROOK, J., E. F. FRITSCH and T. MANIATIS, 1989 Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY.
SEYDOUX, G. and A. FIRE, 1994 Soma-germline asymmetry in the distributions of embryonic RNAs in Caenorhabditis elegans.. Development 120:2823-2834[Abstract].
SKEATH, J. B. and S. B. CARROLL, 1994 The achaete-scute complex: generation of cellular pattern and fate within the Drosophila nervous system. FASEB J. 8:714-721[Abstract].
STENICO, M., A. T. LLOYD, and P. M. SHARP, 1994 Codon usage in Caenorhabditis elegans: delineation of translational selection and mutational biases. Nucleic Acids Res. 22:2437-2446
THOMPSON, J. D., T. J. GIBSON, F. PLEWNIAK, F. JEANMOUGIN, and D. G. HIGGINS, 1997 The ClustalX windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tool. Nucleic Acids Res. 25:4876-4882
WATERSTON, R., C. MARTIN, M. CRAXTON, C. HUYNH, and A. COULSON et al., 1992 A survey of expressed genes in Caenorhabditis elegans.. Nature Genet. 1:114-123[Medline].
WAY, J. C., L. WANG, J. Q. RUN, and A. WANG, 1991 The mec-3 gene contains cis-acting elements mediating positive and negative regulation in cells produced by asymmetric cell division in Caenorhabditis elegans.. Genes Dev. 5:2199-2211
WILSON, R., R. AINSCOUGH, K. ANDERSON, C. BAYNES, and M. BERKS et al., 1994 2.2 Mb of contiguous nt sequence from chromosome III of C. elegans.. Nature 368:32-38[Medline].
ZORIO, D. A., N. N. CHENG, T. BLUMENTHAL, and J. SPIETH, 1994 Operons as a common form of chromosomal organization in C. elegans.. Nature 372:270-272[Medline].
This article has been cited by other articles:
![]() |
D. M. Eisenmann and S. K. Kim Protruding Vulva Mutants Identify Novel Loci and Wnt Signaling Factors That Function During Caenorhabditis elegans Vulva Development Genetics, November 1, 2000; 156(3): 1097 - 1116. [Abstract] [Full Text] |
||||
![]() |
H. M. Robertson The Large srh Family of Chemoreceptor Genes in Caenorhabditis Nematodes Reveals Processes of Genome Evolution Involving Large Duplications and Deletions and Intron Gains and Losses Genome Res., February 1, 2000; 10(2): 192 - 203. [Abstract] [Full Text] |
||||
![]() |
J. M. Bosher, P. Dufourcq, S. Sookhareea, and M. Labouesse RNA Interference Can Target Pre-mRNA: Consequences for Gene Expression in a Caenorhabditis elegans Operon Genetics, November 1, 1999; 153(3): 1245 - 1256. [Abstract] [Full Text] |
||||
- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Dufourcq, P.
- Articles by Labouesse, M.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Dufourcq, P.
- Articles by Labouesse, M.


symbol (members of the SL2 family of spliced leaders are collectively referred to as SL2). Inverted triangles show the positions of potential methionine initiation codons. lir-1 isoforms starting at exon 1 can have three possible 3' ends: in lir-1C, which contains four exons and can code for a potential protein of 296 aa, the last exon is not spliced; in lir-1A and lir-1B, which can code for proteins of 309 and 300 aa, respectively, the fourth exon is alternatively spliced from the same donor site, but onto different acceptor sites (see C). lir-1 isoforms starting at the beginning of exon 2 undergo alternative splicing similarly to lir-1A and lir-1C and were called lir-1D and lir-E, respectively; we did not identify any lir-1B-like cDNA among cDNAs starting at exon 2, possibly because we did not isolate enough cDNAs. RT-PCR experiments using an SL1 primer identified an isoform starting at lir-1 exon 3 (lir-1F); we have not determined the structure of the lir-1F 3' end (two possible structures are shown by analogy with lir-1D-E). (B) Detailed nt coordinates of exon boundaries for lir-2, lir-1, and lin-26 within the F18A1 sequence. (C) Enlargement of the region where lir-1 and lin-26 overlap, focusing on the alternative splicing pattern (C. elegans in the top part and C. briggsae in the bottom part). Below, the alternative C-terminal region of the different LIR-1 proteins is shown, starting at the level of the fourth exon; vertical arrows show the positions of introns, and an asterisk symbolizes the stop codon. Note that C.e.lin-26A, which is the cDNA described in 





