- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Supplemental data
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Ferris, P. J.
- Articles by Goodenough, U. W.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Ferris, P. J.
- Articles by Goodenough, U. W.
Genetic Structure of the Mating-Type Locus of Chlamydomonas reinhardtii
Patrick J. Ferrisa, E. Virginia Armbrust2,a, and Ursula W. Goodenoughaa Department of Biology, Washington University, St. Louis, Missouri 63130
Corresponding author: Patrick J. Ferris, Campus Box 1229, 1 Brookings Dr., Washington University, St. Louis, MO 63130., ferris{at}biosgi.wustl.edu (E-mail)
Communicating editor: S. L. ALLEN
| ABSTRACT |
|---|
Portions of the cloned mating-type (MT) loci (mt+ and mt-) of Chlamydomonas reinhardtii, defined as the
1-Mb domains of linkage group VI that are under recombinational suppression, were subjected to Northern analysis to elucidate their coding capacity. The four central rearranged segments of the loci were found to contain both housekeeping genes (expressed during several life-cycle stages) and mating-related genes, while the sequences unique to mt+ or mt- carried genes expressed only in the gametic or zygotic phases of the life cycle. One of these genes, Mtd1, is a candidate participant in gametic cell fusion; two others, Mta1 and Ezy2, are candidate participants in the uniparental inheritance of chloroplast DNA. The identified housekeeping genes include Pdk, encoding pyruvate dehydrogenase kinase, and GdcH, encoding glycine decarboxylase complex subunit H. Unusual genetic configurations include three genes whose sequences overlap, one gene that has inserted into the coding region of another, several genes that have been inactivated by rearrangements in the region, and genes that have undergone tandem duplication. This report extends our original conclusion that the MT locus has incurred high levels of mutational change.
THE mating-type (MT) locus of the haploid green alga Chlamydomonas reinhardtii, located 30 cM from the centromere of linkage group (chromosome) VI, is involved in generating mating-type plus or minus gametic phenotypes in response to nitrogen starvation (![]()
![]()
![]()
1-Mb region of recombinational suppression, in the center of which is an
200-kb domain [the rearranged (R) domain] that has undergone numerous translocations and inversions involving four large segments of the domain (Fig 1). These rearrangements presumably suppress meiotic crossing over in the flanking telomere-proximal (T) and centromere-proximal (C) domains of the locus.
|
Of the genetic markers under recombinational suppression, three define genes that are selectively transcribed in response to nitrogen starvation and are directly involved with generating either the plus or the minus gametic phenotypes.
- The Fus1 gene, originally marked by the imp1 mutation, encodes an 810-amino-acid glycoprotein that is necessary for gametic cell fusion. It is located in region c of the mt+ R domain (Fig 1) and has no homolog in the mt- locus (
FERRIS et al. 1996 ).
- The Mid gene, originally marked by the imp11 mutation (the mutant allele and mutant strain are henceforth designated mid-1), encodes a 147-amino-acid regulatory protein, related to a family of nitrogen-sensitive transcriptional regulators (
SCHAUSER et al. 1999 ), that induces cells to differentiate as minus gametes. It is located in region f of the mt- locus and has no homolog in the mt+ locus (
FERRIS and GOODENOUGH 1997 ).
- The Sad1 gene, marked by the imp10/imp12 mutations (
HWANG et al. 1981 ) and the agl mutation (
MATSUDA et al. 1988 ), encodes a 3875-amino-acid protein that serves as the flagellar sexual agglutinin of minus gametes. It is located just centromere-proximal to the mt- R domain (Fig 1), with an allele located in the homologous position in the mt+ locus, an allele that is ordinarily not expressed because its expression is Mid-dependent and plus cells lack the Mid gene. A full report on the characterization of the Sad1 gene is in preparation.
Although several genes involved with mating map to the MT locus, including several new genes that are described in this report, many other gamete-specific genes are not linked to MT and are designated as "autosomal" (![]()
![]()
![]()
The MT locus is also involved in mediating uniparental transmission of organelle genomes during the zygotic phase of the C. reinhardtii life cycle. All four meiotic products of zygote germination ordinarily inherit chloroplast DNA (cpDNA) from the plus parent only and mitochondrial DNA from the minus parent only, the nontransmitted organellar DNAs having been selectively degraded during zygote maturation (![]()
![]()
![]()
Mutant alleles in the MT locus that fail to recombine also mark several genes that are expressed in vegetative (mitotic) cells and play no known specific role in gametogenesis or zygote development. Five of these "housekeeping" genes have been cloned and at least partially characterized; all lie outside the R domain at positions designated in Fig 1. The Nic7 (nicotinamide-requiring), Ac29 (acetate-requiring), and Thi10 (thiamine-requiring) gene sequences have been identified by their ability to complement mutant alleles (![]()
![]()
![]()
![]()
![]()
![]()
The finding that such housekeeping genes are intermixed with life-cycle-specific genes suggests that the MT locus arose in an "ordinary" chromosome, in much the same way that the sex chromosomes of mammals were once ordinary chromosomes and continue to encode non-sex-related proteins (![]()
We also report the characterization of several genes that are found in one MT locus but not the other, expanding our understanding of the coding capacity of MT and providing additional evidence for high mutational change in the region (![]()
| MATERIALS AND METHODS |
|---|
Northern analysis:
The C. reinhardtii strains used to prepare RNA for Northern analysis were wild-type strains CC-620 (mt+) and CC-621 (mt-) all strains are available from the Chlamydomonas Genetics Center, Duke University (Durham, North Carolina). Cultures were maintained in continuous light on Tris-acetate-phosphate (TAP) medium (![]()
![]()
![]()
![]()
EMBL3 genomic phage clones that comprise the chromosome walk through the MT loci (![]()
-32P]dCTP (DuPont/New England Nuclear Research Products) by random priming.
Isolation of cDNA clones:
The cDNA clones for pr6(+), pr6(-), Mta1, Mta2, and Ezy2 were identified by screening plaque lifts of a cDNA expression library in Uni-ZAPXR (Stratagene, La Jolla, CA) prepared from 1-hr zygotic poly(A)+ RNA (![]()
DNA sequencing and analysis:
The strategy for DNA sequencing included subcloning, gene-specific primers, nested deletions using the double strand nested deletion kit (Pharmacia, Piscataway, NJ) and use of the GPS-1 genome priming system (New England Biolabs, Beverly, MA). Some sequence data were obtained by making single-stranded DNA according to ![]()
![]()
![]()
The sequences described in this article have the following GenBank accession numbers: Nic7 partial genomic,
AY032929; pr6(-) cDNA,
AY032930; pr6(+) cDNA,
AY032931; Mtd1 cDNA,
AF417574; Pr46 genomic,
AF387366; Pdk genomic,
AF387365; Ezy2 genomic (mt+),
AF399653;
-Ezy2 genomic (mt-),
AF399654; autosomal a region, left border of the duplication,
AF417573; autosomal a region, right border of the duplication (Mta2 and Mta3 genes),
AF309495; mt+ a region, left border of the duplication,
AF417572; mt+ a region, right border of the duplication (Mta1,
-Mta2, and
-Mta3 genes),
AF417571.
RNase protection analysis:
Total RNA was isolated essentially as described by ![]()
![]()
Uniparental inheritance crosses:
Genetic crosses were performed using standard protocols (![]()
![]()
| RESULTS |
|---|
Transcriptional patterns in the MT locus: methodology:
Northern blots containing poly(A)+ RNA from vegetative cells of both mating types, gametes of both mating types, and zygotes 30 min and 3 hr into development were prepared, and these were screened with 128 probes from the MT locus. The data are presented on the GENETICS website at http://www.genetics.org/supplemental. The probes were chosen to give near total coverage of the R domain (
90% covered, with six gaps of 23 kb, and most <1 kb). The C and T domains were covered less extensively (except near the R domain borders), primarily using probes known to give single-copy bands on Southern blots (![]()
![]()
From these primary data we attempted to identify all the bona fide genes within the R domains of the mt+ and mt- loci, an analysis complicated by false negatives and false positives.
False negatives (a gene failing to be identified by the Northern analysis) could result for several reasons.
- The message is of low abundance and the blots are not sensitive enough. For example, probe 54, known to contain part of the Mid gene, and probe 93, known to contain part of the Mat3 gene, did not generate Northern-blot signals under the conditions used.
- Signals produced by cross-hybridizing repetitive sequences in the probe may obscure gene-specific signals. For example, one cannot discern the Nic7 mRNA against the smeared hybridization signals produced by probe 5 (Fig 2A).

View larger version (71K):
In this window
In a new window
Download PPT slide
Figure 2. Northern blots hybridized to selected MT locus probes. (AD) Poly(A)+ RNA was isolated from mt+ vegetative cells (veg+), mt- vegetative cells (veg-), mt+ gametes (gam+), mt- gametes (gam-), zygotes 30 min after mating (zyg 30'), or zygotes 3 hr after mating (zyg 3 hr). The size of the RNA is indicated on the right (in kilobases). (A) Blot hybridized with probe 5 (Nic7 gene). (B) Blot hybridized with probe 6. (C) Blot hybridized with probe 61 (Mtd1 gene). (D) Blot hybridized with probes derived from the Mta1 and Mta2 cDNAs. (E) Blot hybridized with an Mta1 cDNA probe. Total RNA was isolated from veg+ and gam+, from zygotes 2.5 hr after mating (zyg 2.5 hr), and from gametes of an mt+/mt- diploid (diploid gamete). (F) Poly(A)+ RNA from the designated stages hybridized with the 6.5-kb XhoI fragment from the 16-kb repeat of the Ezy2 locus of mt+(Fig 7); the 3.9-kb Ezy2 signal is visible only in the 1-hr zygote sample (the minor band beneath it is assumed to be artifactual since it is not always present; cf. Fig 9). - The gene may not be expressed under the growth conditions used or during the life cycle stages tested.
- The gene may not have been represented in any of our probes, although this is unlikely for the R domain. Given these considerations, the gene density displayed in Fig 1 is very likely to be an underestimate.
False positives result if the probe cross-hybridizes to messages derived from elsewhere in the genome. This could result from repetitive-sequence elements in the probe that are present in unrelated messages, most likely in the 3' untranslated region (UTR), or a probe that detects a transposon or a duplicated gene. In the case of duplicates, the copies in the MT locus might be functional (although we have no documented examples of this), or, like the genes in the a region of mt+ (see below), they might be pseudogenes. In constructing Fig 1 and Table 1 we endeavored to eliminate false positives, but this is necessarily subjective. In general, multiple bands or smears were considered false positives, as were cases in which a DNA fragment known to be present in only the mt+ or mt- locus generated signals in RNA blots derived from both mating types.
|
Several regions where messages were identified by Northern blots were analyzed in more detail by DNA sequencing to confirm that the Northern analysis accurately predicts genes. This was considered particularly important to verify the existence of vegetatively expressed genes within the R domain and to identify new genes involved in the mating process. The Chlamydomonas expressed sequence tag (EST) data from the Kazusa Institute (![]()
![]()
Transcriptional patterns in the mt locus: observations:
Fig 1 shows the location of major transcriptional units in the mt+ and mt- loci, with additional information on the various transcripts provided in Table 1. Genes designated by boxes are expressed during the vegetative phase of the life cycle; most of these were also expressed in gametes and early zygotes (Table 1). These presumably represent genes whose products function throughout the life cycle, and they are henceforth referred to as housekeeping genes. Genes designated by circles are expressed only in gametes, with (+) transcripts found only in plus gametes, (-) transcripts found only in minus gametes, and (+/-) found in both; the expression of these genes is presumably regulated directly or indirectly by nitrogen starvation. Genes designated by triangles are expressed only in early zygotes (whether their expression continues into the late stages of zygote development/germination has not been investigated); the expression of these genes is presumably regulated directly or indirectly by gametic cell fusion (![]()
![]()
The following sections describe genes or MT regions that were subjected to in-depth analysis.
Nic7:
Probe (Pr) 5 is a 2.1-kb genomic fragment from the center of the Nic7 gene in the T domain, as defined by rescue of the nic7 mutation using transformation (![]()
![]()
Database searches with this partial Nic7 sequence yielded no matches to C. reinhardtii ESTs (which is not surprising given that Nic7 is probably a low-abundance message, and the sequence is not near the 5' or 3' ends). After excluding six putative introns from the Chlamydomonas sequence, a significant homology (63% identity) was found to an Arabidopsis protein predicted from genomic sequencing (GenBank no. BAB09392). The function of the Arabidopsis gene is unknown. However, both sequences display a weak homology to prokaryotic quinolinate synthetase A genes (e.g., 24% identity to the Escherichia coli nadA sequence). Since quinolinate synthetase participates in one pathway of NAD biosynthesis (![]()
Pr6:
The Pr6 gene, detected by a cDNA called probe 6, is not expressed in vegetative cells but is transcribed at low levels in gametes, abundantly in 30-min zygotes, and somewhat less abundantly in 3-hr zygotes (Fig 2B). The gene is located in the T domain (Fig 1).
Two distinct classes of Pr6 cDNA clones, with slightly different sequences, are present in 1-hr-zygote cDNA libraries, indicating that both the mt+ (CC-620 parent) and mt- (CC-621 parent) alleles are expressed. Restriction-site polymorphisms allowed us to assign the two cDNA types to their respective alleles (![]()
|
A single recombinant between nic7 and ac29 has been isolated (![]()
The sequence of Pr6p is 40% identical, over 190 amino acids, to E. coli Endopeptidase IV (P08395), the signal peptide peptidase (![]()
Pyruvate dehydrogenase kinase:
The results using probes 3739 highlight the problems of false positives and negatives. Probe 38 hybridizes to a 1.6- and a 1.9-kb message; probe 39 hybridizes to a 3.2- and a 1.1-kb message; probe 37, which partly overlaps probe 38, gives a negative result. The four signals all appear to be false positives; in fact, the message for the protein encoded in this region is not visualized.
We sequenced a 5813-bp region from segment 2 of the mt- R domain (Fig 1) that covers the region represented by these three probes plus a few hundred flanking nucleotides. A BLAST search identified three C. reinhardtii ESTs to this sequencetwo from the Chlamydomonas Genome Project mt+ set (AW758420 and AW758419 are the 5' and 3' ends, respectively, of the same clone) and one from the Kazusa mt- set (AV643090)which all correspond to the same mRNA. We sequenced the AV643090 clone completely to identify intron borders and the 3' end of the gene. This region contains a 4974-bp gene (GenBank no.
AF387365) predicted to produce a message of
2.6 kb, which does not correspond to any of the bands seen on Northerns.
The predicted protein product is homologous to both pyruvate dehydrogenase kinase (Pdk) and the closely related branched chain
-keto acid dehydrogenase kinase (Bckdk), containing all the conserved motifs (![]()
The Pdk gene resides near one end of segment 2 (Fig 1) such that its 3' UTR extends beyond the sequence discontinuity that marks the end of segment 2. This means that the final 146 bp of the mt- 3' UTR and the final 125 bp of the mt+ 3' UTR are unrelated.
Glycine decarboxylase complex subunit H, Pr46a, and Pr46b:
Probe 46, a 5.4-kb Sal fragment from segment 3 of the mt+ R domain, hybridizes to a single 1.1-kb transcript seen only in vegetative cells; however, the sequence of probe 46 matched ESTs representing three different transcripts, two of 1.1 kb and one of 1.4 kb (Fig 3). In mt+ and mt- genomic Southern blots, probe 46 hybridized to a single band, indicating that these three genes are present only in the MT locus.
|
The sequence of the leftmost message (Fig 3) encodes the mitochondrial enzyme glycine decarboxylase complex subunit H (GdcH), which participates in photorespiration (![]()
Gene Pr46a is represented by four ESTs, all from the Kazusa collection and hence derived from the mt- allele of the gene. One of these (AV390703) was sequenced to determine the intron locations and the 3' end. The predicted Pr46a protein of 96 amino acids is highly conserved (80% identity to an Arabidopsis protein, 75% identity to a Caenorhabditis elegans protein) but of unknown function. The sequence does not appear in the yeast genome. A number of polymorphisms exist between the mt+ and mt- alleles, only one of which is in the coding region, resulting in an Ile in mt+ and a Thr in mt- at position 69, a poorly conserved region of the protein.
Gene Pr46b is represented by a single EST in the Kazusa collection (AV626473); this was sequenced to determine the positions of the two introns and the 3' end. One EST from the Chlamydomonas Genome Project collection confirmed the 3' end, and a second includes additional 5' sequence. The predicted Pr46b protein of 267 amino acids shows 30% identity to a human cDNA (GenBank no. AK023156) and its mouse homolog (GenBank no. AK006639), of unidentified function. Again there are polymorphisms between the mt+ genomic sequence and the mt- cDNA: The six changes in the sequenced coding regions are all synonymous, suggesting that the gene is under selection.
Remarkably, the Pr46a and Pr46b mRNAs also overlap, in this case by 1005 bp: The 3' end of one message is within the last intron of the other gene and vice versa (Fig 3). The 3' UTR of each message overlaps part of the 3' UTR and part of the coding region of the other, but there is no overlap of their coding regions.
Region f:
In a previous publication we documented that the Mid gene, marked by the mid-1 mutation, resides in region f, which is flanked by segments 3 and 4 and unique to the mt- R domain (![]()
![]()
Gene Mtd1:
Region d is a single-copy sequence found only within segment 4 of the mt- locus (![]()
|
Whatever function the Mtd1 protein provides to mt- gametes, it cannot be essential in the laboratory, since mt+ gametes transformed with the Mid gene can mate as minus and produce meiotic progeny with a mt+ partner even though the Mtd1 gene is absent from both parents (![]()
![]()
![]()
Genes Mta1, Mta2, Mta3,
-Mta2, and
-Mta3:
Region a was originally defined (![]()
|
Eleven probes (111121 in Fig 5) were used in the Northern analysis. Of those flanking the duplicated a region, probes 111 and 112 detect the same 6.5-kb RNA, probe 119 detects a 3.0-kb RNA, and probe 120 detects a 0.9-kb RNA. However, since these signals are present in mt- lanes as well, we interpret them to be false positives.
Probe 118, which lies within the duplicated a region, detects a 1.8-kb message at all life-cycle stages analyzed. The entire probe has been sequenced, and guided by EST matches we found that this message derives from a gene we call Mta3. The gene (GenBank no.
AF309495) has one intron and encodes a predicted gene product of 166 amino acids, with a molecular weight of 18.5 kD, an isoelectric point (pI) of 11.3, and no homologs in the database. Since the Mta3 sequence lies within the duplicated region, the Mta3 ESTs from the vegetative plus library could have originated either from the autosomal copy or from the copy in the mt+ locus. However, the three ESTs analyzed all contain sequence polymorphisms specific to the autosomal copy, suggesting that the mt+ copy may not be transcribed. This inference is supported by the finding that the mt+ copy carries a mutation that deletes the intron 5' splice site so that an alternative splice junction would have to be used for the mt+ Mta3 gene to be functional. Our working assumption, therefore, is that the mt+ copy of Mta3 is a pseudogene,
-Mta3, and that the expressed Mta3 gene is autosomal.
Probe 117, which also lies within the duplicated a region, detects two messagesone 2.2 kb and one 0.8 kbboth of which are absent from vegetative cells, present in gametes and 30-min zygotes, and at reduced levels in 3-hr zygotes (Fig 2D). The 2.2-kb species is present in gametes of both mating types, whereas the 0.8-kb species is present in plus gametes only (Fig 2D). cDNA clones that correspond to each have been isolated.
The smaller 0.8-kb message derives from a gene we call Mta1, which is present in the mt+ copy of the a region but absent from the autosomal copy. The Mta1 gene is expressed in mt+/mt- diploid gametes (Fig 2E), indicating that its expression is not repressed by the Mid protein (diploids differentiate as minus gametes; ![]()
![]()
The Mta1 gene encodes a predicted 126-amino-acid protein, Mta1 (Fig 6), of 14.6 kD, pI 7. Its C terminus is predicted to adopt a coiled-coil motif, generating BLAST matches to proteins such as lamin B. Amino acids 48102, the main components of the coiled-coil domain, comprise five imperfect repeats of an 11-amino-acid sequence (Fig 6). A strikingly similar 11-amino-acid repeat domain is found in the ROPE protein of Plasmodium chabaudi (![]()
|
The larger message detected by probe 117 derives from a gene that is expressed from the autosomal a region in gametes of both mating types but not vegetative cells. As reported elsewhere (![]()
![]()
When the autosomal and mt+ genomic sequences are compared, it becomes clear that the Mta1 coding region has been inserted into the Mta2 gene in the mt+ locus (Fig 5): The promoter region, the 5' UTR (and its intron), and the first nine codons of Mta1 correspond to the Mta2 sequences in the autosome, after which the two sequences diverge completely, with the rest of the Mta1 sequence being totally unrelated to the autosomal Mta2 sequence. Downstream of the 3' end of the Mta1 gene, Mta2 sequences pick up again: Although most of the second Mta2 exon and a portion of its second intron are missing, the remainder of the gene is present. Since these Mta2 sequences are not included in the Mta1 transcript, this means that transcriptional termination signals downstream of the Mta1 gene prevent expression of the adjacent Mta2 sequences. We therefore designate this region as
-Mta2.
Table 2 shows the level of homology between Mta2 and
-Mta2 and between Mta3 and
-Mta3. The density of codon and noncodon differences is comparable in the two gene pairs, consistent with the possibility that the two pseudogenes were created at a similar time during C. reinhardtii evolution.
Insertions in the a region:
As detailed in the DISCUSSION, the configuration of the a sequences in the mt+ locus is most readily explained by proposing that the Mta1 gene transposed into the region, thereby inactivating the resident Mta2 gene and creating
-Mta2. The presence of three insertions between the
-Mta2 and
-Mta3 sequences (Fig 5), which may have participated in Mta3 inactivation, offers additional evidence of transpositional activity in the region.
The first insertion is a 1278-bp sequence related to the TOC2 element described by ![]()
The second insertion is a 249-bp sequence that resembles the 12-kb Gulliver transposon (![]()
![]()
The third insertion is a 361-bp sequence with a direct repeat of 34 bp at each end (1-bp mismatch). There is no unambiguous target-site duplication and no homology to previously characterized Chlamydomonas transposons.
The Ezy2 gene cluster:
An obvious structural difference between the mt+ and mt- locus is a 16-kb DNA sequence tandemly repeated six to eight times in segment 3 of the mt+ R domain (Fig 1). This sequence is found in the mt- locus as a single copy, split in two, a portion resident at the end of segment 3 and the remainder resident in the C domain (![]()
To determine whether gene(s) are located within the 16-kb element, an mt+ genomic clone of the repeat unit was used to probe Northern blots. No signals were detected using vegetative or gametic samples, whereas a single 3.9-kb mRNA was detected in the 1-hr zygote sample (Fig 2F). A cDNA library generated from 1-hr zygotes was also screened with the probe, and one full-length cDNA was recovered and sequenced. An open reading frame of 3078 bp defines the unit gene, hereafter called Ezy2 (Early zygote 2; Fig 7). A genomic copy was also sequenced, which showed polymorphisms in its 3' UTR sequence to the full-length cDNA. Additional partial cDNAs were also characterized, some displaying polymorphisms to the full-length clone, suggesting that several, and perhaps all, of the Ezy2 repeats are transcribed.
|
The predicted Ezy2 polypeptide is shown in Fig 8. It displays a putative 42-amino-acid chloroplast transit peptide (Fig 8, boxed): An alanine follows the initiator methionine and the N-terminal region displays a high content of valine, alanine, and serine, albeit there are fewer arginines than expected for a transit peptide (![]()
![]()
![]()
|
Fig 9A shows the pattern of Ezy2 expression during zygote development as monitored by RNase protection assays. The message appears almost immediately after zygote formation, peaks at 30 min, is greatly reduced by 2 hr, and is undetectable by 4 hr into zygote development. By comparison, Ezy1 expression peaks later (Fig 9A), as does the expression of most other zygote-specific genes (![]()
![]()
![]()
![]()
|
To determine whether the bisected copy of Ezy2 in the mt- locus is expressed, a mating was performed between a normal minus strain and a mid-1 mt- strain transformed with the Fus1 gene [mid-1 mt- (Fus1)]. The mid-1 mutant, lacking a functional Mid gene, differentiates as plus and, when transformed with Fus1, is able to mate with minus gametes and form apparently normal zygotes (![]()
-Ezy2. As a control, RNase protection was also performed using the Ezy1 sequence, a gene tandemly repeated in both the mt+ and mt- loci (Fig 1 and ![]()
The mt+ Ezy2 gene is
6 kb, with a contiguous "spacer" of
10 kb, meaning that the repeats in mt+ segment 3 span
100140 kb. The gene has one intron in the 5' UTR and seven introns in the coding region (Fig 7). The 214-amino-acid internal repeat is encoded by exons 3 and 4 (Fig 7). The first internal repeat is 829 bp and the second is 842 bp, the length differences created by three insertions/deletions (indels) in the intervening intron. The introns are otherwise identical, and one synonymous codon difference is found between the duplicated exons. Restriction analysis indicates that the internal repeat is present in all the mt+ Ezy2 copies.
The genomic sequence of
-Ezy2 was also determined. Whereas the restriction maps of the mt+ versions of Ezy2 are very similar, the restriction maps of Ezy2 and
-Ezy2 share few common sites (![]()
-Ezy2 sequence, the spacer domain has been truncated at a downstream position, the missing portion now being located in the C domain (Fig 7). The most obvious difference between the coding regions of Ezy2 and
-Ezy2 is that
-Ezy2 lacks exon 4 and hence the internal direct repeat (Fig 7). In addition, a frameshift at the 5' end of
-Ezy2 shifts the location of the first candidate initiator methionine to a more downstream position (Fig 7), and numerous nucleotide differences and indels have accumulated throughout the two versions of the gene (Table 2).
Zygote development in the absence of a mt+ locus:
As noted earlier, because the mid-1 mt- mutant lacks a functional Mid gene, it expresses plus gametic traits; moreover, if it has been transformed with the Fus1 gene from the mt+ locus, it is able to fuse with mt- gametes, generating zygotes that have two copies of the mt- version of chromosome VI. These zygotes are apparently able to mature and germinate normally, indicating that the program for zygote development does not require genes such as Mta1 or Ezy2 that are exclusively encoded in the mt+ locus.
We went on to ask whether the uniparental transmission of plus cpDNA is affected in these zygotes. Table 3 compares the transmission patterns of chloroplast markers in control crosses and crosses in which a mid-1 mt- (Fus1) strain served as the plus parent. Inheritance of chloroplast traits is seen to be biparental in the crosses involving the strain lacking an mt+ chromosome.
|
| DISCUSSION |
|---|
Coding capacity of the MT locus:
The rearranged R domain of the C. reinhardtii MT locus as well as the flanking T and C domain sequences that are under recombinational suppression are shown to contain genes that are expressed throughout the life cycle of the organism as well as genes expressed exclusively during the gametic or the zygotic phases of the life cycle. Although a comparable transcription map has not yet been generated for other regions of the C. reinhardtii genome, this distribution of genes is what one would expect if an ordinary chromosome had undergone large-scale rearrangements and had also gained a few gene sequences in one homolog but not the other. Large-scale rearrangements are found in the mouse T locus, which includes genes affecting male fertility (![]()
![]()
![]()
![]()
![]()
![]()
![]()
A particular goal of this study was to ascertain whether the four large segments of rearranged DNA in the R domain contain active genes or are instead noncoding structural elements, as is the case, for example, for most of the mammalian Y chromosome (![]()
![]()
![]()
![]()
It is widely assumed that one of the functions of meiotic recombination is to promote genomic integrity, and it has been demonstrated that chromosomes prevented from engaging in meiotic recombination are subject to deterioration, a model for the ontogeny of XY differentiation (![]()
![]()
MT-unique sequences:
We also examined closely six regions of the MT locus that are found in one chromosome but not the other; these are hereafter referred to as MT-unique sequences. We were unable to detect any genes in two of theseregion b in mt+ and region e in mt- (see data at http://www.genetics.org/supplemental)albeit it is of interest that region b is duplicated, in inverted orientation, at a site 1 cM telomere-proximal to the mt+ locus (![]()
- Region a in mt+ contains the gene Mta1 that is expressed in plus gametes only. The Mta1 protein is predicted to contain a leucine-histidine zipper and is of unknown function.
- Region c in mt+ contains the Fus1 gene, encoding the Fus1 protein, that is expressed in plus gametes only and is necessary for plus-mediated gametic cell fusion (
FERRIS et al. 1996 ).
- Region d in mt- contains the Mtd1 gene that is expressed in minus gametes only. The predicted Mtd1 gene product is a putative triple-span membrane protein with putative extracellular N-glycosylation sites. When mt+ gametes are transformed with the Mid gene, which causes them to differentiate as minus, their flagellar agglutination is strong but their cell fusion is very slow and erratic. Since such gametes lack the Mtd1 gene, there is a pleasing symmetry to the possibility that regions c and d might contain genes Fus1 and Mtd1 that code for plus and minus cell-fusion proteins, respectively. The Mtd1 sequence shows no homology to known membrane-fusion motifs, so if it proves to participate in membrane fusion it may do so by a novel mechanism.
- Region f in mt- contains the Mid gene that is expressed in minus gametes only. The Mid protein is necessary for minus gametic differentiation (
FERRIS and GOODENOUGH 1997 ). Whereas there would be a pleasing symmetry in the postulate that the Mta1 protein is necessary for plus gametic differentiation, this is ruled out by the ability of mid-1 and mid-2 mutants to differentiate as plus gametes in the absence of an Mta1 gene.
In addition to these four genes, the Ezy2 gene is MT-unique as well, being expressed from the mt+ locus only. It differs from the four genes above in three respects: It is present in multiple tandem copies; its expression is initiated in the zygote rather than in the gamete; and it is not strictly unique to the mt+ locus in that a nonexpressed Ezy2 pseudogene is located in the mt- locus.
Codon bias:
The first two genes to be sequenced from the C. reinhardtii MT locus were Fus1 (![]()
![]()
![]()
Chloroplast DNA inheritance:
During the first 2 hr of zygote maturation in C. reinhardtii, cpDNA derived from the mt- parent is normally degraded by nuclease digestion whereas cpDNA from the mt+ parent is preserved and later selectively replicated (![]()
![]()
![]()
![]()
![]()
![]()
The mid-1 mt- (Fus1) x mt- cross is not lethal, but neither is it normal: cpDNA is inherited biparentally (Table 3), suggesting that the missing mt+ chromosome is somehow necessary for the selective destruction of minus cpDNA in the zygote. For example, if the mt+-encoded Ezy2 protein participates in cpDNA destruction and is selectively targeted to minus chloroplasts in the zygote (perhaps because minus chloroplasts carry specific receptors for Ezy2 translocation; cf. ![]()
Taken together, the results available at present are best explained by proposing that the mt+ locus encodes both a protection function and a destruction function, with Mta1 being a candidate participant in protection and Ezy2 in destruction. Both of these functions would be operative in the heterozygous-diploid crosses of ![]()
Evolutionary history of the MT locus:
A common way to model the evolution of separate sexes (heterothallism, dioecy) is to start with a self-fertile (homothallic, monoecious) ancestor and propose steps that would lead to self-sterility (e.g., ![]()
![]()
To model the subsequent "invasion" of the homothallic population by these two chromosomes, one can invoke the benefits of outcrossing as driving the process. Alternatively, or in addition, one can invoke positive selection for advantageous genes linked to the proto-mt loci and propose that the linkage would come to be buttressed by recombinational suppression (![]()
![]()
![]()
![]()
![]()
![]()
We can now consider possible origins of the MT-unique genes, using Fus1 as an example. There are two possibilities: Either Fus1 originally happened to reside in the proto-MT region of chromosome VI and was subsequently lost from the proto-mt- chromosome or it was originally autosomal and then moved into the






