| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Corresponding author: Gertrud Lund, Department of Plant Biology, Thorvaldsensvej 40, DK-1871 Frederiksberg C, Denmark., gel{at}kvl.dk (E-mail)
Communicating editor: J. BIRCHLER
| ABSTRACT |
|---|
This study investigates the prevalence of CG and CNG suppression in single- vs. multicopy DNA regions of the maize genome. The analysis includes the single- and multicopy seed storage proteins (zeins), the miniature inverted-repeat transposable elements (MITEs), and long terminal repeat (LTR) retrotransposons. Zein genes are clustered on specific chromosomal regions, whereas MITEs and LTRs are dispersed in the genome. The multicopy zein genes are CG suppressed and exhibit large variations in CG suppression. The variation observed correlates with the extent of duplication each zein gene has undergone, indicating that gene duplication results in an increased turnover of cytosine residues. Alignment of individual zein genes confirms this observation and demonstrates that CG depletion results primarily from polarized C:T and G:A transition mutations from a less to a more extensively duplicated gene. In addition, transition mutations occur primarily in a CG or CNG context suggesting that CG suppression may result from deamination of methylated cytosine residues. Duplication-dependent CG depletion is likely to occur at other loci as duplicated MITEs and LTR elements, or elements inserted into duplicated gene regions, also exhibit CG depletion.
IN many organisms, nuclear DNA is methylated at cytosine residues, resulting in 5-methylcytosine (5mC). In plants, symmetrical 5'-CpG-3' (CG) and 5'-CpNpG-3' (CNG) are the most frequent targets of cytosine methylation, whereas in mammals 90% of methylation is restricted to the CG dinucleotide (![]()
![]()
![]()
![]()
28% of cytosine residues are methylated compared to only 6% in Arabidopsis (![]()
![]()
![]()
![]()
CG suppression, or depletion, refers to the underrepresentation of the CG dinucleotide compared to an estimated value based on the G + C content of the sequence investigated. CG suppression is especially evident in the mammalian genome where the frequency of the CG dinucleotide can be up to fivefold lower than the expected value (![]()
![]()
![]()
![]()
The most commonly explained mechanism of CG depletion relates to the tendency of 5mC to undergo spontaneous deamination to thymidine, resulting in C:T or G:A transition mutations (![]()
![]()
![]()
![]()
![]()
Although the majority of methylation in plants is associated with repetitive DNA sequences such as transposons, duplicated gene regions can also be methylated (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
In mammals, duplicated genes are more CG suppressed compared to single-copy genes. This observation has led to the suggestion that duplicated genes have a history of methylation and subsequent mutation of methylated residues (![]()
![]()
![]()
The zeins constitute 5060% of total endosperm protein and can be divided into two major fractions, zein-1 and zein-2, on the basis of solubility characteristics (![]()
-zeins. On the basis of DNA sequence identity and hybridization characteristics, this family can be further divided into four subfamilies, z1A, z1B, z1C, and z1D (![]()
![]()
![]()
![]()
-zein genes can show considerable variation among different inbred lines (![]()
![]()
![]()
![]()
![]()
-zeins (![]()
![]()
![]()
The majority of 22-kD zein genes form a dense gene cluster,
168 kb in size, on chromosome 4 of maize (![]()
![]()
![]()
![]()
![]()
![]()
-zein genes has revealed that the 19- and 22-kD zein genes share a common ancestor (![]()
![]()
-zein gene family in maize occurred within the last 34 million years (![]()
![]()
In contrast to the clustered zein genes, MITEs and LTR elements are dispersed in the genome. MITEs are frequently associated with promoter and 3' regulatory regions of genes, whereas the larger LTR-transposons are typically found in intergenic regions (![]()
![]()
![]()
![]()
![]()
![]()
Our analysis shows that duplicated zein genes are CG suppressed and that the degree of suppression correlates with the copy of each subfamily. Likewise, within the 19- and 22-kD zein gene families the extent of CG depletion correlates with the number of duplications each individual gene has undergone. Despite their high copy number, most MITEs and LTR elements are not CG suppressed, except when duplicated or located in a duplicated DNA sequence. This suggests that the process leading to CG depletion is activated upon duplication or is a consequence of the duplication process itself. We discuss the possible role of duplication-dependent CG depletion in the evolution of the GC-poor isochores in which the zein genes are located.
| MATERIALS AND METHODS |
|---|
Sequences:
The di-and trinucleotide composition of the coding region of 32 zein genes belonging to the zein-1 fraction and 6 genes belonging to zein-2 fraction was analyzed. All 22-kD zein genes were derived from the inbred line BSSS53 (af090447), whereas the 19-kD gene sequences were derived from the B73 inbred line (af546187, af546188; af546189, af546190; ![]()
![]()
![]()
![]()
![]()
![]()
![]()
CG analysis:
To measure the extent of repression of a given di- or trinucleotide, a score
was calculated by the formula
= O/E, where O and E denote the observed and expected counts, respectively. Overall expected counts for di- and trinucleotides were calculated by multiplying the observed counts of each nucleotide and dividing the product by the total number of nucleotides found in the sequence. Position-dependent expected counts were calculated assuming the absence of any codon bias. The positions of the 5' and 3' di- or trinucleotides relative to codon triplets are indicated by roman numerals; e.g., I-II denotes a dinucleotide including the first two nucleotides of a codon triplet. For the CG dinucleotide, position-dependent expected counts were calculated as follows: I-II = 2/3 of arginine-specifying codons; II-III = the sum of 1/6 of serine-, 1/4 of proline-, threonine-, and alanine-specifying codons; III-I = NNC x GNN/T; N represents any nucleotide and T represents the total number of triplets in the sequence. In the case of CNG trinucleotide, position-dependent expected counts were calculated as follows: I-III = the sum of 1/6 of arginine- and leucine-, 1/4 of proline-, and 1/2 of glutamine-specifying codons; II-I = NCN x GNN/T; and III-II = NNC x NGN/T.
Alignment between members of the
-zein gene family:
Pairwise alignments were conducted of individual expressed members of the 19- and 22-kD zein genes by GAP analysis (GCG analysis software package). For each alignment, the percentage of C:T and G:A transition mutations was compared to the total number of single-base-pair point mutations. To establish if transition mutations occur in a polarized fashion, i.e., from a younger to an older duplication, or from a gene that has undergone less to more duplications, transition mutations of each gene were counted in individual alignments. Importantly, only transition mutations occurring in a CG or CNG context were considered.
Statistical analysis:
Differences between O and E values were tested using the chi-square analysis. To test for differences in
-values between groups, the Mann-Whitney U-test was employed. All statistical tests were performed using the STATISTICA software package for Macintosh (StatSoft, Tulsa, OK).
Bisulfite analysis:
Genomic DNA was extracted from young leaf tissue of the inbred lines W64A and W22 using the DNAeasy kit (QIAGEN, Valencia, CA). The W64 and W22 inbred lines contain the Tourist element located in the 5' flanking region of the single-copy or duplicated 27-kD zein gene, respectively (![]()
![]()
| RESULTS |
|---|
CG and CNG analysis of the zein genes:
Table 1 shows
-values (observed/expected) of CG dinucleotides and CNG trinucleotides of genes belonging to the zein-1 and zein-2 fractions. All 19-kD zein genes analyzed were isolated from the B73 inbred line, whereas the 22-kD zein genes were derived from the BSSS53 inbred line (![]()
![]()
-average was calculated of both zein fractions, of the 19- and 22-kD zein genes, and of each of the three 19-kD zein subfamilies, z1A, z1B, and z1D. The zein-1 fraction, representing the multicopy
-zeins has a CG average of 0.40 (P < 0.001), whereas the
-average of single-copy zein genes of zein-2 fraction is 0.75 (P < 0.001). Indeed, the zein-1 fraction is more suppressed than the zein-2 fraction (P < 0.001). Furthermore, the average GC content of the zein-1 fraction is 48% compared to 66% of the zein-2 fraction (results not shown), indicating that CG suppression is accompanied by a decrease in G + C (GC) content. In contrast, none of the zein fractions are suppressed at the CNG trinucleotide.
|
It can also be observed that the degree of suppression varies between subfamilies of the zein-1 fraction. The more abundant 19-kD zein genes are the more CG suppressed compared to the less abundant 22-kD zein genes (0.34 and 0.49, respectively; P < 0.001) and, likewise, within the 19-kD gene family the degree of suppression is associated with the copy number of each subfamily. The z1A subfamily, which has the highest copy number, is more suppressed compared to the less abundant z1B and z1C subfamilies (P < 0.009 and P < 0.027, respectively). We also analyzed 18 19-kD genes from different genetic backgrounds and found no differences in the average CG and CNG scores (results not shown).
The amino acid content of zein-1 and zein-2 fractions differs considerably, which could potentially influence overall CG and CNG scores. For example, the
-zeins are particularly rich in glutamine, leucine, proline, and alanine, whereas the single-copy
-zeins have a high content of methionine. To address this problem, CG and CNG frequencies were analyzed in a position-dependent context (Table 2). Position-dependent frequencies correct for differences in amino acid content but not for amino acid codon bias. Essentially, position-dependent frequencies of the zein genes largely reflect overall CG and CNG frequencies. The multicopy
-zeins are suppressed at positions II-III and III-I (Table 2;
= 0.41 and 0.55, respectively; P < 0.001); in contrast, the single-copy zein genes are not CG suppressed at any position. This implies that the low overall CG score observed for the 10-kD zein gene, m23537 (Table 1), is related to the amino acid content of this gene. Within the zein-1 fraction, the 19-kD zein genes are CG suppressed at positions II-III and III-I (P < 0.001). The z1A subfamily is more suppressed than the z1B subfamily at position III-I, whereas the opposite is true of position II-III (P < 0.002 and P < 0.025, respectively). The 22-kD zein genes are also CG suppressed at position II-III (
= 0.52; P < 0.001), but to a lesser degree than the 19-kD zein genes (
= 0.33; P < 0.001). The CNG trinucleotide is suppressed only at position I-III of the zein-1 fraction (
= 0.60; P < 0.009; results not shown) and, again, the 19-kD zein genes are more suppressed compared to the 22-kD zein genes at this position (P < 0.001).
|
A recent analysis of the B73 and BSSS53 inbred lines has shown that only a relatively small number of
-zein genes are expressed (![]()
![]()
![]()
![]()
![]()
![]()
-zein genes have been assessed in the B73 inbred line (![]()
= 0.71). This is also true within the 22-kD zein gene family, and a similar tendency was observed for the 19-kD zein gene family (results not shown). This indicates that highly expressed genes are more CG suppressed compared to low expressing genes.
To understand if CG suppression is a general effect of a specific chromosomal region, the CG content of intergenic regions of the 22-kD zein gene cluster located on chromosome 4S was analyzed. The lengths of the 22-kD zein intergenic regions varied from 2517 to 14,438 bp. The
-average CG of the intergenic region was 0.68, which is significantly higher than the
-average of 0.50 of the 22-kD genes (P < 0.001).
CG analysis of MITEs and LTR-retrotransposons:
The observed copy number variation in CG suppression prompted us to investigate whether the high-copy-number LTR-retrotranposons and MITEs exhibited similar behaviors. Three MITE families (Tourist, Stowaway, and Heartbreaker) and three groups of LTR-retrotransposon (Ty1-copia, Ty3-gypsy, and an unclassified group), differing in element copy number between and within each group, were analyzed for evidence of CG suppression. The
-value and G + C content was calculated for MITEs and LTR elements in different sequence contexts (Table 3 and Table 4, respectively). Despite the high copy number of MITEs and LTR elements, no association was found between the degree of suppression and element copy number. Most transposons were not, or were only slightly, CG suppressed. For example, the
-average of Tourist and Heartbreaker families was 0.66 and 0.60 (P < 0.041 and P < 0.018), respectively, whereas no suppression was observed of Stowaway family or LTR elements. However, large differences in CG suppression were observed of both MITEs and LTR elements (
ranging from 0.16 to 1.46 and 0.26 to 1.12, respectively). We found that the copy number of the insertion site could largely explain the variation in
; i.e., elements inserted into multicopy gene regions were more CG suppressed than elements inserted into single-copy genes. This is nicely illustrated by the
-values of Stowaway found 3' of the single-copy 10- and 27-kD zein genes and the multicopy 22-kD zein genes (0.66, 0.96, and 0.25, respectively; Table 3B; ![]()
-values of Tourist and Stowaway elements located 5' and 3', respectively, of the 27-kD zein genes are indicated (x53514 and x56118; Table 3A and Table 3B). x56118 represents an allele of a tandem duplication of a 27-kD zein gene (![]()
![]()
-value of a single-copy or duplicated Tourist element located in the 3' region of the Adh1 locus (x17556 and x04049, respectively; Table 3A; ![]()
= 1.46), whereas the tandemly duplicated element is severely CG suppressed (
= 0.16). Again, CG suppression of the duplicated element is accompanied by a 4% decrease in G + C content compared to the single-copy insertion. LTR regions also exhibit severe CG suppression upon duplication or if located in a duplicated DNA region (Table 4). For example, CG suppression is observed of x58700, a Hopscotch-like transposon inserted in the promoter region of a multicopy 19-kD zein gene (![]()
![]()
|
|
An average
-value was calculated of elements inserted into single- vs. multicopy gene regions or tandemly duplicated elements of selected data points (Table 3 and Table 4; labeled with superscript a and b, respectively). The criterion for selection of data points was knowledge of copy number of both element and insertion sequence. In addition, only clustered multicopy genes were represented in the multicopy group. Therefore, the Gpc4 gene, which belongs to a small dispersed multigene family, was analyzed as a single-copy insertion (![]()
= 0.94), whereas tandemly duplicated elements or gene sequences were suppressed (
= 0.33; P < 0.020). Furthermore, the
-average of MITEs and LTR elements inserted in single-copy gene regions was higher than the average value of tandemly duplicated elements and elements inserted into duplicated DNA regions (P < 0.004). Simply analyzing
-values of MITEs and LTR elements located in single- vs. multi-copy gene regions produced results identical to the selected data set, the latter being less significant (P < 0.009).
CG depletion as a function of time or gene duplication:
If CG suppression results from passive deamination of methylated residues, the extent of depletion should reflect the number of years a sequence has been methylated. Table 5A shows a comparison between
, the estimated time of insertion, and the number of duplications of the seven expressed 22-kD zein genes clustered on chromosome 4S. Similarly, the estimated times of retrotransposon insertion at the Adh1-F locus has been compared to CG frequencies (Table 5B). Neither CG nor CNG suppression correlated with time of insertion of the 22-kD zein genes or of the LTR-retrotransposons. However, the 22-kD zein genes showed an inverse correlation between CG content and the number of duplications each gene has undergone (r = 0.85; P < 0.014). This was found to be specific of the CG dinucleotide. Similarly, analysis of the expressed 19-kD zein genes isolated from the B73 cluster also showed that the degree of CG suppression correlated with the extent of duplication (r = 0.52; P < 0.05; n = 14; results not shown).
|
Sequence divergence of duplicated DNA sequences:
To identify the fate of CG dinucleotides, pairwise alignments of the seven expressed zein genes was conducted (Table 5). For each of the 21 alignments, the total number of C:T and G:A transition mutations and the total number of transition mutations occurring in a CG or CNG context were counted and compared to the total number of point mutations (results not shown). In addition, transition mutations occurring in a CG or CNG context were counted for each gene in the pairwise alignments. Transition mutations were the most common point mutations observed, representing on average 61% of total point mutations, and the vast majority occurred in a CG or CNG context (average 74%). In 7/21 alignments performed, it was possible to distinguish if CG depletion resulted from the time or extent of duplication, whereas the remaining alignments were noninformative. The informative alignments included zp22/D87 and azs22;8, azs22;10 and azs22;4, azs22;10 and azs22;14, azs22;12 and zp22/6, zp22/6 and azs22;14, azs22;12 and azs22;10, and zp22/6 and azs22;14. Four of these alignments showed that transition mutations occurring in CG context were consistent with CG suppression resulting from duplication, whereas none were consistent with the time of amplification. For example, 11 transition mutations (occurring in a CG context) were observed from azs22;12 to zp22/6, whereas only 7 were observed in the opposite direction. This is in agreement with duplication-dependent CG suppression as zp22/6 has undergone a larger number of duplications compared to azs22;12 (six vs. four, respectively). If CG suppression were the result of time-dependent deamination of cytosine residues, an equal number of polarized transition mutations would have been expected given the fact that these genes have both amplified 0.6 MYA. In contrast, no association between transition mutations occurring at the CNG trinucleotide and duplication was found for the informative alignments. Interestingly, 5 of the informative alignments did show a time-dependent decrease in CNG content. For the noninformative alignments, the majority of transition mutations observed at CG dinucleotides (12/14) also occurred in a polarized fashion from a less to a more duplicated gene (or from a younger to an older duplication). However, only 6/14 alignments showed a similar behavior at the CNG trinucleotide. These results support the prior observation that only CG suppression correlates with the extent of duplication.
Alignments were also performed between genes belonging to the 19-kD zein subfamilies, z1A, z1B, and z1D, which have undergone an average of eight, six, and four duplications, respectively. Two expressed genes representative of the z1A subfamily (Z448F14-3 and Z448F14-4) were aligned to all the expressed genes of z1B and z1C. In 16/18 alignments, a higher number of transition mutations at CG dinucleotides were observed from the less duplicated z1B and z1C genes to the more extensively duplicated z1A genes. The same could be concluded by alignments of two z1B genes (Z492M16-1 and Z492M16-4) to the two expressed z1D genes. Of the 18 alignments performed, transition mutations represented on average 57% of total single base pair mutations and, of these, 48% occurred in a CG or CNG context.
Similar conclusions were found by aligning the duplicated 27-kD zein gene, x56118, to the single-copy 27-kD zein gene, zc2 (x53514). These genes exhibit 97% sequence similarity. Only nine point mutations were observed, six being C:T or G:A transition mutations. Again, transition mutations occurred in a polarized fashion from the single copy to the duplicated gene (four vs. two, respectively). However, only transition mutations observed from the zc2 to the x56118 allele were in a CG context (3/4). This pattern of mutation was also mirrored by alignment of the Tourist and Stowaway elements located in the 5' and 3' flanking region of the single-copy and duplicated 27-kD zein gene, respectively. Finally, alignment of the single-copy and duplicated Tourist element 3'of the Adh1 locus confirmed that C:T and G:A transition mutations are polarized (i.e., occur from the single to the duplicated MITE sequence).
The degree of sequence identity between individual MITEs varies between 46 and 88% (![]()
Methylation status of single- vs. multicopy genic regions:
C:T and G:A transition mutations are the presumed products resulting from deamination of methylated cytosines. Therefore, a possible explanation of the enhanced turnover of CG dinucleotides might be that duplicated sequences exhibit qualitative or quantitative differences in methylation compared to single-copy genes. To this end, we analyzed the methylation status of the Tourist element located in the single-copy or tandem duplication of the 27-kD zein gene (zc2 and x56118) by bisulfite sequencing. These elements were chosen because they exhibit small differences in CG and G + C content (see Table 3). The results showed that in both sequence contexts the MITE was hypermethylated at all CG dinucleotides, in addition to a proportion of methylated cytosines in a nonsymmetrical sequence context (see Fig 1). All 29 cytosine residues of the MITE located 5' of the zc2 coding region were methylated compared to 25/30 cytosine residues of the element inserted in the tandem duplication. In addition, quantitative analysis of 16 independent clones indicated that the MITE located in the duplicated 27-kD zein gene showed 19 and 37% reduction in symmetrical and asymmetrical methylation, respectively, compared to the element located in the single-copy gene.
|
| DISCUSSION |
|---|
Our results show that during a short evolutionary time span, individual
-zein genes have accumulated large variations in CG content. In particular, within the 19- and 22-kD zein gene families, extensively duplicated genes are more CG depleted compared to less duplicated genes. In addition, the 19-kD zein genes are more suppressed than the 22-kD zein genes, indicating that the former have undergone a greater number of gene duplications. This is supported by the fact that the 19-kD genes are more abundant compared to the 22-kD genes (![]()
![]()
![]()
Duplication-dependent depletion of the
-zein genes results largely from C:T and G:A transition mutations in a CG or CNG context. Similarly, elevated levels of transition mutations have also been identified in other multigene families in plants such as the GAPA and rDNA gene families of maize and the 5S RNA genes from Arabidopsis (![]()
![]()
![]()
![]()
![]()
-zein gene family, as the most highly expressed genes are the most CG depleted. We speculate that the high expression levels of the most CG-suppressed
-zeins are caused by the lack of CG dinucleotides available for methylation, thus relieving methylation-mediated transcriptional repression. Indeed, the fact that
-zein genes are methylated in both the coding and noncoding regions and exhibit an inverse relation between CG methylation and expression lends support to this idea (![]()
![]()
![]()
![]()
![]()
The in vivo rate of deamination of methylated cytosine residues in plants is not known, whereas in mammals the estimated half-life of a cytosine residue is between 24 and 60 million years (![]()
![]()
![]()
![]()
![]()
If duplicated sequences are methylated compared to their single-copy counterparts, an increase in methylation-related deaminations might be expected, subsequently resulting in a reduction in CG content. However, bisulfite analysis of a MITE inserted in the single-copy or duplicated 27-kD zein gene locus showed that the element was methylated in both sequence contexts. In addition, as most transposons are hypermethlyated (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
An alternative explanation for the observed differences in CG content is that selective pressures differ between chromosomal regions, which could result in different mutation rates of CG dinucleotides. For example, this might explain why the 22-kD zein genes, which map to chromosome 4S, are suppressed, whereas no suppression of DNA sequences that map to the Adh1-F chromosomal region was observed. However, analysis of intergenic and LTR regions of retrotransposons, located in the 340-kb 22-kD zein gene cluster, showed that CG suppression was localized to zein gene regions and was not a common feature of this chromosomal region. That CG suppression is independent of chromosomal content is also supported by the fact that many sequences analyzed exhibit large variations in CG despite being located in identical genomic regions, e.g., Tourist and Stowaway elements located in the noncoding regions of the single-copy or duplicated 27-kD zein gene or the single or duplicated Tourist element located in the 3' region of the Adh1 locus.
A study of 101 maize genes has shown that 40% of codon-usage variation is due to a bias toward G or C vs. A or U ending codons (![]()
-zeins have low GC3 values. We found that CG dinucleotides of the
-zeins were suppressed at positions II-III and III-I, whereas the single-copy genes showed an excess of CG at these positions. This is interesting as CG suppression at position III-I seems to be specific of methylating species, whereas nonmethylating species show an excess of CG at this position (![]()
-zein genes reflect the fact that the
-zein genes have undergone extensive gene duplication and are not due to a bias in codon usage between the single- and multicopy zein genes.
Mammalian and plant genomes are made up of large regions of relatively homogeneous base composition known as isochores (![]()
![]()
![]()
-zeins and ribosomal genes that are located in GC-poor and GC-rich fractions, respectively (![]()
-zeins could be explained by duplication-dependent CG depletion. Given that the GC content and, in particular, the GC3 content of a gene is highly correlated to the overall GC content of the isochore in which it is located (![]()
![]()
![]()
We have shown that duplicated zein genes, LTR elements, and MITEs undergo specific changes in nucleotide sequence. These changes have been observed in CG dinucleotides and result in C:T and G:A transition mutations and a net reduction in GC content. Such a process is reminiscent of RIP in N. crassa, where duplications are de novo methylated and riddled with point mutations (![]()
![]()
From the detailed analysis of the 22-kD zein genes it is clear that the rate of CG depletion of duplicated sequences is enhanced compared to the average deamination of methylated residues of dispersed multicopy sequences. Further analysis will reveal if this mechanism is common to other duplicated gene families. Suppression of MITEs and LTR elements inserted in duplicated gene regions indicates that this might be the case. Interestingly, rapid responses to genome-wide duplication of genomes have been shown to occur in wheat and Arabidopsis allotetraploids. These changes include nonrandom alterations in methylation and gene silencing caused by methylation or gene loss (![]()
![]()
| ACKNOWLEDGMENTS |
|---|
The authors thank Mik Noordeweir for assisting in part of the sequence analysis, Angelo Viotti and Vincenzo Rossi for critical reading of the manuscript, and E. Linton for estimates of insertion time of the 22-kD zein genes. This work was supported by a grant from the Danish National Research Foundation.
Manuscript received February 28, 2003; Accepted for publication June 13, 2003.
| LITERATURE CITED |
|---|
ASHIKAWA, I., 2001 Gene-associated CpG islands in plants as revealed by analyses of genomic sequences. Plant J. 26:617-625.[Medline]
BARRY, C., G. FAUGERON, and J. L. ROSSIGNOL, 1993 Methylation induced premeiotically in Ascobolus: coextension with DNA repeat lengths and effect on transcription elongation. Proc. Natl. Acad. Sci. USA 90:4557-4561.
BENDER, J., 1998 Cytosine methylation of repeated sequences in eukaryotes; the role of DNA pairing. Trends Biochem. Sci. 23:252-256.[Medline]
BENDER, J. and G. R. FINK, 1995 Epigenetic control of an endogenous gene family is revealed by a novel blue fluorescent mutant of Arabidopsis. Cell 83:725-734.[Medline]
BENNETZEN, J. L., K. SCHRICK, P. S. SPRINGER, W. E. BROWN, and P. SANMIGUEL, 1994 Active maize genes are unmodified and flanked by diverse classes of modified, highly repetitive DNA. Genome 37:565-576.[Medline]
BERNARDI, G., 2000 Isochores and the evolutionary genomics of vertebrates. Gene 241:3-17.[Medline]
BERNARDI, G., B. OLOFSSON, J. FILIPSKI, M. ZERIAL, and J. SALINAS et al., 1985 The mosaic genome of warm blooded vertebrates. Science 228:953-958.
BIANCHI, M. W. and A. VIOTTI, 1988 DNA methylation and tissue-specific transcription of the storage protein genes of maize. Plant Mol. Biol. 11:203-214.
BIRD, A. P., 1980 DNA methylation and the frequency of CpG in animal DNA. Nucleic Acids Res. 8:1499-1504.
BUREAU, T. E. and S. R. WESSLER, 1992 Tourist: a large family of small inverted repeat elements frequently associated with maize genes. Plant Cell 4:1283-1294.
BUREAU, T. E. and S. R. WESSLER, 1994 Stowaway: a new family of inverted repeat elements associated with the genes of monocotyledonous and dicotyledonous plants. Plant Cell 6:907-916.[Abstract]
CAMBERERI, E. B., B. C. JENSEN, E. SCHABTACH, and E. SELKER, 1989 Repeat-induced G-C to A-T mutations in Neurospora. Science 244:1571-1575.
CARELS, N., A. BARAKAT, and G. BERNARDI, 1995 The gene distribution of the maize genome. Proc. Natl. Acad. Sci. USA 92:11057-11060.
CLAY, O., S. CACCIO, Z. ZOUBAK, D. MOUCHIROUD, and G. BERNARDI, 1996 Human coding and noncoding DNA: compositional correlations. Mol. Phylogenet. Evol. 5:2-12.[Medline]
COOPER, D. N. and M. KRAWCZAK, 1989 Cytosine methylation and the fate of CpG dinucleotides in vertebrate genomes. Hum. Genet. 83:181-188.[Medline]
COULOUNDRE, C., J. H. MILLER, P. J. FARABAUGH, and W. GILBERT, 1978 Molecular basis of base substitution hotspots in Escherichia coli.. Nature 274:775-780.[Medline]
DAS, O. P. and J. MESSING, 1987 Allelic variation and differential expression of the 27-kDa zein locus in maize. Mol. Cell. Biol. 7:4490-4497.
DAS, O. P., K. WARD, S. RAY, and J. MESSING, 1991 Sequence variation between alleles reveals two types of copy correction at the 27 kDa zein locus of maize. Genomics 11:849-856.[Medline]
EDWARD, S., I. V. BUKLER, and T. P. HOLTSFORD, 1996 Zea mays ribosomal repeat evolution and substitution patterns. Mol. Biol. Evol. 14:623-632.
ESEN, A., 1986 Separation of alcohol-soluble proteins (zeins) from maize into three fractions by differential solubility. Plant Physiol. 80:623-627.
EYRE-WALKER, A. and L. D. HURST, 2001 The evolution of isochores. Nat. Genet. Rev. 2:549-555.
FENNOY, S. L. and J. BAILEY-SERRES, 1993 Synonymous codon usage in Zea mays L. nuclear genes is varied by levels of C-ending and G-ending codons. Nucleic Acids Res. 23:5294-5300.
FLAVELL, R. B., 1994 Inactivation of gene expression in plants as a consequence of specific sequence duplication. Proc. Natl. Acad. Sci. USA 91:3490-3496.
FLAVELL, R. B., M. O'DELL, and W. F. THOMPSON, 1988 Regulation of cytosine methylation in ribosomal DNA and nculeolus organizer expression in wheat. J. Mol. Biol. 204:523-534.[Medline]
FRYXELL, K. J. and E. ZUCKERKANDL, 2000 Cytosine deamination plays a primary role in the evolution of mammalian isochores. Mol. Biol. Evol. 17:1371-1383.
GARDINER-GARDEN, M. and M. FROMMER, 1987 CpG islands in vertebrate genomes. J. Mol. Biol. 196:261-282.[Medline]
GARDINER-GARDEN, M., J. A. SVED, and M. FROMMER, 1992 Methylation sites in angiosperm genes. J. Mol. Evol. 34:219-230.
GOYON, C. and G. FAUGERON, 1989 Targeted transformation of Ascobolus immersus and de novo methylation of the resulting duplicated DNA sequences. Mol. Cell. Biol. 9:2818-2827.
GRUENBAUM, Y., T. NAVEH-MANY, H. CEDAR, and A. RAZIN, 1981 Sequence specificity of methylation in higher plant DNA. Nature 292:860-862.[Medline]
GRUENBAUM, Y., H. CEDAR, and A. RAZIN, 1982 Substrate and sequence specificity of a eukaryotic DNA methylase. Nature 295:620-622.[Medline]
HAGEN, G. and I. RUBENSTEIN, 1981 Complex organization of zein genes in maize. Gene 13:239-249.[Medline]
HEINDECKER, G. and J. MESSING, 1986 Structural analysis of plant genes. Annu. Rev. Plant Physiol. 37:439-466.
HOLSTEIN, M., M. S. GREENBLATT, K. RICE, T. SOUSSI, and R. FUCHS et al., 1994 Database of p53 gene somatic mutations in human tumors and cell lines. Nucleic Acids Res. 22:3551-3555.
JEDDELOH, J. A. and E. J. RICHARDS, 1996 mCCG methylation in angiosperms. Plant J. 9:579-586.[Medline]
JONES, P. A., W. M. RIDEOUT, J. C. SHEN, C. H. SPRUCK, and Y. C. TSAI, 1992 Methylation, mutation and cancer. Bioessays 14:33-36.[Medline]
KASHKUSH, K., M. FELDMAN, and A. A. LEVY, 2002 Gene loss, silencing and activation in a newly synthesized wheat allotetraploid. Genetics 160:1651-1659.
KIRIHARA, J., J. B. PETRI, and J. MESSING, 1988 Isolation and sequence of a gene encoding a methionine rich 10-kDa zein protein from maize. Gene 71:359-370.[Medline]
KOVARIK, A., R. MATYASEK, A. LEITCH, B. GAZDOVA, and J. FULNECEK et al., 1997 Variability in CpNpG methylation in higher plant genomes. Gene 201:25-33.
KRICKER, M. C., J. W. DRAKE, and M. RADMAN, 1992 Duplication-targeted DNA methylation and mutagenesis in the evolution of eukaryotic chromosomes. Proc. Natl. Acad. Sci. USA 89:1075-1079.
KUMAR, A. and J. L. BENNETZEN, 1999 Plant retrotransposons. Annu. Rev. Genet. 33:479-532.[Medline]
LEITE, A., L. M. M. OTTOBONI, M. L. P. N. TARGON, M. J. SILVA, and S. R. TURCINELLI et al., 1990 Phylogenetic relationship of zein and coixins as determined by immunological cross-reactivity and Southern blot analysis. Plant Mol. Biol. 14:743-751.[Medline]
LEUTWILER, L. S., B. R. HOUGH-EVANS, and E. M. MEYEROWITZ, 1984 The DNA of Arabidopsis thaliana.. Mol. Gen. Genet. 194:15-23.
LIU, C. N. and I. RUBENSTEIN, 1992 Molecular characterization of two types of 22 kilodalton
-zein genes in a gene cluster in maize. Mol. Gen. Genet. 234:244-253.[Medline]
LLACA, V. and J. MESSING, 1998 Amplicons of maize genes are conserved within genic but expanded and constricted in intergenic regions. Plant J. 15:211-220.[Medline]
LUND, G., P. CICERI, and A. VIOTTI, 1995 Maternal-specific demethylation and expression of specific alleles of zein genes in the endosperm of Zea mays L. Plant J. 8:571-581.[Medline]
MADLUNG, A., R. W. MASUELLI, B. WATSON, S. H. REYNOLDS, and J. DAVIDSON et al., 2002 Remodeling of DNA methylation and phenotypic and transcriptional changes in synthetic Arabidopsis allotetraploids. Plant Physiol. 129:733-746.
MALOISEL, L. and J. L. ROSSIGNOL, 1998 Suppression of crossing-over by DNA methylation in Ascobolus. Genes Dev. 12:1381-1389.
MATASSI, G., R. MELIS, K. C. KUO, G. MACAYA, and C. W. GEHRKE et al., 1992 Large-scale methylation patterns in the nuclear genomes of plants. Gene 122:239-245.[Medline]
MATIEU, O., Y. YUKAWA, M. SUGIURA, G. PIKARD, and S. TOURMENTE, 2002 5S rRNA genes expression is not inhibited by DNA methylation in Arabidopsis. Plant J. 29:313-323.[Medline]
MCCLELLAND, M., 1983 The frequency and distribution of methylatable DNA sequences in leguminous plant protein coding genes. J. Mol. Evol. 19:346-354.[Medline]
MITTELSTEIN-SCHEID, O., K. AFSAR, and J. PASZKOWSKI, 1994 Gene inactivation on Arabidopsis thaliana is not accompanied by an accumulation of repeat-induced point mutations. Mol. Gen. Genet. 244:325-330.[Medline]
MONROE, J. J., M. G. MANJANATHA, and T. R. SKOPEK, 2001 Extent of CpG methylation is not proportional to the in vivo spontaneous mutation frequency at transgenic loci in Big Blue rodents. Mutat. Res. 476:1-11.[Medline]
MONTERO, L. M., J. FILIPSKI, P. GIL, J. CAPEL, and J. M. MARTINEZ-ZAPATER et al., 1992 The distribution of 5-methylcytosine in the nuclear genome of plants. Nucleic Acids Res. 20:3207-3210.
PRAT, S., J. CORTADAS, P. PUIGDOMENECH, and J. PALAU, 1985 Multiple variability in the sequence of a family of maize endosperm proteins. Nucleic Acids Res. 13:1493-1504.
QUIGLEY, F., H. BRINKMANN, W. F. MARTIN, and R. CERFF, 1989 Strong functional GC pressure in a light regulated maize gene encoding subunit GAPA of chloroplast gyceraldehyde-3-phosphate dehydrogenase: implications for the evolution of GAPA pseudogenes. J. Mol. Evol. 29:412-421.[Medline]
RABINOWICZ, P. D., K. SCHUTZ, N. DEDHIA, C. YORDAN, and L. D. PARNELL et al., 1999 Differential methylation of genes and retrotransposons facilitates shotgun sequencing of the maize genome. Nat. Genet. 23:305-308.[Medline]
REINA, M., P. GUILLEN, I. PONTE, A. BORONAT, and J. PALAU, 1990 Sequence analysis of a genomic clone encoding a Zc2 protein from Zea mays W64A. Nucleic Acids Res. 18:6425.
RONCHI, A. K., K. PETRONI, and C. TONELLI, 1995 The reduced expression of endogenous duplications (REED) in the maize R gene family is mediated by DNA methylation. EMBO J. 14:5318-5328.[Medline]
ROUNTREE, M. R. and E. U. SELKER, 1997 DNA methylation inhibits elongation but not initiation of transcription in Neurospora crassa.. Genes Dev. 11:2383-2395.
RUBENSTEIN, I., and D. E. GERAGHTY, 1986 The genetic organization of zeins, pp. 297315 in Advances in Cereal Science and Technology, edited by Y. POMERANZ. American Association of Cereal Chemists, St. Paul.
RUSSELL, D. A. and M. M. SACHS, 1991 The maize cytosolic glyceraldehyde 3-phosphate dehydrogenase gene family: organ specific expression and genetic analysis. Mol. Gen. Genet. 229:219-228.[Medline]
SANMIGUEL, P., A. TIKHONOV, Y. K. JIN, N. MOTCHOULSKAIA, and D. ZAKHAROV et al., 1996 Nested retrotransposons in the intergenic regions of the maize genome. Science 274:737-738.
SANMIGUEL, P., B. S. GAUT, A. TIKHONOV, Y. NAKAJIMA, and J. L. BENNETZEN, 1998 The paleontology of intergene retrotransposons of maize. Nat. Genet. 20:43-45.[Medline]
SCHORDERET, D. F. and S. M. GARTLER, 1992 Analysis of CpG suppression in methylated and non-methylated species. Proc. Natl. Acad. Sci. USA 89:957-961.
SELKER, E. U., 1990 Premeiotic instability of repeated sequences in Neurospora crassa.. Annu. Rev. Genet. 24:579-613.[Medline]
SKOPEK, T., D. MARINO, K. KORT, J. MILLER, and M. TRUMBAUER et al., 1998 Effect of target gene CpG content on spontaneous mutation in transgenic mice. Mutat. Res. 400:77-88.[Medline]
SOAVE, C., R. REGGIANI, N. DIFONZO, and F. SALAMINI, 1981 Clustering of genes for 20 kd zein subunits in the short arm of maize chromosome 7. Genetics 97:363-377.