Genetics, Vol. 162, 1389-1400, November 2002, Copyright © 2002

Different Types and Rates of Genome Evolution Detected by Comparative Sequence Analysis of Orthologous Segments From Four Cereal Genomes

Wusirika Ramakrishnaa, Jorge Dubcovskyb, Yong-Jin Park1,a, Carlos Bussob, John Embertona, Phillip SanMiguelc, and Jeffrey L. Bennetzena
a Department of Biological Sciences, Purdue University, West Lafayette, Indiana 47907,
b Department of Agronomy and Range Science, University of California, Davis, California 95616
c Purdue University Genomics Core, WSLR, Purdue University, West Lafayette, Indiana 47907

Corresponding author: Jeffrey L. Bennetzen, Hansen Bldg., Purdue University, West Lafayette, IN 47907., maize{at}bilbo.bio.purdue.edu (E-mail)

Communicating editor: J. A. BIRCHLER


*  ABSTRACT
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Orthologous regions in barley, rice, sorghum, and wheat were studied by bacterial artificial chromosome sequence analysis. General microcolinearity was observed for the four shared genes in this region. However, three genic rearrangements were observed. First, the rice region contains a cluster of 48 predicted small nucleolar RNA genes, but the comparable region from sorghum contains no homologous loci. Second, gene 2 was inverted in the barley lineage by an apparent unequal recombination after the ancestors of barley and wheat diverged, 11–15 million years ago (mya). Third, gene 4 underwent direct tandem duplication in a common ancestor of barley and wheat 29–41 mya. All four of the shared genes show the same synonymous substitution rate, but nonsynonymous substitution rates show significant variations between genes 4a and 4b, suggesting that gene 4b was largely released from the strong purifying selection that acts on gene 4a in both barley and wheat. Intergenic retrotransposon blocks, many of them organized as nested insertions, mostly account for the lower gene density of the barley and wheat regions. All but two of the retrotransposons were found in the regions between genes, while all but 2 of the 51 inverted repeat transposable elements were found as insertions in genic regions and outside the retrotransposon blocks.


THE grass (Poaceae) family of plants, including barley, maize, millets, oat, rice, rye, sorghum, and wheat, contributes ~60% of the world's food production. Beyond their agronomic importance, these cereals also serve as a model system for comparative genetics (BENNETZEN and FREELING 1993 Down; FREELING 2001 Down), wherein several species have highly developed molecular and genetic tool kits plus a long history of detailed physiological, developmental, and genetic characterization.

One of the most significant differences in grass genomes is their nuclear DNA content, ranging from <200 to >80,000 Mb (BENNETT and SMITH 1976 Down; BENNETT et al. 2000 Down). Among the cereals, the approximate haploid DNA contents of rice, sorghum, maize, barley, and diploid wheat are 430, 750, 2500, 4800, and 5700 Mb, respectively (ARUMUGANATHAN and EARLE 1991 Down). In maize, we know that most (>60%) of the nuclear genome is composed of retrotransposons, often arranged as nested insertions within insertions (SANMIGUEL et al. 1996 Down; SANMIGUEL and BENNETZEN 1998 Down; MEYERS et al. 2001 Down). In barley and wheat, it is also clear that much of the genome is composed of repetitive DNA and that much of this repetitive DNA consists of retrotransposons (PANSTRUGA et al. 1998 Down; SHIRASU et al. 2000 Down; STEIN et al. 2000 Down; WICKER et al. 2001 Down; SANMIGUEL et al. 2002 Down), but too few genomic segments have been investigated to identify any consistent patterns of arrangement. The smaller grass genomes, including rice and sorghum, appear to have both a lower amount of repetitive DNA and fewer retrotransposons inserted between genes (CHEN et al. 1997 Down, CHEN et al. 1998 Down; TIKHONOV et al. 1999 Down; KLEIN et al. 2000 Down; TARCHINI et al. 2000 Down). Further comparative DNA sequence analysis is needed to determine how these genomes differ in gene density and in the nature/organization of repetitive DNAs in genic regions.

Comparative genetic mapping of several cereal genomes using DNA markers has shown that all can be depicted as simple variants of a single genetic map (MOORE et al. 1995 Down; GALE and DEVOS 1998 Down). However, a small number of large genomic rearrangements (commonly full arm translocations or inversions) differentiate many of the genomes, and a certain percentage of DNA markers (perhaps as high as 30%) do not fit into any clear colinear pattern (BENNETZEN 2000 Down). Many of the large rearrangements mark specific lineages, as in the case of three translocations that occurred in the Panicoideae ancestor that gave rise to maize, sorghum, pearl millet, foxtail millet, and finger millet after their divergence from a common ancestor with barley, wheat, and rice ~50–60 million years ago (mya; GALE and DEVOS 1998 Down).

Despite the general colinearity of their genetic maps, we do not yet know the frequency or nature of the small rearrangements that differentiate cereal genomes. Any rearrangements smaller than a few centimorgans (e.g., several megabases) would have been missed by standard comparative genetic maps. In the grasses, comparative genomic sequencing studies that involved genomic segments >30 kb have been limited to maize, sorghum, and rice for the sh2/a1 region (CHEN et al. 1997 Down, CHEN et al. 1998 Down), maize and sorghum for the adh1 region (TIKHONOV et al. 1999 Down), and barley and rice for a region near the Vrn1 gene (DUBCOVSKY et al. 2001 Down). These studies indicated that only the genes were conserved in these segments, but that some small rearrangements involving one or two genes had occurred also. Because so few segments and species were investigated in each study, no overall patterns could be discerned.

Comparative sequence analysis of small genomic regions (14–23 kb) at the Lrk/Tak loci in wheat and homologous regions from barley, maize, and rice uncovered a high gene density and numerous rearrangements (FEUILLET and KELLER 1999 Down). This observation is not surprising for disease resistance genes of this type, which are organized in tandem clusters that tend to undergo rapid reorganization.

In all grasses studied, gene density has been higher than that predicted for a random dispersal of genes. Because investigators have always begun their studies by selecting a clone that contained a gene, an inherent bias was present in the gene-density outcome. Regardless, the observed gene densities in barley and wheat sequences (15–22 and 5–42 kb, respectively) are much lower than the 200–250 kb/gene that would be predicted by random dispersal (KELLER and FEUILLET 2000 Down; SHIRASU et al. 2000 Down; DUBCOVSKY et al. 2001 Down; FEUILLET et al. 2001 Down; WICKER et al. 2001 Down). Cytogenetic studies also strongly support the existence of gene-rich islands in barley and wheat (GILL et al. 1996A Down, GILL et al. 1996B Down; FARIS et al. 2000 Down; KUNZEL et al. 2000 Down).

In dicotyledonous plants, comparative sequence analyses involving large genomic segments are restricted to comparisons with Arabidopsis thaliana, a species that has undergone a very high frequency of ancestral rearrangements involving small chromosomal segments, primarily genic deletions (LAGERCRANTZ 1998 Down; BLANC et al. 2000 Down; KU et al. 2000 Down; O'NEILL and BANCROFT 2000 Down; VISION et al. 2000 Down). Extensive chromosomal rearrangement is observed between Arabidopsis and Brassica species (O'NEILL and BANCROFT 2000 Down). However, significant residual microcolinearity is observed between Arabidopsis and its close relative, Capsella, two species that diverged 6–10 mya (ROSSBERG et al. 2001 Down).

Genes in plants, and in other eukaryotes, can evolve at very different rates, perhaps due to differences in selective pressure or to different local rates of mutation (WOLFE et al. 1987 Down, WOLFE et al. 1989A Down, WOLFE et al. 1989B Down; GAUT 1998 Down; ZHANG et al. 2001 Down). For instance, adh1 was found to evolve faster than adh2 at nonsynonymous sites and at similar rates at synonymous sites (GAUT et al. 1999 Down). Overall, little is known about substitution rate variations among nuclear genes of grass genomes and how this phenomenon may be related to local genome organization.

In this study, we present comparative sequence analysis of an orthologous chromosomal region from barley, rice, sorghum, and wheat. Our analyses indicate the frequency, nature, and lineages of several different types of genome evolution. These include variation in gene order and number, differences in the local rates of nucleotide variation, and patterns of transposable element accumulation. These combined studies provide a first indication of the value of a multi-species analysis of local genome structure and evolution in the Poaceae.


*  MATERIALS AND METHODS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

BAC selection, restriction mapping, and sequencing:
Restriction fragment length polymorphism (RFLP) marker WG644 was used to screen the Morex barley bacterial artificial chromosome (BAC) library (YU et al. 2000 Down), the HindIII BAC library made from Nipponbare rice DNA (http://www.genome.clemson.edu/orders/lib_desc/nippon.html), the HindIII BAC library made from Btx623 sorghum DNA (http://www.genome.clemson.edu/orders/lib_desc/SB_BBc.html), and the HindIII BAC library made from DV92 diploid wheat DNA (LIJAVETZKY et al. 1999 Down). The diploid wheat and sorghum libraries were also screened with probes corresponding to different genes of barley and rice BACs (DUBCOVSKY et al. 2001 Down) to identify the orthologous BAC. Positive BACs were fingerprinted with restriction enzyme HindIII, transferred to nylon membranes, and hybridized to confirm that all BACs contained the homologous locus.

Restriction maps of 36I5 (rice), 635P2 (barley), 116F2 (diploid wheat), 115G1 (diploid wheat), and 170F8 (sorghum) were constructed to experimentally validate computer sequence assembly. This experimental confirmation was important to determine the effect of large retroelements with large direct repeats on the assembly algorithms. BACs were individually digested with the 8-bp specificity restriction enzymes AscI, NotI, PacI, PmeI, and SwaI. All possible single and double digestions were analyzed for restriction enzymes with one or more sites within the mapped BAC. Restriction fragments were separated by pulsed-field electrophoresis, as described earlier (DUBCOVSKY et al. 2001 Down).

Preparation of shotgun libraries, sequencing, and analysis were as described by DUBCOVSKY et al. 2001 Down. For completing BAC sequences, gaps were closed by a combination of different approaches, including the use of different sequence chemistries, the thermofidelase enzyme, PCR amplification of gaps, shotgun sequencing of transposon-inserted subclones that span a gap, and direct sequencing of BAC template. When gaps were due to repetitive regions, subclones that either started or ended in unique regions with the remaining portion in the repetitive region were assembled separately and inserted into the main assembly.

Sequence analysis:
Annotation and sequence analysis were performed as described earlier (DUBCOVSKY et al. 2001 Down). FGENESH (http://www.softberry.com/nucleo.html) with the maize training set was used for gene prediction in addition to GENSCAN (http://genes.mit.edu/GENSCAN.html) and GeneMark.hmm (http://genemark.biology.gatech.edu/Gene Mark/).

Estimation of nucleotide substitution rates and phylogenetic reconstructions:
Genes were aligned using CLUSTALX (THOMPSON et al. 1997 Down). Rates of nucleotide substitution were estimated using the distance measures of NEI and GOJOBORI (1986) and the Jukes-Cantor correction as implemented in the MEGA2 (molecular evolutionary genetic analysis) package (KUMAR et al. 2001 Down). Phylogenetic reconstructions were performed by the neighbor-joining method, with synonymous sites for the analyses. Synonymous and nonsynonymous substitution rates were estimated as described by GAUT et al. 1996 Down. Divergence and duplication times (T) were estimated for all genes, except those shown by relative rate tests to be evolving at different rates, using . k is the absolute rate of synonymous substitution/site/year; Ks is the estimated number of synonymous substitutions per site between homologous sequences and from the neighbor-joining trees generated using the MEGA2 package.

Relative rate tests:
Relative rate tests were used to assess heterogeneity in the numbers of substitutions per site estimated from the one-parameter method (JUKES and CANTOR 1969 Down). Tajima's relative rate test was used as implemented in MEGA2 for testing the molecular clock hypothesis because it can be applied even when the substitution rate varies among different sites (TAJIMA 1993 Down). We used this test as implemented in MEGA2. It compares two sequences with an outgroup sequence and counts the number of unique substitutions in both lineages. When one of the two sequences accumulates a significantly larger number of substitutions, it indicates that the molecular clock hypothesis can be rejected.


*  RESULTS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Isolation and sequencing of orthologous barley, rice, sorghum, and wheat BACs:
To study genomic organization near the Vrn1 locus, BACs Hv635P2, Os36I5, Sb170F8, and Tm115G1/Tm116F2 were selected by their hybridization to DNA marker WG644. WG644 had been mapped to orthologous regions in rice, barley, and wheat (KLEINHOFS et al. 1993 Down; DUBCOVSKY et al. 1998 Down; SARMA et al. 1998 Down). Colinearity between the long arm of rice chromosome 3 and the long arm of homeologous group 5 in the Triticeae has been studied extensively because of the presence of vernalization and frost tolerance genes Vrn1 and Fr1 in wheat and barley and the heading date gene hd6 in rice (DUBCOVSKY et al. 1998 Down; SARMA et al. 1998 Down; KATO et al. 1999 Down; SUTKA et al. 1999 Down). The DNA on the sorghum BAC has not been mapped to the sorghum genetic map, but all selected sorghum clones formed a single contiguous set, indicating that there was only one homologous region in sorghum.

Previously, we reported the sequence of one of these BACs (barley clone Hv635P2) and part (50 kb) of the rice BAC Os36I5 (DUBCOVSKY et al. 2001 Down). Transposable element composition, dinucleotide arrangement, and recombination frequencies have also been reported for the wheat region (SANMIGUEL et al. 2002 Down). The three BACs (two wheat, one sorghum) and an extra 21.3 kb of the rice BAC were sequenced (accession nos. AY013246, AY013245, AY099491, AF503433, and AF459639) by a shotgun approach like that described earlier (DUBCOVSKY et al. 2001 Down). The final error rate was <1 bp/10 kb and the consensus sequence was of high quality (PHRED value of >=25). The two diploid wheat (Triticum monococcum) BACs formed a 215-kb contiguous sequence with 20,573 bp of overlap. The genome sequences from BACs Hv635P2, Sb170F8, and Tm115G1/116F2 are 102,433, 142,376, and 215,220 bp, respectively. The insert in BAC Os36I5 is ~75 kb. This BAC has two regions of contiguous sequence of 65.5 and 5.8 kb, with a gap of ~3.7 kb that we have been unable to close (Fig 1). In every case, restriction maps of the BAC clones agree completely with the sequence assemblies.



View larger version (33K):
In this window
In a new window
Download PPT slide
 
Figure 1. Microcolinearity of an orthologous region in barley, rice, sorghum, and diploid wheat. Arrows indicate genes, plus their size and proposed direction of transcription. Nested insertions of retrotransposons are shown in barley and diploid wheat. Asterisks indicate simple sequence repeats that contain tandem repeat numbers of eight or more. MITEs are indicated by small triangles. In the rice BAC, a dashed line represents a region with homology to a cluster of rice small nucleolar RNA genes.

Gene content and organization in the sequenced clones:
As previously reported (DUBCOVSKY et al. 2001 Down), barley BAC Hv635P2 contains five predicted genes, four long terminal repeat (LTR) retrotransposons, a solo LTR of BAGY-2 (SHIRASU et al. 2000 Down), a LINE retroposon, at least nine miniature inverted repeat transposable elements (MITEs), and a Mutator-like transposable element (Fig 1). Two inverted repeats flank barley gene 2 and a novel transposable element, Inysub, is inserted in the rightward inverted repeat.

Rice BAC Os36I5 has eight predicted genes, one LTR retrotransposon, one Mutator transposable element, and at least 12 MITEs. Gene 6 is a hypothetical gene with no significant similarity to any known gene, protein, or expressed sequence tag (EST), but has been postulated to be a gene by all three gene prediction programs. The predicted protein product of gene 7 shows highest homology to an Arabidopsis unknown protein (NP189619, 1e-42). Gene 7 of rice is orthologous to sorghum gene 6. The predicted gene 8 protein product shows highest homology to an Arabidopsis ribosomal protein (NP199657, 3e-45) and was also conserved as sorghum gene 7. However, gene 8 appears to be a pseudogene in rice because it has a stop codon in the first predicted coding exon. In addition, exon 4 (the last exon) is missing in the rice BAC. Downstream of this region, a single 1.9-kb segment is repeated four times (94.8–99% sequence identity). A fifth repeat is truncated at the end and extends into the unsequenced gap. This 1.9-kb repeat shows homology to rice small nucleolar RNA (snoRNA) genes that are organized in a cluster (AJ310377). Each cluster in the rice BAC, Os36I5, has 10 snoRNA-like genes that show homology (93–99%) to snoRNA genes ranging from 79 bp (AJ307932) to 188 bp (AJ320263). The last cluster contains eight candidate snoRNA genes. snoRNAs are small in size and essential for processing ribosomal RNAs (LEADER et al. 1997 Down, LEADER et al. 1999 Down; BROWN et al. 2001 Down). Plant snoRNA gene clusters are transcribed as a polycistronic pre-snoRNA transcript (LEADER et al. 1997 Down, LEADER et al. 1999 Down). A comparable snoRNA gene cluster is not present in the orthologous region in the sorghum BAC, Sb170F8. Identified homologies are shown in Table 1.


 
View this table:
In this window
In a new window

 
Table 1. Predicted genes in BACs Os36I5 and Sb170F8

The sorghum genomic segment in BAC Sb170F8 contains 20 predicted genes, two LTR retrotransposons, and at least 20 MITEs (Fig 1). At the most-leftward end of the BAC is a partial retrotransposon (Fig 1). The predicted gene 5 of sorghum shows 100% identity to a sorghum EST (BE592631). The sorghum BAC also has a cluster of putative glucosyl transferase genes (genes 13–19). Interestingly, all but genes 8–11 are in the same transcriptional orientation on this BAC. In fact, for all BACs investigated, only these 4 genes and gene 2 on the barley BAC are in the opposite transcriptional organization from all other genes (Fig 1).

The contiguous sequence contained on diploid wheat BACs Tm115G1 and Tm116F2 harbors five predicted genes and 21 intact or partially deleted LTR retrotransposons. As in barley, gene 4 is tandemly duplicated. Within the 215-kb wheat sequence, the five genes are arranged in clusters of 13 and 29 kb, containing two and three genes, respectively (Fig 1). Within these genic regions, the average gene density is one gene per 8.4 kb.

Mobile DNA organization:
The majority of the retrotransposons in wheat were similar to those previously identified in barley and wheat (PANSTRUGA et al. 1998 Down; SHIRASU et al. 2000 Down; WICKER et al. 2001 Down), although we discovered and named six new elements (Eway, Latidu, Miuse, Nusif, Veju, and Wham). We also discovered new retrotransposons in barley (Inav, Ikeros, and Sedef), rice (Alulu), and sorghum (Pyrubu and Unum). Over one-half of the retrotransposons in the wheat BACs are organized in a nested fashion like that seen in the maize genome (SANMIGUEL et al. 1996 Down). The 21 LTR retrotransposons comprise ~160 kb of the 215-kb region. These retroelements were named by the approach described in SANMIGUEL et al. 2002 Down. Nested insertions of LTR retrotransposons were found only on wheat BACs, except in the case of barley BAC Hv635P2, where one BARE-1 element was inserted in the opposite orientation into another BARE-1 element.

BARE-1-like retrotransposons have been named Angela in wheat (WICKER et al. 2001 Down) and constitute ~11% of the wheat region. Solo LTRs were not found in any of the sequenced regions except a BAGY-2 solo LTR in barley. Sabrina-like and Wham LTR retrotransposons constitute 10 and 9% of the sequenced wheat region, respectively. Overall, there are nine largely intact LTR retrotransposons in addition to several partial elements in the wheat region. Angela elements appear to show the most recent insertional activity, because none of them contain another inserted LTR retrotransposon. Five of the identified retroelements (not including the two partial Angela elements at the ends of the BAC contig) were partly deleted. Sorghum BAC Sb170F8 contains two novel LTR retrotransposons, Unum and Pyrubu, and a partial LTR retrotransposon that together account for ~15% of the 142-kb BAC insert. Rice BAC Os36I5 contains a retrotransposon, Alulu. All retrotransposons except one (Unum inserted in intron 13 of sorghum gene 3) were found in the intergenic regions.

MITEs are equally numerous, relative to genes, in all regions. Identified MITEs include 9 in barley, 12 in rice, 20 in sorghum, and 10 in wheat. On average, there are 2 MITEs per gene in barley, rice, and wheat. In sorghum, the ends of at least eight more MITE-like elements could not be determined. With two exceptions (one each in barley and wheat), all MITEs were found as insertions in the genic regions. A MITE is inserted in the LTR retrotransposon Inav in barley, and a second MITE is inserted in a partial element (downstream of the first partial Angela) in wheat that has homology to RIRE2.

Comparison of orthologous barley, rice, sorghum, and wheat regions:
Four predicted genes are conserved across the four genomes in this study. However, they are distributed across 102 kb in barley, 30 kb in rice, 35 kb in sorghum, and 215 kb in diploid wheat (Fig 2).



View larger version (19K):
In this window
In a new window
Download PPT slide
 
Figure 2. Comparison of an orthologous region in barley, rice, sorghum, and diploid wheat. The compactness of rice and sorghum genomes is evident compared to barley and diploid wheat genomes. Lines connect the genes that are common among the BACs. Gene 5 of sorghum and gene 6 of rice (connected by a dotted line) show limited sequence homology and may not be truly orthologous. Large transposons are shown as blocks. Arrows indicate genes and their transcriptional orientation.

Adjacent to the four conserved genes, the sorghum BAC shares two additional genes (6 and 7) with rice BAC Os36I5 (Fig 2). In addition, the predicted protein product (116 amino acids) of a divergent gene (gene 6) in rice has a small stretch of similarity (63% identity in a region of 19 amino acids) to the predicted protein product (117 amino acids) of sorghum gene 5. Sorghum genes 11–20 are colinear with a segment of a rice BAC (AC079887) in the GenBank database. This BAC is downstream from Os36I5 and includes RFLP marker R2404, which is 0.3 cM from marker R2311 located in Os36I5 on chromosome 3. The cluster of putative glucosyl transferase genes (genes 13–19) in the sorghum BAC, Sb170F8, is also present in the rice BAC (AC079887). Sorghum has seven of these genes, compared to five in rice.

Gene 2 is in inverted orientation in barley relative to rice, sorghum, and wheat. This indicates that the inversion occurred in the barley lineage after wheat and barley ancestors diverged. Gene 4 is duplicated in both barley and wheat, suggesting that this duplication occurred before the divergence of barley and wheat ancestors but after their divergence from rice and sorghum ancestors. Alternatively, the duplicated gene may have been lost from the rice and sorghum lineages.

Intron-exon and exon-intron boundaries were analyzed for the four genes that were common among the four genomes. These four genes were also compared to the most closely related genes in Arabidopsis. In the grasses, exon-intron boundaries predominantly exhibit the sequence GT/A (56%), followed by GT/G (26%) as the next most frequent. This compares to 62% GT/A and 14% GT/G in Arabidopsis. Intron-exon boundaries are C/AG (78%) or T/AG (22%) in the grass genomes, compared to 63% C/AG and 29% T/AG in the putative orthologous genes in Arabidopsis. Hence, the grasses show a narrower range of variation in intron splicing sites than is seen in Arabidopsis.

Expansion of intergenic spaces was largely caused by LTR retrotransposon insertion in barley and wheat (Fig 2). Retrotransposons account for ~60, 8, 15, and 70% of the sequenced barley, rice, sorghum, and wheat regions, respectively. However, none of the retrotransposons, MITEs, or other mobile elements were in orthologous locations, except a small truncated element of 752 bp in Hv635P2, which was conserved in Tm115G1/ 116F2 (84% identity; Fig 1) and was similar (84–89% identities) to parts of the LTR of the Barbara element (AF326781; WICKER et al. 2001 Down). Hence, almost all these elements were inserted after the ancestors of these species diverged from each other.

Detailed comparison of the homologous barley and wheat regions:
Exons and introns in the five common genes between wheat and barley were well conserved, facilitating a comprehensive analysis of these sequences. The coding regions of the five genes were very similar, varying from 95.9 to 97.5% identity at the DNA level. The 16 predicted exon-intron boundaries found within a codon and the noncanonical 5' "GC" splice site at the end of exon 6 in gene 4 were all conserved between barley and wheat, as they were between barley and rice (DUBCOVSKY et al. 2001 Down). These perfect alignments of barley and wheat exons facilitated the delimitation of homeologous intron regions. The total intron size for these five genes (66 introns) was 16,442 bp in barley and 15,972 bp in wheat, of which 15,182 bp were aligned. Introns were very similar in barley and wheat, ranging from 86.1 to 89.5% identity.

A different proportion of transitions and transversions was observed between introns and exons. In the exons, 66% of the 273 point mutations were transitions and 34% were transversions. In the introns, 58.3% of the 1284 point mutations were transitions and 41.7% were transversions. An analysis of variance of the transition proportions using the five genes as replicates indicated that this difference was significant at P < 0.05.

The unaligned portions of the introns (20%) were caused by insertions of 10 MITEs or MITE-like elements, 262 small indels (<15 bp), 19 intermediate indels (15–200 bp), two large indels (395 and 1532 bp), and three regions of gene 4b (introns 1, 2, and 16) that showed unusually low levels of conservation and could not be accurately aligned. The presence of 10 MITEs or MITE-like elements in these five genes indicates an average insertion rate of approximately one insertion/gene/evolutionary lineage/10 million years. The predicted MITEs (or remainders of partially deleted MITEs) varied in length from 17 to 226 bp and all showed duplication of short host sequences (generally TA) and the presence of perfect or imperfect inverted repeats. These apparent insertions accounted for an increase of 1249 bp in the size of the introns.

Most of the other indels were small, including 1 bp (32%), 2 bp (22%), 3 bp (10%), or 4 bp (8%) events. However, the largest amount of indel size variation in these 66 introns was provided by the 19 intermediate (831 bp total) and 2 large (1927 bp total) indels. To understand the possible origin of these indels, we compared their flanking regions. Forty-four percent of the indels of three or more base pairs included perfect short direct repeats in one border of the indel and in the opposite border of the paired region between barley and wheat. This proportion increased to 65% with the inclusion of direct repeats with 1 bp difference among repeats (>4 bp) or 1–2 bp away from the exact border of the indel.

Nucleotide substitution rates:
Synonymous (Ks) and nonsynonymous (Ka) substitution rates within a gene are often correlated (WOLFE and SHARP 1993 Down). The Ka/Ks ratio indicates the level of selective constraint acting on proteins. For most of the comparisons for genes 1–4, Ka and Ks were also correlated (r = 0.82, P < 0.0001; Table 2). The data show that gene 4 is highly constrained in its nonsynonymous substitution rate in rice and sorghum, but only gene 4a is similarly constrained in barley and wheat (Fig 3). In fact, gene 4a shows a decrease in its nonsynonymous substitution rate in the barley-wheat comparison, resulting in a Ka/Ks ratio of 0.03 that indicates an increase in purifying selection.




View larger version (115K):
In this window
In a new window
Download PPT slide
 
Figure 3. Nucleotide substitution rates of genes 1–4 among different grass genomes. (A) Synonymous substitution (Ks) rates show less variation. Bars represent Ks for pairwise comparisons between different grass genomes for four or five genes. Line extensions on top of the bars represent standard error values. (B) Nonsynonymous substitution (Ka) rates vary by severalfold. Bars and line extensions are as described in A except they are for Ka values.


 
View this table:
In this window
In a new window

 
Table 2. Number of synonymous (Ks) and nonsynonymous (Ka) substitutions per site among barley, rice, sorghum, and wheat for four genes in Vrn1 region

To estimate whether the four genes conserved among the four genomes are evolving at different rates, two-tailed t-tests were performed for different gene pairs. P values were significant at a 5% level only for comparisons of Ks values of gene 4b with genes 1, 3, and 4a after correction for multiple tests (P < 0.002, {alpha} = 0.0051). Comparisons of Ka for different gene pairs showed 5 out of 10 comparisons to be statistically significant at a 5% level after correction for multiple tests (P < 0.0037, {alpha} = 0.0051). This suggests more variable nonsynonymous substitution rates than synonymous substitution rates among these four genes.

To estimate the time of duplication of gene 4 and the divergence of wheat and barley, the synonymous substitution rates must not differ significantly and the duplicated sequences must evolve at similar rates after duplication. Tajima's relative rate test (TAJIMA 1993 Down) was used to scan for substitution rate heterogeneity in the four genes conserved in the four grass genomes. Rice was taken as an outgroup to estimate relative rates of genes 1–4 in barley and wheat. Relative rate tests showed that synonymous substitution rate variations were not significant for genes 1–4. However, highly significant variations in nonsynonymous substitution rates were observed between genes 4a and 4b. Therefore, synonymous substitutions were used to estimate divergence times. Relative rate tests and nucleotide substitution rates show that gene 4b is evolving at a faster rate compared to gene 4a (Fig 3 and Table 3).


 
View this table:
In this window
In a new window

 
Table 3. Tajima's relative rate tests

Rice fossils dated to ~40 mya were described by STEBBINS 1981 Down. Molecular data indicate that the divergence of the rice lineage from the lineages giving rise to barley, sorghum, and wheat occurred 50–70 mya (WOLFE et al. 1989A Down). Therefore, 60 mya was taken as the divergence time between rice and barley/wheat. We used average synonymous substitution rates for the rice and barley/wheat divergence to estimate absolute synonymous substitution rates. The average synonymous substitution rate of the adh1 and adh2 genes in grasses was reported to be 6.5 x 10-9 substitutions/synonymous site/year (GAUT et al. 1996 Down). The average for six nuclear genes (adh1, adh2, waxy, shrunken1, gapC, and chalcone synthase) between maize and wheat or barley was reported to be 6.1 x 10-9 substitutions/synonymous site/year (WOLFE et al. 1989B Down). Our estimate is 4.2 x 10-9 substitutions/synonymous site/year based on genes 1–4 of rice and wheat and a divergence time of 60 mya. On the basis of our estimates, the ancestors of barley and wheat diverged from each other 11–15 mya, which is in a similar range (10–14 mya) to that estimated by WOLFE et al. 1989A Down.

Using our calculated rate of sequence divergence, we determined that the duplication of gene 4 in a Triticeae ancestor occurred 29–41 mya, many millions of years after this lineage diverged from ancestral lineages that gave rise to rice or sorghum. Hence, our data are more consistent with a duplication of gene 4 that occurred specifically in a shared ancestor of barley and wheat than it is with a model proposing deletion of one gene 4 copy from the rice/sorghum or rice and sorghum lineages.


*  DISCUSSION
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Types and times of genic rearrangement:
Comparative sequence analysis can provide a wealth of information about the nature of sequence arrangement and evolution, including gene content, order, and orientation. Microcolinearity among grass genomes has been shown by the sequencing of genomic segments from orthologous loci from rice, maize, sorghum, barley, and wheat (CHEN et al. 1997 Down; TIKHONOV et al. 1999 Down; DUBCOVSKY et al. 2001 Down; FEUILLET et al. 2001 Down). In the present study, comparative sequence analysis of orthologous genomic segments of barley, rice, sorghum, and wheat revealed two small genic rearrangements. Gene 2 was inverted in barley relative to the other three genomes studied, probably by unequal recombination between inverted repeats flanking the gene, after the ancestors of barley and wheat diverged from each other. It is likely that the duplication of gene 4 occurred before the divergence of barley and wheat, because the duplicated genes are present in these two genomes but absent in rice and sorghum.

One dramatic variation between the studied regions is a difference in the presence of an entire duplicated gene family in the comparison of rice and sorghum. Rice BAC Os36I5 contains at least 48 candidate snoRNA genes in five clusters, none of which are present in the comparable sorghum region. This lack of colinear location is also observed with rDNA repeats, storage protein gene clusters, and tandem disease resistance genes (DUBCOVSKY and DVORAK 1995 Down; LEISTER et al. 1998 Down; RICHLY et al. 2002 Down; J. MESSING, personal communication). It is not clear why tandem gene clusters should be less stable in genomic location. However, when this property is shared by several different types of clustered genes, it suggests that this is a consistent and evolved property of higher plant genomes.

Distribution and density of genes and repeats:
Gene density in species from the Triticeae tribe with large genomes such as barley and wheat is of immense interest. The large difference between expected and observed gene density in barley and wheat (KELLER and FEUILLET 2000 Down) and studies based on GC composition (BARAKAT et al. 1997 Down) and cytogenetics (GILL et al. 1996A Down, GILL et al. 1996B Down; FARIS et al. 2000 Down) support the hypothesis that gene-rich regions exist in these large genomes. Previous estimates of gene density of the wheat genome were mostly based on disease resistance genes (STEIN et al. 2000 Down; FEUILLET et al. 2001 Down; WICKER et al. 2001 Down), which are largely exceptional in their organization and lack of stability (LEISTER et al. 1998 Down; HULBERT et al. 2001 Down; RICHLY et al. 2002 Down). The map position of Vrn1 places it in a region (between breakpoints in deletion lines 5AL-6 and 5AL-17) that undergoes a moderate level of recombination (GILL et al. 1996B Down) and, thus, it should be an average genic region of the wheat genome. In our study, sorghum had the highest gene density of one gene per 7 kb, while rice (excluding the snoRNA loci), barley, and diploid wheat had gene densities of one gene per 9, 20, and 43 kb, respectively. These observed gene densities compare to predicted respective gene densities of one per 25, 15, 160, and 190 kb for a random gene dispersal model if each of these species contains ~30,000 genes. Hence, all species show higher-than-random gene densities, but this effect is much more pronounced for the large genomes. This result suggests that most of the larger plant genomes are composed of largely gene-free regions, like paracentromeric heterochromatin, as is also suggested by cytogenetic studies (GILL et al. 1996A Down, GILL et al. 1996B Down; FARIS et al. 2000 Down).

We identified numerous novel LTR retrotransposons in the sequenced regions and found that retrotransposons constitute >70% of the 215-kb region sequenced in wheat. Retrotransposons and genes are organized in separate clusters in wheat and to a lesser extent in barley. The most striking features in the wheat region are the nested insertions of retrotransposons. Hence, the preferential insertion of the abundant classes of retrotransposons into each other reported in maize (SANMIGUEL et al. 1996 Down) appears to be a general characteristic of the wheat genome. In addition, we found that MITEs in wheat and barley also show the preferential accumulation near genes that has been reported in maize (BUREAU and WESSLER 1994 Down; TIKHONOV et al. 1999 Down). As in maize (SANMIGUEL et al. 1998 Down), all of the LTR retrotransposons discovered in this study appear to be of recent origin, partly evidenced by the lack of orthologous elements inserted in the barley and wheat regions. This may reflect a recent explosion in LTR retrotransposon activity in these lineages (accounting for the large overall genome size) or may indicate relatively rapid removal of older transposon sequences (DEVOS et al. 2002 Down).

Mutation, selection, and drift:
Comparison of barley and wheat genic regions revealed several interesting features. The ubiquitous presence of direct tandem repeats in the indels suggests that replication slippage, illegitimate recombination (DEVOS et al. 2002 Down), and/or transposable element excision were responsible for these indels. If all the indels >4 bp and flanked by perfect repeats were considered scars from transposon activity, the previous estimate of the rate of transposon insertion would change from 1 to 2.2 (or 2.6 if imperfect repeats are also included) insertion per gene per evolutionary lineage per 11 million years. If the indels (including the two large indels of uncertain origin) are assumed to be deletions, they would have removed at least 3.5 kb of an 18.5-kb region of aligned intron sequences in the last 11–15 million years. These deletions could provide an adequate counterbalance to the increase in size originated by transposon insertion in barley, wheat, and other plant species.

In the present study, we compared the modes and rates of evolution of four orthologous genes that are next to each other in the four cereal genomes. Our data indicate that the synonymous substitutions evolve at a more uniform rate than nonsynonymous substitutions, not only among different genes but also in different lineages.

Rice-sorghum and wheat-barley (Triticeae) are separately derived from ancestors that diverged 50–70 mya (WOLFE et al. 1989A Down). We used this date and synonymous substitutions to estimate divergence and duplication times. Genes 1–4 show 4.2 x 10-9 substitutions per synonymous site per year, which is close to that reported previously for grass genomes (WOLFE et al. 1989B Down; GAUT et al. 1996 Down). Our data suggest that the duplication of gene 4 occurred 29–41 mya. This conclusion is also supported by the absence of the duplicated gene 4 in the orthologous region in both rice and sorghum. However, the duplication may have happened earlier, if it was followed by rounds of unequal gene conversion.

Nonsynonymous substitutions tend to be uninformative over short evolutionary time periods, partly because they are subject to positive selection. We found that the nonsynonymous substitution rates deviated significantly from clock-like behavior after duplication of gene 4, as revealed by relative rate tests. Similar variations were observed between the two duplicated grass adh loci (GAUT et al. 1996 Down). In contrast, several of the 26 groups of duplicated genes analyzed in zebra fish showed significant differences in the rate of evolution as measured by both Ka and Ks (VAN DE PEER et al. 2001 Down). In nine genomes previously studied (including Arabidopsis and rice), duplicated genes were found to arise at a rate averaging 0.01 duplications/gene/million years (LYNCH and CONERY 2000 Down). However, these duplicated genes often have a short half-life, 23.4 million years, for instance, in Arabidopsis (LYNCH and CONERY 2001 Down). Biological innovation and functional diversification can lead to evolution of new functions for duplicated genes. In fact, gene 4a seems to be the most conserved locus of the four genes compared, while gene 4b is the least conserved. Thus, these duplicated genes appear to be on different evolutionary routes that may lead (or may have already led) to a new role for gene 4b.

Small-scale gene rearrangements have been found in other eukaryotic genome comparisons and can be a major factor in speciation and genome evolution (WOLFE and SHIELDS 1997 Down; WAGNER 2001 Down). The very high rate of genic rearrangement observed in this study and other plant genome comparisons (reviewed in BENNETZEN and RAMAKRISHNA 2002 Down) indicates that plants have much more unstable genomes than those seen in mammals. This instability can provide the extensive haplotype variability characteristic of most plants, providing broad opportunities for the action of natural selection. Further studies are warranted to provide a more comprehensive indication of the nature, mechanisms, rates, and lineages of genome rearrangement in plants.


*  FOOTNOTES

Sequence data from this article have been deposited with the EMBL/GenBank Data Libraries under accession nos. AY013246, AY013245, AY09949, AF503433, and AF459639. Back
1 Present address: Department of Bio-Resources, National Institute of Agricultural Biotechnology, Suwon 441-707, Republic of Korea. Back


*  ACKNOWLEDGMENTS

This work was supported by the National Science Foundation Plant Genome Program (grant no. 9975793) and United States Department of Agriculture-National Research Initiative grant no. 2000-1678.

Manuscript received April 18, 2002; Accepted for publication July 17, 2002.


*  LITERATURE CITED
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

ARUMUGANATHAN, K. and E. D. EARLE, 1991  Nuclear DNA content of some important plant species. Plant Mol. Biol. Reporter 9:208-218.

BARAKAT, A., N. CARELS, and G. BERNARDI, 1997  The distribution of genes in the genomes of Gramineae. Proc. Natl. Acad. Sci. USA 94:6857-6861.[Abstract/Free Full Text]

BENNETT, M. D. and J. B. SMITH, 1976  Nuclear DNA amounts in angiosperms. Philos. Trans. R. Soc. Lond. B Biol. Sci. 274:227-274.[Medline]

BENNETT, M. D., P. BHANDOL, and I. J. LEITCH, 2000  Nuclear DNA amounts in angiosperms and their modern uses: 807 new estimates. Ann. Bot. 86:859-909.[Abstract/Free Full Text]

BENNETZEN, J. L., 2000  Comparative sequence analysis of plant nuclear genomes: microcolinearity and its many exceptions. Plant Cell 12:1021-1030.[Abstract/Free Full Text]

BENNETZEN, J. L. and M. FREELING, 1993  Grasses as a single genetic system: genome composition, collinearity and compatibility. Trends Genet. 9:259-261.[Medline]

BENNETZEN, J. L. and W. RAMAKRISHNA, 2002  Numerous small rearrangements of gene content, order and orientation differentiate grass genomes. Plant Mol. Biol. 48:821-827.[Medline]

BLANC, G., A. BARAKAT, R. GUYOT, R. COOKE, and M. DELSENY, 2000  Extensive duplication and reshuffling in the Arabidopsis genome. Plant Cell 12:1093-1101.[Abstract/Free Full Text]

BROWN, J. W. S., G. P. CLARK, D. J. LEADER, C. G. SIMPSON, and T. LOWE, 2001  Multiple snoRNA gene clusters from Arabidopsis.. RNA 7:1817-1832.[Abstract]

BUREAU, T. E. and S. R. WESSLER, 1994  Mobile inverted-repeat elements of the tourist family are associated with the genes of many cereal grasses. Proc. Natl. Acad. Sci. USA 91:1411-1415.[Abstract/Free Full Text]

CHEN, M., P. SANMIGUEL, A. C. D. OLIVEIRA, S.-S. WOO, and H. ZHANG et al., 1997  Microcolinearity in sh2-homologous regions of the maize, rice, and sorghum genomes. Proc. Natl. Acad. Sci. USA 94:3431-3435.[Abstract/Free Full Text]

CHEN, M., P. SANMIGUEL, and J. L. BENNETZEN, 1998  Sequence organization and conservation in sh2/a1-homologous regions of sorghum and rice. Genetics 148:435-443.[Abstract/Free Full Text]

DEVOS, K. M., J. K. M. BROWN, and J. L. BENNETZEN, 2002  Genome size reduction through illegitimate recombination counteracts genome expansion in Arabidopsis.. Genome Res. 12:1075-1079.[Abstract/Free Full Text]

DUBCOVSKY, J. and J. DVORAK, 1995  Ribosomal RNA multigene loci: nomads of the Triticeae genomes. Genetics 140:1367-1377.[Abstract]

DUBCOVSKY, J., D. LIJAVETZKY, L. APPENDINO, and G. TRANQUILLI, 1998  Comparative RFLP mapping of Triticum monococcum genes controlling vernalization requirement. Theor. Appl. Genet. 97:968-975.

DUBCOVSKY, J., W. RAMAKRISHNA, P. SANMIGUEL, C. S. BUSSO, and L. YAN et al., 2001  Comparative sequence analysis of colinear barley and rice BACs. Plant Physiol. 125:1342-1353.[Abstract/Free Full Text]

FARIS, J. D., K. M. HAEN, and B. S. GILL, 2000  Saturation mapping of a gene-rich recombinant hot spot region in wheat. Genetics 154:823-835.[Abstract/Free Full Text]

FEUILLET, C. and B. KELLER, 1999  High gene density is conserved at syntenic loci of small and large grass genomes. Proc. Natl. Acad. Sci. USA 96:8265-8270.[Abstract/Free Full Text]

FEUILLET, C., A. PENGER, K. GELLNER, A. MAST, and A. KELLER, 2001  Molecular evolution of receptor-like kinase genes in hexaploid wheat: independent evolution of orthologs after polyploidization and mechanisms of local rearrangements at paralogous loci. Plant Physiol. 125:1304-1313.[Abstract/Free Full Text]

FREELING, M., 2001  Grasses as a single genetic system: reassessment. Plant Physiol. 125:1191-1197.[Free Full Text]

GALE, M. D. and K. M. DEVOS, 1998  Comparative genetics in the grasses. Proc. Natl. Acad. Sci. USA 95:1971-1974.[Abstract/Free Full Text]

GAUT, B. S., 1998  Molecular clocks and nucleotide substitution rates in higher plants. Evol. Biol. 30:93-120.

GAUT, B. S., B. R. MORTON, B. C. MCCAIG, and M. T. CLEGG, 1996  Substitution rate comparisons between grasses and palms: synonymous rate differences at the nuclear gene Adh parallel rate differences at the plastid gene rbcl.. Proc. Natl. Acad. Sci. USA 93:10274-10279.[Abstract/Free Full Text]

GAUT, B. S., A. S. PEEK, B. R. MORTON, and M. T. CLEGG, 1999  Patterns of genetic diversification within the Adh gene family in the grasses (Poaceae). Mol. Biol. Evol. 16:1087-1097.

GILL, K. S., B. S. GILL, T. R. ENDO, and E. V. BOYKO, 1996a  Identification and high-density mapping of gene-rich regions in chromosome group 5 of wheat. Genetics 143:1001-1012.[Abstract]

GILL, K. S., B. S. GILL, T. R. ENDO, and T. TAYLOR, 1996b  Identification and high-density mapping of gene-rich regions in chromosome group 1 of wheat. Genetics 144:1883-1891.[Abstract]

HULBERT, S. H., C. A. WEBB, S. M. SMITH, and Q. SUN, 2001  Resistance gene complexes: evolution and utilization. Annu. Rev. Phytopathol. 39:285-312.[Medline]

JUKES, T. H., and C. R. CANTOR, 1969 Evolution of protein molecules, pp. 21–132 in Mammalian Protein Metabolism, edited by H. N. MUNRO. Academic Press, New York.

KATO, K., H. MIURA, and S. SAWADA, 1999  Comparative mapping of the wheat Vrn1 region with the rice Hd-6 region. Genome 42:204-209.

KELLER, B. and C. FEUILLET, 2000  Colinearity and gene density in grass genomes. Trends Plant Sci. 5:246-251.[Medline]

KLEIN, P. E., R. R. KLEIN, S. W. CARTINHOUR, P. E. ULANCH, and J. M. DONG et al., 2000  A high-throughput AFLP-based method for constructing integrated genetic and physical maps: progress toward a sorghum genome map. Genome Res. 10:789-807.[Abstract/Free Full Text]

KLEINHOFS, A., A. KILIAN, M. A. SAGHAI, R. M. BIYASHEV, and P. HAYES et al., 1993  A molecular, isozyme and morphological map of the barley (Hordeum vulgare) genome. Theor. Appl. Genet. 86:705-712.

KU, H.-M., T. VISION, J. LIU, and S. D. TANKSLEY, 2000  Comparing sequenced segments of the tomato and Arabidopsis genomes: large-scale duplication followed by selective gene loss creates a network of synteny. Proc. Natl. Acad. Sci. USA 97:9121-9126.[Abstract/Free Full Text]

KUMAR, S., K. TAMURA, I. B. JAKOBSEN, and M. NEI, 2001  MEGA2: molecular evolutionary genetics analysis software. Bioinformatics 17:1244-1245.[Abstract/Free Full Text]

KUNZEL, G., L. KORZUN, and A. MEISTER, 2000  Cytologically integrated physical restriction fragment length polymorphism maps for the barley genome based on translocation breakpoints. Genetics 154:397-412.[Abstract/Free Full Text]

LAGERCRANTZ, U., 1998  Comparative mapping between Arabidopsis thaliana and Brassica nigra indicates that Brassica genomes have evolved through extensive genome replication accompanied by chromosome fusions and frequent rearrangements. Genetics 150:1217-1228.[Abstract/Free Full Text]

LEADER, D. J., G. P. CLARK, J. WATTERS, A. F. BEVEN, and P. J. SHAW et al., 1997  Clusters of multiple different small nucleolar RNA genes in plants are expressed as and processed from polycistronic pre-snoRNAs. EMBO J. 16:5742-5751.[Medline]

LEADER, D. J., G. P. CLARK, J. WATTERS, A. F. BEVEN, and P. J. SHAW et al., 1999  Splicing-independent processing of plant box C/D and box H/ACA small nucleolar RNAs. Plant Mol. Biol. 39:1091-1100.[Medline]

LEISTER, D., J. KURTH, D. A. LAURIE, M. YANO, and T. SASAKI et al., 1998  Rapid reorganization of resistance gene homologues in cereal genomes. Proc. Natl. Acad. Sci. USA 95:370-375.[Abstract/Free Full Text]

LIJAVETZKY, D., G. MUZZI, T. WICKER, B. KELLER, and R. WING et al., 1999  Construction and characterization of a bacterial artificial chromosome (BAC) library for the A genome of wheat. Genome 42:1176-1182.[Medline]

LYNCH, M. and J. S. CONERY, 2000  The evolutionary fate and consequences of duplicate genes. Science 290:1151-1155.[Abstract/Free Full Text]

LYNCH, M. and J. C. CONERY, 2001  Gene duplication and evolution: response. Science 293:U2-U3.

MEYERS, B. C., S. V. TINGLEY, and M. MORGANTE, 2001  Abundance, distribution, and transcriptional activity of repetitive elements in the maize genome. Genome Res. 11:1660-1676.[Abstract/Free Full Text]

MOORE, G., K. M. DEVOS, Z. WANG, and M. D. GALE, 1995  Grasses, line up and form a circle. Curr. Biol. 5:737-739.[Medline]

NEI, M. and T. GOJOBORI, 1986  Simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions. Mol. Biol. Evol. 3:418-426.[Abstract]

O'NEILL, C. and I. BANCROFT, 2000  Comparative physical mapping of segments of the genome of Brassica oleracea var. alboglabra that are homoeologous to sequenced regions of chromosomes 4 and 5 of Arabidopsis thaliana.. Plant J. 23:233-243.[Medline]

PANSTRUGA, R., R. BUSCHGES, P. PIFFANELLI, and P. SCHULZE-LEFERT, 1998  A contiguous 60 kb genomic stretch from barley reveals molecular evidence for gene islands in monocot genome. Nucleic Acids Res. 26:1056-1062.[Abstract/Free Full Text]

RICHLY, E., J. KURTH, and D. LEISTER, 2002  Mode of amplification and reorganization of resistance genes during recent Arabidopsis thaliana evolution. Mol. Biol. Evol. 19:76-84.[Abstract/Free Full Text]

ROSSBERG, M., K. THERES, A. ACARKAN, R. HERRERO, and T. SCHMITT et al., 2001  Comparative sequence analysis reveals extensive microcolinearity in the lateral suppressor regions of the tomato, Arabidopsis, and Capsella genomes. Plant Cell 13:979-988.[Abstract/Free Full Text]

SANMIGUEL, P. and J. L. BENNETZEN, 1998  Evidence that a recent increase in maize genome size was caused by the massive amplification of intergene retrotransposons. Ann. Bot. 82:37-44.[Abstract/Free Full Text]

SANMIGUEL, P., A. TIKHONOV, Y.-K. JIN, N. MOTCHOULSKAIA, and D. ZAKHAROV et al., 1996  Nested retrotransposon in the intergenic regions of the maize genome. Science 274:765-768.[Abstract/Free Full Text]

SANMIGUEL, P., B. S. GAUT, A. TIKHONOV, Y. NAKAJIMA, and J. L. BENNETZEN, 1998  The paleontology of intergene retrotransposons of maize. Nat. Genet. 20:43-45.[Medline]

SANMIGUEL, P., W. RAMAKRISHNA, J. L. BENNETZEN, C. S. BUSSO, and J. DUBCOVSKY, 2002  Transposable elements, genes and recombination in a 215-kb contig from wheat chromosome 5Am. Funct. Integr. Genomics 2:70-80.[Medline]

SARMA, R. N., B. S. GILL, T. SASAKI, G. GALIBA, and J. SUTKA et al., 1998  Comparative mapping of the wheat chromosome 5A Vrn-A1 region with rice and its relationship to QTL for flowering time. Theor. Appl. Genet. 97:103-109.

SHIRASU, K., A. H. SCHULMAN, T. LAHAYE, and P. SCHULZE-LEFERT, 2000  A contiguous 66 kb barley DNA sequence provides evidence for reversible genome expansion. Genome Res. 10:908-915.[Abstract/Free Full Text]

STEBBINS, G. L., 1981  Coevolution of grasses and herbivores. Ann. Mo. Bot. Gard. 68:75-86.

STEIN, N., C. FEUILLET, T. WICKER, E. SCHLAGENHAUF, and B. KELLER, 2000  Subgenome chromosome walking in wheat: a 450-kb physical contig in Triticum monococcum L. spans the Lr10 resistance locus in hexaploid wheat (Triticum aestivum L.). Proc. Natl. Acad. Sci. USA 97:13436-13441.[Abstract/Free Full Text]

SUTKA, J., G. GALIBA, A. VAGUJFALVI, B. S. GILL, and J. W. SNAPE, 1999  Physical mapping of the Vrn-A1 and Fr1 genes on chromosome 5A of wheat using deletion lines. Theor. Appl. Genet 99:199-202.

TAJIMA, F., 1993  Simple methods for testing the molecular evolutionary clock hypothesis. Genetics 135:599-607.[Abstract]

TARCHINI, R., P. BIDDLE, R. WINELAND, S. TINGEY, and A. RAFALSKI, 2000  The complete sequence of 340 kb of DNA around the rice Adh1-Adh2 region reveals interrupted colinearity with maize chromosome 4. Plant Cell 12:381-391.[Abstract/Free Full Text]

THOMPSON, J. D., T. J. GIBSON, F. PLEWNIAK, F. JEANMOUGIN, and D. G. HIGGINS, 1997  The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 24:4876-4882.

TIKHONOV, A. P., P. J. SANMIGUEL, Y. NAKAJIMA, N. M. GORENSTEIN, and J. L. BENNETZEN et al., 1999  Colinearity and its exceptions in orthologous adh regions of maize and sorghum. Proc. Natl. Acad. Sci. USA