- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Kalendar, R.
- Articles by Schulman, A. H.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Kalendar, R.
- Articles by Schulman, A. H.
Large Retrotransposon Derivatives: Abundant, Conserved but Nonautonomous Retroelements of Barley and Related Genomes
Ruslan Kalendara, Carlos M. Vicienta, Ofer Pelegb, Kesara Anamthawat-Jonssonc, Alexander Bolshoyb, and Alan H. Schulmana,da MTT/BI Plant Genomics Laboratory, Institute of Biotechnology, Viikki Biocenter, University of Helsinki, FIN-00014 Helsinki, Finland,
b Genome Diversity Center, Institute of Evolution, University of Haifa, 31905 Haifa, Israel,
c Faculty of Sciences, University of Iceland, 108 Reykjavík, Iceland
d Plant Breeding Biotechnology, Plant Production Research, MTT Agrifood Research Finland, 31600 Jokioinen, Finland
Corresponding author: Alan H. Schulman, Institute of Biotechnology, University of Helsinki, P.O. Box 56, Viikinkaari 4, FIN-00014 Helsinki, Finland., alan.schulman{at}helsinki.fi (E-mail)
Communicating editor: D. F. VOYTAS
| ABSTRACT |
|---|
Retroviruses and LTR retrotransposons comprise two long-terminal repeats (LTRs) bounding a central domain that encodes the products needed for reverse transcription, packaging, and integration into the genome. We describe a group of retrotransposons in 13 species and four genera of the grass tribe Triticeae, including barley, with long,
4.4-kb LTRs formerly called Sukkula elements. The
3.5-kb central domains include reverse transcriptase priming sites and are conserved in sequence but contain no open reading frames encoding typical retrotransposon proteins. However, they specify well-conserved RNA secondary structures. These features describe a novel group of elements, called LARDs or large retrotransposon derivatives (LARDs). These appear to be members of the gypsy class of LTR retrotransposons. Although apparently nonautonomous, LARDs appear to be transcribed and can be recombinationally mapped due to the polymorphism of their insertion sites. They are dispersed throughout the genome in an estimated 1.3 x 103 full-length copies and 1.16 x 104 solo LTRs, indicating frequent recombinational loss of internal domains as demonstrated also for the BARE-1 barley retrotransposon.
RETROTRANSPOSONS are ubiquitous in the genomes of plants, animals, and fungi (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
|
Despite conservation for function, the processes of transcription and reverse transcription are highly error prone (![]()
![]()
![]()
![]()
![]()
![]()
![]()
In the first full-length clone of BARE-1 (Z17327), a barley (Hordeum vulgare) LTR retrotransposon, we reported a 4930-bp sequence in the 3' LTR (![]()
5-kb sequences similar to this insertion in a 66-kb stretch of the barley genome (![]()
![]()
![]()
| MATERIALS AND METHODS |
|---|
Plant materials:
Seeds were a gift of Dr. Ole Seberg (Botanical Institute, Copenhagen, Denmark) or of Boreal Plant Breeding Limited (Jokioinen, Finland). Plant DNA was isolated from seedling leaves by the CTAB method essentially as previously described (![]()
Bacterial artificial chromosome manipulations:
The BAC clones of barley genomic DNA, their handling, and DNA blotting from them were described previously (![]()
![]()
![]()
Determination of LARD LTR sequences:
Primers were designed with the Fast PCR program (http://www.biocenter.helsinki.fi/bi/bare-1_html/oligos.htm). They were made to match the ends of the four LARD LTR, or Sukkula, sequences available, two from a 66-kb contiguous sequence at the Rar1 locus of chromosome 2HL (AF254799, 11,2046645; 20,14524,048, 25,78027,570), the insertion in the 3' LTR of BARE-1a (Z17327, nucleotides 575810,687), and one on chromosome 4H near the Mlo gene (Y14573, 42,14037,153). These primers were 91673, described above, and 91674, 5'-CACGCCCAAGATGCGACCCTATCC-3' (Z17327, nucleotides 10,68410,661). The PCR was carried out with 20 ng genomic DNA in a 20-µl reaction containing 50 mM Tris-HCl, pH 9.0, 15 mM (NH4)2SO4, 1.5 mM MgCl2, 0.1% (v/v) Triton X-100, 200 nM each primer, 200 µM dNTPs, and 1 unit DyNAzymeEXT DNA Polymerase (Finnzymes, Espoo, Finland). The amplification program consisted of denaturation at 94°, 2 min; 15 cycles of 94°, 20 sec, 68°, 4 min; and a final elongation step at 72°, 10 min. The PCR was performed in a Master Cycler Gradient (Eppendorf, Madison, WI) or PTC-225 DNA Engine Tetrad (MJ Research, Watertown, MA) in 0.2-ml tubes (AB Advanced Biotechnologies, Epsom, United Kingdom). Products were cloned in the pGEM-5Zf(+) T-vector (Promega, Madison, WI). Full-length sequences for the LTR were derived by primer walking in both directions, and sequencing was carried out in house on automated sequencers (http://www.biocenter.helsinki.fi/bi/dna/).
Determination of internal LARD sequences:
Long-distance PCR for the cloning of LARD internal domains was carried out as for the LTRs for all species and genera except Triticum and Secale. The 5' primer, LTROL2, faced outward from the 5' LTR 280 nt from the terminus and consisted of 5'-GCAGCCTGGGATAGCAAGGATGG-3'. The 3' primer, LTROL, was located 146 nt from the 5' terminus of the 3' LTR and consisted of 5'-CCGGCAGCTACGAACGGATGCAAG-3'. Strong, single bands were obtained, cloned, and sequenced in both directions. For all except Triticum and Secale, the 5' primer, 9900, consisted of 5'-GATAGGGTCGCATCTTGGGCGTGAC-3' and was used in combination with primer LTROL.
Determination of LTR copy number:
The sources and provenances of barley cv. Bomi and of H. roshevitzii, accession no. 7039, are as described previously (![]()
![]()
![]()
![]()
Relative LTR and internal domain abundance:
Amplification reactions were carried out between the LTR and ()-strand priming site (PBS) using the primers LTROL2, above, and primer InTol, 5'-GGTTCGATCAATCAAGGGGGCTC-3'. The LTR fragment was produced with the primers Gossy-L, 5'-ATATCTTGTGCATCGGGATTCC-3', and Gossy-R, 5'-GACATAACCCCACCGTGTCCTC-3'. Amplification was carried out in a 20-µl reaction containing 75 mM Tris-HCl, pH 8.8, 20 mM (NH4)2SO4, 1.5 mM MgCl2, 0.01% Tween-20, 20 ng barley DNA, 200 nM each primer, 200 µM dNTPs, and 1 unit Taq DNA polymerase. The amplification program consisted of 94°, 2 min; 716 cycles of 94°, 20 sec, 56°, 20 sec, 68°, 30 sec; and a final elongation of 68°, 5 min. One-fifth of each reaction (4 µl) was separated by electrophoresis in 2% agarose gels (RESolute wide range agarose; BIOzym, Landgraaf, The Netherlands) by electrophoresis (80100 V) and visualized by ethidium bromide.
Cloning of LTR flanks:
Ligation-mediated PCR (![]()
Fluorescent in situ hybridization:
Chromosomes were prepared from barley root tips as previously described (![]()
![]()
![]()
Data mining, alignment, and sequence analysis:
Preliminary sequence alignments were in some cases made with the PILEUP program available in the Wisconsin package version 10.2 (Oxford Molecular Group) available on the irix server (Center for Scientific Computing, Espoo, Finland; http://www.csc.fi/suomi/info/index.phtml.en). Sequences were aligned using the CLUSTALW multiple alignment program (![]()
![]()
DNA and RNA information content:
To find the relevant signal in the multiple alignments we used the Kullback-Leibler measure (![]()
![]()
![]()
![]()
![]()
![]()
Analysis of multiple alignments of RNAfold predictions:
Candidate regions of RNA secondary structure were provided by the Mfold, version 3.0 (![]()
![]()
![]()
![]()
![]()
![]() |
(1) |
where the index k runs over all RNA secondary-structure elements and gaps. The quantities qi, (, qi, ), qi, . , and qi, , are the observed fractions of 5 double strands (ds), 3 ds, single strands (ss), and gaps correspondingly at position i. The expected probability of single-stranded RNA at every position i, p . , has been found empirically to equal 0.5; consequently ds probabilities p(, p) are equal to 0.25, and the gap background, p, is equal to 1.
Visualization of RNAfolds:
To illustrate the most conserved features of the putative RNA secondary structure in the region between positions 200 and 400 in the multiple alignment file of LARDs we made two types of RNAfold predictions: a common RNA secondary structure and the specific representative of each fragment. The putative common RNAfold was predicted by the GeneBee (![]()
| RESULTS |
|---|
LARD LTRs have long, conserved ends but highly variable centers:
Primers were designed to match the ends of the LARD LTR sequences available in the database. These primers produced bands of 4.14.5 kb from template DNA of all Hordeum species investigated (data not shown). A set of species from the tribe Triticeae of the Poaceae was chosen for examination. The Triticeae (![]()
![]()
![]()
The alignment of the 5' end of the LTR, consisting of 2734 nucleotides (nt; accession ALIGN_000282; access via ftp://ftp.ebi.ac.uk/pub/databases/embl/align/) reveals a highly conserved segment extending from the 5' universal terminus, 5'-TGTGACAGCCCGA ... -3', for
400 nt into the element (Fig 2A). The region is conserved also in H. marinum and H. murinum, which belong to different genome groups than do barley and H. patagonicum and are not closely related to them. The sequence available for a similar element, tasuk10 from wheat (AF029897), is quite divergent from the others and may be degenerate. Alignment of the succeeding segments of the sequences reveals a highly conserved stretch extending inward for
1880 nt from the 3' end of the LTR (Fig 2B and accession ALIGN_000280). Hence,
2250 bp, or
50%, of the
4.5-kb LTR of LARD elements is conserved, but the region that is variable in the alignment is variable even among the sequences determined from the barley genome alone.
|
Retrotransposon LTRs contain short inverted repeats at their termini. Because the PCR primers were designed to match the termini, the actual termini in the sequenced products shown in Fig 2 cannot be analyzed. However, the large clones containing LARD LTRs present in the database, as well as additional sequence data we have collected using subterminal LTR primers, define imperfect 6-bp terminal inverted repeats: 5'-TGTRAC ... GTRACA-3'. These database accessions also universally display the conserved terminal 5'-TG ... CA-3', found in all retroviruses and LTR-containing retrotransposons, and contain 5-bp direct repeats flanking the termini.
LARD internal domains are highly conserved but noncoding:
To amplify the internal region lying between the left and right LTRs, primers were designed so that the 5' primer matched the 3' end of the LTR and the 3' primer matched the 5' end of the LTR. Amplifications were carried out on genomic DNA from 13 species in four genera in the tribe Triticeae, including barley, and generated single bands of
3.5 kb. In all 35 sequences aligned (alignment database accession ALIGN_000601), a highly conserved domain matching the initiator-methionyl tRNA (imettRNA) was found immediately prior to the 5' LTR (Fig 3A). A domain in this position homologous to tRNA is almost universal among retrotransposons and retroviruses, where it serves as the minus-strand priming site (PBS) for synthesis of the initial cDNA strand during reverse transcription (![]()
![]()
|
Immediately internal to the 3' LTR, functional retroelements contain a conserved segment rich in guanine and adenine referred to as a polypurine tract (PPT). The PPT serves as the priming site for the reverse transcription of the plus-strand, the second strand of the cDNA to be synthesized (![]()
![]()
![]()
![]()
The entire internal domain was determined for these 35 sequences. Beyond the PBS, the internal domain was highly conserved on the DNA level for almost all species in a block extending to position 710 in the alignment (alignment accession ALIGN_000601). All five oat sequences lacked a segment from nucleotides 100 to 1010. The Triticum sequences and those of Elymus repens, H. roshevitzii, H. marinum, and one barley sequence lacked the block extending from nucleotide 740 to nucleotide 920 in the alignment. All sequences except those of oat were well conserved from nucleotide 920 to 1140. The sequences of all accessions from nucleotides 1160 to 2320 are well conserved, with insertions or deletions in individual sequences or species. From nucleotides 2340 to 2660 is poorly conserved in all sequences, but highly conserved between 2660 and 2880 and then poorly conserved until nucleotide 3200 in the alignment. The regions from 3500 to 3920 and the terminal region from 4300 to 4418 are highly conserved.
Given a fairly high degree of conservation between elements in different genera, we expected that the internal domains of the LARD elements would contain open reading frames (ORFs) coding for the products universally found in LTR retrotransposons, including integrase and reverse transcriptase. However, searches for ORFs in all six frames failed to reveal any that are sufficiently long to encode the major retrotransposon proteins, which are furthermore generally expressed as a polyprotein. We present one example from barley, the best characterized for retrotransposons of the species considered, typical of the other sequences (Fig 4). In the sense orientation with respect to the reverse transcriptase priming sites, 18 ORFs were longer than 50 nt, the longest 4 being 330, 255, 234, and 174 nt, respectively. Although it is conceivable, as seen in Fig 4, that coding domains for the expected proteins might be split among the reading frames by indels or stop codons, BLAST searches against the protein, DNA, and expressed sequence tag (EST) databases both for the internal domains and for the individual ORFs failed to yield any significant matches to retrotransposon or retroviral products. Therefore, the internal domains are simultaneously well conserved on the DNA level but noncoding. For this reason, the group has been named LARDs.
|
LARD elements are transcribed:
The Sukkula LARD LTR matched the greatest number of ESTs of any retrotransposon query sequence examined (![]()
LARD internal domains form well-conserved RNA secondary structures:
The conservation on the sequence level of the internal domain, in the absence of protein-coding capacity, led us to explore other features possibly related to sequence conservation. Expansion and contraction of sets of repeat units, for example, might lead to homogenization of a region of DNA. However, when we analyzed the sequence for direct and inverted repeats using COMPARE and DOTPLOT of the Wisconsin package (data not shown), no repeats longer than 20 nt of either kind were found, even at a stringency of 14 matches in a scanning window of 21 nt.
The internal domains were then subjected to statistical analysis for information content. Information content is a convenient way to quantify the sequence conservation because it is additive when the positions are independent (![]()
![]()
Thirteen LARD internal regions were aligned by CLUSTALW and to get the total sequence conservation the information content was summed across all positions in alignment (![]()
![]()
|
The information content, or nonrandomness, of the RNA structure was compared with that for RNA corresponding to shuffled sequences (Fig 5B). Sequences were individually randomized (Fig 5B, "shuffled DNA") or randomized by swapping between aligned sequences at each individual position (Fig 5B, "vertically shuffled DNA"). In much of the internal domain, the predicted secondary structure is considerably above that for randomized sequence, particularly between nucleotides 1 and 500, 1200 and 1700, 1900 and 2100, and 2300 and 2500. Furthermore, of the regions with highest information content in the RNA secondary structure, respectively, at nucleotides 200, 500, 1500, 2200, 2500, and 2900 (Fig 5A), only those at nucleotides 2500 and 2900 have DNA conservation in the alignment (Fig 5B) >90%, the others being <70%. The region of high information content in the RNA structure at nucleotide 500 has DNA conservation of only 55%.
We concentrated further analysis on the region of low variability and high information content in the 5' end of the internal domain and searched for candidate regions forming secondary structure with the programs Mfold and RNAfold. Multiple alignments were made of the RNA secondary structures to reveal RNA motifs common for the various LARD sequences in a way similar to our approach earlier (![]()
![]()
|
Genome prevalence and organization of LARD elements in barley:
The copy number for LARDs in the barley genome was estimated by dot blot as before for BARE-1 (![]()
![]()
![]()
We have estimated the relative abundance of LARD LTRs and internal domains in barley cv. Bomi by PCR. Two sets of primers were designed, one to amplify LTR fragments and the other, matched in Tm and product size, to amplify a fragment bridging the LTR and the PBS region. The amplification efficiency of the two sets of primers was tested with equimolar amounts of cloned template and the results were used in normalizing the data. Amplification efficiency was assessed by densitometry at varying numbers of cycles (716), taking the logarithmic portion for analysis. The primer pair detecting the internal region required 1.84 cycles more for the same intensity of product fluorescence as the LTR, and the internal domain controls amplified 1.6 times more efficiently than the LTR (Fig 7). This indicates that the LARD LTRs are 21.6+1.84 or 10.9 times more abundant in the genome than are internal domains. Because intact LTR retrotransposons contain two LTRs flanking an internal domain, a genome containing few solo LTRs of a given family of retroelements should display a ratio of LTRs to internal domains near 2:1. Hence, the haploid barley genome appears to contain only
1300 full-length LARDs and 11,600 solo LTRs. This yields a solo LTR:full-length ratio of 8.9:1 for LARDs in barley. Previous studies have indicated that solo LTRs, produced by LTR-LTR recombination, are relatively common in the barley genome, particularly for retrotransposon BARE-1 (![]()
![]()
![]()
![]()
|
On the local scale, both the original BARE-1 insertion element (![]()
![]()
![]()
![]()
![]()
![]()
![]()
Insertions nested into retrotransposons and other repeated DNA make it difficult to prove that the LARD polymorphisms (![]()
![]()
Using a LARD LTR probe, 7 of 20 BAC clones gave a total of 12 hybridizing bands (Fig 8). For comparison, 19 of the same 20 BAC clones contained a total of 50 BARE-1 LTRs, and 9 of 20 contained internal domains (![]()
![]()
![]()
![]()
|
|
| DISCUSSION |
|---|
LARDs, a distinct group of retroelements:
The LTR retrotransposons and retroviruses generally display conservation of structure against a background of fairly rapid sequence evolution. The RNA processing signals, encoded proteins, and arrangement of the coding domains are remarkably conserved among the major groups of these elements in the eukaryotes (![]()
![]()
![]()
![]()
![]()
A group of small, nonautonomous retroelements, TRIMs, was recently reported (![]()
4.5 kb, among the longest known. Each group is conserved, and elements intermediate between TRIMs and LARDs have not been found. Therefore, the elements in these two groups appear to follow different replicational or life-cycle strategies and probably have distinct histories. The recently reported Dasheng element of rice is of the LARD type (![]()
![]()
LARD internal domains produce conserved RNA structures:
A remarkable feature of the LARD element sequences, determined for 13 species, is the conservation of the noncoding central domain. Folding predictions for the internal domain revealed a highly conserved common RNAfold in 12 out of 13 species. The predicted RNA transcription products for the elements sequenced all show a hairpin loop of
200 bp starting 200 bp downstream from the 3' end of the 5' LTR. The observed conservation of the secondary structure in the LARD internal regions could not be easily explained as an outcome of high sequence conservation; a rather small number of differences in homologous sequences can cause dramatic changes in secondary structure (![]()
![]()
![]()
Thus, positional conservation of a stem-loop structure among related sequences should be seen as a consequence of the biological function of the RNA structure rather than as a consequence of sequence homology. The high conservation of this 5' hairpin in LARD internal domains suggests a biological function, which is still unknown. Hairpin structures have been predicted for the LTRs of plant (![]()
![]()
![]()
![]()
![]()
![]()
LARD elements are abundant in the barley genome:
A combination of DNA gel blot hybridizations, BAC clone analyses, PCR amplification experiments, and fluorescent in situ hybridizations indicate that LARD elements are abundant in the barley genome. We estimate that there are 1.3 x 103 full-length LARDs, and 1.2 x 104 solo LARD LTRs such as Sukkula, in the haploid barley genome. Full-length BARE-1 retrotransposons, by contrast, are present in
1.32 x 104 copies (![]()
![]()
![]()
It was observed previously (![]()
4.5 kb, although only 400 bp at the 5' end and 1900 bp at the 3' end are well conserved between elements. The conserved 3' end is about as long as BARE-1 LTRs are in total. The ratio of solo LTRs to full-length elements is
11:1 for BARE-1 and 8.9:1 for LARDs, despite the difference in their copy number. This is consistent both with dependence on length and with the probability of recombination being independent for each element. Nested retrotransposons appear characteristic of barley and other cereals (![]()
![]()
![]()
LARD elements are active:
Not only are LARD elements abundant in barley, but also they are found throughout the Triticeae, the tribe that includes barley, and in grass species outside the Triticeae. In the Triticeae, in both barley and A. tauschii, the D-genome ancestor of bread wheat, the LARD elements are polymorphic in their insertion sites and serve well in recombinational map construction (![]()
![]()
![]()
![]()
![]()
Which autonomous retrotransposon drives LARD mobility?
The conserved features described above, combined with the polymorphisms in barley varieties for the LARD element Sukkula insertion sites, suggest that LARDs remain active despite their lack of coding capacity. The LARDs therefore appear to be very large nonautonomous retroelements. This interpretation raises the questions of which retroelement class LARDs belong to and which family may provide the complementary proteins required for LARD replication, packaging, and integration. One sequence with a good match by BLAST search criteria with the Sukkula LTR was found within a 211-kb contiguous sequence from T. monococcum (AF326781, nucleotides 10,4294129,635). The matching region was described as Erika-1, a gypsy-like retrotransposon (![]()
![]()
74% similar to the internal domain of barley LARD elements. Although Erika is interrupted by stop codons, the internal domain finds predicted or identified retroelement translation products in BLASTX searches against protein databases. Bagy-1 produces no extensive alignments with barley LARD elements and their LTRs are not similar. The Bagy-1 PBS, 5'-TGGTAACAGA-3', differs by 1 nucleotide from the PBS of LARD elements. However, the Bagy-1 PPT, 5'-GAGGGGGTGAG-3', corresponds poorly to that of the LARDs.
A "Sukkula polyprotein" has been annotated at the Mla locus (AF427791). The two Sukkula elements in the accession appear to be identical copies and partially deleted. Alignment on the DNA level of the region assigned at Mla to the Sukkula polyprotein to the internal domain of our reported LARDs gives only a 38%, and highly gapped, identity, using a Smith-Waterman method, and only a 50-bp region of 66% identity, using a best-local-alignment method. Hence, the relationship between the conserved internal domains of the LARDs reported here and the putative Sukkula polyprotein remains unclear.
Another gypsy-like element, RIRE3 (AB014739) has been described as being similar to Sukkula, which the authors refer to as BARE101 (![]()
Although we have described LARD elements primarily from barley and related grasses, research by Jiang and co-workers (![]()
![]()
![]()
In conclusion, LARDs are a group of nonautonomous retroelements conserved in sequence and structure that appear to be part of a system involving complementation by a family of probably gypsy-like retrotransposons encoding the protein products necessary for replication and integration. It is interesting, in this context, that LARDs are abundant, raising the question of whether LARDs are parasitic on their corresponding autonomous elements or whether the transpositional success of the latter is unaffected by LARD propagation. The existence of two groups of nonautonomous retrotransposons, TRIMs and LARDs, differing significantly in size, raises interesting questions about their comparative replicational strategies and their role in autonomous-nonautonomous binary systems. To our knowledge, similar elements have not been found in animals or fungi. It remains to be seen if these groups of retroelements represent evolutionary innovations in the plants or if they have parallels elsewhere.
| FOOTNOTES |
|---|
Sequence data from this article have been deposited with the EMBL/GenBank Data Libraries under accession nos.
AF453658,
AF453659,
AF453660,
AF453661,
AF453662,
AF453663,
AF453664,
AF453665,
AF453666,
AF453667,
AF453668,
AF453669,
AF453670,
AF453671,
AF453672,
AF453673,
AF453674,
AF453675,
AF453676,
AF453677,
AF453678,
AF453679,
AF453680,
AF453681,
AF453682,
AF453683,
AF453684,
AY054376,
AY054377,
AY054378,
AY054379,
AY054380,
AY054381,
AY069966,
AY069967, ALIGN_000280, ALIGN_000282, and ALIGN_000601. ![]()
| ACKNOWLEDGMENTS |
|---|
We thank Dr. Ole Seberg (Botanical Institute, Copenhagen, Denmark) and Boreal Plant Breeding Ltd. (Jokioinen, Finland) for gifts of plant materials. Andris Kleinhofs (Washington State University, Pullman, WA) is thanked for gifts of BAC clones. The expert technical assistance of Anne-Mari Narvanto is deeply appreciated. Prof. Eduard Trifonov is gratefully acknowledged for helpful discussions. This study was supported by the European Union Research Directorate (contract no. QLK5-CT-1999-01499) and by the Academy of Finland (project 44404).
Manuscript received August 18, 2003; Accepted for publication November 24, 2003.
| LITERATURE CITED |
|---|
ANAMTHAWAT-JÓNSSON, K., J. S. HESLOP-HARRISON and T. SCHWARZACHER, 1996 Genomic in situ hybridization for whole chromosome and genome analysis, pp. 124 in In Situ Hybridization Laboratory Companion, edited by M. CLARK. Chapman & Hall, Weinheim, Germany.
BARKWORTH, M. E., 1992 Taxonomy of the Triticeae: a historical perspective. Hereditas 116:1-14.
BONHOEFFER, S., J. S. MCCASKILL, P. F. STADLER, and P. SCHUSTER, 1993 RNA multi-structure landscapes: a study based on temperature dependent partition functions. Eur. Biophys. J. 22:13-24.[Medline]
BOYKO, E., R. KALENDAR, V. KORZUN, B. GILL, and A. H. SCHULMAN, 2002 Combined mapping of Aegilops tauschii by retrotransposon, microsatellite, and gene markers. Plant Mol. Biol. 48:767-790.[CrossRef][Medline]
BRODSKY, L. I., V. V. IVANOV, Y. L. KALAYDZIDIS, A. M. LENOTOVICH, and V. K. NIKOLAEV et al., 1995 GeneBee-NET: internet-based server for analyzing biopolymers structure. Biochemistry 60:923-928.
CASACUBERTA, J. M., S. VERNHETTES, and M.-A. GRANDBASTIEN, 1995 Sequence variability within the tobacco retrotransposon Tnt1 population. EMBO J. 14:2670-2678.[Medline]
COVER, T. M., and J. A. THOMAS, 1991 Elements of Information Theory. John Wiley & Sons, New York.
DRISCOLL, M. D., M. P. GOLINELLI, and S. H. HUGHES, 2001 In vitro analysis of human immunodeficiency virus type 1 minus-strand strong-stop DNA synthesis and genomic RNA processing. J. Virol. 75:672-686.
FLAVELL, A. J., E. DUNBAR, R. ANDERSON, S. R. PEARCE, and R. HARTLEY et al., 1992 Ty1-copia group retrotransposons are ubiquitous and heterogeneous in higher plants. Nucleic Acids Res. 20:3639-3644.
FONTANA, W., D. A. KONINGS, P. F. STADLER, and P. SCHUSTER, 1993 Statistics of RNA secondary structures. Biopolymers 33:1389-1404.[CrossRef][Medline]
FRANKEL, A. D. and J. A. YOUNG, 1998 HIV-1: fifteen proteins and an RNA. Annu. Rev. Biochem. 67:1-25.[CrossRef][Medline]
FRIANT, S., T. HEYMAN, F. X. WILHELM, and M. WILHELM, 1996 Role of RNA primers in initiation of minus-strand and plus-strand DNA synthesis of the yeast retrotransposon Ty1. Biochemie 78:674-680.[Medline]
GABRIEL, A. and E. H. MULES, 1999 Fidelity of retrotransposon replication. Ann. NY Acad. Sci. 870:108-118.[CrossRef][Medline]
HARTL, D. L., E. R. LOZOVSKAYA, and J. G. LAWRENCE, 1992 Nonautonomous transposable elements in prokaryotes and eukaryotes. Genetica 86:47-53.[CrossRef][Medline]
HEIDMANN, T., O. HEIDMANN, and J. F. NICOLAS, 1988 An indicator gene to demonstrate intracellular transposition of defective retroviruses. Proc. Natl. Acad. Sci. USA 85:2219-2223.
HIGGINS, D. G., J. D. THOMPSON, and T. J. GIBSON, 1996 Using CLUSTAL for multiple sequence alignments. Methods Enzymol. 266:383-402.[Medline]
HOFACKER, I. L., W. FONTANA, P. F. STADLER, L. BONHOEFFER, and M. TACKER et al., 1994 Fast folding and comparison of RNA secondary structures. Monatsh. Chem. 125:167-188.[CrossRef]
HOFACKER, I. L., M. FEKETE, and P. F. STADLER, 2002 Secondary structure prediction for aligned RNA sequences. J. Mol. Biol. 319:1059-1066.[CrossRef][Medline]
HSIAO, C., N. J. CHATTERTON, K. H. ASAY, and K. B. JENSEN, 1995 Phylogenetic relationships of the monogenomic species of the wheat tribe, Triticeae (Poaceae), inferred from nuclear rDNA (internal transcribed spacer) sequences. Genome 38:211-223.[Medline]
HUDAKOVA, S., W. MICHALEK, G. G. PRESTING, R. TEN HOOPEN, and K. DOS SANTOS et al., 2001 Sequence organization of barley centromeres. Nucleic Acids Res. 29:5029-5035.
HWANG, C. K., E. S. SVAROVSKAIA, and V. K. PATHAK, 2001 Dynamic copy choice: steady state between murine leukemia virus polymerase and polymerase-dependent RNase H activity determines frequency of in vivo template switching. Proc. Natl. Acad. Sci. USA 98:12209-12214.
JAEGER, J. A., D. H. TURNER, and M. ZUKER, 1990 Predicting optimal and suboptimal secondary structure for RNA. Methods Enzymol. 183:281-306.[Medline]
JIANG, N., Z. BAO, S. TEMNYKH, Z. CHENG, and J. JIANG et al., 2002a Dasheng: a recently amplified non-autonomous LTR element that is a major component of pericentromeric regions in rice. Genetics 161:1293-1305.
JIANG, N., K. JORDAN, and S. R. WESSLER, 2002b Dasheng and RIRE2. A nonautonomous long terminal repeat element and its putative autonomous partner in the rice genome. Plant Physiol. 130:1697-1705.
KALENDAR, R., T. GROB, M. REGINA, A. SUONIEMI, and A. H. SCHULMAN, 1999 IRAP and REMAP: two new retrotransposon-based DNA fingerprinting techniques. Theor. Appl. Genet. 98:704-711.[CrossRef]
KALENDAR, R., J. TANSKANEN, S. IMMONEN, E. NEVO, and A. H. SCHULMAN, 2000 Genome evolution of wild barley (Hordeum spontaneum) by BARE-1 retrotransposon dynamics in response to sharp microclimatic divergence. Proc. Natl. Acad. Sci. USA 97:6603-6607.
KANKAANPÄÄ, J., L. MANNONEN, and A. H. SCHULMAN, 1996 The genome sizes of Hordeum species show considerable variation. Genome 39:730-735.[Medline]
KERWOOD, D. J., M. J. CAVALUZZI, and P. N. BORER, 2001 Structure of SL4 RNA from the HIV-1 packaging signal. Biochemistry 40:14518-14529.[CrossRef][Medline]
KULLBACK, S. and R. A. LEIBLER, 1951 On information and sufficiency. Ann. Math. Stat. 22:79-86.[CrossRef]
KUMAR, A. and J. BENNETZEN, 1999 Plant retrotransposons. Annu. Rev. Genet. 33:479-532.[CrossRef][Medline]
KUMEKAWA, N., H. OHTSUBO, T. HORIUCHI, and E. OHTSUBO, 1999 Identification and characterization of novel retrotransposons of the gypsy type in rice. Mol. Gen. Genet. 260:593-602.[CrossRef][Medline]
LUCAS, H., G. MOORE, G. MURPHY, and R. B. FLAVELL, 1992 Inverted repeats in the long-terminal repeats of the wheat retrotransposon Wis-2-1A. Mol. Biol. Evol. 9:716-728.[Abstract]
MANNINEN, I. and A. H. SCHULMAN, 1993 BARE-1, a copia-like retroelement in barley (Hordeum vulgare L.). Plant Mol. Biol. 22:829-846.[CrossRef][Medline]
MANNINEN, O., R. KALENDAR, J. ROBINSON, and A. H. SCHULMAN, 2000 Application of BARE-1 retrotransposon markers to map a major resistance gene for net blotch in barley. Mol. Gen. Genet. 264:325-334.[CrossRef][Medline]
MARQUET, R., C. ISEL, C. EHRESMANN, and B. EHRESMANN, 1995 tRNAs as primer of reverse transcriptases. Biochemie 77:113-124.[Medline]
MCALLISTER, B. F. and J. H. WERREN, 1997 Phylogenetic analysis of a retrotransposon with implications for strong evolutionary constraints on reverse transcriptase. Mol. Biol. Evol. 14:69-80.[Abstract]
MEYERS, B. C., S. V. TINGEY, and M. MORGANTE, 2001 Abundance, distribution, and transcriptional activity of repetitive elements in the maize genome. Genome Res. 11:1660-1676.
MONFORT, A., C. M. VICIENT, R. RAZ, P. PUIGDOMENECH, and J. A. MARTINEZ-IZQUIERDO, 1995 Molecular analysis of a putative transposable retroelement from the Zea genus with internal clusters of tandem repeats. DNA Res. 2:255-261.[Abstract]
PELEG, O., S. BRUNAK, E. N. TRIFONOV, E. NEVO, and A. BOLSHOY, 2002 RNA secondary structure and sequence conservation in C1 region of human immunodeficiency virus type 1 env gene. AIDS Res. Hum. Retroviruses 18:867-878.[CrossRef][Medline]
PETERSEN, G. and O. SEBERG, 1997 Phylogenetic analysis of the Triticeae (Poaceae) based on rpoA sequence data. Mol. Phylogenet. Evol. 7:217-230.[CrossRef][Medline]
ROEBUCK, K. A. and M. SAIFUDDIN, 1999 Regulation of HIV-1 transcription. Gene Exp. 8:67-84.
SAMBROOK, J., E. F. FRITSCH and T. MANIATIS, 1989 Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY.
SANMIGUEL, P., A. TIKHONOV, Y. K. JIN, N. MOTCHOULSKAIA, and D. ZAKHAROV et al., 1996 Nested retrotransposons in the intergenic regions of the maize genome. Science 274:765-768.
SCHNEIDER, T. D., 1999 Measuring molecular information. J. Theor. Biol. 201:87-92.[CrossRef][Medline]
SCHNEIDER, T. and R. STEPHENS, 1990 Sequence logos: a new way to display consensus sequences. Nucleic Acids Res. 18:6097-6100.
SCHNEIDER, T. D., G. D. STORMO, L. GOLD, and A. EHRENFEUCHT, 1986 Information content of binding sites on nucleotide sequences. J. Mol. Biol. 188:415-431.[CrossRef][Medline]
SCHULTZ, S. J., M. ZHANG, C. D. KELLEHER, and J. J. CHAMPOUX, 2000 Analysis of plus-strand primer selection, removal, and reutilization by retroviral reverse transcriptases. J. Biol. Chem. 275:32299-32309.
SHIRASU, K., A. H. SCHULMAN, T. LAHAYE, and P. SCHULZE-LEFERT, 2000 A contiguous 66 kb barley DNA sequence provides evidence for reversible genome expansion. Genome Res. 10:908-915.
SIEBERT, P. D., A. CHENCHIK, D. E. KELLOGG, K. A. LUKYANOV, and S. A. LUKYANOV, 1995 An improved PCR method for walking in uncloned genomic DNA. Nucleic Acids Res. 23:1087-1088.
SUONIEMI, A., K. ANAMTHAWAT-JÓNSSON, T. ARNA, and A. H. SCHULMAN, 1996 Retrotransposon BARE-1 is a major, dispersed component of the barley (Hordeum vulgare L.) genome. Plant Mol. Biol. 30:1321-1329.[CrossRef][Medline]
SUONIEMI, A., D. SCHMIDT, and A. H. SCHULMAN, 1997 BARE-1 insertion site preferences and evolutionary conservation of RNA and cDNA processing sites. Genetica 100:219-230.[CrossRef][Medline]
SUONIEMI, A., J. TANSKANEN, and A. H. SCHULMAN, 1998a Gypsy-like retrotransposons are widespread in the plant kingdom. Plant J. 13:699-705.[CrossRef][Medline]
SUONIEMI, A., J. TANSKANEN, O. PENTIKÄINEN, M. S. JOHNSON, and A. H. SCHULMAN, 1998b The core domain of retrotransposon integrase in Hordeum: predicted structure and evolution. Mol. Biol. Evol. 15:1135-1144.[Abstract]
THOMPSON, J. D., D. G. HIGGINS, and T. J. GIBSON, 1994 Clustal W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position specific gap penalties and weight matrix choice. Nucleic Acids Res. 22:4673-4680.
VICIENT, C. M., A. SUONIEMI, K. ANAMTHAWAT-JÓNSSON, J. TANSKANEN, and A. BEHARAV et al., 1999 Retrotransposon BARE-1 and its role in genome evolution in the genus Hordeum.. Plant Cell 11:1769-1784.
VICIENT, C. M., M. JÄÄSKELÄINEN, R. KALENDAR, and A. H. SCHULMAN, 2001a Active retrotransposons are a common feature of grass genomes. Plant Physiol. 125:1283-1292.
VICIENT, C. M., R. KALENDAR, and A. H. SCHULMAN, 2001b Envelope-containing retrovirus-like elements are widespread, transcribed and spliced, and insertionally polymorphic in plants. Genome Res. 11:2041-2049.
VOYTAS, D. F., M. P. CUMMINGS, A. K. KONIECZNY, F. M. AUSUBEL, and S. R. RODERMEL, 1992 Copia-like retrotransposons are ubiquitous among plants. Proc. Natl. Acad. Sci. USA 89:7124-7128.
WICKER, T., N. STEIN, L. ALBAR, C. FEUILLET, and E. SCHLAGENHAUF et al., 2001 Analysis of a contiguous 211 kb sequence in diploid wheat (Triticum monococcum) reveals multiple mechanisms of genome evolution. Plant J. 26:307-316.[CrossRef][Medline]
WITTE, C. P., Q. H. LE, T. BUREAU, and A. KUMAR, 2001 Terminal-repeat retrotransposons in miniature (TRIM) are involved in restructuring plant genomes. Proc. Natl. Acad. Sci. USA 98:13778-13783.
XIONG, Y. and T. H. EICKBUSH, 1990 Origin and evolution of retroelements based upon their reverse transcriptase sequences. EMBO J. 9:3353-3362.[Medline]
YANG, J., H. BOGERD, S. Y. LE, and B. R. CULLEN, 2000 The human endogenous retrovirus K Rev response element coincides with a predicted RNA folding region. RNA 6:1551-1564.[Abstract]
ZEEBERG, B, 2002 Shannon information theoretic computation of synonymous codon usage biases in coding regions of human and mouse genomes. Genome Res. 12:944-955.
ZUKER, M., 1989 On finding all suboptimal foldings of an RNA molecule. Science 244:48-52.
ZUKER, M., 1994 Prediction of RNA secondary structure by energy minimization. Methods Mol. Biol. 25:267-294.[Medline]
ZUKER, M. and P. STIEGLER, 1981 Optimal computer folding of large RNA sequences using thermodynamic and auxiliary information. Nucleic Acids Res. 9:133-148.
This article has been cited by other articles:
![]() |
I. A. Maksakova, Y. Zhang, and D. L. Mager Preferential Epigenetic Suppression of the Autonomous MusD over the Nonautonomous ETn Mouse Retrotransposons Mol. Cell. Biol., May 1, 2009; 29(9): 2456 - 2468. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Wawrzynski, T. Ashfield, N. W.G. Chen, J. Mammadov, A. Nguyen, R. Podicheti, S. B. Cannon, V. Thareau, C. Ameline-Torregrosa, E. Cannon, et al. Replication of Nonautonomous Retroelements in Soybean Appears to Be Both Recent and Common Plant Physiology, December 1, 2008; 148(4): 1760 - 1771. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Kalendar, J. Tanskanen, W. Chang, K. Antonius, H. Sela, O. Peleg, and A. H. Schulman Cassandra retrotransposons carry independently transcribed 5S RNA PNAS, April 15, 2008; 105(15): 5833 - 5838. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Tomita, K. Shinohara, and M. Morimoto Revolver is a New Class of Transposon-like Gene Composing the Triticeae Genome DNA Res, February 1, 2008; 15(1): 49 - 62. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Wicker and B. Keller Genome-wide comparative analysis of copia retrotransposons in Triticeae, rice, and Arabidopsis reveals conserved ancient evolutionary lineages and distinct dynamics of individual copia families Genome Res., July 1, 2007; 17(7): 1072 - 1081. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Ner-Gaon, N. Leviatan, E. Rubin, and R. Fluhr Comparative Cross-Species Alternative Splicing in Plants Plant Physiology, July 1, 2007; 144(3): 1632 - 1641. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Wang, X. Tang, Z. Cheng, L. Mueller, J. Giovannoni, and S. D. Tanksley Euchromatin and Pericentromeric Heterochromatin: Comparative Composition in the Tomato Genome Genetics, April 1, 2006; 172(4): 2529 - 2540. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Nagaki, P. Neumann, D. Zhang, S. Ouyang, C. R. Buell, Z. Cheng, and J. Jiang Structure, Divergence, and Distribution of the CRR Centromeric Retrotransposon Family in Rice Mol. Biol. Evol., April 1, 2005; 22(4): 845 - 855. [Abstract] [Full Text] [PDF] |
||||
- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Kalendar, R.
- Articles by Schulman, A. H.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Kalendar, R.
- Articles by Schulman, A. H.















