- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Lukens, L.
- Articles by Osborn, T.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Lukens, L.
- Articles by Osborn, T.
Comparison of a Brassica oleracea Genetic Map With the Genome of Arabidopsis thaliana
Lewis Lukensa, Fei Zoub, Derek Lydiatec, Isobel Parkinc, and Tom Osbornaa Department of Agronomy, University of Wisconsin, Madison, Wisconsin 53711,
b Department of Biostatistics, University of North Carolina, Chapel Hill, North Carolina 27599
c Agriculture and Agri-Food Canada, Saskatoon Research Centre, Saskatoon, Saskatchewan S7N 0X2, Canada
Corresponding author: Lewis Lukens, Crop Science Bldg., University of Guelph, Guelph, ON N1G 2W1, Canada., llukens{at}uoguelph.ca (E-mail)
Communicating editor: O. SAVOLAINEN
| ABSTRACT |
|---|
Brassica oleracea is closely related to the model plant, Arabidopsis thaliana. Despite this relationship, it has been difficult to both identify the most closely related segments between the genomes and determine the degree of genome replication within B. oleracea relative to A. thaliana. These difficulties have arisen in part because both species have replicated genomes, and the criteria used to identify orthologous regions between the genomes are often ambiguous. In this report, we compare the positions of sequenced Brassica loci with a known position on a B. oleracea genetic map to the positions of their putative orthologs within the A. thaliana genome. We use explicit criteria to distinguish orthologous from paralogous loci. In addition, we develop a conservative algorithm to identify collinear loci between the genomes and a permutation test to evaluate the significance of these regions. The algorithm identified 34 significant A. thaliana regions that are collinear with >28% of the B. oleracea genetic map. These regions have a mean of 3.3 markers spanning 2.1 Mbp of the A. thaliana genome and 2.5 cM of the B. oleracea genetic map. Our findings are consistent with the hypothesis that the B. oleracea genome has been highly rearranged since divergence from A. thaliana, likely as a result of polyploidization.
ONE major goal of plant biologists is to compare the genomic information available from model species to other, nonmodel species for which genetic maps are available. If genome structures are highly conserved, candidate genes in the model species that correspond to loci mapped in the nonmodel species can be quickly identified. In addition, differences between the nonmodel genome and model genome can be used to infer the frequency of genome duplications and rearrangements over time.
The genus Brassica is an excellent system with which to develop tools for genome comparison and to examine the divergence of genome structure. Brassica species are closely related to the model plant species, Arabidopsis thaliana. Both Brassica and Arabidopsis are classified within the same family, the Brassicaceae, and diverged
20 MYA (![]()
![]()
![]()
![]()
Despite the close relationship between Brassica species and A. thaliana, whole-genome mapping studies have found that the order of loci in Brassica genetic maps is only infrequently similar to the order of homologous loci in the A. thaliana genome (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
The conserved and rearranged regions between Brassica and Arabidopsis genomes have been interpreted in different ways, leading to fundamental disagreements about Brassica genome structure. ![]()
![]()
![]()
Some of the difficulties in resolving intergenomic relationships and shared orthologous regions have arisen because genome replication confounds orthologous and paralogous relationships between loci (Fig 1A). The A. thaliana genome is partially duplicated (ARABIDOPSIS GENOME INITIATIVE 2000), and Brassica sequences can detect paralogous A. thaliana duplicates (![]()
|
Finally, the lack of explicit criteria when evaluating putative orthologous regions and the potential for bias when making genome comparisons can lead to misclassification of intergenomic relationships (![]()
![]()
![]()
Here, we report on a comparison between a genetic map of B. oleracea and the A. thaliana genome using approaches that reduce the confounding effect of paralogous sequences. In addition, we developed an algorithm written in PERL that uses explicit criteria to identify orthologous regions and to establish their significance. Consistent with previous reports, we found evidence for substantial genomic replication in B. oleracea as compared to A. thaliana and found evidence that multiple chromosomal rearrangements have occurred since the species' divergence. However, we also found that the B. oleracea genetic map and the A. thaliana genome sequence share 34 significant, collinear regions. The average putative orthologous segment has 3.3 markers corresponding to 2.1 Mbp in A. thaliana and 7.1 cM in B. oleracea. In total, the significant regions identified in this study cover well over one-fourth of the B. oleracea genome. Of 22 previously published regions of predicted orthology, our algorithm identified 20, 17 of which were significant at P < 0.05.
Our data suggest three separate inferences. First, in general, published reports of collinear regions appear to have sampled highly conserved areas between the Brassica and A. thaliana genomes. Second, different interpretations of Brassica genome structure may have arisen because of different criteria used to define homologous regions between Brassica and A. thaliana. Finally, differences in the genomic arrangements between A. thaliana and B. oleracea appear to be due to the recent history of polyploidy in B. oleracea.
| MATERIALS AND METHODS |
|---|
A. thaliana sequence information source and B. oleracea map source:
The B. oleracea genetic map was developed by ![]()
The sequences of BACs used to assemble the A. thaliana genome sequence were downloaded from TIGR, http://www.tigr.org, on May 20, 2001. The number of nucleotides within all BACs totaled 132,101,284 bp. Dr. Eva Huala (Arabidopsis Information Resource) kindly provided the order of BACs and the estimated starting and ending position for each BAC within the A. thaliana genome on February 15, 2001. The nucleotide positions of BACs within the genome are estimates.
Sequencing and plasmid insert information:
Brassica DNA mapping fragments were cloned into a pUC18-derived plasmid, pIJ2925, and two sequences were obtained for each clone. Most of these clones contain PstI fragments of genomic DNA, although a few pW clones contain EcoRI genomic DNA fragments, and all have been used in mapping experiments. They are present in low-copy number within the Brassica genome with a mean of 1.8 polymorphic loci in B. oleracea. Sequencing reactions were performed using ABI Big-Dye Terminator cycle sequencing reagents. Reactions contained the ABI mix, ddH20, 500 ng of plasmid DNA, and 3.2 pmol of M13 forward or reverse primer to a final volume of 20 µl. The cycle-sequencing conditions were as follows: 25 cycles for 10 sec at 95°, 5 sec at 50°, and 4 min at 60°. Unincorporated nucleotides were removed by passing the reaction mixture through a Sephadex G-50 column. Sequence reactions were analyzed with an automated DNA sequencer (ABI model 377XL or 377-96) and base-pair calls were confirmed by visual inspection of chromatograms.
To identify molecular markers that had similar sequences but different names, we used blastn (![]()
![]()
![]()
BLAST analysis:
To identify BACs with nucleotide sequences similar to the Brassica query sequences, we used blastn. Low-complexity sequences were filtered in the blast analysis, and default values for cost (mismatch cost = -3.0), reward (match reward = 1.0), and word size (11 bp) were selected. The default gap opening penalty (5.0) and the gap extension penalty (2.0) were also selected. We recorded the bit score to evaluate sequence relationships. We did not align A. thaliana and Brassica sequences by eye in order to calculate additional sequence distances or other statistics. The number of marker sequences and the number of detected homologs in A. thaliana made such an approach impractical.
The results from the blastn analysis were parsed using a spreadsheet and short PERL scripts that we wrote for this purpose. From each "hit" to the A. thaliana BAC database by a Brassica query sequence, we retrieved the BAC name, the bit score, and the significance value. The nucleotide start position of the BAC was used as the approximate position of the Brassica query sequence in the A. thaliana genome. If both sequences from the same fragment detected the same BAC, only the highest scoring match was kept. Additional parsing was done to remove redundant data. If a query sequence had significant sequence similarity to BACs that overlapped (had overlapping base-pair intervals) or were immediately adjacent to each other (had shared beginning or ending nucleotide positions), a single BAC that was assigned the lower nucleotide position in the chromosome was recorded. This procedure would cause local/tandem duplications of a single gene within A. thaliana to be defined as a single locus. The raw and parsed data sets can be downloaded from http://www.plant.uoguelph.ca/faculty/llukens.
Collinearity analysis:
"Conserved linkage" or "collinearity" (![]()
![]()
![]()
|
Our method to assess the significance of an observed collinear region using both definitions is based on the approach outlined by ![]()

A is the approximate distance (in kilobases) between the outermost loci detected within a collinear region in A. thaliana. The term R is the mean ratio of genetic distance (centimorgans) to physical distance (kilobases) for the A. thaliana chromosome on which the segment lies. R is 135/29,000, 97/17,463, 101/23,560, 125/22,140, 139/26,170 for chromosomes 15, respectively (![]()
To evaluate the probability that an observed collinear region arose by chance, an empirical permutation procedure was developed. In each permutation, markers from B. oleracea linkage groups were assigned randomly and uniformly to the A. thaliana genome. After the permutation, the scoring metric was calculated for each collinear region found in the permuted data set. If the permuted data set did not contain a collinear region of a given length, the metric was assigned a high value. The procedure was repeated 1000 times to obtain the expected distribution of scores for a particular length of a collinear region under the null hypothesis that collinear regions are due to chance association between loci. To obtain the P value for an observed collinear region, its distance metric was calculated and compared with the distribution of the metric scores for collinear regions of the same length between the same linkage group and chromosome generated from the permutations. The percentage of the scores from the permutations that were less than the observed score was defined as the P value for the observed collinear region.
The relative importance of the number of loci and the distance between loci in determining the significance of a collinear region is arbitrary. With our standards, S(n) must be considerably lower than S(n + 1) to be significant. For example, between O2 and At5, the 5% quantile of the scoring metric under the null distribution for collinear regions with two shared loci was 2.31 cM vs. 7.14 cM for three shared loci. Collinear regions with more than four shared loci were claimed to be significant at the 5% level.
The comparative data between B. oleracea and A. thaliana were parsed in two ways before testing for significant collinear regions. Both manipulations tended to increase the number of observed, collinear regions. First, if two or more markers had similarity to the same BAC in A. thaliana or shared the same centimorgan location in B. oleracea, we assigned the order of these markers or sequences relative to each other by eye. Second, in each pairwise comparison between a B. oleracea linkage group and A. thaliana chromosome, each locus on a B. oleracea linkage group was allowed a single position on the A. thaliana chromosome. Duplicates caused the algorithm to identify unlikely collinear regions. For example, if one linkage group of B. oleracea contained three closely linked loci of which two were recently duplicated, then this entire region would incorrectly be inferred to be collinear with a region defined by only two loci on an A. thaliana chromosome.
| RESULTS |
|---|
Identification of A. thaliana sequences putatively orthologous to B. oleracea marker sequences:
The A. thaliana BAC database was queried with sequences from a total of 158 Brassica DNA probes using the nucleotide pattern-matching program blastn (![]()
45 million years prior to the divergence of A. thaliana and B. oleracea (![]()
![]()
![]()
|
Of the 158 Brassica probes, sequences from 131 (83%) have significant similarity to one or more A. thaliana BACs. Four-fifths of the probes with significant similarity to A. thaliana have putative orthology to only a single BAC, and each probe has similarity to a mean of 1.4 loci within the A. thaliana genome. Only two probes have similarity to more than three BACs (Fig 4).
|
The majority of B. oleracea linkage groups are strongly associated with a single A. thaliana chromosome. Over one-half of the probes that mapped to five of the nine B. oleracea linkage groups were putatively orthologous to BACs within a single chromosome (O2 and At5; O4 and At2; O5 and At1; O8 and At1; O9 and At5; Table 1). Despite this general association, probes that map to each B. oleracea linkage group have putative orthologs throughout the A. thaliana genome, suggesting extensive chromosomal repatterning has occurred since the divergence of these two species (Table 1). In addition, sequences are not uniformly distributed across A. thaliana chromosomes, and the P value of the goodness-of-fit test was <0.05. On the basis of comparisons of total and expected numbers in Table 1, sequences with similarity to At5 are overrepresented in the B. oleracea genome, while sequences with similarity to At3 are underrepresented.
|
Significant collinear regions between B. oleracea and A. thaliana:
Pairwise comparisons between each B. oleracea linkage group and A. thaliana chromosome show that several adjacent markers within a linkage group may correspond to adjacent markers within a single chromosome. The pairwise comparison between linkage groups and chromosomes reduces the confounding effect of markers mapped to paralogous regions within the B. oleracea genome (Fig 1C) because Brassica markers that have homology to different chromosomes are not simultaneously examined for collinearity.
Long collinear regions are shared between O1 and At4, O5 and At1, O3 and At5, as well as between several other linkage groups and chromosomes (Fig 5). Nonetheless, the relationship between many genomic regions remains ambiguous. Some markers may be closely linked in one genome but not in the other genome. For example, two loci >20 cM apart on O1 lie within 1 Mbp on At1 (Fig 5). Or, some markers closely linked in both genomes may clearly delineate a region of putative orthology, but another marker may not be closely linked to them. For example, two markers are tightly linked within O3 and At3 (both at 17.5 Mbp); a third marker is also tightly linked in B. oleracea, but it lies at 20 Mbp in A. thaliana.
|
The collinearity test identified all collinear, putatively orthologous regions with confidence. Using the strict definition or the general definition of collinearity with both species as testers (see MATERIALS AND METHODS), the algorithm aligned a total of 240 cM of the 872-cM B. oleracea genetic map, or 28% of the B. oleracea genome, to A. thaliana within 34 significant collinear regions at a P < 0.05 level of significance (Table 2). The algorithm identified regions that appeared collinear in the visual examination, regions such as the interval shared between O1 and At4. The largest segment of significant collinearity is the region in O5 that is putatively orthologous to At1. This region has 13 markers and spans 45.3 cM, corresponding to 7.4 Mbp within At1, and is significant at P < 0.01 (Table 2). Every B. oleracea linkage group has a significant collinear region on at least one A. thaliana chromosome (Table 2). Collinear regions contain an average of 3.3 markers corresponding to 2.1 Mbp in A. thaliana and 7.1 cM in B. oleracea. The distribution of the number of markers that define collinear regions is highly skewed because one-half of the regions have only two markers, while all but one of the remaining regions have three, four, or five markers. As a result, the median collinear region length is 2.5 markers corresponding to 695,000 bp and 2.5 cM. Marker density may be an important factor in detecting collinear regions. O6 has the fewest number of markers and only a single significant collinear region.
|
The true length of each collinear region extends for some distance beyond the outermost loci detected here. An estimate of the true length of a particular segment can be made using the equation from ![]()
Seven of the nine B. oleracea linkage groups have regions that are collinear with more than one A. thaliana chromosome (Fig 6), again suggesting that numerous translocations have occurred since the divergence of the two species. In addition, closely linked regions within a single B. oleracea linkage group may be collinear to different segments within the same A. thaliana chromosome, suggesting intrachromosomal rearrangements. On O7, for example, one collinear region (pN64pN101) spans 9.610.1 Mbp within At5. This region lies adjacent to another region defined by pO169 and pN20 on the genetic map, but putative orthologs to these markers span 19.820.4 Mbp within At5 (Fig 6). Because neither interval corresponds to the known duplicated segment within At5 (ARABIDOPSIS GENOME INITIATIVE 2000), this region likely defines an intrachromosomal rearrangement that has occurred since the divergence of B. oleracea and A. thaliana. Additional collinear segments that are contiguous on the genetic map and hybridize to the same A. thaliana chromosome are found on O3, O5, and O9. However, within these linkage groups, neighboring markers on the genetic map have similarity to the same general area within the A. thaliana genome, indicating either a local rearrangement of markers or that markers were misplaced on the B. oleracea genetic map.
|
The relationship between the B. oleracea genetic map distances and the A. thaliana genome sequence can be inferred from regions within both genomes with a large number of shared markers. In most such regions (e.g., O4 and At3, O9 and At5), the genetic distance between each marker on the B. oleracea linkage group is roughly proportional to the nucleotide distance between each marker on the A. thaliana chromosome (Table 2, Fig 5), indicating that the relationship between the genetic distance in B. oleracea and physical distance in A. thaliana is linear for these regions. A notable exception is the longest region of putative homology shared between O5 and At1. As would be expected if the frequency of recombination per nucleotide is greater at chromosome ends than toward the centromere, the change in centimorgans vs. the change in base pairs is greater between markers at the end of the linkage group than between markers in the center. At the top of the linkage group, markers are distantly linked in B. oleracea but lie fairly close together on the A. thaliana chromosome. The slope progressively declines toward the center of the B. oleracea linkage group as the markers approach the centromere of At1 located at
14 Mbp (ARABIDOPSIS GENOME INITIATIVE 2000). Markers that are closely linked in the center of O5 lie very far apart on At1.
| DISCUSSION |
|---|
Sequence similarity and orthology between B. oleracea and A. thaliana:
Identification of orthologous regions between two genomes depends on correctly identifying orthologous sequences that are shared between the genomes (Fig 1). Orthology and the significance of a relationship between two sequences are very difficult to infer (![]()
![]()
![]()
![]()
In this study, we define criteria by which to accept or reject A. thaliana sequences as likely orthologs to Brassica sequences on the basis of the distribution of blastn scores and the a priori knowledge of expected nucleotide differences between paralogous sequences within the A. thaliana genome (![]()
5 Mbp (ARABIDOPSIS GENOME INITIATIVE 2000). Without an outgroup, it is difficult to infer whether the small number of A. thaliana duplicated loci detected as orthologous to Brassica sequences arose before or after the divergence of A. thaliana from B. oleracea.
The degree of conservation inferred between genomes depends on the cutoff score used to define putative orthology. In this analysis, if sequence alignments with low blastn scores were considered orthologous, additional collinear regions would be identified. For example, the marker pW105 is positioned at 27 cM on B. oleracea LG 1 and is similar to a sequence at 10 Mbp in A. thaliana ch. 4 with a score of 119. Under less stringent criteria, pW105 could be considered collinear with pR36, a marker positioned at 35 cM on LG 1 and aligned to A. thaliana ch. 4 at 12 Mbp with a blastn score of 74. Similarly, if only sequence alignments with high blastn scores were considered orthologous, some collinear regions reported in this analysis, such as the region defined by markers pO125 and pR72 on LG 2, would not be identified. Marker pO125 is positioned at 75 cM and is aligned to a sequence at 9 Mbp on A. thaliana ch. 3 with a blastn score of 234. Marker pR72 is positioned at 75 cM and is aligned to a sequence at 9 Mbp on A. thaliana ch. 3 with a blastn score of 92. Finally, Brassica markers may be similar to a region of the A. thaliana genome to which other linked Brassica markers have no similarity. These singleton hits increase as the stringency for orthology is reduced.
Statistical test of collinearity:
Probes may be closely linked in one genome while their orthologs are distantly linked in another (Fig 1D and Fig 5). In such a case, it is difficult to determine if the intervening region between the probes is orthologous. This determination often depends on the researcher's judgment, and orthologous regions may be identified using very liberal criteria (![]()
![]()
Despite these similarities, this collinearity approach differs from Gaut's approach in several major respects. First, the metric to evaluate collinear regions integrates physical distance from one genome with genetic distance from the other. Second, the collinearity program evaluates collinear regions nested within longer regions. This characteristic is important because a long collinear region shared between linkage groups or chromosomes may not be significant, but nested collinear regions within this long interval may be significant. Third, markers that lie at the end of one collinear region and at the beginning of another collinear region are evaluated in both positions (i.e., pN121 in Fig 2). Fourth, although both models use empirical permutations to estimate statistical significance, in each permutation of our model, markers from each linkage group of one species are randomly and uniformly assigned a position within the second species' genome. Thus, to evaluate significance, the algorithm compares the metric of each collinear region with the expected distribution of the metric under the null hypothesis that collinear regions are due to chance association between loci across genomes. In contrast, in Gaut's permutation, markers from each linkage group are randomly and uniformly assigned to positions within every linkage group of the second species. Finally, the algorithm can evaluate genomes for collinear regions using both general and strict definitions of collinearity.
Under the general definition of collinearity, the identity of collinear regions may depend on which chromosome or linkage group is a tester and which is a standard in the comparison (Fig 2). In this study, if a collinear region was detected when a chromosome was used as both a standard and a tester, it was recorded. Collinear regions identified using the strict definition of collinearity were also recorded. Under the general definition of collinearity, many collinear regions that were significant when using one chromosome as the tester but not significant when using the same chromosome as the standard did not correspond to orthologous intervals. For example, the markers pO169 and pR36 define a significant, collinear region between O7 and At5 where At5 is the standard (Fig 2A). However, within this region, six markers lie between the last two markers (pN20 and pR36) within the B. oleracea genetic map, and all six markers have putative orthologs to different regions within At5 (Fig 2B).
Our method of scoring collinear regions, inferring collinearity, and establishing significance is based on several assumptions. First, when calculating the scoring metric, the ratio between genetic distance and physical distance is assumed to be constant over each A. thaliana chromosome. Nonetheless, because the genetic distance and physical distance are known to vary across A. thaliana chromosomes (![]()
![]()
Several published reports of genome comparisons between A. thaliana and Brassica species were compared to our results to test the utility of the collinearity algorithm and the effect of these assumptions. If previously identified collinear regions were among the significant collinear regions identified by our analysis, then we judged that our algorithm has high utility. We used two distinct criteria to infer if previously reported homologous/orthologous regions were among those identified by the collinearity analysis. Under the first criterion, two requirements had to be met. The region of A. thaliana identified in our analysis must overlap with a region detected in the previous analysis. In addition, the B. oleracea region identified in our analysis must lie on a linkage group that was likely homologous or homeologous to the Brassica linkage group reported previously. A second, less stringent criterion was used if relationships between linkage groups could not be inferred because of experimental design (i.e., ![]()
The collinearity test identified almost all putative homologous/orthologous regions reported in previous genetic and/or physical comparative mapping studies (Table 3). Out of 22 previously reported regions of similarity, the collinearity test identified 20. In addition, the test identified several regions that have not been reported previously and could be targets of future studies. Of the 22 previously characterized regions, 17 were collinear and significant at P < 0.05, and 3 additional regions were collinear but not significant, reflecting the conservative nature of the test. Two published regions were not identified by our analysis. Our collinearity analysis assigned somewhat more than one-quarter of the B. oleracea genetic map to putatively orthologous regions within A. thaliana (see below). The fact that our analysis identified over three-quarters of the previously reported collinear regions suggests both that comparative studies have not randomly sampled the Brassica genome for regions of collinearity and that long, conserved regions may be overrepresented in the literature.
|
Analysis of the ancestral Brassica genome:
Several studies have provided evidence that the base Brassica genome is highly duplicated (![]()
![]()
![]()
![]()
![]()
![]()
![]()
|
Replication and rearrangements within the B. oleracea genome:
Our analysis does show that numerous chromosomal translocations, deletions, and duplications differentiate A. thaliana from B. oleracea. For example, different regions of O3 have high similarity to all five A. thaliana chromosomes (Fig 3). In addition, Brassica markers often have putative orthologs within a region of the A. thaliana genome to which other linked Brassica markers have no similarity (i.e., between O5 and At4), suggesting that relatively short sequences have transferred between chromosomes. Such events have been identified in the recent evolutionary history of humans (![]()
![]()
Although inferences about the rate of change within two genomes require a third, outgroup genome for comparison, we nonetheless suggest that most rearrangements reported here occurred within the Brassica lineage since its divergence from the Brassica-Arabidopsis common ancestor. If the A. thaliana genome has had many duplications since its divergence from the Brassica-Arabidopsis common ancestor, one would expect that more than one region of the A. thaliana genome would correspond to a single region within B. oleracea. This occurs infrequently (Fig 4), and only two regions of the B. oleracea genome were found by the collinearity test to be associated with more than one region in A. thaliana (Fig 6). Likewise, if many large deletions have occurred in the A. thaliana genome since its divergence from the Brassica-Arabidopsis common ancestor, large portions of the B. oleracea genome would not be similar to regions within the A. thaliana genome. With the possible exception of the top of O6, all regions of B. oleracea have similarity to A. thaliana regions (Table 1, Fig 5). The concept that genome change has occurred predominantly during Brassica evolution (as opposed to Arabidopsis) is also consistent with both linkage and microcollinearity studies. The genomes of the genus Brassica and that of A. thaliana likely diverged
2024 MYA (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
The large number of differences between the genomes may in part have been the result of the recent polyploidization of B. oleracea. Extensive repatterning of a genome subsequent to polyploidization has been suggested to account for the scattered, duplicate segments within the yeast genome (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
| ACKNOWLEDGMENTS |
|---|
We thank the Arabidopsis Genome Initiative for open access to the sequence data. Financial support was provided by a National Science Foundation Postdoctoral Fellowship in Bioinformatics to L.L. and a National Science Foundation grant to T.O.
Manuscript received July 1, 2002; Accepted for publication January 25, 2003.
| LITERATURE CITED |
|---|
ACARKAN, A., M. ROSSBERG, M. KOCH, and R. SCHMIDT, 2000 Comparative genome analysis reveals extensive conservation of genome organization for Arabidopsis thaliana and Capsella rubella.. Plant J. 23:55-62.[Medline]
ALTSCHUL, S., T. MADDEN, A. SCHÄFFER, J. ZHANG, and Z. ZHANG et al., 1997 Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25:3389-3402.
Analysis of the genome sequence of the flowering plant Arabidopsis thaliana.. (2000) Nature 408:796-812.[Medline]
BENNETZEN, J., 2000 Comparative sequence analysis of plant nuclear genomes: microcolinearity and its many exceptions. Plant Cell 12:1021-1029.
BOHUON, E. J. R., L. D. RAMSAY, J. A. CRAFT, A. E. ARTHUR, and D. F. MARSHALL et al., 1998 The association of flowering time quantitative trait loci with duplicated regions and candidate loci in Brassica oleracea. Genetics 150:393-401.
BRUBAKER, C. L., A. H. PATERSON, and J. F. WENDEL, 1999 Comparative genetic mapping of allotetraploid cotton and its diploid progenitors. Genome 42:184-203.
CAMARGO, L. E. A., L. SAVIDES, G. JUNG, J. NIENHUIS, and T. C. OSBORN, 1997 Location of the self-incompatability locus in an RFLP and RAPD map of B. oleracea. J. Hered. 88:57-60.
CAVELL, A. C., D. LYDIATE, I. PARKIN, C. DEAN, and M. TRICK, 1998 Collinearity between a 30cM segment of Arabidopsis thaliana chromosome 4 and duplicated regions within the Brassica napus genome. Genome 41:62-69.[Medline]
COPENHAVER, G., W. BROWNE, and D. PREUSS, 1998 Assaying genome-wide recombination and centromere functions with Arabidopsis tetrads. Proc. Natl. Acad. Sci. USA 95:247-252.
DEKEN, J., 1983 Probabilistic behavior of longest common subsequence length, pp. 359362 in Time Warps, String Edits, and Macromolecules: The Theory and Practice of Sequence Comparison, edited by D. SANKOFF and J. KRUSKAL. Addison-Wesley, Reading, MA.
EHRLICH, J., D. SANKOFF, and J. NADEAU, 1997 Synteny conservation and chromosome rearrangements during mammalian evolution. Genetics 147:289-296.[Abstract]
GALLOWAY, G. L., R. L. MALMBERG, and R. A. PRICE, 1998 Phylogenetic utility of the nuclear gene arginine decarboxylase: an example from Brassicaceae.. Mol. Biol. Evol. 15:1312-1320.[Abstract]
GAUT, B., 2001 Patterns of chromosomal duplication in maize and their implications for comparative maps of the grasses. Genome Res. 11:55-66.
GRANT, D., P. CREGAN, and R. SHOEMAKER, 2000 Genome organization in dicots: genome duplication in Arabidopsis and synteny between soybean and Arabidopsis.. Proc. Natl. Acad. Sci. USA 97:4168-4173.
GRANT, M., J. MCDOWELL, A. SHARPE, M. DE TORRES ZABALA, and D. LYDIATE et al., 1998 Independent deletions of a pathogen-resistance gene in Brassica and Arabidopsis.. Proc. Natl. Acad. Sci. USA 95:15843-15848.
HAGA, T., 1938 Relationship of genome to secondary pairing in Brassica.. Jpn. J. Genet. 13:277-284.
HELENTJARIS, T., D. WEBER, and S. WRIGHT, 1988 Identification of the genomic locations of duplicate nucleotide sequences in maize by analysis of restriction fragment length polymorphism. Genetics 118:353-356.
HU, J., J. SADOWSKI, T. C. OSBORN, B. S. LANDRY, and C. F. QUIROS, 1998 Linkage alignment from four independent Brassica oleracea RFLP maps. Genome 41:226-235.
KOCH, M., B. HAUBOLD, and T. MITCHELL-OLDS, 2000 Comparative evolutionary analysis of chalcone synthase and alcohol dehydrogenase loci in Arabidopsis, Arabis, and related genera. Mol. Biol. Evol. 17:1482-1498.
KOCH, M., B. HAUBOLD, and T. MITCHELL-OLDS, 2001 Molecular systematics of the Brassicaceae: evidence from coding plastidic matK and nuclear chs sequences. Am. J. Bot. 88:534-544.
KOWALSKI, S., T. LAN, K. FELDMANN, and A. PATERSON, 1994 Comparative mapping of Arabidopsis thaliana and Brassica oleracea chromosomes reveal islands of conserved organization. Genetics 138:499-510.[Abstract]
LAGERCRANTZ, U., 1998 Comparative mapping between Arabidopsis thaliana and Brassica nigra indicates that Brassica genomes have evolved through extensive genome replication accompanied by chromosome fustions and frequent rearrangements. Genetics 150:1217-1228.
LAGERCRANTZ, U. and D. LYDIATE, 1996 Comparative genome mapping in Brassica.. Genetics 144:1903-1910.[Abstract]
LAN, T. and A. PATERSON, 2000 Comparative mapping of quantitative trait loci sculpting the curd of Brassica oleracea.. Genetics 155:1927-1954.
LAN, T., T. DELMONTE, K. REISCHMANN, J. HYMAN, and S. KOWALSKI et al., 2000 An EST-enriched comparative map of Brassica oleracea and Arabidopsis thaliana.. Genome Res. 10:776-788.
LISTER, C. and C. DEAN, 1993 Recombinant inbred lines for mapping RFLP and phenotypic markers in Arabidopsis thaliana.. Plant J. 4:745-750.
LIU, B., J. M. VEGA, and M. FELDMAN, 1998 Rapid genomic changes in newly synthesized amphiploids of Triticum and Aegilops. II. Changes in low-copy coding DNA sequences. Genome 41:535-542.[Medline]
LYNCH, M. and J. CONERY, 2000 The evolutionary fate and consequences of duplicate genes. Science 290:1151-1155.
NADEAU, J. and B. TAYLOR, 1984 Lengths of chromosomal segments conserved since divergence of man and mouse. Proc. Natl. Acad. Sci. USA 81:814-818.
O'KEEFE, C., and E. EICHLER, 2000 The pathological consequences and evolutionary implications of recent human genetic duplications, pp. 2946 in Comparative Genomics, edited by D. SANKOFF and J. H. NADEAU. Kluwer Academic Publishers, Dordrecht, The Netherlands.
O'NEILL, C. and I. BANCROFT, 2000 Comparative physical mapping of the genome of Brassica oleracea var. alloglabra that are homeologous to sequenced regions of chromosome 4 and 5 of Arabidopsis thaliana.. Plant J. 23:233-243.[Medline]
PARKIN, I., 2000 Unraveling crucifer genomes through comparative mapping, pp. 425437 in Comparative Genomics, edited by D. SANKOFF and J. H. NADEAU. Kluwer Academic Publishers, Dordrecht, The Netherlands.
PARKIN, I., D. J. LYDIATE, and M. TRICK, 2002 Assessing the level of collinearity between Arabidopsis thaliana and Brassica napus for A. thaliana chromosome 5. Genome 45:356-366.[Medline]
QUIROS, C. F., F. GRELLET, J. SADOWSKI, T. SUZUKI, and G. LI et al., 2001 Arabidopsis and Brassica comparative genomics: sequence, structure and gene content in the ABI1-RPS2-Ck1 chromosomal segment and related regions. Genetics 157:1321-1330.
RÖBBELEN, G., 1960 Beiträge zur analyse des Brassica genomes. Chromosoma 11:205-228.[Medline]
ROSSBERG, M., K. THERES, A. ACARKAN, R. HERRERO, and T. SCHMITT et al., 2001 Comparative sequence analysis reveals extensive microcolinearity in the lateral suppressor regions of the tomato, Arabidopsis, and Capsella genomes. Plant Cell 13:979-988.
RYDER, C. D., L. B. SMITH, G. R. TEAKLE, and G. J. KING, 2001 Contrasting genome organisation: two regions of the Brassica oleracea genome compared with collinear regions of the Arabidopsis thaliana genome. Genome 44:808-817.[Medline]
SCHRANZ, M. E., P. QUIJADA, S. SUNG, L. LUKENS, and R. AMASINO et al., 2002 Characterization and effects of the replicated flowering time gene FLC in Brassica rapa.. Genetics 162:1457-1468.
SHAKED, H., K. KASHKUSH, H. OZKAN, M. FELDMAN, and A. LEVY, 2001 Sequence elimination and cytosine methylation are rapid and reproducible responses of the genome to wide hybridization and allopolyploidy in wheat. Plant Cell 13:1749-1759.
SONG, K., P. LU, K. TANG, and T. OSBORN, 1995 Rapid genome change in synthetic polyploids of Brassica and its implications for polyploid evolution. Proc. Natl. Acad. Sci. USA 92:7719-7723.
TRUCO, M. J., J. HU, J. SADOWSKI, and C. F. QUIROS, 1996 Inter- and intra-genomic homology of the Brassica genomes: implications for their origin and evolution. Theor. Appl. Genet. 93:1225-1233.
WARWICK, S. I. and L. D. BLACK, 1997 Phylogenetic implications of chloroplast DNA restriction site variation in subtribes Raphaninae and Cakilinae (Brassicaceae, tribe Brassicieae). Can. J. Bot. 75:960-973.
WOLFE, K. and D. SHIELDS, 1997 Molecular evidence for an ancient duplication of the entire yeast genome. Nature 387:708-713.[Medline]
YUAN, Y., O. EULENSTEIN, M. VINGRON, and P. BORK, 1998 Toward detection of orthologues in sequence databases. Bioinformatics 14:285-289.
This article has been cited by other articles:
![]() |
H. T.T. Phan, S. R. Ellwood, K. Adhikari, M. N. Nelson, and R. P. Oliver The First Genetic and Comparative Map of White Lupin (Lupinus albus L.): Identification of QTLs for Anthracnose Resistance and Flowering Time, and a Locus for Alkaloid Content DNA Res, May 26, 2007; (2007) dsm009v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. S. Kim, T. Y. Chung, G. J. King, M. Jin, T.-J. Yang, Y.-M. Jin, H.-I. Kim, and B.-S. Park A Sequence-Tagged Linkage Map of Brassica rapa Genetics, September 1, 2006; 174(1): 29 - 39. [Abstract] [Full Text] [PDF] |
||||
![]() |


0.05; **, statistical significance at P 
60 are plotted. The number of Arabidopsis thaliana BACs detected by markers rises asymptotically at lower BLAST scores. Sequence matches with scores <82 (identified by the arrow) were not included in our analysis. For plotting, one outlier with a bit score of 1104 was removed.




