- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Chang, Y.-L.
- Articles by Zhang, H.-B.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Chang, Y.-L.
- Articles by Zhang, H.-B.
An Integrated Map of Arabidopsis thaliana for Functional Analysis of Its Genome Sequence
Yueh-Long Chang1,a, Quanzhou Tao1,a, Chantel Scheuringa, Kejiao Dinga, Khalid Meksemb, and Hong-Bin Zhangaa Department of Soil and Crop Sciences and Crop Biotechnology Center, Texas A&M University, College Station, Texas 77843-2123
b Department of Plant Soil and General Agriculture, Southern Illinois University, Carbondale, Illinois 62901-4415
Corresponding author: Hong-Bin Zhang, Department of Soil and Crop Sciences and Crop Biotechnology Center, 2123 TAMUS, Texas A&M University, College Station, TX 77843-2123., hbz7049{at}pop.tamu.edu (E-mail)
Communicating editor: C. S. GASSER
| ABSTRACT |
|---|
The genome of the model plant species Arabidopsis thaliana has recently been sequenced. To accelerate its current genome research, we developed a whole-genome, BAC/BIBAC-based, integrated physical, genetic, and sequence map of the A. thaliana ecotype Columbia. This new map was constructed from the clones of a new plant-transformation-competent BIBAC library and is integrated with the existing sequence map. The clones were restriction fingerprinted by DNA sequencing gel-based electrophoresis, assembled into contigs, and anchored to an existing genetic map. The map consists of 194 BAC/BIBAC contigs, spanning 126 Mb of the 130-Mb Arabidopsis genome. A total of 120 contigs, spanning 114 Mb, were anchored to the chromosomes of Arabidopsis. Accuracy of the integrated map was verified using the existing physical and sequence maps and numerous DNA markers. Integration of the new map with the sequence map has enabled gap closure of the sequence map and will facilitate functional analysis of the genome sequence. The method used here has been demonstrated to be sufficient for whole-genome physical mapping from large-insert random bacterial clones and thus is applicable to rapid development of whole-genome physical maps for other species.
ARABIDOPSIS thaliana is a model system for genomic studies of plant species (![]()
![]()
![]()
![]()
![]()
![]()
![]()
Sequence analysis has indicated that the genome of A. thaliana contains
25,498 genes. However, the functions of >90% of the predicted genes remain to be characterized experimentally (ARABIDOPSIS GENOME INITIATIVE 2000). Experimental determination of the functions of these genes and related sequences has been targeted as a goal for the coming decade (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
In this study, we developed a whole-genome integrated physical and genetic map of the A. thaliana ecotype Columbia from a new plant-transformation-competent BIBAC library and integrated it with the existing sequence map of the species. The integration of the new map with the sequence map will significantly accelerate genome research of the model species in many aspects. The complementarity of the new BIBAC library to the source libraries of the sequence map and the competency of the BIBACs for plant transformation will facilitate gap closure of the sequence map and large-scale functional analysis of the genome sequence. Furthermore, because the new map was constructed using a DNA sequencing gel-based fingerprinting method (![]()
![]()
![]()
![]()
| MATERIALS AND METHODS |
|---|
BAC and BIBAC libraries:
A new plant-transformation-competent binary library (the clones are hereafter referred as to BIBACs; Y.-L. CHANG, K. MEKSEM, H.-W. CHUANG, C. SCHEURING and H.-B. ZHANG, unpublished data) and the TAMU (![]()
![]()
The TAMU and IGF BAC libraries are the source libraries of the existing physical and sequence maps of A. thaliana (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Fingerprinting and contig assembly:
BAC and BIBAC DNA were isolated and fingerprinted according to ![]()
![]()
![]()
|
Library screening:
The source clones of the physical map were double spotted on Hybond N+ membrane in a format of 3 x 3 by using the Biomek 2000 Robotic Workstation (Beckman, Fullerton, CA), and the high-density colony filters were prepared according to ![]()
50 kb apart on the chloroplast genome, were used to screen the source BAC and BIBAC libraries, and the IGF BACs derived from mitochondrial DNA were used to search the database of the new physical map. To verify the accuracy of the contigs and anchor them to Arabidopsis genetic maps, 77 restriction fragment length polymorphism (RFLP) markers were selected from the Arabidopsis genetic map (![]()
|
|
| RESULTS AND DISCUSSION |
|---|
Fingerprinting the BAC and BIBAC clones:
To construct the whole-genome physical map that is integrated with the existing sequence map (ARABIDOPSIS GENOME INITIATIVE 2000), we selected 1536 IGF BACs (4 384-well plates), 3072 TAMU BACs (8 384-well plates), and 6144 BIBACs (16 384-well plates) from their libraries. The 10,752 BAC and BIBAC clones were fingerprinted on 224 autoradiographs using the DNA sequencing gel-based restriction fingerprinting method (![]()
Assembling the BAC/BIBAC map:
We scanned the clone fingerprints into image files and edited with the Image program of the FPC package (![]()
![]()
![]()
|
|
|
Origin of the 196 contigs was investigated by colony hybridization using chloroplast DNA as probes and by analyzing the mitochondrial DNA-derived clones of the IGF library (![]()
![]()
![]()
![]()
![]()
![]()
![]()
Verifying the map:
To test the accuracy of the new map, we compared the BAC/BIBAC contigs constructed in this study with the BAC contigs of the existing physical (![]()
95% of the new BAC/BIBAC contigs were consistent with the contigs of the existing physical and sequence maps in both clone content and order (Fig 2), but 5% of the contigs were different. To further verify the accuracy of the new BAC/BIBAC contigs, we screened them with 77 DNA markers selected from the genetic map (![]()
![]()
![]()
Anchoring the physical map contigs to the genetic map:
To anchor the BAC/BIBAC contigs to a public genetic map of A. thaliana, we used the above screening results of the contig BACs and BIBACs with the 77 mapped DNA markers and the database of the TAMU and IGF BAC-based maps of the IGF (![]()
Potential applications of the new integrated map for accelerated genome research of Arabidopsis:
This new integrated map has provided a platform for accelerated genome research of Arabidopsis in many aspects. To test the utility of the integrated map for gap closure in the sequence map, we attempted to close four gaps in the sequence map using the new map. One of the gaps was between clones T2P3 and F2G19 in the sequence map of chromosome 1, and three gaps were between ctg714-ctg719-ctg731-ctg11 in the sequence map of Arabidopsis chromosome 3 (http://genome.wustl.edu/gsc/arab/arabidopsis.html; communicated with Dr. Christopher Town, The Institute for Genome Research). The TAMU and IGF BACs in the contigs were used to search the new BAC/BIBAC contigs. One contig (ctg1024) was identified from the new map to span the gap between T2P3 and F2G19 by two BIBACs (B05G22 and B09C04). The two BIBACs have been used to close the gap of the existing sequence map (C. TOWN, personal communication). Similarly, we searched the contigs of the new map that span the three gaps between ctg714-ctg719-ctg731-ctg11 in chromosome 3. As a result, from the new map we identified three contigs, ctg3010 (810 kb), ctg3011 (345 kb), and ctg3012 (1308 kb). To further determine whether ctg3011 and ctg3012 overlap, we further analyzed the fingerprints at the contig ends under less comparison stringency and screened the BACs and BIBACs of the new map using one IGF BAC clone (F03O21) at a ctg3012 end as a probe. As a result, eight positive BIBAC clones at one of the ctg3011 ends were identified. The hybridization and fingerprint analysis results suggested that the ctg3011 and ctg3012 overlapped and thus were merged. The merged contig (1618 kb) seems to span the gaps between ctg719, ctg731, and ctg11. Therefore, of the four gaps in the AGI sequence map, three were likely to be closed using the new map. Whether the gap between ctg714 and ctg719 in the sequence map can be closed by ctg3010 and ctg3011 remains to be determined. Similarly, other gaps in the existing sequence map could also be closed using the new map.
Experimental determination of the function of the genes and related sequences predicted by the genome sequence analysis of A. thaliana will be a significant challenge. Since A. thaliana can be readily transformed via Agrobacterium (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
In addition to their utility in gap closure and functional analysis of the Arabidopsis genome sequence, the new integrated map and fingerprint database have provided a platform for numerous other studies of not only A. thaliana, but also of many other plant species. We have already received a number of inquiries, including those from the laboratories of the AGI. These studies include gap closure in the sequence map (see above), isolation of genes by positional cloning, identification of the functional sequence of centromeres, studies of gene regulation, engineering of a cluster of genes at a locus, and comparative genomics research between A. thaliana and crop plants. Since errors have recently been identified in the existing sequence map (![]()
![]()
Accessing the integrated map:
The integrated map and the new BIBAC library have been posted at http://hbz.tamu.edu (Physical Mapping-Arab Map) and made available to the public. Users can access the map using any of the following approaches: clone-FPC hitting; clone-graphic contig map; clone-fingerprint map; contig no.-graphic contig map; or marker/EST-positive clones-contig/PFC hit/ fingerprint matches. The contigs, clones, and libraries can be requested at http://hbz.tamu.edu-BAC Library-Library List.
| FOOTNOTES |
|---|
1 These authors contributed equally to this study. ![]()
| ACKNOWLEDGMENTS |
|---|
We thank Dr. Christopher Town for kindly providing the gap information in the Arabidopsis sequence map of AGI. We thank Drs. Huey-Wen Chuang and David A. Lightfoot for critically reading the manuscript. This study was supported in part by Texas Agricultural Experiment Station (8536-203104) and the Texas Higher Education Coordinating Board (999902-042).
Manuscript received July 11, 2001; Accepted for publication August 13, 2001.
| LITERATURE CITED |
|---|
Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. (2000) Nature 408:796-815[Medline].
AZPIROZ-LEEHAN, R. and K. A. FELDMANN, 1997 T-DNA insertion mutagenesis in Arabidopsis: going back and forth. Trends Genet. 13:152-156[Medline].
BECHTOLD, N., J. ELLIS, and G. PELLETIER, 1993 In planta Agrobacterium-mediated gene transfer by infiltration of adult Arabidopsis thaliana plants. C. R. Acad. Sci. Paris Life Sci. 316:1194-1199.
BENT, A. F., B. N. KUNKEL, D. DAHLBECK, K. L. BROWN, and R. SCHMIDT et al., 1994 RPS2 of Arabidopsis thaliana: a leucine-rich repeated class of plant disease resistance genes. Science 265:1856-1860
CANILLERI, C., J. LAFLEURIEL, C. MACADRE, F. VAROQUAUX, and Y. PARMENTIER et al., 1998 A YAC contig map of Arabidopsis thaliana chromosome 3. Plant J. 15:633-642.
CHANG, S. S., S. K. PARK, B. C. KIM, B. J. KANG, and D. U. KIM et al., 1994 Stable genetic transformation of Arabidopsis thaliana by Agrobacterium inoculation in planta. Plant J. 5:551-558.
CHOI, S. D., R. CREELMAN, J. MULLET, and R. A. WING, 1995 Construction and characterization of a bacterial artificial chromosome library from Arabidopsis thaliana. Weeds World 2:17-20.
DESPREZ, T., J. AMSELEM, M. CABOCHE, and H. HOFTE, 1998 Differential gene expression in Arabidopsis seedlings monitored using cDNA arrays. Plant J. 14:643-652[Medline].
FELDMANN, K. A. and M. D. MARKS, 1987 Agrobacterium-mediated transformation of germinating seeds of Arabibdopsis thaliana: a non-tissue culture approach. Mol. Gen. Genet. 208:1-9.
HAMILTON, C. M., A. FRARY, C. LEWIS, and S. D. TANKSLEY, 1996 Stable transfer of intact high molecular weight DNA into plant chromosomes. Proc. Natl. Acad. Sci. USA 93:9975-9979
HAMILTON, C. M., A. FRARY, Y. XU, S. D. TANKSLEY, and H.-B. ZHANG, 1999 Construction of tomato genomic DNA libraries in a binary-BAC (BIBAC) vector. Plant J. 18:223-229.
HWANG, I., T. KOHCHI, B. M. HAUGE, and H. M. GOODMAN, 1991 Identification and map position of YAC clones comprising one-third of the Arabidopsis genome. Plant J. 1:367-374[Medline].
JONES, J. G., L. SHLUMUKOV, F. CARLAND, J. ENGLISH, and S. R. SCOFIELD et al., 1992 Effective vectors for transformation, expression of heterologous genes, and assaying transposon excision in transgenic plants. Transgenic Res. 1:285-297[Medline].
KLEIN, M., U. ECKERT-OSSENKOPP, I. SCHMIEDEBERG, P. BRANDT, and M. UNSELD et al., 1994 Physical mapping of the mitochondrial genome of Arabidopsis thaliana by cosmid and YAC clones. Plant J. 6:447-455[Medline].
KONCZ, C., N. MARTINI, R. MAYERHOFER, Z. KONCZ-KALMAN, and H. KORBER et al., 1989 High-frequency T-DNA-mediated gene tagging in plants. Proc. Natl. Acad. Sci. USA 86:8467-8471
LIN, X., S. KAUL, S. ROUNSLEY, T. P. SHEA, and M.-I. BENITO et al., 1999 Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana. Nature 402:761-768[Medline].
LIU, Y.-G., N. MITSUKAWA, C. LISTER, C. DEAN, and R. F. WHITTIER, 1996 Isolation and mapping of a new set of 129 RFLP markers in Arabidopsis thaliana using recombinant lines. Plant J. 10:733-736[Medline].
LIU, Y.-G., Y. SHIRANO, H. FUKAKI, Y. YANAI, and M. TASAKA et al., 1999 Complementation of plant mutants with large genomic DNA fragments by a transformation-competent artificial chromosome vector accelerates positional cloning. Proc. Natl. Acad. Sci. USA 96:6535-6540
MARRA, M., T. KUCABA, M. SEKHON, L. HILLER, and R. MARTIENSSEN et al., 1999 A map for sequence analysis of the Arabidopsis thaliana genome. Nat. Genet. 22:265-270[Medline].
MARTIENSSEN, R. A., 1998 Functional genomics: probing plant gene function and expression with transposons. Proc. Natl. Acad. Sci. USA 95:2021-2026
MEINKE, D. W., J. M. CHERRY, C. DEAN, S. D. ROUNSLEY, and M. KOORNNEEF, 1998 Arabidopsis thaliana: a model plant for genome analysis. Science 282:662-682
MOZO, T., S. FISCHER, H. SHIZUYA, and T. ALTMANN, 1998 Construction and characterization of the IGF Arabidopsis BAC library. Mol. Gen. Genet. 258:562-570[Medline].
MOZO, T., K. DEWAR, P. DUNN, J. R. ECKER, and S. FISCHER et al., 1999 A complete BAC-based physical map of the Arabidopsis thaliana genome. Nat. Genet. 22:271-275[Medline].
PALMER, J. D., S. R. DOWNIE, J. M. NUGENT, P. BRANDT, M. UNSELD et al., 1994 Chloroplast and mitochondrial DNAs of Arabidopsis thaliana: conventional genomes in an unconventional plant, pp. 3762 in Arabidopsis, edited by E. M. MEYEROWITZ and C. R. SOMERVILLE. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY.
RUAN, Y., J. GILMORE, and T. CONNER, 1998 Towards Arabidopsis genome analysis: monitoring expression profiles of 1400 genes using cDNA microarrays. Plant J. 15:821-833[Medline].
SCHENA, M., D. SHALON, R. W. DAVIS, and P. O. BROWN, 1995 Quantitative monitoring of gene expression patterns with a complementary DNA microarray. Science 270:467-470
SCHMIDT, R., J. WEST, K. LOVE, Z. LENEHAN, and C. LISTER et al., 1995 Physical map and organization of Arabidopsis thaliana chromosome 4. Science 270:480-483
SODERLUND, C., I. LONGDEN, and R. MOTT, 1997 FPC: a system for building contigs from restriction fingerprinted clones. Comput. Appl. Biosci. 13:523-535
SOMERVILLE, C. and J. DANGL, 2000 Plant biology in 2010. Science 290:2077-2078
STUPAR, R. M., J. W. LILLY, C. D. TOWN, Z. CHENG, and S. KAUL et al., 2001 Complex mtDNA constitutes an approximate 620-kb insertion on Arabidopsis thaliana chromosome 2: implication of potential sequencing errors caused by large-unit repeats. Proc. Natl. Acad. Sci. USA 98:5088-5103.
SULSTON, J., F. MALLETT, R. STADEN, R. DURBIN, and T. HORSNELL et al., 1988 Software for genome mapping by fingerprinting techniques. Comput. Appl. Biosci. 4:125-132
SUSSMAN, M. R., R. M. AMASINO, J. C. YOUNG, P. J. KRYSAN, and S. AUSTIN-PHILLIPS, 2000 The Arabidopsis knockout facility at the University of Wisconsin-Madison. Plant Physiol. 124:1465-1467
TAO, Q.-Z. and H.-B. ZHANG, 1998 Cloning and stable maintenance of DNA fragments over 300 kb in Escherichia coli with conventional plasmid-based vectors. Nucleic Acids Res. 26:4901-4909
TAO, Q.-Z., Y.-L. CHANG, J. WANG, H. CHEN, and C. SCHEURING et al., 2001 Bacterial artificial chromosome-based physical map of the rice genome constructed by restriction fingerprint analysis. Genetics 158:1711-1724
WU, Y., L. TULSIERAM, Q. TAO, H.-B. ZHANG, and S. J. ROTHSTEIN, 2000 A binary vector-based large insert library for Brassica napus and identification of clones linked to a fertility restorer locus for Ogura cytoplasmic male sterility (CMS). Genome 43:102-109[Medline].
ZACHGO, E. A., M. L. WANG, J. DEWNEY, D. BOUCHEZ, and C. CAMILLERI et al., 1996 A physical map of chromosome 2 of Arabidopsis thaliana. Genome Res. 6:19-25
ZHANG, H.-B., S. CHOI, S.-S. WOO, Z. LI, and R. A. WING, 1996 Construction and characterization of two rice bacterial artificial chromosome libraries from the parents of a permanent recombinant inbred mapping population. Mol. Breed. 2:11-24.
This article has been cited by other articles:
![]() |
D. A. Vasco A Fast and Reliable Computational Method for Estimating Population Genetic Parameters Genetics, June 1, 2008; 179(2): 951 - 963. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Xu, M. A. van den Berg, C. Scheuring, L. Covaleda, H. Lu, F. A. Santos, T. Uhm, M.-K. Lee, C. Wu, S. Liu, et al. Genome physical mapping from large-insert clones by fingerprint analysis with capillary electrophoresis: a robust physical map of Penicillium chrysogenum Nucleic Acids Res., March 14, 2005; 33(5): e50 - e50. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Wu, S. Sun, P. Nimmakayala, F. A. Santos, K. Meksem, R. Springman, K. Ding, D. A. Lightfoot, and H.-B. Zhang A BAC- and BIBAC-Based Physical Map of the Soybean Genome Genome Res., February 1, 2004; 14(2): 319 - 326. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Ren, M.-K. Lee, B. Yan, K. Ding, B. Cox, M. N. Romanov, J. A. Price, J. B. Dodgson, and H.-B. Zhang A BAC-Based Physical Map of the Chicken Genome Genome Res., December 1, 2003; 13(12): 2754 - 2758. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. C. Wendl and R. H. Waterston Generalized Gap Model for Bacterial Artificial Chromosome Clone Fingerprint Mapping and Shotgun Sequencing Genome Res., December 1, 2002; 12(12): 1943 - 1949. [Abstract] [Full Text] [PDF] |
||||
- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Chang, Y.-L.
- Articles by Zhang, H.-B.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Chang, Y.-L.
- Articles by Zhang, H.-B.

DNA/Sau 3AI markers labeled with [33P]dATP were used in the first lane and every seventh lane thereafter. Note that the bands derived from the cloning vector of the new BIBAC library, pCLD04541 (




