Genetics, Vol. 152, 1723-1731, August 1999, Copyright © 1999

DNA Variation in the Wild Plant Arabidopsis thaliana Revealed by Amplified Fragment Length Polymorphism Analysis

Naohiko T. Miyashitaa, Akira Kawabea, and Hideki Innanb
a Laboratory of Plant Genetics, Graduate School of Agriculture, Kyoto University, Kyoto, 606-8502 Japan
b Department of Biological Sciences, Graduate School of Science, University of Tokyo, Tokyo 113-0033, Japan

Corresponding author: Naohiko T. Miyashita, Laboratory of Plant Genetics, Graduate School of Agriculture, Kyoto University, Sakyo-ku, Kyoto, 606-8502 Japan., arabis{at}kais.kyoto-u.ac.jp (E-mail)

Communicating editor: J. A. BIRCHLER


*  ABSTRACT
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

To investigate the level and pattern of DNA variation of Arabidopsis thaliana at the entire genome level, AFLP analysis was conducted for 38 ecotypes distributed throughout the world. Ten pairs of selective primers were used to detect a total of 472 bands, of which 374 (79.2%) were polymorphic. The frequency distribution of polymorphic bands was skewed toward an excess of singleton variation. On the basis of AFLP variation, nucleotide diversity for the entire genome was estimated to be 0.0106, which was within the range reported previously for specific nuclear genes. The frequency distribution of pairwise distance was bimodal because of an ecotype (Fl-3) with a large number of unique bands. Linkage disequilibrium between polymorphic AFLPs was tested. The proportion of significant linkage disequilibria was close to random expectation after neglecting the ecotype Fl-3. This result indicates that the effect of recombination could not be ignored in this selfing species. A neighbor-joining tree was constructed on the basis of the AFLP variation. This tree has a star-like topology and shows no clear association between ecotype and geographic origin, suggesting a recent spread of this plant species and limited migration between its habitats.


THE amplified fragment length polymorphism (AFLP) technique is a method of DNA fingerprinting (VOS et al. 1995 Down), which combines strategies of restriction fragment length polymorphism (RFLP) and polymerase chain reaction (PCR). AFLP is carried out in four distinct steps: restriction enzyme digestion of DNA, ligation of adaptors to the restricted sites, PCR amplification of restricted fragments with primers that bind to the adaptor sequence and adjacent selective nucleotides, and acrylamide gel electrophoresis. Usually, restriction enzymes with two different specificities, e.g., EcoRI and MseI, are used to generate a large number of fragments. PCR amplification with the specific primers ensures reliable and reproducible detection of restricted fragments. Acrylamide gel electrophoresis is sufficiently sensitive to distinguish fragments differing in length by only 1 bp. Thus, AFLP can be a powerful technique to detect a large number of bands with high reproducibility and sensitivity.

AFLP analysis has been applied to the genomes of many plants, mostly crop species, to construct genetic maps (barley: BECKER et al. 1995 Down; WAUGH et al. 1997 Down; melon: WANG et al. 1997 Down; potato: VAN DER VOORT et al. 1997 Down; Arabidopsis thaliana: ALONSO-BLANCO et al. 1998 Down) and to study phylogenetic relationship and genetic diversity among cultivars (barley: RUSSELL et al. 1997 Down; SCHUT et al. 1997 Down; cassava: ROA et al. 1997 Down; Eucalyptus: GAIOTTO et al. 1997 Down; hop: HARTL and SEEFELDER 1998 Down; maize: MARSAN et al. 1998 Down; wheat: DONINI et al. 1997 Down). In all of these studies, AFLP has detected a large number of polymorphisms more efficiently than other DNA technologies, such as RFLP, random amplified polymorphic DNA (RAPD), and microsatellite or simple sequence repeat polymorphism (SSR) analyses. In A. thaliana, AFLP analysis was used to construct a fine genetic map (ALONSO-BLANCO et al. 1998 Down). AFLP markers were detected in the entire genome, although they were not distributed proportionally to physical length of chromosomes. There were clusterings of AFLP markers around the centromeric regions of all five chromosomes. The reason for this clustering pattern is not known, but might involve low recombination and/or physical abundance of AFLP markers in centromeric regions.

Recently, estimates of nucleotide variation in A. thaliana have been reported (KING et al. 1993 Down; HANFSTINGL et al. 1994 Down; INNAN et al. 1996 Down; KAWABE et al. 1997 Down; PURUGGANAN and SUDDITH 1998 Down). Although this plant species has a high selfing rate (ABBOT and GOMES 1989 Down), the level of DNA variation at the species level (among ecotypes) as revealed by RFLP and sequencing studies was shown to be as high as those in other outcrossing species. These studies estimated that nucleotide diversity (NEI and LI 1979 Down) is ~0.01. In addition, INNAN et al. 1997 Down also reported that A. thaliana had a high level of variation at microsatellite loci, which was comparable to that of animal and outbreeding plant species. The population structure of A. thaliana was studied by RFLP analysis on three CAPS regions by BERGELSON et al. 1998 Down. They reported that interpopulation variation was three to seven times larger than intrapopulation variation. This result was indicative of limited migration and/or frequent extinction/recolonization and consistent with the selfing nature of this plant species. The phylogenetic relationship among ecotypes of A. thaliana and the evolutionary history of the species were also studied on the basis of DNA variations (KING et al. 1993 Down; INNAN et al. 1996 Down, INNAN et al. 1997 Down; KAWABE et al. 1997 Down). In all these studies, no relationship was detected between geographical origin and clustering in the phylogenetic tree, suggesting a recent spread and/or long-distance dispersal of A. thaliana throughout the world.

In this report, AFLP analysis was used to study the level and pattern of naturally occurring DNA variation in the wild plant species A. thaliana. Because this technique can detect polymorphisms in many loci of the genome simultaneously, it is possible to study DNA variation within the entire genome. We estimated genome-wide nucleotide diversity (the average pairwise number of nucleotide substitutions per site) from AFLP data by using a method developed by INNAN et al. 1999 Down and compared the estimate with those obtained by sequencing studies on specific nuclear genes. In addition, linkage disequilibrium between AFLPs was tested to infer the effect of recombination on the pattern of DNA polymorphism. In selfing species like A. thaliana, the whole genome could be regarded as a single linkage group because of effectively low recombination (CHARLESWORTH et al. 1993 Down). Furthermore, it was shown that one characteristic of DNA polymorphism in the A. thaliana nuclear genome was the presence of dimorphism (INNAN et al. 1996 Down; KAWABE et al. 1997 Down; PURUGGANAN and SUDDITH 1998 Down). If the whole genome is a single linkage group, and dimorphism extends over the entire genome, then a high level of linkage disequilibria is expected. Finally, on the basis of AFLP variation, the phylogenetic relationship among ecotypes was studied. The implications of these data for the evolutionary history of A. thaliana are discussed.


*  MATERIALS AND METHODS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Plant materials:
Thirty-eight ecotypes of A. thaliana were used (Table 1). Seeds were obtained from Nobuharu Goto, Sendai Arabidopsis Seed Stock Center, Miyagi University of Education, Sendai, Japan. Plants were grown from seeds in pots placed in an incubator under 24-hr light conditions.


 
View this table:
In this window
In a new window

 
Table 1. Arabidopsis thaliana ecotypes used in this study

AFLP analysis:
Total DNA was extracted from mature plants (KAWABE et al. 1997 Down). AFLP analysis was carried out according to the manufacturer's protocol (AFLP Analysis System II; GIBCO/BRL, Grand Island, NY). Total DNA (100–250 ng) was double digested with EcoRI (recognition sequence 5'-GAATTC-3') and MseI (TTAA). After ligating EcoRI and MseI adaptors to the digested DNA, preamplification was conducted with an EcoRI primer (EcoRI adaptor sequence) and an MseI primer (MseI adaptor sequence + C as a selective nucleotide). The preamplification product was used as a template for selective amplification. Selective amplification was conducted using an EcoRI primer containing two selective nucleotides and a MseI primer containing three selective nucleotides. Ten combinations of the selective primers were used (Table 2). All the EcoRI selective primers were labeled at the 5' end with fluorescein Cy-5 as supplied by the manufacturer (Pharmacia, Piscataway, NJ). The products of selective amplification were denatured at 90° for 3 min. Electrophoresis on a denaturing 6% acrylamide gel was conducted with an ALFred sequencer (Pharmacia). The gel image was visualized by the program ALFwin ver. 1.00 (Pharmacia), and band peaks were detected by the program AlleleLinks ver. 1.00 (Pharmacia) with default settings. Bands were scored by confirming peaks on the gel image. Most bands corresponded to fragments <500 bp long. Band sizes were estimated by AlleleLinks using 2 internal markers (50 and 300 bp) and 10 external markers (50-bp ladder) as size standards.


 
View this table:
In this window
In a new window

 
Table 2. Summary of AFLP variation in Arabidopsis thaliana

Data analysis:
The presence and absence of AFLP bands were scored for each ecotype. As a measure of DNA variation (genetic distance) between ecotypes, the number of nucleotide substitutions per site was estimated, following the method of INNAN et al. 1999 Down. The average value of all the pairwise distances was regarded as a measure of nucleotide polymorphism (nucleotide diversity, {pi}). Linkage disequilibrium between AFLPs was tested by the chi-square test. From Figure 2 and Appendix 1 of ALONSO-BLANCO et al. 1998 Down, we could identify the chromosomal location of 38 of the AFLPs detected in this study. These 38 AFLPs were used to study linkage disequilibrium within and between chromosomes. A neighbor-joining tree was constructed on the basis of the pairwise distance by PHYLIP ver. 3.57 (FELSENSTEIN 1993 Down).

Computer simulation:
The expected frequency spectrum of AFLPs was investigated by computer simulation that was conducted according to the genealogical relationship among sampled individuals (INNAN et al. 1999 Down). In the simulation, it was assumed that the genome size per haploid individual was 1000 kbp and that the number of fragments per haploid individual was ~23, which was the estimate obtained in this study. The genome was divided into five segments (the length of each segment was 200 kbp) because A. thaliana has five chromosomes, although the number of segments does not affect the expected spectrum. For each segment of the genome, a random genealogy and an ancestral sequence were generated, and mutations took place randomly following the genealogy. The estimated {pi} (0.0106 for n = 38 and 0.0100 for n = 37) was used as the mutation parameter (4Nµ). Finally, the AFLP bands of the descendant individuals were scored and the frequency spectrum was obtained. The number of replications was 1000.


*  RESULTS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

AFLP in Arabidopsis thaliana:
A total of 472 bands were detected by AFLP among 38 ecotypes by using 10 combinations of selective primers (Table 1 and Table 2). Average band length was 225.8 bp, but bands of ~150-bp long were detected most often (Figure 1). The number of bands decreased as the size increased. Bands longer than 600 bp were rarely detected. The number of bands varied for different pairs of selective primers (Table 2). The combination of EcoRI + TA and MseI + CTT gave the smallest number of bands (27); all other primer combinations detected >30 bands. The average number of bands per ecotype per primer pair was 22.9 ± 0.2. The ecotype Fl-3 sampled in Finland had the largest number of bands per primer pair (28.7), including 60 unique bands. The smallest number of bands per primer pair was detected in ecotype Mr-0 sampled in Italy (21.0 bands). When all the bands were considered, each ecotype had a distinct haplotype. Even when singleton bands were eliminated, none of the ecotypes shared identical haplotypes. This indicates that AFLP could be used as a powerful tool to identify ecotypes in A. thaliana.



View larger version (14K):
In this window
In a new window
Download PPT slide
 
Figure 1. Frequency distribution of fragment size detected by AFLP analysis on A. thaliana. n = 472; average is 225.8 bp.

Level of nucleotide variation revealed by AFLP:
A total of 374 (79.2%) of the AFLP bands were polymorphic, and 98 (20.8%) were fixed. Although the number of bands varied among primer pairs, no relationship was seen between the number of bands and selective nucleotides used in a particular primer pair. In addition, the number of bands did not correlate with the proportion of polymorphic bands for different primer pairs. The frequency distribution of polymorphic bands was asymmetric for both actual and simulation data (Figure 2). The asymmetric distribution in the simulation experiment is due to the fact that mutation tends to form a new band rather than a band that comigrates with a preexisting AFLP band, even though nucleotide mutation occurs at random. This situation is similar to the infinite allele model (KIMURA and CROW 1964 Down). However, singleton bands (22.0%, all ecotypes; 13.1%, all ecotypes except Fl-3) were found at a higher frequency than expected by the simulation. The frequencies of fixed and single-loss classes were reduced. The other allelic classes were found rather uniformly at low frequencies. This result indicates that DNA variations at intermediate frequencies are not common in the A. thaliana nuclear genome.



View larger version (20K):
In this window
In a new window
Download PPT slide
 
Figure 2. Frequency spectrum of AFLP in A. thaliana.

On average, 83.1% of all bands were shared between ecotypes (Table 2). Because of its large number of unique bands, the ecotype Fl-3 shared only ~70% of bands with other ecotypes. Interestingly, a laboratory strain Col-0 shared 94.6% of bands with the ecotype Xxx-0 sampled in Spain. By using the method of INNAN et al. 1999 Down, a genome-wide estimate of nucleotide diversity ({pi}) was obtained (Table 2). Although estimates varied for different primer pairs, the average value (0.0106) was comparable to the values obtained by sequencing protein coding regions of A. thaliana (0.0080 for Adh: INNAN et al. 1996 Down; 0.0104 for ChiA: KAWABE et al. 1997 Down; 0.007 for Cal: PURUGGANAN and SUDDITH 1998 Down). When the ecotype Fl-3 was eliminated from the calculation, the estimate of nucleotide diversity decreased slightly from 0.0106 to 0.0100.

The frequency distribution of pairwise nucleotide distance had two peaks (Figure 3). Because of many unique bands in Fl-3, estimates of pairwise distance between Fl-3 and the other ecotypes were large, producing a peak around 0.023. The other peak around 0.01 was composed of estimates between the rest of the ecotypes. The smallest distance (0.0031) was obtained between the ecotypes Col-0 and Xxx-1, while the largest (0.0240) was that between Fl-3 and Xxx-1. Figure 3 also shows the frequency distributions of pairwise nucleotide distance for nuclear genes in A. thaliana and Arabis gemmifera (Adh: INNAN et al. 1996 Down; MIYASHITA et al. 1996 Down; ChiA: KAWABE et al. 1997 Down). The distributions had several peaks and wide variance, although averages were about the same as for AFLP. The peaks were due to dimorphic DNA variation in Adh and ChiA. In ChiA of A. thaliana, three separate peaks were detected, caused by the presence of three divergent ecotypes.



View larger version (24K):
In this window
In a new window
Download PPT slide
 
Figure 3. Frequency distribution of pairwise nucleotide difference in A. thaliana and Ar. gemmifera.

Linkage disequilibrium between AFLPs:
Linkage disequilibrium was examined by the chi-square test between all the polymorphic bands (Table 3). There were 69,751 combinations, of which 7952 were significant at least at the 5% level. The proportion of significant pairs (11.4%) was higher than that expected by chance and similar to that of genome-wide linkage disequilibrium (12.1%) obtained between 20 microsatellite loci in A. thaliana (INNAN et al. 1997 Down). Among significant pairs, the proportion of highly significant pairs (P < 0.5%) was in excess, and was detected 15 times more than expected by chance. However, after excluding the ecotype Fl-3 with many unique bands, the overall frequency of significant linkage disequilibrium decreased (6.3%), which was slightly higher than random expectation (5%). This result indicates that unique variations in Fl-3 contributed to the high proportion of significant linkage disequilibria. The unique variations in Fl-3 produced singleton gamete types in two- and three-gamete combinations, which resulted in significance in the chi-square test (or Fisher's exact test).


 
View this table:
In this window
In a new window

 
Table 3. Linkage disequilibrium between 374 AFLPs in 38 ecotypes of Arabidopsis thaliana

For the 38 AFLPs of known chromosomal locations, 7.7% (54/703) of combinations were in significant linkage disequilibrium, which was similar to that of the overall data. In intrachromosomal comparisons, the percentage of significant pairs was 6.5% (9/139), while the percentage for interchromosomal comparisons was 8.0% (45/564). These percentages were not statistically different ({chi}2 = 0.36, d.f. = 1, P = 0.60). For 20 microsatellite loci of A. thaliana (INNAN et al. 1997 Down), the percentage of significant inter- and intrachromosomal linkage disequilibria was also similar: significant interchromosomal linkage disequilibrium was 12.6% (6/55), and intrachromosomal was 10.9% (17/135; {chi}2 = 0.09, d.f. = 1, P = 0.77). Although microsatellite variations gave a higher proportion of significant pairs than AFLP, the difference between types of variation was not statistically significant in either comparison ({chi}2 = 3.05, d.f. = 1, P = 0.08 for overall data; {chi}2 = 0.91, d.f. = 1, P = 0.34 for intrachromosomal comparison; and {chi}2 = 2.34, d.f. = 1, P = 0.13 for interchromosomal comparison). From these results, it could be concluded that the level of significant linkage disequilibria in A. thaliana is not high and is close to the level expected due to random chance.

Phylogenetic relationship between A. thaliana ecotypes based on AFLP variation:
Because AFLP analysis detects DNA sequence variations throughout the entire genome, AFLP data can be used to study the evolutionary relationship among ecotypes of A. thaliana. Previously, evolutionary relationships were studied using data on the polymorphism of 20 microsatellite loci, which were also detected throughout the genome (INNAN et al. 1997 Down). However, the distance measure used in that study (the average squared difference in repeat number) may not vary in a manner that is proportional to time. Although the measure has a linear relationship with time under the stepwise mutation model (GOLDSTEIN et al. 1995 Down), the mutation model was shown to be inappropriate to explain the level of microsatellite polymorphism in A. thaliana (INNAN et al. 1997 Down). In this study, the average number of nucleotide substitutions per site was estimated. Although many factors influence the rate of nucleotide substitutions (KIMURA 1983 Down), it is likely that nucleotide substitution is more linear with respect to time than change in repeat number in microsatellite loci. Therefore, the phylogenetic tree obtained here could reflect phylogenetic relationships more accurately in terms of time.

A neighbor-joining tree was constructed on the basis of the pairwise distance between each pair of ecotypes (Figure 4). The topology is star-like, where the tree diversified rapidly, and each ecotype has a long branch. Branch length is relatively constant except for the ecotype Fl-3. In the tree based on microsatellite variation (Figure 3 in INNAN et al. 1997 Down), there was variation in branch length leading to each ecotype. The variation in branch length of that tree could be due to sporadic occurrence of microsatellite mutations. In Figure 4, ecotypes sampled in relatively adjacent places, e.g., three Finnish ecotypes, were found close to each other in the tree structure, but there was no clear association between tree clusters and geographic origin. This result is consistent with previous results of RFLP and sequencing studies (KING et al. 1993 Down; PRICE et al. 1994 Down; INNAN et al. 1996 Down, INNAN et al. 1997 Down; KAWABE et al. 1997 Down).



View larger version (27K):
In this window
In a new window
Download PPT slide
 
Figure 4. Neighbor-joining tree of 38 ecotypes of A. thaliana based on AFLP variation.


*  DISCUSSION
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

AFLP variation in A. thaliana:
As in other plant species, AFLP analysis in A. thaliana detects many polymorphic bands and is an efficient method for ecotype identification. With a single combination of selective primers, the average number of bands detected was >20 per ecotype, of which ~80% were polymorphic. Considering the technical ease and sensitivity to DNA polymorphism of this technique, it is advantageous for carrying out population studies of various organisms and perhaps more suitable than other similar techniques, i.e., RFLP, RAPD, and SSR.

Every ecotype in this study was distinguished by a moderate amount of AFLP variation; however, the ecotype Fl-3 was unusual in having a large number of unique bands. Because AFLP can detect variation throughout the entire genome, Fl-3 could have increased variation on a genome-wide basis. One explanation for its higher-than-average number of unique bands is that Fl-3 has a higher mutation rate than the other ecotypes. However, no morphological abnormality is detected in this ecotype (GOTO 1996 Down), as might be expected if Fl-3 had a high mutation rate. Another possibility is that this ecotype has additional chromosomes (part or all of another genome). This would be consistent with the fact that many plant species can tolerate additional foreign chromosomes (RICHARDS 1986 Down). In other words, this ecotype could be an interspecific hybrid or the aneuploid descendant of such a hybrid. Because the ecotype has extra bands and is not missing bands, it is certain that something must have been added. A naturally occurring interspecific hybrid of A. thaliana has been identified: A. suecica is thought to be an amphiploid between A. thaliana and Cardaminopsis arenosa (HYLANDER 1957 Down; REDEI 1972 Down). Furthermore, divergent sequences exist in a long stretch in the Adh and ChiA regions of particular ecotypes in addition to DNA dimorphism (INNAN et al. 1996 Down; KAWABE et al. 1997 Down). Although the origin of these divergent sequences is unclear, one explanation is interspecific introgression as suggested here. A cytological examination of the Fl-3 ecotype is under way.

Level of DNA variation in the entire genome of A. thaliana:
This study demonstrates that the level of nucleotide variation in A. thaliana is not low despite the selfing nature of this plant species. On the basis of genome-wide AFLP variation, the estimate of nucleotide diversity (0.0106) was comparable to the values reported for specific nuclear genes (INNAN et al. 1996 Down; KAWABE et al. 1997 Down). This estimate of nucleotide diversity should be understood as representing variation at the species level. It has been shown that the level of genetic variation between populations is much larger than within populations (ABBOT and GOMES 1989 Down; TODOKORO et al. 1996 Down; BERGELSON et al. 1998 Down). A high level of genetic variation between populations is expected under conditions of limited migration between populations and selfing within the population. Each ecotype has a long branch in the neighbor-joining tree determined in this study, suggesting that each ecotype had isolated history. This result supports the idea of low migration between populations. The topology of the tree suggests that this plant species spread its habitat rapidly over the world, and that since then only limited migration has occurred between habitats. This population structure may explain the higher frequency of singleton bands than that of the simulation, because unique bands could be fixed in isolated finite populations independently.

The average proportion of shared bands found in this study was 83.1% in comparison to the 66% shared bands found by ALONSO-BLANCO et al. 1998 Down. This difference may be due to the use of different selective primers in the two studies. As noted, the number of total bands and polymorphic bands varies for different primer pairs in this study. To evaluate the overall level of nucleotide variation more accurately by the AFLP technique in the future, it may be necessary to use more selective primer pairs and to use different restriction enzymes in the initial digestion of genomic DNA. In addition, the 38 ecotypes selected for this study may not represent the whole range of actual variation of this species. However, this group of ecotypes is likely to be representative of the species-wide variation in A. thaliana, because the estimate of nucleotide diversity was similar in this and previous studies. It is possible that a study encompassing even more ecotypes would find an even larger nucleotide variation in A. thaliana than the estimate obtained in this study.

Because AFLP analysis detects polymorphisms regardless of their location in the genome, it was expected that the nucleotide diversity estimated by AFLP might be larger than the value determined in specific genic regions. Also, this technique cannot discriminate indel polymorphisms, which might cause the estimate by AFLP to be larger than the true nucleotide diversity. Actually, INNAN et al. 1999 Down showed that estimated nucleotide diversity based on AFLP of Dioscorea tokoto was larger than for specific nuclear gene regions. In this study, the estimated nucleotide diversity of A. thaliana was about the same for AFLP as for nuclear genes (INNAN et al. 1996 Down; KAWABE et al. 1997 Down). An explanation for this result is the low GC content and the compactness of the A. thaliana genome. BARAKAT et al. 1998 Down showed that the A. thaliana nuclear genome is AT-rich and that gene-coding regions are found at ~5-kb intervals; long noncoding spacer sequences were not observed. Because the restriction enzymes used in this study, EcoRI and MseI, have AT-rich recognition sequences, it is possible that polymorphism was detected evenly throughout coding and noncoding regions of the A. thaliana genome. In this case, estimated nucleotide diversity is expected to be similar for genome-wide AFLP and studies of specific nuclear genes. Unlike A. thaliana, D. tokoro may have long noncoding spacer sequences, which could have accumulated greater nucleotide variation due to fewer functional constraints. If so, estimated nucleotide diversity based on AFLP could be larger than for specific nuclear genes in D. tokoro. It would be interesting to conduct AFLP analysis on organisms with high GC content, e.g., Drosophila and Gramineae, to study the effect of gene organization and higher GC content on the level of nucleotide variation.

Linkage disequilibrium between AFLP variation:
The proportion of significant linkage disequilibria between AFLP variations was not particularly high and is close to the level expected due to random chance if the Fl-3 ecotype is excluded. This result could be due to the long distance between any two AFLP loci. In this study, we detected 472 bands, and the sum of all the band lengths was only ~106 kb, which is ~0.1% of the total genome size (~105 kb) of A. thaliana. This suggests that AFLPs were found at an average interval of ~200 kb. If AFLPs overlap, this distance could be even longer. Four recombination events were detected in the 2.4-kb region of the Adh gene and one in the 1.7-kb region of ChiA, giving about one recombination per 1 kb in the evolutionary history of this plant. Because the expected number of recombinations would be larger than this estimate (HUDSON 1987 Down), there would be a large number of recombination events between each pair of AFLPs identified here. Even if dimorphism exists in a specific region, as in Adh and ChiA, linkage disequilibrium would not be detected by AFLP analysis. It is not surprising that the proportion of significant linkage disequilibria is at about the level expected due to random occurrence. Although selfing means a small effective size and low recombination, the number of recombination events in the evolutionary history of A. thaliana cannot be ignored. Studies of nuclear genes and AFLP indicate that outcrossing does occur in this species and that subsequent recombination events clearly influence the pattern of DNA polymorphism in this selfing plant.


*  ACKNOWLEDGMENTS

We express our thanks to N. Goto for A. thaliana seeds and to T. Sasanuma, R. Terauchi and an anonymous reviewer for comments and suggestions. We are also grateful to S. Nasuda for technical advice. This study is contribution number 557 from the Laboratory of Plant Genetics, Graduate School of Agriculture, Kyoto University.

Manuscript received February 1, 1999; Accepted for publication April 19, 1999.


*  LITERATURE CITED
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

ABBOT, R. J. and M. F. GOMES, 1989  Population genetic structure and outcrossing rate of Arabidopsis thaliana.. Heredity 62:411-418.

ALONSO-BLANCO, C., A. J. PEETERS, M. KOORNNEEF, C. LISTER, and C. DEAN et al., 1998  Development of an AFLP based linkage map of Ler, Col and Cvi Arabidopsis thaliana ecotypes and construction of a Ler/Cvi recombinant inbred line population. Plant J. 14:259-271[Medline].

BARAKAT, A., G. MATASSI, and G. BERNARDI, 1998  Distribution of genes in the genome of Arabidopsis thaliana and its implications for the genome organization of plants. Proc. Natl. Acad. Sci. USA 95:10044-10049[Abstract/Free Full Text].

BECKER, J., P. VOS, M. KUIPER, F. SALAMINI, and M. HEUN, 1995  Combined mapping of AFLP and RFLP markers in barley. Mol. Gen. Genet. 249:65-73[Medline].

BERGELSON, J., E. STAHL, S. DUDEK, and M. KREITMAN, 1998  Genetic variation within and among populations of Arabidopsis thaliana.. Genetics 148:1311-1323[Abstract/Free Full Text].

CHARLESWORTH, B., M. T. MORGAN, and D. CHARLESWORTH, 1993  The effect of deleterious mutations on neutral molecular variation. Genetics 134:1289-1303[Abstract].

DONINI, P., M. L. ELIAS, S. M. BOUGOURD, and R. M. D. KOEBNER, 1997  AFLP fingerprinting reveals pattern differences between template DNA extracted from different plants. Genome 40:521-526.

FELSENSTEIN, J., 1993 Phylogenetic Inference Package (PHYLIP). Version 3.5. University of Washington, Seattle, WA.

GAIOTTO, F. A., M. BRAMUCCI, and D. GRATTAPAGLIA, 1997  Estimation of outcrossing rate in a breeding population of Eucalyptus urophylla with dominant RAPD and AFLP markers. Theor. Appl. Genet. 95:842-849.

GOLDSTEIN, D. B., A. R. LINARES, L. L. CAVALLI-SFORZA, and M. W. FELDMAN, 1995  An evaluation of genetic distances for use with microsatellite loci. Genetics 139:473-485[Abstract].

GOTO, N., 1996 The SENDAI Arabidopsis seed stock center SEED LIST. Department of Biology, Miyagi University of Education, Sendai, Japan.

HANFSTINGL, U., A. BERRY, E. A. KELLOG, J. T. COSTA, III, and W. RUDIGER et al., 1994  Haplotype divergence coupled with lack of diversity at the Arabidopsis thaliana alcohol dehydrogenase locus: role for both balancing and directional selection? Genetics 138:811-828[Abstract].

HARTL, L. and S. SEEFELDER, 1998  Diversity of selected hop cultivars detected by fluorescent AFLPs. Theor. Appl. Genet. 96:112-116.

HYLANDER, N., 1957  Cardaminopsis arenosa (Fr.) Hiit., a northern amphidiploid species. Bull. Jard. Bot. Bruxelles 27:591-604.

HUDSON, R. R., 1987  Estimating the recombination parameter of finite population model without selection. Genet. Res. 50:245-250[Medline].

INNAN, H., F. TAJIMA, R. TERAUCHI, and N. T. MIYASHITA, 1996  Intragenic recombination in the Adh locus of the wild plant Arabidopsis thaliana.. Genetics 143:1761-1770[Abstract].

INNAN, H., R. TERAUCHI, and N. T. MIYASHITA, 1997  Microsatellite polymorphism in natural populations of the wild plant Arabidopsis thaliana.. Genetics 146:1441-1452[Abstract].

INNAN, H., R. TERAUCHI, G. KAHL, and F. TAJIMA, 1999  A method for estimating nucleotide diversity from AFLP data. Genetics 151:1157-1164[Abstract/Free Full Text].

KAWABE, A., H. INNAN, R. TERAUCHI, and N. T. MIYASHITA, 1997  Nucleotide polymorphism in the acidic chitinase locus (ChiA) region of the wild plant Arabidopsis thaliana.. Mol. Biol. Evol. 14:1303-1315[Abstract].

KIMURA, M., 1983 The Neutral Theory of Molecular Evolution. Cambridge University Press, Cambridge, United Kingdom.

KIMURA, M. and J. F. CROW, 1964  The number of alleles that can be maintained in a finite population. Genetics 49:725-738[Free Full Text].

KING, G., J. NIENHUIS, and C. HUSSEY, 1993  Genetic similarity among ecotypes of Arabidopsis thaliana estimated by analysis of restriction fragment length polymorphisms. Theor. Appl. Genet. 86:1028-1032.

MARSAN, P. A., P. CASTIGLIONI, K. FUSARI, M. KUIPER, and M. MOTTO, 1998  Genetic diversity and its relationship to hybrid performance in maize as revealed by RFLP and AFLP markers. Theor. Appl. Genet. 96:219-227.

MIYASHITA, N. T., H. INNAN, and R. TERAUCHI, 1996  Intra- and interspecific variation in the alcohol dehydrogenase locus region of wild plants Arabis gemmifera and Arabidopsis thaliana.. Mol. Biol. Evol. 13:433-436[Medline].

NEI, M. and NEI, M.W-S. LI, 1979  Mathematical model for studying genetic variation in terms of restriction endonuclease. Proc. Natl. Acad. Sci. USA 76:5269-5273[Abstract/Free Full Text].

PRICE, R. A., J. D. PALMER and I. A. AL-SHEHBAZ, 1994 Systematic relationships of Arabidopsis: a molecular and morphological perspective, pp. 7–19 in Arabidopsis, edited by E. M. MEYEROWITZ and C. R. SOMERVILLE. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY.

PURUGGANAN, M. D. and J. I. SUDDITH, 1998  Molecular population genetics of the Arabidopsis CAULIFLOWER regulatory gene: nonneutral evolution and naturally occurring variation in floral homeotic function. Proc. Natl. Acad. Sci. USA 95:8130-8134[Abstract/Free Full Text].

DEI, G. P., 1972  Compatibility of Arabidopsis thaliana with Cardaminopsis arenosa. Arabidopsis Inf. Serv. 9:42.

RICHARDS, A. J., 1986 Plant Breeding Systems. G. Allen and Unwin Ltd., London.

ROA, A. C., M. M. MAYA, M. C. DUQUE, J. TOHME, and A. C. ALLEM et al., 1997  AFLP analysis of relationships among cassava and other Manihot species. Theor. Appl. Genet. 95:741-750.

RUSSELL, J. R., J. D. FULLER, M. MACAULAY, B. G. HATZ, and A. JAHOOR et al., 1997  Direct comparison of levels of genetic variation among barley accessions detected by RFLPs, AFLPs, AARs and RAPDs. Theor. Appl. Genet. 95:714-722.

SCHUT, J. W., X. QI, and P. STAM, 1997  Association between relationship measures based on AFLP markers, pedigree data and morphological traits in barley. Theor. Appl. Genet. 95:1161-1168.

TODOKORO, S., R. TERAUCHI, and S. KAWANO, 1996  Microsatellite polymorphisms in natural populations of Arabidopsis thaliana in Japan. Jpn. J. Genet. 70:543-554.

VAN DER VOORT, J. R., P. WOLTERS, R. FOLKERTSMA, R. HUTTEN, and P. VAN ZANDVOORT et al., 1997  Mapping of the cyst nematode resistance locus Gpa2 in potato using a strategy based on comigrating AFLP markers. Theor. Appl. Genet. 95:874-880.

VOS, P., R. HOGERS, M. BLEEKER, M. REIJANS, and T. VAN DE LEE et al., 1995  AFLP: a new technique for DNA fingerprinting. Nucleic Acids Res. 23:4407-4414[Abstract/Free Full Text].

WANG, Y.-H., C. E. THOMAS, and R. A. DEAN, 1997  A genetic map of melon (Cucumis melo L.) based on amplified fragment length polymorphism (AFLP) markers. Theor. Appl. Genet. 95:791-798.

WAUGH, R., N. BONAR, E. BAIRD, B. THOMAS, and A. GRANER et al., 1997  Homology of AFLP products in three mapping populations of barley. Mol. Gen. Genet. 255:311-321[Medline].




This article has been cited by other articles:


Home page
Proc. Natl. Acad. Sci. USAHome page
N. Scarcelli, J. M. Cheverud, B. A. Schaal, and P. X. Kover
Antagonistic pleiotropic effects reduce the potential adaptive value of the FRIGIDA locus
PNAS, October 23, 2007; 104(43): 16986 - 16991.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
J. O. Borevitz, S. P. Hazen, T. P. Michael, G. P. Morris, I. R. Baxter, T. T. Hu, H. Chen, J. D. Werner, M. Nordborg, D. E. Salt, et al.
Genome-wide patterns of single-feature polymorphism in Arabidopsis thaliana
PNAS, July 17, 2007; 104(29): 12057 - 12062.
[Abstract] [Full Text] [PDF]


Home page
ANN BOT (LOND)Home page
C. Shindo, G. Bernasconi, and C. S. Hardtke
Natural Genetic Variation in Arabidopsis: Tools, Traits and Prospects for Evolutionary Ecology
Ann. Bot., June 1, 2007; 99(6): 1043 - 1054.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
E. M. Goss, M. Kreitman, and J. Bergelson
Genetic Diversity, Recombination and Cryptic Clades in Pseudomonas viridiflava Infecting Natural Populations of Arabidopsis thaliana
Genetics, January 1, 2005; 169(1): 21 - 35.
[Abstract] [Full Text] [PDF]


Home page
Crop Sci.Home page
A. Basu, M. Ghosh, R. Meyer, W. Powell, S. L. Basak, and S. K. Sen
Analysis of Genetic Diversity in Cultivated Jute Determined by Means of SSR Markers and AFLP Profiling
Crop Sci., March 1, 2004; 44(2): 678 - 685.
[Abstract] [Full Text] [PDF]


Home page
ScienceHome page
T. P. Michael, P. A. Salome, H. J. Yu, T. R. Spencer, E. L. Sharp, M. A. McPeek, J. M. Alonso, J. R. Ecker, and C. R. McClung
Enhanced Fitness Conferred by Naturally Occurring Variation in the Circadian Clock
Science, November 7, 2003; 302(5647): 1049 - 1053.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
O. Pontes, R. J. Lawrence, N. Neves, M. Silva, J.-H. Lee, Z. J. Chen, W. Viegas, and C. S. Pikaard
Natural variation in nucleolar dominance reveals the relationship between nucleolus organizer chromatin topology and rRNA gene transcription in Arabidopsis
PNAS, September 30, 2003; 100(20): 11418 - 11423.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
T. Kado, H. Yoshimaru, Y. Tsumura, and H. Tachida
DNA Variation in a Conifer, Cryptomeria japonica (Cupressaceae sensu lato)
Genetics, August 1, 2003; 164(4): 1547 - 1559.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
K. J. Schmid, T. R. Sorensen, R. Stracke, O. Torjek, T. Altmann, T. Mitchell-Olds, and B. Weisshaar
Large-Scale Identification and Analysis of Genome-Wide Single-Nucleotide Polymorphisms for Mapping in Arabidopsis thaliana
Genome Res., June 1, 2003; 13(6): 1250 - 1257.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
J. O. Borevitz and M. Nordborg
The Impact of Genomics on the Study of Natural Variation in Arabidopsis
Plant Physiology, June 1, 2003; 132(2): 718 - 725.
[Full Text] [PDF]


Home page
Mol Biol EvolHome page
H. Kuittinen, D. Salguero, and M. Aguade
Parallel Patterns of Sequence Variation Within and Between Populations at Three Loci of Arabidopsis thaliana
Mol. Biol. Evol., November 1, 2002; 19(11): 2030 - 2034.
[Full Text] [PDF]


Home page
Am. J. Bot.Home page
H. K. Stenoien, C. B. Fenster, H. Kuittinen, and O. Savolainen
Quantifying latitudinal clines to light responses in natural populations of Arabidopsis thaliana (Brassicaceae)
Am. J. Botany, October 1, 2002; 89(10): 1604 - 1608.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
D. Tian, H. Araki, E. Stahl, J. Bergelson, and M. Kreitman
Signature of balancing selection in Arabidopsis
PNAS, August 20, 2002; 99(17): 11525 - 11530.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
V. Le Corre, F. Roux, and X. Reboud
DNA Polymorphism at the FRIGIDA Gene in Arabidopsis thaliana: Extensive Nonsynonymous Variation Is Consistent with Local Selection for Flowering Time
Mol. Biol. Evol., August 1, 2002; 19(8): 1261 - 1271.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
B. Haubold, J. Kroymann, A. Ratzka, T. Mitchell-Olds, and T. Wiehe
Recombination and Gene Conversion in a 170-kb Genomic Region of Arabidopsis thaliana
Genetics, July 1, 2002; 161(3): 1269 - 1278.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
M. Iizuka, H. Tachida, and H. Matsuda
A Neutral Model With Fluctuating Population Size and Its Effective Size
Genetics, May 1, 2002; 161(1): 381 - 388.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
A. Kawabe, K. Yamane, and N. T. Miyashita
DNA Polymorphism at the Cytosolic Phosphoglucose Isomerase (PgiC) Locus of the Wild Plant Arabidopsis thaliana
Genetics, November 1, 2000; 156(3): 1339 - 1347.
[Abstract] [Full Text]


Home page
GeneticsHome page
H. Kuittinen and M. Aguadé
Nucleotide Variation at the CHALCONE ISOMERASE Locus in Arabidopsis thaliana
Genetics, June 1, 2000; 155(2): 863 - 872.
[Abstract] [Full Text]