Genetics, Vol. 165, 759-769, October 2003, Copyright © 2003

Population Structure and Its Effect on Haplotype Diversity and Linkage Disequilibrium Surrounding the xa5 Locus of Rice (Oryza sativa L.)

Amanda J. Garrisa, Susan R. McCoucha, and Stephen Kresovicha
a Institute for Genomic Diversity and Department of Plant Breeding, Cornell University, Ithaca, New York 14853

Corresponding author: Stephen Kresovich, 158 Biotechnology Bldg., Cornell University, Ithaca, NY 14853., sk20{at}cornell.edu (E-mail)

Communicating editor: H. OCHMAN


*  ABSTRACT
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

To assess the usefulness of linkage disequilibrium mapping in an autogamous, domesticated species, we have characterized linkage disequilibrium in the candidate region for xa5, a recessive gene conferring race-specific resistance to bacterial blight in rice. This trait and locus have good mapping information, a tractable phenotype, and available sequence data, but no cloned gene. We sampled 13 short segments from the 70-kb candidate region in 114 accessions of Oryza sativa. Five additional segments were sequenced from the adjacent 45-kb region in resistant accessions to estimate the distance at which linkage disequilibrium decays. The data show significant linkage disequilibrium between sites 100 kb apart. The presence of the xa5 resistant reaction in two ecotypes and in accessions with different haplotypes in the candidate region may indicate multiple origins or genetic heterogeneity for resistance. In addition, genetic differentiation between ecotypes emphasizes the need for controlling for population structure in the design of linkage disequilibrium studies in rice.


THE ability to interpret patterns of molecular genetic diversity and to relate them to phenotypic variation will enhance the use of diverse genetic resource collections in crop improvement. Recently, a primary goal in genetic resource management has been to characterize the structure of diversity within a crop species (BROWN 1989 Down). Increasingly, the focus is shifting to dissecting and understanding diversity in relation to genes underlying agronomic traits (BUCKLER and THORNSBERRY 2002 Down; RAFALSKI 2002 Down).

One possible approach to building the connection from genetic diversity to phenotype is linkage disequilibrium (LD) mapping, recently proposed as an alternative to traditional methods for mapping traits in plants (BUCKLER and THORNSBERRY 2002 Down; NORDBORG and TAVARE 2002 Down). Linkage disequilibrium is defined as the nonrandom association of alleles, and it can result from population structure, selection, drift, or physical linkage. The physical extent of linkage disequilibrium around a gene determines the effectiveness of this approach, and it is the result of many factors, including the rate of outcrossing, the degree of artificial or natural selection on the gene or region of the genome, the recombination rate, chromosomal location, population size and structure, and the age of the allele under study. In cultivated species, the extent of linkage disequilibrium will also be shaped by human selection and the bottlenecks associated with crop dispersal beyond the center of origin.

Recent studies in maize and Arabidopsis have provided contrasting results for the utility of linkage disequilibrium for fine mapping genes in plants on the basis of divergent estimates for the extent of linkage disequilibrium in these two plant genomes. In maize, an outcrossing species, significant linkage disequilibrium was detected only within a range from 100 bp to 7 kb on the basis of analysis of several genic regions (REMINGTON et al. 2001 Down; TENAILLION et al. 2001; THORNSBERRY et al. 2001 Down); in the autogamous species Arabidopsis thaliana, significant linkage disequilibrium persisted for 250 kb in a single region (HAGENBLAD and NORDBORG 2002 Down). Unlike maize, rice is predominantly autogamous, which is predicted to result in more extensive linkage disequilibrium, perhaps even genome-wide linkage disequilibrium. But in contrast to Arabidopsis, the domestication history of rice has presumably introduced numerous bottlenecks as well as diverse hybridization events followed by generations of selection for performance in diverse agricultural environments. Because many of the world's major crop species are autogamous, including many cereals, legumes, and Solanaceous species, the understanding of linkage disequilibrium in rice may assist in evaluating the utility of linkage disequilibrium mapping in other autogamous species.

In this article we provide an analysis of linkage disequilibrium in the genomic region containing xa5, a bacterial blight resistance allele whose identity is still unknown. The gene was first reported by PETPISIT et al. 1977 Down and is a recessive gene conferring race-specific resistance to Xanthomonas oryzae pv. oryzae. After its identification, subsequent screening of the genetic resources collection by the International Rice Research Institute (IRRI) resulted in a group of accessions with the xa5 reaction profile to a panel of isolates, which was designated the "DZ192 group," named for the original donor of xa5 (DZ192) to breeding lines such as IR1545 and isoline IRBB5 developed at IRRI. The gene has been mapped to the short arm of chromosome 5 (YOSHIMURA et al. 1984 Down) and was subsequently localized to a bacterial artificial chromosome of ~136 kb (YANG et al. 1998 Down) and to a region of ~70 kb (BLAIR et al. 2003 Down).

The xa5 resistance allele may be associated with only certain ecotypes of rice. Rice ecotypes are the result of intraspecific differentiation of Orzya sativa L. for diverse environmental conditions during the past 10,000 years since domestication (KHUSH 1997 Down). Broad classification of rice into the subspecies indica and japonica fails to capture these evolutionarily distinct subgroups. For example, indica rices have traditionally included the aus, aman, and boro rices of Bangladesh as well as the tjereh rices of Indonesia. Within the japonica subspecies are the Japanese ecotype nuda and the Indonesian ecotype bulu (TAKAHASHI 1997 Down). Allelism tests that showed xa5 to be in higher frequency in rice accessions from Bangladesh and Nepal than from other Asian countries also suggested that xa5 might be associated specifically with the aus and boro ecotypes (BUSTO et al. 1990 Down). The divergence among the aus, boro, and aman ecotypes of Bangladesh has been shaped by the wet and dry cycles of the growing season (KHUSH 1997 Down). Although the xa5 resistance allele was found primarily in aus and boro ecotypes, the presence of the resistance allele was not assessed in light of a molecular genetic definition of ecotype that would provide the more precise evolutionary characterization required for statistical analysis. As noted previously, population structure resulting from ecotypic differentiation is critical because it could result in spurious associations in linkage disequilibrium analysis.

The objective of this research is to describe the diversity and the decay of linkage disequilibrium in one region of the rice genome. This region consists of a small telomeric area on the short arm of chromosome 5 that harbors xa5. Our goals were (1) to characterize the extent of linkage disequilibrium in the region containing xa5 in resistant accessions and to determine if it is possible to reduce the number of candidate genes, (2) to analyze haplotype diversity in the context of population structure to determine the distribution of the resistance allele among ecotypes, and (3) to make predictions about the allelic diversity underlying the xa5 phenotype.


*  MATERIALS AND METHODS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Plant material:
The 114 rice accessions used in this study (listed in Table 1) were obtained from the IRRI in the Philippines and from the National Plant Germplasm System Small Grains Collection in Aberdeen, Idaho. A subset of the sample had previously been allele tested for xa5 (OLUFOWOTE et al. 1977 Down; SIDHU et al. 1978 Down; SINGH et al. 1983 Down); see Table 1. Plants were grown in a greenhouse and genomic DNA was extracted using a cetyltrimethylammonium bromide protocol (COLOSI and SCHAAL 1993 Down).


 
View this table:
In this window
In a new window

 
Table 1. Rice accessions studied

Bacterial blight inoculation and evaluation:
Accessions were evaluated for their disease response at 60 days after sowing following inoculation with X. oryzae pv. oryzae (Xoo) isolates representing Philippine races one (PXO61), two (PXO86), and four (PXO71; MEW 1987 Down). The inoculum was prepared as in BLAIR et al. 2003 Down, except that the bacteria were cultured for 4 days on modified Wakimoto's medium. Inoculation was performed using the leaf-clipping method (KAUFFMANN et al. 1973 Down). Three leaves were inoculated for each race, and different bacterial isolates were inoculated on separate tillers. IR24 and IRBB5 were included as susceptible and resistant controls, respectively. Plants were maintained in growth chambers with 11 hr of daylight, night temperatures of 28°, and day temperatures of 32°. Lesions were measured 14 days after inoculation. Lesion lengths generally showed a bimodal distribution, and this distribution was used as the basis for defining resistant and susceptible reactions. For typing of disease response, each plant was classified as resistant if the mean lesion length was between 0 and 3 cm. Plants with mean lesion lengths >6 cm were classified qualitatively as susceptible. Plants with lesion lengths intermediate to these two classes were classified as moderately resistant.

Markers:
Twenty-one simple sequence repeats (SSRs) distributed on the 12 chromosomes of rice were employed to analyze population structure (RM11, RM14, RM105, RM109, RM152, RM174, RM202, RM206, RM215, RM225, RM228, RM230, RM232, RM235, RM259, RM317, RM335, RM400, RM481, RM467, and RM415; as summarized in CHEN et al. 1997 Down; TEMNYKH et al. 2000 Down, TEMNYKH et al. 2001 Down; http://www.gramene.org). This number of markers seemed reasonable because it has been shown that 15–20 unlinked SSRs are sufficient to detect population stratification in humans (PRITCHARD and ROSENBERG 1999 Down). In addition, 13 amplicons were analyzed for single nucleotide polymorphisms (SNPs) and used for haplotype and linkage disequilibrium analysis using primer pairs described by BLAIR et al. 2003 Down and summarized in Table 2.


 
View this table:
In this window
In a new window

 
Table 2. Amplicon names, lengths, and matches to genes in the TIGR gene index

PCR amplification:
The SSRs and SNP amplicons were generated using the following PCR conditions: 95° for 4 min; 30 cycles of 94° for 1 min, 55° for 2 min, 72° for 2 min; and a 1-hr extension at 72° to promote nontemplate addition of adenine by Taq.

Genotyping:
PCR products were size separated on 4% polyacrylamide gels using an ABI Prism 377 DNA analyzer (Applied Biosystems, Foster City, CA). SSRs were analyzed with GenScan 3.1.2 software (Applied Biosystems) and scored with Genotyper 2.5 software (Applied Biosystems).

DNA sequencing:
A total of 10 µl of quantified PCR product was treated with 10 units exonuclease I and 2 units shrimp alkaline phosphatase and incubated at 37° for 15 min followed by 80° for 15 min. Single-pass sequencing was performed by automated sequencing using an ABI Prism 3700 DNA analyzer (Applied Biosystems) at the Cornell BioResource Center (Ithaca, NY). Because rice is a diploid, predominantly selfing species and therefore predominantly homozygous, direct sequencing of PCR products resulted in a monomorphic sequence. Sequences were aligned using Sequencher 4.0.5 (Gene Codes, Ann Arbor, MI) for base calling and CLUSTAL W (THOMPSON et al. 1994 Down) with manual quality control for insertion/deletions. The ends of fragments were trimmed to remove low-quality sequence. The resulting sequences are listed in Table 2, along with the putative gene content of each fragment on the basis of TIGR annotation (http://www.tigr.org). Singletons, SSRs, and polymorphisms resulting from the expansion/contraction of polyA and polyTs were eliminated from linkage disequilibrium and diversity analyses to exclude variation potentially introduced by sequencing error.

Analysis:
Population structure was evaluated on the basis of three different analyses of genotypic data from the 21 SSRs: genetic distance, the model-based program "Structure" (http://pritch.bsd.uchicago.edu/), and FST (WRIGHT 1969 Down) implemented in Genepop software (http://wbiomed.curtin.edu.au/genepop/). Genetic distance was calculated using DC (CAVALLI-SFORZA and EDWARDS 1967 Down). Phylogenetic reconstruction was based on the neighbor-joining method (SAITOU and NEI 1987 Down) implemented in PowerMarker, a free genetic analysis software package distributed by Kejun Liu (kliu2{at}unity.ncsu.edu). Linkage disequilibrium, diversity, and recombination analyses of sequence data were performed using SITES (http://lifesci.rutgers.edu/heylab/ProgramsandData/Programs/SITES/SITES_Documentation.htm) and dipdat software (http://home.uchicago.edu/rhudson1/source/misc/dipld/). Linkage disequilibrium was plotted as the squared correlation coefficient r2. The minimum set of recombination intervals was calculated as in HUDSON and KAPLAN 1985 Down. Association tests were performed using Strat software (http://pritch.bsd.uchicago.edu/).


*  RESULTS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Significant divergence among ecotypes was found by using all measures for population structure. Distance-based analysis of 84 accessions detected two major clusters, as illustrated in Fig 1. Although ecotypic designation was not available for all accessions, enough samples of aus, boro, and aman ecotypes were available to anchor the ecotype identities of the clusters (Table 1). The first group consisted of the Bangladeshi indica rice ecotype called aman, breeding lines and landraces from Nepal of unknown ecotype, and a single accession from Malaysia, hereafter referred to as indica. The second group was populated by aus and boro ecotypes, mainly from Bangladesh and Nepal, as well as accessions of unknown ecotype.



View larger version (36K):
In this window
In a new window
Download PPT slide
 
Figure 1. Neighbor-joining tree of 84 rice accessions based on DC (CAVALLI-SFORZA and EDWARDS 1967 Down) using 21 unlinked SSR markers. Resistant accessions are indicated in red.

Results from model-based analysis generally concur with the relationships determined by genetic distance analysis. In this study, the model-based analysis gave high significance levels for several theoretical population sizes, but the highest posterior probability was for a model with three populations. These results provided evidence for substructure within the indica group, formalizing the subclusters into two populations: one consisting mainly of aman ecotypes and another consisting of Nepali breeding lines. The third model-derived population corresponded to the aus-boro group of the distance-based analysis. Only four accessions differed in their population assignment by the two approaches; these were individuals that clustered with the indica's in the genetic distance analysis but were assigned to the aus-boro group in the model-based analysis.

When FST values were computed using the model-based population subdivisions for two and three populations (Table 3), they showed a high degree of population structure (overall FST for two populations = 0.89; overall FST for three populations = 0.85). There was a higher FST for the pairwise comparison of the aus-boro group with the indica group than between the two indica groups, indicating that the aus-boro subgroup was more differentiated from the indica groups than either was from the other. Because the sample size was small for the indica group and because several samples were breeding lines that were closely related, these two groups were treated as one for the remaining analyses. The population structure data support a hypothesis of hierarchical levels of divergence within rice, with greater divergence between the indica and aus-boro groups and no detectable divergence between the aus and boro ecotypes at this level of genomic resolution. This suggests that the divergence between the indica and aus-boro ecotypes is more ancient than that between the aus and boro ecotypes.


 
View this table:
In this window
In a new window

 
Table 3. Overall and pairwise estimates of FST for 21 SSR loci, using model-based population subdivisions

Analysis of the xa5 phenotype in relation to population structure analysis of the accessions confirmed the presence of xa5 in the Bangladeshi aus and boro ecotypes. Of the 45 resistant rice accessions for which genotypic information was available, all were assigned to the aus-boro subgroup except three accessions originating from Malaysia, Bangladesh, and Nepal (accessions Tolil 14, Lal Chamara, and Jumula 2). The presence of the xa5 phenotype outside of the aus-boro group could indicate gene flow or multiple origins.

Linkage disequilibrium in the 70-kb xa5 region was extensive but potentially informative in reducing the candidate region for xa5 described in BLAIR et al. 2003 Down. Because the accessions showed significant population structure, indica's and aus-boro's were analyzed separately. Linkage disequilibrium, measured as r2, showed significant linkage disequilibrium for the distal 45 kb of the candidate region for resistant accessions from both indica and aus-boro accessions (Fig 2), a pattern that was not observed in the susceptible groups. Association tests showed all sites to be equally significant due to the low frequency of recombination events in the haplotypes (data not shown).



View larger version (70K):
In this window
In a new window
Download PPT slide
 
Figure 2. Pairwise value for r2 between all pairs of SNP loci, analyzed by population and phenotype: (A) aus-boro resistant. (B) aus-boro susceptible. (C) Non-aus-boro resistant. (D) Non-aus-boro susceptible. The shade of blue indicates the value for r2. The position of each site in the candidate region is indicated by lines that connect the loci with a chromosomal segment, labeled with physical distance measured in kilobase pairs. Candidate genes, represented by numbered boxes below the chromosome segment, are 1, putative ABC transporter; 2, putative TFIIa small subunit; 3, putative 23.6-kD protein; 4, putative tRNA synthase; 5, putative 46.2-kD protein; 6, putative 61.5-kD kinase; 7, hypothetical 33.3-kD protein; 8, putative cysteine protease. Arrows indicate the direction of transcription.

A putative recombination event was detected on the distal side of the candidate region only, raising the question of how far linkage disequilibrium extended on the proximal side of the candidate region. To observe a decay of linkage disequilibrium, five additional amplicons spanning an additional 45 kb were analyzed in resistant accessions and added to the previous data set. Results confirmed that extensive linkage disequilibrium was present; r2 approaches 0.1 only after 100 kb (Fig 3).



View larger version (34K):
In this window
In a new window
Download PPT slide
 
Figure 3. The decay of linkage disequilibrium between all pairs of SNP loci in the region, shown as a function of the distance between the loci. Linkage disequilibrium was measured as r2.

Analysis of haplotype diversity for xa5 indicates that the xa5 resistance phenotype either derives from multiple origins or is genetically heterogeneous. Sequence diversity and haplotype structure were assessed in a larger sample of 114 accessions at 13 amplicons in the xa5 candidate region. Additional accessions not analyzed previously were included for two purposes: to serve as indica outgroups and to allow examination of possible additional sources of xa5. To this end, 12 accessions from Southeast Asia (Cambodia, Vietnam, Indonesia, and Myanmar) were included; all exhibited the xa5 phenotype reaction profile and had previously been identified as members of the DZ192 varietal group by IRRI. However, these accessions had not been allele tested for xa5.

Sixty-six variable sites (with insertion-deletions counted as a single site) in the 4725 bp of sequence from 13 amplicons in the xa5 region were observed, resulting in a frequency of one SNP per 100 bp. The 66 variable sites were organized into 26 distinct haplotypes in the 70-kb candidate region (Fig 4). Because of the great divergence between haplotypes and the absence of an outgroup to determine ancestral polymorphisms, it was not possible to build a single haplotype network to include them all (data not shown). Fig 4 displays the haplotypes in the order in which they appear in a neighbor-joining diagram. At total of 10 different haplotypes were present in resistant accessions. Haplotypes associated with resistance in the aus-boro ecotypes were very different from haplotypes associated with resistance in the indica ecotypes (Fig 4). A set of 4 highly similar haplotypes predominated in the aus-boro accessions that had been allele tested for xa5. This cluster of highly similar haplotypes (numbers 23, 24, 25, and 26) formed the bulk of xa5-containing accessions in the sample and includes DZ192, the original donor of xa5. A putative recombination event in haplotype 23, with a haplotype in higher frequency in susceptible accessions, suggests that the distal side of the candidate region is not involved in resistance. However, the possibility of recombination with the resistant (but not allele tested) haplotype 11 or of double recombination cannot be excluded. Furthermore, there were distinctive haplotypes in two allele-tested accessions, one from Malaysia and the other from Bangladesh. In addition to the major cluster of resistant haplotypes in the aus-boro group, allele-tested accession Aus 449 had a haplotype that was distinct from the others, and it was very similar (1/66 sites differed) to a haplotype found in accessions showing complete susceptibility or moderate resistance to Xoo race 1. Within phenotypically resistant non-aus-boro accessions that had not been allele tested for xa5, there were 4 additional related haplotypes, but there they are not supported by accessions that were allele tested for xa5, so this may indicate genetic heterogeneity for resistance.



View larger version (36K):
In this window
In a new window
Download PPT slide
 
Figure 4. Illustration of the 26 SNP haplotypes in the 70-kb candidate region for xa5. For each site, a dash represents the more common nucleotide, and a circle represents the more rare nucleotide. The vertical lines below indicate the predicted recombination intervals (HUDSON and KAPLAN 1985 Down). The haplotypes are arranged in order of the tips of a neighbor-joining tree, so that more similar haplotypes are clustered. Numbers in parentheses indicate the number of accessions containing that haplotype. The phenotypes (R, all resistant; S, all susceptible; R/S, both resistant and susceptible accessions; RR2, resistant to Xoo race 2 only; RR3 resistant to Xoo race 3 only; MRR1, moderately resistant to Xoo race 1 only), the populations, and the countries of origin are listed to the right of the haplotypes. An asterisk indicates a haplotype present in accessions that had been allele tested.

In general, each haplotype was found in a single subpopulation, and frequently several closely related haplotypes were found in the same subpopulation. Haplotypes 1–9, 11, 15, 16, and 20–22 were found in indica ecotypes. Aus-boro accessions contained haplotypes 10, 12, 13, 14, and 23–26. The apparent restriction of a haplotype to a specific, genetically defined subpopulation did not preclude a wide geographical distribution. The detection of the global distribution of haplotypes was hindered because of the strategy to sample accessions from Bangladesh and Nepal, where the resistance allele was expected to be in highest frequency. However, the example of haplotype 3, which was present in rice collected from Vietnam, Cambodia, Indonesia, Myanmar, and Bangladesh, attests to the global spread of certain haplotypes and suggests that geographic origin may be a poor indicator of genetic distance.

The data showed a high-enough level of diversity both within and between populations for effective mapping and indicated a higher role for mutation than for recombination in generating the observed haplotype diversity. The sequenced amplicons containing the 66 variable sites were predominantly noncoding, although parts of five amplicons had significant matches to genes in the TIGR gene index (Table 2). Of the variable sites, 4 were insertion-deletions ranging in length from 1 to 33 bp and 62 were SNPs.

Variation in the xa5 region was similarly distributed between the indica and aus-boro ecotypes, despite the much larger sample size for the aus-boro's. Specifically, 50 sites varied within the aus-boro subpopulation and 53 sites within the indica's. The 13 additional variable sites were found in outgroups and nonallele-tested accessions of xa5. Many haplotypes (n = 11) differ from their most similar haplotype by a single site, indicating an important role for mutation in generating haplotype diversity. In contrast, the minimum set of recombination intervals is four, indicated in Fig 4. There is evidence for recombination only in haplotypes 7, 17, and 23.


*  DISCUSSION
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

The analysis of population structure underscores the need for genetic analysis of ecotypic differentiation if linkage disequilibrium and association mapping approaches are to be of value in rice improvement. The divergence between indica and aus-boro ecotypes detected by SSRs in the present study had been observed in previous studies employing isozymes (15 loci), amplified fragment length polymorphisms (AFLPs; 179 bands), and randomly amplified polymorphic DNAs (RAPDs; 35 bands; GLASZMANN 1987 Down; ZHU et al. 1998 Down; PARSONS et al. 1999 Down). The mixing of aus and boro genotypes was noted in the isozyme and RAPD studies, but was not addressed in the AFLP analysis. The fact that 21 SSR markers give similar results to the other marker systems attests to the utility of multi-allelic, highly variable SSR markers for detecting population differentiation as well the depth of population structure in rice. It contrasts with an outcrossing species such as maize, with a different evolutionary history, where a greater number of markers may be required to detect population structure. It is interesting to note that the differentiation between aus and boro rices, which is meaningful in the farming system, is invisible with this low genomic resolution. This could indicate a recent divergence between aus and boro rices or continuing gene flow between them.

The frequency of nucleotide polymorphisms in this sample was 1 SNP per 100 bp. This is lower than that of maize, where the frequency of SNP polymorphism in US elite inbred germplasm was 1 SNP per 48 bp in noncoding regions and 1 SNP per 131 bp in coding regions (BHATTRAMAKKI et al. 2002 Down). The estimate for SNP frequency in rice in this study is more similar to preliminary data from sorghum, which, like rice, has low outcrossing rates (PEDERSEN et al. 1998 Down), where the estimated frequency is 1 SNP per 102 bp (average n = 25.45; M. HAMBLIN, personal communication). However, because this study included only a few indica's from outside of Bangladesh, and therefore represents a small sampling of their wide geographic range and no tropical or temperate japonicas or aromatic rices, the accessions included in this study do not represent the full diversity of rice. It is possible that different genomic regions and sampling will offer different views of both the frequency and the distribution of polymorphism and recombination.

Estimates of linkage disequilibrium are important as an indicator of how useful linkage-disequilibrium-based trait mapping approaches may be compared to other available methods on the basis of the tradeoff between population size and informativeness. If linkage disequilibrium declines rapidly, genome scans will require an excessive marker density, but the testing of candidate genes is feasible. If linkage disequilibrium is too large, resolution may be low, but genome scans are viable. The linkage disequilibrium decay at 100 kb observed in this study would require an average of one marker per centimorgan (1 cM = 200–300 kb; FENG et al. 2002 Down; SASAKI et al. 2002 Down), and these results suggest that linkage disequilibrium mapping strategies could provide greater resolution (because of the higher recombination rate) than primary quantitative trait locus (QTL) mapping, where populations with 200–300 individuals are typically surveyed with 150–200 markers and result in QTL typically 10–20 cM in size. However, to generate a whole-genome scan that captures the resolution offered by LD mapping, such a study would require the use of ~1500 well-distributed markers. Thus, most applications of LD mapping are likely to be limited to regions previously delimited by QTL analysis or by candidate gene studies. In these cases, association mapping offers the advantage of exploring the relationship between phenotype and a broad array of genotypic variants at a favorable level of resolution in a specified target region. Because linkage disequilibrium is likely to extend beyond a single gene in rice, the application differs greatly from maize where genes already known to be associated with a trait can be tested to identify the functional nucleotide polymorphisms (THORNSBERRY et al. 2001 Down). More studies will be required to determine if the extent of linkage disequilibrium reported here is typical of other subpopulations and loci in rice.

In this sample, significant linkage disequilibrium (r2 >= 0.1) persisted between sites up to 100 kb apart. This is the same order of magnitude as linkage disequilibrium observed at the FRIGIDA flowering time locus in A. thaliana, where significant linkage disequilibrium was detected between pairs of sites up to 250 kb apart (HAGENBLAD and NORDBORG 2002 Down; NORDBORG et al. 2002 Down). As expected, these estimates differ greatly from the limited linkage disequilibrium observed in outcrossing species like maize where linkage disequilibrium frequently decays at distances between 100 bp and 1.5 kb (REMINGTON et al. 2001 Down; TENAILLON et al. 2001 Down; THORNSBERRY et al. 2001 Down). In addition, it is possible that the xa5 locus is under selection and would therefore be predicted to have more extensive linkage disequilibrium than a locus evolving neutrally.

The resolution of the origin of xa5 and the allelic diversity for resistance was not possible with this data set. The xa5 phenotype was found predominantly within the genetically defined aus-boro subpopulation. However, the presence of the phenotype in a few accessions in the indica group raises the possibility of independent origins of this phenotype in different subpopulations, particularly when haplotypic data are considered. Within the aus-boro subpopulation one very common haplotype was associated with the xa5 reaction profile; however, very different haplotypes were associated with resistance in indica ecotypes.

Several lines of evidence suggest genetic heterogeneity for the resistance phenotype. For instance, some resistant, allele-tested accessions had haplotypes highly similar to susceptible accessions (compare haplotype 18 to 19 and haplotype 6 to 7). It is possible that the relevant differences lie in unsequenced regions and that recombination has not broken the linkage. These pairs of haplotypes could be useful for examining candidate genes for evidence of mutations because they would be expected to be highly similar at most positions. Another possibility is that susceptibility is being caused by another locus, because the Philippine Xoo races contain multiple avirulence (avr) proteins, which could interact with susceptibility alleles at other loci in the rice genome.

More evidence for genetic heterogeneity is that some non-allele-tested, resistant accessions from the presumed indica group originating in Southeast Asia have a haplotype that differs from the aus-boro resistant haplotype and is identical to some susceptible accessions. Because these accessions were not allele tested, it is possible that another locus confers the phenotype, a hypothesis that could be confirmed by genetic mapping. Alternately, it could be a different resistance allele at this locus; if the recessive nature of the gene is indicative of a knock-out mutation, the phenotype could be achieved by many possible nucleotide changes. Once again, it is also possible that the relevant mutation could be so recent that recombination has not occurred to sufficiently reduce linkage disequilibrium.

Genetic heterogeneity for a trait would require careful sampling if linkage disequilibrium and association mapping were to be employed. If alleles in rice have arisen after the diversification into subpopulations and their isolation has been enforced by limited gene flow, this situation would represent a violation of the common assumption for association mapping, the common disease common variant hypothesis, which proposes that common variants are responsible for the genetic risk for certain diseases (LANDER 1996 Down). This would reduce the power to detect the association between genotype and phenotype and suggests that larger sample sizes could be necessary. At this time, little information is available on the distribution of alleles in subpopulations of rice. In a study of the haplotype at the waxy locus that confers glutinous texture to rice, the glutinous haplotype was found mainly in temperate and tropical japonica's and in only a few indica accessions (OLSEN and PURUGGANAN 2002 Down). This trait would be expected to be under strong selection due to cultural preference, so one might expect limited gene flow. It is not known how this would differ for traits affecting biotic or abiotic stress resistance.

A similar example of genetic heterogeneity was found for the early flowering FRIGIDA locus in Arabidopsis. The early flowering haplotype in Central Asia differs from that found in the rest of the early-flowering accessions (HAGENBLAD and NORDBORG 2002 Down), and eight independent loss-of-function mutations at this locus conferring early flowering have been identified (LE CORRE et al. 2002 Down). Both rice and Arabidopsis are predominantly autogamous, and therefore the expectation of a single origin of a phenotype that occurs across subpopulations may be less plausible than in outcrossing species. This has implications for sampling in future linkage disequilibrium or association studies. Isolated populations, employed in the study of human diseases, may find their plant counterpart in the subpopulations of autogamous crop species, which can have the advantage of a greater likelihood of having a single origin for a phenotype (SHIFMAN and DARVASI 2001 Down). However, because linkage disequilibrium may extend beyond a single gene, studies will require large sample sizes to capture rare recombination events.


*  ACKNOWLEDGMENTS

The authors thank the International Rice Research Institute for providing rice accessions; Fumio Onishi for growth chamber assistance; two anonymous reviewers for valuable comments; Sharon Mitchell, Matthew Blair, Anjali Iyer, Alexandra Casa, Julie Ho, Martha Hamblin, Rebecca Nelson, and Ed Buckler for useful discussions; and Lois Swales for assistance with formatting the manuscript. A. Garris was supported by U.S. Department of Agriculture/Cooperative State Research Service competitive grant 97-35300-5101, representing Food and Agricultural Sciences National Needs Graduate Fellowship in Plant Biotechnology.

Manuscript received February 10, 2003; Accepted for publication May 16, 2003.


*  LITERATURE CITED
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

BHATTRAMAKKI, D., M. DOLAN, M. HANAFEY, R. WINELAND, and D. VASKE et al., 2002  Insertion-deletion polymorphisms in 3' regions of maize genes occur frequently and can be used as highly informative markers. Plant Mol. Biol. 48:539-547.[Medline]

BLAIR, M. W., A. J. GARRIS, A. S. AYER, B. CHAPMAN, and S. KRESOVICH et al., 2003  High resolution genetic mapping and candidate gene identification at the xa5 locus for bacterial blight resistance in rice (Oryza sativa L.). Theor. Appl. Genet. 107:62-73.[Medline]

BROWN, A. H. D., 1989  Core collections: a practical approach to genetic resources management. Genome 31:818-824.

BUCKLER, E. S. and J. M. THORNSBERRY, 2002  Plant molecular diversity and applications to genomics. Curr. Opin. Plant Biol. 5:107-111.[Medline]

BUSTO, G. A., T. OGAWA, N. ENDO, R. E. TABIEN, and R. IKEDA, 1990  Distribution of genes for resistance to bacterial blight of rice in Asian countries. Rice Genet. Newsl. 7:127.

CAVALLI-SFORZA, L. L. and A. W. F. EDWARDS, 1967  Phylogenetic analysis: models and estimation procedures. Am. J. Hum. Genet. 19:233-257.

CHEN, X., S. TEMNYKH, Y. XU, Y. G. CHO, and S. R. MCCOUCH, 1997  Development of a microsatellite framework map providing genome-wide coverage in rice, Oryza sativa L. Theor. Appl. Genet. 95:553-567.

COLOSI, J. C. and B. A. SCHAAL, 1993  Tissue grinding with ball-bearings and vortex mixer for DNA extraction. Nucleic Acids Res. 21:1051-1052.[Free Full Text]

FENG, Q., Y. ZHANG, P. HAO, S. WANG, and G. FU et al., 2002  Sequence and analysis of rice chromosome 4. Nature 420:316-320.[Medline]

GLASZMANN, J. C., 1987  Isozymes and classification of Asian rice varieties. Theor. Appl. Genet. 74:21-30.

HAGENBLAD, J. and M. NORDBORG, 2002  Sequence variation and haplotype structure surrounding the flowering time locus FRI in Arabidopsis thaliana. Genetics 161:289-298.[Abstract/Free Full Text]

HUDSON, R. R. and N. L. KAPLAN, 1985  Statistical properties of the number of recombination events in the history of a sample of DNA sequences. Genetics 111:147-164.[Abstract/Free Full Text]

KAUFFMANN, H., A. P. K. REDDY, S. P. Y. HSIEH, and S. D. MERCA, 1973  An improved technique for evaluating resistance of rice varieties to Xanthomonas oryzae.. Plant Disease Rep. 57:537-541.

KHUSH, G. S., 1997  Origin, dispersal, cultivation and variation of rice. Plant Mol. Biol. 35:25-34.[Medline]

LANDER, E. S., 1996  The new genomics: global views of biology. Science 274:536-539.[Free Full Text]

LE CORRE, V. L., F. ROUX, and X. REBOUD, 2002  DNA polymorphisms at the FRIGIDA gene in Arabidosis thaliana: extensive nonsynonymous variation is consistent with local selection for flowering time. Mol. Biol. Evol. 19:1261-1271.[Abstract/Free Full Text]

MEW, T. W., 1987  Current status and future prospects on bacterial blight of rice. Annu. Rev. Phytopathol. 25:359-382.

NORDBORG, M. and S. TAVARÉ, 2002  Linkage disequilibrium: what history has to tell us. Trends Genet. 18:83-90.[Medline]

NORDBORG, M., J. O. BOREVITZ, J. BERGELSON, C. C. BERRY, and J. CHORY et al., 2002  The extent of linkage disequilibrium in Arabidopsis thaliana.. Nat. Genet. 30:190-193.[Medline]

OLSEN, K. M. and M. D. PURUGGANAN, 2002  Molecular evidence on the origin and evolution of glutinous rice. Genetics 162:941-950.[Abstract/Free Full Text]

OLUFOWOTE, J. O., G. S. KHUSH, and H. E. KAUFFMAN, 1977  Inheritance of bacterial blight resistance in rice. Phytopathology 67:771-775.

PARSONS, B. J., H. J. NEWBURY, M. T. JACKSON, and B. V. FORD-LLOYD, 1999  The genetic structure and conservation of aus, aman and boro rices from Bangladesh. Genet. Res. Crop Evol. 46:587-598.

PEDERSEN, J. F., J. J. TOY, and B. JOHNSON, 1998  Natural outcrossing of sorghum and sudangrass in the central great plains. Crop Sci. 38:937-939.[Abstract/Free Full Text]

PETPISIT, V., G. S. KHUSH, and H. E. KAUFFMAN, 1977  Inheritance of resistance to bacterial blight in rice. Crop Sci. 17:551-554.[Abstract/Free Full Text]

PRITCHARD, J. K. and N. A. ROSENBERG, 1999  Use of unlinked genetic markers to detect population stratification in association studies. Am. J. Hum. Genet. 65:220-228.[Medline]

RAFALSKI, A., 2002  Applications of single nucleotide polymorphisms in crop genetics. Curr. Opin. Plant Biol. 5:107-111.

REMINGTON, D. L., J. M. THORNSBERRY, Y. MATSUOKA, L. M. WILSON, and S. R. WHITT et al., 2001  Structure of linkage disequilibrium and phenotypic associations in the maize genome. Proc. Natl. Acad. Sci. USA 98:11479-11484.[Abstract/Free Full Text]

SAITOU, N. and M. NEI, 1987  The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 4:406-425.[Abstract]

SASAKI, T., T. MATSUMOTO, K. YAMAMOTO, K. SAKATA, and T. BABA et al., 2002  The genomic sequence and structure of rice chromosome 1. Nature 420:312-316.[Medline]

SHIFMAN, S. and A. DARVASI, 2001  The value of isolated populations. Nat. Genet. 28:309-310.[Medline]

SIDHU, G. S., G. S. KHUSH, and T. W. MEW, 1978  Genetic analysis of resistance to bacterial blight in seventy cultivars of rice, Oryza sativa L. Theor. Appl. Genet. 53:105-111.

SINGH, R. J., G. S. KHUSH, and T. W. MEW, 1983  A new gene for resistance to bacterial blight in rice. Crop Sci 23:558-560.[Abstract/Free Full Text]

TAKAHASHI, N., 1997 Differentiation of ecotypes in cultivated rice. 1. Adaptation to environments and ecotypic differentiation, pp. 112–118 in Science of the Rice Plant, Vol. 3, Genetics, edited by T. MATSUO and K. HOSHIKAWA. Food and Agriculture Policy Research Center, Tokyo.

TEMNYKH, S., W. D. PARK, N. AYRES, S. CARTINHOUR, and N. HAUCK et al., 2000  Mapping and genome organization of microsatellite sequences in rice, Oryza sativa L.. Theor. Appl. Genet. 100:697-712.

TEMNYKH, S., G. DECLERCK, A. LUKASHOVA, L. LIPOVICH, and S. CARTINHOUR et al., 2001  Computational and experimental analysis of microsatellites in rice (Oryza sativa L.): frequency, length variation, transposon associations, and genetic marker potential. Genet. Res. 11:1441-1452.

TENAILLON, M. I., M. C. SAWKINS, A. D. LONG, R. L. GAUT, and J. F. DOEBLEY et al., 2001  Patterns of DNA sequence polymorphism along chromosome 1 of maize (Zea mays ssp. mays L.). Proc. Natl. Acad. Sci. USA 98:9161-9166.[Abstract/Free Full Text]

THOMPSON, J. D., D. G. HIGGINS, and T. J. GIBSON, 1994  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position specific gap penalties and weight matrix choice. Nucleic Acids Res. 22:4673-4680.[Abstract/Free Full Text]

THORNSBERRY, J. M., M. M. GOODMAN, J. DOEBLEY, S. KRESOVICH, and D. NIELSEN et al., 2001  Dwarf8 polymorphisms associate with variation in flowering time. Nat. Genet. 28:286-289.[Medline]

WRIGHT, S., 1969 Evolution and the Genetics of Populations, Vol. 2. University of Chicago Press, Chicago.

YANG, D., A. SANCHEZ, G. S. KHUSH, Y. ZHU, and N. HUANG, 1998  Construction of a BAC contig containing the xa5 locus in rice. Theor. Appl. Genet. 97:1120-1124.

YOSHIMURA, A., T. W. MEW, G. S. KHUSH, and T. OMURA, 1984  Genetics of bacterial blight resistance in a breeding line of rice. Phytopathology 74:773-777.

ZHU, J., M. D. GALE, S. QUARRIE, M. T. JACKSON, and G. J. BRYAN, 1998  AFLP markers for the study of rice biodiversity. Theor. Appl. Genet. 96:602-611.




This article has been cited by other articles:


Home page
jashsHome page
I. Simko and J. Hu
Population Structure in Cultivated Lettuce and Its Impact on Association Mapping
J. Amer. Soc. Hort. Sci., January 1, 2008; 133(1): 61 - 68.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
K. A. Mather, A. L. Caicedo, N. R. Polato, K. M. Olsen, S. McCouch, and M. D. Purugganan
The Extent of Linkage Disequilibrium in Rice (Oryza sativa L.)
Genetics, December 1, 2007; 177(4): 2223 - 2232.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
T. Pyhajarvi, M. R. Garcia-Gil, T. Knurr, M. Mikkonen, W. Wachowiak, and O. Savolainen
Demographic History Has Influenced Nucleotide Diversity in European Pinus sylvestris Populations
Genetics, November 1, 2007; 177(3): 1713 - 1724.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
U. Arunyawat, W. Stephan, and T. Stadler
Using Multilocus Sequence Data to Assess Population Structure, Natural Selection, and Linkage Disequilibrium in Wild Tomatoes
Mol. Biol. Evol., October 1, 2007; 24(10): 2310 - 2322.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
J. M. Kolkman, S. T. Berry, A. J. Leon, M. B. Slabaugh, S. Tang, W. Gao, D. K. Shintani, J. M. Burke, and S. J. Knapp
Single Nucleotide Polymorphisms and Linkage Disequilibrium in Sunflower
Genetics, September 1, 2007; 177(1): 457 - 468.
[Abstract] [Full Text] [PDF]


Home page
Crop Sci.Home page
S. Chao, W. Zhang, J. Dubcovsky, and M. Sorrells
Evaluation of Genetic Diversity and Genome-wide Linkage Disequilibrium among U.S. Wheat (Triticum aestivum L.) Germplasm Representing Different Market Classes
Crop Sci., May 31, 2007; 47(3): 1018 - 1030.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
D. L. Hyten, I.-Y. Choi, Q. Song, R. C. Shoemaker, R. L. Nelson, J. M. Costa, J. E. Specht, and P. B. Cregan
Highly Variable Patterns of Linkage Disequilibrium in Multiple Soybean Populations
Genetics, April 1, 2007; 175(4): 1937 - 1944.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
L.-B. Zhang and S. Ge
Multilocus Analysis of Nucleotide Variation and Speciation in Oryza officinalis and Its Close Relatives
Mol. Biol. Evol., March 1, 2007; 24(3): 769 - 783.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
Q. Zhu, X. Zheng, J. Luo, B. S. Gaut, and S. Ge
Multilocus Analysis of Nucleotide Variation of Oryza sativa and Its Wild Relatives: Severe Bottleneck during Domestication of Rice
Mol. Biol. Evol., March 1, 2007; 24(3): 875 - 888.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
S. Stracke, T. Presterl, N. Stein, D. Perovic, F. Ordon, and A. Graner
Effects of Introgression and Recombination on Haplotype Structure and Linkage Disequilibrium Surrounding a Locus Encoding Bymovirus Resistance in Barley
Genetics, February 1, 2007; 175(2): 805 - 817.
[Abstract] [Full Text] [PDF]


Home page
Crop Sci.Home page
A. M. Casa, S. E. Mitchell, J. D. Jensen, M. T. Hamblin, A. H. Paterson, C. F. Aquadro, and S. Kresovich
Evidence for a Selective Sweep on Chromosome 1 of Cultivated Sorghum
Crop Sci., November 1, 2006; 46(Supplement_1): S-27 - S-40.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
I. Simko, K. G. Haynes, and R. W. Jones
Assessment of Linkage Disequilibrium in Potato Genome With Single Nucleotide Polymorphism Markers
Genetics, August 1, 2006; 173(4): 2237 - 2245.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
K. L. McNally, R. Bruskiewich, D. Mackill, C. R. Buell, J. E. Leach, and H. Leung
Sequencing multiple and diverse rice varieties. Connecting whole-genome variation with phenotypes.
Plant Physiology, May 1, 2006; 141(1): 26 - 31.
[Full Text] [PDF]


Home page
GeneticsHome page
A. Liu and J. M. Burke
Patterns of Nucleotide Diversity in Wild and Cultivated Sunflower
Genetics, May 1, 2006; 173(1): 321 - 330.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
F. Breseghello and M. E. Sorrells
Association Mapping of Kernel Size and Milling Quality in Wheat (Triticum aestivum L.) Cultivars
Genetics, February 1, 2006; 172(2): 1165 - 1177.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
K. S. Caldwell, J. Russell, P. Langridge, and W. Powell
Extreme Population-Dependent Linkage Disequilibrium Detected in an Inbreeding Plant Species, Hordeum vulgare
Genetics, January 1, 2006; 172(1): 557 - 567.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
M. T. Hamblin, M. G. Salas Fernandez, A. M. Casa, S. E. Mitchell, A. H. Paterson, and S. Kresovich
Equilibrium Processes Cannot Explain High Levels of Short- and Medium-Range Linkage Disequilibrium in the Domesticated Grass Sorghum bicolor
Genetics, November 1, 2005; 171(3): 1247 - 1256.
[Abstract] [Full Text] [PDF]