- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Supplemental Material
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Shepard, K. A.
- Articles by Purugganan, M. D.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Shepard, K. A.
- Articles by Purugganan, M. D.
Molecular Population Genetics of the Arabidopsis CLAVATA2 Region: The Genomic Scale of Variation and Selection in a Selfing Species
Kristen A. Shepard1,a and Michael D. Puruggananaa Department of Genetics, North Carolina State University, Raleigh, North Carolina 27695
Corresponding author: Michael D. Purugganan, 3513 Gardner Hall, North Carolina State University, Raleigh, NC 27695., michaelp{at}unity.ncsu.edu (E-mail)
Communicating editor: M. AGUADÉ
| ABSTRACT |
|---|
The Arabidopsis thaliana CLAVATA2 (CLV2) gene encodes a leucine-rich repeat protein that regulates the development of the shoot meristem. The levels and patterns of nucleotide variation were assessed for CLV2 and 10 flanking genes that together span a 40-kb region of chromosome I. A total of 296 out of 7959 sequenced nucleotide sites were polymorphic. The mean levels of sequence diversity of the contiguous genes in this region are approximately twofold higher than those of other typical Arabidopsis nuclear loci. There is, however, wide variation in the levels and patterns of sequence variation among the 11 linked genes in this region, and adjacent genes appear to be subject to contrasting evolutionary forces. CLV2 has the highest levels of nucleotide variation in this region, a significant excess of intermediate frequency polymorphisms, and significant levels of intragenic linkage disequilibrium. Most alleles at CLV2 are found in one of three haplotype groups of moderate (>15%) frequency. These features suggest that CLV2 may harbor a balanced polymorphism.
BALANCED polymorphisms are maintained in populations by selective forces acting on alternative alleles of a locus (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
200 bp (![]()
In selfing species, the width of the genomic region of enhanced variation scales with the inverse of the population recombination parameter C = 4Ner', where Ne is the effective population size and r' is the selfing-reduced effective recombination rate (![]()
A. thaliana provides an excellent opportunity to empirically assess the genomic impact of balanced polymorphisms in a predominantly selfing plant species. Outcrossing rates in this weedy plant species are estimated to be as low as 1% (![]()
![]()
![]()
![]()
250 kb (![]()
![]()
![]()
![]()
Low effective recombination and long-range linkage disequilibrium in A. thaliana suggest that the region of enhanced variation associated with a balanced polymorphism could extend over several linked genes. This linkage may affect the rate and efficacy of selection on alternate alleles. Recent studies, however, contradict this prediction; the effects of balanced polymorphisms in the TFL1 gene (![]()
![]()
![]()
CLV2 is a meristem regulatory gene located near 89 cM on chromosome I. Loss-of-function mutations at CLV2 result in the accumulation of undifferentiated cells in vegetative, inflorescence, and floral meristems. The enlargement of these shoot meristems contributes to the formation of extra flowers and floral organs (![]()
20 leucine-rich repeats (LRR), a transmembrane region, and a short cytoplasmic domain (![]()
![]()
![]()
![]()
![]()
![]()
The initial isolation of CLV2 showed that this gene harbors a large amount of nucleotide diversity (![]()
| MATERIALS AND METHODS |
|---|
Isolation and sequencing of alleles:
A. thaliana ecotypes were obtained from single-seed propagated material provided by the Arabidopsis Biological Resource Center (ABRC; see Table 1). The Lisse-2 seed stock was from the population collection of P. H. Williams maintained at ABRC. A. lyrata seed from a Karhumaki, Russia, population was provided by O. Savolainen and Helmi Kuittinen.
|
Genomic DNA was isolated from young leaves of 920 A. thaliana ecotypes and one A. lyrata accession using the plant DNeasy mini kit (QIAGEN, Chatsworth, CA). PCR primers for 11 genes in this region were designed from the Col-0 genomic sequence [bacterial artificial chromosome (BAC) T8F5, GenBank accession no.
AC004512] using Primer3 (![]()
|
DNA fragments were purified using the QIAquick PCR purification kit or the QIAquick gel extraction kit (QIAGEN). A. thaliana samples were sequenced directly via cycle sequencing with BigDye terminators (Applied Biosystems) using the primers described in Supplementary Text I at http://www.genetics.org/supplemental/. Several singleton polymorphisms were confirmed with reamplification and sequencing. Amplified A. lyrata products were cloned into pCR4Blunt-TOPO vector using the Zero Blunt TOPO PCR cloning kit (Invitrogen). Plasmid miniprep DNA was isolated using the QIAprep miniprep kit (QIAGEN), and sequenced twice via cycle sequencing from both directions. DNA sequencing was conducted with a Prism 3700 96-capillary automated sequencer (Applied Biosystems). The PHRED and PHRAP functions (![]()
![]()
Molecular population genetic data analysis:
Sequences used in this study were visually aligned against the A. thaliana GenBank sequence for the Col-0 accession (no. AC004512). The variable length portions of microsatellites were excluded from the analysis. The A. lyrata ortholog was used as the outgroup. Interspecific divergence distances were estimated from silent sites with the Kimura two-parameter model using MEGA2.1 (![]()
![]()
(![]()
W (![]()
![]()
![]()
![]()
![]()
The Hudson-Kreitman-Aguadé (HKA) two-locus test (![]()
![]()
![]()
![]()
![]()
Previously published A. thaliana sequences of the following genes, which were available at the time of this study, were used in comparisons of nucleotide diversity: Adh (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
| RESULTS |
|---|
Nucleotide variation among linked genes in a 40-kb region of Arabidopsis chromosome I:
Fragments of 11 adjacent genes on chromosome I were sequenced from 9 to 12 A. thaliana accessions sampled primarily from Eurasia (Table 1 and Table 2). The sequenced regions spanned exons and (when present) introns within the coding region of each gene; fragments ranged from 277 to 939 bp, with a mean length of 724 bp/gene. Of the 7959 nucleotide sites sequenced for this study, 296 sites segregated for single nucleotide polymorphisms. Twenty-eight indel polymorphisms, ranging from 1 to 3.9 kb, were also observed in these sequences. Four indels, two in the serpin and two in the ARI/RING-like gene, are associated with simple sequence or microsatellite repeats in introns. Seven indels occur in coding regions. Tables of polymorphic sites are given in Supplementary Figures S1S7 at http://www.genetics.org/supplemental/.
Polymorphisms in the UBQ13, the MATH domain gene, and the serpin suggest that these loci may be pseudogenes. All sampled UBQ13 alleles contain a partial ubiquitin repeat followed by three or four complete repeats. We were unable to locate the rest of the repeat in the upstream genomic sequence of the Col-0 accession. The internal repeats appear to have undergone substantial recombination; because homology among these repeats was difficult to determine, analyses were restricted to the 5' flanking region, the partial repeat, and the first and last complete repeats. One allele of UBQ13 codes for a premature stop codon, while two alleles contain 3- or 12-bp deletions in coding sequence. The Col-0 allele contains a 3.9-kb insertion of mitochondrial DNA (![]()
Estimates of silent nucleotide diversity and divergence in the CLV2 region:
Nucleotide diversity at silent sites (third position of codons and noncoding regions) for these 11 genes was estimated from the average number of pairwise differences (
; ![]()
W; ![]()
span nearly one order of magnitude, from 0.0063 to 0.0579 (see Table 3A and Fig 1). Levels of
W show a comparable range, from 0.0075 to 0.0489 (Table 3A). CLV2 exhibits the greatest silent site diversity, with the highest
and the second highest
. The high value of
W observed at the putative antigen receptor can be attributed to a single allele from the Ita-0 accession that accounts for 7 of the 10 segregating sites in this gene. The lowest levels of
and
W were observed 3 kb upstream of CLV2 in the MATH domain gene and 17 kb downstream in the ARI/RING-like gene.
|
|
Overall, the CLV2 region exhibits elevated levels of silent site nucleotide diversity compared to other nuclear genes in A. thaliana. The mean values of
and
W for the 11 genes in the CLV2 region are 0.0219 ± 0.005 and 0.0241 ± 0.004, respectively. These mean diversity levels are considerably greater than those observed among 14 previously studied A. thaliana genes. For these other loci (see MATERIALS AND METHODS), the mean values of silent site
and
W are 0.009 ± 0.001 and 0.012 ± 0.002, twofold lower than those of the genes in the CLV2 region (see Fig 2). In contrast, the 11 genes in the CLV2 region display only slightly higher levels of nucleotide divergence between A. thaliana and the closely related species A. lyrata (Table 3A). The mean level of silent site sequence divergence, K2P, between these two species for 12 previously studied A. thaliana genes is 0.123 ± 0.007 substitutions/site. The mean nucleotide divergence level for the 11 linked genes in the CLV2 region is 0.138 ± 0.010 substitutions/site.
|
The HKA test (![]()
|
Selective forces among linked genes in the CLV2 region:
The frequency distribution of polymorphisms provides information on the relative roles of neutral drift vs. selection at specific loci. The skewness of frequency distributions for nucleotide polymorphisms in the sample or along branches in the gene genealogy can be evaluated with the Tajima (![]()
![]()
Among the 11 genes in the CLV2 region, 8 have negative values of Tajima's D and Fu and Li's D and D*, indicating an excess of low-frequency polymorphisms within these loci (Table 4A). The trend toward excess low-frequency polymorphism for most of the genes at the CLV2 region is similar to that observed for many other Arabidopsis nuclear genes (![]()
![]()
![]()
![]()
In contrast, both CLV2 and the TIR domain gene have consistently positive values of the Tajima and Fu and Li test statistics (Table 4A), but only the TIR domain gene was significantly positive (Fu and Li D* = +1.2984, P < 0.05; D = +1.6783, P < 0.01). Loci with significant positive values of these test statistics have rarely been observed in previous studies of A. thaliana. Positive values of these test statistics are associated with an excess of intermediate-frequency polymorphisms. These data suggest that both of these genes may be evolving non-neutrally in a pattern consistent with balancing selection, but the power of these tests is limited at such small sample sizes.
Since our results indicated that the CLV2 gene has the highest level of polymorphism among the 11 linked genes, we examined variation at this gene and its three closest neighbors in greater detail. We sequenced additional accessions at these loci to increase the number of sampled alleles to 1921. The results from this expanded data set are consistent with the patterns observed with the smaller data set. The levels of nucleotide variation, the directions of the Tajima's D and Fu and Li's D* and D tests statistics, and the results of the HKA tests against Adh are all comparable across the two data sets (Table 3 and Table 4). The only difference is that with larger sample sizes, the value of Tajima's D is now significant for CLV2 (D = +1.752, P < 0.05). This finding is consistent with previous analyses that indicated that augmenting sample sizes for sequenced alleles increases the power to detect significant deviations from the neutral-equilibrium model (![]()
The positive value of Tajima's D in CLV2 is associated with the presence of at least three distinct haplotype groups (I, II, and IV in Fig 3). These three haplogroups are found at moderate frequency, with the rarest haplogroup at
15% frequency. Also, one haplotype (III in Fig 3) may have arisen from a recombination event between alleles belonging to groups II and IV. Alternatively, haplotype III, obtained from the Ita-0 accession, may represent an additional allelic class; this accession also bears more divergent alleles of several other loci in this region.
|
Intragenic and intergenic linkage disequilibrium at the CLV2 region:
Linkage disequilibrium, the nonrandom association of allelic polymorphisms, was surveyed for nucleotide polymorphisms both within and between genes in the CLV2 region. The amount of linkage disequilibrium was estimated using the r2 statistic (![]()
Larger sample sizes increase the power of detecting significant linkage disequilibrium, and this is demonstrated for four genes (the MATH domain gene, CLV2, the serpin pseudogene, and the TIR domain gene), which were examined in the expanded sample set of 1921 ecotypes. The proportion of significant comparisons ranges from
1 to
30% of pairwise comparisons (Table 5). The levels of intragenic linkage disequilibrium can also be estimated using the ZnS statistic (![]()
|
The extent of disequilibrium between genes is evident in plots of r2 as a function of physical distance. Across the entire 40-kb region, strong linkage disequilibrium (r2 = 1) is observed even at distances of
25 kb in the smaller data set (Fig 4A). Using data from the expanded sample set, strong levels of intergenic disequilibrium are also evident among CLV2 and its nearest neighbors (Fig 4B). The distance plot shows strong linkage disequilibrium up to
6 kb associated with correlations among CLV2, the serpin pseudogene, and the TIR domain locus.
|
Amino acid replacements at CLV2:
In our sample of CLV2 alleles, 22 of the 54 nucleotide polymorphisms code for amino acid replacements (Fig 3A); 20 of the substitutions occur in the LRRs, while two are in the cysteine-pair region preceding the LRRs (Fig 5). Proteins in the four allele classes differ by 715 amino acids. Although the majority of these replacements are fairly conservative, two to five of the differences between allele classes are due to radical substitutions (Fig 3B). The amino acid substitutions observed in our data set probably encompass much of the variation present within the species. Comparisons of the full-length CLV2 sequence from the Col-0 (class I), Ws-0 (class II), and Ler-0 (class IV) ecotypes reveal only two additional amino acid replacements, one in the 18th LRR and one in the cysteine-pair region following the LRRs (![]()
|
| DISCUSSION |
|---|
Contrasting patterns of sequence variation across the A. thaliana CLV2 region:
Molecular population genetic analyses of the A. thaliana CLV2 region indicate that levels and patterns of nucleotide diversity can vary even among contiguous, closely linked genes. For example, although CLV2 has the highest level of nucleotide variation in this region (
= 0.0558), the MATH domain gene has the lowest (
= 0.0060)a nearly 10-fold reduction in diversity between adjacent genes. Similar patterns of differing nucleotide diversity levels among linked genes have also been observed in a 400-kb region around the FRI gene (![]()
![]()
![]()
There also appear to be dramatic changes in the patterns of nucleotide variation observed among neighboring loci in the CLV2 region. Both CLV2 and the TIR domain locus, for example, have positive levels of Tajima's D, consistent with an excess of intermediate frequency polymorphisms in the sampled alleles. These two loci, however, are surrounded by and interspersed with genes that display negative levels of Tajima's D, indicating an excess of low-frequency polymorphisms for these linked loci. These results suggest that levels and patterns of variation are remarkably gene-specific even among closely linked A. thaliana nuclear genes.
Linkage disequilibrium levels appear to be extensive across the CLV2 region. In this 40-kb region, disequilibrium is observed both intra- and intergenically, and strong disequilibrium can extend to
25 kb. There is also evidence for correlation of allele genealogies among some of the linked genes (K. A. SHEPARD, unpublished observations). This correlation in gene genealogies, however, is not observed between genes that are farther apart and can also disappear between adjacent loci. The CLV2 gene and the MATH domain locus immediately upstream, for example, display weaker correlation in genealogies among the sampled alleles (K. A. SHEPARD, unpublished observations).
Several of the genes in the CLV2 region appear to contain two or more distinct haplotype groups (see, for example, Figure S1 in Supplementary Information at http://www.genetics.org/supplemental/). The presence of two distinct allele groups, commonly referred to as allelic dimorphism, has been observed in previous studies of A. thaliana (![]()
![]()
![]()
![]()
![]()
![]()
![]()
The long-range decay of linkage disequilibrium is expected in A. thaliana, a predominantly selfing species with a reduced effective recombination rate. Unlike in D. melanogaster or Z. mays, where disequilibrium decays in scales of
1 kb, linkage disequilibrium in A. thaliana can persist up to 250 kb (![]()
![]()
![]()
Evidence for balancing selection in the CLV2 genomic region:
The reported high level of polymorphism at the CLV2 meristem regulatory gene (![]()
Three alternative scenarios may explain this pattern of diversity at the CLV2 gene. One possibility is a duplication at this locus, which could explain the distinct haplogroups, high variation, and intragenic linkage disequilibrium. There is no evidence, however, for a recent duplication of CLV2 or any of the genes flanking it in the Arabidopsis genome. Moreover, we find no evidence of duplication heterozygosity in different A. thaliana ecotypes (K. A. SHEPARD, unpublished observation). A second scenario is that contemporary or ancestral geographical subdivision can also result in the observed pattern. Detailed analysis of A. thaliana ecotypes using genome-wide markers, however, does not reveal any strong geographical subdivision within this species (![]()
![]()
![]()
![]()
![]()
![]()
![]()
The only other gene in this region that shows some evidence for balancing selection is the TIR domain gene located
4 kb downstream of CLV2. This locus has significantly positive Fu and Li and ZnS disequilibrium test statistics; unlike CLV2, however, this gene does not show significantly high intraspecific nucleotide variation compared to Adh (HKA test, P < 0.7). The pattern at the TIR domain gene may simply result from linkage with a balanced polymorphism at CLV2, as is suggested by the allele groups shared among these loci (see Fig 3 and S5 at http://www.genetics.org/supplemental/). Alternatively, balancing selection may be acting independently on the TIR domain gene. The sequence of this gene is similar to the TIR portion of the RPS4 disease-resistance gene (![]()
While levels of nucleotide variation are predicted to be highest immediately surrounding a balanced polymorphism, an elevated level of variation may also be expected in a more extended genomic region of a predominantly selfing species. This predicted pattern is also observed by the high level of nucleotide variation among the 11 linked genes in the CLV2 genomic region. There is a twofold increase in estimates of variation between loci in the CLV2 region and a set of 14 other A. thaliana genes. There is no accompanying increase in nucleotide divergence estimates for these genes between A. thaliana and A. lyrata, compared to previously studied loci. This suggests that the increase in intraspecific nucleotide variation in this region is not the result of an increase in the neutral mutation rate.
Our results, however, indicate that while a wide window of enhanced neutral variation surrounds the putative balanced polymorphisms in CLV2, significant effects of selection on levels and patterns of sequence diversity appear confined to genic scales. The localized nature of the effects of balanced polymorphisms in the predominantly selfing A. thaliana is paradoxical, although it has been observed at several loci. In the RPS5 disease-resistance locus, significantly enhanced variation is observed surrounding the sequence junction that harbors the RPS5 balanced indel polymorphism, but is not observed at adjacent loci within
10 kb (![]()
![]()
![]()
The CLV2 gene, and to some extent the TIR domain locus, are the only two genes that display departures from neutral-equilibrium predictions by several criteria: (i) significantly elevated levels of nucleotide variation, (ii) intermediate-frequency polymorphisms, and (iii) intragenic linkage disequlibrium. The other genes in the CLV2 region may also have been affected by selection at or near these loci, but do not retain consistent signatures of balancing or positive selective forces. This may reflect, in part, the relatively low power of some of the tests for selection (![]()
Functional consequences of the putative balanced polymorphism at CLV2:
The functional consequences of natural allelic differentiation at CLV2 remain unclear. The putatively balanced alleles at CLV2 are associated with a large number of replacement polymorphisms, with 715 amino acid changes differentiating different allele groups. The distribution of amino acid replacements within LRRs suggests that some of these substitutions could affect the function of the CLV2 protein. Extracellular plant LRRs are characterized by the consensus amino acid sequence LxxL{xxLxLxx}NxLxGxI-PxxLGx, where L may also be isoleucine, valine, or phenylalanine. Plant-specific LRRs have not yet been crystallized; however, structural analyses of nonplant proteins predict that each LRR consists of a ß-strand and an
-helix joined by loops. The alternating ß-strands and
-helices yield a horseshoe-shaped structure in which parallel ß-strands form a binding pocket for protein-protein interactions. The xxLxLxx motif forms a ß-strand/ ß-turn with buried leucine residues and solvent-exposed variable residues (![]()
In our CLV2 data set, three amino acid substitutions occur in the solvent-exposed residues in the ß-strand/ß-turn (Fig 5). Two of these mutations (Thr125
Ile and Arg148
Gly) are radical substitutions, while the third (Ile244
Val) is conservative. Recent studies of cytoplasmic LRR proteins that confer disease resistance in plants have highlighted the functional importance of variation in solvent-exposed LRR residues. Evidence for diversifying selection on these residues has been observed in comparisons of paralogous disease-resistance genes within several species (![]()
![]()
![]()
![]()
![]()
![]()
![]()
The majority of amino acid replacements in CLV2 are located in the interstrand regions of the LRRs. Of these 14 replacements, only 2 are predicted to reside in helical motifs, suggesting that the remainder are found in loops (Fig 5). Although the structure-function relationships in the interstrand regions are less understood, residues in loop regions can clearly affect LRR protein function. Studies of natural variation at RPS2, an A. thaliana disease-resistance gene, have shown that six amino acid differences between the Col-0 (resistant) and Po-1 (susceptible) alleles are sufficient to alter pathogen recognition (![]()
![]()
![]()
If, indeed, some of these replacement substitutions are maintained as balanced polymorphisms, the mechanism of selection is puzzling in light of what little is known about CLV2's role in plant development. Although there is compelling genetic evidence that the proteins encoded by the three CLAVATA genes act together to regulate shoot meristem growth, the exact constituents of and binding relationships among the receptor and ligand multimers are unclear. Of the three characterized CLAVATA genes, clv2 mutant alleles show the weakest shoot meristem phenotypes (![]()
![]()
![]()
![]()
We therefore propose two hypotheses that might explain the putative balancing selection on the CLV2 locus. First, CLV2 might act as a modulator of shoot meristem growth, with different alleles enhancing or reducing the strength of signaling through the CLAVATA complex. This modulation might be accomplished by variation in the accumulation of CLV1 protein in the plasma membrane or by alterations in the affinity of the complex for the multimeric CLV3 ligand. Such modulation could have direct effects on fitness-related traits such as flower number. Alternatively, balancing selection may act on pleiotropic functions of CLV2 that involve currently unidentified binding partners. Characterizing phenotypic, ecologically relevant variation associated with alleles at CLV2 will strengthen the argument of balancing selection at this locus.
| FOOTNOTES |
|---|
Sequence data from this article have been deposited with the EMBL/GenBank Data libraries under accession nos.
AF528566,
AF528713. ![]()
1 Present address: Department of Biological Sciences, Barnard College, Columbia University, New York, NY 10027. ![]()
| ACKNOWLEDGMENTS |
|---|
The authors thank Brandon Gaut, Ken Olsen, Mark Ungerer, Montserrat Aguadé, two anonymous reviewers, and members of the Purugganan laboratory for helpful comments, Outi Savolainen and Helmi Kuittinen for providing A. lyrata seed, and Juergen Kroymann for providing preprints of relevant manuscripts. The authors are also grateful to the NCSU Phytotron for providing growth facilities and the NCSU Genome Research Laboratory for sequencing facilities. This work was funded by a grant from the National Science Foundation Integrated Research Challenges in Environmental Biology program to M.D.P., J. Schmitt, and T.F.C. Mackay, and an Alfred P. Sloan Foundation Young Investigator Award to M.D.P.
Manuscript received July 16, 2002; Accepted for publication November 20, 2002.
| LITERATURE CITED |
|---|
ABBOT, R. J. and M. F. GOMEZ, 1989 Population genetic structure and outcrossing rate of Arabidopsis thaliana (L.) Heynh. Heredity 62:411-418.
AGUADÉ, M., 2001 Nucleotide sequence variation at two genes of the phenylpropanoid pathway, the FAH1 and F3H genes, in Arabidopsis thaliana.. Mol. Biol. Evol. 18:1-9.
BALDI, P., S. BRUNAK, P. FRASCONI, G. SODA, and G. POLLASTRI, 1999 Exploiting the past and the future in protein secondary structure prediction. Bioinformatics 15:937-946.
BANERJEE, D., X.-C. ZHANG, and A. F. BENT, 2001 The leucine-rich repeat domain can determine effective interaction between RPS2 and other host factors in Arabidopsis RPS2-mediated disease resistance. Genetics 158:439-450.
BITTNER-EDDY, P. D., I. R. CRUTE, E. B. HOLUB, and J. L. BEYNON, 2000 RPP13 is a simple locus in Arabidopsis thaliana for alleles that specify downy mildew resistance to different avirulence determinants in Peronospora parasitica.. Plant J. 21:177-188.[Medline]
BRAND, U., J. C. FLETCHER, M. HOBE, E. M. MEYEROWITZ, and R. SIMON, 2000 Dependence of stem cell fate in Arabidopsis on a feedback loop regulated by CLV3 activity. Science 289:617-619.
CAICEDO, A. L., B. A. SCHAAL, and B. N. KUNKEL, 1999 Diversity and molecular evolution of the RPS2 resistance gene in Arabidopsis thaliana.. Proc. Natl. Acad. Sci. USA 96:302-306.
CHARLESWORTH, B., M. NORDBORG, and D. CHARLESWORTH, 1997 The effects of local selection, balanced polymorphism and background selection on equilibrium patterns of genetic diversity in subdivided population. Genet. Res. 70:155-174.[Medline]
DODDS, P. N., G. J. LAWRENCE, and J. G. ELLIS, 2001 Six amino acid changes confined to the leucine-rich repeat beta-strand/beta-turn motif determine the difference between the P and P2 rust resistance specificities in flax. Plant Cell 13:163-178.
EWING, B. and P. GREEN, 1998 Base-calling of automated sequencer traces using Phred. II. Error probabilities. Genome Res. 8:186-194.
EWING, B., L. HILLIER, M. C. WENDL, and P. GREEN, 1998 Base-calling of automated sequencer traces using Phred. I. Accuracy assessment. Genome Res. 8:175-185.
FU, Y.-X. and W.-H. LI, 1993 Statistical tests of neutrality of mutations. Genetics 133:693-709.[Abstract]
GASSMANN, W., M. E. HINSCH, and B. J. STASKAWICZ, 1999 The Arabidopsis RPS4 bacterial-resistance gene is a member of the TIR-NBS-LRR family of disease-resistance genes. Plant J. 20:265-277.[Medline]
HAGENBLAD, J. and M. NORDBORG, 2002 Sequence variation and haplotype structure surrounding the flowering time locus FRI in Arabidopsis thaliana.. Genetics 161:289-298.
HAUBOLD, B., J. KROYMANN, A. RATZKA, T. MITCHELL-OLDS, and T. WIEHE, 2002 Recombination and gene conversion in a 170-kb genomic region of Arabidopsis thaliana. Genetics 161:1269-1278.
HILL, W. G. and A. ROBERTSON, 1968 Linkage disequilibrium in finite populations. Theor. Appl. Genet. 38:226-231.
HUDSON, R. R., M. KREITMAN, and M. AGUADÉ, 1987 A test of neutral molecular evolution based on nucleotide data. Genetics 116:153-159.
INNAN, H. and W. STEPHAN, 2000 The coalescent in an exponentially growing metapopulation and its application to Arabidopsis thaliana.. Genetics 155:2015-2019.
INNAN, H., F. TAJIMA, R. TERAUCHI, and N. T. MIYASHITA, 1996 Intragenic recombination in the Adh locus of the wild plant Arabidopsis thaliana. Genetics 143:1761-1770.[Abstract]
JEONG, S., A. E. TROTOCHAUD, and S. E. CLARK, 1999 The Arabidopsis CLAVATA2 gene encodes a receptor-like protein required for the stability of the CLAVATA1 receptor-like kinase. Plant Cell 11:1925-1933.
KAWABE, A. and N. T. MIYASHITA, 1999 DNA variation in the basic chitinase locus (ChiB) region of the wild plant Arabidopsis thaliana.. Genetics 153:1445-1453.
KAWABE, A., H. INNAN, R. TERAUCHI, and N. T. MIYASHITA, 1997 Nucleotide polymorphism in the acidic chitinase locus (ChiA) region of the wild plant Arabidopsis thaliana.. Mol. Biol. Evol. 14:1303-1315.[Abstract]
KAWABE, A., K. YAMANE, and N. T. MIYASHITA, 2000 DNA polymorphism at the cytosolic phosphoglucose isomerase (PgiC) locus of the wild plant Arabidopsis thaliana.. Genetics 156:1339-1347.
KAYES, J. M. and S. E. CLARK, 1998 CLAVATA2, a regulator of meristem and organ development in Arabidopsis.. Development 125:3843-3851.[Abstract]
KELLY, J. K., 1997 A test of neutrality based on interlocus associations. Genetics 146:1197-1206.[Abstract]
KOBE, B. and A. V. KAJAVA, 2001 The leucine-rich repeat as a protein recognition motif. Curr. Opin. Struct. Biol. 11:725-732.[Medline]
KREITMAN, M. and R. R. HUDSON, 1991 Inferring the evolutionary histories of the Adh and Adh-Dup loci in Drosophila melanogaster from patterns of polymorphism and divergence. Genetics 127:565-582.[Abstract]
KUITTINEN, H. and M. AGUADÉ, 2000 Nucleotide variation at the CHALCONE ISOMERASE locus in Arabidopsis thaliana. Genetics 155:863-872.
KUMAR, S., K. TAMURA, I. B. JAKOBSEN, and M. NEI, 2001 MEGA2: molecular evolutionary genetics analysis software. Bioinformatics 17:1244-1245.
LONG, A. D., R. F. LYMAN, C. H. LANGLEY, and T. F. C. MACKAY, 1998 Two sites in the Delta gene region contribute to naturally occurring variation in bristle number in Drosophila melanogaster.. Genetics 149:999-1017.
MCDOWELL, J. M., M. DHANDAYDHAM, T. A. LONG, M. G. M. AARTS, and S. GOFF et al., 1998 Intragenic recombination and diversifying selection contribute to the evolution of downy mildew resistance at the RPP8 locus of Arabidopsis. Plant Cell 10:1861-1874.
MEYERS, B. C., K. A. SHEN, P. ROHANI, B. S. GAUT, and R. W. MICHELMORE, 1998 Receptor-like genes in the major resistance locus of lettuce are subject to divergent selection. Plant Cell 11:1833-1846.
MIYASHITA, N. T., 2001 DNA variation in the 5' upstream region of the Adh locus of the wild plants Arabidopsis thaliana and Arabis gemmifera.. Mol. Biol. Evol. 18:164-171.
MIYASHITA, N. T., A. KAWABE, H. INNAN, and R. TERAUCHI, 1998 Intra- and interspecific DNA variation and codon bias of the alcohol dehydrogenase (Adh) locus in Arabis and Arabidopsis species. Mol. Biol. Evol. 15:1420-1429.
NORDBORG, M., B. CHARLESWORTH, and D. CHARLESWORTH, 1996 Increased levels of polymorphism surrounding selectively maintained sites in highly selfing species. Proc. R. Soc. Lond. Ser. B 263:1033-1039.
NORDBORG, M., J. O. BOREVITZ, J. BERGELSON, C. C. BERRY, and J. CHORY et al., 2002 The extent of linkage disequilibrium in Arabidopsis thaliana. Nat. Genet. 30:190-193.[Medline]
OLSEN, K. M., A. WOMACK, A. R. GARRETT, J. I. SUDDITH, and M. D. PURUGGANAN, 2002 Contrasting evolutionary forces in the Arabidopsis thaliana floral developmental pathway. Genetics 160:1641-1650.
PARNISKE, M., K. E. HAMMOND-KOSACK, C. GOLSTEIN, C. M. THOMAS, and D. A. JONES et al., 1997 Novel disease resistance specificities result from sequence exchange between tandemly repeated genes at the Cf-4/9 locus of tomato. Cell 91:821-832.[Medline]
PURUGGANAN, M. D. and J. I. SUDDITH, 1998 Molecular population genetics of the Arabidopsis CAULIFLOWER regulatory gene: nonneutral evolution and naturally occurring variation in floral homeotic function. Proc. Natl. Acad. Sci. USA 95:8130-8134.
PURUGGANAN, M. D. and J. I. SUDDITH, 1999 Molecular population genetics of floral homeotic loci: departures from the equilibrium-neutral model at the APETALA3 and PISTILLATA genes of Arabidopsis thaliana.. Genetics 151:839-848.




