Genetics, Vol. 161, 1269-1278, July 2002, Copyright © 2002

Recombination and Gene Conversion in a 170-kb Genomic Region of Arabidopsis thaliana

Bernhard Haubold2,a, Jürgen Kroymanna, Andreas Ratzkaa, Thomas Mitchell-Oldsa, and Thomas Wiehe3,a
a Max-Planck-Institut für Chemische Ökologie, Department of Genetics and Evolution, D-07745 Jena, Germany

Corresponding author: Bernhard Haubold, Waldhofer Str. 98, D-69123 Heidelberg, Germany., bernhard.haubold{at}lionbioscience.com (E-mail)

Communicating editor: S. W. SCHAEFFER


*  ABSTRACT
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Arabidopsis thaliana is a highly selfing plant that nevertheless appears to undergo substantial recombination. To reconcile its selfing habit with the observations of recombination, we have sampled the genetic diversity of A. thaliana at 14 loci of ~500 bp each, spread across 170 kb of genomic sequence centered on a QTL for resistance to herbivory. A total of 170 of the 6321 nucleotides surveyed were polymorphic, with 169 being biallelic. The mean silent genetic diversity ({pi}s) varied between 0.001 and 0.03. Pairwise linkage disequilibria between the polymorphisms were negatively correlated with distance, although this effect vanished when only pairs of polymorphisms with four haplotypes were included in the analysis. The absence of a consistent negative correlation between distance and linkage disequilibrium indicated that gene conversion might have played an important role in distributing genetic diversity throughout the region. We tested this by coalescent simulations and estimate that up to 90% of recombination is due to gene conversion.


GENOME projects facilitate evolutionary studies, which in turn help to interpret the information uncovered by large-scale sequencing (CHARLESWORTH et al. 2001 Down). As a consequence, interest in the population genetics of the model plant Arabidopsis thaliana has grown steadily over the past decade. Three central observations have emerged from the analyses of the seven or so loci that have been subjected to comparative sequencing in this cruciferous weed (KAWABE et al. 1997 Down; PURUGGANAN and SUDDITH 1998 Down, PURUGGANAN and SUDDITH 1999 Down; KUITTINEN and AGUADE 2000 Down; SAVOLAINEN et al. 2000 Down): (i) There is an excess of rare polymorphisms, (ii) a number of genes have alleles that fall into two distinct classes (allelic dimorphism), and (iii) there is more recombination than might be expected, given that A. thaliana is a selfer.

The excess of rare polymorphisms, often indicated by a negative value of Tajima's D, is perhaps the least surprising of these findings. Most structural genes are subject to purifying selection, leading to an excess of rare frequency segregating sites. The converse, i.e., an excess of genetic diversity ({pi}), is an infrequent but often highly significant exception, as seen, for example, at the Rpm1 resistance locus of A. thaliana (STAHL et al. 1999 Down).

There is a slight tension between the observation of an excess of rare polymorphisms and allelic dimorphism. Extreme cases of the latter correspond to a deficiency of rare polymorphisms, as observed at the Rpm1 locus (STAHL et al. 1999 Down), and may therefore be evidence for balancing selection. However, the reason for the apparent dimorphism may simply be that in a sample of n sequences the expected time required for the last two lineages to coalesce is equal to that taken by the first n - 2 sequences (KINGMAN 1982A Down, KINGMAN 1982B Down). In other words, even neutral genealogies tend to have deep splits, and since branch lengths are proportional to the number of segregating sites, apparent dimorphism might result from such a neutral process.

The most enigmatic observation concerns recombination. The outcrossing rate of A. thaliana has been estimated as 0.3% (ABBOT and GOMES 1989 Down), which is very low. On the other hand, MIYASHITA et al. 1999 Down found no significant linkage disequilibrium among 472 AFLP markers scored in 38 ecotypes. This contrasts with the situation in another well-studied selfing plant species, wild Barley (Hordeum spontaneum). Its outcrossing rate has been estimated as 1.6% (BROWN et al. 1978 Down) and in an extensive allozyme study 20 out of 28 populations investigated displayed significant genome-wide linkage disequilibrium (BROWN et al. 1980 Down). However, using an extension of the test for linkage disequilibrium applied to H. spontaneum (HAUBOLD et al. 1998 Down), SHARBEL et al. 2000 Down detected highly significant linkage disequilibrium among 79 amplified fragment length polymorphism (AFLP) loci scored in 142 ecotypes (henceforth referred to as accessions) of A. thaliana. Nevertheless, the extent of recombination in A. thaliana has remained unclear, prompting the present study.

We employed the genomic sequence of the Columbia accession of A. thaliana to sample the genetic diversity among 39 accessions at 14 loci of ~500 bp each in a region spanning 170 kb on chromosome 5. The region was chosen because a polymorphism for production of defensive metabolites maps within this interval (KROYMANN et al. 2001 Down).

In this study we examine the mode of recombination in this region. Depending on the way the Holliday Junction is resolved, recombination may result either in reciprocal recombination or in gene conversion (Fig 1). Reciprocal recombination affects a series of homologous loci downstream of the recombination break point. Gene conversion, on the other hand, leads to the alteration of single segments only. Therefore, recombination causes the decay of linkage disequilibrium with distance, while no such effect results from gene conversion if conversion tracts are short (WIEHE et al. 2000 Down). We have applied this idea to our data and discovered that a substantial input from gene conversion is likely.



View larger version (8K):
In this window
In a new window
Download PPT slide
 
Figure 1. The effects of reciprocal recombination (R) and gene conversion (C) on the distribution of genetic material.


*  MATERIALS AND METHODS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Plant material and DNA sequencing:
The 39 accessions used in this study are listed in Fig 3. Primers were designed using the published sequence of the accession Columbia and the software PRIMER3 (ROZEN and SKALETSKY 1998 Down). Total DNA was extracted from leaves of single plants, amplified, and sequenced directly on both strands. All primer pairs are shown in Table 1.



View larger version (17K):
In this window
In a new window
Download PPT slide
 
Figure 2. Silent genetic diversity, {pi}s (•), including 95% confidence intervals, and Tajima's D (*) across the 170-kb region studied. Tajima's D was significant at locus 6 (P = 0.01), perhaps indicating balancing selection. MRN17, T2007, and MYJ24 designate bacterial P1 clones of genomic A. thaliana DNA used in the Arabidopsis genome sequencing project (ARABIDOPSIS GENOME INITIATIVE 2000); MAM1 encodes a methylthioalkylmalate synthase involved in glucosinolate chain elongation (KROYMANN et al. 2001 Down); and MAML encodes a duplication of MAM1.



View larger version (63K):
In this window
In a new window
Download PPT slide
 
Figure 3. Prettyplot of all 170 polymorphic sites. Positions are indicated by numbers, which should be read top to bottom. At each site a dot indicates agreement with the nucleotide shown in the top row.


 
View this table:
In this window
In a new window

 
Table 1. Primers used to amplify the 14 loci investigated in this study

Data analysis: Alignment:
DNA sequences were aligned using PILEUP (GCG Wisconsin Package) and all computations were carried out after gap removal.

Nucleotide diversity:

Most loci in our sample contained coding as well as noncoding regions (Table 2). To compute the average number of silent substitutions between pairs of sequences ({pi}s) we included the third codon positions of the coding regions as well as the complete noncoding segment and applied

where n is the sample size, L the number of silent positions in the alignment, and mij denotes the number of mismatches between the ith and jth haplotype.


 
View this table:
In this window
In a new window

 
Table 2. Coding parts and annotations of the 14 loci investigated

Confidence intervals for {pi}s were estimated using the bootstrap procedure (EFRON 1979 Down) across taxa: Rows of the aligned data matrix were resampled with replacement and the average number of pairwise mismatches per nucleotide was recalculated 10,000 times. The resulting mismatch values were sorted, and the 2.5 and 97.5% quantiles were looked up in the sorted array.

Pairwise linkage disequilibrium:

We use the normalized linkage disequilibrium, D', to quantify pairwise linkage disequilibria (LEWONTIN 1964 Down). Consider two biallelic loci, and , and denote the probability of finding allele "1" at locus by p1 and at locus by q1. The frequencies of the possible four haplotypes are denoted P00, P10, P01, and P11. We define as the linkage disequilibrium. Then, the desired disequilibrium measure is

The correlation between D' and pairwise distance was tested using Mantel tests as described by MANLY 1994 Down(p. 72ff).

Now consider three loci, , , , where the distance between and is less than or equal to one-tenth of the distance between and . We then define the ratio

Its expectation is equal to 1 for gene conversion and <1 for reciprocal recombination (WIEHE et al. 2000 Down). The distribution of as determined either from real or simulated data is an indicator of the relative frequency of recombination and gene conversion. We further consider the sign of as a derived random variable:

and .

Simulating the distribution of :

We used a coalescent program distributed by HUDSON 2002 Down to generate gene samples. This program implements both reciprocal recombination as well as the model of gene conversion developed by WIUF and HEIN 2000 Down. The chosen input parameters corresponded to our data, that is, sample size = 39 and number of sites = 168,037, which is equal to the distance between the first and the last nucleotide in our data set. The mutation parameter {theta} was computed by scaling the value observed in our 14 loci, which cover only a fraction of the surveyed region, resulting in {theta} = 1320. The tract length for gene conversion was 300 bp, a number smaller than the mean conversion tract length of 352 bp found in Drosophila (HILLIKER et al. 1994 Down) and larger than the estimated tract length of 30 bp at the human leukocyte antigen (HLA) locus (PARHAM et al. 1995 Down; WIEHE et al. 2000 Down). From the simulated samples we excluded all mutations outside of our 14 loci and grouped the remaining polymorphisms into blocks, which (1) contained at least two polymorphisms (to exclude point mutations as a potential source of variation), (2) extended at most as far as the polymorphism pattern across the aligned haplotypes did not change, and (3) did not extend beyond locus boundaries.

An example of such a block is provided by positions 136,818–137,110 in our data set (Fig 3), which all belong to locus 10 and have an identical haplotype structure. Although the adjacent polymorphic position 138,531 also maintains the haplotype structure, it is not included in the block as it belongs to locus 11 (Fig 2). From triplets of such blocks of polymorphisms we computed as outlined above. Distances were defined as base pairs between block midpoints.

Multilocus disequilibrium:

Multilocus disequilibrium was investigated by treating each distinct sequence at the 14 loci as an allele and calculating the number of loci at which each pair of haplotypes differed. The observed variance of this "mismatch distribution," VD, was then compared to the variance expected under linkage equilibrium, Ve (BROWN et al. 1980 Down; HAUBOLD et al. 1998 Down). The ratio between these variances serves as a measure of the strength of multilocus association in the sample

where l is the number of loci and IsA is the standardized index of association (MAYNARD SMITH et al. 1993 Down; HUDSON 1994 Down). We used the software LIAN to calculate the IsA and to test its significance (HAUBOLD and HUDSON 2000 Down).


*  RESULTS
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Sequence data and genetic diversity:
A total of 39 accessions were sequenced at 14 loci distributed over a 170-kb region (Fig 2). After gap removal this amounted to 6321 nucleotides in 39 accessions. Of this data set, 170 sites distributed among 35 haplotypes were polymorphic (Fig 3). With the exception of one hypervariable position (46,750; Fig 3), all segregating sites had only two nucleotide states. In addition, there were three heterozygous positions in accession Kondara (37,062, 37,304, and 37,351; Fig 3). The hypervariable and heterozygous sites were removed from the computation of pairwise disequilibria.

Most of the loci were located within predicted genes and contained both protein-coding as well as noncoding parts (Table 2). However, locus 4 was entirely noncoding, while loci 3 and 10 consisted of coding sequence only. With the exception of loci 6 and 9, functions had been assigned to the investigated loci in the context of the Arabidopsis genome project (ARABIDOPSIS GENOME INITIATIVE 2000). These functions were diverse, ranging from putative alanyl-tRNA synthetase (locus 1) to histone (locus 3), peptidases (loci 7 and 14), and acetyl-CoA synthetase (locus 8; Table 2).

The genetic diversity varied by a factor of 30 between at locus 1 and at locus 3 across the region (Fig 2). To assess whether these diversity values were compatible with neutral equilibrium expectations, we investigated the frequency spectrum of the single-nucleotide polymorphisms using Tajima's D test statistic (TAJIMA 1989 Down). This test is based on the assumption that the data have not been subject to recombination. We explored this assumption by computing the minimum number of recombination events for each locus (Rm; HUDSON and KAPLAN 1985 Down). Only locus 7 showed evidence of a recombination event and with this background information we proceeded to calculate Tajima's D.

The only locus with a significant value of Tajima's D was locus 6 (D = 2.66, P = 0.01; Fig 2). Unfortunately, its function is unknown. Further, the signs of the test statistics showed no consistent pattern, with 9 out of the 14 loci having D < 0 and the rest D > 0 (Fig 2).

Multilocus linkage disequilibrium:
HANFSTINGL et al. 1994 Down hypothesized that recombination in A. thaliana was frequent enough to erode linkage disequilibrium between sites just 350 bp apart. Since all the loci investigated in our survey were >350 bp apart (Fig 2), we assessed the strength of association between these loci by calculating the standardized index of association, IsA, which is zero under linkage equilibrium (HUDSON 1994 Down). For our sample , a value significantly >0 (P < 10-4).

Phylogeny:
Given that there was strong linkage disequilibrium between the surveyed loci, we used the exploratory tool of statistical geometry to investigate the phylogeny of the genomic region (EIGEN et al. 1988 Down; MAYNARD SMITH 1989 Down). Statistical geometry proceeds by first generating a phylogeny on the basis of the parsimony criterion for each quartet of sequences in the sample. These phylogenies are averaged to generate the graph shown in Fig 4. Note that there are three ways in which this graph can be reduced to a conventional unrooted tree: Collapse dimensions X and Y of the central box, collapse dimensions Z and Y, or collapse dimensions Y and Z. In other words, a statistical geometry graph simultaneously represents the three unrooted trees that can be formed from four taxa. If no recombination has taken place, only one of these three possible trees should be supported by the data. High support for all three possible trees is indicated by a large central box.



View larger version (6K):
In this window
In a new window
Download PPT slide
 
Figure 4. Statistical geometry phylogeny for the combined nucleotide data. X, Y, and Z indicate the dimensions of the three-dimensional box from which the terminal branches (a–d) stick out. If the data were tree-like, the small Z-dimension as well as the larger Y-dimension would be zero. The bar indicates the number of substitutions per polymorphic site.

For our data the deviation from the ideal tree topology was considerable (Fig 4), and we did not attempt to further reconstruct the phylogenetic history of the region.

Disequilibrium as a function of distance:
Given that over a stretch of 170 kb the phylogeny of A. thaliana does not conform to a tree, reciprocal recombination or gene conversion has probably contributed considerably to the evolution of this species. Under reciprocal recombination the disequilibria between pairs of polymorphic sites are expected to fall off exponentially with distance. In contrast, gene conversion should generate no distance effect on disequilibria, if the average tract length is short.

We started our investigation of the relationship between distance and disequilibrium by grouping the single-nucleotide polymorphisms (SNPs) into 24 "blocks" as outlined in MATERIALS AND METHODS. Pairwise linkage disequilibria between these blocks were negatively correlated with distance (; Fig 5A). However, if only haplotype pairs with four alleles were included in the analysis, i.e., allele pairs where a recombination event could be detected, the negative correlation between distance and linkage disequilibrium turned positive (; Fig 5B). There is no neutral mechanism that results in a significant positive correlation between distance and disequilibrium. When we removed the one locus with a significant Tajima's D from the analysis (locus 6), the correlation between distance and disequilibrium vanished altogether (r = 0.07, P > 0.05; Fig 5C). This indicated that reciprocal recombination may not have been the primary mechanism for exchanging homologous DNA in the region.



View larger version (15K):
In this window
In a new window
Download PPT slide
 
Figure 5. Linkage disequilibrium as a function of distance. (A) All pairs of blocks of haplotypes included in the analysis. (B) Only those pairs of blocks are included where all four possible haplotypes were present, i.e., where a recombination event certainly has taken place. (C) Same as B, except that the nonneutral locus 6 was removed from the analysis.

To quantify the mode of recombination more directly, we applied a statistical test designed to distinguish between reciprocal recombination and gene conversion (WIEHE et al. 2000 Down).

Gene conversion vs. reciprocal recombination:
We carried out coalescent simulations with a recombination rate of one-tenth the rate of mutation, which appears to be a reasonable value given estimates in the literature (KUITTINEN and AGUADE 2000 Down) and the observation of genome-wide linkage disequilibrium (SHARBEL et al. 2000 Down). In our simulations we distributed this "effective" rate of recombination between reciprocal recombination and gene conversion. A graph of the mean value of sign , the sign of the random variable , as a function of the percentage of gene conversion returned a maximum-likelihood estimate of 90% conversion (Fig 6).



View larger version (18K):
In this window
In a new window
Download PPT slide
 
Figure 6. The mean of the test statistic sign as a function of the extent of gene conversion. Plotted are mean values (•) and 50% confidence intervals ({blacksquare}). All blocks of polymorphisms were included in the analysis and the corresponding maximum-likelihood estimate (MLE) for gene conversion is ~90%. See MATERIALS AND METHODS for details on the test statistic .


*  DISCUSSION
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

Completely asexual reproduction halves the rate of adaptation compared to panmixis and is therefore usually regarded as a rare exception, if it exists at all (FISHER 1930 Down/1999, p. 123). This may appear surprising, given the large number of selfing plant species and other asexual organisms, including bacteria. However, in most selfing plants inbreeding is not complete and even the existence of purely clonal bacterial populations has been doubted (FEIL et al. 2001 Down). Different accessions of A. thaliana can be crossed in the laboratory, which forms the basis of the large amount of classical mapping work carried out using this organism. However, in the wild A. thaliana is a selfer with a very low outcrossing rate of 0.3% (ABBOT and GOMES 1989 Down). Recent studies of this plant's molecular population genetics suggested that in spite of its selfing habit, it underwent recombination rather frequently (KUITTINEN and AGUADE 2000 Down), leading to a decay of linkage disequilibrium in worldwide samples over ~250 kb (NORDBORG et al. 2002 Down). In this study we contribute to the clarification of the apparent contradiction between selfing and the molecular data.

Nucleotide polymorphism:
The genetic diversity in the MAM region is highly variable (Fig 2). In 13 out of the 14 cases the polymorphisms do not contradict neutral expectations. The one exception (locus 6, Fig 2) is currently annotated as a gene of unknown function. Every new genome that is sequenced reveals a large number of predicted genes to which no function can be assigned. Given that sequencing is usually easier than elucidating a gene's function, comparative sequencing combined with tests of neutrality might point to those genes whose products are most relevant to an organism's biology.

Multilocus disequilibrium:
If multiple loci have been investigated, linkage disequilibrium can be assessed either by performing pairwise tests or by calculating the overall linkage disequilibrium. Pairwise tests are difficult to interpret, as they are not independent from each other. The test based on the mismatch distribution used in this study does not suffer from this uncertainty about its interpretation (HAUBOLD et al. 1998 Down). Moreover, it leads to the discovery of strong linkage disequilibrium not only in our data set, but also in a set of genome-wide AFLP markers (SHARBEL et al. 2000 Down). A lack of genome-wide linkage disequilibrium as suggested by MIYASHITA et al. 1999 Down would be hard to reconcile with the selfing habit of A. thaliana and previous findings in other selfing plant species (BROWN et al. 1980 Down).

Phylogeny:
The average phylogeny differed from an ideal tree topology, which indicated that there was substantial recombination in the region (Fig 4). This difference becomes more pronounced if the genome-wide AFLP data published by SHARBEL et al. 2000 Down is subjected to statistical geometry (Fig 7). This is not surprising, since disequilibrium decreases exponentially with distance. However, even in this situation the loci display significant genome-wide linkage disequilibrium (SHARBEL et al. 2000 Down). Having rejected the two extreme hypotheses of no recombination and of linkage equilibrium, we were interested in investigating the rate and mode of recombination.



View larger version (7K):
In this window
In a new window
Download PPT slide
 
Figure 7. Statistical geometry phylogeny for the 87 AFLP loci in 115 accessions in A. thaliana published by SHARBEL et al. 2000 Down. X, Y, and Z indicate the dimensions of the three-dimensional box from which the terminal branches (a–d) stick out. If the data were tree-like, the small Z-dimension as well as the larger Y-dimension would be zero. The bar indicates the number of substitutions per locus.

Disequilibrium as a function of distance:
It is clear that linkage disequilibrium should reflect genetic distance rather than physical distance. However, genetic positions are rather unreliable over short distances and hence we have used physical positions as a substitute (NORDBORG et al. 2002 Down).

In the MAM region linkage disequilibrium apparently decreases with distance (Fig 5A). However, a positive correlation with distance was observed when we analyzed only pairs of blocks displaying all four possible haplotypes (Fig 5B). The puzzle of finding a positive correlation was resolved when we removed the one locus with significant evidence for selection from the sample. The resulting data set showed no correlation between distance and disequilibrium (Fig 5C). It is clear that three haplotypes can be generated by mutation alone, while four haplotypes between two markers must be the result of recombination, assuming no recurrent mutation. Hence, Fig 5C shows a sample that has certainly been shaped by recombination, while in Fig 5A the pairs of positions may or may not have been affected by recombination. Nevertheless, under neutrality and reciprocal recombination the two samples should yield a similar decay of linkage disequilibrium with distance. This suggests that gene conversion has shaped the distribution of polymorphisms in this region.

Mode of recombination:
Gene conversion has been at the center of recent empirical and theoretical population genetic studies. LANGLEY et al. 2001 Down investigated the extent of linkage disequilibrium in the su(s) and su(wa) loci on the Drosophila melanogaster X chromosome that are located in a region of reduced crossing over. In spite of low reciprocal recombination, the authors observed a similar genomic scale of linkage disequilibrium at the su(s) and su(wa) loci as found in regions with normal rates of crossing over. This suggests that gene conversion is high in this region (LANGLEY et al. 2001 Down).

WIUF and HEIN 2000 Down have introduced gene conversion into coalescent models. These authors noted that there was no statistic available to assess the relative extent of recombination and gene conversion. Such a statistic, , has been provided by WIEHE et al. 2000 Down and we carried out coalescent simulations to explore the utility of when applied to a data set such as ours.

These simulations were based on the assumption of neutrality, which may not apply throughout the region, especially at locus 6 (Fig 2). Removal of this locus from the plot of linkage disequilibrium as a function of distance resulted in zero correlation between the two variables (Fig 5C), which would be expected with high gene conversion rates.

We show that with an effective recombination rate of one-tenth the rate of mutation and a 90% gene conversion rate the experimental data can be explained quite adequately (Fig 6). However, this calculation should be treated with caution, as the distribution of has a large variance and is far from normal. Nevertheless, this study demonstrates that there is no need to invoke a high rate of recombination to account for the experimental data; is sufficient. There is no contradiction between knowing that as a selfer A. thaliana must have a low rate of recombination and observing recombination events at the molecular level.


*  FOOTNOTES

Sequence data from this article have been deposited with the EMBL/GenBank Data Libraries under accession nos. AF471728–AF472273. Back
2 Present address: Lion Bioscience AG, Waldhofer Str. 98, D-69123 Heidelberg, Germany. Back
3 Present address: Freie Universität Berlin und Berliner Centrum für genombasierte Bioinformatik, Arnimallee 22, 14195 Berlin, Germany. Back


*  ACKNOWLEDGMENTS

We thank Richard Hudson for providing his gene conversion simulation code and comments. Thanks are also due to U. Priedemuth for help with data handling. This work was supported by the Max-Planck-Gesellschaft; the Bundesministerium für Forschung, Germany, BMBF grant 0312705A to T.W.; the U.S. National Science Foundation (grant DEB-9527725); and the European Union.

Manuscript received September 5, 2001; Accepted for publication April 22, 2002.


*  LITERATURE CITED
*TOP
*ABSTRACT
*MATERIALS AND METHODS
*RESULTS
*DISCUSSION
*LITERATURE CITED

ABBOT, R. J. and M. F. GOMES, 1989  Population genetic structure and outcrossing rate of Arabidopsis thaliana (L.) Heynh. Heredity 62:411-418.

Analysis of the genome sequence of the flowering plant Arabidopsis thaliana.. (2000) Nature 408:796-815.[Medline]

BROWN, A. H. D., D. ZOHARY, and E. NEVO, 1978  Outcrossing rates and heterozygosity in natural populations of Hordeum spontaneum Koch in Israel. Heredity 41:49-62.

BROWN, A. H. D., M. W. FELDMAN, and E. NEVO, 1980  Multilocus structure of natural populations of Hordeum spontaneum. Genetics 96:523-536.[Abstract/Free Full Text]

CHARLESWORTH, D., B. CHARLESWORTH, and G. A. T. MCVEAN, 2001  Genome sequences and evolutionary biology, a two-way interaction. Trends Ecol. Evol. 16:235-242.[Medline]

EFRON, B., 1979  Bootstrap methods: another look at the Jackknife. Ann. Stat. 7:1-26.

EIGEN, M., R. WINKLER-OSWATITSCH, and A. DRESS, 1988  Statistical geometry in sequence space: a method of quantitative comparative sequence analysis. Proc. Natl. Acad. Sci. USA 85:5913-5917.[Abstract/Free Full Text]

FEIL, E. J., E. C. HOLMES, D. E. BESSEN, M.-S. CHAN, and N. P. DAY et al., 2001  Recombination within natural populations of pathogenic bacteria: short-term empirical estimates and long-term phylogenetic consequences. Proc. Natl. Acad. Sci. USA 98:182-187.[Abstract/Free Full Text]

FISHER, R. A., 1930/1999 The Genetical Theory of Natural Selection, variorum edition. Oxford University Press, Oxford.

HANFSTINGL, U., A. BERRY, E. A. KELLOGG, I. T. COSTA, III, and W. RÜDIGER et al., 1994  Haplotypic divergence coupled with lack of diversity at the Arabidopsis thaliana alcohol dehydrogenase locus: roles for both balancing and directional selection? Genetics 138:811-828.[Abstract]

HAUBOLD, B. and R. R. HUDSON, 2000  Lian 3.0: detecting linkage disequilibrium in multilocus data. Bioinformatics 16:847-848.[Abstract/Free Full Text]

HAUBOLD, B., M. TRAVISANO, P. B. RAINEY, and R. R. HUDSON, 1998  Detecting linkage disequilibrium in bacterial populations. Genetics 150:1341-1348.[Abstract/Free Full Text]

HILLIKER, A. J., G. HARAUZ, M. GRAY, S. H. CLARK, and A. CHOVNICK, 1994  Meiotic gene conversion tract length distribution within the rosy locus of Drosophila melanogaster. Genetics 137:1019-1026.[Abstract]

HUDSON, R. R., 1994  Analytical results concerning linkage disequilibrium in models with genetic transformation and conjugation. J. Evol. Biol. 7:535-548.

HUDSON, R. R., 2002  Generating samples under a Wright-Fisher neutral model of genetic variation. Bioinformatics 18:337-338.[Abstract/Free Full Text]

HUDSON, R. R. and N. L. KAPLAN, 1985  Statistical properties of the number of recombination events in the history of a sample of DNA sequences. Genetics 111:147-164.[Abstract/Free Full Text]

KAWABE, A., H. INNAN, R. TERAUCHI, and T. MIYASHITA, 1997  Nucleotide polymorphism in the acidic chitinase locus (ChiA) region of the wild plant Arabidopsis thaliana. Mol. Biol. Evol. 14:1303-1315.[Abstract]

KINGMAN, J. F. C., 1982a  The coalescent. Stoch. Proc. Appl. 13:235-248.

KINGMAN, J. F. C., 1982b  On the genealogy of large populations. J. Appl. Prob. 19A:27-43.

KROYMANN, J., S. TEXTOR, J. TOKUHISA, K. FALK, and S. BARTRAM et al., 2001  A gene controlling variation in Arabidopsis thaliana glucosinolate composition is part of the methionine chain elongation pathway. Plant Physiol. 127:1077-1088.[Abstract/Free Full Text]

KUITTINEN, H. and M. AGUADÉ, 2000  Nucleotide variation at the CHALCONE ISOMERASE locus in Arabidopsis thaliana.. Genetics 155:863-872.[Abstract/Free Full Text]

LANGLEY, C. H., B. P. LAZZARO, W. PHILLIPS, E. HEIKKINEN, and J. M. BRAVERMAN, 2001  Linkage disequilibria and the site frequency spectra in the su(s) and su(wa) regions of the Drosophila melanogaster X chromosome. Genetics 156:1837-1852.[Abstract/Free Full Text]

LEWONTIN, R. C., 1964  The interaction of selection and linkage. I. General considerations; heterotic models. Genetics 49:49-67.[Free Full Text]

MANLY, B. F. J., 1994 Multivariate Statistical Methods: A Primer, Ed. 2. Chapman & Hall, London.

MAYNARD SMITH, J., 1989  Trees, bundles or nets? Trends Ecol. Evol. 4:302-304.

MAYNARD SMITH, J., N. H. SMITH, C. G. DOWSON, and B. G. SPRATT, 1993  How clonal are bacteria? Proc. Natl. Acad. Sci. USA 90:4384-4388.[Abstract/Free Full Text]

MIYASHITA, N. T., A. KAWABE, and H. INNAN, 1999  DNA variation in the wild plant Arabidopsis thaliana revealed by amplified fragment length polymorphism analysis. Genetics 152:1723-1731.[Abstract/Free Full Text]

NORDBORG, M., J. O. BOREVITZ, J. BERGELSON, C. C. BERRY, and J. CHORY et al., 2002  The extent of linkage disequilibrium in Arabidopsis thaliana.. Nat. Genet. 30:190-193.[Medline]

PARHAM, P., E. J. ADAMS, and K. L. ARNETT, 1995  The origins of HLA-A,B,C polymorphism. Immunol. Rev. 143:141-180.[Medline]

PURUGGANAN, M. D. and J. I. SUDDITH, 1998  Molecular population genetics of the Arabidopsis CAULIFLOWER regulatory gene: non-neutral evolution and naturally occurring variation in floral homeotic function. Proc. Natl. Acad. Sci. USA 95:8130-8134.[Abstract/Free Full Text]

PURUGGANAN, M. D. and J. I. SUDDITH, 1999  Molecular population genetics of floral homeotic loci: departures from the equilibrium-neutral model at the APETALA3 and PISTILLA genes of Arabidopsis thaliana. Genetics 151:839-848.[Abstract/Free Full Text]

ROZEN, S., and H. SKALETSKY, 1998 Primer3. Code available at http://www-genome.wi.mit.edu/genome_software/other/primer3.html.

SAVOLAINEN, O., C. H. LANGLEY, B. P. LAZZARO, and H. FRÉVILLE, 2000  Contrasting patterns of nucleotide polymorphism at the alcohol dehydrogenase locus in the outcrossing Arabidopsis lyrata and the selfing Arabidopsis thaliana.. Mol. Biol. Evol. 17:645-655.[Abstract/Free Full Text]

SHARBEL, T. F., B. HAUBOLD, and T. MITCHELL-OLDS, 2000  Genetic isolation by distance in Arabidopsis thaliana: biogeography and postglacial colonization of Europe. Mol. Ecol. 9:2109-2118.[Medline]

STAHL, E. A., G. DWYER, R. MAURICIO, M. KREITMAN, and J. BERGELSON, 1999  Dynamics of disease polymorphism at the Rpm1 locus of Arabidopsis.. Nature 400:667-671.[Medline]

TAJIMA, F., 1989  Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 123:585-595.[Abstract/Free Full Text]

WIEHE, T., J. MOUNTAIN, P. PARHAM, and M. SLATKIN, 2000  Distinguishing recombination and intragenic gene conversion by linkage disequilibrium patterns. Genet. Res. 75:61-73.[Medline]

WIUF, C. and J. HEIN, 2000  The coalescent with gene conversion. Genetics 155:451-462.[Abstract/Free Full Text]




This article has been cited by other articles:


Home page
GeneticsHome page
A. Barriere and M.-A. Felix
Temporal Dynamics and Linkage Disequilibrium in Natural Caenorhabditis elegans Populations
Genetics, June 1, 2007; 176(2): 999 - 1011.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
P. L. Morrell, D. M. Toleno, K. E. Lundy, and M. T. Clegg
Estimating the Contribution of Mutation, Recombination and Gene Conversion in the Generation of Haplotypic Diversity
Genetics, July 1, 2006; 173(3): 1705 - 1723.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
V. Plagnol, B. Padhukasahasram, J. D. Wall, P. Marjoram, and M. Nordborg
Relative Influences of Crossing Over and Gene Conversion on the Pattern of Linkage Disequilibrium in Arabidopsis thaliana
Genetics, April 1, 2006; 172(4): 2441 - 2448.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
K. S. Caldwell, J. Russell, P. Langridge, and W. Powell
Extreme Population-Dependent Linkage Disequilibrium Detected in an Inbreeding Plant Species, Hordeum vulgare
Genetics, January 1, 2006; 172(1): 557 - 567.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
M. T. Hamblin, M. G. Salas Fernandez, A. M. Casa, S. E. Mitchell, A. H. Paterson, and S. Kresovich
Equilibrium Processes Cannot Explain High Levels of Short- and Medium-Range Linkage Disequilibrium in the Domesticated Grass Sorghum bicolor
Genetics, November 1, 2005; 171(3): 1247 - 1256.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
D. Benovoy, R. T. Morris, A. Morin, and G. Drouin
Ectopic Gene Conversions Increase the G + C Content of Duplicated Yeast and Arabidopsis Genes
Mol. Biol. Evol., September 1, 2005; 22(9): 1865 - 1868.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
K. J. Schmid, S. Ramos-Onsins, H. Ringys-Beckstein, B. Weisshaar, and T. Mitchell-Olds
A Multilocus Sequence Survey in Arabidopsis thaliana Reveals a Genome-Wide Departure From a Neutral Model of DNA Sequence Polymorphism
Genetics, March 1, 2005; 169(3): 1601 - 1615.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
S. I. Wright and B. S. Gaut
Molecular Population Genetics and the Search for Adaptive Evolution in Plants
Mol. Biol. Evol., March 1, 2005; 22(3): 506 - 519.
[Abstract] [Full Text] [PDF]


Home page
Plant CellHome page
H. Kuang, S.-S. Woo, B. C. Meyers, E. Nevo, and R. W. Michelmore
Multiple Genetic Processes Result in Heterogeneous Rates of Evolution within the Major Cluster Disease Resistance Genes in Lettuce
PLANT CELL, November 1, 2004; 16(11): 2870 - 2894.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
J. D. Wall
Estimating Recombination Rates Using Three-Site Likelihoods
Genetics, July 1, 2004; 167(3): 1461 - 1473.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
P. Wenzl, J. Carling, D. Kudrna, D. Jaccoud, E. Huttner, A. Kleinhofs, and A. Kilian
Diversity Arrays Technology (DArT) for whole-genome profiling of barley
PNAS, June 29, 2004; 101(26): 9915 - 9920.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
M. J. Clauss and T. Mitchell-Olds
Functional Divergence in Tandemly Duplicated Arabidopsis thaliana Trypsin Inhibitor Genes
Genetics, March 1, 2004; 166(3): 1419 - 1436.
[Abstract] [Full Text] [PDF]


Home page
Plant CellHome page
J. Molinier, G. Ries, S. Bonhoeffer, and B. Hohn
Interchromatid and Interhomolog Recombination in Arabidopsis thaliana
PLANT CELL, February 1, 2004; 16(2): 342 - 352.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
J. Kroymann, S. Donnerhacke, D. Schnabelrauch, and T. Mitchell-Olds
Evolutionary dynamics of an Arabidopsis insect resistance quantitative trait locus
PNAS, November 25, 2003; 100(suppl_2): 14587 - 14592.
[Abstract] [Full Text]


Home page
Genome ResHome page
K. J. Schmid, T. R. Sorensen, R. Stracke, O. Torjek, T. Altmann, T. Mitchell-Olds, and B. Weisshaar
Large-Scale Identification and Analysis of Genome-Wide Single-Nucleotide Polymorphisms for Mapping in Arabidopsis thaliana
Genome Res., June 1, 2003; 13(6): 1250 - 1257.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
L. Huang, S. A. Brooks, W. Li, J. P. Fellers, H. N. Trick, and B. S. Gill
Map-Based Cloning of Leaf Rust Resistance Gene Lr21 From the Large and Polyploid Genome of Bread Wheat
Genetics, June 1, 2003; 164(2): 655 - 664.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
J. O. Borevitz and M. Nordborg
The Impact of Genomics on the Study of Natural Variation in Arabidopsis
Plant Physiology, June 1, 2003; 132(2): 718 - 725.
[Full Text] [PDF]


Home page
GeneticsHome page
K. A. Shepard and M. D. Purugganan
Molecular Population Genetics of the Arabidopsis CLAVATA2 Region: The Genomic Scale of Variation and Selection in a Selfing Species
Genetics, March 1, 2003; 163(3): 1083 - 1095.
[Abstract] [Full Text] [PDF]