SEVERAL predictions of the genomic pattern of nucleotide polymorphism include the local recombination rate, r, as a parameter. In typical population genetic models, the parameter r, for two specified loci (or sites), is properly defined as the probability that a randomly selected gamete produced by a double heterozygote is a recombinant. In a given region, r can be measured directly through genetic crosses. This is typically done by observing markers spaced several centimorgans apart, in which case recombinants are produced almost exclusively by classical crossing-over. Over small intervals it may be assumed that crossing-over events are equally likely to occur at any point between two markers, and the probability of observing more than one can be neglected. Thus r, to a reasonable approximation, increases linearly over short physical distances. The problem we wish to draw attention to is that this model is often extrapolated to distances far too small to ignore the added effect of gene conversion events on the overall probability of producing a recombinant.
In fungi and Drosophila, a common feature of models of homologous recombination is that Holliday junctions are resolved either as gene conversion alone or as gene conversion with the accompanying exchange of flanking markers (Carpenter 1984). Thus some fraction of genetic exchanges will involve the transfer of short tracts of information from one gamete to another (i.e., gene conversion) without concurrent crossing-over. For clarity, we will refer to gene conversion without crossing-over as “gene conversion,” and gene conversion accompanied by crossing-over as “crossing-over.” Here we show that, at intragenic distances, gene conversion, rather than crossing-over, is likely to be the dominant force that breaks up associations among sites. We discuss implications for population genetic predictions for the behaviour of neutral sites closely linked to a site under balancing or directional selection.
A crude model: It is intuitive that gene conversion should increase the rate of exchange for closely linked sites, but should have negligible effects for more distant sites. Assume that over very short intervals, the probability of crossing-over increases linearly with d, the physical distance in base pairs between two sites. Thus the probability of producing a recombinant by crossing-over equals ρd, where ρ is the probability of a crossing-over event between two given sites per base pair per generation. We suggest that the following crude model describes the added effect of gene conversion events on the rate of exchange between two sites. Define γ as the probability per generation that a given site is included in a gene conversion tract of length L base pairs, where L is taken to be fixed. The probability per generation that a gene conversion tract will produce a recombinant for two markers d base pairs apart will then be roughly 2γd/L, when d < L. Thus, incorporating both crossing-over and gene conversion, r depends on d as follows:
Some implications: Several attempts have recently been made to interpret estimated levels of linkage disequilibrium between polymorphic sites at the intragenic level in the context of competing population genetic models (e.g., Schaeffer and Miller 1993; Miyashitaet al. 1993; Begun and Aquadro 1995). Ohta and Kimura (1971) showed that in a neutral finite population at equilibrium,
The expected pairwise coalescence time, E(T2), as a function of physical distance for neutral sites linked to a balanced polymorphism (Equation 3) under various models of recombination: r = ρd for crossing-over only, r = 2γd/L for gene conversion only and see Equation 1 for crossing-over + gene conversion. The parameters used are N = 106, ρ = 10−8, γ = 10−5 and L = 350.
Another implication concerns our ability to detect balancing selection. If some form of balancing selection acts on a given polymorphism for a sufficiently long time, a build-up of differences between allelic classes, centered on the selected site, is expected. Consider the expected pairwise coalescence time for a neutral site linked to a two-allele polymorphism with both alleles maintained at equal frequencies, given by
Strong signals of balancing selection have been observed at MHC loci in humans (Hughes and Nei 1988) and at plant self-incompatibility loci (see Charlesworth and Guttman 1997 for a review). In D. melanogaster, a few loci with polymorphisms thought to be maintained by selection have been investigated, but so far, only the Adh locus shows significant evidence for balancing selection (Begun and Aquadro 1994; Eaneset al. 1996; Hudsonet al. 1987; Hudson and Kaplan 1988; Kreitman and Hudson 1991). This has led to speculation that either balancing selection is not a prevalent force maintaining variation or that it only rarely acts long enough for the equilibrium pattern predicted by Equation 3 to develop (Kreitman and Akashi 1995; Hudson 1996). In light of Figure 1, we suggest an alternative explanation: Perhaps the rate of gene conversion in Drosophila is simply too high, given the population size, for this phenomenon to be readily observed. The striking signal seen at Adh remains an enigma as it is larger than expected even under a model that does not include gene conversion (Hudson and Kaplan 1988). Possible explanations include past geographic subdivision or the association of Adh alleles with the common polymorphic rearrangement, In(2L)t .
In conclusion, methods for detecting balancing selection and positive directional selection that rely on Nr being sufficiently small may not have much power in organisms such as Drosophila. An exception may be genomic regions where gene conversion is likely to be suppressed, such as near an inversion breakpoint (see, for instance, Hasson and Eanes 1996). As Hudson and Kaplan (1988) point out, picking an organism with smaller N may not help, because such organisms are expected to harbor less polymorphism, which reduces our power to detect selection. A better solution may be to study organisms with lower r, such as partially selfing organisms, where the effective rate of exchange is reduced because most individuals are homozygous (Nordborget al. 1996).
Acknowledgments
We thank Joy Bergelson, Marty Kreitman, Molly Przeworski, and Jeff Wall for helpful discussions. Comments from Brian Charlesworth, Richard Hudson, Steven Orzack, Baharé Rashidi, and an anonymous reviewer significantly improved this manuscript. P.A. holds a Postgraduate Scholarship from the National Science and Engineering Research Council of Canada.
Footnotes
-
Communicating editor: R. R. Hudson
- Received July 12, 1997.
- Accepted November 24, 1997.
- Copyright © 1998 by the Genetics Society of America