- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Navarro, A.
- Articles by Barton, N. H.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Navarro, A.
- Articles by Barton, N. H.
The Effects of Multilocus Balancing Selection on Neutral Variability
Arcadio Navarroa and Nick H. Bartonaa Institute of Cell, Animal and Population Biology, University of Edinburgh, Edinburgh EH9 3JT, United Kingdom
Corresponding author: Arcadio Navarro, Animal and Population Biology, University of Edinburgh, W. Mains Rd., Edinburgh EH9 3JT, Scotland., arcadi{at}holyrood.ed.ac.uk (E-mail)
Communicating editor: N. TAKAHATA
| ABSTRACT |
|---|
We studied the effect of multilocus balancing selection on neutral nucleotide variability at linked sites by simulating a model where diallelic polymorphisms are maintained at an arbitrary number of selected loci by means of symmetric overdominance. Different combinations of alleles define different genetic backgrounds that subdivide the population and strongly affect variability. Several multilocus fitness regimes with different degrees of epistasis and gametic disequilibrium are allowed. Analytical results based on a multilocus extension of the structured coalescent predict that the expected linked neutral diversity increases exponentially with the number of selected loci and can become extremely large. Our simulation results show that although variability increases with the number of genetic backgrounds that are maintained in the population, it is reduced by random fluctuations in the frequencies of those backgrounds and does not reach high levels even in very large populations. We also show that previous results on balancing selection in single-locus systems do not extend to the multilocus scenario in a straightforward way. Different patterns of linkage disequilibrium and of the frequency spectrum of neutral mutations are expected under different degrees of epistasis. Interestingly, the power to detect balancing selection using deviations from a neutral distribution of allele frequencies seems to be diminished under the fitness regime that leads to the largest increase of variability over the neutral case. This and other results are discussed in the light of data from the Mhc.
MORE than 50 complete genomes are currently accessible on public databases. Within the next few years, this quantity is expected to increase by at least an order of magnitude (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Both single-locus and multilocus results rely on the similarity between a genetically subdivided population and a spatially subdivided one. In the same way that natural populations can be spatially structured into local demes, they can also be genetically structured into diverse "genetic backgrounds." Genetic backgrounds are defined as combinations of variants from different selected sites in a chromosome. Thus, they can be thought of as, for example, haplotypes, defined by different combinations of selected alleles from different genes, or as alleles, defined by combinations of variants from different selected sites in a gene. The genetic structure produced by selected backgrounds influences diversity at neutral loci in an analogous way to spatial structure. Just as with fluctuating deme sizes (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
The main purpose of this work is to study the levels and patterns of neutral variability to be expected under different multilocus balancing selection regimes. Because epistasis is a key component of any multilocus system, we hope to suggest ways in which the study of neutral variability may allow us to distinguish between different kinds of interactions among selected loci. To do this, we have simulated the effects on linked neutral variability of balancing selection acting on a set of diallelic loci. Simulation results are compared with predictions obtained by the method proposed by ![]()
![]()
| METHODS OF COMPUTER SIMULATION |
|---|
We simulate a Wright-Fisher diploid population of size N whose life cycle consists of drift, selection, and recombination. In every run, we consider chromosomes formed by a number of selected loci, each segregating for two alleles, and a neutral locus lying among them. Genetic backgrounds are defined by combinations of alleles at different loci, so n loci produce 2n potential backgrounds. Selected loci are assumed to be spaced at equal intervals along the genetic map, the recombination rate between any two adjacent selected loci being r (Fig 1). The neutral locus can be in any position, either at an extreme (Fig 1A) or within the set of selected loci (Fig 1B). It recombines with the selected loci, but there is no intragenic recombination. Mutations are generated at the neutral locus with rate µ immediately after recombination and according to the infinite sites model; i.e., each mutation occurs at a new site. Also, it is important to stress that, all along this work, we use terms such as "locus" and "alleles" for the sake of simplicity. A genetic background is an abstract entity and can also be defined by combinations of variants within a gene, a single exon, or even a noncoding sequence.
|
The coalescent approach proposed by ![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
To investigate these scenarios, we used two different fitness functions in our simulations. First, possible interactions among loci were studied in a series of runs where fitnesses were computed according to the n-locus symmetric viability model. For simplicity, we assumed that all loci contribute equally to fitness and that selection acts only on the proportion of heterozygous loci of an individual. The fitness of an individual is given by
![]() |
(1) |
where h is the proportion of heterozygous loci in a given individual (0
h
1) and
is the strength of selection. Epistasis enters the function by means of k, a parameter that allows for different selective regimes. Fig 2 shows three different selection schemes and two different selection strengths. If k = 1 (Fig 2A, straight lines), there is additive selection on heterozygosity. Selection acts on each locus individually and tends to maximize average heterozygosity. Linkage disequilibrium becomes important with epistasis, because the average fitness of the population will then depend on its variance in heterozygosity (![]()
![]()
0. In other words, selection favors maximum linkage disequilibrium, either positive or negative, because in that case the population tends to be formed by individuals that are either completely heterozygous or completely homozygous. With k < 1 (Fig 2, convex lines), there is negative epistasis, so selection favors decreased variance in heterozygosity and D = 0 (![]()
|
Also, to study the case of independent selective effects of each locus we ran a series of simulations in which fitnesses were multiplicative across loci (with heterozygotes having a fitness of 1 and homozygotes of 1 - s). The fitness of an individual was given by
![]() |
(2) |
where h is again the proportion of heterozygous loci and i is the total number of loci. A multiplicative fitness scheme favors D
0 (![]()
![]()
0) it converges to additive selection on heterozygosity.
For every set of parameters, an initial population was randomly generated assuming equal allele frequencies and D = 0. The population was run to drift-selection equilibrium and then several diversity measures were taken for the neutral locus. The probability of identity between two randomly chosen alleles, f, was computed for the whole population. ![]()
![]() |
(3) |
![]() |
(4) |
(![]()
is the neutral evolution parameter and n is the sample size. The mean number of nucleotide differences is proportional to the pairwise coalescence time and identities can be used as the moment-generating function of coalescence times (![]()
![]()
Single-locus balancing selection has been shown to increase the number of heterozygotes, that is, to decrease f (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]() |
(5) |
where
![]() |
(6) |
Positive values of Tajima's D reflect a deficiency of homozygotes and are usually associated with stable population subdivision, either spatial or genetical (![]()
![]()
Every variability measure value plotted in the figures is the average of 10 runs. The program was tested by using it to compute some well-known population genetics quantities, such as expected times for the fixation or loss of a neutral or a selected allele or patterns of decay of gametic disequilibrium. The results obtained agreed with the literature.
| RESULTS |
|---|
Identities:
The coalescent approach developed by ![]()
![]()
![]()
Fig 3 and Fig 4 show the way in which the probability of identity changes at a neutral locus located at the extreme of a set of selected loci, as the number of selected loci increases, for different selective regimes and population sizes. Analytical predictions obtained from Equations 30 and 33 in ![]()
|
|
When k = 1 (additive selection, Fig 3C and Fig D) or k > 1 (positive epistasis, Fig 3E), multilocus balancing selection fails to boost variability beyond the effect of a single diallelic locus. A similar divergence between multilocus analytical predictions and simulation results is registered if fitnesses are multiplicative across loci (compare Fig 4, a and c, with low recombination). If selection is additive (k = 1) this discrepancy is due to drift. The number of possible backgrounds increases exponentially with the number of loci and, as discussed by ![]()
D = 0. Therefore, unless N is very large and/or selection is strong enough to make drift negligible, additive selection fails to maintain a large number of backgrounds at stable frequencies. In a finite population, drift generates random associations among selected loci and, hence, the assumptions of the extended coalescent of ![]()
|
When epistasis is positive (k > 1, Fig 3E) or fitnesses are multiplicative (Fig 4C), there is also a discrepancy between simulated and theoretically predicted results, but this is only apparent. As we have already mentioned, under such fitness schemes selection favors linkage disequilibrium among selected loci, so that the equilibrium population is dominated by two complementary genotypes (![]()
![]()
Identities and recombination:
The results in Fig 4 suggest that, independently of the number of loci and the selection regime, neutral variability does not increase for markers at an extreme of the set of selected loci when Nr > 1 (r > 10-3 in the figure) between the neutral marker and the closest selected locus. The effect of recombination is further studied in Fig 6 Fig 7 Fig 8. Fig 6 plots the identities for a neutral locus lying at the extreme of a group of selected loci (either two or seven) under negative epistasis and for different recombination values. As expected, simulated and predicted results diverge when the number of loci is large, but, independently of this discrepancy, the effect of the set of selected loci on neutral variability dissipates when recombination between them and the neutral locus is >1/N (Nr > 1). The same threshold is found for other population sizes (not shown).
|
|
|
In all the results presented up to now, the neutral locus lies at an extreme of the map. We now study the chromosomal regions between selected loci. Fig 7 and Fig 8 show theoretically predicted and simulated identities for a neutral locus at different relative positions between two selected loci. Predicted identities for the single-selected locus case are also shown in Fig 7. Selected loci are at positions 0 and 1 and the recombination fraction between them can change. With negative epistasis and intermediate or low recombination (r
10-3, Fig 7A), maximum neutral variability (which is always found in regions closely linked to the selected loci) is increased beyond the one-locus (two backgrounds) limit, because the population is more subdivided with two loci (four backgrounds) than with a single one. Variability is increased in a wider region of the chromosome than expected for a single locus alone. The reason is simple: a higher level of variability is attained and it decays over a longer distance. Still, high recombination rates (r > 10-3) preclude any relevant multilocus effect and the predicted values are almost identical for one as for two selected loci (Fig 7B).
Results for multiplicative fitnesses are shown in Fig 8. Results for positive epistasis (not shown) are qualitatively equivalent. As expected, when recombination is high and selection is moderate (Fig 8A with
) only the individual effect of each locus is relevant. If recombination is low (Fig 8A with
) maximum neutral variability near selected loci is still the same as with high recombination. Variability is not increased beyond the one-locus limit because the population is dominated by only two backgrounds. Moving away from the extremes of the set of selected loci, variability decays at the same rate as in the single-locus case (not shown). In segments between selected loci, however, a second kind of multilocus effect is detected. Variability is increased over a much larger region than expected for a single locus. This is due to the fact that selection generates linkage disequilibrium between selected loci. Crossing over between the two selected loci, which would allow neutral alleles to recombine away, breaks linkage disequilibrium and generates gametes that are eliminated by selection. Thus, the effective recombination rate is reduced in regions between selected loci and, if selection is strong enough, variability can be increased even when recombination is high (Fig 8B with
). Simulated and analytical results fit quite well because coalescent predictions can be calculated taking into account the exact haplotype frequencies at equilibrium. Just as previously shown in Fig 3E and Fig 4A, the extended coalescent needs only information about the frequencies of the selectively relevant haplotypes and not about the kind of selection producing them.
Coalescence times and the frequency spectrum:
To complement the information provided by identities, we considered two more variability measures: d and S (see METHODS OF COMPUTER SIMULATION). Analytical predictions for d can be obtained by using the identities provided by ![]()
![]() |
(7) |
![]()
![]() |
(8) |
where U are all the possible sets of loci and
U are the total recombination rates between the limits of any given set (for details, see Equations 2633 in ![]()
![]() |
(9) |
The mean number of nucleotide differences between pairs of randomly chosen alleles is given by
![]() |
(10) |
This shows that, when r
0, d
. The reason is that under an infinite sites model with no recombination, different genetic backgrounds eventually accumulate infinite differences. Variances can be calculated using the same method.
Fig 9 shows changes in the values of d and S as the number of selected loci increases, for different selective regimes. In general, these two summary statistics behave as expected. Positive epistasis (k > 1) increases variability, but not beyond the one-locus two-backgrounds limit, so analytical results and simulations coincide independently of the number of selected loci. With negative epistasis (k < 1), many backgrounds are maintained by selection and neutral variability is increased beyond the single-locus limit, even though these backgrounds undergo strong fluctuations (Fig 5). In this case simulations fit the analytical predictions until the number of selected loci is large enough for background frequency fluctuations to sweep variability away. When the number of selected loci is large, fluctuations can be so strong that the timescale of background loss and recovery becomes very small. In that case, d and S can be smaller in systems with a large number of loci than when the number of loci is intermediate (compare, for example, d values for three loci with values for eight or nine loci in Fig 9A). Note that although d and S become quite low, the corresponding identities do not undergo such a radical change (Fig 3) because, first, they can fluctuate only between 0 and 1 and, second, they converge very quickly to their equilibrium values (see below).
|
By comparing Fig 9A and Fig 9B, it can be seen that d and S increase at different rates with increasing number of backgrounds. These different rates result in a change in the shape of the frequency spectrum with increasing number of loci. When there is positive epistasis (k > 1) Tajima's D is highly positive and remains so with increasing number of loci (Fig 10). In this case, balancing selection makes the frequency spectrum even and an excess of variants at average frequencies is detected, just as expected from previous results for the one-locus two-background case. With negative epistasis (k < 1) the situation is different. Fig 10 shows that Tajima's D gets less positive with increasing number of loci. Paradoxically, although variability is maximized with negative epistasis, the power to detect deviations from a neutral distribution of allele frequencies is decreased. There are two reasons for this behavior. First, with so many neutral variants segregating in the population, it becomes easy to find several low frequency alleles in a sample. Second, background frequency fluctuations act as selective sweeps or, in terms of the analogy with spatially subdivided populations, as extinction-recolonization events. Such events are known to make Tajima's D negative because low frequency variants tend to accumulate while mutation restores variability in a recently lost and recovered background.
|
| DISCUSSION |
|---|
The multilocus coalescent and balancing selection:
We have shown that in a multilocus system the ways in which selected loci interact are factors as important as population size, recombination, or the strength of selection, because they determine a key factor: the extent to which the population is subdivided, i.e., the number of genetic backgrounds that are maintained in the population. Different kinds of epistasis allow for different degrees of population subdivision and, thus, for different degrees of diversity, both selected and neutral. In general, balancing selection acting on groups of loci generates two related kinds of multilocus effects. First, variability at sites closely linked to each of the selected loci can be enhanced beyond the expectations for a single-locus system. This effect is produced when the population is highly subdivided, that is, when a large number of backgrounds are maintained. In our model, this is achieved under negative epistasis (Fig 3, a and b, for example). This variability increment dissipates quickly as the neutral locus moves away from the set of selected loci and completely disappears when Nr > 1 (Fig 6 and Fig 7). Second, when multiple selected loci are involved, the variability enhancement extends to a larger section of the map. With negative epistasis this extension affects all neutral variability linked to a set of selected loci and is a trivial consequence of the higher levels of variability reached near each of the selected loci (Fig 6 and Fig 7A). In contrast, with multiplicative fitness or positive epistasis (Fig 7B and Fig 8), the extension is directly due to selection and affects only variability in regions between selected loci. Under the latter selective regimes, the population reaches an equilibrium in which there is maximum linkage disequilibrium and only two backgrounds dominate. As a consequence, selection opposes the homogenizing effect of recombination because crossing over between two selected loci produces unfit gametes that selection tends to eliminate. The rate of decay of molecular diversity remains the same as in the single-locus two-backgrounds case for neutral markers outside the set of selected loci. In contrast, diversity is enhanced at neutral markers located in regions among the set, even if they are a long way from the selected loci themselves (see Fig 8 with
). This mechanism has been already described by ![]()
Under negative epistasis, simulations show that, when the number of backgrounds is large relative to the population size, variability stops increasing with the addition of more selected loci (Fig 3A, Fig 3B, and Fig 4B). It is clear that the multilocus coalescent expectations of a huge increase in variability (![]()
![]()
and k, together with N, r, and the number of loci in the system determine how many backgrounds can be maintained in the population and how stable their frequencies will be. As N and
increase, and as k decreases, the more subdivision and the more variability the population can harbor. On the other hand, the larger the number of loci, the less intense will be selection on each individual locus, and because the number of backgrounds grows exponentially with the number of loci, selection will be even weaker on each individual background. For example, in Fig 3, a and b, selection is still very efficient in maintaining allelic frequencies when the number of selected loci is greater than five, but it cannot keep haplotypic frequencies stable against drift and, thus, background frequencies fluctuate.
The nature of multilocus balancing selection equilibria has been studied in detail (cf. ![]()
, k, N, and r, would summarize the amount of genetic subdivision in an equilibrium population and use it to predict the amount of neutral variability that this population can sustain. An obvious candidate for such a statistic is the effective number of selected backgrounds, ne, which can be defined by analogy to the effective number of alleles,
, where pi are the frequencies of the i backgrounds present in the population (![]()
13. With the same number of selected loci but under an additive fitness scheme, ne is only
4 (Fig 3C, f = 0.43). Variability, therefore, seems to increase with the effective number of backgrounds. Yet, the relationship is far from simple. For example, ne
4 is achieved with two selected loci in a system under negative epistasis (Fig 3B), but in this case the probability of identity (f = 0.24) is only one-half as big as the corresponding 4-background value in the additivity case. This implies higher variability with the same effective number of backgrounds. This is due to the fact that fluctuations are much stronger with five selected loci and additivity than with two selected loci and negative epistasis. The degree of population subdivision may be similar in both cases, but ne contains no information about fluctuations, which are what really matters. Unfortunately, as discussed in ![]()
![]()
Variability patterns associated with multilocus balancing selection:
One of the main purposes of this study was to investigate the levels and patterns of neutral variability to be expected under multilocus balancing selection and, conversely, to suggest ways in which the study of neutral variability may allow us to distinguish between different types of fitness regimes. We focused on qualitative results because of the sheer complexity of the problem and because we expected that a preliminary exploration of the parameter space would allow us to detect general patterns that would not depend on the exact nature of selection or on the details of the model. We have been able to do so and found that, although multilocus balancing selection always increases variability, very different patterns are expected under different parameter values. This challenges the traditional consensus view, on the basis of results from single-locus studies, that the footprints of balancing selection are high variability, strong linkage disequilibrium, and uniformly high allele frequencies across sites (![]()
![]()
Under positive epistasis, variability patterns resemble the intuitions provided by single-locus studies. Large "islands" of high variability and high linkage disequilibrium are expected to be found within the region spanned by the selected loci. Variability is expected to be high and roughly constant along the intervening region, even if this region is highly recombining, and to decay for markers at the extremes of the set of selected loci. In sharp contrast, with negative epistasis the effect of multilocus balancing selection depends strongly on recombination. In that case, multilocus balancing selection tends to produce peaks of higher variability around each of the selected loci (not shown) and there is a threshold of recombination beyond which no effect on neutral variability is expected (Nr > 1). Actually, the multilocus threshold coincides with the single-locus one, as previously described (![]()
![]()
![]()
Data on multilocus balancing selection:
Can our results help us to understand the patterns of DNA variability found in natural populations? An initial point can be made on the issue of the overall amount of balancing selection in the genome. Even ignoring the important problems of the expected distribution of epistatic effects and of the possibility of unequal contributions to fitness of different loci, it is clear that if multilocus balancing selection were common, a considerable increase of variability beyond the neutral expectations would be predicted in regions of low recombination. In fact, the opposite correlation has been found in a variety of organisms (![]()
![]()
![]()
![]()
![]()
![]()
![]()
Nevertheless, an increasing number of cases of multilocus balancing selection are being reported. Some examples are self-incompatibility systems (cf. in ![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Although the Mhc is usually modeled as a single-locus multiple-allele system (see, for example, ![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
0. The estimates of Ns obtained by ![]()
![]()
![]()
= 1, so N
= 102103). However, the kind of interactions involved in the Mhc is a far more complex issue.
In the coding regions of Mhc genes, there is an overall enhancement of variability, sometimes accompanied by linkage disequilibrium both within and among genes (![]()
![]()
![]()
104 (![]()
1). It is clear that intragenic recombination is below the threshold, but the whole Mhc region spans
4 Mb on chromosome 6 (![]()
![]()
10-4. Those are the two genes for which the best evidence of linkage disequilibrium has been found (![]()
![]()
![]()
4 x 10-3, Nr
4), so some of the variability within these two loci is beyond the threshold of action of multilocus selection if epistasis is such that it tends to make linkage disequilibrium low. This would seem to be the case because, although HLA-DPB1 has high variability, it is the locus showing less evidence of linkage disequilibrium with any other HLA loci. Interestingly, its frequency spectrum does not present significant deviations from neutrality in the available studies (![]()
![]()
All this evidence suggests that studying regions between Mhc loci, and not only the Mhc loci themselves, is a potentially fruitful strategy to ascertain the kind of selection involved. Unfortunately, only a few such studies are available (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Several simplifying assumptions underlie our model. First, although the analytical approach used by ![]()
![]()
![]()
10-5/kb (![]()
103 N generations. Assuming a generation time of
10 years this gives an average pairwise coalescence time of
108 years. This shows that the populations cannot possibly be at equilibrium. Even if a species survived to such an amount of time, turnover of alleles by mutations at the selected sites would preclude equilibrium. Some of the patterns we have detected, particularly the ones referring to frequency spectrum, are stronger in populations that have not reached equilibrium. Deeper insight is to be gained by the study of nonequilibrium multilocus systems. Such analysis is currently under way.
| ACKNOWLEDGMENTS |
|---|
We thank P. Andolfatto, P. Awadalla, B. Charlesworth, D. Charlesworth, F. Depaulis, S. Otto, J. Rozas, and three anonymous reviewers for valuable discussion and criticism. A.N. is grateful to F. Depaulis, whose comments were particularly helpful (and extremely funny), and to D. Charlesworth, whose ideas made this work readable. This work was supported by Biotechnology and Biological Sciences Research Council/Engineering and Physical Sciences Research Council.
Manuscript received October 22, 2001; Accepted for publication February 22, 2002.
| LITERATURE CITED |
|---|
AQUADRO, C. F., and D. J. BEGUN, 1993 Evidence for and implications of genetic hitch-hiking in the Drosophila genome, pp. 159178 in Mechanisms of Molecular Evolution, edited by N. TAKAHATA and A. G. CLARK. Sinauer Press, Sunderland, MA.
AQUADRO, C. F., D. J. BEGUN and E. C. KINDAHL, 1994 Selection, recombination and DNA polymorphism in Drosophila, pp. 4656 in Non-Neutral Evolution, edited by B. GOLDING. Chapman & Hall, New York.
BARTON, N., 2000 Genetic hitchhicking. Philos. Trans. R. Soc. Lond. B 355:1-10.
BARTON, N. H. and A. NAVARRO, 2002 Extending the coalescent to multilocus systems: the case of balancing selection. Genet. Res. 79:129-139[Medline].
BARTON, N. H. and M. SHPAK, 2000 The stability of symmetrical solutions to polygenic models. Theor. Popul. Biol. 57:249-263[Medline].
BECK, S. and J. TROWSDALE, 2000 The human major histoincompatibility complex: lessons from the DNA sequence. Annu. Rev. Genomics Hum. Genet. 1:117-138[Medline].
BERGELSON, J., M. KREITMAN, E. A. STAHL, and D. TIAN, 2001 Evolutionary dynamics of plant R-genes. Science 292:2281-2285
BERNAL, A., U. EAR, and N. KYRPIDES, 2001 Genomes OnLine Database (GOLD): a monitor of genome projects world-wide. Nucleic Acids Res. 29:126-127
CHARLESWORTH, B., 1994 The effect of background selection against deleterious mutations on weakly selected, linked variants. Genet. Res. 63:213-228[Medline].
CHARLESWORTH, B., M. T. MORGAN, and D. CHARLESWORTH, 1993 The effect of deleterious mutations on neutral molecular variation. Genetics 134:1289-1303[Abstract].
CHARLESWORTH, D. and P. AWADALLA, 1998 Flowering plant self-incompatibility: the molecular population genetics of Brassica S-loci. Heredity 81:1-9.
CHRISTIANSEN, F. B., 1987 The deviation from linkage equilibrium with multiple loci varying in a stepping-stone cline. J. Genet. 66:45-67.
CHRISTIANSEN, F. B., 1988 Epistasis in the multiple locus symmetric viability model. J. Math. Biol. 26:595-618[Medline].
CHRISTIANSEN, F. B., 2000 Population Genetics of Multiple Loci. John Wiley & Sons, New York.
CROW, J. F., and M. KIMURA, 1970 An Introduction to Population Genetics Theory. Harper & Row, New York.
CULLEN, M., J. NOBLE, H. ERLICH, K. THORPE, and S. BECK et al., 1997 Characterization of recombination in the HLA class II region. Am. J. Hum. Genet. 60:397-407[Medline].
DVORAK, J., M. C. LUO, and Z. L. YANG, 1998 Restriction fragment length polymorphism and divergence in the genomic regions of high and low recombination in self-fertilizing and cross-fertilizing aegilops species. Genetics 148:423-434
EWENS, W. J., 1979 Mathematical Population Genetics. Springer Verlag, Berlin.
FRANKLIN, I. and R. C. LEWONTIN, 1970 Is the gene the unit of selection? Genetics 65:701-734.
FU, Y. X., 1997 Statistical tests of neutrality of mutations against population growth, hitchhiking and background selection. Genetics 147:915-925[Abstract].
GAUDIERI, S., J. K. KULSKI, R. L. DAWKINS, and T. GOJOBORI, 1999 Extensive nucleotide variability within a 370 kb sequence from the central region of the major histocompatibility complex. Gene 238:157-161[Medline].
GAUDIERI, S., R. L. DAWKINS, K. HABARA, J. K. KULSKI, and T. GOJOBORI, 2000 SNP profile within the human Major Histoincompatibility Complex reveals an extreme and interrupted level of nucleotide diversity. Genome Res. 10:1579-1586
GUILLAUDEUX, T., M. JANER, G. K. S. WONG, T. SPIES, and D. E. GERAGHTY, 1998 The complete genomic sequence of 424,015 bp at the centromeric end of the HLA class I region: gene content and polymorphism. Proc. Natl. Acad. Sci. USA 95:9494-9499
HEY, J., 1991 The structure of genealogies and the distribution of fixed differences between DNA sequence samples from natural populations. Genetics 128:831-840[Abstract].
HOGSTRAND, K. and J. BOHME, 1999 Gene conversion can create new MHC alleles. Immunol. Rev. 167:305-317[Medline].
HORTON, R., D. NIBLETT, S. MILNE, S. PALMER, and B. TUBBY et al., 1998 Large-scale sequence comparisons reveal unusually high levels of variation in the HLA-DQB1 locus in the class II region of the human MHC. J. Mol. Biol. 282:71-87[Medline].
HUDSON, R., 1990 Gene genealogies and the coalescent process. Oxf. Surv. Evol. Biol. 7:1-44.
HUDSON, R. B. and N. L. KAPLAN, 1988 The coalescent process in models with selection and recombination. Genetics 120:831-840
HUDSON, R. R., and N. L. KAPLAN, 1994 Gene trees with background selection, pp. 140153 in Non-neutral Evolution: Theories and Molecular Data, edited by G. B. GOLDING. Chapman & Hall, London/New York.
HUDSON, R. R. and N. L. KAPLAN, 1995 The coalescent process with background selection. Philos. Trans. R. Soc. B 349:19-23[Medline].
HUGHES, A. L. and M. YEAGER, 1997 Molecular evolution of the vertebrate immune system. BioEssays 19:777-786[Medline].
HUGHES, A. L. and M. YEAGER, 1998 Natural selection and the evolutionaty history of major histocompatibility complex loci. Front. Biosci. 3:509-516.
KAPLAN, N. L., T. DARDEN, and R. B. HUDSON, 1988 The coalescent process in models with selection. Genetics 120:819-829
KAPLAN, N. L., R. R. HUDSON, and C. H. LANGLEY, 1989 The hitch-hiking effect revisited. Genetics 123:887-899
KARLIN, S. and H. AVNI, 1981 Analysis of central equilibria in multilocus systems. Theor. Popul. Biol. 20:241-280.
KARLIN, S. and U. LIBERMAN, 1979 Central equilibria in multilocus systems. II. Bisexual generalized nonepistatic selection schemes. Genetics 91:799-816
KELLY, J. K. and M. J. WADE, 2000 Molecular evolution near a two-locus balanced polymorphism. J. Theor. Biol. 204:83-101[Medline].
LEWONTIN, R. C. and K. KOJIMA, 1960 The evolutionary dynamics of complex polymorphisms. Evolution 14:450-472.
LYTTLE, T. W., 1991 Segregation distorters. Annu. Rev. Genet. 25:511-557[Medline].
MARUYAMA, T., 1972 Rate of decrease of genetic variability in a two-dimensional continuous population of finite size. Genetics 70:639-651
MEYER, D. and G. THOMSON, 2001 How selection shapes variation on the human major histoincompatibility complex: a review. Ann. Hum. Genet. 65:1-26[Medline].
NACHMAN, M. W., 1997 Patterns of DNA variability at X-linked loci in Mus domesticus. Genetics 147:1303-1316[Abstract].
NAGYLAKI, T., 1982 Geographical invariance in population genetics. J. Theor. Biol. 99:159-172[Medline].
NORDBORG, M., 1997 Structured coalescent processes on different time scales. Genetics 146:1501-1514[Abstract].
NORDBORG, M., 2001 Coalescent theory, pp. 179212 in Handbook of Statistical Genetics, edited by D. BALDING, M. BISHOP and C. CANNINGS. Wiley & Sons, New York.
O'HUIGIN, C., Y. SATTA, A. HAUSMANN, R. L. DAWKINS, and J. KLEIN, 2000 The implications of intergenic polymorphism for major histocompatibility complex evolution. Genetics 156:867-877
PALOPOLI, M. F., 2000 Genetic partners in crime: evolution of a coadapted gene complex that specializes in sperm sabotage, pp. 113126 in Epistasis and the Evolutionary Process, edited by E. D. BRODIE, M. U. WADE and J. WOLF. Oxford University Press, London.
PRZEWORSKI, M., R. R. HUDSON, and A. DI RIENZO, 2000 Adjusting the focus on human variation. Trends Genet. 16:296-302[Medline].
SANCHEZ-MAZAS, A., S. DJOULAH, M. BUSSON, I. LE MONNIER DE GOUVILLE, and J. C. POIRIER et al., 2000 A linkage disequilibrium map of the Mhc region based on the analysis of 14 loci haplotypes in 50 French families. Eur. J. Hum. Genet. 8:33-41[Medline].
SATTA, Y., 1997 Effect of intra-locus recombination on HLA polymorphism. Hereditas 127:105-112[Medline].
SATTA, Y., G. O'HUIGIN, N. TAKAHATA, and J. KLEIN, 1994 Intensity of natural selection at the major histocompatibility complex loci. Proc. Natl. Acad. Sci. USA 91:7184-7188
SATTA, Y., Y. J. LI, and N. TAKAHATA, 1998 The neutral theory and natural selection in the HLA region. Front. Biosci. 27:459-467.
SLATKIN, M. and C. A. MUIRHEAD, 2000 A method for estimating the intensity of overdominant selection from the distribution of allele frequencies. Genetics 156:2119-2126
STEPHAN, W. and C. H. LANGLEY, 1998 DNA polymorphism in Lycopersicon and crossing-over per physical length. Genetics 150:1585-1593
STEPHAN, W., T. H. WIEHE, and M. LENZ, 1992 The effect of strongly selected substitutions on neutral polymorphism: analytical results based on diffusion theory. Theor. Popul. Biol. 41:237-254.
STROBECK, C., 1983 Expected linkage disequilibrium for a neutral locus linked to a chromosome rearrangement. Genetics 103:545-555
SVED, J. A., 1983 Does natural selection increase or decrease variability at linked loci? Genetics 105:239-340
TAJIMA, F., 1989 Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 123:585-595
TAKAHATA, N. and M. NEI, 1990 Allelic genealogy under overdominant and frequency-dependent selection and polymorphism of MHC loci. Genetics 124:967-978[Abstract].
TAKAHATA, N. and Y. SATTA, 1998 Footprints of intragenic recombination at HLA loci. Immunogenetics 47:430-441[Medline].
TAKAHATA, N., Y. SATTA, and J. KLEIN, 1992 Polymorphism and balancing selection at major histocompatibility loci. Genetics 130:925-938[Abstract].
WHITLOCK, M. C. and N. H. BARTON, 1997 The effective size of a subdivided population. Genetics 146:427-441[Abstract].
This article has been cited by other articles:
![]() |
J. F. Storz and J. K. Kelly Effects of Spatially Varying Selection on Nucleotide Diversity and Linkage Disequilibrium: Insights From Deer Mouse Globin Genes Genetics, September 1, 2008; 180(1): 367 - 379. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Ferrer-Admetlla, E. Bosch, M. Sikora, T. Marques-Bonet, A. Ramirez-Soriano, A. Muntasell, A. Navarro, R. Lazarus, F. Calafell, J. Bertranpetit, et al. Balancing Selection Is the Main Force Shaping the Evolution of Innate Immunity Genes J. Immunol., July 15, 2008; 181(2): 1315 - 1322. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. F. Storz, M. Baze, J. L. Waite, F. G. Hoffmann, J. C. Opazo, and J. P. Hayes Complex Signatures of Selection and Gene Conversion in the Duplicated Globin Genes of House Mice Genetics, September 1, 2007; 177(1): 481 - 500. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Kamau, B. Charlesworth, and D. Charlesworth Linkage Disequilibrium and Recombination Rate Estimates in the Self-Incompatibility Region of Arabidopsis lyrata Genetics, August 1, 2007; 176(4): 2357 - 2369. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Kawabe, S. Nasuda, and D. Charlesworth Duplication of Centromeric Histone H3 (HTR12) Gene in Arabidopsis halleri and A. lyrata, Plant Species With Multiple Centromeric Satellite Sequences Genetics, December 1, 2006; 174(4): 2021 - 2032. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Innan Modified Hudson-Kreitman-Aguade Test and Two-Dimensional Evaluation of Neutrality Tests Genetics, July 1, 2006; 173(3): 1725 - 1733. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Kuang, S.-S. Woo, B. C. Meyers, E. Nevo, and R. W. Michelmore Multiple Genetic Processes Result in Heterogeneous Rates of Evolution within the Major Cluster Disease Resistance Genes in Lettuce PLANT CELL, November 1, 2004; 16(11): 2870 - 2894. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Takebayashi, E. Newbigin, and M. K. Uyenoyama Maximum-Likelihood Estimation of Rates of Recombination Within Mating-Type Regions Genetics, August 1, 2004; 167(4): 2097 - 2109. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Clauss and T. Mitchell-Olds Functional Divergence in Tandemly Duplicated Arabidopsis thaliana Trypsin Inhibitor Genes Genetics, March 1, 2004; 166(3): 1419 - 1436. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Turelli and N. H. Barton Polygenic Variation Maintained by Balancing Selection: Pleiotropy, Sex-Dependent Allelic Effects and G x E Interactions Genetics, February 1, 2004; 166(2): 1053 - 1079. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Charlesworth, C. Bartolome, M. H. Schierup, and B. K. Mable Haplotype Structure of the Stigmatic Self-Incompatibility Gene in Natural Populations of Arabidopsis lyrata Mol. Biol. Evol., November 1, 2003; 20(11): 1741 - 1753. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Nordborg and H. Innan The Genealogy of Sequences Containing Multiple Sites Subject to Strong Selection in a Subdivided Population Genetics, March 1, 2003; 163(3): 1201 - 1213. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. BERTRANPETIT, F. CALAFELL, D. COMAS, A. GONZALEZ-NEIRA, and A. NAVARRO Structure of Linkage Disequilibrium in Humans: Genome Factors and Population Stratification Cold Spring Harb Symp Quant Biol, January 1, 2003; 68(0): 79 - 88. [Abstract] [PDF] |
||||
- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Navarro, A.
- Articles by Barton, N. H.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Navarro, A.
- Articles by Barton, N. H.











, and
. (a)
. (b)
. (c)
. (d)
. (e)
. (In this and the following figures, bars show one standard error.)

. (a) Predicted values with several backgrounds (solid lines) or only two backgrounds (dashed lines). () r = 10-5. (
) r = 10-4. (
) r = 10-3. (
) r = 10-2. (b) Negative epistasis,
) Neutral. (
) r = 10-5. (



. (a)
. (b) 
) predicted r = 1-4, (




, sample size n = 10. (a) Simulated and predicted values of the average number of pairwise differences. (b) Simulated values of the number of segregating sites.
, sample size n = 10.



