| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Corresponding author: Mikkel H. Schierup, Bioinformatics Research Center (BiRC), Department of Ecology and Genetics, University of Aarhus, Ny Munkegade, Bldg. 540, 8000 Aarhus C., Denmark., mikkel.schierup{at}biology.au.dk (E-mail)
Communicating editor: N. TAKAHATA
| ABSTRACT |
|---|
Using a coalescent model of multiallelic balancing selection with recombination, the genealogical process as a function of recombinational distance from a site under selection is investigated. We find that the shape of the phylogenetic tree is independent of the distance to the site under selection. Only the timescale changes from the value predicted by Takahata's allelic genealogy at the site under selection, converging with increasing recombination to the timescale of the neutral coalescent. However, if nucleotide sequences are simulated over a recombining region containing a site under balancing selection, a phylogenetic tree constructed while ignoring such recombination is strongly affected. This is true even for small rates of recombination. Published studies of multiallelic balancing selection, i.e., the major histocompatibility complex (MHC) of vertebrates, gametophytic and sporophytic self-incompatibility of plants, and incompatibility of fungi, all observe allelic genealogies with unexpected shapes. We conclude that small absolute levels of recombination are compatible with these observed distortions of the shape of the allelic genealogy, suggesting a possible cause of these observations. Furthermore, we illustrate that the variance in the coalescent with recombination process makes it difficult to locate sites under selection and to estimate the selection coefficient from levels of variability.
LOCI under multiallelic balancing selection are the most polymorphic genes known in eukaryotes. These systems include gametophytic (![]()
![]()
![]()
![]()
![]()
The MHC data in particular have stimulated the analysis of models that are consistent with these striking levels of polymorphism. Overdominant selection with (close to) equal selection coefficient is sufficient (and appears necessary) to explain the data (![]()
![]()
![]()
Population genetics theory has successfully explained some aspects of these polymorphisms. Nevertheless, one important aspect of the pattern of polymorphism in these systems is not yet well understood. This is the shape of the phylogenetic tree of the alleles. ![]()
![]()
![]()
![]()
![]()
![]()
However, when these ratios are applied to real sequence data of functionally different alleles, they show significant deviations from coalescent expectations (![]()
![]()
![]()
![]()
![]()
|
TAKAHATA's (1990) allelic genealogy is an infinite alleles model that treats alleles as entities that cannot be broken up by recombination. It is thus an important assumption for application of this theory to sequence data that recombination does not occur. However, intragenic recombination/gene conversion has been reported within genes of the MHC (![]()
![]()
![]()
![]()
![]()
Here, the expected effects of recombination on the shape of the genealogy are investigated. We simulate a simple model of multiallelic selection with recombination using an extension of HUDSON's (1983) algorithm for the coalescent with recombination. The main assumption is that variation in a single nonrecombining spot on the sequence is subject to selection. This spot could either be a single nucleotide site or a collection of adjacent nucleotides forming a specificity-determining region.
First, we investigate the genealogy as a function of the recombination distance from the spot under selection. At the spot under selection, we expect a neutral-shaped genealogy of alleles with an extended timescale, which depends on the number of allelic classes and the mutation rate to new specificities (![]()
![]()
![]()
Second, we quantify the shape of the "average" genealogical tree of a sample of whole sequences subject to recombination. With recombination a single genealogical tree does not normally describe the sequence variation since different parts of the sequence have different histories. Previous investigations of allelic genealogies are, however, based on phylogenetic trees, and it is therefore of interest to investigate the expected shape of a phylogenetic tree of sequences even when recombination occurs. Biases introduced by ignoring recombination can then be quantified. To investigate this question we simulate samples of nucleotide sequences assuming a given amount of recombination and a specified substitution model for the linked neutral nucleotides. We describe how much recombination is needed before the shape of the phylogenetic tree is distorted, compare these results with the published studies, and conclude that relatively small amounts of recombination are compatible with the deviations from the expected shape of genealogical trees observed in the data sets.
| MODEL |
|---|
The model is an extension of HUDSON's (1983) coalescent with recombination, here allowing for a simple form of symmetrical balancing selection. It is reminiscent of the process formulated by ![]()
|
There are n sequences sampled and the diploid population size is N. The continuous time approximation scales time in 2N generations. Recombination can happen with the same probability over the sequence determined by the overall recombination parameter
= 4Nr, which is the number of recombination events in a sequence in 4N generations, with r thus being the probability of a recombination event in a single sequence in a single generation.
To model strong balancing selection we assume that M distinct allelic classes are kept in equal frequencies in the population. An allelic class is also termed a specificity as in studies of self-incompatibility or the MHC. The turnover process of specificities follows ![]()
![]()
Each of the n sampled sequences is associated at its left endpoint with one of the allelic classes. A given point at a given sequence can change its associated specificity if either the specificity is changed by an allelic turnover event or if recombination occurs between the selected site and the focal point. Two sequences can coalesce only when they have the same specificity (Fig 1, top).
Assume that there are M specificities in the population and that we sample n sequences of different specificities n
M. This corresponds to the situation where an investigator sequences only one copy of each specificity, but not all existing specificities are necessarily sampled. The coalescent process with recombination and selection can then be approximated by three independent exponentially distributed waiting times, namely coalescence, recombination, and allelic turnover (see Fig 1). A sample of sequences is followed backward in time until all parts of each sampled sequence (the ancestral material) have found a most recent common ancestor.
Coalescence:
The intensity of coalescence is given by Ct = M
Mi=1(
), where ni is the number of copies of specificity i, since coalescences can only happen within a given specificity that each has an effective size of 2N/M. If a coalescent event happens, an allelic class i is chosen with probability proportional to ni(ni - 1)/2, two random sequences from class i are merged into one ancestral sequence with the same specificity i, and ni is decreased by 1. Note that initially, since we sample at most one sequence from each specificity, ni = 1 for all specificities sampled and C0 = 0. Thus, coalescence can only happen once recombination has shifted ancestral material to other specificities, making ni > 1 for at least one i.
Recombination:
The intensity of recombination Rt at a given point in time is determined by the amount of ancestral material to the sample plus any material "trapped" by blocks of ancestral material (![]()
, i.e., the number of sequences times their lengths. If recombination happens, the recombination point is picked uniformly over this length of sequence. A recombination event breaks up the sequence in a left and a right segment (Fig 1). The left segment retains its allelic class. The right segment is assigned an allelic class among the other existing classes. In the case of self-incompatibility, this class is chosen randomly among the M - 1 allelic classes distinct from the class of the recombining sequence. For overdominance with selection coefficient s, the present class is chosen with relative weight 1 - s, corresponding to selection against homozygotes of strength s.
Allelic turnover:
The intensity of allelic turnover is determined by Q, which is independent of the time t by definition. If an allelic turnover event happens, an allelic class, say i, is chosen randomly with equal probability among the M allelic classes. The ni sequences from this class are then made to coalesce instantly and the resultant sequence has its specificity changed to one of the other M - 1 allelic classes at random (Fig 1). Viewed forward in time this corresponds to a new specificity arising by mutation followed by its (almost) immediate increase in frequency due to the strong selection favoring rare specificities. Then a new allelic class is introduced to keep the number constant. Initially, this new allelic class does not carry ancestral material, but recombination events can transfer ancestral material to the class (see Fig 1). Note that if this happens, the material between the point of selection and the left border of ancestral material is also added to the trapped material part of the recombination intensity because a recombination event here would change the specificity associated with the ancestral material and thus the coalescent history. If the allelic turnover rate is small, the coalescent process of the left endpoint of the sequence (the point under selection) is dominated by the allelic turnover process alone, and, according to ![]()
)/Q, which is likely to be much longer than the neutral value of D = 2(
) when Q < 1.
Since the three events are independent and exponentially distributed, the intensity of any event to happen is exponentially distributed with parameter Ct + Rt + Q and given that an event happens, the probability that it is a coalescent event, say, is Ct/(Ct + Rt + Q). The process is simulated from starting conditions by determining the time of the first event by drawing a random number from an exponential distribution with mean 1/(Ct + Rt + Q), then determining the type of the event, and finally updating the intensities of the three events according to the rules above. The process is continued until all parts of the sequences have found a common ancestor. For the left endpoint of the sequences the time until a common ancestor is primarily determined by the allelic turnover process, whereas recombination plays an increasingly important role the farther a point is away from the left endpoint.
The process results in a set of correlated trees relating the samples along the sequence. In contrast to the neutral coalescent with recombination these trees are not taken from the same distribution since their expected branch lengths depend on the distance to the left endpoint where specificities are determined. During a single stochastic realization of the process we stored all information on topology and branch length for each of these trees. From these we (a) investigate the coalescent process as a function of distance from the point of selection and (b) simulate and subsequently analyze nucleotide sequences under this process.
| STATISTICS |
|---|
To characterize the shape of the phylogenetic trees we used five quantities calculated from branch lengths. These are
From these, four ratios,

and

can be defined, where an =
n-1i=1 (
) and bn
n-1i=2 (
) (![]()
Viewed as ratios of means, all four ratios are scaled to have an expected mean of one under the neutral coalescent, and simulations have shown that their means as ratios are also close to one (![]()
![]()
![]()
We also calculated the time between subsequent coalescence events. Under the neutral coalescent with i sequences, the mean waiting times Fi to the next coalescence are independent and exponentially distributed with mean 2/(i(i - 1)). Thus, Gi = Fii
2 are exponentially distributed with mean 1, and plotting Gi as a function of i can visualize systematic deviations from neutral expectations.
These measures can all be calculated from the branch length of the true trees over the sampled sequences in a single realization of the coalescent with recombination and balancing selection process. They can also be calculated from phylogenetic trees reconstructed from nucleotide sequences simulated under the model (see below).
Genealogical structure over a gene:
The model was used to simulate genealogical histories. Each of n sampled genes was assigned a unique specificity among the M possibilities. Models were simulated and analyzed where either all specificities were sampled (n = M) or just a subset was sampled (n < M). One run of the program generates a set of trees with branch lengths over the set of genes. Such a set is a single outcome of the stochastic process and is termed a "history." We sampled a given history at 1000 points spaced as a logarithmic function of
and calculated the various statistics at each point. Mean and standard deviations for a given set of parameters were then found over many (>15,000) recorded histories. The statistics were then plotted as a function of the recombination distance from the site under selection. A total length of
= 100 was investigated, which means that 100 recombinations are expected between the endpoints of a gene in 4N generations.
Nucleotide sequences:
Simulation of nucleotide sequences followed ![]()
![]()
![]()
Sequences were simulated with a single allelic turnover rate at the site under selection but with different levels of recombination over the sequence. Again, sequences were initially assigned unique specificities, equivalently to sampling sequences with different specificities only, as is done in published studies of these systems. Each set of sequences was subsequently run though DNAdist and Kitsch programs of PHYLIP (![]()
| RESULTS |
|---|
Genealogical structure over gene:
Fig 2 shows results for four of the basic quantities for two different allelic turnover rates to new specificities [Q = 0.01 (solid line) and Q = 0.1 (dotted-dashed line)] for the sample size n = 30 genes and M = 30 specificities. The values of each quantity for
= 0 are as expected from TAKAHATA's (1990) theory (marked on y-axis), which predicts coalescence times proportional to the square number of specificities and to 1/Q (e.g., D = M(M - 1)(1 - 1/n)/Q). Selection can be seen to greatly increase expected coalescence times close to the site under selection, but as
increases, each quantity approaches the value expected under KINGMAN's (1982) coalescent (marked on right vertical axis). Each of the four quantities shows, as expected, a monotonic decrease with distance from the point of selection. We stress two observations. First, even though the two values of Q correspond to a 10-fold difference in fS, the graphs become (almost) indistinguishable when
> 0.02. Thus, differences in selection intensities only have an effect extremely close to the point of selection (corresponding perhaps only to a couple of nucleotides). Second, the recombination distance needed to approach the neutral coalescent depends approximately linearly on the number of allelic classes as shown previously for the pairwise divergence times (![]()
= 1 the effect of selection on levels of diversity is negligible for M = 2 (shown by ![]()
= 10 when M = 30.
|
Fig 3 shows the four ratios for the same runs as in Fig 2. For
= 100, ratios from all models are expected to converge to a value very close to one as expected from the neutral coalescent. Similarly, the ratios are expected to converge to one for
= 0 (![]()
|
Several different combinations of parameters M, n, Q (including cases where n < M, i.e., not all specificities sampled), and selection coefficients against homozygotes were investigated and found to yield the same conclusions (results not shown).
Expected phylogenetic tree for sequence sets with selection and recombination:
Again, we present results for 30 sequences sampled, each from a distinct specificity (M = 30), and the turnover rate to a new specificity was Q = 0.1. The recombination rate here is the value of
used when simulating the sequences and not the recombination distance from the site under selection as in the previous section. The idea here is to show how various amounts of recombination will affect inferences that are based on sequences from different specificities.
Fig 4 shows S (the length of terminal branches), T (the length of the tree), D (the height of the tree), and P (the average pairwise difference) as a function of
. All statistics except S show a monotonic decrease as a function of
. This is expected when Q is constant, because parts of the sequence are less influenced by selection when
increases, lowering the average (compare Fig 1). The statistic S, the length of the terminal branches, increases slightly for small recombination rates, indicating that the shape of the genealogical tree is changed when recombination occurs. Fig 5 shows this to be true. Even very small rates of recombination (
0.1) have a large effect on the four ratios. The terminal branches are relatively longer than expected (when compared with the height and total length of the tree, i.e., both RSD and RST > 1), and the average pairwise distance is smaller than expected from the total branch length (RPT < 1). This pattern is similar to that previously reported for neutrally evolving sequences (![]()
![]()
|
|
Again, the inferences drawn from these results were found to be very robust to changing values of the different parameters, including modeling different strengths of balancing selection (results not shown).
The scaled internode distances, Gi, reveal a more detailed picture of the shape of the genealogy. Fig 6 shows these as functions of the coalescence events for four different recombination rates. For
= 0, the line is horizontal as expected from theory (![]()
increases, the recent coalescence times are too long relative to the coalescence times close to the root. This again reflects the long terminal branches (Fig 5).
|
Variation over a single set of sequences:
Fig 2D shows the average expected diversity P as a function of the recombination distance from the spot under selection. However, the variation around these means is enormous, mainly because of the inherent variation in the coalescent with recombination process. To visualize this variance we chose three random data sets for each of three different amounts of recombination. Sequence diversity was then calculated in a sliding window (Fig 7). Variation between runs is very large as can be seen by comparing the three replicates for a given value of
(Fig 7). Increasing the rate of recombination makes it easier to see where selection is acting. When
= 0.01 it appears virtually impossible to pinpoint the spot under selection (which by definition is at the left endpoint) from sequence diversity pattern and even for
= 0.1 there are spurious peaks of diversity separated from the site of selection by low diversity regions. This reflects that the coalescent with recombination process determines the history of blocks of nucleotides and when
< 0.1 the size of such blocks is large. Therefore, detection of selection through regions of overall hypervariability of both synonymous and nonsynonymous substitutions is likely to be successful only if
> 1 for the sequence under study.
|
| DISCUSSION |
|---|
We have investigated a very simple model of multiallelic balancing selection with recombination. The allelic genealogy and the neutral coalescent have the same genealogical structure, differing only in timescale (![]()
= 1, recombination shifts a given nucleotide among the different specificities in much the same way as a change in specificity caused by an allelic turnover event but at a higher rate. The process is therefore similar to an allelic genealogy with increasing allelic turnover rate as one moves farther away from the site under selection. To be able to compare with experimental data of these systems, we simulated sequences under the same model and reconstructed phylogenies. Results show that very little recombination significantly changes the shape of the inferred genealogy.
Limitations of the model:
Some of the simplifications in the model need consideration.
In cases where specificity is determined by nucleotides dispersed in the sequence of interest, it is presently unclear how good an approximation our model is. We have not investigated models of this situation mainly because recombination between such nucleotides would then contribute to the creation of new specificities and a very complex mutation process would have to be modeled. The current understanding of the determination of specificities does not allow us to choose among the many possible models of such interaction of sites. However, we believe that the qualitative effects we observe would also be preserved under many realistic models where several interspersed positions determine specificity.
Comparison with experimental data sets:
With these limitations in mind, we compare our results with observed values of the four ratios in different incompatibility systems (Table 1). Clearly, the observed pattern is very close to the results of simulations when sequences are allowed to recombine at a very low absolute rate. In fact, values of Table 1 correspond to recombination rates in the range of
= 0.0010.1 in Fig 4.
Indirect evidence that gene exchange occurs between alleles has been reported in each of the four types of self-recognition analyzed in Table 1. In sporophytic SI, ![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
To further investigate whether recombination has happened in the data sets of Table 1 we also applied two recent tests of recombination. The informative sites test (![]()
![]()
We hypothesize that if recombination occurs at sufficiently high rates then it is a possible explanation for the "long terminal branches" (![]()
![]()
![]()
Recombination rates in the range
= 0.0010.1 are normally too small to investigate by direct methods. Assuming 20 sequences are sampled,
= 0.1 corresponds to 0.3 recombination events in the whole sequence set during 2N generations. As an example, in Drosophila melanogaster,
= 0.1 corresponds to just 1.2 bp of a gene in a region of normal recombination if we assume the effective population size is N = 106 and the per generation recombination rate is 2 x 10-8 between adjacent nucleotides. Thus it is possible that recombination can be severely reduced in self-incompatibility genes (![]()
= 0.1 is unlikely to be detected through direct observation of segregation. The reason for the large effect of such low rates of recombination on the structure of the allelic genealogy is that balancing selection slows the coalescent process of the alleles (Fig 2 and Fig 3) and thereby extends the time interval where recombination may have an effect. In this sense, the closer to the selected site, the higher the effective recombination rate, which is expected to be fS times the neutral expectation very close to the selected site. Thus, recombination in the ancestral material to the sampled genes is expected to happen with an inflated frequency close to the site under selection.
It remains to be determined whether recombination rates in the different self-recognition systems are on the order of
= 0.0010.1 that one would expect if recombination alone should explain the long terminal branches in these systems. Because of the strong selection,
= 0.001 and 0.01 corresponds to
40 and 150 recombination events, respectively, in the history of the sample (values recorded in the simulations). Such levels of recombination should in principle be detectable for neutrally evolving sequences, but, for balancing selection, the selected position acts as an apparent "recombination hot spot" as discussed above. This invalidates application of current estimation methods of recombination rates to the data sets of Table 1. Even for a neutrally evolving gene, estimation of the recombination rate is already a formidable task (see, e.g., ![]()
A further feature of each of the systems of Table 1 is that the alignments of alleles contain hypervariable regions. For example, in sporophytic SI of Brassicaceae three such regions have been described (![]()
![]()
> 1, see Fig 7). The reason is the very large variance in the time to the most recent common ancestor over a sequence in the coalescent with recombination process. Stronger evidence for selection in hypervariable regions would be that only the nonsynonymous substitution rate is elevated but this is difficult to test because the amount of synonymous substitution in most of these systems is very close to saturation.
If recombination is indeed happening at a rate
> 0.01 in some self-recognition systems, then some of the conclusions based on allelic phylogenies of self-incompatibility systems and MHC systems should be carefully reconsidered. The level of trans-specific evolution (TSE; i.e., polymorphism shared between species) is very high in both MHC (![]()
![]()
![]()
![]()
![]()
![]()
![]()
A final consequence of recombination is that the intensity of selection is very difficult to estimate. Fig 2 showed that stronger selection affects only a very minor part of the sequence because the rest of the sequence "escapes" the balancing selection through recombination. Thus, that selection is acting can be inferred from an increased level of polymorphism but the strength and location of selection cannot be determined with accuracy when recombination occurs.
| ACKNOWLEDGMENTS |
|---|
M.H.S. thanks D. Charlesworth and P. Awadalla for continued discussions. X. Vekemans, F. B. Christiansen, Marcy K. Uyenoyama, and two anonymous referees made useful suggestions about the manuscript. T. Christensen is thanked for computer programming. The study was supported by grants nos. 9701412 and 1262 from the Danish Natural Sciences Research Council and by the Basic Research in Computer Science Centre of the Danish National Research Foundation.
Manuscript received May 23, 2001; Accepted for publication October 1, 2001.
| LITERATURE CITED |
|---|
ANDERSON, M. A., E. C. CORNISH, S.-L. MAU, E. G. WILLIAMS, and R. HOGGART et al., 1986 Cloning of cDNA for a stylar glycoprotein associated with expression of self-incompatibility in Nicotiana alata.. Nature 321:38-44.
AWADALLA, P. and D. CHARLESWORTH, 1999 Recombination and selection at Brassica self-incompatibility loci. Genetics 152:413-425
AWADALLA, P., A. EYRE-WALKER, and J. M. SMITH, 1999 Linkage disequilibrium and recombination in hominid mitochondrial DNA. Science 286:2524-2525
AYALA, F. J., 1995 The myth of Eve: molecular biology and human origins. Science 270:1930-1936
BERGSTROM, T. F., A. JOSEFSSON, H. A. ERLICH, and U. GYLLENSTEN, 1998 Recent origin of HLA-DRB1 alleles and implications for human evolution. Nat. Genet. 18:237-242[Medline].
CASSELMAN, A. L., J. VREBALOV, J. A. CONNER, A. SINGHAL, and J. GIOVANNONI et al., 2000 Determining the physical limits of the Brassica S locus by recombinational analysis. Plant Cell 12:23-33
DWYER, K. G., M. A. BALENT, J. B. NASRALLAH, and M. E. NASRALLAH, 1991 DNA sequences of self-incompatibility genes from Brassica campestris and B. oleracea: polymorphism predating speciation. Plant Mol. Biol. 16:481-486[Medline].
EMERSON, S., 1939 A preliminary survey of the Oenothera organensis population. Genetics 24:524-537
FELSENSTEIN, J., 1995 PHYLIP (Phylogeny Inference Package) Version 3.572. Distributed over the Worldwide Web, Seattle.
GRIFFITHS, R. C. and P. MARJORAM, 1996 Ancestral inference from samples of DNA sequences with recombination. J. Comput. Biol. 3:479-502[Medline].
HUDSON, R. R., 1983 Properties of a neutral allele model with intragenic recombination. Theor. Popul. Biol. 23:183-201[Medline].
HUDSON, R. R. and N. L. KAPLAN, 1988 The coalescent process in models with selection and recombination. Genetics 120:831-840
HUGHES, A. L. and M. NEI, 1988 Pattern of nucleotide substitution at major histocompatibility complex class I loci reveals overdominant selection. Nature 335:167-170[Medline].
JUKES, T. H., and C. R. CANTOR, 1969 Evolution of protein molecules, pp. 21123 in Mammalian Protein Metabolism, edited by H. N. MUNRO. Academic Press, New York.
KINGMAN, J. F. C., 1982 The coalescent. Stoch. Proc. Appl. 13:235-248.
KUSABA, M., T. NISHIO, Y. SATTA, K. HINATA, and D. OCKENDON, 1997 Striking sequence similarity in inter- and intra-specific comparisons of class I SLG alleles from Brassica oleracea and Brassica campestris: implications for the evolution and recognition mechanism. Proc. Natl. Acad. Sci. USA 94:7673-7678
MAY, G. and E. MATZKE, 1995 Recombination and variation at the a mating-type of Coprinus-cinereus. Mol. Biol. Evol. 12:794-802.
MAY, G., F. SHAW, H. BADRANE, and X. VEKEMANS, 1999 The signature of balancing selection: fungal mating compatibility gene evolution. Proc. Natl. Acad. Sci. USA 96:9172-9177
NISHIO, T. and M. KUSABA, 2000 Sequence diversity of SLG and SRK in Brassica oleracea L. Ann. Bot. 85(Suppl. A):141-146
RICHMAN, A. D. and J. R. KOHN, 1999 Self-incompatibility alleles from Physalis: implications for historical inference from balanced genetic polymorphisms. Proc. Natl. Acad. Sci. USA 96:168-172
RICHMAN, A. D., T.-H. KAO, S. W. SCHAEFFER, and M. K. UYENOYAMA, 1995 S-allele sequence diversity in natural populations of Solanum carolinense (Horsenettle). Heredity 75:405-415.
RICHMAN, A. D., M. K. UYENOYAMA, and J. R. KOHN, 1996 S-allele diversity in a natural population of Physalis crassifolia (Solanaceae) (ground cherry) assessed by RT-PCR. Heredity 76:497-505.
SCHIERUP, M. H. and J. HEIN, 2000 Consequences of recombination on traditional phylogenetic analysis. Genetics 156:879-891
SCHIERUP, M. H., X. VEKEMANS, and F. B. CHRISTIANSEN, 1997 Evolutionary dynamics of sporophytic self-incompatibility alleles in plants. Genetics 147:835-846[Abstract].
SCHIERUP, M. H., X. VEKEMANS, and F. B. CHRISTIANSEN, 1998 Allelic genealogies in sporophytic self-incompatibility systems in plants. Genetics 150:1187-1198
SCHIERUP, M. H., B. K. MABLE, P. AWADALLA, and D. CHARLESWORTH, 2001 Identification and characterization of a polymorphic receptor kinase gene linked to the self-incompatibility locus of Arabidopsis lyrata.. Genetics 158:387-399
SEDDON, J. M. and P. R. BAVERSTOCK, 1999 Variation on islands: major histocompatibility complex (MHC) polymorphism in populations of the Australian bush rat. Mol. Ecol. 8:2071-2079[Medline].
SIMS, T. L., 1993 Genetic regulation of self-incompatibility. Crit. Rev. Plant Sci. 12:129-167.
SWOFFORD, D. L., 2000 PAUP*. Phylogenetic Analysis Using Parsimony (*and Other Methods), Version 4. Sinauer Associates, Sunderland, MA.
TAKAHATA, N., 1990 A simple genealogical structure of strongly balanced allelic lines and transspecies evolution of polymorphism. Proc. Natl. Acad. Sci. USA 87:2419-2423
TAKAHATA, N. and M. NEI, 1990 Allelic genealogy under overdominant and frequency-dependent selection and polymorphism of major histocompatibility complex loci. Genetics 124:967-978[Abstract].
TAKAHATA, N. and Y. SATTA, 1998 Footprints of intragenic recombination at HLA loci. Immunogenetics 47:430-441[Medline].
TAKAHATA, N., Y. SATTA, and J. KLEIN, 1992 Polymorphism and balancing selection at major histocompatibility complex loci. Genetics 130:925-938[Abstract].
THOMPSON, J. D., T. J. GIBSON, F. PLEWNIAK, F. JEANMOUGIN, and D. G. HIGGINS, 1997 The ClustalX windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 24:4876-4882.
UYENOYAMA, M. K., 1997 Genealogical structure among alleles regulating self-incompatibility in natural populations of flowering plants. Genetics 147:1389-1400[Abstract].
VEKEMANS, X. and M. SLATKIN, 1994 Gene and allelic genealogies at a gametophytic self-incompatibility locus. Genetics 137:1157-1165[Abstract].
WALL, J. D., 2000 A comparison of estimators of the population recombination rate. Mol. Biol. Evol. 17:156-163
WANG, X., A. L. HUGHES, T. TSUKAMOTO, T. ANDO, and T.-H. KAO, 2001 Evidence that intragenic recombination contributes to allelic diversity of the S-RNAse gene at the self-incompatibility(S) locus in Petunia inflata.. Plant Physiol. 125:1012-1022
WIUF, C. and J. HEIN, 1997 On the number of ancestors to a DNA sequence. Genetics 147:1459-1468[Abstract].
WOROBEY, M., 2001 A novel approach to detecting and measuring recombination: new insights into evolution in viruses, bacteria, and mitochondria. Mol. Biol. Evol. 18:1425-1434
This article has been cited by other articles:
![]() |
S. Takuno, R. Fujimoto, T. Sugimura, K. Sato, S. Okamoto, S.-L. Zhang, and T. Nishio Effects of Recombination on Hitchhiking Diversity in the Brassica Self-incompatibility Locus Complex Genetics, October 1, 2007; 177(2): 949 - 958. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Loisel, M. V. Rockman, G. A. Wray, J. Altmann, and S. C. Alberts Ancient polymorphism and functional variation in the primate MHC-DQA1 5' cis-regulatory region PNAS, October 31, 2006; 103(44): 16331 - 16336. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Innan Modified Hudson-Kreitman-Aguade Test and Two-Dimensional Evaluation of Neutrality Tests Genetics, July 1, 2006; 173(3): 1725 - 1733. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Kuang, S.-S. Woo, B. C. Meyers, E. Nevo, and R. W. Michelmore Multiple Genetic Processes Result in Heterogeneous Rates of Evolution within the Major Cluster Disease Resistance Genes in Lettuce PLANT CELL, November 1, 2004; 16(11): 2870 - 2894. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Charlesworth, C. Bartolome, M. H. Schierup, and B. K. Mable Haplotype Structure of the Stigmatic Self-Incompatibility Gene in Natural Populations of Arabidopsis lyrata Mol. Biol. Evol., November 1, 2003; 20(11): 1741 - 1753. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. D. Richman, L. G. Herrera, and D. Nash Evolution of MHC Class II E{beta} Diversity Within the Genus Peromyscus Genetics, May 1, 2003; 164(1): 289 - 297. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||