- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Ohashi, J.
- Articles by Tokunaga, K.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Ohashi, J.
- Articles by Tokunaga, K.
Sojourn Times and Substitution Rate at Overdominant and Linked Neutral Loci
Jun Ohashia and Katsushi Tokunagaaa Department of Human Genetics, University of Tokyo, Graduate School of Medicine, Tokyo 113-0033, Japan
Corresponding author: Jun Ohashi, Department of Human Genetics, Graduate School of Medicine, University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-0033, Japan., juno{at}m.u-tokyo.ac.jp (E-mail)
Communicating editor: A. G. CLARK
| ABSTRACT |
|---|
The sojourn times until fixation of an overdominant allele were investigated based on the diffusion equation. Furthermore, the rate of accumulation of mutations, or the substitution rate, was predicted from the mean extinction time of a common overdominant allele. The substitution rate calculated theoretically agreed well with that determined by computer simulation. Overdominant selection enhances the polymorphism at linked loci, while its effect on the sojourn times and the substitution rate at linked loci has not been studied yet. To solve these problems, a model that assumed two linked loci, each with infinite alleles, was examined by computer simulation. A decrease in the recombination rate between two loci markedly changed the distribution of sojourn times of a neutral allele. Although overdominant selection obviously increased the sojourn times and the polymorphism at a linked locus, the rate of nucleotide substitution at the neutral locus was not influenced significantly even if complete linkage was assumed. These results suggest that, in regions containing overdominant genes, linked neutral loci will exhibit elevated levels of polymorphism, but their rate of molecular evolution remains that predicted by neutral theory.
SEVERAL outstanding characteristics of overdominant alleles, such as the allele frequency distribution, the homozygosity, the maintenance of polymorphisms, the rate of gene substitution, and the allelic genealogy have been investigated based on the infinite-allele model (e.g., ![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
It is known that overdominant selection at a particular locus enhances the polymorphism at a linked nonoverdominant locus or region (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
| MODELS |
|---|
Throughout this article, the infinite-allele model of ![]()
Analysis of the overdominant allele:
In this study, the sojourn times are calculated based on the process until extinction of the allele in question. Since the sojourn times of infinite neutral alleles have already been determined by ![]()
x[y] be the total number of generations in which the gene frequency is y, in the case that the initial allele frequency is x. Then
x[y] is represented as (Equation 7 in ![]()
![]() |
(1) |
![]() |
(2) |
where G(p) = exp {-2
p(
)dp}. M
p and V
p are, respectively, the mean and the variance of the change in the gene frequency p per generation. Equation 1 and Equation 2 consider a possibility for temporary fixation of the allele. Under the infinite-allele model and symmetric overdominant selection, M
p and V
p are given by -up -
and
respectively, where J (=
p2k) is the sum of the squares of the frequencies of all alleles and N is the size of a diploid population (![]()
![]()
![]()
![]()
, we obtain
![]() |
(3) |
If we substitute V
p and (3) into (1) and (2), then we obtain
![]() |
(4) |
![]() |
(5) |
Numerical calculations are required to compute the integral equation in (4) and (5) unless -4N(u + A - AJ) and -4NA are special values.
To calculate the sojourn times, J needs to be evaluated before the calculation. For the moment, let us focus on the computation of J as well as the equilibrium distribution of the number of alleles. Let
(p)dp be the expected number of alleles whose frequency is in the range of p to p + dp. It is given by ![]()
![]()
![]()
![]() |
(6) |
Equation 6 has a local maximum at pmax =
and a local minimum at pmin =
, if 0 < 1 - a + b < 2b, where we put a = 4N(u + A - AJ) - 1 and b = 4NA. Since the sum of allele frequencies is equal to unity, the relation
![]() |
(7) |
is satisfied. J, which satisfied (7), was calculated for several parameter sets of N, u, and s (Table 1). Once J is obtained, it can be evaluated by
![]() |
(8) |
because (8) gives the mean homozygosity. Although values of J from (8) are not quite identical to those of J from (7) due to the approximation, values of J from (7) are numerically close to those from (8) (data not shown). ![]()
![]()
(p):
![]() |
(9) |
|
They used -up - sp(p - J) and p/2N as M
p and V
p, respectively. As N increases, the difference between J from (6) and (9) becomes small (Table 1). In this study, J from (6) are used in the following calculations. When A
s,
[y] and
(p) are functions of Nu and Ns. As a consequence, the results of theoretical calculation and simulation for the particular parameters N, u, and s, can be regarded as the results for other parameter values chosen appropriately.
Fig 1 shows the distribution of the sojourn times of a new mutant allele, or
1/2N[y], based on (6) and (9). Here we assumed that N = 1000 and u = 10-5. The mean extinction time of this allele is given by
1

[y]dy. We can see that the extinction time of a particular mutant allele (s > 0) is mostly determined by the duration of its stay at the middle frequency class.
|
To check the accuracy of our computation, simulation experiments were carried out. In the simulation, two genes are randomly chosen from a population without replacement, and the genotype is determined. If the individual is a homozygote, then we compare the homozygote disadvantage, s, with the random number that is generated in the range of 01. When the former is smaller than the latter, this individual can transmit one gene to the next generation. If the genotype is heterozygous, one of two genes is transmitted randomly to the next generation. This procedure is repeated until 2N genes are transmitted. Mutation events are considered after the transmission of the genes, and the parental allele and the time of mutation are recorded at the same time. After reaching equilibrium, the data for 2000 different alleles were recorded, and the mean sojourn times were calculated on the basis of this record (Fig 2). The results of theoretical expectation and simulation were in reasonable agreement with each other. When s = 0.1, (6) gives a better approximation than (9). In contrast, when s = 0.01, the theoretical curve from (9) is in fair agreement with the simulation results.
|
In this study, an allele whose frequency is >pmin is defined as a common allele and pmax is regarded as the mean frequency of a common allele at equilibrium. Fig 3 shows the sojourn times of a common allele with an initial frequency of pmax under overdominant selection (i.e.,
pmax[y]). Unlike the sojourn times of a newly arisen mutant allele, the mean extinction time of a common allele, tc =
1
pmax [y]dy, decreased as s increased, as expected from (6) in ![]()
|
The nucleotide substitution is caused by allelic turnover of common alleles under overdominant selection (![]()
![]()
![]()
![]()
, as
![]() |
(10) |
where we measure
in units of 1/u generations. Equation 10 is not useful for estimating
, compared to TAKAHATA's (1990) formula,
![]() |
(11) |
However, (10) agreed well with the results of the simulation (Table 2).
|
Analysis of the neutral allele at the linked locus:
Unlike the single locus, it is difficult to treat the two linked loci with infinite alleles. To examine the effect of overdominant selection on the linked neutral gene, a simulation approach was used. The recombination rate between the overdominant locus and a neutral locus was denoted by r.
Among the overdominant loci, the major histocompatibility complex (MHC) loci, especially the human MHC, or human leukocyte antigen (HLA) loci that are located at 6p21.3, have been investigated intensively. On the basis of the HLA data, the effective size of the human population is considered to be ~104105 (see ![]()
![]()
![]()
|
|
For the same parameter sets, the mean rate of accumulation of neutral mutations, the number of alleles, and the oldest allelic divergence time among alleles that existed at the end of the simulation were examined for a linked neutral locus (Table 3). The number of alleles and the oldest allelic divergence time increased as r decreased, mainly because the mean coalescence time of two linked neutral genes increased (![]()
|
| DISCUSSION |
|---|
Theoretical calculations in this study are dependent on
(p) because the equilibrium homozygosity, J, is required to be known prior to the calculations. Although several one-dimensional frequency spectrum approximations have been obtained (![]()
![]()
![]()
![]()
![]()
![]()
The sojourn times or the frequency spectrum at the neutral locus that locates around the HLA locus are affected by natural selection operating at the HLA. Our results suggest that genes are influenced by HLA if the loci are located within 0.1 cM (r = 10-3) from HLA, although this genetic distance is overestimated because the population size is not taken into consideration here. Recently, a high degree of polymorphism of CYP21 was reported (![]()
![]()
![]()
Our results offer the key to understanding the synonymous substitutions in the ARS of HLA genes. The ARS of the HLA class II gene is composed of 16 amino acid residues, whose sites are codons 9, 11, 13, 28, 30, 37, 38, 57, 61, 67, 70, 71, 74, 78, 82, and 86, in the second exon. Since one-fifth of the nucleotide changes are thought to be synonymous, and the mutation rate is 2 x 10-8 per site per generation for humans, the synonymous mutation rate in the ARS is ~0.2 x 10-6 per gene per generation for the HLA class II loci (0.2 x 10-6
1/5 x 48 x 2 x 10-8). If we suppose that the long-term breeding size in the human lineage is 104 (case I) or 105 (case II), the synonymous mutation rate is equivalent to 0.4 x 10-5 or 0.4 x 10-4, respectively, in our simulation in which N = 500 is assumed. Likewise, the nonsynonymous mutation rate is assumed to be 1.6 x 10-5 (case I) or 1.6 x 10-4 (case II). The obtained synonymous substitution and nonsynonymous substitution rates are shown in Table 4. Here, we use substitution rate as a rate of accumulation of mutations. The number of synonymous changes in the ARS between any pair of alleles was larger for r = 0 than for r = 0.5 (data not shown), while the synonymous substitution rate was approximately equal to the synonymous mutation rate. Whether the synonymous substitution rate at the HLA locus is increased by the overdominant selection can be evaluated by the interlocus comparisons of HLA genes. ![]()
![]()
![]()
![]()
|
| ACKNOWLEDGMENTS |
|---|
We express our great thanks to Dr. Naoyuki Takahata for his interest and helpful comments. We are grateful to Dr. Andrew G. Clark for reviewing the manuscript and providing helpful suggestions and to the anonymous reviewers for their insightful comments. Thanks to Dr. Minato Nakazawa and Mr. Jun Hagihara for help in carrying out simulation experiments. This study was supported by a Grant-in-Aid for Scientific Research from the Japanese Ministry of Education, Science, Sports, and Culture.
Manuscript received June 30, 1999; Accepted for publication February 14, 2000.
| LITERATURE CITED |
|---|
CARGILL, M., D. ALTSHULER, J. IRELAND, P. SKLAR, and K. ARDLIE et al., 1999 Characterization of single-nucleotide polymorphisms in coding regions of human genes. Nat. Genet. 22:231-238[Medline].
EWENS, W. J., 1964 The maintenance of alleles by mutation. Genetics 50:891-898
HUDSON, R. R. and N. L. KAPLAN, 1988 The coalescent process in models with selection and recombination. Genetics 120:831-840
HUGHES, A. L. and M. NEI, 1988 Pattern of nucleotide substitution at major histocompatibility complex class I loci reveals overdominant selection. Nature 335:167-170[Medline].
HUGHES, A. L. and M. NEI, 1989 Nucleotide substitution at major histocompatibility complex class II loci: evidence for overdominant selection. Proc. Natl. Acad. Sci. USA 86:958-962
KAPLAN, N. L., T. DARDEN, and R. R. HUDSON, 1988 The coalescent process in models with selection. Genetics 120:819-829
KIMURA, M., 1968 Evolutionary rate at the molecular level. Nature 217:624-626[Medline].
KIMURA, M. and J. F. CROW, 1964 The number of alleles that can be maintained in a finite population. Genetics 49:725-738
LI, W.-H., 1978 Maintenance of genetic variability under the joint effect of mutation, selection and random drift. Genetics 90:349-382
MARUYAMA, T., 1974 The age of an allele in a finite population. Genet. Res. 23:137-143[Medline].
MARUYAMA, T. and M. KIMURA, 1975 Moments for sum of an arbitrary function of gene frequency along a stochastic path of gene frequency change. Proc. Natl. Acad. Sci. USA 72:1602-1604
MARUYAMA, T. and M. NEI, 1981 Genetic variability maintained by mutation and overdominant selection in finite populations. Genetics 98:441-459
NAGYLAKI, T., 1974 The moments of stochastic integrals and the distribution of sojourn times. Proc. Natl. Acad. Sci. USA 71:746-749
OHTA, T. and M. KIMURA, 1970 Development of associative overdominance through linkage disequilibrium in finite populations. Genet. Res. 16:165-177[Medline].
OHTA, T. and M. KIMURA, 1971 Behavior of neutral mutants influenced by associated overdominant loci in finite populations. Genetics 69:247-260
SATTA, Y., 1997 Effects of intra-locus recombination on HLA polymorphism. Hereditas 127:105-112[Medline].
SATTA, Y., C. O'HUIGIN, N. TAKAHATA, and J. KLEIN, 1994 Intensity of natural selection at the major histocompatibility complex loci. Proc. Natl. Acad. Sci. USA 91:7184-7188
TAKAHATA, N., 1990 A simple genealogical structure of strongly balanced allelic lines and trans-species evolution of polymorphism. Proc. Natl. Acad. Sci. USA 87:2419-2423
TAKAHATA, N. and M. NEI, 1990 Allelic genealogy under overdominant and frequency-dependent selection and polymorphism of major histocompatibility complex loci. Genetics 124:967-978[Abstract].
TAKAHATA, N. and Y. SATTA, 1998 Footprints of intragenic recombination at HLA loci. Immunogenetics 47:430-441[Medline].
TAKAHATA, N., Y. SATTA, and J. KLEIN, 1992 Polymorphism and balancing selection at major histocompatibility complex loci. Genetics 130:925-938[Abstract].
THOMSON, G., 1977 The effect of a selected locus on linked neutral loci. Genetics 85:753-788
WATTERSON, G. A., 1977 Heterosis or neutrality? Genetics 85:789-814
WRIGHT, S., 1938 The distribution of gene frequencies under irreversible mutation. Proc. Natl. Acad. Sci. USA 24:253-259
WRIGHT, S., 1966 Polyallelic random drift in relation to evolution. Proc. Natl. Acad. Sci. USA 55:1074-1081
YOKOYAMA, S. and M. NEI, 1979 Population dynamics of sex-determining alleles in honey bees and self-incompatibility alleles in plants. Genetics 91:609-626
- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Ohashi, J.
- Articles by Tokunaga, K.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Ohashi, J.
- Articles by Tokunaga, K.
















