Genetics, Vol. 148, 1333-1340, March 1998, Copyright © 1998

Conditions for Positive and Negative Correlations Between Fitness and Heterozygosity in Equilibrium Populations

Hong-Wen Denga and Yun-Xin Fub
a Osteoporosis Research Center, Creighton University, Omaha, Nebraska 68131
b Human Genetics Center, SPH, University of Texas, Houston, Texas 77225

Corresponding author: Hong-Wen Deng, Osteoporosis Research Center, Creighton University, 601 N. 30th St., Suite 6787, Omaha, NE 68131, deng{at}creighton.edu (E-mail).

Communicating editor: Z-B. ZENG


*  ABSTRACT
*TOP
*ABSTRACT
*THEORY
*DISCUSSION
*LITERATURE CITED

The past decades have witnessed extensive efforts to correlate fitness traits with genomic heterozygosity. While positive correlations are revealed in most of the organisms studied, results of no/negative correlations are not uncommon. There has been little effort to reveal the genetic causes of these negative correlations. The positive correlations are regarded either as evidence for functional overdominance in large, randomly mating populations at equilibrium, or the results of populations at disequilibrium under dominance. More often, the positive correlations are viewed as a phenomenon of heterosis, so that it cannot possibly occur under within-locus additive allelic effects. Here we give exact genetic conditions that give rise to positive and negative correlations in populations at Hardy-Weinberg and linkage equilibria, thus offering a genetic explanation for the observed negative correlations. Our results demonstrate that the above interpretations concerning the positive correlations are not complete or even necessary. Such a positive correlation can result under dominance and potentially under additivity, even in populations where associated overdominance due to linked alleles at different loci is not significant. Additionally, negative correlations and heterosis can co-occur in a single population. Although our emphasis is on equilibrium populations and for biallelic genetic systems, the basic conclusions are generalized to non-equilibrium populations and for multi-allelic situations.


DURING the past three decades, numerous efforts have attempted to correlate fitness (or related characters) with genomic heterozygosity as reflected by molecular marker heterozygosity in natural populations (MITTON and GRANT 1984 Down; ALLENDORF and LEARY 1986 Down; ZOUROS and FOLTZ 1987 Down; LYNCH and WALSH 1997). While a number of studies have found no or negative correlations (e.g., GAINS et al. 1978 Down; PIERCE and MITTON 1982 Down), positive correlations have been revealed in most organisms studied. There has been little effort to search for satisfactory genetic explanations for the negative correlations, and it is generally regarded that with a parametric slope of nearly zero, negative correlations can result by chance. On the other hand, the more commonly observed positive correlations have stimulated a great deal of research interest and there have been some distinct genetic explanations for them.

One explanation is that (MITTON and GRANT 1984 Down; SMOUSE 1986 Down; ZOUROS and FOLTZ 1987 Down) overdominance is the cause, and the positive correlation is a piece of evidence for overdominance underlying the fitness loci. This is especially true in large randomly mating populations in which linkage disequilibrium is not significant. The basis for this view is that with overdominance, one would expect individual fitness to increase with the fraction of the genome that is heterozygous. This fraction should be correlated with the number of heterozygous molecular marker loci if they are tightly linked to polymorphic loci underlying fitness or themselves influence fitness. Under dominance, a correlation between multilocus heterozygosity and fitness cannot arise except when there is a correlation between multilocus homozygosity and the level of inbreeding. Such an association is unlikely to be pronounced in large, randomly mating populations. However, the overdominance argument encounters some difficulties. For example, there are several findings of heterozygote deficiency in populations showing positive correlations between heterozygosity and fitness-related traits (GAFFNEY et al. 1990 Down; LYNCH and WALSH 1997), which should not be expected in large randomly mating populations with the overdominance hypothesis.

Another explanation is that the populations studied may not be strictly panmictic but instead have local inbreeding. Nonrandom mating may cause correlations between homozygosity in the genome, even with unlinked loci (HALDANE 1949 Down; OHTA and COCKERHAM 1974 Down; HOULE 1994 Down). The allozyme heterozygosity may thus be correlated with an individual level of inbreeding, and hence the associations between heterozygosity and fitness are largely the consequence of variation in the level of inbreeding among individuals (LEDIG et al. 1983 Down; STRAUSS 1986 Down). However, this explanation was not supported by a few studies (e.g., LEARY et al. 1987 Down), and large studies in marine bivalves normally exclude inbreeding as an explanation for the observed positive correlation (GAFFNEY et al. 1990 Down). In randomly mating populations, if linkage disequilibrium is present, regardless of its causes, linked deleterious alleles with dominance at different loci will likely cause associated overdominance (HOULE 1989 Down). This also may explain the positive correlation. However, despite a large number of studies, evidence for linkage disequilibrium in large randomly mating populations remains controversial (e.g., BARKER 1979 Down; LEWONTIN 1985 Down; SMIT-MACBRIDE et al. 1988 Down; HOULE 1989 Down; ZAPATA and ALVAREZ 1992, ZAPATA and ALVAREZ 1993; LYNCH and DENG 1994 Down; DENG and LYNCH 1996A Down).

The correlation approach for fitness and genomic heterozygosity and the different explanations for the observed data are highly relevant to one fundamental and long-standing issue in population genetics: How is genetic variation maintained in natural populations? Overdominance essentially encompasses all forms of balancing selection at the allelic level and dominance is compatible with mutation-selection balance. The following two inferences are common to the above two genetic explanations. First, the positive correlation reflects the phenomenon of heterosis. Hence in large randomly mating populations, it cannot possibly exist under within-locus additive allelic effects. In addition, heterosis should be incompatible with the negative correlations and they cannot co-occur within individual populations. Second, in populations at genetic equilibria, the positive correlations cannot exist with dominance. These concepts have been widely held among the researchers in this field. However, are they always true? Throughout, unless otherwise specified, (genetic) equilibria refer to Hardy-Weinberg and/or linkage equilibria.

Employing a multilocus biallelic model, and by theoretical analyses supplemented by computer simulations, we show that these two concepts are not true. Moreover, we demonstrate that negative correlations between fitness and genomic heterozygosity are not unexpected, and we give explicit genetic conditions for both positive and negative correlations to occur. Our focus is on equilibrium populations. However, the conclusions derived for multiple loci under equilibrium are generalized to nonequilibrium populations and multi-allelic systems through one locus model.

There are extensive data existing for the discovered correlations. There are also some potential limitations of the correlation approach for fitness and genomic heterozygosity (see DISCUSSION). Therefore, our focus here is to show when correlations do indeed exist in equilibrium populations, and how to interpret them when they are detected. Hence, some practical issues are not dealt with here, such as what sample sizes are needed and how many loci need to be assayed in order to detect a correlation when it indeed exists. Some of these, or related practical problems, have been addressed before (e.g., MITTON and PIERCE 1980 Down; CHAKRABORTY 1981 Down).


*  THEORY
*TOP
*ABSTRACT
*THEORY
*DISCUSSION
*LITERATURE CITED

Consider a simplified situation, in which there are N polymorphic loci underlying fitness, each having two alleles A and a. The allelic effects across loci may vary so that the equilibrium frequencies for the ith locus is pi and qi, respectively. Let the three genotypic values be:

Then for the ith locus, hi < 0.0 implies overdominance, hi = 0.5 implies additivity, 0 < hi < 1.0 (hi != 0.5) implies dominance and hi > 1.0 implies underdominance. For the time being, we will assume hi = h, si = s, and that mutation-selection balance has been established so that qi = q where qi is the frequency of the less fit allele a at the ith locus. We will, later in the discussion section, consider the situation where hi , si and qi vary across loci. qi may vary across loci either due to variable hi and si in populations at mutation-selection balance, or in populations experiencing recent expansion after a population bottleneck (even with constant h and s), where genetic disequilibria could be negligible but mutation-selection balance has not been reached.

The multiplicative fitness function is biologically plausible by direct and indirect evidence (MORTON et al. 1956 Down; CROW 1986 Down; FU and RITLAND 1996 Down) and will be assumed throughout. The fitness W(n1,n2) of an individual is totally determined by the number of heterozygous (n1) and homozygous (n2) loci for a allele in the genome, regardless of the specific genotypes at particular loci:

(1)
where 0 <= n1 + n2 <= N.

Under random mating and linkage equilibrium, genomic genotypes for individuals, as determined by n1 and n2, follow a trinomial distribution:

(2)

Conditional on having n1 heterozygous loci, the probability of having n2 homozygous loci for the a allele in the genome is a binomial distribution:

(3)

Thus, conditional on having n1 heterozygous loci in the genome, the expected fitness of a genotype [E(W/n1)] is:

(4)

Note that the quantities and in the above equation are, respectively, the probabilities of being aa and AA genotypes, conditional on that the genotype is homozygous. + is the weighted mean fitness of the two homozygotes, with the weight being their conditional probabilities as above.

Equation 4 can be re-written as:

(5)

It is then easy to see that E(W/n1) is a monotonically increasing function of n1, if

(6)
otherwise it is a monotonically decreasing function of n1. Let

(7)
then Equation 6 is equivalent to

(8)

In other words, if the heterozygote fitness is larger than the weighted mean fitness of the two homozygotes (Equation 6), or if h satisfies Equation 8, then the expected fitness of a genotype increases monotonically with the number of heterozygous loci (n1) in the genome; otherwise it decreases monotonically with n1. This result may be easier to understand when we only consider, later in this section, the simplest case of a single locus with two alleles.

Therefore, both positive and negative relationships between fitness and genomic heterozygosity can exist. Which one exists critically depends on the above condition (Equation 6 or Equation 8) under our assumptions of random mating, no significant linkage disequilibrium, and multiplicative fitness function. Figure 1 depicts the parameter space of h and q that gives rise to positive and negative correlations between fitness and genomic heterozygosity. Note that Figure 1 does not imply any true relationship of h and q. It just graphically depicts the outcome regions of the relationship of fitness and heterozygosity given the true relationship of h and q. It can be seen that negative correlations are possible under a large range of parameter space of h and q. In particular, when q is small (<0.1), negative correlations are almost always expected under dominance and additivity. Additionally, positive correlations could potentially exist under additivity, if q is larger than 0.5 (see DISCUSSION).



View larger version (8K):
In this window
In a new window
Download PPT slide
 
Figure 1. —Parameter spaces of h (Y-axis) and q (X-axis) for positive and negative correlations to exist in population at genetic equilibria. The curve is based on Equation 7. Region A indicates the parameter space where both population inbreeding depression (heterosis) and a negative correlation can occur.

To corroborate our analytical derivations, computer simulations were performed. The population was at Hardy-Weinberg and linkage equilibria. A total of 1000 polymorphic genomic loci of constant effects h and s with the same equilibrium allele frequencies p and q were assumed to be underlying fitness. At each locus, the genotype of an individual was determined by a uniform random variable {xi}(0 < {xi} < 1.0). Genotype AA was chosen if {xi} < p2, else Aa if {xi} < p2 + 2pq, otherwise aa was chosen. An individual's fitness was determined by Equation 1. One hundred genotypes were sampled from the population in our simulations. As stated in the introduction, our objective here was to find the genetic conditions for the existence of a true correlation. Additionally, the purpose of our simulation was to corroborate our analytical results. Therefore, the genotypic values of the 100 sampled genotypes were assumed to be measured without error, and are plotted in Figure 2. Regression analysis was performed; the regression slope was positive and highly significant (P = 0.0006). The simulation corroborates very well with our analytical derivations, i.e., the regression line based on the simulated genotypic data coincides with the expected theoretical line (Figure 2).



View larger version (20K):
In this window
In a new window
Download PPT slide
 
Figure 2. —A positive relationship detected between fitness (Y-axis) and the number (n1) of heterozygous loci in the genome (X-axis) under dominance. The solid line is the regression line based on the 100 simulated data points, the dashed line is the expected line by Equation 5. For data simulated in this figure, q = 0.3, s = 0.01, h = 0.01 and hc = 0.155.

The above conclusion, which was derived from a multiple-locus system where linkage equilibrium and mutation-selection balance were assumed, may be made more intuitive and general. In the rest of this section, we are going to progressively relax the assumptions concerning the genetic systems. First, we consider the simplest case of one locus with two alleles A and a as defined before. The essential question is then: Which has a higher fitness, a heterozygote or a homozygote? With the heterozygote, the fitness is 1 - hs. With a homozygote, it can be either AA or aa with respective frequencies in the population under Hardy-Weinberg equilibrium being p2 and q2. Thus the expected fitness of a homozygote in the population is

which is

Therefore, if

(9)
a heterozygote has a higher expected fitness than a homozygote. Otherwise, if h > hc, a heterozygote has lower expected fitness than a homozygote. If q is such that hc > 0.5, then the heterozygote has a higher expected fitness than a homozygote regardless of allelic effects being overdominant (h < 0), dominant (0 < h < 1.0, h != 0.5) or additive (h = 0.5). Note that Equation 9 is actually equivalent to Equation 7 and Equation 8. This is exactly the condition we found earlier when we considered multiple loci under a multiplicative fitness function (Equation 6, Equation 7 and Equation 8), where we assumed constant h, s, q, linkage equilibrium and mutation-selection balance. However, in the one locus case, no assumption needs to be made about the linkage disequilibrium and mutation-selection balance. Therefore, regardless of linkage disequilibrium and mutation-selection balance, as long as Equation 9 is satisfied at each locus, a positive correlation will exist.

Now, let us relax our assumptions even further and assume a general population where even Hardy-Weinberg equilibrium may not hold. A population resulting from the mixing of different populations may represent such a scenario. The essential question still is: At each polymorphic locus, which has a higher fitness, a heterozygote or a homozygote? For a biallelic locus as above, let us denote the genotype frequencies as PAA, PAa, and Paa respectively. The expected fitness of a genotype conditional on that it is homozygous is:

Again, this is the weighted fitness of the homozygotes, with the weight being their conditional genotype frequencies. Therefore, if

a positive correlation will exist. The above inequality is equivalent to

(10)

Note there is no assumption concerning the (Hardy-Weinberg/linkage disequilibrium) equilibrium in the above derivation. It can be easily seen that if Hardy-Weinberg equilibrium is assumed, Equation 10 can be reduced to Equation 9. If h > hc , a negative correlation will exist.

All the above analyses are for biallelic genetic systems, which are applicable for many allozyme loci and restriction fragment polymorphisms (RFLPs). However, some allozyme loci have more than two alleles and the increasingly employed micro-satellite marker loci are even more polymorphic. In the following, we are going to give the general genetic conditions of the correlation relationships for multi-allelic systems. We will use the tri-allelic genetic system as an example; extensions to genetic systems with more alleles are straightforward and can be obtained similarly.

Let the genotypic values and frequencies of a tri-allelic locus be:

The essential question is again the same as before: Which has a higher fitness, a heterozygote or a homozygote? The expected fitness of a genotype given that it is homozygous is:

and the expected fitness of a genotype given that it is heterozygous is:

Let = . That is, is the weighted homozygous effect of the less fit alleles, with the weight being the conditional probabilities

of being a specific less fit homozygeous genotype, given that the genotype is homozygous.

Let = . That is, is the weighted dominance coefficient, with the weight being the product of the corresponding selection coefficients (s1, s2 and s3) and the conditional probabilities

of being a specific heterozygous genotype, given that the genotype is heterozygous.

Therefore, if

(11)
a positive correlation is expected for fitness and heterozygosity for this tri-allelic locus. Otherwise, a negative correlation is expected. The results can be extended easily to loci with more than three alleles to show that the conclusions are the same. Note there is no assumption concerning the equilibrium in the above derivation. It can be easily seen that the result, derived from a single locus, is applicable to multiple loci under additive and multiplicative fitness functions. It can also be easily seen that with the definition of and as above, Equation 11 can be reduced to Equation 10 for the biallelic situations by just setting PCC = 0, PAC = 0, PBC = 0 (since only two alleles A and B exist at the locus).


*  DISCUSSION
*TOP
*ABSTRACT
*THEORY
*DISCUSSION
*LITERATURE CITED

Under the simplified model of two alleles at each locus, in randomly mating populations with genetic equilibria, a positive correlation between genomic heterozygosity and fitness with dominance seems to be counterintuitive and has not been revealed before (Figure 1 and Figure 2). A more counterintuitive conclusion is that a positive correlation could potentially exist even under within-locus additive allelic effects, without any dominance or overdominance in equilibrium populations (Figure 1, Equation 6Equation 7Equation 8Equation 9). However, our analyses clearly demonstrate that these are entirely possible for a range of parameter space under Hardy-Weinberg and linkage equilibria. Importantly, negative correlations between fitness and genomic heterozygosity are actually not unexpected under a wide range of plausible parameter space of h and q in equilibrium populations (Figure 1). Additionally, in nonequilibrium populations, a positive correlation can also exist with dominance, overdominance, and additive allelic effects if Equation 10 holds at polymorphic loci; otherwise a negative correlation may exist. Furthermore, with no assumption about the genetic equilibrium, the exact genetic conditions for the positive and negative correlations for general multi-allelic systems are also given. All of these results are new in that they give genetic conditions for the correlations, which directly link the genetic effect h (and s) with the population property of gene (or genotype) frequencies.

The conditions for these counterintuitive phenomena to exist do not seem to be prohibitive given some level of biological knowledge. For a positive correlation to exist, q does not have to be very common in the dominance case (Figure 1). In the additive case, in order for hc > 0.5 so that a positive correlation could possibly exist (Equation 6, Equation 8 and Equation 9), q has to be greater than 0.5 in populations at genetic equilibrium or PAA < Paa in populations at disequilibrium (Equation 10). Are these entirely impossible? Our knowledge is very limited on fitness effects at polymorphic loci such as those revealed by molecular markers (KIMURA 1983 Down; NEI 1987 Down; LI 1997 Down). For example, how is the extensive polymorphism in natural populations maintained? What is the difference in the fitness effects of different polymorphisms? How much genetic variation of fitness can polymorphism at a particular single locus explain? What is the mutation rate at a locus? Our knowledge about the population history, such as population dynamics and population admixture, is also very limited. Therefore, the possibility of q > 0.5 cannot be entirely ruled out in at least three conceivable situations. The first is in populations that have been large for long enough generations, so that a mutation-selection balance has been approximately reached. For instance, for slightly deleterious mutations (OHTA 1973 Down, OHTA 1974 Down), it is not unreasonable to assume s = 0.0001. Even in the most prohibitive case of additive effects, with mutation-selection balance, the locus mutation rate u is inferred to be on the order of 1.0E-5 (CROW and KIMURA 1970 Down) in order for q > 0.5. This is roughly on the order of the few mutation rates inferred for allozyme loci (HARTL and CLARK 1989 Down; MAYNARD-SMITH 1989 Down). For instance, SCHLEGER and DICKIE 1971 Down estimated that, for five loci tested in the mouse, the average mutation rate is 1.1E-5. Since this is only for visible mutants, it is likely to be a lower bound. It is noted that by inferring the order of u, reversible mutations are ignored. This may be partially justified by the fact that mutations at many of its nucleotide sites within a locus may be slightly deleterious; however, in a mutant allele, only reversible mutations at those few mutated nucleotide sites can restore the original wild-type allele.

The second situation where q may exceed 0.5 may be in populations experiencing recent expansion, where mutation-selection balance is not established yet but Hardy-Weinberg and linkage disequilibria are not significant. We focus on considering the conditions for different correlations under Hardy-Weinberg and linkage equilibria. In order to assume constant q under constant h and s across loci, mutation-selection balance is assumed when we derive Equation 8 for the case of multiple loci. However, mutation-selection may not be an essential assumption for our conclusions. This can be easily seen, since the same basic conclusion (Equation 8) is derived for the single locus case (Equation 9) without assuming mutation-selection balance. It is known that Hardy-Weinberg equilibrium can be established by just one generation of random mating; linkage disequilibrium decays at a rate of r. r is the recombination rate between two loci at disequilibrium. However, to reach a mutation-selection balance, roughly, for those mutants with s greater than 10/N e, where Ne is the effective population size, the population has to have an annual population size in excess of Ne for a time span (in generations) of at least a few Ne (KIMURA et al. 1963 Down; LYNCH et al. 1995 Down; DENG and LYNCH 1996B Down). Therefore, it takes a much longer time for a population to reach mutation-selection balance than to reach approximate genetic equilibria. So, it is possible for a population to be approximately in genetic equilibria without reaching mutation-selection balance. The third scenario is that for non-equilibrium populations due to recent population admixture, we can have PAA < Paa fairly easily if two populations mix and the larger proportion is from the population homozygous for a.

The observed negative correlations between fitness traits and heterozygosity in a number of studies (GAINS et al. 1978 Down; PIERCE and MITTON 1982 Down; MITTON and GRANT 1984 Down; ALLENDORF and LEARY 1986 Down; ZOUROS and FOLTZE 1987; LYNCH and WALSH 1997) have not attracted much attention. Hence, there has been hardly any satisfactory genetic explanation for them. Researchers have generally attributed these negative correlations to statistical artifacts and commented that with a parametric slope of nearly zero, negative correlations can result by chance. However, we show here that the negative correlations may indeed result if, at each locus, heterozygote fitness is smaller than the weighted mean fitness of homozygote, or h > hc. As noted in Figure 1 and Equation 6Equation 7Equation 8Equation 9Equation 10, for a di-allelic system, the negative correlations can actually occur under a wide range of parameter space of h and q in equilibrium and nonequilibrium populations. This conclusion contradicts that of TURELLI and GINZBURG 1983 Down for multi-allelic systems. By numerical analyses via computer simulations, they concluded that in populations with little linkage disequilibrium, average fitness always increases with genomic heterozygosity. The different conclusions of ours and that of TURELLI and GINZBURG 1983 Down are not due to the genetic systems under study (bi-allelic vs. multi-allelic), since we clearly show that the negative correlations can result under multi-allelic systems (Equation 11). TURELLI and GINZBURG 1983 Down did not study the influence of the detailed genetic effect h (and s) in association with gene (or genotype) frequencies on the correlations.

The potentially common negative correlations are not inconsistent with the widely observed inbreeding depression. For a single locus with two alleles, the necessary and sufficient condition for population inbreeding depression to occur is that the heterozygote has fitness greater than the arithmetic mean fitness of the two corresponding homozygotes, not weighted by their population frequencies as in Equation 6 and Equation 10 (CROW and KIMURA 1970 Down; FALCONER and MACKAY 1996 Down). Therefore, the widely observed inbreeding depression does necessarily imply that the negative correlation should be rare. In fact, when q is small, the negative correlation should be more likely unless h is also very small (Figure 1). Please note, underdominance (where h > 1) may be an alternative and sufficient explanation for the negative correlations. However, it is not a necessary one, since h does not have to be greater than 1.0 if hc < 1. As long as h > hc , a negative correlation will exist. Figure 1 depicts the parameter space where inbreeding depression and negative correlations can co-occur in a single population. Because of this potential co-occurrence of inbreeding depression (heterosis) and negative correlations, the positive correlations may not be interpreted as equivalent to heterosis.

It should be pointed out that although the derivation for multiple-loci in the THEORY section is based on the assumptions of constant effects (h and s) and multiplicative fitness function, these assumptions are not essential for our main conclusions. In the THEORY section, for the one locus model, we also showed the same basic results as the multilocus model. If hi and si are variable across loci, each locus will have its own peculiar qi , and thus a peculiar hic , at mutation-selection balance. At the ith locus, hic is determined by the equilibrium qi (Equation 7), which in turn depends on the specific allelic effects hi and si for equilibrium populations at mutation-selection balance (CROW and KIMURA 1970 Down). Under either multiplicative or additive fitness functions across loci, as long as at each locus, hi < hic , being a heterozygote has higher expected fitness than being a homozygote; thus, a positive correlation between fitness and genomic heterozygosity will result. Similar conclusions hold for the situations of negative correlations. In cases where hi < hic at some loci and hi > hic at other loci, no correlation or either correlation could exist, which depends on the fitness effects and the number of loci with hi < hic relative to those with hi > hic.

The explanation for the commonly observed positive correlations between fitness (or its related traits such as developmental stability) and molecular marker loci may be complex. In populations that are not strictly panmictic with local inbreeding present, the genomic heterozygosity may be correlated with individual levels of inbreeding. Therefore, the associations between heterozygosity and fitness are largely the consequence of variation in the level of inbreeding among individuals (LEDIG et al. 1983 Down; STRAUSS 1986 Down). In randomly mating populations, if linkage disequilibrium is present (whether due to random genetic drift, selection or other causes), linked deleterious alleles under dominance at different loci will likely cause associated overdominance (HOULE 1989 Down, HOULE 1994 Down). This may also explain the positive correlations. However, we have particularly shown, via analytical approaches supplemented by simulations, two novel results for populations at genetic equilibria. The first is that the positive correlation can result with dominance. The second is that the positive correlation may not necessarily always reflect the phenomenon of heterosis, for the two reasons argued earlier: (1) the positive correlation may potentially exist even under within-locus additive allelic effects, and (2) the possible co-occurrence of inbreeding depression (heterosis) and negative correlations implies that the positive correlations may not be interpreted as equivalent to heterosis. In populations at genetic equilibria, functional overdominance may be an alternative and sufficient explanation (SMOUSE 1986 Down); however, it is not a necessary one. Therefore, even for populations at equilibria, the positive correlation observed cannot by itself be evidence for overdominance; it may not even be evidence for heterosis expected with dominance/over-dominance either, since it could potentially exist under pure additive allelic effects.

An implication of our results is that there is probably a limitation of the correlation approach for distinguishing the genetic mechanisms responsible for the maintenance of genetic variability. This is because of the following reasons. First, in equilibrium populations, a positive correlation can be explained by both dominance (0 < h < 1.0) and overdominance (h < 0) as long as h < hc (Equation 6Equation 7Equation 8Equation 9Equation 10). Second, the negative correlations and heterosis can co-occur under the same genetic conditions (Figure 1). Third, even for the same genetic effect (i.e., the same h) in one species, both correlations could be revealed in different populations if these populations have different genotype frequencies due to different population origins and histories (Equation 10). The limitation of the correlation approach in inferring the mechanisms responsible for the maintenance of genetic variability was also pointed out before on different grounds (e.g., HOULE 1994 Down; FU and RITLAND 1996 Down).

A potential application of the theoretical result here is that, for diallelic makers (such as those from RFLP), inference of the upper/lower bounds of h may be made given significant positive/negative correlations being found (Equation 6Equation 7Equation 8Equation 9Equation 10). h is an important genetic parameter in population and evolutionary genetics and has been difficult to estimate (even for its bounds, DENG 1997; DENG et al. 1997), especially for those organisms for which controlled breeding is difficult. Whereas, using Equation 6Equation 7Equation 8Equation 9Equation 10, the bounds of h may be estimated without controlled breeding with the application of the traditional correlation approach for fitness and genomic heterozygosity in natural populations.


*  ACKNOWLEDGMENTS

We thank Drs. D. CHARLESWORTH, M. LYNCH, R. CHAKROBARTY, and M. JOHNSON for discussions and comments. We are also grateful to Dr. Z-B. ZENG and three anonymous reviewers for their helpful comments that improved the paper. H.-W. DENG would like to thank Dr. M. LYNCH for years of advice, and Dr. D. HEDGECOCK for providing support to attend the conference "Genetic and Physiological Basis of Heterosis," which stimulated the development of this work. The work was in part supported by a FIRST AWARD from National Institutes of Health to Dr. Y.-X. FU. H.-W. DENG was supported by a grant from Health Future Foundation to Drs. R. RECKER and D. KIMMEL while finalizing this paper.

Manuscript received August 12, 1997; Accepted for publication November 7, 1997.


*  LITERATURE CITED
*TOP
*ABSTRACT
*THEORY
*DISCUSSION
*LITERATURE CITED

ALLENDORF, F. W., and R. F. LEARY. 1986 Heterozygosity and fitness in natural populations of animals, pp. 57–76 in Conservation Biology: The Science of Scarcity and Diversity, edited by M. E. SOULE. Sinauer Associates, Sunderland, MA.

BARKER, J. S. F., 1979  Interlocus interaction: a review of experimental evidence. Theor. Popul. Biol. 16:323-346[Medline].

CHAKRABORTY, R., 1981  The distribution of the number of heterozygous loci in an individual in natural populations. Genetics 98:461-466[Free Full Text].

CROW, J. F., 1986 Basic Concepts in Population, Quantitative and Evolutionary Genetics. W. H. Freeman, San Francisco.

CROW, J., and M. KIMURA, 1970 An Introduction to Population Genetics Theory. Harper & Row, New York.

DENG, H.-W. and M. LYNCH, 1996a  Change of genetic architecture in response to sex. Genetics 143:203-212[Abstract].

DENG, H.-W. and M. LYNCH, 1996b  Estimation of deleterious-mutation parameters in nature populations. Genetics 144:349-360[Abstract].

DENG, H.-W., 1998  Estimating coefficient of (over)dominance from inbreeding data and discriminating dominance and overdominance hypothesis for heterosis. Genetics in press.

DENG, H.-W., Y.-X. FU, and M. LYNCH, 1998  Inferring the major genomic mode of dominance and overdominance. Genetics in press.

FALCONER, D. S., and T. F. C. MACKAY, 1996 Introduction to Quantitative Genetics. Longman, New York.

FU, Y.-B. and K. RITLAND, 1996  Marker-based inference about epistasis for gene influencing inbreeding depression. Genetics 144:339-348[Abstract].

GAINS, M. R., L. R. MCCLENAGHAN, JR., and R. K. ROSE, 1978  Temporal patterns of allozymic variation in fluctuating populations of Microtus ochrogaster.. Evolution 32:723-739.

GAFFNEY, P. M., T. M. SCOTT, R. K. KOEHN, and W. J. DIEHL, 1990  Interrerlationships of heterozygosity, growth rate and heterozygote deficiencies in the coot clam, Mulinia lateralis.. Genetics 124:687-699[Abstract].

HALDANE, J. B. S., 1949  The association of characters as a result of inbreeding and linkage. Annals of Eugenics 15:15-23.

HARTL, D. L., and A. G. CLARK, 1989 Principles of Population Genetics. Sinauer Associates, Sunderland, MA.

HOULE, D., 1989  Allozyme-associated heterosis in Drosophila melanogaster.. Genetics 123:789-801[Abstract/Free Full Text].

HOULE, D., 1994  Adaptive distance and the genetic basis of heterosis. Evolution 48:1410-1417.

KIMURA, M., 1983 The Neutral Theory of Molecular Evolution. Cambridge University Press, Cambridge.

KIMURA, M., T. MARUYAMA, and J. F. CROW, 1963  The mutation load in small populations. Genetics 48:1303-1312[Free Full Text].

LEARY, R. F., F. W. ALLENDORF, and K. L. KNUDSEN, 1987  Differences in inbreeding coefficients do not explain the association between heterozygosity at allozyme loci and developmental stability in rainbow trout. Evolution 41:1413-1415.

LEDIG, F. T., R. P. GURIES, and B. A. BONEFELD, 1983  The relation of growth to heterozygosity in pitch pine. Evolution 37:1227-1238.

LEWONTIN, R. C., 1985  Population genetics. Annu. Rev. Genet. 19:81-102[Medline].

LI, W.-H., 1997 Molecular Evolution. Sinauer Associates, Sunderland, MA.

LYNCH, M. and H.-W. DENG, 1994  Genetic slippage in response to sex. Am. Nat. 144:242-261.

LYNCH, M., and B. WALSH, 1998 Genetics and Data Analysis of Quantitative Traits. Sinauer Associates, Sunderland, MA.

LYNCH, M., J. CONERY, and R. BURGER, 1995  Mutation accumulation and the extinction of small populations. Am. Nat. 146:489-518.

MAYNARD-SMITH, J., 1989 Evolutionary Genetics. Oxford University Press, Oxford.

MITTON, J. B. and M. C. GRANT, 1984  Associations among protein heterozygosity, growth rate, and developmental homeostasis. Ann. Rev. of Ecol. and Syst. 15:479-499.

MITTON, J. B. and B. A. PIERCE, 1980  The distribution of individual heterozygosity in natural populations. Genetics 95:1043-1054[Abstract/Free Full Text].

MORTON, N. E., J. F. CROW, and H. J. MULLER, 1956  An estimate of the mutational damage in man from data on consanguineous marriages. Proc. Natl. Acad. Sci. USA 42:855-863[Free Full Text].

NEI, M., 1987 Molecular Evolutionary Genetics. Columbia Univ. Press, New York.

OHTA, T. and C. C. COCKERHAM, 1974  Detrimental genes with partial selfing and effect on a neutral locus. Genet. Res. 23:191-200[Medline].

OHTA, T., 1973  Slightly deleterious substitutions in evolution. Nature 246:96-98[Medline].

OHTA, T., 1974  Mutation pressure as the main cause of molecular evolution and polymorphism. Nature 252:351-354[Medline].

PIERCE, B. A. and J. B. MITTON, 1982  Allozyme heterozygosity and growth in the tiger salamander, Ambystoma tigrinum.. J. Hered. 73:250-253[Abstract/Free Full Text].

SCHLEGER, G. and M. M. DICKIE, 1971  Natural mutation rates in the house mouse. Estimates for five specific loci and dominant mutations. Mutat. Res. 11:89-96[Medline].

SMIT-MACBRIDE, Z., A. MOYA, and F. J. AYALA, 1988  Linkage disequilibria in natural and experimental populations of Drosophila melanogaster.. Genetics 123:1043-1051.

SMOUSE, P. E., 1986  The fitness consequences of multiple-locus heterozygosity under the multiplicative overdominance and inbreeding depression models. Evolution 40:946-958.

STRAUSS, S. H., 1986  Heterosis at allozyme loci under inbreeding and crossbreeding in Pinus attenuata.. Genetics 113:115-134[Abstract/Free Full Text].

TURELLI, M. and L. V. GINZBURG, 1983  Should individual fitness increase with heterozygosity? Genetics 104:191-209[Abstract/Free Full Text].

ZAPATA, C. and G. ALVEREZ, 1992  On the detection of nonrandom association between DNA polymorphisms in natural populations of Drosophila.. Mol. Biol. Evol. 10:823-841[Abstract].

ZAPATA, C. and G. ALVEREZ, 1993  The detection of gametic disequilibria between allozyme loci in natural populations of Drosophila.. Evolution 46:1900-1917.

ZOUROS, E. and D. W. FOLTZ, 1987  The use of allelic isozyme variation for the study of heterosis. Isozymes 13:1-59[Medline].




This article has been cited by other articles:


Home page
GeneticsHome page
H.-W. Deng
Characterization of Deleterious Mutations in Outcrossing Populations
Genetics, October 1, 1998; 150(2): 945 - 956.
[Abstract] [Full Text]