- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Li, J.-L.
- Articles by Deng, H.-W.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Li, J.-L.
- Articles by Deng, H.-W.
The Effect of Overdominance on Characterizing Deleterious Mutations in Large Natural Populations
Jin-Long Lia, Jian Lia, and Hong-Wen Dengaa Osteoporosis Research Center and Department of Biomedical Sciences, Creighton University, Omaha, Nebraska 68131
Corresponding author: Hong-Wen Deng, Osteoporosis Research Center, Creighton University, 601 N. 30th St., Suite 6787, Omaha, NE 68131., deng{at}creighton.edu (E-mail)
Communicating editor: M. A. ASMUSSEN
| ABSTRACT |
|---|
Alternatives to the mutation-accumulation approach have been developed to characterize deleterious genomic mutations. However, they all depend on the assumption that the standing genetic variation in natural populations is solely due to mutation-selection (M-S) balance and therefore that overdominance does not contribute to heterosis. Despite tremendous efforts, the extent to which this assumption is valid is unknown. With different degrees of violation of the M-S balance assumption in large equilibrium populations, we investigated the statistical properties and the robustness of these alternative methods in the presence of overdominance. We found that for dominant mutations, estimates for U (genomic mutation rate) will be biased upward and those for
(mean dominance coefficient) and
(mean selection coefficient), biased downward when additional overdominant mutations are present. However, the degree of bias is generally moderate and depends largely on the magnitude of the contribution of overdominant mutations to heterosis or genetic variation. This renders the estimates of U and
not always biased under variable mutation effects that, when working alone, cause U and
to be underestimated. The contributions to heterosis and genetic variation from overdominant mutations are monotonic but not linearly proportional to each other. Our results not only provide a basis for the correct inference of deleterious mutation parameters from natural populations, but also alleviate the biggest concern in applying the new approaches, thus paving the way for reliably estimating properties of deleterious mutations.
THE genome of any organism is subject to continuous bombardment of mutations, the majority of which are deleterious. Numerous theories based on the assumptions of deleterious genomic mutations have been developed to explain some fundamental phenomena in biology. These phenomena include (but are not limited to) the evolution of sex and recombination (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
For the rest of the Introduction, the following definitions and distinctions between dominance and overdominance are in order. For a locus with alleles A and a, let the three genotypic values of fitness be

Here, h is the dominance coefficient, where h < 0 implies overdominance, h = 0.5 implies additivity, and 0
h
1.0 (h
0.5) implies dominance. Note that we use "dominant" or "dominance" to refer to cases of complete dominance and partial dominance. Mutations with (over)dominant effects are referred to as (over)dominant mutations. Deleterious genomic mutations generally refer to dominant mutations. Dominance is compatible with mutation-selection (M-S) balance; overdominance essentially encompasses all kinds of balancing selection at the allelic level (![]()
Three essential parameters of deleterious genomic mutations are (1) the genomic mutation rate (U), (2) the mean selection coefficient (
), and (3) the mean dominance coefficient (
). For the three essential parameters, there are now three approaches for estimation:
- The mutation-accumulation (M-A) approach (
BATEMAN 1959 ;
MUKAI 1964 ;
MUKAI et al. 1972 ): This technique estimates U and s. Most estimates have come from this approach applied to Drosophila melanogaster (
MUKAI 1979 ;
CROW and SIMMONS 1983 ;
KEIGHTLEY 1994 ,
KEIGHTLEY 1996 ) and have been very hard to acquire, requiring large and long-term M-A and special chromosomal constructs or inbred/asexual lines. The data from M-A can also be analyzed by the maximum-likelihood method (
KEIGHTLEY 1994 ) or the minimum-distance method (
GARCIA-DORADO 1997 ).
- The inbreeding depression approach (
MORTON et al. 1956 ;
CHARLESWORTH et al. 1990 ): Requiring a
value that must be assumed or that cannot be estimated without bias (CABALLERO et al. 1997 ;
DENG and FU 1998 ;
DENG 1998A ), this technique per se estimates U only. In the highly selfing annual plants Leavenworthia (
CHARLESWORTH et al. 1994 ) and Amsinckia (
JOHNSTON and SCHOEN 1995 ), U estimates from this approach are in line with earlier ones from M-A in Drosophila, suggesting high deleterious genomic mutation rates.
- The fitness moments approach (
DENG and LYNCH 1996 ,
DENG and LYNCH 1997 ;
DENG 1998B ): This approach estimates U, h, and s. For two outcrossing species of cyclical parthenogenetic Daphnia (a freshwater microcrustacean), preliminary estimates by this approach generally agree with earlier ones from other species (
DENG and LYNCH 1997 ) and those from the direct M-A approach in Daphnia (
LYNCH 1985 ;
LYNCH et al. 1998 ).
The last two approaches depend on the change in mean (and genetic variance) of fitness traits upon only one generation of mating in large selfing or outcrossing populations. In comparison, the first approach is much more time-consuming and requires many generations of M-A. None of the current experimental designs and statistical methods can estimate mutation parameters without bias. Under a number of biologically plausible conditions, the statistical properties of the above three approaches were compared (![]()
An essential assumption common to the last two approaches is that all the genetic variation in the study population is maintained under M-S equilibrium. Accordingly, changes in the mean and genetic variance of fitness (or its components) upon inbreeding or outcrossing are solely due to deleterious dominant mutations maintained by M-S balance. Even in large populations, despite tremendous efforts (e.g., ![]()
![]()
![]()
![]()
![]()
![]()
![]()
The robustness of the approaches applied to natural populations has been investigated under a range of biologically plausible conditions, such as variable and/or epistatic mutation effects, etc. (![]()
![]()
![]()
![]()
![]()
are underestimated and
is overestimated. The direction and the magnitude of the bias revealed may provide a numerical basis for the close inference of deleterious genomic mutations. However, estimation under violation of the M-S balance assumption has never been investigated. It is intuitive that violation of the M-S balance assumption will result in biased estimates (![]()
The M-S balance assumption can be violated in several scenarios, such as in small populations subject to random genetic drift or in large populations subject to balancing selection due to functional overdominance and/or fluctuating selection at the allelic level. Each scenario deserves careful consideration and thus separate treatment. The two approaches applicable to natural populations were originally devised for large populations at approximate equilibrium. Hence, we investigate estimation in large natural populations with genetic variance maintained by either M-S balance or balancing selection, and with inbreeding depression caused by either dominant or overdominant mutations. The study is conducted by computer simulations using algorithms we devised previously (![]()
![]()
![]()
The experimental designs to characterize deleterious genomic mutations are different depending on the study population's mating type (![]()
![]()
![]()
In this article, we first outline the simulations and develop the associated analytical derivations in outcrossing and selfing populations. Then we present the simulation results in these two types of populations for the fitness moments approach and the inbreeding depression approach for both constant and variable mutation effects. Finally, we discuss the implications of our current results for characterizing deleterious genomic mutations from natural populations.
| SIMULATIONS |
|---|
The direction and the magnitude of the bias under balancing selection with overdominance are of particular interest to geneticists. To focus on this, we assume that genotypic values are measured accurately. In reality, this would require that each genotype be clonally replicated and assayed a large number of times. Ignoring measurement error for genotypic values reduces the sampling error of estimates, but is unlikely to bias either the estimation or the comparison of the techniques, assuming that the same number of genotypes would be handled experimentally. This is supported by our previous investigations (![]()
![]()
![]()
![]()
![]()
![]()
![]()
Large outcrossing populations at equilibrium are constructed with some dominant loci maintained under M-S balance and other overdominant loci maintained by balancing selection. In large selfing populations, overdominance does not contribute to the maintenance of genetic variability (because of constant exposure to the homozygous state under selfing), and mutations of overdominant effects are also maintained by M-S balance (![]()
![]()
Mutation effects on fitness across all loci are assumed to be multiplicative throughout, an assumption that is biologically plausible (![]()
![]()
![]()
![]()
![]()
![]()
![]()
Outcrossing populations:
Loci of constant dominant mutation effects mixed with overdominant loci:
At dominant loci at M-S balance, the number of mutations per individual (after selection, all in the heterozygous state) is Poisson distributed with an expectation of
=
(![]()
![]()
![]()

where hi and si are the dominance and selection coefficients of the ith locus with mutations. They are assumed to be constant initially and made variable later. Wmax is the fitness of a genotype that is free of segregating deleterious genomic mutations in the experimental environment where the measurements are taken. This parameter serves as a scaling factor so that fitness can be on any scale instead of from 0 to 1.
For a genotype sampled from the selfed progeny generation, the fitness is

where n1 and n2 are, respectively, the numbers of loci with mutations in heterozygous and homozygous states. n1 and n2 are determined from two levels of random sampling: (1) A number (n) of loci is randomly determined from the Poisson distribution with mean
=
. (2) Each of these n loci has a probability of 1/4 in the selfed progeny of being homozygous for the normal A allele, a probability of 1/2 of being a heterozygote Aa, and a probability of 1/4 of being a homozygote aa. After the genotypic status of each locus is determined as above, n1 and n2 are, respectively, the sum of loci heterozygous and homozygous for mutations.
Now consider the overall individual fitness with N polymorphic overdominant loci in the genome in addition to those dominant loci at M-S balance. At an overdominant locus with effect ho < 0 and so in large populations, the equilibrium frequency of the more fit allele B is p =
(![]()
. With N such additional overdominant polymorphic loci in the population, the overall fitness of a random parental individual now becomes

where n3 and n4 are, respectively, the numbers of overdominant loci with genotypes Bb and bb in the genome of this individual, and n is defined earlier for the dominant loci. n3 and n4 are determined by random sampling, N times, from the trinomial distribution, with genotypes Bb and bb having the frequencies of 2pq and q2, respectively. Recall that the population is assumed to be randomly mating. Different N values are assumed in simulations.
Upon selfing, the overall fitness of a selfed progeny whose parent has n3 overdominant loci with the Bb genotype and n4 overdominant loci with the bb genotype, is

where the term in the brackets has been explained earlier for the dominant loci. n5 and n6 are the number of loci in the selfed progeny heterozygous (Bb) and homozygous (bb) at the n3 heterozygous overdominant loci of the parent. They are determined by random segregation during selfing of the parent, in the same manner as n1 and n2.
Once the desired samples of K individuals from the parent and selfed progeny generations are simulated, we estimate the parameters of deleterious genomic mutations on the basis of the assumption of pure dominant mutations maintained under M-S balance (![]()
o and
2o denote the mean and genetic variance of fitness in the outcrossed parental generation, respectively; and let
s and
2s denote the corresponding values among the selfed progeny generation, respectively. These can be computed easily from simulated data (![]()
![]() |
(1) |
Then
![]() |
(2a) |
![]() |
(2b) |
![]() |
(2c) |
If a value of h is assumed by external knowledge or estimated by other experimental designs and estimation methods, U can then be estimated from the change in the mean fitness upon selfing by Equation 2b (![]()
![]()
![]()
![]() |
(3) |
For investigating the inbreeding depression approach in outcrossing populations, a set of M selfing families, each having S selfed progeny, is simulated as above to estimate h (Equation 3). This set of simulated selfing families and another set of L selfing families (each with one selfing parent and one selfed offspring) are employed to estimate inbreeding depression and then U (Equation 2b). Unless otherwise specified, M = 10, S = 40, and L = 20. This choice of parameters is shown to be efficient on the basis of our previous investigation (![]()
![]()
Two aspects of overdominant mutations concern geneticists most and are directly relevant to characterizing deleterious genomic mutations from natural populations. One is the contribution of dominant mutations to heterosis (the mean fitness of the outcrossed generation to the inbred generation) relative to that of overdominant mutations. The other is the magnitude of genetic variation due to dominant mutations maintained under M-S balance relative to that due to overdominant mutations maintained by balancing selection.
The contribution to the total heterosis upon selfing from the dominant mutations can be measured by the index
![]() |
(4) |
The index
plays an important role. Compared with a similar index (
, constructed on the original fitness scale) of ![]()
here represents the proportion of heterosis on the log fitness scale that is attributable to dominant mutations. Therefore,
ranges from 0 to 1. If
= 1, the sole cause of heterosis is dominance; if
= 0, it is overdominance. The smaller the
, the larger the contribution to heterosis from overdominant mutations.
To measure the magnitude of genetic variation from dominant mutations maintained under M-S balance relative to that from overdominant mutations maintained by balancing selection, we define the index
![]() |
(5) |
Dominant loci with variable mutation effects mixed with overdominant loci:
Deleterious mutation effects hi and si, across loci are unlikely to be constant. For example, si may vary anywhere from 0 (neutral mutation) to 1 (lethal mutation). The rate of mutations with different effects may also vary so that mutations of smaller effects may occur at higher rates. To evaluate the direction and the magnitude of bias introduced jointly by variable mutation effects and overdominant mutations, as in ![]()
![]() |
(6a) |
Also we let
![]() |
(6b) |
By Equation 6b, hi and si are correlated. These are in rough accordance with the few available data (![]()
![]()
![]()
![]()
![]()
= 0.36 when
= 0.03, h
0.5 as s
0, and h
0.0 as s
1.0, all in rough accordance with the data (![]()
![]()
(Equation 4) and ß (Equation 6aEquation 6b) can be constructed using the results in APPENDIX A.
In simulations, we divide the entire range of s (0 - 1) into 100 discrete classes of width 0.01. Within each class, mutations have constant effects (hi and si). Each individual from the outcrossed parental generation in the simulation is assigned a number (ni) of heterozygous mutations from the ith of these classes by drawing from a Poisson distribution with expectation Upi/(hisi), where pi is the density of the mutational distribution in the ith class. For an individual from the selfed progeny generation, ni's are first determined as above. Then for each of the ni loci, the genotype is, as before, determined by randomly sampling from the trinomial probabilities so that probabilities for different genotypes are 1/4 for AA, 1/2 for Aa, and 1/4 for aa, respectively (due to random segregation during selfing of parents). This discrete treatment closely approximates the continuous distribution of mutation effects (H.-W. DENG, unpublished data).
Selfing populations:
To estimate deleterious genomic mutations, selfed individuals from natural selfing populations are crossed randomly to obtain outcrossed progeny. In selfing populations, new mutations in the genome most likely follow a Poisson distribution, whether they involve dominant or overdominant mutations. In highly selfing populations, mutant alleles will be maintained by M-S balance, regardless of their (over)dominance (![]()
o and constant effects ho and so. If the genomic mutation rate to the overdominant (but less fit) allele a is Uo, it can be easily shown that at M-S equilibrium,
o =
(![]()
![]()
In each situation, a variable number K of individuals is randomly sampled from the selfed parental and outcrossed progeny generations, respectively. For a genotype with n dominant and n7 overdominant mutations [randomly determined from the Poisson distribution with mean U/(2s) and Uo/(2so), respectively] from the selfed parental generation, the fitness is

For an outcrossed progeny resulting from crossing two selfed parents (with nf, n7 and nm, n8 homozygous loci for dominant and overdominant mutations, respectively, where the subscript f indicates female parent and m the male parent), its fitness is

hi and si are the dominance and selection coefficients of the ith locus with dominant mutations. They are assumed to be constant initially and made variable later.
In selfing populations, the indices
and ß defined in Equation 4 and Equation 5 can be constructed from the derivations in Appendix 1 for the constant and variable dominant mutation effects, respectively. In simulated populations, the genome contains both dominant and overdominant loci, all at M-S equilibrium. In the parental generation, the number of homozygous dominant loci in each individual is determined by random sampling from a Poisson distribution of mean U/(2s), and the number for the overdominant loci is determined from a Poisson distribution with mean
o [=
] (![]()
![]()
![]()
![]()
Once the desired samples of K individuals from the selfed parent and the outcrossed progeny generations are simulated, the estimation developed on the basis of the assumption of pure dominant mutants maintained under M-S balance (![]()
o,
2o and
s,
2s be the mean and genetic variance of the fitness in the outcrossed progeny and selfed parental generations, respectively. Let x, y, z be defined as in Equation 1; then
![]() |
(7a) |
![]() |
(7b) |
![]() |
(7c) |
To apply the inbreeding depression approach to estimate U (![]()
![]()

Once h is estimated, U can be estimated by Equation 7b. A sample of 200 outcrossing families, each consisting of two selfed parents and one outcrossed offspring, is simulated to implement the inbreeding depression approach. The total number of genotypes employed in the experiment is 600.
In simulations, we arbitrarily let Wmax = 1, as the values of Wmax do not influence the estimation for the mutation parameters (![]()
= ß = 0), U = 1.0,
= 0.36, and
= 0.03, which are close to the most often cited values estimated by ![]()
![]()
= 0.010.05) and experimental designs have also been performed. The results are similar and thus not presented. Because almost all the results are biased, the MSE is presented together with one standard deviation (SD) computed over the repeated simulations.
| RESULTS |
|---|
Outcrossing populations
Constant dominant mutation effects:
The fitness moments approach (Table 1):
With only deleterious dominant loci in the genome (N = 0 and
= ß = 1), the estimates for U, h, and s are unbiased. Recall that N is the number of polymorphic overdominant loci in the population and
and ß are, respectively, the proportion of heterosis and genetic variation on the log fitness scale that is attributable to dominance mutations. With overdominant loci coexisting in the genome with deleterious dominant loci (N > 0 and 0 <
, ß < 1), Û (^ indicates an estimated value) is an overestimate, while h and s are underestimated. The degree of bias increases with increasing contributions from overdominance to heterosis (decreasing
) and to the standing genetic variation in the population (decreasing ß). Generally, the bias is not dramatic so that estimates of the upper bound of U and lower bounds of h and s can be obtained, and these estimates are close to the true parameter values. All the sampling errors are quite small. Even with only overdominant mutations in the genome (
= ß = 0), estimates of U,
, and
can still be obtained, although the parameter values do not exist for the dominant mutations. In this case, it is not incorrect to treat Û as an upper limit for the true U of zero. This is understandable, because, upon selfing (or outcrossing in selfing populations), overdominant mutations will also cause mean and genetic variance of fitness to change, similar to those changes caused by dominant mutations. This will be similar in every case, and thus will not be repeated. The estimation bias is relatively more sensitive to a change of ho than to a change of so. With a larger absolute value of ho, the degree of bias increases.
|
The inbreeding depression approach (Table 2):
With N = 0 and
= ß = 1, the estimates for U and h are nearly unbiased. With N > 0 and 0 <
, ß < 1, U is generally overestimated, while h is underestimated. The degree of bias generally increases with decreasing
and ß. Compared with the fitness moments approach, the bias is larger for
and smaller for Û. The smaller bias of Û is largely due to the greatly underestimated
. This can be understood from Equation 2b or Figure 1 in ![]()
. However, the estimation of U suffers from large sampling errors, even though the number of genotypes employed (450) is larger than that for the fitness moments approach (400). When both sampling error and bias are considered, the estimation of U by the inbreeding depression approach is generally worse than that by the fitness moments approach, as reflected by the larger MSE. The statistical properties (mean and sampling variance) of Û are relatively unstable with changes of a and ß. This instability is largely due to the relatively small sample size employed. When overdominance contributes importantly to the heterosis and standing genetic variation in natural populations (with small
and ß), Û is unacceptable even as an estimate for the upper limit because of the large sampling error.
estimated by DENG's (1998b) method can serve well as a lower bound of the true h as evidenced by its small sampling error.
|
|
Variable dominant mutation effects:
The fitness moments approach (Table 3):
With N = 0 and
= ß = 1, U and
are underestimated and
is overestimated. With N > 0 and 0 <
, ß < 1,
is always biased downward, and the magnitude of bias and sampling variance do not change much with changing
and ß. The degree of bias is relatively small so that
0.67
. The small bias and sampling variance of
render it an ideal estimate of the lower limit for the true
, and it is close to the true parameter value. The bias of Û and
changes so that Û and
are not always biased. When
and ß are relative large, so that overdominance does not contribute substantially to the heterosis and to the genetic variation in the population, U and
are both underestimated. When
and ß gradually decrease, so that overdominance contributes more to the heterosis and the standing genetic variation in the population, Û and
become unbiased and then overestimated. For the same magnitude of
or ß, with different parameters ho and so, the degree of bias for Û,
, and
is different. This is also true throughout this study and is not repeated.
|
It should be noted that with different ho and so parameters for overdominant mutations, the same
may correspond to a different ß. This can be inferred from the corresponding Equation 4 and Equation 5 and those in Appendix 1 and B. It is also evident in every case as can be seen from the numerical values of Table 1 Table 2 Table 3 Table 4 Table 5 Table 6 Table 7 Table 8 for outcrossing and selfing populations and for constant and variable mutation effects. To illustrate the monotonic but nonlinear relationship between
and ß, Figure 1 plots the values of
and ß for constant and variable mutation effects in both outcrossing and selfing populations.
|
|
|
|
|
The inbreeding depression approach (Table 4):
With N = 0 and
= ß = 1, the estimates for U and
are both biased downward. With N > 0 and 0 <
, ß < 1, U is generally underestimated when
and ß are relatively large and is only overestimated when
and ß are quite small. However, the sampling variance of Û is usually large. On the other hand,
is always biased downward and the sampling variance is miniscule. With decreasing a and ß, the degree of bias of
increases.
can serve reasonably well as a lower bound of the true
.
Selfing populations
Constant dominant mutation effects:
The fitness moments approach (Table 5):
With N = 0 and
= ß = 1, the estimates for U, h, and s are unbiased. With N > 0 and 0 <
, ß < 1, U is overestimated, while h and s are underestimated. The degree of bias increases with decreasing
and ß. However, the bias is not so dramatic that the upper bound of U and lower bounds of h and s can be estimated, and that they are not wildly far away from the true parameter values. The estimation bias is not very sensitive to changes in ho and so, especially for
and
.
The inbreeding depression approach (Table 6):
With N = 0 and
= ß = 1, the estimates for U and h are nearly unbiased. With N > 0 and 0 <
, ß < 1, U is generally overestimated, while h is underestimated. The degree of bias generally increases with decreasing
and ß. Compared with the fitness moments approach, the bias is larger for
and smaller for Û. The smaller bias of Û is largely due to the greatly underestimated
. This can be understood from Equation 2b or Figure 1 in ![]()
. Compared with outcrossing populations under constant mutation effects with a comparable sample size of genotypes, the sampling error for Û is relatively small, and hence Û can serve well as an estimate for the upper limit.
estimated by Mukai's method (![]()
Variable dominant mutation effects:
The fitness moments approach (Table 7):
With N = 0 and
= ß = 1, the estimates for U and
are biased downward and the estimates for
are biased upward. With N > 0 and 0 <
, ß < 1,
is always biased downward, and the magnitude of the bias increases slightly with decreasing
and ß, while its sampling variance remains relatively stable.
ranges from 0.77
to 0.5
. The relatively small bias and sampling variance of
render it an ideal estimate of the lower limit for
. The direction and the magnitude of the bias of Û and
change so that Û and
are not always biased. When
and ß are relatively large, so that overdominance does not contribute substantially to the heterosis and the standing genetic variation in the population, U and
are both underestimates. When
and ß gradually decrease, Û and
become unbiased and then overestimated. However, for Û and
to become biased upward,
and ß need to be quite small (
< ~0.56, ß < 0.84) so that overdominance contributes substantially to heterosis and the standing genetic variation in the populations.
The inbreeding depression approach (Table 8):
With N = 0 and
= ß = 1, the estimates for U and
are biased. With N > 0 and 0 <
, ß < 1, U is generally underestimated when
and ß are relatively large and is only overestimated when
and ß are quite small. It should be noted that, as with the case for the outcrossing populations, when overdominant mutations are present but do not contribute substantially to heterosis and genetic variation, the bias of Û is smaller than under dominant mutations. This is because the directions of estimation bias caused by overdominant mutations and variable effects of dominant mutations are opposite and they cancel each other, resulting in smaller (or no) bias. The extent of the bias depends on the parameters under estimation and
and ß parameter values. The sampling variance of Û is small.
is always biased downward and the sampling variance is miniscule. With decreasing
and ß, the degree of bias of
increases.
can serve well as a lower bound of the true
.
| DISCUSSION |
|---|
Using extensive simulations, we investigated the effect of overdominant mutations on characterizing deleterious dominant mutations by the two existing estimation approaches (![]()
![]()
![]()
![]()
![]()
and
biased downward by overdominant mutations. However, the degree of bias is generally moderate and depends on the magnitude of the contribution of overdominant mutations to heterosis or genetic variation. This renders the estimates of U and
not invariably biased under variable mutation effects, which when working independently will almost always cause U and
to be underestimated. We also note that the contributions to heterosis and genetic variation from overdominant mutations are monotonic but not linearly proportional to each other. Our results may not only provide a basis for correct inferences about deleterious mutations from natural populations, but may also alleviate the biggest concern and obstacle in applying the inbreeding depression and fitness moments approaches, thus paving the way for efficiently characterizing deleterious genomic mutations from large natural populations.
Although it is intuitive that the two approaches will yield biased estimates (![]()
unbiased. It has been stipulated (![]()
![]()
Our simulation results not only reveal the robustness and statistical properties of the current approaches to characterize deleterious dominant mutations in natural populations, but also shed light on the relative efficiencies of the different approaches in different populations. Although the relative efficiencies of all the three available approaches (as outlined in the Introduction) were investigated earlier (![]()
. The inbreeding depression approach is sometimes better for the estimation of U; however, the better estimation is achieved because of a greatly biased estimation of
. Therefore, it is not the original inbreeding depression approach per se that achieves the better estimation for U. It is actually the greatly underestimated
by the estimation methods chosen that leads to the less biased U in the inbreeding depression approach. Therefore, the estimation of U by the inbreeding depression approach would greatly depend on the methods chosen for the estimation of
. With less biased estimates or assumed values for
, simulation results not shown here indicate that the U estimation by the inbreeding depression approach is much worse statistically than that of the fitness moments approach.
The issue of dominance and overdominance has been under debate for decades in genetics (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
and ß, provides a basis for investigating a number of other genetic issues related to the contribution of dominant and overdominant mutations to inbreeding and the standing genetic variation in natural populations.
It has long been recognized that, when dominant and overdominant mutations coexist, the heterosis and standing genetic variation will be affected by both. However, the disproportional contributions of overdominant mutations to heterosis and to standing genetic variation have not been documented before. This phenomenon may form a basis for discerning the relative importance of dominant and overdominant mutations in the genome. Studies have been initiated along this line of research. It is worthy of note that, for overdominant mutations to contribute relatively importantly to the standing genetic variation, a substantial proportion of heterosis must be caused by overdominant mutations. This is especially true when overdominant mutations contribute to less than half of the heterosis (
> 0.5; Figure 1).
For any theory to be of great significance, its underlying assumptions must be examined closely and the important parameters must be estimated. There is no doubt that any genome is subject to continuous bombardment of deleterious genomic mutations. However, no amount of theoretical argument can resolve the issues concerning the importance of deleterious genomic mutations without the important parameters being estimated. Indisputably, characterizing deleterious genomic mutations is extremely important. However, even if the importance is realized by more and more scientists and revealed in more and more biological aspects, the estimates are astonishingly few and thus are imperatively needed (![]()
![]()
![]()
![]()
![]()
![]()
![]()
| ACKNOWLEDGMENTS |
|---|
H.-W. Deng thanks Professor M. Lynch for years of advice, continuous encouragement, and support. We are very grateful to Professor Marjorie A. Asmussen and three anonymous reviewers for their extremely careful comments that helped to improve the article. We thank Drs. Robert R. Recker and Mark Johnson and Ms. Carolyn Meeks for careful editing of the manuscript. The work was partially supported by a grant from National Institutes of Health (R01 AR45349) and a Health Future Foundation grant from Creighton University, Nebraska, and by graduate student tuition waiver to J.-L.L. and J.L. from the Department of Biomedical Sciences of Creighton University.
Manuscript received April 20, 1998; Accepted for publication October 30, 1998.
| APPENDIX 1 |
|---|













with Deng's method and U with the inbreeding depression approach to characterize variable dominant mutations in the presence of overdominant mutations in outcrossing populations