- THIS ARTICLE
- Full Text (PDF)
-
All Versions of this Article:
genetics.106.067173v1
176/2/1371 most recent - Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Goudet, J.
- Articles by Martin, G.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Goudet, J.
- Articles by Martin, G.
Originally published as Genetics Published Articles Ahead of Print on March 4, 2007.
Genetics, Vol. 176, 1371-1374, June 2007, Copyright © 2007
doi:10.1534/genetics.106.067173
Under Neutrality, QST
FST When There Is Dominance in an Island Model
Jérôme Goudet*,
,1 and
Guillaume Martin*
* Department of Ecology and Evolution, UNIL, CH-1015 Lausanne, Switzerland and
Department of Genetics, University of Cambridge, Cambridge CB2 3EH, United Kingdom
1 Corresponding author: Department of Ecology and Evolution, Biophore, UNIL-SORGE, University of Lausanne, CH-1015 Lausanne, Switzerland.
E-mail: jerome.goudet{at}unil.ch
TO test whether quantitative traits are under directional or homogenizing selection, it is common practice to compare population differentiation estimates at molecular markers (FST) (WRIGHT 1951) and quantitative traits (QST) (SPITZE 1993). If the trait is neutral and its determinism additive, then theory predicts that QST = FST, while QST > FST is predicted under directional selection for different local optima, and QST < FST is predicted under homogenizing selection (MERILA and CRNOKRAK 2001). GOUDET and BÜCHI (2006) recently evaluated the effects of dominance, inbreeding, and sampling design on QST for neutral traits. Under dominance, GOUDET and BÜCHI (2006) found that (1) dominance decreases on average the value of QST relative to FST (i.e., QST FST
0), (2) the magnitude of the contrast QST FST increases with population differentiation (i.e., with increasing FST), and (3) dominance is unlikely to lead to QST FST > 0.
In a recent letter to GENETICS, LOPEZ-FANJUL et al. (2007) questioned the evidence leading to these claims. In particular, they criticized GOUDET and BÜCHI (2006) for using averages over allele frequencies and dominance deviations. Here, taking an analytical approach similar to that used in LOPEZ-FANJUL et al. (2007), we first show that under an island model, the result QST
FST with dominance obtained by GOUDET and BÜCHI (2006) is strictly true over all allelic frequencies and dominance deviations. We then argue that independently of the underlying population structure, averaging over allele frequencies and dominance deviations is pertinent, since quantitative traits are polygenic, and this is what empiricists study when they estimate QST. We conclude by emphasizing that the major problem faced by empiricists is not the slight negative bias in QST due to nonadditive effects, but the very large variance in this quantity, particularly when the number of samples is small.
As LOPEZ-FANJUL et al. (2007) and GOUDET and BÜCHI (2006) used different parameterizations, some clarification might be useful. GOUDET and BÜCHI (2006) used a, d, and a, while LOPEZ-FANJUL et al. (2007) used 1 s, 1 hs, and 1 for the genotypic values of AA, AB, and BB, respectively. These two notation schemes are equivalent when a = 1 (and therefore s = 2) and
.
To obtain the expectation of
and
, four quantities are needed: gene diversities within [
] and over all [
] populations, where q represents the frequency of the recessive allele and np the number of populations, as well as the additive variance within (VAW) and between (VB) populations. Under strict additivity, these four quantities are functions of the first and second moments of the distribution of allele frequencies only. However, under dominance (when
), VAW and VB also depend on the third and fourth moments of this distribution. As we see below, the difference between LOPEZ-FANJUL et al. (2007) results and those of GOUDET and BÜCHI (2006) stems partly from the assumed distribution of allele frequencies.
We consider exactly the same genetical setup as in LOPEZ-FANJUL et al. (2007): a biallelic locus with dominance h, additive effect s, and allele frequency q. Using their notation, the mean for the trait, in any given population, is given by M = 1 2qhs q2s(1 2h). The variance of trait mean among populations is given by VB = E(M2) E(M)2, where E denotes expectation with respect to the distribution of q among populations. This turns out to be a polynomial function of the first four moments of allelic frequencies. The additive variance in a given population is given by VAW = 2
q(1 q) [where
= s(h + (1 2h)q is the average effect; LOPEZ-FANJUL et al. 2003]. With
the expectation of this additive variance among populations, E(VAW), is also a polynomial function of the first four moments of allele frequencies.
The expectations of these quantities are obtained by replacing q, q2, q3, and q4 in their expressions by the first, second, third, and fourth moments of allele frequency distribution (just as in LOPEZ-FANJUL et al. 2007). In their letter, LOPEZ-FANJUL et al. (2007) consider a specific model where isolated lines diverge by drift from an infinitely large panmictic population (pure drift model, PDM), while GOUDET and BÜCHI (2006) considered the classical island model (IM) at equilibrium between migration and drift. For the infinite island model at equilibrium, and for a biallelic locus, the distribution of allelic frequencies follows a beta distribution with parameter 4Nm,
![]() |
is the average allele frequency among populations (for the multiallelic locus equivalent, the pertinent distribution is a Dirichlet, as used in GOUDET and BÜCHI 2006; see RANNALA and HARTIGAN 1996).
The first four moments of the beta distribution can be expressed in terms of
as follows:
![]() | (1) |
Under the PDM, the moments of the allele frequency distribution are given by Equations 14 in LOPEZ-FANJUL et al. (2002). Note that the two models yield the same first and second moments, but differ in their third and fourth moments, which also influence VB and VAW when
.
Figure 1A portrays the dynamics of the first four allele frequency moments of the two models, assuming an overall allele frequency of q = 0.9. This frequency was chosen because this is the frequency that gave rise to the largest positive difference between QST and FST in LOPEZ-FANJUL et al. (2007) (see Figure 1 of their article). The discrepancies between the third and fourth moments are barely notable in Figure 1A, but when the differences between these moments are plotted against Ft (Figure 1B), we can clearly see that the expected third and fourth moments of allelic frequency are smaller in the IM than in the PDM. Figure 1C shows that these tenuous differences lead to a qualitative change in the difference QST FST. While in the PDM (dotted lines) this difference can be positive or negative, it is always negative in the IM (for any s and any h < 0.5). The formulas used by LOPEZ-FANJUL et al. (2007) therefore do not apply to an island model (at least at equilibrium), contrary to what they claim. And, since none of the overall allelic frequencies give rise to a positive difference between QST and FST in the IM, it can be concluded that the results of GOUDET and BÜCHI (2006) are true both for a single diallelic locus under the IM and for more realistic multilocus traits, at least under the IM and the PDM. GOUDET and BÜCHI (2006) were therefore correct in identifying the population model as a likely reason for the discrepancies between their results and earlier work by LOPEZ-FANJUL et al. (2003). Finally, Figure 1D confirms that simulations of a single diallelic locus with overall recessive frequency of 0.9 and h = 0 give results entirely consistent with the theoretical expectations under an IM.
|
LOPEZ-FANJUL et al. (2007) also assert that contrary to the results of GOUDET and BÜCHI (2006), the difference between QST and FST is largest around FST = 0.5 and decreases thereafter. It is clear that for extremely large population differentiation (e.g., FST
1) QST and FST should be equal whatever the dominance deviation. This is because when populations are entirely differentiated, no heterozygotes remain, and hence dominance is not expressed. Figure 1, C and D, shows that in the IM the maximum difference between QST and FST due to dominance is obtained for differentiation values of
0.70.8 rather than 0.5. A cursory survey of the literature shows that FST
0.7 is seldom reported. Accordingly, the simulation scheme in GOUDET and BÜCHI (2006) did not investigate FST-values much larger than 0.8.
Although it was just shown that under an island model dominance always leads to QST
FST, one might be worried that this result depends on the underlying model of population structure. Indeed, the IM (which assumes an equilibrium between migration and drift) is by no means more realistic than the PDM (which assumes no migration), for which dominance does give rise to the pattern QST > FST for certain allelic frequencies. It is essential to realize that all these results are the expectation of the difference between QST and FST for a single diallelic locus (in the analytical section of GOUDET and BÜCHI 2006, p. 1339 gives the expression of QST for a single diallelic locus independently of the underlying model of population structure, from which it is possible to derive the conditions under which QST > FST). As soon as several loci are considered, and because the parameter space leading to QST > FST is so narrow, the general consequence of dominance deviations is QST
FST. LOPEZ-FANJUL et al. (2007) criticized GOUDET and BÜCHI (2006) for using a far too wide parameter space in their simulations. First, as shown above for the IM, QST
FST independently of the distribution of s and h. For the PDM, restricting simulation scenarios within what is known about allelic frequencies and dominance deviations of recessive deleterious alleles (
, low q) also leads to QST
FST. The conditions leading to QST > FST for a dominant trait are rather unlikely: over and above it occurring only for a specific population model (PDM), it also requires the frequencies of the recessive alleles at most loci coding for the trait to be large. While this might happen for an isolated locus, it is extremely unlikely that it will occur for any real traits (for which, for a neutral trait, we expect a symmetric distribution of q among loci). Thus, while for a single diallelic locus the relation between QST and FST depends on the underlying model of population structure, for a polygenic trait the different models of population structures consistently lead to QST
FST.
It is perhaps unfortunate that LOPEZ-FANJUL et al. (2007) drew attention only to one of the conclusions of GOUDET and BÜCHI (2006), namely the slight bias in QST due to dominance, and did not take note of the rest of the article, where GOUDET and BÜCHI (2006) quantified the variance in QST under several experimental designs. From Figure 1 (and Figure 2 of LOPEZ-FANJUL et al. 2007), we see that the expected bias in QST seldom exceeds 10% of the value of FST. This would matter if the variance of QST was small. But GOUDET and BÜCHI (2006) found that unless the number of populations analyzed to estimate QST is very large (e.g., >20), only extremely large differences between QST and FST (certainly >10% of the value of FST) are likely to be statistically significant.
The slight effect that dominance might have on QST is therefore unlikely to lead to a spurious inference of selection, and the large variance of QST is certainly more worrisome for the prospect of identifying traits under selection.
GOUDET, J., and L. BÜCHI, 2006 The effects of dominance, regular inbreeding and sampling design on QST, an estimator of population differentiation for quantitative traits. Genetics 172: 13371347.
LOPEZ-FANJUL, C., A. FERNANDEZ and M. TORO, 2002 The effect of epistasis on the excess of the additive and non-additive variance after population bottlenecks. Evolution 56: 865876.[CrossRef][Medline]
LOPEZ-FANJUL, C., A. FERNANDEZ and M. TORO, 2003 The effect of neutral nonadditive gene action on the quantitative index of population divergence. Genetics 164: 16271633.
LOPEZ-FANJUL, C., A. FERNANDEZ and M. A. TORO, 2007 The effect of dominance on the use of the QST FST contrast to detect natural selection on quantitative traits. Genetics 176: 725727.
MERILA, J., and P. CRNOKRAK, 2001 Comparison of genetic differentiation at marker loci and quantitative traits. J. Evol. Biol. 14: 892903.[CrossRef]
RANNALA, B., and J. HARTIGAN, 1996 Estimating gene flow in island populations. Genet. Res. 67: 147158.[Medline]
SPITZE, K., 1993 Population structure in Daphnia obtusa: quantitative genetics and allozyme variation. Genetics 135: 367374.[Abstract]
WRIGHT, S., 1937a The distribution of gene frequencies in populations. Proc. Natl. Acad. Sci. USA 23: 307320.
WRIGHT, S., 1937b The distributions of gene frequencies in populations. Science 85: 504.
WRIGHT, S., 1951 The genetic structure of populations. Ann. Eugen. 15: 323354.
Communicating editor: P. PHILLIPS
- THIS ARTICLE
- Full Text (PDF)
-
All Versions of this Article:
genetics.106.067173v1
176/2/1371 most recent - Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Goudet, J.
- Articles by Martin, G.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Goudet, J.
- Articles by Martin, G.


