Genetics, Vol. 157, 885-897, February 2001, Copyright © 2001

Population Admixture: Detection by Hardy-Weinberg Test and Its Quantitative Effects on Linkage-Disequilibrium Methods for Localizing Genes Underlying Complex Traits

Hong-Wen Denga,b,c, Wei-Min Chena,b, and Robert R. Reckera
a Osteoporosis Research Center, Creighton University, Omaha, Nebraska 68131
b Department of Biomedical Sciences, Creighton University, Omaha, Nebraska 68131
c Laboratory of Molecular and Statistical Genetics, Hunan Normal University, ChangSha, Hunan 410081, People's Republic of China

Corresponding author: Hong-Wen Deng, Osteoporosis Research Ctr., Creighton University, 601 N. 30th St., Omaha, NE 68131., deng{at}creighton.edu (E-mail)

Communicating editor: M. A. ASMUSSEN


*  ABSTRACT
*TOP
*ABSTRACT
*THEORY AND METHODS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*APPENDIX C
*LITERATURE CITED

In association studies searching for genes underlying complex traits, the results are often inconsistent, and population admixture has been recognized qualitatively as one major potential cause. Hardy-Weinberg equilibrium (HWE) is often employed to test for population admixture; however, its power is generally unknown. Through analytical and simulation approaches, we quantify the power of the HWE test for population admixture and the effects of population admixture on increasing the type I error rate of association studies under various scenarios of population differentiation and admixture. We found that (1) the power of the HWE test for detecting population admixture is usually small; (2) population admixture seriously elevates type I error rate for detecting genes underlying complex traits, the extent of which depends on the degrees of population differentiation and admixture; (3) HWE testing for population admixture should be performed with random samples or only with controls at the candidate genes, or the test can be performed for combined samples of cases and controls at marker loci that are not linked to the disease; (4) testing HWE for population admixture generally reduces false positive association findings of genes underlying complex traits but the effect is small; and (5) with population admixture, a linkage disequilibrium method that employs cases only is more robust and yields many fewer false positive findings than conventional case-control analyses. Therefore, unless random samples are carefully selected from one homogeneous population, admixture is always a legitimate concern for positive findings in association studies except for the analyses that deliberately control population admixture.


COMPLEX traits refer to diseases and quantitative traits with complex and multiple genetic and environmental determinations. Association studies that depend on linkage disequilibrium between markers and genes underlying complex traits have helped to decipher some genetic basis of variation of quantitative traits and the differential susceptibility to complex diseases (e.g., CHAGNON et al. 1998 Down). In association studies, usually, case-control analyses have been employed for complex diseases by comparing genotype or allele frequencies of candidate genes in unrelated cases and controls (e.g., BLUM et al. 1990 Down, BLUM et al. 1991 Down; HOLDEN 1994 Down). For quantitative traits, analyses of variance are usually conducted for random individuals to test the difference of the trait means among different genotypes or alleles (e.g., BOERWINKLE et al. 1986 Down, BOERWINKLE et al. 1987 Down; DENG et al. 1999 Down; PAGE and AMOS 1999 Down). However, despite extensive efforts, the results of independent association studies often fail to reach consensus and result in controversy. Such examples are the association between the dopamine D2 receptor gene and alcoholism (BLUM et al. 1990 Down, BLUM et al. 1991 Down; GELERNTER et al. 1993 Down; PATO et al. 1993 Down; HOLDEN 1994 Down) and the association between vitamin D receptor genotypes and bone mass (MORRISON et al. 1994 Down; EISMAN 1995 Down; PEACOCK 1995 Down; GONG et al. 1999 Down).

One of the most important causes that may underlie the inconsistent results from association studies is population admixture (CHAKRABORTY and SMOUSE 1988 Down; LANDER and SCHORK 1994 Down; WEIR 1996 Down; DENG and CHEN 2000 Down). If a population is composed of a recent admixture of different ethnic groups that differ in marker allele frequencies and disease frequencies (or the quantitative trait means), spurious associations may result between the marker genotypes (or alleles) and the complex traits. However, although the qualitative effect of population admixture has long been well recognized, the quantitative effects of population admixture under various degrees of admixture and population differentiation in the marker allele frequencies and disease frequencies have rarely been systematically investigated and are largely unknown. Investigation of such detailed effects is necessary to assess quantitatively the impact of population admixture on association studies and the general utility of the association study approach to identifing genes underlying complex traits. The results from such quantitative studies may also be useful in assessing the robustness of the results from association studies in relation to the samples employed.

Family-based analyses such as the transmission disequilibrium test (TDT; SPIELMAN et al. 1993 Down) have been developed specifically to control for population admixture in association studies to identify genes underlying complex traits. However, compared with the case-control studies that employ random samples of individual cases and controls, the samples for the family-based studies such as the TDT are generally much more difficult to obtain. This is particularly true in light of the fact that only nuclear families (parents and children) with at least one parent heterozygous at marker loci are eligible for the TDT type analyses. Therefore, case-control studies are still commonly used (e.g., DENG et al. 1999 Down) and advocated (e.g., RISCH and TENG 1998 Down) with the hope that carefully selected and tested (through Hardy-Weinberg equilibrium, see below) samples may come from a homogeneous population and thus population admixture is not a concern.

It is well known that population admixture can lead to deviation of genotype frequencies from what are expected on the basis of the Hardy-Weinberg (HW) law (CROW and KIMURA 1970 Down). It has been proposed that the HW equilibrium (HWE) test should be routinely performed at the candidate gene(s) as a method for assessing the potential population admixture (TIRET and CAMBIEN 1995 Down) in association studies with an aim of effectively reducing false positive findings of genes underlying complex traits. Testing HWE at candidate gene(s) is also a common practice in association studies to provide the evidence that population admixture is weak or absent (e.g., DENG et al. 1999 Down). However, the critical questions are the following: What is the power of the HWE test in detecting population admixture? How do various degrees of population admixture and population differentiation affect the power of the HWE test? What samples and markers should be employed to test the HWE for population admixture? Most importantly, how useful is the HWE test in association studies for reducing the rate of false positive findings of genes underlying complex traits? Additionally, the HWE test is also an important and general tool in population and evolutionary genetics in validating the assumptions of the HWE, such as random mating (e.g., HEBERT 1987 Down; LYNCH and SPITZE 1994 Down; DENG and LYNCH 1996 Down), although the usefulness and the power of this important tool is generally unknown.

In this article, through analytical and/or computer simulation approaches, first we quantify the power of the HWE test under various degrees of population differentiation (as reflected by different population allele and disease frequencies) and various degrees of population admixture (as reflected by the different proportions that populations admix). Second, we quantify the effects of various degrees of population differentiation and population admixture on the outcome of association studies. Two types of analyses for complex diseases [a conventional one that employs cases and controls and a recently developed one (FEDER et al. 1996 Down; NIELSEN et al. 1999 Down; DENG et al. 2000 Down) that employs cases only] are investigated. Third, we examine choices of samples and markers for the HWE test to detect population admixture. Finally, we investigate the utility of testing HWE for population admixture in association studies for reducing the error rate of false positive findings of genes underlying complex traits.


*  THEORY AND METHODS
*TOP
*ABSTRACT
*THEORY AND METHODS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*APPENDIX C
*LITERATURE CITED

In this section, we first present our theoretical investigation and then outline our simulation methods. For simplicity, we focus our investigation on association studies of complex diseases in a population (P) admixed of two differentiated large subpopulations (P1 and P2). In the P1 and P2 populations, HWE holds at a marker locus in which alleles can be classified into two classes, M and m. The frequencies of M in P1 are f1 and in P2 are f2. The disease prevalences are, respectively, {phi}1 in population P1 and {phi}2 in P2. The disease and the marker locus are not associated by any cause in the P1 and P2 populations. A proportion k of individuals in population P come from population P1; the rest (1 - k) come from P2. The frequencies of the M allele (f) and the disease ({phi}) in population P are then, respectively, f = kf1 + (1 - k)f2 and {phi} = k{phi}1 + (1 - k){phi}2.

The power of HWE test for population admixture at marker loci:
To focus our investigation on the power of the HWE test for population admixture, we assume that in population P, HW disequilibrium is entirely due to the population admixture. The HW disequilibrium can be measured by the deviations of genotype frequencies from those expected under the HWE (WEIR 1996 Down),

(1a)


(1b)


(1c)

where {Delta}f = f2 - f1. k(1 - k) can serve as a measure for the degree of population admixture. The larger the k(1 - k), the larger the degree of population admixture. k(1 - k) is maximized when k = 0.5. The {chi}2-test is often employed to test for HWE (WEIR 1996 Down). The test statistic is

(2)

which has 1 d.f. N is the sample size, and ~ indicates estimated frequencies of genotypes (MM, Mm, or mm) or alleles (M or m) from the sample. Under the alternative hypothesis that there is population admixture, the {chi}2HW statistic follows a noncentral {chi}2-distribution with 1 d.f. and the noncentrality parameter

(3a)

where D's are defined in Equation 1aEquation 1bEquation 1c. Substituting D's from Equation 1aEquation 1bEquation 1c into Equation 3a, we have

(3b)

With {lambda}HW given in Equation 3b, we can compute the power ({eta}) of the {chi}2HW-test under various degrees of population differentiation {Delta}f and various degrees of population admixture k (Appendix A). Some numerical results substantiated in later computer simulations are given in Table 1 Table 2 Table 3 Table 4.


 
View this table:
In this window
In a new window

 
Table 1. Sample sizes required to detect population admixture with 90% power by the HWE test under different admixtures (k) for two populations with allele M frequencies indicated as f


 
View this table:
In this window
In a new window

 
Table 2. Sample sizes required to detect population admixture with 80% power by the HWE test under different admixtures (k) for two populations with allele M frequencies indicated as f


 
View this table:
In this window
In a new window

 
Table 3. The power to detect population admixture with 200 individuals sampled from a population under different admixtures (k) with two differentiated populations


 
View this table:
In this window
In a new window

 
Table 4. The power to detect population admixture with 400 individuals sampled from a population under different admixtures (k) with two differentiated populations

Differentiation between populations can be measured by various indices in population genetics (CROW 1983 Down; HARTL and CLARK 1989 Down). One frequently employed index is the GST (NEI 1975 Down; CROW 1983 Down), which measures the relative reduction of heterozygosity (H) due to isolation of differentiated populations—the well-known Wahlund phenomenon (HARTL and CLARK 1989 Down). It can be shown (Appendix B) that

(4)

where N is the sample size for the {chi}2HW-test. Equation 4 establishes a direct relationship between the power of the HWE test for population admixture and a classical measure (GST) of the degree of population differentiation. Apparently, the larger the population subdivision as reflected by the GST, the higher the power for a sample to detect population subdivision by the HW test.

The effects of admixture of differentiated populations on the outcome of association studies:
To focus on quantifying the effects of admixture on association studies, we assume that the marker locus does not underlie the disease susceptibility in populations P1 and P2 and any association between the marker locus and the disease in population P will be entirely due to the admixture of the two differentiated populations.

Two types of tests are investigated, both depending on the basis that the marker locus is a disease gene per se or that it is in linkage disequilibrium with a disease gene. The first one is the {chi}2-test employed in the conventional case-control studies (WEIR 1996 Down) to test for the association between frequencies of marker alleles and diseases. The null hypothesis is that the distributions of marker allele frequencies are the same in the cases (individuals with the disease) and controls (individuals without the disease). The test statistic is

(5)

where M|D is the allele M frequency in cases (D) and M|C is the allele M frequency in controls (C). m|D and m|C are similarly defined for allele m. This {chi}2-test has 1 d.f. N is the sample size, and ~ indicates an estimated value from the sample. With population admixture, {chi}2CC approximately follows a noncentral {chi}2-distribution as corroborated later in our computer simulations. In the admixed population, pM|D, pM|C, pm|C, pm|C can be derived in terms of k, f1, f2, {phi}1, {phi}2, f, and {phi} (Appendix C). With these, we can obtain the noncentrality parameter of the {chi}2CC-statistic:

(6a)

If none of the terms of k, (1 - k), {Delta}f, or {Delta}{phi} is equal to 0, i.e., if there is population admixture for two populations differentiated in both allele and disease frequencies, then

(6b)

It can be seen qualitatively that the larger the term k(1 - k){Delta}f{Delta}{phi}, the larger the {lambda}CC. The magnitude of the noncentrality parameter {lambda}CC determines the power to detect association between marker alleles and the disease due to admixture of the two differentiated populations P1 and P2. {lambda}CC may help us understand intuitively the effects of population admixture and population differentiation on case-control analyses of association studies.

The null hypothesis to be tested in association studies for disease genes is that the marker alleles are not causally associated with the disease; i.e., the marker locus and a disease gene are not linked. For this null hypothesis, the power of the {chi}2CC-test in the admixed population under the condition of no causal relationship between the marker alleles and the disease is, in fact, the type I error ({epsilon}) due to population admixture and the statistical sampling error (a prespecified significance level for the {chi}2CC-test, {alpha}). With a given noncentrality parameter, the power under a specific set of parameters is computed in the same way as detailed in the previous section. The dependency of {epsilon} on various parameters of population differentiation and admixture is depicted in Fig 1. The difference of {epsilon} and {alpha} is the inflated type I error due solely to the admixture of differentiated populations.



View larger version (26K):
In this window
In a new window
Download PPT slide
 
Figure 1. The rate of false positive findings of association between a marker locus and a disease ({epsilon}) with the {chi}2CC-test under various degrees of population differentiation (f2 and {phi}2) and admixture (k). A total of 200 cases and 200 controls are employed in the statistical tests; f1 = 0.2, {phi}1 = 0.01. The solid lines indicate {epsilon} unadjusted for the HWE test, and the dashed lines are for {epsilon} adjusted for the HWE test for population admixture. Squares, {phi}2 = 0.06; circles, {phi}2 = 0.04; triangles, {phi}2 = 0.02.

The second analysis investigated is developed by FEDER et al. 1996 Down and NIELSEN et al. 1999 Down. DENG et al. 2000 Down extended this method for fine-mapping QTL. For a complex disease, this method can be employed to test the association of a marker locus and a disease by testing for HWE in the cases only (NIELSEN et al. 1999 Down). The power of this test and several other linkage disequilibrium tests has been compared (DENG et al. 2000 Down). Here, we investigate the effects of population admixture on the type I error rate of this test of HWE ({chi}2HW-D) in cases (D) for finding the causal association between a marker locus and a disease. The test statistic is

(7)

which has 1 d.f. ND is the sample size of cases. With the derivations in Appendix C, we can obtain the noncentrality parameter of the {chi}2HW-D statistic in the presence of population admixture:

(8)

It can be seen, by calculus, that {lambda}HW - D maximizes, with respect to {psi}, when {psi} = - k . The maximum of {lambda}HW - D with respect to {psi} is

{lambda}HW-D (max) is useful in that it may allow us to characterize maximum type I error in association studies with the {chi}2HW-D-test, irrespective of the population differentiation in the disease frequencies in populations P1 and P2.

The dependency of the type I error of the {chi}2HW-D-test ({epsilon}, for the null hypothesis of no causal association between the disease and the marker) on various parameters of population differentiation and admixture is computed, on the basis of {lambda}HW-D for the {chi}2HW-D-statistic, in a way similar to that described in APPENDIX A. The results are depicted in Fig 2.



View larger version (23K):
In this window
In a new window
Download PPT slide
 
Figure 2. The rate of false positive findings of association between a marker locus and a disease ({epsilon}) with the {chi}2HW-D-test under various degrees of population differentiation (f2 and {phi}2) and admixture (k). A total of 400 cases are employed. For other parameters, see the legend to Fig 1.

From Equation 3aEquation 3b and Equation 8, we can obtain the following relationship between the noncentrality parameter for the test of HWE in random population samples for detecting admixture and that in the cases for only detecting linkage disequilibrium between a marker locus and disease genes. Assume that both tests employ the same sample sizes,

where {psi} = .

Choice of population samples and marker loci for the HWE test:
Ideally, random samples (for any locus) or marker loci not associated with the disease (for any sample) should be employed to detect population admixture ({chi}2HW-test, Equation 2). For association studies (cases and controls for the {chi}2CC-test and cases only for the {chi}2HW-D-test), what samples at hand should be employed for the HWE test to detect population admixture are not entirely clear and have not been formally investigated. Apparently, we cannot use the cases only, as the HWE test in cases is a test for linkage disequilibrium when the whole population is randomly mating (FEDER et al. 1996 Down; NIELSEN et al. 1999 Down; DENG et al. 2000 Down). Should we employ all the combined samples of cases and controls to maximize the power to detect the HW disequilibrium due to population admixture? Or should we employ only controls?

To investigate these questions, we performed two types of simulations. The first type is to investigate the power to detect HW disequilibrium with cases and/or controls in large randomly mating populations, when the marker locus is at or closely linked to a disease susceptibility locus. For the null hypotheses of no population admixture, this power is in fact the rate of false positive findings (type I error rate, {epsilon}) for HW disequilibrium that is due to nonrandom choices of samples (cases and/or controls) and the statistical sampling error (a prespecified significance level, {alpha}). The simulation procedures are detailed in NIELSEN et al. 1999 Down and DENG et al. 2000 Down. Briefly, evolving populations segregating for a biallelic disease locus and biallelic marker loci are simulated. We consider a set of marker loci that are positioned at every 0.20 cM and span 0–2 cM on one side of the disease locus and a marker locus that is not linked to the disease locus (with the additive or recessive model, Fig 3, a and b). The disease prevalence in the population is 0.08. In simulations, recombinations between the QTL and marker loci are independent; i.e., there is no interference. The recombination rate is obtained from the physical distance between the disease locus and the marker locus using Haldane's map function (OTT 1991 Down). Under a specific genetic model, the population started at the 0th generation with complete association between allele A1 at the disease locus (with frequency 0.1) and a marker allele M (with frequency 0.2). Then the population evolved for 50 generations under random mating and genetic drift. The population size is 15,000. The genetic drift under such a population size is extremely small (CROW and KIMURA 1970 Down; DENG et al. 2000 Down). At the end of the simulation, 200 cases and 200 controls are sampled from the population. Then the {chi}2HW-test (Equation 2) is applied to the 200 controls and the combined sample of 200 controls and 200 cases. The population is sampled and tested 5000 times, and the proportion of the significant tests is the power to detect HW disequilibrium when there is no population admixture ({epsilon}). Thus this proportion provides an estimate of the type I error ({epsilon}) of the null hypothesis of no population admixture. The simulation is repeated 100 times, and the mean and standard deviation of {epsilon} is computed and depicted in Fig 3, a and b.



View larger version (22K):
In this window
In a new window
Download PPT slide
 
Figure 3. The rate ({epsilon}) of the false positive findings of HW disequilibrium in randomly mating populations in marker loci in linkage disequilibrium with a disease locus (a and b) and the power to detect ({eta}) population admixture by HWE test at a candidate gene locus (c and d) in admixed populations. (a) Recessive genetic model at the disease locus. The penetrances for the three genotypes A1A1, A1A2, A2A2, are, respectively, 1.00, 0.07, and 0.07. (b) Additive genetic model at the disease locus. The penetrances for the three genotypes A1A1, A1A2, A2A2, are, respectively, 0.98, 0.49, and 0.00. In a and b, data plotted are the mean and 1 SD, the solid squares are data for the tests with combined samples of 200 cases and 200 controls, and the open circles are for the tests with 200 controls. x-axes in a and b are expected average linkage disequilibrium coefficient between a marker locus and the disease locus defined as D = fMD - fMfD, where fMD is the haplotype frequency of the marker allele M and disease allele D, and fM and fD are the marker allele M frequency and disease allele D frequencies, respectively. In c and d, the solid and open squares are, respectively, for the tests with random samples of 200 and 400 individuals; the solid and open circles are, respectively, for the tests with 200 random cases and 200 random controls and for the tests with 200 random controls. In simulations for c, f1 = 0.1, f2 = 0.3, {phi}1 = 0.10, {phi}2 = 0.07. In simulations for d, k = 0.5, f = 0.2, {phi}1 = 0.10, {phi}2 = 0.07.

The second type of simulations is to compare the power to detect population admixture with controls only and that with both cases and controls in an admixed population. A population P admixed with P1 (with f1 = 0.1, {phi}1 = 0.1) and P2 (with f1 = 0.3, {phi}2 = 0.3) is simulated with various k in Fig 3C. In Fig 3D, a population P (f = 0.2) is simulated from admixture (k = 0.5) of two populations P1 and P2 that have allele M frequencies that differ by {Delta}f. In the P1 and P2 populations, the disease and the marker are not associated by any means. A total of 200 controls and 200 cases are sampled from the P population. Then the {chi}2HW-test (Equation 2) is applied to the 200 controls and to the combined sample of 200 controls and 200 cases. The population is sampled and tested 50,000 times, and the proportion of the significant tests is the power to detect HW disequilibrium due to population admixture ({eta}).

Testing HWE for population admixture in reducing false positive findings in association studies:
In the first two sections, through the analytical approach, we study separately the power of the HWE test for population admixture and the effects of admixture of differentiated populations on elevating the type I error rate in association studies. In this section, through computer simulations, we investigate the effect of testing HWE at candidate genes for population admixture in association studies, a practice (e.g., DENG et al. 1999 Down) and a recommendation (TIRET and CAMBIEN 1995 Down) for reducing false positive findings due to admixture. We also corroborate with our simulations the analytical results obtained in the first two sections.

A population (P) admixed of two differentiated populations (P1 and P2) is simulated. A proportion k of individuals of the population P comes from P1 and the rest from P2. In the P1 and P2 populations, HWE holds at the marker locus in which alleles can be classified into two classes M and m. The frequencies of M in P1 are f1 and in P2 are f2. The disease prevalences are, respectively, {phi}1 in population P1 and {phi}2 in P2. The disease and the marker locus are independent in the P1 and P2 populations. For a specific parameter set, a sample of 3N individuals is simulated from the P population, with 2N cases and N controls. The HWE test (Equation 2) is performed to detect population admixture with the N controls only (see RESULTS, Choices of population samples and marker loci for HWE test). If the test is significant, further testing of association between the marker and the disease will not be pursued to avoid confounding of association results due to admixture. If the test is not significant, i.e., if the test fails to reveal population admixture, tests of association that employ N random cases and N controls (Equation 5) and those that employ 2N cases (Equation 7) are conducted. This sampling scheme ensures that the test of HWE is performed on the basis of the same sample of controls for the test employing cases and controls (Equation 5) and the tests that are based on cases only (Equation 7). It also ensures that the two tests of associations have the same sample sizes of 2N so that the comparison of false positive findings of these two tests will not be confounded by the different sample sizes employed. N = 200 in our investigations. Note that this sampling design is used for the purpose of simulations only and is in no way intended as a design in collecting data in practice.

The proportion of the simulations with significant associations and a nonsignificant HWE test is the type I error of the association study approach that is aided with the HWE test to guard against confounding from population admixture. This type I error includes both the specified type I error rate in the statistical testing ({alpha}) and that inflated due to population admixture that failed to be revealed by the HWE test. For comparison, the type I error rate of association studies with and without (computed as indicated in the second section) the aid of the HWE test for population admixture is contrasted in Fig 1 and Fig 2. In simulations, we corroborated that the results for the power or type I error based upon the analytical approach in the first two sections are accurate (to avoid repetitiveness the results are not shown).


*  RESULTS
*TOP
*ABSTRACT
*THEORY AND METHODS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*APPENDIX C
*LITERATURE CITED

The sample size required and the power of the HWE test for population admixture (Table 1 Table 2 Table 3 Table 4):
It can be seen from Table 1 and Table 2 that the sample size (n) required to detect population admixture by the HWE test is generally quite large, except when the degree of population admixture is large (i.e., k ~ 0.5) and the differentiation of populations P1 and P2 is large (i.e., when {Delta}f is large). Generally speaking, when {Delta}f = 0.2 (i.e., the frequencies of the allele M differ by 0.2 in the populations P1 and P2), n required is >2000 even with the largest degree of population admixture (k = 0.5) and n is >20,000 if the degree of population admixture is small (k = 0.1). These sample sizes well exceed those feasible and typically employed in association studies. When {Delta}f gets larger and k gets closer to 0.5, n gets smaller. Generally speaking, only when {Delta}f > 0.4, and when k > 0.2, can the population admixture be detected by the sample sizes typically employed in association studies (<1000).

For samples sizes 200 and 400 that are typically feasible, Table 3 and Table 4 list the power to detect population admixture via the HWE test under various degrees of population differentiation and admixture. It can be seen that the power depends on both k and {Delta}f. Generally speaking, if {Delta}f < 0.2, there is little power to detect population admixture via the HWE test regardless of k. Only when {Delta}f is quite large (>0.4) and k > 0.2 is the power relatively high. When {Delta}f = 0.8, the power is almost always 100%. However, {Delta}f > 0.4 is probably rather rare in natural populations, especially in humans for candidate genes.

The effects of admixture of differentiated populations on the outcome of association studies ( Fig 1 and Fig 2):
It can be seen (Fig 1) that when {Delta}f and {Delta}{phi} increase, the false findings of association studies (the type I error rate, {epsilon}) increase rapidly for the {chi}2CC-test that employs both cases and controls. When {Delta}f = 0, irrespective of the magnitude of {Delta}{phi}, {epsilon} remains the same magnitude of a prespecified type I error rate {alpha} (0.05). Data not shown indicate that when {Delta}{phi} = 0, {epsilon} remains relatively stable at {alpha} (0.05) for the case-control analyses (Equation 5), regardless of the magnitude of {Delta}f. This is consistent with the analytical prediction based on the noncentrality parameter of the test statistic (Equation 6a). These results are consistent with qualitative analyses of the noncentrality parameters of these tests (Equation 6a and Equation 8).

Noticeable (Fig 2) is the fact that, over a range of {Delta}f, {Delta}{phi}, and k, {epsilon} remains fairly stable and close to the specified significance level {alpha} (0.05) for the {chi}2HW-D test (Equation 7) that employs cases only. Generally speaking, for the same sample sizes employed, the {chi}2HW-D test has much smaller {epsilon} (Fig 1 and Fig 2) than the {chi}2CC-test under the same parameters.

Choice of population samples and marker loci for HWE test ( Fig 3):
In large randomly mating populations (Fig 3, a and b), if the marker locus is in linkage disequilibrium with the disease locus due to linkage, testing HWE with both the cases and controls (selected for population association studies) will result in false positive findings of population admixture at a rate ({epsilon}) higher than the specified statistical type I error rate {alpha} (0.05). {epsilon} increases dramatically with increasing levels of linkage disequilibrium. When the marker locus is not linked to a disease locus, {epsilon} remains at the specified {alpha} of 0.05. However, if tested only in controls, {epsilon} remains at the level close to 0.05 whether the locus is linked to a disease locus or not.

In an admixed population P (Fig 3C and Fig D), if the marker locus is not linked to a disease locus, combining cases and controls for the HW test will generally have higher power to detect population admixture than testing in the controls alone, due to larger sample sizes. Testing the HWE with controls only has similar power to the testing with random samples of the same sizes. Testing HWE with combined sample sizes of cases and controls has similar or slightly greater power to detect population admixture than the test employing random samples of the same sizes. This is probably due to the elevated level of HW disequilibrium in cases due to population admixture. Although the marker is not linked to a disease locus in subpopulations P1 and P2, linkage disequilibrium between the marker and the disease is created upon admixture of P1 and P2 that differ in disease and marker frequencies. Such linkage disequilibrium leads to the elevated level of HW disequilibrium in cases.

Testing HWE for population admixture in association studies ( Fig 1 and Fig 2):
By contrasting the {epsilon}'s for the association studies that do and do not employ the HWE test for population admixture, it can be seen easily (Fig 1 and Fig 2) that those employing the HWE test will suffer reduced levels of {epsilon}. However, the reduction of {epsilon} is generally small by accepting only those significant associations in samples with a nonsignificant HWE test. Therefore, the utility of testing HWE in reducing false positive findings due to population admixture is generally limited. This is consistent with earlier results on the limited power of the HWE test for population admixture.


*  DISCUSSION
*TOP
*ABSTRACT
*THEORY AND METHODS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*APPENDIX C
*LITERATURE CITED

With random population samples, extensive association studies have been conducted to search for genes underlying complex traits through linkage disequilibrium of these genes with markers. It is well known (CHAKRABORTY and SMOUSE 1988 Down; LANDER and SCHORK 1994 Down; WEIR 1996 Down) that if there is population stratification, spurious association may result between marker loci and complex traits in association studies. Although the qualitative effects of population stratification have long been recognized, the detailed quantitative effects of various degrees of population stratification on various linkage disequilibrium methods have seldom, if ever, been investigated. It is a usual practice (e.g., DENG et al. 1999 Down) and it is suggested (TIRET and CAMBIEN 1995 Down) to use the HWE test at candidate genes for population admixture in association studies with an aim to guard against false positive findings of markers with diseases.

Through analytical and computer simulation approaches, we quantified the power of the HW test for population admixture and the effects of population admixture on increasing the false positive findings (type I error, {epsilon}) in association studies under various scenarios of population admixture and population differentiation. We found that (1) the power of the HWE test for detecting population admixture is usually small, even with large samples, unless the degrees of population admixture and population differentiation are rather large; (2) population admixture seriously elevates {epsilon} for detecting genes underlying complex traits, the extent depending on the degrees of population admixture and population association; (3) HWE testing for population admixture should be performed with random samples, or only with controls at candidate genes, or the test may be performed for combined samples of cases and controls at marker loci that are not linked to the diseases under study; (4) testing HWE for population admixture generally reduces false positive findings of genes underlying complex traits but the effect is generally small due to the limited power to detect population admixture by the HWE test; and (5) compared with the conventional case-control analyses ({chi}2CC-test, Equation 5) in association studies for complex diseases, the {chi}2HW-D-test (Equation 7) that employs only cases is more robust and yields much smaller {epsilon}.

In this study, we focus on studying {epsilon} in a common practice (e.g., TIRET and CAMBIEN 1995 Down; DENG et al. 1999 Down) in association studies where the HWE test is employed at candidate gene(s) to guard against spurious association due to population admixture. Although, as revealed here, such a practice has some minor effects on decreasing {epsilon} in detecting disease genes in the presence of population admixture, it is also intuitive that in the absence of population admixture, such a practice will decrease the power to detect diseases genes. This is simply because the spurious population admixture will be detected by HWE tests that are entirely due to sampling error (at a rate specified by the level of the test significance {alpha}) in the absence of population admixture. Such spurious findings of population admixture may erroneously halt the testing for disease loci at the candidate genes. Recently, PRITCHARD and ROSENBERG 1999 Down proposed employing a series of marker loci unlinked to the candidate genes to test for population admixture/stratification through contingency table {chi}2-tests in cases and controls as a means to effectively reduce {epsilon} in disease gene searching in association studies. Although combining a series of unlinked marker loci may increase the power to detect population admixture in the HWE test, the increase in power requires that each marker locus included in analyses is differentiated in subpopulations—a valuable piece of information that is generally unknown for markers in most admixed populations. Including markers that are not differentiated among subpopulations will generally decrease the power to detect population admixture. Most importantly, it is the differentiation of disease frequencies and allele frequencies at candidate genes in subpopulations of an admixed population that affects {epsilon} in disease gene testing at candidate genes. This can be easily seen analytically via the noncentrality parameters of association test statistics (Equation 6aEquation 6b and Equation 8). Various loci across the human genome may be differentiated to various degrees in subpopulations of an admixed population. If the candidate genes are not differentiated, but the unlinked marker loci selected are differentiated in subpopulations and population admixture is detected by HWE tests at these unlinked marker loci, we will suffer substantial loss of power by stopping to test candidate loci for detecting disease genes. On the other hand, if the marker loci are not differentiated but the candidate genes are differentiated in subpopulations of an admixed population, we will still suffer inflated {epsilon} due to admixture of subpopulations differentiated at the candidate genes to be tested. The above problem may also undermine the usefulness of the approach of PRITCHARD et al. 2000A Down, PRITCHARD et al. 2000B Down for inferring population structure using multilocus genotype data to perform association studies of candidate genes. By applying the Bayesian approach, DEVLIN and ROEDER 1999 Down developed a genomic control method for single nucleotide polymorphism (SNP) data densely sampled from the whole genome in case-control studies. This method is supposed to be able to, in whole genome case-control studies with SNP, control the type I error rate to desired levels by choosing appropriate tuning parameters in implementation.

PRITCHARD and ROSENBERG 1999 Down focus on the situation where there is no prior reason to suspect population admixture, and association studies have been conducted and positive results have been generated. This is because, as they correctly pointed out, case-control studies are often criticized under such circumstances for potential confounding effects of possible population admixture. Therefore, they suggest genotyping additional unlinked markers to test for population admixture in the presence of positive association results. Our study starts from a slightly different angle. Our study is stimulated by the general practice and suggestion (TIRET and CAMBIEN 1995 Down) that testing population admixture via the HWE test should proceed at the candidate gene before association tests and by the general perception that this procedure can effectively reduce the type I error due to population admixture. Testing HWE for population admixture at a candidate gene conditional on a significant case-control test may be of limited use in reducing the type I error due to population admixture in disease gene identification. This is because of the small power of the HWE test at a single candidate gene in detecting population admixture as demonstrated by the results here.

HWE is a fundamental topic in population genetics. Issues related to HWE have been subjected to extensive studies and have various applications in many research areas. Examples are the propositions of various tests of HWE (e.g., LOUIS and DEMPSTER 1987 Down; HERNANDEZ and WEIR 1989 Down; EGUCHI and MATSUURA 1990 Down; GUO and THOMPSON 1992 Down) and HWE tests in stratified populations (e.g., NAM 1997 Down); characterization of HW disequilibrium (SHOEMAKER et al. 1998 Down); and testing of genes underlying complex traits through the HWE test in extreme samples of populations (NIELSEN et al. 1999 Down; DENG et al. 2000 Down). SCHAID and JACOBSEN 1999 Down proposed testing for disease genes in association studies by correcting the existent HW disequilibrium to avoid the inflated type I error due to population admixture/stratification; however, we (DENG and CHEN 2000 Down) found that their correction approach is generally not feasible in practice.

CHAKRABORTY and SMOUSE 1988 Down and BRISCOE et al. 1994 Down found that the level of linkage disequilibrium in a population P admixed of P1 and P2 populations for two marker loci is D = k(1 - k){Delta}f{Delta}p, where k and {Delta}f are defined earlier and {Delta}p is the difference of the allele frequency of the second locus. The two loci are assumed to be in linkage equilibrium in the P1 and P2 populations. In this study, we assume that a locus and a disease are not associated in the P1 and P2 populations. The association in the P population is entirely due to the "disequilibrium" between the marker locus and the disease created by admixture. The degree of such disequilibria may be measured as D' = k(1 - k){Delta}f{Delta}{phi}. It is noted that the power to detect an association between the marker and the disease created by admixture critically depends on D' as reflected by Equation 6a and Equation 6b for the {chi}2CC-test. However, the disequilibrium due to population admixture between a marker locus and the disease may not have the same effects on different association studies, as is demonstrated by our simulation results for the two tests examined ({chi}2CC- and {chi}2HW-D-tests). This is also apparent from the noncentrality parameters found for the two test statistics (Equation 6a, Equation 6b, and Equation 8). Different from the {chi}2CC-test, the power of the test does not have a direct relationship with {Delta}{phi} for the {chi}2HW-D-test.

Population association studies that depend on linkage and strong linkage disequilibrium between marker loci and loci underlying complex traits have been conducted extensively and have helped in deciphering some genetic bases of complex traits (e.g., CHAGNON et al. 1998 Down). Population association studies have advantages such as being powerful and relatively easy to recruit study subjects. However, the results generated so far from population association studies are largely inconsistent and controversial. Quantitative studies of the detailed mechanisms of various potential causes underlying the inconsistent results may not only provide a basis for correct implementation of association studies but also form a basis on which to correctly interpret the significant results obtained under various designs and analyses. For example, it is noted that association studies in cases only (Equation 7) may be more robust in identifying genes underlying complex traits than the conventional case-control analyses (Equation 5). In addition, the HWE test may reduce the false positive findings, but the effects are small. Therefore, while association studies can be a useful tool with which to generate hypotheses in gene identification, the hypotheses may also need to be substantiated by methods (SPIELMAN et al. 1993 Down; ALLISON 1997 Down; XIONG et al. 1998 Down) that are robust to population admixture, unless the samples are known from a homogeneous population.

Finally, it should be pointed out that although we examine the detection of HW disequilibrium due to population admixture in the context of localizing genes underlying complex diseases, some issues investigated here should be of general interest in genetics. For example, it is noted here for the first time that the degree of population differentiation as measured by GST has a direct relationship with the noncentrality parameter (and thus the power) of the test to detect HW disequilibrium (Equation 4). In addition, it is a general practice in population and evolutionary genetics to test for HW disequilibrium as a means to substantiate the assumptions for HW equilibrium (such as population admixture, inbreeding, and assortative mating). Nonsignificant results are generally interpreted as an indication of random mating in the study populations (e.g., HEBERT 1987 Down; LYNCH and SPITZE 1994 Down; DENG and LYNCH 1996 Down). However, such a practice may not be reliable in that the test has limited power in detecting deviation from HW equilibrium due to population migration, etc., as demonstrated here. Therefore, the practice that employs the HW disequilibrium test to substantiate the assumptions of HW equilibrium may need to be treated with caution unless the sample size is very large (e.g., >1000).


*  ACKNOWLEDGMENTS

We are grateful to Professor Asmussen and the two anonymous reviewers for providing careful comments that helped to improve the manuscript. This study was partially supported by grants from the National Institutes of Health, the Health Future Foundation, and HuNan Normal University and by a graduate student tuition waiver to Wei-Min Chen from Creighton University.

Manuscript received January 4, 2000; Accepted for publication October 30, 2000.


*  APPENDIX A
*TOP
*ABSTRACT
*THEORY AND METHODS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*APPENDIX C
*LITERATURE CITED

COMPUTATION OF THE STATISTICAL POWER BASED ON THE NONCENTRALITY PARAMETER OF THE {chi}2HW-STATISTIC
The power ({eta}) of the {chi}2HW-test under various degrees of population differentiation {Delta}f and various degrees of population admixture k can be computed as

(A1)

where f(x, 1, {lambda}) is the p.d.f. of a noncentral {chi}2-distribution with 1 d.f. and the noncentrality parameter {lambda}HW defined in Equation 3b. {chi}2{alpha} is the critical value (quantile of order 1 - {alpha}) so that for the central {chi}2-distribution [with p.d.f. f(x, 1, 0)], which holds under the null hypothesis of HWE, the following relationship holds:

Thus {alpha} specifies the type I error (or significance level) of the HWE test for population admixture. For a specified {alpha}, {chi}2{alpha} can be found in most statistics books. Therefore, with a certain sample size N, {eta} of the test for HWE under various {Delta}f and k can be computed by Equation 3aEquation 3b and Equation A1. In the power computations, PEARSON's (1959) method (JOHNSON et al. 1995 Down, p. 462) is employed to approximate the c.d.f. of noncentral {chi}2-distributions, which requires the c.d.f. of central {chi}2-distributions that are approximated by the LING 1978 Down method (when d.f. > 1.5) or the WILSON and HILFERTY 1931 Down method (when d.f. < 1.5; JOHNSON et al. 1994 Down, pp. 426 and 437). Under a specified {eta}, the sample size N required to detect population admixture with the HWE test can be simply obtained by Equation 3aEquation 3b and Equation A1 with the aid of the tables for noncentral {chi}2-distributions (WEIR 1996 Down, p. 382).


*  APPENDIX B
*TOP
*ABSTRACT
*THEORY AND METHODS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*APPENDIX C
*LITERATURE CITED

THE RELATIONSHIP OF {lambda}HW WITH GST
GST = , where HT is the heterozygosity if all the isolated populations were converted into a single randomly mating population. S measures the average heterozygosity of isolated subpopulations. In a population P admixed of populations P1 and P2 with a proportion k from P1 and (1 - k) from P2, for a locus with two alleles M and m with frequencies of M being f1 in P1 and f2 in P2 and the frequency of M in P is f, HT = 1 - f2 - (1 - f)2, where f is defined in the text. If the average heterozygosity of P1 and P2 is computed by weighting the heterozygosity in P1 and P2, respectively, by their relative contributions to population P, S = 2kf1(1 - f1) + 2(1 - k)f2(1 - f2), then

By the above equation and Equation 3a, we have

where N is the sample size for the {chi}2HW-test.


*  APPENDIX C
*TOP
*ABSTRACT
*THEORY AND METHODS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*APPENDIX C
*LITERATURE CITED

FREQUENCIES OF MARKER ALLELES AND GENOTYPES IN CASES AND CONTROLS IN AN ADMIXED POPULATION
Assume that the marker locus is not causally associated with the disease and assume that the marker genotypes (or alleles) and the disease are not associated in populations P1 and P2; the association between the marker and the disease in population P is then due entirely to the admixture. In population P, the expected frequency of the allele M in cases is

(C1)

where Pr(M|P1, D) = Pr(M|P1) due to the independence of the marker allele and disease within populations of P1 and P2,

Similarly,

In population P, the expected frequency of genotype MM in cases is

(C2)

Therefore, from Equations C1 and C2, and after some algebra simplification, we have

Similarly, we have


*  LITERATURE CITED
*TOP
*ABSTRACT
*THEORY AND METHODS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*APPENDIX C
*LITERATURE CITED

ALLISON, D. B., 1997  Transmission-disequilibrium tests for quantitative traits. Am. J. Hum. Genet. 60:676-690[Medline].

BLUM, K., E. P. NOBEL, P. J. SHERIDAN, A. MONTGOMERY, and T. RITCHIE et al., 1990  Allelic association of human dopamine D2 receptor gene in alcoholism. J. Am. Med. Assoc. 263:2055-2060[Abstract/Free Full Text].

BLUM, K., E. P. NOBLE, P. J. SHERIDAN, O. FINLEY, and A. MONTGOMERY et al., 1991  Association of the A1 allele of the D2 Dopamine receptor gene with severe alcoholism. Alcohol 8:409-416[Medline].

BOERWINKLE, E., R. CHAKRABORTY, and C. F. SING, 1986  The use of measured genotype information in the analysis of quantitative phenotypes in man. Ann. Hum. Genet. 50:181-194[Medline].

BOERWINKLE, E., S. VISCIKIS, D. WELSH, J. STEINMETZ, and S. M. HAMASH et al., 1987  The use of measured genotype information in the analysis of quantitative phenotypes in man. II. The role of the apolipoprotein E polymorphisms in determining levels, variability, and covariability of cholesterol, betalipoprotein, and triglycerides in a sample of unrelated individuals. Am. J. Med. Genet. 27:567-582[Medline].

BRISCOE, D., J. C. STEPHENS, and S. J. O'BRIEN, 1994  Linkage disequilibrium in admixed populations: applications in gene mapping. J. Hered. 85:59-63[Abstract/Free Full Text].

CHAGNON, Y. C., L. PERUSSE, and C. BOUCHARD, 1998  The human obesity gene map: the 1997 update. Obes. Res. 5:76-92.

CHAKRABORTY, R. and P. SMOUSE, 1988  Recombination in haplotypes leads to biased estimates of admixture proportions in human populations. Proc. Natl. Acad. Sci. USA 85:3071-3074[Abstract/Free Full Text].

CROW, J. F., 1983 Basic Concepts in Population, Quantitative, and Evolutionary Genetics. Freeman, New York.

CROW, J., and M. KIMURA, 1970 An Introduction to Population Genetics Theory. Harper & Row, New York.

DENG, H.-W. and W. M. CHEN, 2000  Re: "biased tests of association: comparison of allele frequencies when departing from Hardy-Weinberg proportions.". Am. J. Epidemiol. 151:335-357[Free Full Text].

DENG, H.-W. and M. LYNCH, 1996  Change of genetic architecture in response to sex. Genetics 143:203-212[Abstract].

DENG, H.-W., J. LI, J. L. LI, M. JOHNSON, and G. GONG et al., 1999  Association of VDR and estrogen receptor genotypes with bone mass in postmenopausal caucasian women: different conclusions with different analyses and the implications. Osteoporos. Int. 9:499-507[Medline].

DENG, H.-W., W. M. CHEN, and R. R. RECKER, 2000  QTL fine mapping in extreme samples from populations. Am. J. Hum. Genet. 66:1027-1045[Medline].

DEVLIN, B. and K. ROEDER, 1999  Genomic control for association studies. Biometrics 55:997-1004[Medline].

EGUCHI, S. and M. MATSUURA, 1990  Testing the Hardy-Weinberg equilibrium in the HLA system. Biometrics 46:415-426[Medline].

EISMAN, J. A., 1995  Vitamin D receptor gene alleles and osteoporosis: an affirmative view. J. Bone Miner. Res. 10:1289-1293[Medline].

FEDER, J. N., A. GNIRKE, W. THOMAS, and Z. TSUCHIHASI, 1996  A novel MHC class I-like gene is mutated in patients with hereditary haemochromatosis. Nat. Genet. 13:399-408[Medline].

GELERNTER, J., D. GOLDMAN, and N. RISCH, 1993  The A1 allele at the D2 dopamine receptor gene and alcoholism. J. Am. Med. Assoc. 269:1673-1677[Abstract/Free Full Text].

GONG, G., S. STERN, S. C. CHENG, F. FONG, and N. MORDESON et al., 1999  On the association of bone mass density and Vitamin-D genotype polymorphisms. Osteoporos. Int. 9:55-64[Medline].

GUO, S. W. and E. A. THOMPSON, 1992  Performing the exact test of Hardy-Weinberg proportion for multiple alleles. Biometrics 48:361-372[Medline].

HARTL, D. L., and A. G. CLARK, 1989 Principles of Population Genetics. Sinauer Associates, Sunderland, MA.

HEBERT, P. D. N., 1987  Genetics of Daphnia. Mem. Ist. Ital. Idrobiol. 45:439-460.

HERNANDEZ, J. L. and B. S. WEIR, 1989  A disequilibrium coefficient approach to Hardy-Weinberg testing. Biometrics 45:53-70[Medline].

HOLDEN, C., 1994  A cautionary genetic tale: the sobering story of D2. Science 264:1696-1697[Free Full Text].

JOHNSON, N. L., S. KOTZ and N. BALAKRISHNAN, 1994 Continuous Univariate Distributions I. John Wiley & Sons, New York.

JOHNSON, N. L., S. KOTZ and N. BALAKRISHNAN, 1995 Continuous Univariate Distributions II. John Wiley & Sons, New York.

LANDER, E. S. and N. J. SCHORK, 1994  Genetic dissection of complex traits. Science 265:2037-2048[Abstract/Free Full Text].

LOUIS, E. J. and E. R. DEMPSTER, 1987  An exact test for Hardy-Weinberg and multiple alleles. Biometrics 43:805-811[Medline].

LING, R. F., 1978  A study of the accuracy of some approximations for t, {chi}2, and F tail probabilities. J. Am. Stat. Assoc. 73:274-283.

LYNCH, M., and K. SPITZE, 1994 Evolutionary genetics of Daphnia, pp. 109–128 in Ecological Genetics, edited by L. REAL. Princeton University Press, Princeton, NJ.

MORRISON, N. A., J. C. QI, A. TOKITA, P. J. KELLY, and L. CROFTS et al., 1994  Prediction of bone density from vitamin D receptor alleles. Nature 367:284-287[Medline].

NAM, J. M., 1997  Testing a genetic equilibrium across strata. Ann. Hum. Genet. 61:163-170[Medline].

NEI, M., 1975 Molecular Population Genetics and Evolution. North-Holland/American Elsevier, Amsterdam.

NIELSEN, D. M., M. G. EHM, and B. S. WEIR, 1999  Detecting marker-disease association by testing for Hardy-Weinberg disequilibrium at a marker locus. Am. J. Hum. Genet. 63:1531-1540.

OTT, J., 1991 Analysis of Human Genetic Linkage. Johns Hopkins University Press, Baltimore.

PAGE, G. P. and C. I. AMOS, 1999  Comparison of linkage-disequilibrium methods for localization of genes influencing quantitative traits in humans. Am. J. Hum. Genet. 64:1194-1205[Medline].

PATO, C. N., F. MACCIARDI, M. T. PATO, M. VERGA, and J. L. KENNEDY, 1993  Review of the putative association of dopamine D2 receptor and alcoholism: a meta-analysis. Am. J. Med. Genet. 48:78-82[Medline].

PEACOCK, M., 1995  Vitamin D receptor gene alleles and osteoporosis: a contrasting view. J. Bone Miner. Res. 10:1294-1297[Medline].

PEARSON, E. S., 1959  Note on an approximation to the distribution of noncentral {chi}2. Biometrika 46:364.

PRITCHARD, J. K. and N. A. ROSENBERG, 1999  Use of unlinked genetics markers to detect population stratification in association studies. Am. J. Hum. Genet. 65:220-228[Medline].

PRITCHARD, J. K., M. STEPHENS, and P. DONNELLY, 2000a  Inference of population admixture using multilocus genotype data. Genetics 155:945-959[Abstract/Free Full Text].

PRITCHARD, J. K., M. STEPHENS, N. A. ROSENBERG, and P. DONNELLY, 2000b  Association mapping in structured populations. Am. J. Hum. Genet. 67:170-181[Medline].

RISCH, N. and J. TENG, 1998  The relative power of family-based and case-control designs for linkage disequilibrium studies of complex human diseases. I. DNA polling. Genome Res. 8:1273-1288[Abstract/Free Full Text].

SCHAID, D. J. and S. J. JACOBSEN, 1999  Biased tests of association: comparisons of allele frequencies when departing from Hardy-Weinberg proportions. Am. J. Epidemiol. 149:706-711[Abstract/Free Full Text].

SHOEMAKER, J., I. PAINTER, and B. S. WEIR, 1998  A Bayesian characterization of Hardy-Weinberg disequilibrium. Genetics 149:2079-2088[Abstract/Free Full Text].

SPIELMAN, R. S., R. E. MCGINNIS, and W. J. EWENS, 1993  Transmission test for linkage disequilibrium: the insulin gene region and insulin-dependent diabetes mellitus (IDDM). Am. J. Hum. Genet. 52:506-516[Medline].

TIRET, L. and F. CAMBIEN, 1995  Letter: Departure from Hardy-Weinberg equilibrium should be systematically tested in studies of association between genetic markers and disease. Circulation 92:3364-3365.

WEIR, B. S., 1996 Genetic Data Analysis II. Sinauer, Sunderland, MA.

WILSON, E. B. and M. M. HILFERTY, 1931  The distribution of chi-square. Proc. Natl. Acad. Sci. USA 17:684-688[Free Full Text].

XIONG, M. M., J. KRUSHKAL, and E. BOERWINKLE, 1998  TDT statistics for mapping quantitative trait loci. Ann. Hum. Genet. 62:431-452[Medline].




This article has been cited by other articles:


Home page
Hum Mol GenetHome page
K.-D. Yu, G.-H. Di, W.-T. Yuan, L. Fan, J. Wu, Z. Hu, Z.-Z. Shen, Y. Zheng, W. Huang, and Z.-M. Shao
Functional polymorphisms, altered gene expression and genetic association link NRH:quinone oxidoreductase 2 to breast cancer with wild-type p53
Hum. Mol. Genet., July 1, 2009; 18(13): 2502 - 2517.
[Abstract] [Full Text] [PDF]


Home page
FASEB J.Home page
K.-D. Yu, G.-H. Di, L. Fan, J. Wu, Z. Hu, Z.-Z. Shen, W. Huang, and Z.-M. Shao
A functional polymorphism in the promoter region of GSTM1 implies a complex role for GSTM1 in breast cancer
FASEB J, July 1, 2009; 23(7): 2274 - 2287.
[Abstract] [Full Text] [PDF]


Home page
GutHome page
H-L Wong, W-P Koh, N M Probst-Hensch, D Van den Berg, M C Yu, and S A Ingles
Insulin-like growth factor-1 promoter polymorphisms and colorectal cancer: a functional genomics approach
Gut, August 1, 2008; 57(8): 1090 - 1096.
[Abstract] [Full Text] [PDF]


Home page
HypertensionHome page
K. J. Meyers, T. H. Mosley, E. Fox, E. Boerwinkle, D. K. Arnett, R. B. Devereux, and S. L.R. Kardia
Genetic Variations Associated With Echocardiographic Left Ventricular Traits in Hypertensive Blacks
Hypertension, May 1, 2007; 49(5): 992 - 999.
[Abstract] [Full Text] [PDF]


Home page
Int J EpidemiolHome page
A. Evans, V. Salomaa, S. Kulathinal, K. Asplund, F. Cambien, M. Ferrario, M. Perola, L. Peltonen, D. Shields, H. Tunstall-Pedoe, et al.
MORGAM (an international pooling of cardiovascular cohorts)
Int. J. Epidemiol., February 1, 2005; 34(1): 21 - 27.
[Full Text] [PDF]


Home page
Cancer Epidemiol. Biomarkers Prev.Home page
H.-L. Wong, K. DeLellis, N. Probst-Hensch, W.-P. Koh, D. Van Den Berg, H.-P. Lee, M. C. Yu, and S. A. Ingles
A New Single Nucleotide Polymorphism in the Insulin-Like Growth Factor I Regulatory Region Associates with Colorectal Cancer Risk in Singapore Chinese
Cancer Epidemiol. Biomarkers Prev., January 1, 2005; 14(1): 144 - 151.
[Abstract] [Full Text] [PDF]


Home page
HypertensionHome page
M. van Onna, A. A. Kroon, A. J.H.M. Houben, D. Koster, M. P.A. Zeegers, L. H.G. Henskens, A. W. Plat, H. E.J.H. Stoffers, and P. W. de Leeuw
Genetic Risk of Atherosclerotic Renal Artery Disease: The Candidate Gene Approach in a Renal Angiography Cohort
Hypertension, October 1, 2004; 44(4): 448 - 453.
[Abstract] [Full Text] [PDF]


Home page
Physiol. GenomicsHome page
J. Kaput and R. L. Rodriguez
Nutritional genomics: the next frontier in the postgenomic era
Physiol Genomics, January 15, 2004; 16(2): 166 - 177.
[Abstract] [Full Text] [PDF]


Home page
Cancer Epidemiol. Biomarkers Prev.Home page
A. Koushik, R. W. Platt, and E. L. Franco
p53 Codon 72 Polymorphism and Cervical Neoplasia: A Meta-Analysis Review
Cancer Epidemiol. Biomarkers Prev., January 1, 2004; 13(1): 11 - 22.
[Abstract] [Full Text] [PDF]


Home page
Int J EpidemiolHome page
W.-C. Lee
Detecting population stratification using a panel of single nucleotide polymorphisms
Int. J. Epidemiol., December 1, 2003; 32(6): 1120 - 1120.
[Full Text] [PDF]


Home page
GeneticsHome page
J. Hernandez-Sanchez, P. Visscher, G. Plastow, and C. Haley
Candidate Gene Analysis for Quantitative Traits Using the Transmission Disequilibrium Test: The Example of the Melanocortin 4-Receptor in Pigs
Genetics, June 1, 2003; 164(2): 637 - 644.
[Abstract] [Full Text] [PDF]


Home page
J. Clin. Endocrinol. Metab.Home page
H.-W. Deng, F.-H. Xu, Q.-Y. Huang, H. Shen, H. Deng, T. Conway, Y.-J. Liu, Y.-Z. Liu, J.-L. Li, H.-T. Zhang, et al.
A Whole-Genome Linkage Scan Suggests Several Genomic Regions Potentially Containing Quantitative Trait Loci for Osteoporosis
J. Clin. Endocrinol. Metab., November 1, 2002; 87(11): 5151 - 5159.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
H.-W. Deng
Population Admixture May Appear to Mask, Change or Reverse Genetic Effects of Genes Underlying Complex Traits
Genetics, November 1, 2001; 159(3): 1319 - 1323.
[Abstract] [Full Text] [PDF]