Genetics, Vol. 164, 637-644, June 2003, Copyright © 2003

Candidate Gene Analysis for Quantitative Traits Using the Transmission Disequilibrium Test: The Example of the Melanocortin 4-Receptor in Pigs

Jules Hernández-Sáncheza, Peter Visscherb, Graham Plastowc, and Chris Haleya
a Roslin Institute, Roslin, Midlothian EH25 9PS, Scotland, United Kingdom,
b Institute of Cell, Animal and Population Biology, University of Edinburgh, Edinburgh EH9 3JG, Scotland, United Kingdom
c Sygen International, Berkeley, California 94710

Corresponding author: Jules Hernández-Sánchez, Roslin, Midlothian, EH25 9PS, Scotland, United Kingdom., jules.hernandez{at}bbsrc.ac.uk (E-mail)

Communicating editor: M. A. F. NOOR


*  ABSTRACT
*TOP
*ABSTRACT
*METHODS
*RESULTS
*DISCUSSION
*APPENDIX
*LITERATURE CITED

Population-wide associations between loci due to linkage disequilibrium can be used to map quantitative trait loci (QTL) with high resolution. However, spurious associations between markers and QTL can also arise as a consequence of population stratification. Statistical methods that cannot differentiate between loci associations due to linkage disequilibria from those caused in other ways can render false-positive results. The transmission-disequilibrium test (TDT) is a robust test for detecting QTL. The TDT exploits within-family associations that are not affected by population stratification. However, some TDTs are formulated in a rigid form, with reduced potential applications. In this study we generalize TDT using mixed linear models to allow greater statistical flexibility. Allelic effects are estimated with two independent parameters: one exploiting the robust within-family information and the other the potentially biased between-family information. A significant difference between these two parameters can be used as evidence for spurious association. This methodology was then used to test the effects of the fourth melanocortin receptor (MC4R) on production traits in the pig. The new analyses supported the previously reported results; i.e., the studied polymorphism is either causal or in very strong linkage disequilibrium with the causal mutation, and provided no evidence for spurious association.


POPULATION-wide associations between loci due to linkage disequilibrium can be used in high-resolution mapping of quantitative trait loci (QTL). However, spurious associations between markers and QTL can also arise as a consequence of population stratification, for example due to admixture of two different populations. Associations between a genotype and a trait have been frequently tested, after corrections, with simple one-way ANOVA models, e.g., testing mean genotype differences directly. However, these types of analyses are prone to false-positive results due to confounding effects of population stratification/admixture (e.g., DENG et al. 2001 Down). SPIELMAN et al. 1993 Down developed an allele-trait association test called the transmission disequilibrium test (TDT), which is robust to these confounding effects. Different TDTs have since been developed for dichotomous traits (SCHAID 1996 Down; HORVATH and LAIRD 1998 Down; LUNETTA et al. 2000 Down; MARTIN et al. 2000 Down; ZHAO et al. 2000 Down) and for quantitative traits (ALLISON 1997 Down; RABINOWITZ 1997 Down; SZYDA et al. 1998 Down). However, most of these TDTs have been formulated in a rather rigid form, hence reducing the scope for further statistical modeling. In addition, previously described TDTs make use of within-family information only.

This study presents a generalized version of TDT to obtain robust estimates of genetic effects within the statistically more flexible mixed-linear model context. This approach allows maximum-likelihood estimates of genetic effects to be obtained via residual maximum likelihood (REML). Additive allele substitution effects were estimated with two independent regression coefficients (b), the within-families coefficient bTD (transmission disequilibrium) and the between-families coefficient bPD (population disequilibrium). Moreover, the rejection of the null hypothesis bTD = bPD provides evidence for stratification/admixture and hence can be used to guard against false-positive results.

The analysis of the fourth melanocortin receptor (MC4R) locus on pig chromosome 1 constitutes a practical demonstration of this method. The interaction between melanocortins and their receptors (MC3R and MC4R) at the hypothalamus is one of the main neuro-endocrinological pathways controlling energy balance (WARDLAW 2001 Down). In humans, different allelic variants of both MC4R and MC3R have been associated with obesity (VAISSE et al. 1998 Down; YEO et al. 1998 Down; HINNEY et al. 1999 Down; LI et al. 2000 Down). In pigs, the seventh transmembrane region of the MC4R locus contains a mutation at codon 298 that causes a change of aspartic acid for asparagine, i.e., Asp298Asn (KIM et al. 1999 Down). This region is highly conserved across all four types of melanocortin receptors in humans (GANTZ et al. 1993 Down), and it is also very conserved between pigs and humans (KIM et al. 2000 Down). The Asp298Asn mutation has been associated with fatter and faster-growing pigs, having significant effects on back fat, days to 110 kg, test daily gain, and daily food intake in a study involving four different commercial pig lines from nucleus breeding farms (KIM et al. 1999 Down, KIM et al. 2000 Down). However, the original analyses were performed with methods that are potentially biased in the presence of population stratification. Here we analyze an augmented data set using the new methodology to confirm and extend the original findings.


*  METHODS
*TOP
*ABSTRACT
*METHODS
*RESULTS
*DISCUSSION
*APPENDIX
*LITERATURE CITED

Data:
Performance traits were recorded on four different commercial PIC pig lines in the same farm over a 5-year period (1993–1998). The traits were lifetime daily gain (LDG), test daily gain (TDG), daily food intake (DFI), and back fat depth (BF) at the 10th rib. All pigs were performance tested for growth over a fixed period of 12 weeks, during which they were fed ad lib. and weighed at the beginning (on-test) and at the end (off-test) of that period. TDG was calculated as off-test weight minus on-test weight divided by the number of days on test. LDG was calculated as off-test weight minus one (assumed the average birth weight) divided by the age of the pig (in days) at off-test. BF was measured ultrasonically in real time at off-test, and it was normalized with the natural log transformation. DFI was electronically recorded for some pigs over the testing period. The sample sizes by line and sex are given in Table 1. In this data set, there were 726 extra records of BF, 574 of TDG, and 44 of DFI, with respect to the data set analyzed by KIM et al. 2000 Down. The Asp298Asn substitution mutation is located within a TaqI restriction enzyme recognition site (KIM et al. 1999 Down), which was used to generate a codominant restriction fragment length polymorphism (RFLP) to distinguish all three genotypic classes (KIM et al. 2000 Down).


 
View this table:
In this window
In a new window

 
Table 1. Number of pigs (males/females) used in the analyses within each trait, line, and MC4R genotype

Statistical models:
The effects of the Asp298Asn mutation on pig production traits were estimated with the models used in KIM et al. 2000 Down, which are not robust to population stratification/admixture, and with new robust models. The latter models were a combination of the former models and a TDT (RABINOWITZ 1997 Down). The polygenic variance was estimated, fitting sire as a random factor. This model is computationally faster than fitting each animal as a random factor, and the two models were very similar in terms of variance components estimation (results not shown).

ANOVA: The original models included sex, batch, line, and genotype as fixed factors and sire as random effect. BF records were analyzed both in the original scale and in the log-transformed scale. The skewness and kurtosis of the untransformed distribution of BF were 0.88 (±0.049) and 1.96 (±0.099), respectively. After transformation the distribution of BF became more normal (skewness = -0.03 ± 0.049, kurtosis = 0.32 ± 0.099). All two-way interactions between fixed factors were also included in the analyses. Nonsignificant factors were dropped out of the models using a backward elimination procedure. The coefficients for genotypes can be found in column A in Table 2. The analyses were performed with the REML procedure in GENSTAT (PAYNE et al. 2001 Down). This method of estimating allele effects is not robust to stratification/admixture (HERNANDEZ-SANCHEZ et al. 2002B Down). We refer to this as the ANOVA method.


 
View this table:
In this window
In a new window

 
Table 2. Parameterization of covariates A, TD, and PD given family genotypes

Batches as random: There were 54 batch means to estimate when they were fitted as fixed effects in the models. To avoid this unnecessary loss of degrees of freedom, batches were fitted as a random term where direct comparisons were being made with the TDT, and their effect was accounted for with cubic splines. This procedure was feasible because all trait means followed a yearly cycle when plotted against batches. The correction uses up only 2 d.f.: one in fitting a linear regression across all batches and a second in estimating the residual variance around the previous line. The software used to run models with batch as random splines was ASREML (GILMOUR et al. 2001 Down). This approach was also implemented in the TDT analyses (see below).

TDT: A robust analysis of the Asp298Asn mutation was performed with the same models but substituting genotype for two independent fixed covariates. One of these two covariates was based on a TDT (RABINOWITZ 1997 Down). Given a biallelic marker, each individual's genotype received a coefficient equal to H{male}(T{male} - 1/2) + H{female}(T{female} - 1/2); where H{male} = 1 if the individual's sire was heterozygous, or 0 otherwise; T{male} = 1 if the sire had transmitted allele 1 to the individual, or 0 otherwise; and likewise, H{female} and T{female} for the individual's dam. These coefficients can be found in column TD in Table 2. The slope of this covariate, bTD, is a robust estimate of additive substitution effects of alleles at the locus. Allelic effects were also estimated via a second regression coefficient sensitive to the effects of population structure, bPD (L. L. G. JANSS, personal communication). The appropriate coefficients to estimate bPD were obtained by subtracting column A from column TD in Table 2. We refer to this as the TDT method.

Generating parental genotype data:
The TDT method requires parental genotype data, which in the MC4R data set were mostly missing. Missing parental genotypes were generated using Gibbs sampling (e.g., WANG et al. 1994 Down; SORENSEN and GIANOLA 2002 Down). Gibbs sampling was equivalent to integrating over all genotype probabilities of parents with missing genotypes. Missing parental genotypes were sampled conditional on genotypes of progeny and other relatives. The sampling algorithm has been reported previously in the literature (GUO and THOMPSON 1992 Down; JANSS et al. 1995 Down). Convergence was reached after 103 realizations after a burn-in period of 100 realizations. The autocorrelation in Gibbs sampling was minimized by sampling 1 realization every 50 consecutive ones; hence a total of 50 x 104 realizations were generated. The actual integration over missing genotype probabilities was carried out by analyzing the MC4R data set after each sampled realization and averaging results (i.e., P values) across all realizations.

Simulation:
A simulation study was carried out to investigate the properties of bPD and bTD. Population stratification was generated by sampling from two separated populations with different allele frequencies and analyzing the data jointly. Each population was characterized by: (1) 15 unrelated full-sib families and four offspring per family (60 progeny in total); (2) random mating; (3) a QTL and a linked neutral marker, both biallelic and with allele frequency fixed to 0.9 in one population, and frequencies, at both loci, of 0.9, 0.7, 0.5, 0.3, and 0.1 in the other population; (4) recombination rate between loci of c = 0 or 1/2; (5) standardized linkage disequilibrium in parents (D' = D/Dmax; LEWONTIN 1988 Down) of either 0 or 1; (6) residual variance of 1 and polygenic variance 0; and (7) QTL explaining 5% (and in some cases 10%) of the total phenotypic variance. There were no interpopulation matings and all analyses were performed at the marker locus.


*  RESULTS
*TOP
*ABSTRACT
*METHODS
*RESULTS
*DISCUSSION
*APPENDIX
*LITERATURE CITED

Properties of bPD and bTD in analyses of simulated data:
The power of estimating genetic effects through bPD and bTD in the simulated data is shown in Fig 1, where F-ratios are plotted against the level of stratification S (allele frequency difference between populations) across four different scenarios. Each dot in Fig 1 is the average of 100 replicates. The significance threshold is based on the tabulated nominal 5% threshold and is shown as a straight line. Fig 1A shows the results after analyzing a marker totally unlinked (c = 1/2) and with no association in the population (D' = 0) to a QTL. In this situation, any significant effect is a type I error or due to bias. The FTD (i.e., the F-ratio testing whether a significant amount of the total variation is explained by bTD) is approximately 1 across all S values, which indicates that the marker did not have a significant effect on the trait. On the contrary, FPD (i.e., the corresponding F-ratio test for bPD) appeared positively correlated to the level of S. The effect was significant when S ≥ 0.6. In this case, spurious disequilibrium increases as stratification increases, and bPD cannot distinguish between this sort of disequilibrium and disequilibrium due to linkage.



View larger version (0K):
In this window
In a new window
Download PPT slide
 
Figure A1. (A) Expected mean squares (EMS) for bTD, estimated with a simple linear regression (see text), when frequencies of allele A in two different populations are P1 and P2. (B) EMS for bPD.

Fig 1B shows the results after analyzing a marker totally unlinked (c = 0.5) but in complete disequlibrium (D' = 1) in the parents. Here, FPD is always >FTD, and this difference increases with S. Moreover, the value of FPD was >1 (~2) even without stratification (S = 0). This can be explained by considering that, on average, the level of disequilibrium among offspring was one-half, because D' is expected to be halved every generation assuming no linkage and random mating. This feature suggests that bPD could detect an effect given sufficient linkage disequilibrium between a marker and a QTL, even if these two loci are totally unlinked, whereas bTD needs the joint occurrence of linkage and linkage disquilibrium to estimate an effect.

Fig 1C shows how FPD rapidly increases for S > 0.4, while FTD remains constant and equal to one regardless of S. The marker was totally linked to the QTL locus in this set of simulations (c = 0). Despite having simulated no linkage disequilibrium (D' = 0) within each population, mixing two populations with different allele frequencies will cause haplotype frequencies to depart from the expected equilibrium frequencies. In this simple scenario, the fact that FTD = 1 across all values of S even when c = 0 demonstrates the robustness of the test based on bTD.

Finally, Fig 1D shows the effect of effectively analyzing the QTL itself (D' = 1 and c = 0). The power of estimating bPD increases monotonically with S, a fact consistently observed in all previous graphs. However, the power of estimating bTD reaches its maximum at intermediate levels of S. This is so because intermediate allele frequencies [e.g., at 0.5, represented by S = 0.9 (population 1) - 0.5 (population 2) = 0.4 in Fig 1D] are associated with a higher proportion of heterozygous parents, providing better information to estimate bTD.

Analyses of MC4R data:
ANOVA method: Results were obtained from analyses of the data from the four lines separately and from an overall analysis of the combined data from all lines. Striking differences between genotypes were detected for BF (P < 0.001), TDG (P < 0.001), and LDG (P < 0.001) in the overall (i.e., all lines together) analysis (Table 3). Batches were treated as fixed effects in these analyses to allow us to compare results directly with the findings of KIM et al. 2000 Down. Genotypic differences within all pig lines were confirmed for BF (ranging from P < 0.001 in lines A and C to P < 0.02 in line B) and also within some lines for TDG (ranging from P < 0.001 in line C to P > 0.8 in line B) and LDG (ranging from P < 0.01 in line C to P > 0.5 in line D). No significant effect of the Asp298Asn substitution mutation on DFI was found in this analysis, in spite of the significant difference of 0.17 kg between both homozygotes (P < 0.01) reported by KIM et al. 2000 Down. In this study, the estimated difference in DFI between the two homozygous genotypes, although not significant, was in the same direction as that previously reported at 0.1 kg [standard error of the difference (SED) = 0.063] in the overall analysis.


 
View this table:
In this window
In a new window

 
Table 3. Means by genotype class within and across lines (overall)

TDT method: Only results from the overall analyses of the combined data set are shown. The estimation of bTD and bPD via REML was done on the basis of 103 replications, where each replicate used a different population of parental genotypes generated with Gibbs sampling. The average P values across these analyses are shown in Table 4. The ASREML software does not perform hypothesis testing for fixed effects in the model although it provides t-values and residual degrees of freedom. The P values associated with genotypic contrasts were obtained from the t-distribution. Two independent null hypotheses were of interest: H10, bPD = 0, and H20, bTD = 0; both were rejected for all traits except DFI [H10, (BF) P < 0.0001, (TDG) P < 0.01, (LDG) P < 0.01, and (DFI) P > 0.5; H20, (BF) P < 0.0001, (TDG) P < 0.001, (LDG) P < 0.01, and (DFI) P > 0.2]. For the sake of comparison, Table 4 also incorporates results from the ANOVA method (reported as regression coefficients rather than genotype means as in Table 3). Both the ANOVA and the TDT methods yielded similar results, although the former was a slightly more powerful analysis (the standard errors were generally smaller). Results in Table 3 and Table 4 are of similar magnitude.


 
View this table:
In this window
In a new window

 
Table 4. Average effect of allele substitution for each trait using TDT (PD, TD) and ANOVA


*  DISCUSSION
*TOP
*ABSTRACT
*METHODS
*RESULTS
*DISCUSSION
*APPENDIX
*LITERATURE CITED

The study demonstrated that the mean additive effect of allele substitution calculated with a one-way ANOVA model can be decomposed into the within- and between-family effects. These two effects can be estimated via a flexible REML analysis (PATTERSON and THOMPSON 1971 Down) as the regression coefficients bTD and bPD, respectively, using a mixed-linear model that can also incorporate other fixed and random effects.

Parental genotypes are needed to estimate bTD and bPD. This information was not available for the MC4R data analyzed and was generated via Gibbs sampling (1000 realizations). The new methodology was tested via simulation and real data analysis of the effect of the MC4R gene on pig production traits.

The simulation results can be summarized in three main points. First, bTD extracts information from the within- and bPD from the between-family genetic variances, {sigma}2WF and {sigma}2BF, respectively. Second, bTD is robust and bPD is biased in the presence of population admixture/stratification. Third, there is generally more power to detect a significant bPD != 0 than a significant bTD != 0.

Population stratification/admixture increases {sigma}2BF and not {sigma}2WF. As a consequence, estimates of bPD may be biased in the presence of stratification/admixture, whereas estimates of bTD are not as the simulation results demonstrate. Tests such as the TDT are robust because they exploit only {sigma}2WF. However, the one-way ANOVA model uses both {sigma}2WF and {sigma}2BF, from which a pooled estimate between bPD and bTD is obtained, and because of the latter component of variance, this pooled estimate of allelic effects may be biased if there is population admixture/stratification. In spite of this potential bias producing false-positive results (e.g., Fig 1A and Fig B), robust methods such as the TDT have seldom been used in animal breeding (some exceptions are BINK et al. 2000 Down and HERNANDEZ-SANCHEZ et al. 2002A Down). This may be because TDT is viewed as a not very powerful method (e.g., Fig 1D, with severe limitations in some circumstances, e.g., low frequency of heterozygous parents).

The fact that bTD and bPD exploit different sources of information can be intuitively appreciated by inspecting the coefficients in Table 2. First, the coefficients required in the estimation of bTD are weights given to all individuals with records (e.g., offspring) according to their genotypes and to the genotypes of their parents (i.e., family type). Hence, bTD is the slope of the regression of phenotypes onto explanatory variables that combine both offspring's genotypes and family type. Second, the coefficients required in the estimation of bPD can alternatively be obtained as , where Gij = 1/2, 0, or -1/2 if the genotype of the jth parent in the ith family is 11, 12, or 22, respectively. Therefore bPD is the slope of the regression of phenotypes onto family type.

More explicitly, let us model yij, the phenotype of the jth individual having the ith QTL genotype, as , where µ is the population mean, gi is the effect of the ith QTL genotype on the jth offspring, and aij and eij are the polygenic and residual random terms drawn from two independent normal distributions with zero means and variances {sigma}2A and {sigma}2e, respectively. Let there be random mating, no population stratification, and only additive genetic effects at the QTL locus. Under these circumstances, the total additive genetic variance splits equally between and within families; therefore and , where , and {sigma}2Q is the variance due to the QTL. If a statistical model explains all {sigma}2Q, then the additive genetic effects of the QTL will be fully accounted for. However, if the model estimates additive effects only through either bTD or bPD, then {sigma}2Q will be only partially explained. For example, if bTD is the only estimator of QTL effect, then and as only within-family variation can be used to estimate bTD. If, on the other hand, bPD is the only estimator of QTL effect, then and , as only between-family variation can be used to estimate bPD. These changes in the within- and between-family variances due to estimation of bTD or bPD were validated via computer simulations (data not shown).

The situation is more complex when there is stratification within a population. The Appendix shows how the proportion of the total phenotypic variation explained, fitting a linear regression, differed when either bTD or bPD was estimated in the presence of population stratification.

Simulation results showed that testing the null hypothesis bPD = 0 rather than bTD = 0 tends to produce higher F-ratios when it is not true (Fig 1). Nevertheless, the analysis of MC4R data showed the opposite effect (i.e., t-tests for bTD were always higher than those for bPD in Table 4). This result is possible when there is no admixture/stratification, because bTD is expected to be equivalent to bPD. Moreover, in simulations, the only between-family component was one-half of the total QTL variance, whereas in reality other factors (e.g., litter and sow effects) may also increase {sigma}2BF. The analysis of real data showed that bTD was very similar to bPD across all traits and, furthermore, that both estimates were also similar to allelic effects obtained with ANOVA (Table 4). This suggests that there was no significant stratification/admixture in the population and that there were equivalent amounts of genetic information between and within families.

Where estimates of bTD and bPD differ, a t-test could be used to test the null hypothesis bTD = bPD. If no evidence to reject the null hypothesis were found, i.e., no evidence for admixture/stratification in the population, then a more powerful one-way ANOVA could be safely implemented within the mixed-linear model to estimate allele effects. Otherwise, robust approaches such as the one proposed here, i.e., estimating bTD or other TDTs, should be considered as the only reliable methods of analysis.

A complication of this approach applied to many real data sets, including the MC4R data used as an example, is generating missing parental genotypes. A Gibbs sampling technique to generate 103 replicates was readily implemented (and we know they were accurate enough because 10 times more replicates did not affect the outcome) in this simple scenario: i.e., a single biallelic marker, a maximum of three generations, and few missing data. Other robust tests that do not require parental genotypes, e.g., sib-TDTs (e.g., SCHAID and ROWLAND 1999 Down; SPIELMAN and EWENS 1998 Down), were found to be less powerful than the method outlined here in additional analyses of simulated data [e.g., Fsib = 5.5 vs. FTD-PD = 7.7 (based on 1000 replicates)]. Analyzing more complex or unbalanced data sets will probably demand more realizations of the Gibbs sampler and it will present an additional problem: testing fixed effects in REML, because the asymptotic properties of the t-test or the Wald test could not be guaranteed (see KENWARD and ROGER 1997 Down; WELHAM and THOMPSON 1997 Down; ELSTON 1998 Down). If few fixed effects are fitted in the model, then maximum likelihood rather than REML can be used to test allele effects via likelihood-ratio tests.

There is strong evidence from both this study and KIM et al. 2000 Down that the substitution-mutation Asp298Asn in the MC4R gene affects production traits in the pig, e.g., BF and growth (TDG, LDG). These effects were reestimated with extra records with respect to the original work. Significant effects were found for BF, TDG, and LDG across all lines (P < 0.001), although not for all pig lines (ranging from P < 0.001 to P > 0.8).

DFI was the only trait not significant in this study (P > 0.2) that was significant in the study of KIM et al. 2000 Down, even after using their statistical models. The lack of effect on DFI was unexpected given previously reported results (P < 0.01) and other reported associations of MCR with appetite and feeding behavior in macaques (KOEGLER et al. 2001 Down), mice (BUTLER et al. 2000 Down), and layer chicks (TACHIBANA et al. 2001 Down). One possible explanation for not detecting an effect on DFI is the lack of power due to a small data set (n = 275). Although there were less data (n = 231) in the study of KIM et al. 2000 Down than in this study, the difference is small and additional statistical noise may have been introduced through factors such as data structure and/or chance. An independent study that used a larger data set (n = 619) found a significant (P < 0.05) additive gene effect of 0.075 kg/day on DFI (G. PLASTOW, personal communication).

Further experiments are needed to ascertain whether the Asp298Asn substitution mutation at the MC4R locus is causative. For example, one could test the effect of this and other polymorphisms on the same exon/gene in different segregating populations. Final proof will require molecular experiments, e.g., testing whether different molecular pathways are activated in independent cell cultures having different Asp298Asn variants and/or creating mouse models for testing this polymorphism on identical genetic backgrounds.

This study demonstrates that TDT can be implemented within the REML framework. As a guideline, to maximize the power in an association analysis, one should test associations with ANOVA (within a REML or maximum-likelihood framework) only after having checked that bTD and bPD are not significantly different from each other, i.e., making sure no false-positive results are being caused by population stratification.


*  ACKNOWLEDGMENTS

We thank Dr. R. Pong-Wong for his advice on Gibbs sampling; Dr. O. Southwood, D. Waddington, and anonymous referees for helpful comments that improved earlier versions of this manuscript; and especially Dr. L. L. G. Janss for sharing his ideas with us. We acknowledge Kwan Suk Kim and Max Rothschild (Iowa State University) for providing the MC4R data set and the Biotechnology and Biological Sciences Research Council and PIC International Group for funding this project.

Manuscript received August 15, 2002; Accepted for publication January 23, 2003.


*  APPENDIX
*TOP
*ABSTRACT
*METHODS
*RESULTS
*DISCUSSION
*APPENDIX
*LITERATURE CITED

IMPACT OF STRATIFICATION ON bTD AND bPD
Population stratification affects both the estimation of the effect via bPD and the power of that estimation (SE of bPD). We assume the simplest scenario where a population is divided into two subpopulations of equal size, where mating is at random within each subpopulation, and there is no mating across subpopulations.

The expected mean square of a linear model that regresses phenotypes onto a single explanatory variable X is , where B is the expected regression parameter and {sigma}2e is the residual variance (SOKAL and ROHLF 1995 Down). When n phenotypes are simulated with gene effect (B) and , then . Let us assume that pi is the frequency of allele i at the trait locus in subpopulation 1 and qi is the equivalent frequency in subpopulation 2. Furthermore, let us assume that p11 = (p1)2 is the frequency of genotype 11 in subpopulation 1 and q11 is the frequency of the equivalent genotype in subpopulation 2. It can be shown that E[PD] = (p1 - p2 + q1 - q2)/2 and that . The same process is followed to develop the expected mean squares when using TD; thus , and . These two predictions were very similar to simulation results (not shown). We can see the effect of stratification on E[MSR]TD and E[MSR]PD for n = 100 in Fig 1A and Fig B. Fig 1A takes a hill-type shape (seen from above) where the highest point is at the center and gradually decays in all directions away from the center. Fig 1B takes a valley-type shape (seen from above) where the lowest points are on the diagonal (in fact, Fig 1A and Fig 1B are identical on the diagonal) and quickly rising away from the diagonal. If there is no stratification, e.g., on the diagonal passing through points p1 = q1, then E[MSR]TD = E[MSR]PD, and thus both regression lines are identical. However, when there is stratification (e.g., p1 != q1), E[MSR]PD increases and E[MSR]TD decreases. The effect on E[MSR]TD is not due to stratification itself but rather to a reduction of the information content due to a decrease in the frequency of heterozygous genotypes. Although we have shown the effect of analyzing the trait locus itself, this effect is transferred to a marker as a function of linkage disequilibrium between both loci.


*  LITERATURE CITED
*TOP
*ABSTRACT
*METHODS
*RESULTS
*DISCUSSION
*APPENDIX
*LITERATURE CITED

ALLISON, D. B., 1997  Transmission-disequilibrium tests for quantitative traits. Am. J. Hum. Genet. 60:676-690.[Medline]

BINK, M. C., M. F. TE PAS, F. L. HARDENS, and L. L. JANSS, 2000  A transmission/disequilibrium test approach to screen for quantitative trait loci in two selected lines of large white pigs. Genet. Res. 75:115-121.[Medline]

BUTLER, A. A., R. A. KESTERSON, K. KHONG, M. J. CULLEN, and M. A. PELLEYMOUNTER et al., 2000  A unique metabolic syndrome causes obesity in the melanocortin-3 receptor-deficient mouse. Endocrinology 141:3518-3521.[Abstract/Free Full Text]

DENG, H. W., W. M. CHEN, and R. R. RECKER, 2001  Population admixture: detection by Hardy-Weinberg test and its quantitative effects on linkage-disequilibrium methods for localizing genes underlying complex traits. Genetics 157:885-897.[Abstract/Free Full Text]

ELSTON, D. A., 1998  Estimation of denominator degrees of freedom of F-distributions for assessing Wald statistics for fixed-effect factors in unbalanced mixed models. Biometrics 54:1085-1096.

GANTZ, I., H. MIWA, Y. KONDA, Y. SHIMOTO, and T. TASHIRO et al., 1993  Molecular cloning, expression and gene localization of a 4th melanocortin receptor. J. Biol. Chem. 268:15174-15179.[Abstract/Free Full Text]

GILMOUR, A. R., B. R. CULLIS, S. J. WELHAM and R. THOMPSON, 2001 ASREML Reference Manual. ftp.res.bbsrc.ac.uk in pub/aar.

GUO, S. W. and E. A. THOMPSON, 1992  A Monte Carlo method for combined segregation and linkage analysis. Am. J. Hum. Genet. 51:1111-1126.[Medline]

HERNÁNDEZ-SÁNCHEZ, J., D. WADDINGTON, P. WIENER, C. S. HALEY, and J. L. WILLIAMS, 2002a  Genome-wide search for markers associated with bovine spongiform encephalopathy. Mamm. Genome 13:164-168.[Medline]

HERNÁNDEZ-SÁNCHEZ, J., C. S. HALEY and P. M. VISSCHER, 2002b Power of association and transmission disequilibrium tests. World Congress of Genetics Applied to Livestock Production, Communication 21–24, Montpellier, France.

HINNEY, A., A. SCHMIDT, K. NOTTEBOM, O. HEIBULT, and I. BECKER et al., 1999  Several mutations in the melanocortin-4 receptor gene including a nonsense and a frameshift mutation associated with dominantly inherited obesity in humans. J. Clin. Endocrinol. Metab. 84:1483-1486.[Abstract/Free Full Text]

HORVATH, S. and N. M. LAIRD, 1998  A discordant-sibship test for disequilibrium and linkage: no need for parental data. Am. J. Hum. Genet. 63:1886-1897.[Medline]

JANSS, L. L. G., R. THOMPSON, and J. A. M. VAN ARENDONK, 1995  Application of Gibbs sampling for inference in a mixed major gene-polygenic inheritance model in animal populations. Theor. Appl. Genet. 91:1137-1147.

KENWARD, M. G. and J. H. ROGER, 1997  Small sample inference for fixed effects from restricted maximum likelihood. Biometrics 53:983-997.[Medline]

KIM, K. S., H. J. LARSEN, and M. F. ROTHSCHILD, 1999  Rapid communication: linkage and physical mapping of the porcine melanocortin-4 receptor (MC4R) gene. J. Anim. Sci. 78:791.

KIM, K. S., N. LARSEN, T. SHORT, G. PLASTOW, and M. F. ROTHSCHILD, 2000  A missense variant of the porcine melanocortin-4 receptor (MC4R) gene is associated with fatness, growth and feed intake traits. Mamm. Genome 11:131-135.[Medline]

KOEGLER, F. H., K. L. GROVE, A. SHIFFMACHER, M. S. SMITH, and J. L. CAMERON, 2001  Central melanocortin receptors mediate changes in food intake in the rhesus macaque. Endocrinology 142:2586-2592.[Abstract/Free Full Text]

LEWONTIN, R. C., 1988  On measures of gametic disequilibrium. Genetics 120:849-852.[Abstract/Free Full Text]

LI, W. D., E. J. JOO, E. B. FURLONG, M. GALVIN, and K. ABEL et al., 2000  Melanocortin 3 receptor (MC3R) gene variants in extremely obese women. Int. J. Obes. Metab. Dis. 24:206-210.

LUNETTA, K. L., S. V. FARAONE, J. BIEDERMAN, and N. M. LAIRD, 2000  Family-based tests of association and linkage that use unaffected sibs, covariates, and interactions. Am. J. Hum. Genet. 66:605-614.[Medline]

MARTIN, E. R., S. A. MONKS, L. L. WARREN, and N. L. KAPLAN, 2000  A test for linkage and association in general pedigrees: the pedigree disequilibrium test. Am. J. Hum. Genet. 67:146-154.[Medline]

PATTERSON, H. D. and R. THOMPSON, 1971  Recovery of inter-block information when block sizes are unequal. Biometrika 58:545-554.[Abstract/Free Full Text]

PAYNE, R. W., P. W. LANE, P. G. N. DIGBY, S. A. HARDING, P. K. LEECH et al., 2001 Reference Manual. GENSTAT 5 Release 4.2, Oxford University Press, Oxford.

RABINOWITZ, D., 1997  A transmission disequilibrium test for quantitative trait loci. Hum. Hered. 47:342-350.[Medline]

SCHAID, D. J., 1996  General score tests for associations of genetic markers with disease using cases and their parents. Genet. Epidemiol. 13:423-449.[Medline]

SCHAID, D. J. and C. M. ROWLAND, 1999  Quantitative trait transmission disequilibrium test: allowance for missing parents. Genet. Epidemiol. 17:S307-S312.

SOKAL, R. R., and R. J. ROHLF, 1995 Biometry. W. H. Freeman, New York.

SORENSEN, D., and D. GIANOLA, 2002 Likelihood, Bayesian, and MCMC Methods in Quantitative Genetics. Springer Verlag, New York.

SPIELMAN, R. S. and W. J. EWENS, 1998  A sibship test for linkage in the presence of association: the sib transmission/disequilibrium test. Am. J. Hum. Genet. 62:450-458.[Medline]

SPIELMAN, R. S., R. E. MCGINNIS, and W. J. EWENS, 1993  Transmission test for linkage disequilibrium: the insulin gene region and insulin-dependent diabetes mellitus (IDDM). Am. J. Hum. Genet. 52:506-516.[Medline]

SZYDA, J., Z. LIU and V. WILD, 1998 Application of the transmission-disequilibrium test to detection of major genes, 49th Annual Meeting of the European Association for Animal Production, Warsaw.

TACHIBANA, T., K. SUGAHARA, A. OHGUSHI, R. ANDO, and S. KAWAKAMI et al., 2001  Intracerebroventricular injection of agouti-related protein attenuates the anorexigenic effect of alpha-melanocyte stimulating hormone in neonatal chicks. Neurosci. Lett. 305:131-134.[Medline]

VAISSE, C., K. CLEMENT, B. GUY-GRAND, and P. FROGUEL, 1998  A frameshift mutation in human MC4R is associated with a dominant form of obesity. Nat. Genet. 20:113-114.[Medline]

WANG, C. S., J. J. RUTLEDGE, and D. GIANOLA, 1994  Bayesian analysis of mixed linear models via Gibbs sampling with an application to litter size in Iberian pigs. Genet. Sel. Evol. 26:91-115.

WARDLAW, S. L., 2001  Obesity as a neuroendocrine disease: lessons to be learned from proopiomelanocortin and melanocortin receptor mutations in mice and men. J. Clin. Endocrinol. Metab. 86:1442-1446.[Free Full Text]

WELHAM, S. J. and R. THOMPSON, 1997  Likelihood ratio tests for fixed model terms using residual maximum likelihood. J. R. Stat. Soc. Ser. B 59:701-714.

YEO, G. S., I. S. FAROOQUI, S. AMINIAN, D. J. HALSALL, and R. G. STANHOPE et al., 1998  A frameshift mutation in MC4R associated with dominantly inherited human obesity. Nat. Genet. 20:111-112.[Medline]

ZHAO, H., S. ZHANG, K. R. MERIKANGAS, M. TRIXLER, and D. B. WILDENAUER et al., 2000  Transmission/disequilibrium tests using multiple tightly linked markers. Am. J. Hum. Genet. 67:936-946.[Medline]




This article has been cited by other articles:


Home page
ScienceHome page
J. Gratten, A. J. Wilson, A. F. McRae, D. Beraldi, P. M. Visscher, J. M. Pemberton, and J. Slate
A Localized Negative Genetic Correlation Constrains Microevolution of Coat Color in Wild Sheep
Science, January 18, 2008; 319(5861): 318 - 320.
[Abstract] [Full Text] [PDF]


Home page
J ANIM SCIHome page
H. A. M. van der Steen, G. F. W. Prall, and G. S. Plastow
Application of genomics to the pork industry
J Anim Sci, June 1, 2005; 83(13_suppl): E1 - 8.
[Abstract] [Full Text] [PDF]