Genetics, Vol. 178, 1709-1723, March 2008, Copyright © 2008
doi:10.1534/genetics.107.080101

Efficient Control of Population Structure in Model Organism Association Mapping

* Department of Computer Science and Engineering and {dagger} Bioinformatics Program, University of California, San Diego, California 92093, {ddagger} Broad Institute of Harvard and MIT, Cambridge, Massachusetts 02141, § Center for Human Genetic Research, Massachusetts General Hospital, Boston, Massachusetts 02114, ** Microsoft Research, Redmond, Washington 98052 and {dagger}{dagger} Department of Computer Science and Department of Human Genetics, University of California, Los Angeles, California 90095

1 Corresponding author: Department of Computer Science and Department of Human Genetics, Mail Code 1596, 3532-J Boelter Hall, University of California, Los Angeles, CA 90095-1596.
E-mail: eeskin{at}cs.ucla.edu

Genomewide association mapping in model organisms such as inbred mouse strains is a promising approach for the identification of risk factors related to human diseases. However, genetic association studies in inbred model organisms are confronted by the problem of complex population structure among strains. This induces inflated false positive rates, which cannot be corrected using standard approaches applied in human association studies such as genomic control or structured association. Recent studies demonstrated that mixed models successfully correct for the genetic relatedness in association mapping in maize and Arabidopsis panel data sets. However, the currently available mixed-model methods suffer from computational inefficiency. In this article, we propose a new method, efficient mixed-model association (EMMA), which corrects for population structure and genetic relatedness in model organism association mapping. Our method takes advantage of the specific nature of the optimization problem in applying mixed models for association mapping, which allows us to substantially increase the computational speed and reliability of the results. We applied EMMA to in silico whole-genome association mapping of inbred mouse strains involving hundreds of thousands of SNPs, in addition to Arabidopsis and maize data sets. We also performed extensive simulation studies to estimate the statistical power of EMMA under various SNP effects, varying degrees of population structure, and differing numbers of multiple measurements per strain. Despite the limited power of inbred mouse association mapping due to the limited number of available inbred strains, we are able to identify significantly associated SNPs, which fall into known QTL or genes identified through previous studies while avoiding an inflation of false positives. An R package implementation and webserver of our EMMA method are publicly available.


Related articles in Genetics:

ISSUE HIGHLIGHTS

Genetics 2008 178: NP. [Full Text]  



This article has been cited by other articles:


Home page
IBMS BoneKEyHome page
C. L. Ackert-Bicknell and C. J. Rosen
The Future of Mouse Genetics in Osteoporosis Research
IBMS BoneKEy, June 1, 2009; 6(6): 200 - 209.
[Abstract] [Full Text] [PDF]


Home page
Crop Sci.Home page
C. H. Sneller, D. E. Mather, and S. Crepieux
Analytical Approaches and Population Types for Finding and Utilizing QTL in Complex Plant Populations
Crop Sci., March 17, 2009; 49(2): 363 - 380.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
D. M. Gatti, A. A. Shabalin, T.-C. Lam, F. A. Wright, I. Rusyn, and A. B. Nobel
FastMap: Fast eQTL mapping in homozygous populations
Bioinformatics, February 15, 2009; 25(4): 482 - 489.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
H. M. Kang, C. Ye, and E. Eskin
Accurate Discovery of Expression Quantitative Trait Loci Under Confounding From Spurious and Genuine Regulatory Hotspots
Genetics, December 1, 2008; 180(4): 1909 - 1925.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
M. Li, C. Reilly, and T. Hanson
A semiparametric test to detect associations between quantitative traits and candidate genes in structured populations
Bioinformatics, October 15, 2008; 24(20): 2356 - 2362.
[Abstract] [Full Text] [PDF]