- THIS ARTICLE
- Full Text
- Full Text (PDF)
- Supporting Information
-
All Versions of this Article:
genetics.109.100362v1
182/1/79 most recent - Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Chun, H.
- Articles by Keles, S.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Chun, H.
- Articles by Keles, S.
Originally published as Genetics Published Articles Ahead of Print on March 6, 2009.
Genetics, Vol. 182, 79-90, May 2009, Copyright © 2009
doi:10.1534/genetics.109.100362
Expression Quantitative Trait Loci Mapping With Multivariate Sparse Partial Least Squares Regression
Hyonho Chun* and
Sündüz Kele
*,
,1
* Department of Statistics and
Department of Biostatistics and Medical Informatics, University of Wisconsin, Madison, Wisconsin 53705
1 Corresponding author: Department of Statistics, University of Wisconsin, 1300 University Ave., Madison, WI 53705.
E-mail: keles{at}stat.wisc.edu
Expression quantitative trait loci (eQTL) mapping concerns finding genomic variation to elucidate variation of expression traits. This problem poses significant challenges due to high dimensionality of both the gene expression and the genomic marker data. We propose a multivariate response regression approach with simultaneous variable selection and dimension reduction for the eQTL mapping problem. Transcripts with similar expression are clustered into groups, and their expression profiles are viewed as a multivariate response. Then, we employ our recently developed sparse partial least-squares regression methodology to select markers associated with each cluster of genes. We demonstrate with extensive simulations that our eQTL mapping with multivariate response sparse partial least-squares regression (M-SPLS eQTL) method overcomes the issue of multiple transcript- or marker-specific analyses, thereby avoiding potential elevation of type I error. Additionally, joint analysis of multiple transcripts by multivariate response regression increases power for detecting weak linkages. We illustrate that M-SPLS eQTL compares competitively with other approaches and has a number of significant advantages, including the ability to handle highly correlated genotype data and computational efficiency. We provide an application of this methodology to a mouse data set concerning obesity and diabetes.