Genetics, Vol. 164, 1597-1606, August 2003, Copyright © 2003

Combining Gene Expression and Molecular Marker Information for Mapping Complex Trait Genes: A Simulation Study

Miguel Pérez-Encisoa, Miguel A. Torob, Michel Tenenhausc, and Daniel Gianolad
a Station d'Amélioration Génétique des Animaux, INRA, BP 27, 31326 Castanet-Tolosan, France,
b Departamento de Mejora Genética Animal, INIA, 28040 Madrid, Spain,
c HEC School of Management, 78352 Jouy-en-Josas, France
d Department of Animal Sciences, University of Wisconsin, Madison, Wisconsin 53706

Corresponding author: Miguel Pérez-Enciso, BP 27, 31326 Castanet-Tolosan, France., mperez{at}toulouse.inra.fr (E-mail)

Communicating editor: J. B. WALSH

A method for mapping complex trait genes using cDNA microarray and molecular marker data jointly is presented and illustrated via simulation. We introduce a novel approach for simulating phenotypes and genotypes conditionally on real, publicly available, microarray data. The model assumes an underlying continuous latent variable (liability) related to some measured cDNA expression levels. Partial least-squares logistic regression is used to estimate the liability under several scenarios where the level of gene interaction, the gene effect, and the number of cDNA levels affecting liability are varied. The results suggest that: (1) the usefulness of microarray data for gene mapping increases when both the number of cDNA levels in the underlying liability and the QTL effect decrease and when genes are coexpressed; (2) the correlation between estimated and true liability is large, at least under our simulation settings; (3) it is unlikely that cDNA clones identified as significant with partial least squares (or with some other technique) are the true responsible cDNAs, especially as the number of clones in the liability increases; (4) the number of putatively significant cDNA levels increases critically if cDNAs are coexpressed in a cluster (however, the proportion of true causal cDNAs within the significant ones is similar to that in a no-coexpression scenario); and (5) data reduction is needed to smooth out the variability encountered in expression levels when these are analyzed individually.





This article has been cited by other articles:


Home page
GeneticsHome page
P. Schliekelman
Statistical Power of Expression Quantitative Trait Loci for Mapping of Complex Trait Loci in Natural Populations
Genetics, April 1, 2008; 178(4): 2201 - 2216.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
A.-L. Boulesteix and K. Strimmer
Partial least squares: a versatile tool for the analysis of high-dimensional genomic data
Brief Bioinform, January 1, 2007; 8(1): 32 - 44.
[Abstract] [Full Text] [PDF]