- THIS ARTICLE
- Full Text
- Full Text (PDF)
- Supporting Information
-
All Versions of this Article:
genetics.108.098277v1
182/1/355 most recent - Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Zhong, S.
- Articles by Jannink, J.-L.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Zhong, S.
- Articles by Jannink, J.-L.
Originally published as Genetics Published Articles Ahead of Print on March 18, 2009.
Genetics, Vol. 182, 355-364, May 2009, Copyright © 2009
doi:10.1534/genetics.108.098277
Factors Affecting Accuracy From Genomic Selection in Populations Derived From Multiple Inbred Lines: A Barley Case Study
Shengqiang Zhong*,
Jack C. M. Dekkers
,
Rohan L. Fernando
and
Jean-Luc Jannink
,1
* Department of Agronomy, Iowa State University, Ames, Iowa 50011,
Department of Animal Science and Center for Integrated Animal Genomics, Iowa State University, Ames, Iowa 50011 and
U. S. Department of Agriculture–Agricultural Research Service, Robert W. Holley Center for Agriculture and Health, Ithaca, New York 14583
1 Corresponding author: USDA-ARS, Robert W. Holley Center for Agriculture and Health, Ithaca, NY 14853-2901.
E-mail: jean-luc.jannink{at}ars.usda.gov
We compared the accuracies of four genomic-selection prediction methods as affected by marker density, level of linkage disequilibrium (LD), quantitative trait locus (QTL) number, sample size, and level of replication in populations generated from multiple inbred lines. Marker data on 42 two-row spring barley inbred lines were used to simulate high and low LD populations from multiple inbred line crosses: the first included many small full-sib families and the second was derived from five generations of random mating. True breeding values (TBV) were simulated on the basis of 20 or 80 additive QTL. Methods used to derive genomic estimated breeding values (GEBV) were random regression best linear unbiased prediction (RR–BLUP), Bayes-B, a Bayesian shrinkage regression method, and BLUP from a mixed model analysis using a relationship matrix calculated from marker data. Using the best methods, accuracies of GEBV were comparable to accuracies from phenotype for predicting TBV without requiring the time and expense of field evaluation. We identified a trade-off between a method's ability to capture marker-QTL LD vs. marker-based relatedness of individuals. The Bayesian shrinkage regression method primarily captured LD, the BLUP methods captured relationships, while Bayes-B captured both. Under most of the study scenarios, mixed-model analysis using a marker-derived relationship matrix (BLUP) was more accurate than methods that directly estimated marker effects, suggesting that relationship information was more valuable than LD information. When markers were in strong LD with large-effect QTL, or when predictions were made on individuals several generations removed from the training data set, however, the ranking of method performance was reversed and BLUP had the lowest accuracy.
This article has been cited by other articles:
![]() |
A. P. W. de Roos, B. J. Hayes, and M. E. Goddard Reliability of Genomic Predictions Across Multiple Populations Genetics, December 1, 2009; 183(4): 1545 - 1553. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Luan, J. A. Woolliams, S. Lien, M. Kent, M. Svendsen, and T. H. E. Meuwissen The Accuracy of Genomic Selection in Norwegian Red Cattle Assessed by Cross-Validation Genetics, November 1, 2009; 183(3): 1119 - 1126. [Abstract] [Full Text] [PDF] |
||||
