Originally published as Genetics Published Articles Ahead of Print on March 18, 2009.

Genetics, Vol. 182, 355-364, May 2009, Copyright © 2009
doi:10.1534/genetics.108.098277

Factors Affecting Accuracy From Genomic Selection in Populations Derived From Multiple Inbred Lines: A Barley Case Study

* Department of Agronomy, Iowa State University, Ames, Iowa 50011, {dagger} Department of Animal Science and Center for Integrated Animal Genomics, Iowa State University, Ames, Iowa 50011 and {ddagger} U. S. Department of Agriculture–Agricultural Research Service, Robert W. Holley Center for Agriculture and Health, Ithaca, New York 14583

1 Corresponding author: USDA-ARS, Robert W. Holley Center for Agriculture and Health, Ithaca, NY 14853-2901.
E-mail: jean-luc.jannink{at}ars.usda.gov

We compared the accuracies of four genomic-selection prediction methods as affected by marker density, level of linkage disequilibrium (LD), quantitative trait locus (QTL) number, sample size, and level of replication in populations generated from multiple inbred lines. Marker data on 42 two-row spring barley inbred lines were used to simulate high and low LD populations from multiple inbred line crosses: the first included many small full-sib families and the second was derived from five generations of random mating. True breeding values (TBV) were simulated on the basis of 20 or 80 additive QTL. Methods used to derive genomic estimated breeding values (GEBV) were random regression best linear unbiased prediction (RR–BLUP), Bayes-B, a Bayesian shrinkage regression method, and BLUP from a mixed model analysis using a relationship matrix calculated from marker data. Using the best methods, accuracies of GEBV were comparable to accuracies from phenotype for predicting TBV without requiring the time and expense of field evaluation. We identified a trade-off between a method's ability to capture marker-QTL LD vs. marker-based relatedness of individuals. The Bayesian shrinkage regression method primarily captured LD, the BLUP methods captured relationships, while Bayes-B captured both. Under most of the study scenarios, mixed-model analysis using a marker-derived relationship matrix (BLUP) was more accurate than methods that directly estimated marker effects, suggesting that relationship information was more valuable than LD information. When markers were in strong LD with large-effect QTL, or when predictions were made on individuals several generations removed from the training data set, however, the ranking of method performance was reversed and BLUP had the lowest accuracy.




This article has been cited by other articles:


Home page
GeneticsHome page
A. P. W. de Roos, B. J. Hayes, and M. E. Goddard
Reliability of Genomic Predictions Across Multiple Populations
Genetics, December 1, 2009; 183(4): 1545 - 1553.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
T. Luan, J. A. Woolliams, S. Lien, M. Kent, M. Svendsen, and T. H. E. Meuwissen
The Accuracy of Genomic Selection in Norwegian Red Cattle Assessed by Cross-Validation
Genetics, November 1, 2009; 183(3): 1119 - 1126.
[Abstract] [Full Text] [PDF]