Abstract
Intense structuring of plant breeding populations challenges the design of the training set (TS) in genomic selection (GS). An important open question is how the TS should be constructed from multiple related or unrelated small biparental families to predict progeny from individual crosses. Here, we used a set of five interconnected maize (Zea mays L.) populations of doubled-haploid (DH) lines derived from four parents to systematically investigate how the composition of the TS affects the prediction accuracy for lines from individual crosses. A total of 635 DH lines genotyped with 16,741 polymorphic SNPs were evaluated for five traits including Gibberella ear rot severity and three kernel yield component traits. The populations showed a genomic similarity pattern, which reflects the crossing scheme with a clear separation of full sibs, half sibs, and unrelated groups. Prediction accuracies within full-sib families of DH lines followed closely theoretical expectations, accounting for the influence of sample size and heritability of the trait. Prediction accuracies declined by 42% if full-sib DH lines were replaced by half-sib DH lines, but statistically significantly better results could be achieved if half-sib DH lines were available from both instead of only one parent of the validation population. Once both parents of the validation population were represented in the TS, including more crosses with a constant TS size did not increase accuracies. Unrelated crosses showing opposite linkage phases with the validation population resulted in negative or reduced prediction accuracies, if used alone or in combination with related families, respectively. We suggest identifying and excluding such crosses from the TS. Moreover, the observed variability among populations and traits suggests that these uncertainties must be taken into account in models optimizing the allocation of resources in GS.
GENOMIC prediction or selection, initially proposed and rapidly implemented in animal breeding (Meuwissen et al. 2001), is increasingly applied in plant breeding (Bernardo and Yu 2009; Lorenz et al. 2011; Morrell et al. 2011). However, with more investigations in plant breeding applications, new challenges emerge, mainly as the result of the greater possibilities of genetic manipulation and reproduction modes in plants compared to animals. Genomic predictions within diverse populations (Crossa et al. 2010; Riedelsheimer et al. 2012a,b; Windhausen et al. 2012) largely overlap with the scenarios in animal breeding. Predicting crossbred performance in animals also has some similarities with the prediction of maize hybrids from fully homozygous inbred lines drawn from genetically distant heterotic pools (Technow et al. 2012). Apart from using mixtures of close and distant relatives, little overlap with the field of animal breeding however exists in the case of multiple related or unrelated biparental families of inbred lines with a size ≤200. The latter situation is investigated in this study, because it covers the most relevant population structure encountered in practical plant breeding programs of line and hybrid cultivars.
In maize breeding, the doubled-haploid (DH) technology reduced dramatically the time necessary to obtain fully homozygous inbred lines (Prigge and Melchinger 2012), enabling the generation of a huge number of inbred lines every year. These DH lines typically originate from distinct crosses of related parents from one heterotic pool. Theoretical and experimental results showed that the mean of DH lines derived from an F1 cross can be reliably predicted from the mean of the two parent lines and this applies to both per se and testcross performance (Melchinger 1987; Melchinger et al. 2005). Traditional pedigree-based approaches are, however, unable to predict the performance of individual DH lines within a full-sib family, because all of them have the same expected relationship. With the availability of high-density SNP data, it is now possible to calculate the realized relationship that is distributed around the expected value due to Mendelian sampling (Hayes et al. 2009a; Hill and Weir 2011). Since differences in the mean performance of populations are in most cases known a priori based on information about the parents, we did not follow the approach of previous studies (Albrecht et al. 2011; Rutkoski et al. 2012; Schulz-Streeck et al. 2012), which analyzed genomic predictions of DH lines across multiple families. Instead, our focus was on genomic prediction of DH lines from individual biparental populations.
For implementing genomic prediction in plant breeding programs, it is crucial to know how the design of the training set (TS) affects the achievable accuracy for predicting a single biparental population. Given a set of related populations, two questions arise: (i) If the TS composes only a fraction of the DH lines from a biparental cross, what is the accuracy of genomic prediction of the remaining full-sib DH lines? (ii) How does the composition of the TS in terms of related or unrelated families affect the accuracy of genomic prediction of DH lines from a cross not included in the TS?
The first question has been investigated from a theoretical point of view (Daetwyler et al. 2008, 2010; Goddard and Hayes 2009; Meuwissen 2009) and with low-density marker panels also empirically in maize (Lorenzana and Bernardo 2009). It is still unclear, however, how well theoretical expectations fit observed prediction accuracies in biparental maize populations, especially concerning their robustness across several traits and populations. Here we compare theoretical with empirical results for five important traits that so far have not been investigated in the field of genomic prediction: Giberella ear rot (GER) severity caused by the fungus Fusarium graminearum, the content of one of its major mycotoxines, deoxynivalenol (DON), and three kernel yield component traits: ear length, kernel rows, and kernels per row.
The second question is important because knowing the predictability of progeny of crosses based on existing related material is assumed to be one of the key elements for a better allocation of resources in plant breeding. This is especially true considering the high costs of phenotyping compared to automatable genotyping platforms. If multiple related or unrelated populations are combined into one TS, analytical solutions of the expected accuracy are complicated by the intense substructuring in the TS. However, it is evident from previous studies that besides the heritability of the trait and the TS size, relatedness between the TS and the validation population (VP) and the extent to which linkage disequilibrium (LD) with quantitative trait loci (QTL) is exploited across populations are the main factors (Goddard et al. 2006; Habier et al. 2007, 2010; Gianola et al. 2009; Clark et al. 2012). Nevertheless, a detailed understanding how the TS composition influences the predictability of individual crosses is still lacking.
From an applied point of view, several specific questions arise concerning the construction of the TS to predict progeny of a cross with related material: Is it favorable to have more crosses included in the TS? Does it matter if related material is available only for one parent of the VP? Does a mixture of related and unrelated crosses in the TS improve the prediction accuracy or can it even have negative effects? The objective of this study was to answer these questions empirically by changing the composition of the TS in a precisely defined manner to gain insights into the factors influencing the genomic prediction accuracy of individual crosses. Thus, the specific objectives of this study were to assess and compare the following prediction problems, while keeping the size of the TS constant: (i) prediction using full-sib DH lines, (ii) prediction using one to four half-sib families of DH lines that have together either only one or both parents in common with the parents of the VP, and (iii) prediction using one unrelated population of DH lines alone or in combination with one to three half-sib families of DH lines.
Materials and Methods
Populations
The genetic material consisted of five DH populations described in detail by Martin et al. (2012). Four European flint inbred lines from the maize breeding program of the University of Hohenheim were used as parents to develop the populations (Figure 1). Two of the four lines were resistant to GER (P1 = UH007, P2 = UH006) and two were susceptible to GER (P3 = UH009, P4 = D152). The five populations could be clearly separated in a principal component analysis on SNP data with no overlap in the first three dimensions (Supporting Information, Figure S1).
Crossing scheme to obtain the five populations.
Field trials, trait recording, and phenotypic analysis
Field trials of all lines were conducted at two locations in Southwest Germany, using adjacent 10 × 20 α-lattice designs with two replications. Phenotyping of GER and DON was performed in 2008 and 2009 as described previously (Martin et al. 2011). Briefly, artificial silk channel inoculation was performed with an aggressive isolate of F. graminearum on six (2008) or eight (2009) primary ears 5–6 days after silk emergence. At physiological maturity, inoculated ears were dehusked manually and GER was visually recorded as the area covered by mycelium. After drying to an approximate moisture content of 14%, 100 g of ground grain was taken to record spectral data, using near-infrared spectroscopy (NIRS). DON concentration was determined by applying the NIRS calibration model developed by Bolduan et al. (2009). Primary ears from five noninoculated plants of each plot were used in 2009 to record yield component traits ear length, kernel rows, and kernels per row.
Standard lattice analyses of phenotypic traits were performed using the software PLABSTAT (Utz 2005) as described previously (Martin et al. 2011), including an arcsine square-root transformation of GER severity and a natural logarithm transformation of DON content. Heritability (h2) was estimated on a line-mean basis with 95% confidence intervals calculated according to Knapp and Bridges (1987). Both phenotypic and genotypic data can be found in File S1 and were available for the following numbers of lines: 131 (π1,2 = P1 × P2), 204 (π1,3 = P1 × P3), 161 (π1,4 = P1 × P4), 96 (π2,3 = P2 × P3), and 43 (π2,4 = P2 × P4).
Genotyping and genomic similarity measures
All inbred lines were genotyped using the Illumina MaizeSNP50 array containing 56,110 SNPs (Ganal et al. 2011). A total of 16,741 SNPs had a call rate of at least 0.95 and an across-populations minor allele frequency of at least 0.05 and hence were used for all further analysis. Imputation of the 0.3% missing alleles was performed using the expectation-maximization (EM) algorithm (Poland et al. 2012) implemented in the R-package rrBLUP (Endelman 2011). Identity-by-state (IBS) genetic similarities among all n lines were calculated aswhere m is the number of SNPs, J is a matrix of 1’s, and Z is a matrix of size n × m with elements zij reflecting the SNP genotype of the ith line at the jth marker locus, where zij is −1, 0, or +1 for the aa, Aa, and AA genotypes (Astle and Balding 2009).
As a second measure for genetic similarity, we calculated the Pearson correlation coefficients among rows of Z. This genomic correlation measure, denoted sGC, gives values ranging from −1 to +1 in the case of different or the same alleles for all SNPs, respectively, and captures the correlation pattern of the individual DH lines based on their SNP alleles.
To investigate the persistence of linkage phases across the DH populations, we considered consecutive marker pairs with ≈10-Mb distance. Following Technow et al. (2012), linkage phase similarity (sLP) between two populations was then calculated as the proportion of marker pairs with the same linkage phase (reflected by the sign of the r statistic) in both populations.
Genomic prediction model
Genomic prediction was performed using ridge-regression best linear unbiased prediction (RR-BLUP) (Piepho 2009) implemented in the R package rrBLUP (Endelman 2011). RR-BLUP assumes normally distributed genetic effects and has been found to be robust toward trait architecture in elite maize with a high level of LD (Riedelsheimer et al. 2012b). The model was fitted using the equivalent and computationally efficient genomic (G-)BLUP formulation (Hayes et al. 2009a) with a realized relationship matrix computed according to “method I” of VanRaden (2008).
Genomic prediction cross-validation schemes
Prediction accuracies were always calculated as the Pearson correlation between observed and predicted trait values divided by the square root of the heritability of the trait in the VP.
To answer the questions stated in the objectives of this article, we varied the composition of the TS in terms of (i) relatedness to the VP ranging from full sibs (FS) and half sibs (HS) to unrelated (UR), assuming that the parents are unrelated, (ii) number of families (1–4) from which genotypes for the TS are sampled, and (iii) number of parents of the VP that are also a parent in at least one of the families in the TS. A summary of all 11 TS compositions (1A–4B) is given in Table 1. It is important to note that the VP always consisted of the DH lines from one cross with variable size.
The size of the TS ranged from 12 to a maximum of 168 in steps of 12. If the TS consisted of multiple families, sampling was done with equal proportion from each family. To obtain stable mean estimates, 100 repetitions of the sampling procedure for constructing the TS were performed. If the TS could be constructed in multiple combinations of families, the average over all combinations is reported.
Comparing actual with expected within-population prediction accuracy
We used the formula suggested by Daetwyler et al. (2010) to compare the observed with the expected within-population prediction accuracy. It was suggested to approximate for highly polygenic quantitative traits, , where NT is the number of genotypes in the training set and Me is the effective number of loci. Following Meuwissen and Goddard (2010), Me was estimated as Me = 2NeL, where L is the genome size in morgans. L = 16.34 M was adopted from a previous linkage-mapping study, using population π1,2 (Martin et al. 2011). The effective population size (Ne) was calculated using the harmonic mean approximation for two generations (Hartl and Clark 1997, p. 291), resulting in Ne = 3.94, 3.96, 3.95, 3.92, and 3.82 for populations π1,2, π1,3, π1,4, π2,3, and π3,4, respectively.
Model-based comparison of TS compositions
A model-based approach was used to test for statistical differences among different TS compositions. The total TS size was fixed to 84, which was the maximum possible size for which balanced comparisons of TS compositions were possible. The observed prediction accuracies were modeled with the linear mixed modelwhere yijk is the obtained prediction accuracy averaged over all repetitions and possible combinations for constructing the TS; pi is the ith VP with i = 1, … , 5; tj is the jth trait with j = 1, … , 5; ck is the kth TS composition with k = 1, … , 11; and the residual error
. We assumed ck to be a fixed effect and populations and traits to be random draws from normal distributions; e.g.,
,
. Diagnostic plots were used to detect possible violations of the model assumptions. Variance components were estimated using restricted maximum likelihood (REML) and tested for being >0, using an exact restricted likelihood-ratio test as implemented in the R package RLRsim (Scheipl et al. 2007). All pairwise comparisons between compositions of TSs were tested using a two-sided t-test with a P-value adjustment using the Tukey method.
Results
Population characteristics and phenotypic traits
IBS similarity (sIBS) values among parents ranged from 0.324 to 0.522 (19.8% of possible range), and genomic correlation (sGC) values showed a broader range between −0.055 and −0.503 (27.9% of possible range, Table S1). Average IBS similarities of DH lines within populations correlated almost perfectly (r = 0.999) with the IBS similarities of the corresponding parents.
The populations showed an IBS similarity pattern, which reflects the crossing scheme with a clear separation of FS, HS, and UR relationship groups (Figure 2). IBS similarities among FS, HS, and UR relatedness groups showed expected differences in mean values (Figure S2). Since actual IBS sharing between two genotypes is the outcome of a stochastic process due to Mendelian sampling, expected variation in IBS values was observed with substantial overlap between UR and HS as well as HS and FS families. The average sIBS value for HS was close to the average between UR and FS (Figure S2). Across all pairs of lines, genomic correlations were strongly correlated with IBS similarities (r = 0.92) with clearly separable densities for FS, HS, and UR families.
Heatmap showing IBS similarities (red, above diagonal) and genomic correlations (blue, below diagonal) among all genotypes. The numbers in the blocks refer to average IBS similarities within and between populations. Average genomic correlations in comparisons with linkage phase similarities can be found in Table S2. FS, full sibs; HS, half sibs; UR, unrelated.
Linkage phase similarities (sLP) between populations were highly correlated (r = 0.91) with average sGC between populations (Table S2). Average sIBS values of pairs of populations were also highly correlated with average sLP (r = 0.93) and sGC values (r = 0.89).
Heritabilities were high for all traits and stable over most populations but showed in general large confidence intervals especially for yield component traits (Table 2). The average difference between the highest and lowest heritability for each trait was 0.18 with a coefficient of variation ranging from 6.7 to 11.1%.
Genotypic variances () of the traits showed considerable variation among populations and correlations with average sIBS ranged from −0.17 for kernels per row to −0.61 for DON content (Table 2).
Prediction using full sibs from the same cross (1A)
A nonlinear increase in prediction accuracy with increasing size of the TS was observed for all traits within all populations (Figure 3). The higher the initial accuracy obtained with the smallest TS size of 12, the faster was the gain in additional prediction accuracy, which leveled off with increasing TS size. At a TS size of 84, the difference among traits between highest and lowest prediction accuracy ranged from 0.20 for π2,3 to 0.39 for π1,4. At this TS size, Spearman’s rank correlation between accuracies and heritabilities was found to be 0.45 (P = 0.044). Within the evaluated range in TS size, only the prediction accuracies for kernel rows in π1,3 reached the level of phenotypic selection accuracy (square root of heritability). For all except the smallest population (π2,4), a good agreement between observed and theoretically expected within-population prediction accuracy (Figure 4) was found. However, some traits were systematically over- or underestimated.
(A–D) Prediction accuracies for within-population prediction using full sibs (1A). Shown are the average values for all traits obtained from 100 repetitions. The dashed lines indicate phenotypic accuracies calculated as the square root of the heritability.
(A and B) Observed vs. expected within-population prediction accuracy using full sibs (1A) averaged over (A) all populations except the smallest one (π2,4) and (B) traits. The expected prediction accuracy was calculated using the formula suggested by Daetwyler et al. (2010), assuming an average genome size L = 16.34 M and effective number of loci Me = 2NeL. Effective population size (Ne) was calculated using the harmonic mean approximation for two generations.
Comparison of TS compositions
Averaged over traits, all TS compositions without FS (2B–4B) showed initial accuracies and slopes markedly lower than with FS (Figure 5). General trends were observed when comparing scenarios 1A (1 FS) with 1B (1 HS) and 1C (1 UR). Prediction accuracies obtained with one HS family were for all traits, all populations, and all sample sizes lower than prediction with full sibs (Figure 5, Figure S4, Figure S5, Figure S6, Figure S7, and Figure S8). If the TS comprised one UR population, the lowest prediction accuracies for all TS sizes were observed in 12 of 20 possible trait × population combinations and in 7 combinations, this yielded even negative accuracies with a decreasing trend with increasing TS size (Figure S4B, Figure S4E, Figure S5C, Figure S6E, Figure S7C, Figure S7D, and Figure S8E). Most pronounced was this observation in population π2,4 for traits GER (Figure S4E) and kernels per row (Figure S8E).
(A–E) Prediction accuracies for individual validation populations (VP) depending on the composition of the training set (TS) and the total TS size. Shown are the mean values over all traits, repetitions, and possible combinations to construct the TS. The dashed lines indicates the TS size at which a model-based statistical comparison of TS compositions was performed. See Table 1 for details about the different TS compositions. Results for individual traits can be found in Figure S4, Figure S5, Figure S6, Figure S7, and Figure S8.
A good agreement was found with the assumptions underlying the model-based statistical testing of the influence of the TS composition on prediction accuracy (Figure S3). Quantile–quantile (Q-Q) plots showed one outlier for factor VP. Setting all factors of the model as random, REML-based estimated variance components revealed that factors VP and trait explained together 32.4%, whereas factor TS composition explained 41.7% (Table 3).
Least-squares estimates of prediction accuracies obtained with the different TS compositions with a fixed maximum possible TS size of 84 showed large 95% confidence intervals across traits and populations (Table 4) with the highest estimate obtained for full sibs. Pairwise comparisons between TS compositions were significant in several specific cases (Table 4 and Table S3). Using prediction with FS (1A) as a baseline, prediction using one HS family (1B) declined significantly from 0.59 to 0.25 and dropped significantly to almost zero (0.05) with one UR family. Having both parents of the VP represented in the TS resulted in significantly better prediction accuracies than having only one common parent between the VP and the crosses involved in the TS. With two populations (comparison 2A – 2B), the increase was 36.3% and with three populations (comparison 3B – 3C) it was 38.7%. Once both parents were represented in the TS, no significant differences were found if the TS consisted of more than two families. If an UR family was part of a TS involving otherwise one, two, or three HS families, no significant change in accuracy was observed. However, replacing a HS family with an UR family led to a significant reduction in prediction accuracy, if both parents of the VP were no longer represented in the TS (comparison 2A – 2C).
Discussion
The objective of this study was to systematically investigate the influence of the composition of the TS on the genomic prediction accuracies in an interconnected biparental population scenario. Independent of the TS composition, we focused on predicting DH lines from a single cross and not across different crosses. Reasoning for this lies in the well-established finding that the mean of DH lines derived from an F1 cross can be reliably predicted from the mean of the two parent lines (Melchinger 1987; Melchinger et al. 2005). In addition, it can be shown that population means estimated this way can already accurately predict DH lines across populations. Let yij be the performance of DH line j from cross i. Then, regressing estimated population means (ci) on yij yields the modelwhere µ is the overall mean and dij is the deviation from yij to ci. The total variation among all DH lines (
) can be decomposed into variation between crosses (
) and among DH lines within crosses (
). Assuming that the parents of the crosses are unrelated, quantitative genetic theory yields
and
, where
is the additive genetic variance of an outbred reference population. Assuming a purely additive model and that the parent means are perfectly known, the correlation between ci and yij can be derived as
.Thus, if population means estimated from the parents are used to predict DH lines from multiple crosses, a prediction accuracy of 0.71 can already be expected. This correlation is used in scenarios where prediction is evaluated across populations but not if DH lines are predicted within a single cross, which was the focus of this study. To disentangle the factors influencing prediction accuracies of progeny from a genotyped but not a phenotyped cross, TSs were designed in 11 defined ways. Inferences about the influence of the TS design were drawn from a model-based approach while keeping the total TS size constant.
Prediction using full sibs (1A)
For most trait × population combinations, full-sib prediction did not reach phenotypic accuracy measured as the square root of heritability (Figure 3). However, correlated response to selection also depends on selection cycle length and the intensity of selection (Falconer and Mackay 1996). The latter can be increased markedly by shifting resources from phenotyping toward production and genotyping of many more DH lines than under traditional phenotypic selection.
For full sibs, prediction accuracy was strongly dependent on TS size and moderately dependent on trait heritability. This was expected from theoretical simulations (Meuwissen 2009), as well as from empirical work in wheat (Heffner et al. 2011a,b), barley (Lorenz et al. 2012), and oats (Asoro et al. 2011). Within an individual biparental population, genomic prediction information can come from (i) variation in relatedness (i.e., Mendelian sampling) captured by the SNPs and (ii) cosegregation of loci linked to QTL (linkage information). Artifacts originating from population structure can be assumed to be absent, which is empirically supported by (i) the homogeneous data clouds of the populations in the principal components analysis (Figure S1) and (ii) the unstructured distribution of IBS values (Figure 2 and Figure S2). Hence, within populations, it should be possible to model the expected prediction accuracy () by population and trait parameters.
In dairy cattle, deterministic predictions could be successfully used as a guide for the required TS size (Hayes et al. 2009b). Our results support this also for within-population prediction using full sibs (Figure 4). One possible reason for the systematic over- and underestimations of traits lies in deviations from the polygenic genetic architecture, which is assumed in the applied deterministic formula. Indeed, genome partitioning of genetic variance indicates a less polygenic architecture for certain trait × population combinations (Figure S9). On the other side, the robustness of RR-BLUP toward the underlying number of QTL in a high LD scenario (Riedelsheimer et al. 2012b) suggests that differences in the number of underlying QTL play a less important role here. A more likely explanation lies in the imprecise heritability estimates reflected by their large confidence intervals (Table 4). This was especially true for the yield component traits that were evaluated in only 1 year and showed the strongest deviation from the expected values. This result emphasizes the need for high-quality phenotypic data for the TS for genomic prediction.
Nevertheless, for highly heritable traits, the high correlations between actual and observed values suggest that within biparental populations, prediction accuracies can be predicted well using information of (i) training set size, (ii) heritability, and (iii) effective number of loci. Given that good heritability estimates are available, it seems possible to use deterministic formulas to optimize resource allocation between TSs and VPs for prediction using full sibs.
Notably, the minimum training set sizes required to achieve reasonable prediction accuracies within biparental maize DH populations are by at least two orders of magnitude smaller than those required in animal breeding (Goddard and Hayes 2009). This results mostly because of the smaller genome size (in morgans) and the much smaller Ne of biparental populations compared to, e.g., cattle for which Ne ≈ 100 (Sørensen et al. 2005). Thus, results from simulation or empirical studies in animal breeding should therefore be examined carefully before drawing inferences for situations in plant breeding.
Influence of TS composition
Prediction within populations using full sibs is appealing because of (i) the high accuracies achievable with a moderate TS size and (ii) the good agreement of theoretical expectations with empirical results. However, in an ongoing breeding program, phenotypic and genotypic data are available for multiple related and/or unrelated populations often before the actual cross is being made. With the current body of theoretical knowledge in the field of genomic prediction, it is not possible to answer the question of how the composition of the TS affects the genomic prediction of other crosses, which is needed to help breeders in their decision-making process. We therefore followed an empirical model-based approach with statistical testing for significant differences between TS compositions, while keeping the TS size, and hence total resource capacity, constant. TS size had to be fixed to a rather small number because 84 was the maximum possible size for which balanced comparisons among all TS compositions were possible. However, relative differences between TS compositions for which larger TS sizes were possible remained relatively constant beyond this value because prediction accuracies began to level off at this TS size (Figure 5).
The strong decline in prediction accuracy from FS (1A) to HS (1B) and to UR (1C) was expected from theory (Habier et al. 2007) as well as from recent empirical studies emphasizing that the level of relatedness between genotypes in the TS and the VP has a strong impact on prediction accuracies (Habier et al. 2010; Clark et al. 2012). Having multiple related HS families, one important question is whether resources should be spent to increase the number of crosses or the number of DH lines per cross. Here, we found no statistical difference between different numbers of involved crosses once both parents of the VP are parents of at least one cross involved in the TS. In addition, increasing the number of HS families with one parent in common with the VP did not compensate for the lack of representation of the second parent of the VP. In a genomic prediction scenario with HS families, our results suggest that an optimal TS should (i) be of sufficient total size and (ii) have both parents of the VP represented. The number of families used to generate such a TS, however, does not seem to have a strong impact (Table 4). Nevertheless, real or simulated data sets with many more HS families should be analyzed to investigate this question in greater detail.
One interesting finding of our study was the negative accuracies obtained with UR families for certain traits, especially in case of π2,4 as the VP. We note here that the effect of an UR family could not be evaluated for population π1,2 because its UR family π3,4 is not present in our study. It is also important that the term “unrelated” was used to distinguish this type of cross from half sibs and full sibs. UR families still showed a substantial IBS similarity with the VP (Table S2) and were therefore not unrelated in a population genetic sense. However, since prediction accuracies became increasingly negative with increasing TS size, the UR family must have provided a negative prediction signal coming from opposite linkage phases with important QTL in the VP. Genome partitioning of genetic variance following Riedelsheimer et al. (2012b) suggests that important QTL for the measured traits are indeed especially present in population π2,4 (Figure S9). This also indicates that causal variants might not segregate in all DH populations. In addition, linkage phase similarity measured as sLP between population π2,4 and its UR family was among the lowest ones of all combinations and genomic correlation measured as sGC was negative (−0.233, Table S2). If such an UR family is combined with one HS family to a mixed TS, the negative predictive signal coming from the UR family could even counteract the positive signal coming from the HS family (decreasing curve of TS composition 2C compared to 1B in Figure S4E). Therefore it is recommended to exclude such families from the TS. This finding is also in agreement with a simulation study showing that increasing accuracies by combining different populations into a single TS requires persistence of marker–QTL LD across populations (de Roos et al. 2009).
The strong correlation (0.89 < r < 0.93) among sIBS, sGC, and sLP suggests that any of these measures can be used to estimate genetic similarity between populations to identify populations with negative predictive signal. However, we found the sGC measure appealing because of its simplicity, applicability to individual genotypes, and defined range of values that fall between −1 and +1. This measure might also be suitable in identifying outlier genotypes in the TS or the VP. In the case of monogenic or oligogenic genetic architecture with known positions of major QTL, a similarity measure that takes linkage phases in the vicinity of these QTL into account might be even more accurate than a simple genome-wide measure. Future simulation studies with large sets of related populations and known QTL positions and effects could shed more light on this issue.
Replacing a HS family with an UR family led to significant reductions in prediction accuracy only if one parent of the VP was no longer represented in the TS (comparison 2A – 2C); otherwise the difference was not significant. With a constant size of the TS, a drop in accuracy might be expected if the TS is diluted with unrelated genotypes, especially if they show reverse linkage phases. However, since all line crosses in our study originated from only four parents, the DH lines from every UR family were necessarily HS to both HS families included in the TS. For a single locus with additive effect a, letThen, it follows µAA – µBB = 2a2 + 2a4 – 2a3 = 2a1. Thus, with the assumption of no epistasis, additive effects in one population can be derived from a linear combination of the additive effects in the HS families of each parent and the UR family. In a general setting with more than four parents, the inclusion of an UR family might, in contrast to our results, reduce prediction accuracy even if both parents are represented. Careful investigation of the linkage phase similarities (or genomic correlations) between UR families and the VP should therefore be carried out before such families are included in the TS.
Influence of population and trait on TS composition
Systematic population and trait effects accompany the described contrasts among the TS compositions. Variability among traits was substantially higher than variability among VPs (Table 3) and both VP and trait effects accounted for 34.9%, which was of similar magnitude compared to the variation introduced due to different TS compositions (41.7%). This was not unexpected as previous studies also found large variability in prediction accuracy attributable to traits (corrected for their heritability) and populations (Ornella et al. 2012).
Although differences between TS compositions could be quantified and tested, the population and trait variability hampered a direct quantification of interchangeabilities, e.g., how many HS from one or more families are needed to obtain the same prediction accuracy as with a certain number of FS. For example, whereas 48 HS from two families were sufficient to replace 24 FS in the TS for predicting GER severity in population π1,4 (Figure S4C), 72 HS were needed for population π1,3 (Figure S4B), and 168 HS were barely enough in population π1,2 (Figure S4A). In addition, these numbers were in most cases not transferable to other traits. This variability among traits and populations has important implications for resource optimizations. If such calculations are based on only one cross or trait, results can be misleading. Therefore, any optimization calculation should be based on a large set of crosses and results for one trait should be extrapolated to another trait only with utmost caution. Further research should focus on elucidating in more detail the reasons underlying the variability of the prediction accuracy among traits and populations. Possible factors to be investigated with very large sets of simulated or real populations and traits include differences in genetic architecture with different linkage phases between TSs and VPs in regions harboring important QTL for the trait of interest.
Conclusions
From the analysis of our set of five interconnected biparental DH populations, we draw the following conclusions:
Within-population prediction is feasible for highly heritable traits with moderate training set sizes and agrees on average well with theoretical expectations.
Prediction accuracies decline strongly if FS are replaced by HS, but statistically significantly better results can be achieved if HS families are available from both instead of only one parent of the VP.
Once both parents are represented, it is not favorable to increase the number of crosses.
Unrelated families with opposite linkage phase similarities with the VP can have negative prediction accuracies or reduced prediction accuracies if combined with related material. Such crosses should be identified using linkage phase or genomic correlation measures and excluded from the TS.
The observed variability among populations and traits requires resource optimization that incorporates knowledge about the variability of parameters.
Acknowledgments
This research was funded by the German Federal Ministry of Education and Research within the Agro-ClustEr “Synbreed—Synergistic plant and animal breeding” (grant 0315528D) and by the Deutsche Forschungsgemeinschaft research grant ME 2260/6-1. Funding to J.L.J. and M.E.S. in support of this research came from a Bill and Melinda Gates Foundation grant “Genomic selection: the next frontier for rapid genetic gains in maize and wheat.”
Footnotes
Communicating editor: D. J. de Koning
- Received February 8, 2013.
- Accepted March 25, 2013.
- Copyright © 2013 by the Genetics Society of America