- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Moreau, L.
- Articles by Gallais, A.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Moreau, L.
- Articles by Gallais, A.
Marker-Assisted Selection Efficiency in Populations of Finite Size
Laurence Moreaua, Alain Charcosseta, Frédéric Hospitala, and André Gallaisa,ba Station de Génétique Végétale (Institut National de la Recherche Agronomique, Université Paris-Sud, Institut National Agronomique Paris-Grignon), Ferme du Moulon 91190 Gif Sur Yvette, France,
b Institut National Agronomique Paris-Grignon, 75231 Paris Cedex 05, France
Corresponding author: Laurence Moreau, Station de Génétique Végétale (INRA-UPS-INAPG), Ferme du Moulon, 91190 Gif-Sur-Yvette, France, moreau{at}moulon.inra.fr (E-mail).
Communicating editor: B. S. WEIR
| ABSTRACT |
|---|
The efficiency of marker-assisted selection (MAS) based on an index incorporating both phenotypic and molecular information is evaluated with an analytical approach that takes into account the size of the experiment. We consider the case of a population derived from a cross between two homozygous lines, which is commonly used in plant breeding, and we study the relative efficiency of MAS compared with selection based only on phenotype in the first cycle of selection. It is shown that the selection of the markers included in the index leads to an overestimation of the effects associated with these markers. Taking this bias into account, we study the influence of several parameters, including experiment size and heritability, on MAS efficiency. Even if MAS appears to be most interesting for low heritabilities, we point out the existence of an optimal heritability (~0.2) below which the low power of quantitative trait loci detection and the bias caused by the selection of markers reduce the efficiency. In this situation, increasing the power of detection by using a higher probability of type I error can improve MAS efficiency. This approach, validated by simulations, gives results that are generally consistent with those previously obtained by simulations using a more sophisticated biological model than ours. Thus, though developed from a simple genetic model, our approach may be a useful tool to optimize the experimental means for more complex genetic situations.
THE development of highly polymorphic molecular markers has opened a new era for genetics and selection. Most traits of economic importance are quantitative. The use of molecular markers enables one to identify and map quantitative trait loci (QTLs) that are involved in the variation of such traits. For the last six years, the opportunity to use markers in breeding programs to improve the efficiency of the selection of quantitative traits has received extensive attention. ![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Other published results on the efficiency of this method or related methods are based on simulations (![]()
![]()
![]()
![]()
![]()
![]()
![]()
Extending the preliminary analysis of ![]()
| THEORY |
|---|
Lande and Thompson's analytical approach when population size is infinite:
![]()
![]()
![]() |
(1) |
i is the predicted genetic value of the offspring of the individual i in a population assumed to be of nearly infinite size, Pi is the phenotype of the individual i or the average performance of its progeny if the trait is evaluated by progeny test of selfed or crossed origin (e.g., cross with a tester for hybrid breeding), Mi is the sum of the effects on the character associated with the markers and is called "molecular score," and bp and bm are the weight coefficients of the index. Only significant effects are included in Mi. Because the aim is to predict the genetic value of the offspring, only additive effects associated with markers have to be considered.
![]() |
(2) |
2A and
2P are, respectively, the additive genetic and phenotypic variances of the trait in the population to be selected and evaluated by progeny tests or per se value,
2M is the additive genetic variance associated with the markers, h2 is the narrow sense heritability of the trait in the testing system considered (the proportion of the total phenotypic variance caused by additive effects of all QTLs), R2P is the proportion of phenotypic variation associated with additive effects accounted for by markers, and R2A is the proportion of additive genetic variance associated with markers (R2P = h2R2A) .
Denoting Gi, the genetic value of the offspring of the individual i, we then have: Gi =
Ai, where Ai is the additive genetic value of i.
Assuming that
is normally distributed and that the selection is conducted on both sexes, the genetic advance (
GMAS) obtained with MAS after one generation is given by
![]() |
(3) |
Assuming that P is distributed normally, the genetic advance in the next generation under conventional phenotypic selection is
The relative efficiency (RE) of MAS compared with phenotypic selection can be defined as the ratio of the genetic advance under MAS to the genetic advance under phenotypic selection with the same intensity of selection
![]() |
(4) |
When the size of the population is infinite, there is no error in the estimation of the weight coefficients nor of the effects associated with markers. In that case, the relative efficiency of MAS depends only on the heritability of the trait and on the proportion of phenotypic variation associated with markers. When the size of the population is finite, the weight coefficients and the effects associated with markers are estimated with a possible error. This experimental error leads to a smaller efficiency than expected under the assumption that parameters are known. Moreover, as mentioned by ![]()
Genetic model used to study the case of a population of finite size:
Consider a reference population of infinite size derived from a cross between two inbred lines. Such populations are currently used to search for QTLs in plant species, and the most commonly used are F2, backcross progenies (BC), recombinant inbred lines (RIL), or doubled haploids (DH). Consider a normally distributed quantitative trait that is influenced by numerous (l) unlinked QTLs with no epistasis. Each QTL is supposed to be linked to a single marker. The observed rate of recombination between a QTL and its linked marker, r, is assumed to be the same for all the marker-QTL pairs. It is assumed that this situation provides a relevant approximation for several markers in the vicinity of each QTL, r being the smallest recombination rate between the QTL and the markers. With these assumptions, the additive genetic variance associated with markers is (1 - 2r)2
2A . The parameter m2 = (1 - 2r)2 is the fraction of the total additive genetic variance truly associated with markers. It is the maximum percentage of genetic variance that can be detected with this set of markers. In addition to the l markers that are linked with a QTL, Nm - l markers unlinked to any QTL are considered (Nm is the total number of markers). We also assume that all the markers are unlinked.
To simplify this approach, we will consider, in a first step, that all QTL effects are equal. Experimental results concerning QTL detection, however, show that the distribution of QTL effects is generally not uniform. Many authors (e.g., ![]()
![]()
Relative efficiency of MAS in a given experiment:
In a given MAS experiment, we consider N individuals randomly sampled from the reference population. Marker-QTL associations are detected in this sample by a simple linear regression of phenotypes on marker types, with a given probability
of type I error. In ![]()
p and
m). Because we only consider additive effects and populations derived from a cross between two inbred lines, the molecular score M is defined as
![]() |
(5) |
q is a dummy variable taking a value of -1 if the individual i has one parental marker type, 1 if i has the other parental marker type, and 0 if i is heterozygous. In the reference population, we assume that there is no segregation distortion. The expected frequency for each marker type is 0.5 for BC, RIL, and DH. For F2 populations, the frequencies are 0.25 for the parental marker types and 0.5 for the heterozygotes.
Like ![]()
![]()
^2q is the estimated additive variance accounted for by marker q. It should be noted that, in principle, the estimated effects associated with markers are not independent because they are estimated from the same sample. Nevertheless, the covariance between the estimated effects of unlinked markers should be negligible for large samples (N > 100). This assumption will be validated by simulations (see below). The percentage of phenotypic variance associated with the additive effects of markers (R2P ) can be estimated by the sum of the adjusted Rsquares (
2q ) at each marker significantly associated with a QTL with a probability
of type I error.
2q , is defined by
Then,
![]() |
(6) |
The use of the adjusted Rsquare leads to a smaller estimation of the percentage of variance that is accounted for by markers. Thus, it reduces the weight given to the markers in the index of selection.
Following APPENDIX A, Equation 4 becomes
![]() |
(7) |
is a parameter that depends only on the type of population considered (
= 1 for DH or RILs,
= 0.5 for F2 and BC). Equation 7 depends on the estimated effects associated with markers that are variable from one sample to another. Two samples of the same size can possibly lead to different estimated values and then different REs. The expected RE must then be evaluated over all the possible results that can be obtained from a population of N individuals.
Expected RE over all possible experiments of same size:
For a given experiment, the association between a given marker and a QTL is tested with Fisher's test. As developed in APPENDIX B, all the parameters estimated at a given marker are functions of the F statistics. Since only markers significantly associated with a QTL are taken into account in the selection index, the expectations of the estimated parameters are obtained by using a truncated F distribution: only F values that are equal or superior to a critical F value need to be considered. It results that the expectations of the estimated parameters are not equal to their true values, but are overestimated. Even if a marker is not linked to a QTL, the expectation of the estimated variance accounted for by this marker is not zero. As shown in APPENDIX B, the use of truncated F distributions allows us to obtain the expected RE over all the results that can be obtained after sampling N individuals from a given reference population.
Numerical applications:
The formulas described above show that only m2 (related to r), h2, l, N, Nm,
, and the QTL effect distribution affect the RE of MAS for a given population type. In the numerical applications, we suppose that the population is composed of doubled haploids. Three sizes of experiment (100, 300, or 500), five type I error risks
(1, 5, 10, 20, or 30%), different numbers of QTLs (five or 10), different QTL effects distributions (QTLs with equal effect or QTLs effects following an approximate geometric distribution), and 30 markers are considered. For these parameters, the relative efficiency of MAS is determined when m2 varies from 0 to 1 and when h2 varies from 0.05 to 1.
Validation of this approach by simulations:
In previous formulations, some assumptions were made concerning (1) the independence between the parameters estimated at different markers and (2) the expression of the expected efficiency. To validate these assumptions, simulations were performed with conditions as close as possible to those of the theoretical approach. We simulated a population of N = 300 DH (or any population where there are only two classes of genotypes at each locus). The narrow sense heritability of the trait of interest was h2, and the additive variance associated with markers was m2
2A . We considered a total of 30 unlinked markers. This was chosen to roughly correspond to 10 chromosomes with three nearly independent markers on each. Marker selection was made by a simple regression of phenotypes on marker types with a probability
of type I error. The marker effects and the weight coefficients of the selection index were estimated subsequently as described above. A second population with the same genetic parameters was simulated, and individuals were selected using the estimations made from the first population. One-tenth of the individuals were selected based on either (1) their index value (MAS) or (2) their phenotypic value (PHE). The relative efficiency is computed from Equation 4. For each set of parameters (N, m2, h2, and
), simulations were replicated 100 times. Over 100 simulations, the averages of the percentage of phenotypic variance accounted for by the markers (in the first population) and of the RE were compared with the values predicted by the analytical approach. Using the standard error, a confidence interval at the 5% level was determined for the average values obtained by simulations. When the analytical result was included in this interval, we concluded that the two approaches were not significantly different. In the analytical approach, the bias caused by the selection of the markers was taken into account. To investigate the importance of this bias, the analytical results obtained by considering that this bias could be neglected in the formulae (i.e., with no false detection and no overestimation of the effects of QTLs) were also given and compared with simulation results.
Comparisons between simulations and analytical results were made for 24 conditions [two
probabilities of type I error (5% and 20%), three heritabilities (0.15, 0.3, and 0.45), and four m2 parameters (0.30, 0.5, 0.7, and 0.9)].
To simplify the formulation of the expected RE of MAS, it was assumed in the analytic approach that selection is performed in the reference population, and not in the sample used to estimate marker-QTL associations, as would be the case in true experiments. To validate this assumption, simulations conducted with only one population instead of two were performed, and the results were compared with those obtained with two populations. Only minor differences were observed between results of simulations conducted with one or two populations (results not presented). The maximum difference in RE was ~0.1, but the differences were generally ~0.01. Hence, our approach seems to be a good approximation of the realistic situation.
| RESULTS |
|---|
Validation of the analytical approach by simulations:
The analytical results obtained with bias of selection taken into account or neglected are compared to the simulation results (Table 1 and Table 2). The estimation of the percentage of variance associated with markers obtained by the analytical approach is included in the confidence interval at the 5% level of significance of the average value found on 100 simulations (Table 1). Taking into account selection bias through truncated F distribution is, therefore, a valid way to predict the estimated percentage of variance associated with markers. For low m2 and heritabilities, the bias in the estimation of the variance associated with markers is important, the estimated value being at least twice the true value. Detections of false QTLs and overestimations of the effects accounted for by markers explain these differences. For the RE of MAS (Table 2), results of simulations are not significantly different from the analytical ones when selection bias is taken into account. Not considering selection bias leads to overestimating the actual relative efficiency of MAS. This overestimation is especially important for low heritabilities, low m2 and for high
. Simulation results therefore show that the bias caused by the selection of the markers can be important. Since this bias cannot be easily corrected in true experiments, it is important to consider it in the evaluation of the RE of MAS.
|
|
Influence of biological parameters (h2, l, and QTL effects distribution) on the RE of MAS:
We can distinguish two groups of parameters: those that depend on the biology of the trait (number and effects of the QTLs) and those that depend only on experimental conditions (the size N of the population, the distance between markers and QTLs, and the probability
of type I error). The heritability of the trait results from both the biology of the trait and experimental conditions because it can be increased by using replications of genotypes (in the case of plants) or performances of relatives to reduce the experimental error.
Figure 1 shows the domains of RE of MAS compared with selection based on phenotype in function of r and h2 for N = 100, 300, or 500, as well as for two numbers of QTL effects: 5 or 10 of equal effects. The different domains are separated by lines corresponding to REs varying from 1 to 2.75 by 0.25. It is seen that for a given N and with
= 5%, domains with the highest RE correspond to low heritabilities. When the heritability is high, genotypic values are well estimated by the phenotype, the weight given to the markers in the selection index is low, and MAS tends to be equivalent to phenotypic selection. Nevertheless, with an
risk level of 5%, the RE decreases for very low heritabilities (<0.15). At such heritabilities, the power of detection is small and the effects are estimated poorly. Since MAS can only be efficient if the number of detected QTLs is high enough, there is an optimal heritability that varies slightly with m2 but is around 0.150.2 for N = 300 and five QTLs of equal effects in the model.
|
The RE of MAS for given N and
values decreases as the number of QTLs increases (see the comparison between five QTLs vs. 10 QTLs of equal effects in Figure 1). When there are many QTLs, the individual effect (r 2q ) of a given QTL is small, so the power of detection becomes low. The RE also depends on the distribution of the QTL effects.
The RE obtained with the same number of QTLs, but three different QTL effect distributions are compared in Figure 2: (1) 10 QTLs, all with the same r 2q of 10%, (2) 10 QTLs with three having an individual r 2q of 15.37% and seven with an r 2q of 7.7%, and (3) 10 QTLs with one QTL having an r 2q of 33.3%, three QTLs with an r 2q of 16.8%, and six QTLs with an r 2q of 2.7%. Model 3 has been chosen to be close to a geometric distribution, as used by ![]()
= 5%, and m2 = 0.7) that distribution 3 leads to a better efficiency of MAS than distribution 2, which is slightly better than distribution 1 when the heritability is <0.3. Then, for a given number of QTLs, equal effects lead to a lower RE of MAS compared with an "L" distribution or geometric distribution, as assumed by ![]()
![]()
![]()
|
|
Influence of experimental parameters (N, m2, number of markers, and
) on RE:
Figure 1 shows that N is an important parameter. Obviously, the RE of MAS increases with N. When N is large, the power of detection and the accuracy of the estimation of the marker-associated effect are increased. Therefore, MAS seems to be interesting only for populations of >100 or 200 individuals. Increasing N is more important when the trait is controlled by a high number of QTLs; the efficiency domains with N = 300 and l = 5 are very close to those obtained with N = 600 and l = 10 (results are not presented) if we consider in both cases an equal percentage of markers linked with a real QTL (i.e., Nm = 30 for l = 5 and Nm = 60 for l = 10). This can be related to the fact that if each marker accounts for a small part of the phenotypic variance (i.e., if m2h2/l << 1), the noncentrality parameter of the F distribution is close to Nh2m2/l. Thus, with a given percentage of noninformative markers and a given probability of type I error, the noncentrality parameter appears to be the key parameter that explains the relative efficiency of MAS. This is no longer true for a same total number of markers because there are more noninformative markers when l = 5 than when l = 10. This leads to a lower relative efficiency when l = 5 and N = 300 than when l = 10 and N = 600 because of a higher risk of false QTL detections. Nevertheless, the efficiency domains obtained with a given population size, N, and a given number of QTLs, l, are approximately equal to the efficiency domains that would be obtained with 2l QTLs of equal effect and a population size of 2N.
Figure 1 shows that the RE increases as the distance between markers and QTLs decreases. Obviously, if the markers are the QTLs themselves (m2 = 1 or r = 0), selecting on markers is equivalent to selecting on QTLs. The marker-QTL distance in our model cannot be directly interpreted as a density of markers in a real genetic map because, in the latter case, markers that are linked on the same chromosome are not independent. If the marker density is not too high, then it can be assumed that the correlations between linked markers are not strong enough to greatly modify the results, and r can be roughly related to the density of the markers. If we assume the absence of interference, for the DH population, r can be connected with the distance d between markers and QTLs by the mapping function of ![]()
Increasing
is also a way to increase the power of detection, but it results in an increased risk of detecting false QTLs. This parameter therefore has contradictory effects on RE. Figure 3 (with m2 = 0.7 and N = 300) shows that the effect of
depends on the heritability. When the heritability is low, increasing
leads to a better RE. It is the opposite when the heritability is high. Thus, for low heritabilities, the gain in power of detection largely compensates the risk of false detections. As a consequence, the heritability optimum observed with
= 0.05% disappears for higher
values. Table 3, however, shows that the effect of
also depends on the population size and m2. If m2 is low (e.g., for instance 0.5) and if the population size is small (N = 100), RE decreases as
increases. In this situation, even when the heritability is low, the markers that are truly associated with QTLs account for only a minor part of the genetic variation, and the gain in the power of detection does not compensate the risk of false detections. When the size of the population is large (>500) and m2 is high, when
increases, the relative increase of RE tends to be smaller because the power of detection is high enough, even with low
. In true experiments, m2 should generally be >0.5 because of the marker density now available for many species, and the size of the population will often be <500 because of resource limitations; thus, the choice of
may be important to maximize the gain of MAS.
|
| DISCUSSION |
|---|
One of the aspects of this study is to take into account the bias caused by the selection of the markers included in the index. This bias affects the estimation of the weight coefficients of the selection index and also the estimation of the additive effect of each marker. In true experiments, it is difficult to avoid this bias. ![]()
![]()
It was shown that the RE of MAS depends on the genetics of the trait of interest, experimental characteristics, and options concerning data analyses. ![]()
![]()
We demonstrated that even if the RE is generally higher for low heritabilities, when the population size is finite, there is an optimal heritability below which the RE decreases. This was mentioned by ![]()
![]()
We showed that RE increases with population size. ![]()
![]()
![]()
![]()
We pointed out that MAS is more efficient when the distance between markers and QTLs is small. Such a result was also observed by ![]()
![]()
![]()
![]()
In our approach, marker-QTL associations are supposed to be detected by a simple regression. It is now acknowledged that the methods based on the simultaneous use of numerous markers (nearby markers or markers linked to other QTLs) can improve the power of detection of a given marker-QTL association and the precision of the estimated QTL effects (see ![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
In spite of the assumptions made, our analytical results are consistent with results of simulations conducted in this study, justifying the statistical approach. Moreover, our results are also generally consistent with the results obtained by means of simulations by other authors using more sophisticated biological models.
The problem of MAS profitability:
Our results show that MAS can be more efficient than selection based only on phenotype in a large range of situations, as long as the size of the population is at least 200, the heritability of the trait is between 0.05 and 0.5, and the markers are relatively close to the QTLs. While the expected RE of MAS is higher for low heritabilities (0.10.2), however, our simulations show that the frequency of experiments that lead to a worse genetic gain with MAS than with phenotypic selection (RE < 1) is higher (e.g., with five QTLs of equal effects, N = 300, m2 = 0.5,
= 5%, and h2 taking the values 0.15, 0.3, and 0.45, the number of simulations where RE < 1 are, respectively, 13, 9, and 5). This was studied in more detail by ![]()
Nevertheless, even if MAS is more efficient than phenotypic selection, this method is expensive because of the genotyping. Doing several replications of each genotype or using complementary information coming from relatives can be a way to increase the heritability and improve the efficiency of phenotypic selection. Situations can be found where additional replications may be less expensive than obtaining marker data and yet lead to the same genetic advance. For instance, with N = 300, m2 = 0.8, and 10 QTLs of equal effects, the maximum RE of MAS is obtained for a h2 of 0.15 and is ~1.54. In this case, phenotypic selection can provide the same genetic gain as MAS by using three replications of each genotype instead of one. If the genotyping is more expensive than adding two extra replications (which is presently the case for most traits and most markers), MAS is not profitable. New marker techniques based on PCR can reduce the cost of MAS. Our analytical approach can be used to predict the expected genetic gain for a large range of parameters and then to define the best strategy of allocating resources for a given cost of genotyping. This point is currently being investigated in our laboratory.
This problem of cost is complex because even in situations where MAS is not more profitable than other methods of selection in the first generation of selection, it allows one to select the individuals based on their marker types in the second generation. There is no need to evaluate their phenotype and no need to determine all the marker types; it is only necessary to evaluate the genotype at markers linked to QTLs. If the selection can be performed on immature individuals, the second selection cycle can be achieved in a single generation. This possibility of adding one selection cycle based only on markers was not considered in our analytical approach, but simulations (![]()
| ACKNOWLEDGMENTS |
|---|
We thank the anonymous reviewers for their helpful comments on an earlier version of this article.
Manuscript received January 21, 1997; Accepted for publication November 12, 1997.
| APPENDIX A |
|---|
The expected genetic gain of MAS is defined by Equation 3 in the text. The phenotype Pi of the individual i can be written as the sum of its additive genetic value, Ai, and a residual term, Ei, which includes an environmental error term and a genetic term corresponding to nonadditive effects
The genetic value of the offspring of i is equal to 1/2Ai.
In the case of a population of finite size, if the selection is made on the two sexes, it follows that
![]() |
(A1) |
The marker type
q is independent of E. The estimated additive effect âq associated with marker q has been estimated in the sample. It is a constant at the reference population level. Then,
l ,
q) = 0 except those involving a marker and its linked QTL. Denoting
= var(
q), the formula can then be simplified as
![]() |
(A2) |
The parameter
depends only on the population type:
= 1 for HD or RIL;
= 0.5 for F2 and
=
for BC. The variance of the molecular score is computed as
Because markers are independent, all cov(
q,
q') = 0. Then,
![]() |
(A3) |
Using Equation A2 and Equation A3 and Equation 4 in the text, Equation A1 leads to the formula in Equation 7 in the text.
| APPENDIX B |
|---|
For a given experiment, the association between a given marker and a QTL is tested with a Fisher test. If there is no QTL near the marker, then the statistic follows a central F distribution. The probability of declaring the effect significant (false positive) is equal to the
type I level risk. If there is a QTL near the marker, the statistic follows a noncentral F distribution. The probability of detecting it is the power of detection. Following the approach described by ![]()
![]()
> Fcrit
,C-1,N-1,0] where C - 1 is the number of independent estimated parameters (C - 1 = 1 for the simple regression), Fcrit
,C-1,N-1,0 is the critical value from a central F distribution used to test QTL effect with a type I error of
, and FC-1,N-1,
is a random variable from a noncentral F distribution (
0). Following ![]()
= (N - 1)
. The power of detection can then be deduced from the true effect accounted for by the marker.
Because all the markers are independent, among the lj markers truly linked to QTLs having the same effect (aj), we consider that the number of markers detected to be associated with a QTL follows a binomial law B(pj, lj), where pj is the power of detection. There are as many different binomial laws as the number of different QTL effects (z) included in the model. In the same way, among the markers unlinked with QTLs, the number of markers erroneously detected to be associated with a QTL follows a B(
, Nm-l) binomial law, where l =
is the number of markers truly linked to a QTL. Because all the markers are independent, we consider that all these binomial laws are independent. All the possible results of a QTL detection can be divided according to the number of true QTLs with the jth effect (ntj) and false QTLs (nf) detected. It results that
![]() |
(B1) |
In Equation 7, three terms depend on estimations coming from the experiment 
2q, 
, and
âq
. Since
![]() |
(B2) |
The expected relative efficiency can be approximated from a Taylor series by
![]() |
(B3) |
To test the presence of a QTL near a marker, Fisher's test is performed. The value of the statistic, F, is connected with the estimated effects associated with markers
![]() |
(B4) |
![]() |
(B5) |
(if we suppose that the sign of the effect associated with a marker is always well estimated, i.e., aq and âq have the same sign).
The value of the estimated effect at a given marker varies from one experiment to the other but its expected value over all the possible experiments equals the true effect because the estimator is unbiased. Nevertheless, only markers with a significant effect are introduced in the selection index. As mentioned by ![]()
![]()
2q and
are functions of F noted
2q(F) and
(F) . Then, noting g(F ), the density function of an F distribution (a central distribution if there is no QTL near the marker or a noncentral distribution if the marker is linked to a QTL),
![]() |
(B6) |
Because the expected effect associated with a marker, considering truncated F distribution, is not null, we can associate an expected effect for false-positive markers. This expected effect because all the more important as
becomes smaller. For the same reason, the effect of a marker that is truly linked to a QTL is overestimated. The overestimation becomes all the bigger as the power of detection (and
) becomes smaller.
![]() |
(B7) |
| LITERATURE CITED |
|---|
BEAVIS, W. D., 1994 The power and deceit of QTL experiments: lessons from comparative QTL studies. Proceedings of the 49th Annual Corn and Sorghum Research Conference. American Seed Trade Association, 49: 250265.
CHARCOSSET, A. and A. GALLAIS, 1996 Estimation of the contribution of quantitative trait loci (QTL) to the variance of a quantitative trait by means of genetic markers. Theor. Appl. Genet. 93:1193-1201.
EDWARDS, M. D., T. HELENTJARIS, S. WRIGHT, and C. W. STUBER, 1992 Molecular-marker-facilitated investigations of quantitative trait loci in maize. 4. Analysis based on genome saturation with isozyme and restriction fragment length polymorphism markers. Theor. Appl. Genet. 83:765-774.
EDWARDS, M. D. and N. J. PAGE, 1994 Evaluation of marker-assisted selection through computer simulation. Theor. Appl. Genet. 88:376-382.
FREI, O., C. STUBER, and M. GOODMAN, 1986 Y ield manipulation from selection on allozyme genotypes in a composite of elite corn lines. Crop Sci. 26:917-921
GALLAIS, A., and A. CHARCOSSET, 1994 Efficiency of marker-assisted selection, pp. 9198 in Biometrics in Plant Breeding: Applications of Molecular Markers. Proceedings from the Ninth Meeting of the EUCARPIA Section Biometrics in Plant Breeding, edited by J. W. VAN OOIJEN and J. JANSEN. Wageningen, The Netherlands.
GIMELFARB, A. and R. LANDE, 1994a Simulation of marker-assisted selection in hybrid populations. Genet. Res. 63:39-47[Medline].
GIMELFARB, A. and R. LANDE, 1994b Simulation of marker-assisted selection for non-additive traits. Genet. Res. 64:127-136[Medline].
GIMELFARB, A. and R. LANDE, 1995 Marker-assisted selection and marker-QTL associations in hybrid populations. Theor. Appl. Genet. 91:522-528.
HALDANE, J. B. S., 1919 The combination of linkage values and the calculation of distance between loci of linked factors. J. Genet. 8:299-309.
HOSPITAL, F., L. MOREAU, F. LACOUDRE, A. CHARCOSSET, and A. GALLAIS, 1997 More on the efficiency of marker-assisted selection. Theor. Appl. Genet. 95:1181-1189.
KNAPP, S. J., W. W. STROUP, and W. M. ROSS, 1985 Exact confidence intervals for heritability on a progeny mean basis. Crop Sci. 25:192-194
KNAPP, S. J. and W. C. BRIDGES, 1990 Using molecular markers to estimate quantitative trait locus parameters: power and genetic variances for unreplicated and replicated progeny. Genetics 126:769-777[Abstract].
LANDE, R. and R. THOMPSON, 1990 Efficiency of marker-assisted selection in the improvement of quantitative traits. Genetics 124:743-756[Abstract].
LANDER, E. S. and D. BOTSTEIN, 1989 Mapping mendelian factors underlying quantitative traits using RFLP linkage maps. Genetics 124:185-199.
JANSEN, R. C. and P. STAM, 1994 High resolution of quantitative traits into multiple loci via interval mapping. Genetics 136:1447-1455[Abstract].
PATERSON, A. H., S. DAMON, J. D. HEWITT, D. ZAMIR, and H. D. RABINOWITCH et al., 1991 Mendelian factors underlying quantitative traits in tomato: comparison across species, generations and environments. Genetics 127:181-197[Abstract].
SOLLER, M. and T. BRODY, 1976 On the power of experimental designs for the detection of linkage between marker and quantitative loci in crosses between inbred lines. Theor. Appl. Genet. 47:35-39.
STROMBERG, L. D., J. W. DUDLEY, and G. K. RUFENER, 1994 Comparing conventional early generation selection with molecular marker-assisted selection in Maize. Crop Sci. 34:1221-1225
STUBER, C., M. GOODMAN, and R. MOLL, 1982 Improvement of yield and ear number resulting from selection at allozyme loci in maize population. Crop Sci. 22:737-740
STUBER, C., and M. D. EDWARDS, 1986 Genotypic selection for improvement of quantitative traits in corn using molecular marker loci. Proceedings of the 41st Annual Corn and Sorghum Research Conference, American Seed Trade Association, 41: 7083.
STUBER, C., and P. SISCO, 1992 Marker-facilitated transfer of QTL alleles between elite inbred lines and responses in hybrids, Proceedings of the 46th Annual Corn and Sorghum Research Conference, American Seed Trade Association, 46: 104113.
WHITTAKER, J. C., R. N. CURNOW, C. S. HALEY, and R. THOMPSON, 1995 Using marker-maps in marker-assisted selection. Genet. Res. 66:255-265[Medline].
ZENG, Z. B., 1994 Precision mapping of quantitative trait loci. Genetics 136:1457-1468[Abstract].
ZHANG, W. and C. SMITH, 1992 Computer simulation of marker-assisted selection utilizing linkage disequilibrium. Theor. Appl. Genet. 83:813-820.
ZHANG, W. and C. SMITH, 1993 Simulation of marker-assisted selection utilizing linkage disequilibrium: the effect of several additional factors. Theor. Appl. Genet. 86:492-496.
This article has been cited by other articles:
![]() |
K. K. Jena and D. J. Mackill Molecular Markers and Their Use in Marker-Assisted Selection in Rice Crop Sci., July 1, 2008; 48(4): 1266 - 1276. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Zhong and J.-L. Jannink Using Quantitative Trait Loci Results to Discriminate Among Crosses on the Basis of Their Progeny Mean and Variance Genetics, September 1, 2007; 177(1): 567 - 576. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. I |





















