- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Piepho, H.-P.
- Articles by Gauch, H. G.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Piepho, H.-P.
- Articles by Gauch, H. G., , Jr.
Marker Pair Selection for Mapping Quantitative Trait Loci
Hans-Peter Piephoa and Hugh G. Gauch, Jr.ba Institut für Nutzpflanzenkunde, Universität Kassel, 37213 Witzenhausen, Germany
b Department of Plant Breeding, College of Agriculture and Life Sciences, Cornell University, Ithaca, New York 14583
Corresponding author: Hans-Peter Piepho, Institut für Nutzpflanzenkunde, Universität Kassel, Steinstrasse 19, 37213 Witzenhausen, Germany., piepho{at}wiz.uni-kassel.de (E-mail)
Communicating editor: C. HALEY
| ABSTRACT |
|---|
Mapping of quantitative trait loci (QTL) for backcross and F2 populations may be set up as a multiple linear regression problem, where marker types are the regressor variables. It has been shown previously that flanking markers absorb all information on isolated QTL. Therefore, selection of pairs of markers flanking QTL is useful as a direct approach to QTL detection. Alternatively, selected pairs of flanking markers can be used as cofactors in composite interval mapping (CIM). Overfitting is a serious problem, especially if the number of regressor variables is large. We suggest a procedure denoted as marker pair selection (MPS) that uses model selection criteria for multiple linear regression. Markers enter the model in pairs, which reduces the number of models to be considered, thus alleviating the problem of overfitting and increasing the chances of detecting QTL. MPS entails an exhaustive search per chromosome to maximize the chance of finding the best-fitting models. A simulation study is conducted to study the merits of different model selection criteria for MPS. On the basis of our results, we recommend the Schwarz Bayesian criterion (SBC) for use in practice.
PROCEDURES for detecting multiple quantitative trait loci (QTL) are of growing interest to plant breeders and geneticists. The currently most widely used methods are interval mapping (IM; ![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
A peculiarity of multiple regression for QTL mapping is that there is no single true model, because there is no fixed set of markers. If we drop a pair of flanking markers from the analysis, the dropped pair can be replaced by adjacent markers. Similarly, if a different marker system is used, marker loci will change, but still the flanking markers will absorb the QTL effects, leading to a different model conditional on the markers. Thus, the term "true model" has to be used with this peculiarity in mind.
We believe that multiple LR testing (equivalently F-testing if linear least squares is used) for model selection is problematic for several reasons. Most importantly, multiple likelihood-ratio (LR) tests without adjustments are known to tend to overfitting (![]()
= 5% level in a forward selection procedure can easily give a true significance level (false positive rate) >50% (![]()
One reaction to the problems connected with multiple LR testing is to consider a Bayesian framework (see, e.g., ![]()
![]()
![]()
![]()
![]()
![]()
Our suggested procedure is denoted as marker pair selection (MPS). Instead of implementing a standard model selection procedure, we exploit knowledge of the genetic mechanisms underlying the data. Our MPS procedure has three distinctive features: (i) markers are selected in adjacent pairs to increase the chance of selecting flanking markers while reducing the risk of selecting nonflanking markers; (ii) an exhaustive search per chromosome is used in place of simple forward selection, which increases the chance of finding the best-fitting model; and (iii) a model selection criterion such as SBC is employed to select the final model among a sequence of models.
| MATERIALS AND METHODS |
|---|
In this section, we describe the model for mapping QTL in a backcross population as well as the method for parameter estimation. We then develop an MPS algorithm on the basis of a modified forward selection procedure, which generates a sequence of models with an increasing number of markers. From this sequence, a best-fitting model may be selected according to one of the criteria given in Table 2. The performance of MPS is studied by means of simulation.
|
|
Model and parameter estimation:
Consider a backcross mLmLqqmRmR x MLmLQqMRmR, where Mi and mi (i = L, R) denote the left and right flanking marker alleles, while Q, q are the QTL alleles. The recombination frequency between left and right markers is denoted as
, while rL and rR are the recombination frequencies of the markers and the QTL. Let Yj be the phenotypic value (e.g., yield) of an individual in the backcross population. Then, conditional on the QTL genotype, we write
![]() |
(1) |
and
![]() |
(2) |
where
is an intercept term and
is the allele substitution effect for the QTL. Let xL = 1 when an individual of the backcross population has the genotype MLmL at the left flanking marker and xL = 0 when the genotype is mLmL. The dummy variable xR for the right flanking marker is similarly defined. Let g = 0 if the QTL genotype is qq and g = 1 if it is Qq. Assuming no crossover interference, the expectation of g conditional on the flanking markers and QTL position is as given in Table 1 (![]()
![]() |
(3) |
Note that this model is nonlinear in the parameters
and rL. It can be shown, however, that E(g|rL, xL, xR) is linear in xL and xR, i.e.,

with
![]() |
(4) |
(compare to ![]()
+ a
, ßL = 
, and ßR = 
, the expectation for Yj, conditional on the markers, can be written
![]() |
(5) |
Dividing ßL by ßR and rearranging shows that rL is a root of the quadratic
![]() |
(6) |
For rL
(0, 0.5), the only feasible solution is
![]() |
(7) |
Back substitution of this solution into ßL = 
yields
![]() |
(8) |
This shows that a single regression on the adjacent marker covariates xL and xR suffices to estimate
and rL. The model may be extended to cover more than one QTL in a straightforward manner. To estimate the QTL effects and position, we just apply (7) and (8) to the pairs of markers corresponding to the putative QTL in question. This implies the model
![]() |
(9) |
H is the number of QTL and ß0 =
+
hah
h, where ah and
h are the coefficient a and the genetic effect for the hth QTL. Application of this model for estimation of QTL effects and position requires QTL to be isolated; i.e., no pair of QTL shares a common flanking marker. For estimation in the case of nonisolated QTL see ![]()
Model selection criteria:
We use criteria in Table 2 to select the best-fitting model among candidate models. A very thorough and concise review of these criteria can be found in ![]()
![]()
![]()
Consistent criteria are designed for cases where the true model has low dimension and is assumed to be among the candidate models. A consistent criterion identifies the correct model asymptotically (as sample size increases) with probability one. Examples are SBC, HQ (![]()
![]()
![]()
In case there are more markers than observations, the full model is not estimable, and hence Mallows' Cp and GM are not applicable due to lack of an error variance estimate based on the full model. We might continue the forward selection until the error variance estimate stabilizes, but this raises the problem of determining when stabilization has taken place. Incidentally, sequential F-testing will not work for our procedure, since models in the sequence are not necessarily nested.
Subset selection of markers:
In what follows, we first point out the need to select adjacent pairs of markers rather than individual markers. We then make a few remarks regarding applicability of standard subset selection procedures to our problem. Finally, suggestions are given for modifications exploiting the biology of the problem at hand and the procedure is described in algorithmic form.
The effect and position of a QTL can be estimated from the regression coefficients of two flanking markers. A subset selection procedure can be used to find markers, which are likely to flank a QTL. If one marker is selected, we will also have to include one of the adjacent markers, because two flanking markers are needed in the estimation procedure. ![]()
![]()
Since we are in a multiple regression framework, standard procedures for subset selection could be used, such as forward selection, etc. (![]()
![]()
![]()
![]()
![]()
![]()
![]()
In this article, we propose a modified forward selection strategy based on an article by ![]()
![]()
ALGORITHM 1: Make the following definitions: ic is a counter for the number of marker pairs selected for the cth chromosome; k is the total number of marker pairs in the current model; C is the total number of chromosomes; RSSmin is the smallest residual sum of squares of sign-consistent models of order k found so far; Mk is the selected model of order k.
- For each chromosome set ic = 0. Set k = 0. Fit the model with just an intercept and record the residual sum of squares (RSStotal). Record this model as M0.
- Set k
k + 1. Set RSSmin = RSStotal (from step 1). For c = 1 to C do the following: From the current model drop the ic marker pairs from the cth chromosome (but keep all pairs from other chromosomes) and do an exhaustive search for models with ic + 1 marker pairs from the cth chromosome. Consider entry of a set of ic + 1 pairs of markers only if the resulting model is sign consistent. For a current model that is sign consistent, compute the residual sum of squares (RSScurrent). If RSScurrent < RSSmin then set RSSmin = RSScurrent, set cmin = c, and record the current model as Mk. - If in step 2 no sign-consistent model of order k can be found, stop. Else set ic
ic + 1 for chromosome cmin and go back to step 2. - Apply a model selection criterion to select the best-fitting model in the sequence of models Mk (k = 0, 1, 2 ... ) generated by steps 1, 2, and 3.
A remark regarding step 2 is in order. If a sign inconsistency is observed for a pair of markers to be entered, this suggests that the pair may not flank a QTL. Thus, such pairs should not be considered. Checking sign consistency upon entry does not, however, prevent a sign change in an entered pair later in the model-building process. If, while other pairs are being added, a sign change occurs in a pair from another chromosome, that pair may be a false positive, suggesting there is an increasing risk of detecting false positives and that the selection procedure should be terminated. Therefore, we stop the selection process when no sign-consistent model of order k is found.
We should point out that it is impossible that different orders of chromosomes lead to different results with Algorithm 1. This is because step 2 tries to add a pair of markers on each chromosome. In step 3 the algorithm then chooses the one chromosome for which addition of a pair gives the best fit. This will be the same chromosome, regardless of the order in which chromosomes are tried.
Note that in the model sequence obtained from Algorithm 1, the best model with k pairs does not necessarily contain all markers that are in the best model with k - 1 pairs or less. An important reason for allowing the implicit drop of one or two markers during each step of the model-building process is that there may be two adjacent QTL on the same chromosome with the same sign of the associated genetic effect. The pair of markers selected first is likely to lie between the two QTL. If left in the model, a ghost QTL will be detected. Allowing a pair to be dropped from the model during model building reduces the risk of detecting ghost QTL. For a chromosome with six markers and two QTL in the intervals (2, 3) and (5, 6) the model sequence may look like the hypothetical example shown in Table 3. The first pair tries to explain as much of the phenotypic variation as possible. However, only marker 3 is a flanking marker. Marker 4 is included because it accounts for the QTL in the interval (5, 6). In the next step, marker 4 is dropped while the flanking pairs (2, 3) and (5, 6) enter. SBC selects the four-marker model as fitting best (smallest value of criterion), while the full model fits slightly worse. Were simple forward selection applied to the above example, we would first select the pair (3, 4), and this would remain in the model throughout. Thus, there is no more flexibility to end up with the "true" marker model (2, 3, 5, 6), and a ghost QTL will be detected.
|
Simulation study:
We simulated BC1 populations for various settings. The number of chromosomes ranged from 12 to 20, while the number of QTL was between zero and five. Equal spacing of markers (10 or 20 cM) along a 100-cM chromosome and absence of interference were assumed. The number of crossovers per chromosome was simulated according to a Poisson distribution with parameter equal to the length of the chromosome in morgans, which is in accordance with Haldane's mapping function. For each setting, we performed 100 simulation runs. Assuming Poisson sampling, the standard error for an expected count µ (e.g., number of false positives) is (µ/100)0.5, e.g., 0.2 for µ = 4. Due to high positive correlation among statistics of the same type as computed for different selection criteria (number of false positives, etc.), the accuracy of comparisons was deemed reasonable. Algorithm 1 was used, allowing a maximum of two QTL per chromosome to limit the computational burden of the exhaustive search. This does not imply, however, that such limitation is needed in practical applications where there is only one sequence of models to be generated instead of 100 or more in simulations. A QTL was considered as detected when an estimated QTL position was within 15 cM of the true QTL position. While the 15-cM margin is somewhat arbitrary, rankings of model selection criteria according to different performance measures were rather insensitive to changes in the margin. If the hth QTL is detected,
h is the estimate of the hth QTL effect based on (8). Otherwise
h = 0. As an aggregate measure of bias we computed
![]() |
(10) |
where
h is an estimate of the hth QTL effect. For each model selection criterion we counted the number of correctly detected QTL as well as the number of false positives. From these counts we computed the fraction of correct detections among all detections. If for a given QTL there were more than one pair yielding a QTL position estimate within 15 cM of the true QTL position, only the pair with position estimate closest to the true QTL was considered as a detecting pair. All pairs of markers not detecting any QTL were considered as false positives.
We considered 14 examples with different QTL numbers, positions, and effect sizes (Table 4). Heritabilities were computed as described in the Appendix Example 1 is adapted from ![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
|
If markers are densely spaced, it may happen that two adjacent markers are perfectly correlated, so that the design matrix for a model that includes these two markers is not of full column rank. If two markers are perfectly correlated, there is no information as to the position and effect of a QTL between the markers and the approach of ![]()
![]()
| RESULTS |
|---|
The number of detected QTL is usually quite stable across criteria (Table 5). SBC tends to select the simplest models and thus the average number of correctly detected QTL is usually smaller than for other criteria, but the difference is very small most of the time, except for more extreme cases such as examples 9 and 10, where the difference is somewhat more pronounced. HQc and FPE4 also tend to select simpler models. In all cases investigated, SBC clearly has the smallest false positive rate (Table 6) and the most favorable percentage of correct detections among all detections (Table 7), often followed by HQc and FPE4. For these two types of counts, SBC is generally markedly superior to some other quite popular criteria such as s2 and Cp. For example, with example 2, SBC has an average number of 0.63 false detections, while s2 and Cp have 6.06 and 7.36 false detections, respectively. SBC is followed by HQc (1.07) and FPE4 (1.31) in this example. In the example with no QTL (example 12), SBC picks the correct model (model with no markers) 92% of the time, which is by far better than any other criterion. Only FPE4 and HQc come anywhere near this figure (59 and 74%, respectively). It should be noted that all criteria select from the same sequence of models. The difficult task is to strike the right balance between underfitting and overfitting, i.e., to find Ockham's Hill (![]()
|
|
|
Examples 9 and 10, which were chosen mainly to see how the criteria performing best in most cases would perform under circumstances very favorable to other criteria such as AIC, are extreme cases in many respects. The effects are all equal and not tapered as in many of the other examples. In contrast to other examples, SBC has a markedly smaller number of correct detections in example 9 (Table 5), so the fact that it still has the most favorable rate of correct detections relative to the total number of detections does not have an unambiguous interpretation. If we are more concerned about false positives, SBC is clearly favorable, while other criteria fare better regarding the number of correct detections.
Examples 13 and 14 have the same QTL as example 7, but are different in that the number of markers exceeds the number of individuals. Thus, there is a larger potential for overfitting. Note that the criteria Cp and GM are not applicable because the number of markers exceeds the sample size. While for both examples 13 and 14 the number of correct detections is about the same for all criteria, SBC is the clear winner in terms of the ratio of true detections among all detections (Table 7). For many of the other criteria, the number of false detections (Table 6) increases dramatically for examples 13 and 14 compared to example 7, showing that the problem of overfitting increases with the number of markers. SBC is the only criterion for which the number of false positives does not change markedly relative to example 7.
A comparison of examples 2, 3, 4, and 5 in Table 6 and Table 7 shows that all criteria select simpler models as
2 increases and as sample size decreases. Increasing the sample size from 200 (example 2) to 500 (example 5) results in a mild increase in the number of correct detections (Table 5) and in the proportion of correct detections among all detections (Table 7). Reducing marker spacing (compare examples 1 and 2 and examples 7 and 14) increased the number of false positives and reduced the proportion of correct detections, indicating that the risk of overfitting increases with the number of markers. Note, however, that in example 2 the number of correct detections is also increased relative to example 1.
Bias, as assessed by the overall measure SSE(
), is comparable for all selection criteria (Table 8). The only exception to this rule is SBC, which due to its tendency to select simpler models than other criteria has a notably smaller number of detected QTL and so has somewhat larger aggregate bias than other criteria in some examples, mainly due to undetected QTL. Bias decreases with smaller variance
2 (examples 24 and examples 9 and 10). This corroborates the finding of ![]()
![]()
![]()
![]()
|
We summarize the results as follows: Since the number of correct detections is usually quite constant across criteria, we think that the number of false positives and the fraction of correct detections are the most meaningful performance measures. From the overall picture of simulation results, SBC emerges as the best criterion, with FPE4 and HQc as the closest competitors. While SBC, HQc, and FPE4 tend to find slightly fewer QTL than other criteria, they do much better in avoiding the risk of detecting spurious QTL.
| DISCUSSION |
|---|
Features of MPS:
MPS is a new procedure that addresses the goal of finding as many QTL as possible, while limiting the risk of detecting spurious QTL. It contains three important building blocks specifically designed to achieve this goal: (i) selection of marker pairs, (ii) augmentation of a forward selection procedure by an exhaustive search per chromosome, and (iii) application of a model selection criterion to select the final model from a sequence of models. None of these building blocks is in itself new. The novelty here is the way in which these components are integrated into a single algorithm and how they are applied to QTL mapping, exploiting our knowledge of the underlying biology. The two main differences between MPS and CIM are the way in which cofactors are selected and how the final model is selected. MPS implicitly uses marker pairs of other QTL as cofactors, while conventional CIM can use a wide variety of ways in which cofactors are selected (forward selection, using the best five markers, using two markers per chromosome, etc.). MPS uses criteria such as SBC to select a model, while CIM uses multiple LR tests.
MPS can be used as a stand-alone procedure for detecting QTL and estimating their effect and position in BC1 populations, or equivalently for recombinant inbred and doubled haploid lines. It is also applicable for F2 populations, if one is interested only in additive effects, but not in dominance effects (![]()
The equivalence of IM/CIM as applied to adjacent marker pairs and the approach by ![]()
(0,
) and this minimum is smaller than the RSS at rL = 0 and rL =
. In our experience this case will be the rule in real applications. As pointed out by a referee, if IM/CIM maps a true QTL exactly at a marker, it is possible that with the approach of ![]()
Instead of an exhaustive search per chromosome as implemented in our Algorithm 1, we could adopt a simple forward selection procedure, possibly improved by some measures to exploit knowledge of the biology. For example, if at one step markers 2 and 3 have been selected on a chromosome, it is sensible to allow the pair (1, 4) to be selected in subsequent steps, providing pairs (1, 2) and (3, 4) lead to regression estimates of the same sign for a pair. This makes sure that the "correct" pairs can be selected in case there are two isolated QTL in the intervals (1, 2) and (3, 4) on the same chromosome, and at an earlier step pair (2, 3) was selected. Also, we could allow a selected pair of markers to move one position to the left or to the right as more marker pairs are being added. Thus, e.g., having selected the pair (2, 3) at some stage, we would allow this pair to be replaced by (1, 2) or (3, 4) later in the selection process, if this improves the fit. One can think of more modifications of simple forward selection. In fact, the modified algorithm may become fairly complicated and unrealistic to program when one attempts to cover all the possible QTL and marker configurations that may occur in reality. While our partially exhaustive search is computationally more demanding, it has the virtue of simplicity and at the same time covers many of the features lacking in a simple forward selection algorithm.
We observed that occasionally MPS selects more than one pair of markers for a large QTL, leading to overfitting of that QTL. We could augment our Algorithm 1 by a step that tries to reduce the model whenever there are two or more pairs of markers on the same chromosome. Our investigations (results not shown) suggest that this modification will slightly increase efficiency when in fact there is only one QTL on the chromosome, while it may deteriorate performance of the algorithm in case there is more than one QTL. We have not included such modifications in our simulation study for simplicity.
Comparison of MPS to other procedures:
In conventional CIM, cofactors are usually selected on the basis of simple forward selection, with markers entering the model individually rather than in pairs. Often, the selection is semiautomated or fully automated, with no check of whether or not the selected cofactors match QTL detected later in the CIM scan across the chromosome. Such a check would be useful as a guard against overfitting. It has been observed that inclusion of too many cofactors that are not associated with a QTL will reduce power to identify QTL relative to IM (![]()
![]()
A more detailed comparison of CIM and MPS would be rewarding, but is beyond the scope of this article. For CIM there are many parameters that would have to be considered in simulations: window size, definition of critical threshold, definition of when a LOD peak detects a QTL and when it must be considered as a "sub-peak" of another detecting peak, selection of cofactors, ML, or least squares, etc. In fact, CIM could be modified by taking up some or all of the ingredients that make up MPS, i.e., selecting markers in pairs, requiring sign consistency, using SBC or some other criterion instead of LOD thresholds, doing an exhaustive search per chromosome to select cofactors, etc. Thus, a detailed comparison will have to be fairly extensive and should include various blends of MPS and CIM. Such a study is left for future work.
Recently, ![]()
Further remarks on model selection:
Procedures for mapping QTL aim at finding as many true QTL as possible, while avoiding the risk of detecting spurious QTL. Significance testing as is commonly used for IM, CIM, and MIM is not necessarily the best strategy to achieve this goal (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
Model selection criteria are based on a philosophy that is essentially different from that underlying significance testing (![]()
![]()
In this article we have not used computer-intensive methods of model selection, such as leave-d-out cross validation and bootstrapping (![]()
![]()
![]()
![]()
| ACKNOWLEDGMENTS |
|---|
We thank John Whittaker and three anonymous reviewers for helpful comments. This article was written while the first author was visiting the Department of Biometrics and the Department of Plant Breeding, College of Agriculture and Life Sciences, Cornell University, Ithaca, New York. Support of the Heisenberg Programm of the Deutsche Forschungsgemeinschaft is gratefully acknowledged.
Manuscript received January 27, 2000; Accepted for publication October 6, 2000.
| APPENDIX |
|---|
Let z = g1
1 + g2
2, where
1 and
2 are additive genetic effects of two QTL and g1 and g2 are coded 0 and 1 depending on the genotype at the QTL. For the nonrecombinant genotypes we have either g1 = g2 = 0 or g1 = g2 = 1. For the recombinant genotypes g1 = 0 and g2 = 1 or g1 = 1 and g2 = 0. Let r be the recombination fraction between the two QTL. It can be shown that
![]() |
(A1) |
(see ![]()
21 +
22). Thus, the contribution to the total genetic variance of the hth QTL is
, provided it is independent of all other QTL in the genome. The joint contribution of two linked QTL, which are independent of all other QTL, is given in (A1). Since in our simulations there are no more than two QTL in a linkage group, these results suffice to compute the total genetic variance and the heritability.
| LITERATURE CITED |
|---|
AKAIKE, H., 1973 Information theory and an extension of the maximum likelihood principle, pp. 267281 in 2nd International Symposium on Information Theory, edited by B. N. PETRIV and F. CSAKI. Aakademia Kiado, Budapest.
ALLEN, D. M., 1974 The relationship between variable selection and data augmentation and a method for prediction. Technometrics 16:125-127.
BEAVIS, W. D., 1994 The power and deceit of QTL experiments: lessons from comparative QTL studies, pp. 250266 in Report of the Forty-Ninth Annual Corn and Sorghum Research Conference, edited by D. B. WILKINSON. American Seed Trade Association, Washington, DC.
BHANSALI, R. J. and D. Y. DOWNHAM, 1977 Some properties of the order of an autoregressive model selected by a generalized Akaike's EPF criterion. Biometrika 64:547-551
BREIMAN, L. and D. FREEDMAN, 1983 How many variables should be entered in a regression equation? J. Am. Stat. Assoc. 78:131-136.
BUCKLAND, S. T., K. P. BURNHAM, and N. H. AUGUSTIN, 1997 Model selection: an integral part of inference. Biometrics 53:603-618.
BURNHAM, K. P., and D. R. ANDERSON, 1998 Model Selection and Inference. Springer, New York.
CHURCHILL, G. A. and R. W. DOERGE, 1994 Empirical threshold values for quantitative trait mapping. Genetics 138:963-971[Abstract].
DARVASI, A., A. WEINREB, V. MINKE, J. I. WELLER, and M. SOLLER, 1993 Detecting marker-QTL linkage and estimating QTL gene effect and map location using a saturated genetic map. Genetics 134:943-951[Abstract].
DOERGE, R. W. and G. A. CHURCHILL, 1996 Permutation tests for multiple loci affecting a quantitative character. Genetics 142:285-294[Abstract].
DRAPER, N. R., 1995 Assessment and propagation of model uncertainty. J. R. Stat. Soc. B 57:45-97.
DRAPER, N. R., and H. SMITH, 1998 Applied Regression Analysis. Wiley, New York.
DUPUIS, J. and D. SIEGMUND, 1999 Statistical methods for mapping quantitative trait loci from a dense set of markers. Genetics 151:373-386
EFROYMSON, M. A., 1960 Multiple regression analysis, pp. 191203 in Mathematical Methods for Digital Computers, edited by A. RALSTON and H. S. WILF. Wiley, New York.
GABRIEL, K. R., and F. C. PUN, 1979 Binary prediction of weather events with several predictors, pp. 248253 in 6th Conference on Probability and Statistics in Atmospheric Sciences. American Meteorological Society, Boston, MA.
GELFAND, A. E. and S. K. GHOSH, 1998 Model choice: a minimum posterior predictive loss approach. Biometrika 85:1-11
GEWEKE, J. and R. MEESE, 1981 Estimating regression models of finite but unknown order. Int. Econ. Rev. 22:55-70.
GOFFINET, B. and B. MANGIN, 1998 Comparing methods to detect more than one QTL on a chromosome. Theor. Appl. Genet. 96:628-633.
HALEY, C. S. and S. A. KNOTT, 1992 A simple regression method for mapping quantitative trait loci in line crosses using flanking markers. Heredity 69:315-324[Medline].
HANNAN, E. J. and B. G. QUINN, 1979 The determination of the order of an autoregression. J. R. Stat. Soc. B 41:190-195.
HJORTH, J. S. U., 1994 Computer Intensive Statistical Methods. Validation, Model Selection and Bootstrap. Chapman and Hall, London.
HURVITCH, C. M. and C.-L. TSAI, 1989 Regression and time series model selection in small samples. Biometrika 76:297-307
JANSEN, P. C., 1993 Interval mapping of multiple quantitative trait loci. Genetics 135:205-211[Abstract].
JANSEN, R. C. and P. STAM, 1994 High resolution of quantitative traits into multiple loci via interval mapping. Genetics 136:1447-1455[Abstract].
KAO, C. H., Z-B. ZENG, and R. D. TEASDALE, 1999 Multiple interval mapping for quantitative trait loci. Genetics 152:1203-1216
KEARSEY, M. J. and A. G. L. FARQUHAR, 1998 QTL analysis in plants: where are we now? Heredity 80:137-142.
LANDER, E. S. and D. BOTSTEIN, 1989 Mapping Mendelian factors underlying quantitative traits using RFLP linkage maps. Genetics 121:185-199
LANDER, E. S. and D. BOTSTEIN, 1994 Corrigendum. Genetics 136:705.
LEBRETON, C. M. and P. M. VISSCHER, 1998 Empirical nonparametric bootstrap strategies in quantitative trait loci mapping: conditioning on the genetic model. Genetics 148:525-535
MACKAY, D. J. K., 1992 Bayesian interpolation. Neural Comput. 4:415-447.
MALLOWS, C. L., 1973 Some comments on Cp. Technometrics 15:661-675.
MARTINEZ, O. and R. N. CURNOW, 1992 Estimating the locations and the sizes of the effects of quantitative trait loci using flanking markers. Theor. Appl. Genet. 85:480-488.
MCQUARRIE, A. D. R., and C.-L. TSAI, 1998 Regression and Time Series Model Selection. World Scientific Publishers, Singapore.
MELCHINGER, A. E., H. F. UTZ, and C. C. SCHÖN, 1998 Quantitative trait loci (QTL) mapping using different testers and independent population samples in maize reveals low power of QTL detection and large bias in estimates of QTL effects. Genetics 149:383-403
MILLER, A. J., 1990 Subset Selection in Regression. Chapman & Hall, London.
PIEPHO, H. P., 2000 Optimal marker density for interval mapping in a backcross population. Heredity 84:437-440.
REBAÏ, A., B. GOFFINET, and B. MANGIN, 1994 Approximate thresholds of interval mapping tests for QTL detection. Genetics 138:235-240[Abstract].
REBAÏ, A., B. GOFFINET, and B. MANGIN, 1995 Comparing power of different methods for QTL detection. Biometrics 51:87-99[Medline].
SARI-GORLA, M., T. CALINSKI, Z. KACZMAREK, and P. KRAJEWSKI, 1997 Detecting QTL x environment interaction in maize by a least squares interval mapping method. Heredity 78:146-157.
SCHWARZ, G., 1978 Estimating the dimension of a model. Ann. Stat. 6:461-464.
SHAO, J., 1996 Bootstrap model selection. J. Am. Stat. Assoc. 91:655-665.
SILLANPÄÄ, M. J. and E. ARJAS, 1998 Bayesian mapping of multiple quantitative trait loci from incomplete inbred line cross data. Genetics 148:1373-1388
SOUTHEY, B. R. and R. L. FERNANDO, 1998 Controlling the proportion of false positives among significant results in QTL detection. Proc. 6th World Congr. Genet. Appl. Livest. Prod. 26:221-224.
STAM, P., 1991 Some aspects of QTL analysis, pp. 2331 in Proceedings of the VIIIth Meeting of the Eucarpia Section Biometrics in Plant Breeding, edited by J. PESEK, M. HERMAN and J. HARTMANN. Brno, Czechoslovakia.
UTZ, H. F., and A. E. MELCHINGER, 1994 Comparison of different approaches to interval mapping of quantitative trait loci, pp. 195204 in Biometrics in Plant Breeding: Application of Molecular Markers, Proceedings of the Ninth Meeting of the EUCARPIA Section Biometrics in Plant Breeding, edited by J. W. VAN OOIJEN and J. JANSEN. CPRO-DLO, Wageningen, The Netherlands.
WEIR, B., 1996 Genetic Data Analysis II. Sinauer, Sunderland, MA.
WEISBERG, S., 1985 Applied Linear Regression. Wiley, New York.
WHITTAKER, J. C., R. THOMPSON, and P. M. VISSCHER, 1996 On the mapping of QTL by regression of phenotype on marker-type. Heredity 77:23-32.
ZENG, Z.-B., 1993 Theoretical basis of separation of multiple linked gene effects on mapping quantitative trait loci. Proc. Natl. Acad. Sci. USA 90:10972-10976
ZENG, Z.-B., 1994 Precision mapping of quantitative trait loci. Genetics 136:1457-1466[Abstract].
This article has been cited by other articles:
![]() |
H. P. Piepho Ridge Regression and Extensions for Genomewide Selection in Maize Crop Sci., June 26, 2009; 49(4): 1165 - 1176. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Zhang, H. Li, Z. Li, and J. Wang Interactions Between Markers Can Be Caused by the Dominance Effect of Quantitative Trait Loci Genetics, October 1, 2008; 180(2): 1177 - 1190. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Yang, W. Wu, and J. Zhu Mapping Interspecific Genetic Architecture in a Host-Parasite Interaction System Genetics, March 1, 2008; 178(3): 1737 - 1743. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Kusterer, H.-P. Piepho, H. F. Utz, C. C. Schon, J. Muminovic, R. C. Meyer, T. Altmann, and A. E. Melchinger Heterosis for Biomass-Related Traits in Arabidopsis Investigated by Quantitative Trait Loci Analysis of the Triple Testcross Design With Recombinant Inbred Lines Genetics, November 1, 2007; 177(3): 1839 - 1850. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. A. Robertson-Hoyt, C. E. Kleinschmidt, D. G. White, G. A. Payne, C. M. Maragos, and J. B. Holland Relationships of Resistance to Fusarium Ear Rot and Fumonisin Contamination with Agronomic Performance of Maize Crop Sci., September 1, 2007; 47(5): 1770 - 1778. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Yang, J. Zhu, and R. W. Williams Mapping the genetic architecture of complex traits in experimental populations Bioinformatics, June 15, 2007; 23(12): 1527 - 1536. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Stich, J. Yu, A. E. Melchinger, H.-P. Piepho, H. F. Utz, H. P. Maurer, and E. S. Buckler Power to Detect Higher-Order Epistatic Interactions in a Metabolic Pathway Using a New Mapping Strategy Genetics, May 1, 2007; 176(1): 563 - 570. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. J. Balint-Kurti, J. C. Zwonitzer, R. J. Wisser, M. L. Carson, M. A. Oropeza-Rosas, J. B. Holland, and S. J. Szalma Precise Mapping of Quantitative Trait Loci for Resistance to Southern Leaf Blight, Caused by Cochliobolus heterostrophus Race O, and Flowering Time Using Advanced Intercross Maize Lines Genetics, May 1, 2007; 176(1): 645 - 657. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Li, G. Ye, and J. Wang A Modified Algorithm for the Improvement of Composite Interval Mapping Genetics, January 1, 2007; 175(1): 361 - 374. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Baierl, M. Bogdan, F. Frommlet, and A. Futschik On Locating Multiple Interacting Quantitative Trait Loci in Intercross Designs Genetics, July 1, 2006; 173(3): 1693 - 1703. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. A. Robertson-Hoyt, M. P. Jines, P. J. Balint-Kurti, C. E. Kleinschmidt, D. G. White, G. A. Payne, C. M. Maragos, T. L. Molnar, and J. B. Holland QTL Mapping for Fusarium Ear Rot and Fumonisin Contamination Resistance in Two Maize Populations Crop Sci., June 20, 2006; 46(4): 1734 - 1743. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Mihaljevic, H. F. Utz, and A. E. Melchinger No Evidence for Epistasis in Hybrid and Per Se Performance of Elite European Flint Maize Inbreds from Generation Means and QTL Analyses Crop Sci., October 27, 2005; 45(6): 2605 - 2613. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Wang, Y.-M. Zhang, X. Li, G. L. Masinde, S. Mohan, D. J. Baylink, and S. Xu Bayesian Shrinkage Estimation of Quantitative Trait Loci Parameters Genetics, May 1, 2005; 170(1): 465 - 480. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Zhang, K. L. Montooth, M. T. Wells, A. G. Clark, and D. Zhang Mapping Multiple Quantitative Trait Loci by Bayesian Classification Genetics, April 1, 2005; 169(4): 2305 - 2318. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Sillanpaa and M. Bhattacharjee Bayesian Association-Based Fine Mapping in Small Chromosomal Segments Genetics, January 1, 2005; 169(1): 427 - 439. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Bogdan, J. K. Ghosh, and R. W. Doerge Modifying the Schwarz Bayesian Information Criterion to Locate Multiple Interacting Quantitative Trait Loci Genetics, June 1, 2004; 167(2): 989 - 999. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Yi, V. George, and D. B. Allison Stochastic Search Variable Selection for Identifying Multiple Quantitative Trait Loci Genetics, July 1, 2003; 164(3): 1129 - 1138. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. T. G. Hwang and D. Nettleton Investigating the Probability of Sign Inconsistency in the Regression Coefficients of Markers Flanking Quantitative Trait Loci Genetics, April 1, 2002; 160(4): 1697 - 1705. [Abstract] [Full Text] [PDF] |
||||
- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Piepho, H.-P.
- Articles by Gauch, H. G.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Piepho, H.-P.
- Articles by Gauch, H. G., , Jr.













