- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Related articles in Genetics
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Churchill, G. A.
- Articles by Doerge, R. W.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Churchill, G. A.
- Articles by Doerge, R. W.
Genetics, Vol. 178, 609-610, January 2008, Copyright © 2008
doi:10.1534/genetics.107.074609
Naive Application of Permutation Testing Leads to Inflated Type I Error Rates
G. A. Churchill* and
R. W. Doerge
,1
* The Jackson Laboratory, Bar Harbor, Maine 04609 and
Department of Statistics, Purdue University, West Lafayette, Indiana 47907
1 Corresponding author: 150 N. University St., Purdue University, W. Lafayette, IN 47907.
E-mail: doerge{at}purdue.edu
Failure to account for family structure within populations or in complex mating designs via uninformed applications of permutation testing will lead to inflated type I error rates. Careful consideration of the design factors is essential since some situations allow several valid permutation strategies, and the choice that maximizes statistical power will not always be intuitive.
WHEN any statistical method is applied incorrectly, misleading conclusions are to be expected. With this in mind, we are motivated by both the application of a simple permutation test and the message conveyed by ZOU et al. (2006) in which the performance of a permutation test is assessed. Zou et al. address a problem in QTL mapping analysis of recombinant inbred lines (RILs), which has been previously addressed by BELKNAP (1998), concerning the use of strain means vs. individual level data models. Although the essentially equivalent performance of these two models is well known anecdotally, Zou et al. focus on the performance of a simple permutation application which unfortunately is applied incorrectly in the context of the full data model.
The concept of permutation, as proposed by FISHER (1935) and as applied to QTL mapping by CHURCHILL and DOERGE (1994), relies on exchangeability. In simple experimental designs, such as an intercross or a backcross mapping population, the individual units can safely be assumed to be exchangeable. In more complex designs it is often, but not always, possible to identify exchangeable units and thus to construct a permutation test. It is important to realize that several other designs exist in which a simple permutation test has been or could readily be misapplied, for example, the advanced intercross (AI) and heterogeneous stock (HS) breeding designs. In both the AI and HS designs, the animals that are assayed may number in the hundreds or even thousands, but these animals are often the progeny of a penultimate generation consisting of a much smaller number of lineages. Failure to account for the family structure in these populations by naive application of permutation testing will lead to inflated type I error rates.
Permutation tests require relatively few assumptions and can be applied in a wide variety of settings. However, they are not without potential pitfalls. Correlation structure, whether it is known from the design or hidden due to unaccounted factors, can produce misleading results. The optimal choice of permutation strategy requires careful consideration of the experimental design. Design factors may be fixed or random, nested or crossed, and it is these features that determine which strategy should be used. To construct a permutation test, one must decide which units are to be permuted, whether the permutations should be restricted, and whether it is best to permute raw data or residuals. The implications of these choices have been exhaustively characterized in all combinations by ANDERSON and TER BRAAK (2003). Their results sometimes run counter to our intuitions. In general they find that permutation of residuals under a reduced model has the best power while still controlling type I errors. However, this does not apply as a universal recommendation.
Using the notation of ZOU et al. (2006) with the explicit addition of the random line effect, Bi, the ANOVA model for the RI line experiment considered by Zou et al. can be written as
![]() |
In summary, there are many ways to construct a permutation test. Careful consideration of the nature of the design factors is essential to making the correct choice. In some cases there are several valid permutation strategies and the choice that maximizes power will not always be intuitively obvious.
ANDERSON, M. J., and C. J. F. TER BRAAK, 2003 Permutation tests for multi-factorial analysis of variance. J. Statist. Comput. Simul. 73(2): 85–113.
BELKNAP, J. K., 1998 Effect of within-strain sample size on QTL detection and mapping using recombinant inbred mouse strains. Behav. Genet. 28(1): 29–38.[CrossRef][Medline]
CHURCHILL, G. A., and R. W. DOERGE, 1994 Empirical threshold values for quantitative trait mapping. Genetics 138: 963–971.[Abstract]
FISHER, R. A., 1935 The Design of Experiments, Ed. 3. Oliver & Boyd, London.
ZOU, F., Z. XU and T. VISION, 2006 Assessing the significance of quantitative trait loci in replicable mapping populations. Genetics 174: 1063–1068.
Communicating editor: L. MCINTYRE
Related articles in Genetics:
ISSUE HIGHLIGHTS
Genetics 2008 178: NP.
- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Related articles in Genetics
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Churchill, G. A.
- Articles by Doerge, R. W.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Churchill, G. A.
- Articles by Doerge, R. W.
