Originally published as Genetics Published Articles Ahead of Print on February 4, 2007.

Genetics, Vol. 175, 1999-2008, April 2007, Copyright © 2007
doi:10.1534/genetics.106.067868

Prospects for Association Mapping in Classical Inbred Mouse Strains

Laboratory of Genetics, University of Wisconsin, Madison, Wisconsin 53706

1 Corresponding author: 2428 Genetics/Biotechnology, 425-G Henry Mall, University of Wisconsin, Madison, WI 53706.
E-mail: payseur{at}wisc.edu

Manuscript received November 5, 2006. Accepted for publication January 23, 2007.

ABSTRACT

The collection of classical inbred mouse strains displays heritable variation in a large number of complex traits. Many generations of historical recombination have contributed to the panel of classical strain genomes, raising the possibility that quantitative trait loci could be located with high resolution by correlating strain genotypes and phenotypes. Although this association mapping framework has been successful in several empirical applications, its expected performance remains unclear. We used computer simulations based on a publicly available, dense single-nucleotide polymorphism (SNP) map to measure the power and false-positive rate of association mapping on a genomic scale across 30 commonly used classical inbred strains. Expected power is (i) often low for phenotypic effect sizes that are realistic for complex traits, (ii) highly variable across the genome, and (iii) correlated with linkage disequilibrium, aspects of the allele frequency distribution, and haplotype characteristics, as predicted by theory. Simulations also demonstrate clear potential for spurious associations to be generated by unequal relatedness among the strains. These findings suggest that association mapping in the classical strains is best applied in combination with other procedures, such as QTL mapping.


CLASSICAL inbred mouse strains provide powerful model systems for dissecting the genetic basis of complex phenotypes. The collection of widely available strains displays dramatic genetic variation in many quantitative traits, and the association of phenotypes with molecular markers in controlled crosses can reveal chromosomal regions that contain the causal loci. This strategy, quantitative trait locus (QTL) mapping, provides essential information about the genetic basis of complex phenotypes, including locus positions, effect sizes, and modes of action. However, standard QTL designs involve only one generation of recombination, so that phenotypic variation is typically associated with large genomic regions. This low level of mapping resolution has left the genes underlying most mouse QTL unidentified (FLINT et al. 2005). Populations of lines formed by additional generations of recombination, including recombinant inbred lines, advanced intercross lines, and heterogeneous stocks, allow finer mapping resolution (MOTT et al. 2000; WILLIAMS et al. 2001; CHURCHILL et al. 2004; YALCIN et al. 2005; VALDAR et al. 2006), but narrowing the resulting genomic intervals to small numbers of contributing genes still constitutes a formidable challenge.

The recent ability to genotype strains at markers from across the genome and the low resolution of most crossing studies has led some investigators to pursue an alternative approach to mapping complex trait variation. In this method (originally referred to as "in silico mapping"), genotypes and phenotypes from groups of classical inbred strains are compared to identify genomic regions that correlate with phenotypic variation (GRUPE et al. 2001; PLETCHER et al. 2004). Because the collective genomes of classical strains have experienced many generations of recombination during their histories, loci can be located with higher precision than in typical crossing designs. Additionally, in contrast to F2 or backcross experiments, where each mouse has a unique genome, large numbers of animals with the same genotype can be measured, increasing the precision and accuracy of phenotypic estimates. Finally, the classical strains need be genotyped only once, accelerating the identification of genotype–phenotype associations.

Researchers have successfully applied this approach to fine map loci underlying complex trait variation in the classical strains. Some studies have used association mapping to narrow genomic intervals previously determined to contribute to phenotypic variation (through crosses), including metastasis (PARK et al. 2003), blood pressure (DIPETRILLO et al. 2004), and plasma cholesterol (WANG et al. 2004; CERVINO et al. 2005). Other studies have conducted association mapping on a genomewide scale and recovered associations that overlap with strong candidate genes (GRUPE et al. 2001; LIAO et al. 2004; PLETCHER et al. 2004; WANG et al. 2005; LIU et al. 2006).

Despite these successful applications, serious concerns about the validity of this method as a general approach for dissecting the genetic basis of complex traits have been raised (CHESLER et al. 2001; DARVASI 2001; CUPPEN 2005). The number of available strains is small, suggesting that the ability to detect contributions from loci with small to moderate effects might be compromised. Additionally, the classical strains have a complex history that includes several forms of nonrandom mating: admixture between divergent natural populations, inbreeding, and other biases (SILVER 1995; BECK et al. 2000; WADE and DALY 2005). This history has led to unequal relatedness among the strains, a phenomenon with the potential to produce correlations between genotype and phenotype in the absence of QTL (LANDER and SCHORK 1994; EWENS and SPIELMAN 1995; PRITCHARD and ROSENBERG 1999; RISCH 2000; CERVINO et al. 2005; MHYRE et al. 2005). The history of the strains also affects allele frequency spectra, patterns of linkage disequilibrium, and the distribution of haplotypes across the genome. The contributions of these variables to the performance of association mapping in the classical strains have not been examined. Here, we use computer simulations based on publicly available SNP genotypes to quantify the expected performance of this approach across classical strain genomes.


MATERIALS AND METHODS

Strain and marker selection:

We selected genotypes for all single-nucleotide polymorphisms (SNPs) (n = 70,656) that had complete data for and were variable across 30 classical strains (Table 1) from the Inbred Laboratory Mouse Haplotype Map ("HapMap") (http://www.broad.mit.edu/personal/claire/MouseHapMap/Inbred.htm; February 2006 version; WADE and DALY 2005), which features the most dense genotype information currently available. These strains were selected on the basis of two criteria: (i) inclusion in the HapMap and (ii) designation as priority strains by the Mouse Phenome Project (BOGUE and GRUBB 2004; GRUBB et al. 2004; http://aretha.jax.org/pub-cgi/phenome/mpdcgi?rtn=docs/home), a large-scale effort to collect and collate phenotypic information for the classical strains. Although the strain sets examined in empirical studies might vary on the basis of the phenotype of interest, we reasoned that the availability of genotypic and phenotypic information for this strain collection makes it a likely focus for association mapping studies. We did not consider wild-derived strains because their divergent evolutionary histories have strong potential to generate spurious associations between genotype and phenotype (MHYRE et al. 2005).


View this table:
In this window
In a new window

 
TABLE 1

Classical inbred mouse strains used for simulations of association mapping

 

Power simulations:

Phenotypes for individual strains were created from SNP genotypes. To simulate an additive QTL mapping to a particular genomic region, one SNP was designated as the causal locus. Genotypes at this SNP were recoded as 0 or 1 (the additive effect, a), and the variance across this set of recoded genotypes was assumed to be the additive genetic variance (Va). Strain phenotypes determined by particular fractions of variance at this locus (proportion of variance explained, PVE, analogous to a locus-specific heritability) were generated by calculating the variance contributed by effects outside this locus, which could include those from the environment or loci mapping elsewhere in the genome, as

Formula
drawing a random effect from a normal distribution (mean = 0; variance = Ve), and adding this effect to a for each strain. In separate simulations, we considered the following PVE values: 0.05, 0.10, 0.15, 0.20, 0.25, 0.30, 0.35, 0.40, 0.45, and 0.50.

We assigned marker haplotypes in a genomic window by removing the causal SNP and concatenating contiguous sets of SNPs flanking the causal SNP. In separate tests, we considered haplotypes composed of two, three, four, five, six, and seven marker SNPs. Tests of association were performed by comparing phenotypes across haplotypic classes using a one-way ANOVA. P-values were estimated by comparison of the observed F-test to a distribution of 1000 (chosen to allow exploration of many parameter values across the genome) F-tests obtained by randomly permuting phenotypes across strains. For each genomic window, 1000 separate phenotypic simulations were performed. Power was estimated as the fraction of simulations with P < 0.05. We followed this procedure for each window in the genome (allowing each SNP to act as the causal SNP). Pseudocode for the power simulation procedure could be written as:

 For each window size (number of SNPs in haplotype)
  For each SNP in the genome
   For each PVE
    For 1000 replicates
     Remove causal SNP
     Obtain P-value for ANOVA association test
    Loop
   Loop
  Loop
 Loop

By calculating power separately for each genomic window, our approach did not directly simulate genomic scans for genotype–phenotype associations. Several factors motivated this decision. First, performing full genomic scans and accounting for these tests in each simulation replicate would be computationally demanding and would drastically reduce the number of genomic regions and parameter combinations (PVE and number of markers) that could be explored. Second, the power of genomic scans depends critically on the manner in which corrections for multiple testing are performed. The best method for adjusting for the very large number of tests performed in genomic association testing is not currently obvious and is an active area of research (PLETCHER et al. 2004; HIRSCHHORN and DALY 2005; BALDING 2006; MCCLURG et al. 2006). Third, our power estimates describe important variation among genomic regions and should be directly relevant to the increasingly common approach of finely localizing associations in the classical strains that were first identified with another method, such as QTL mapping. Finally, our general conclusion that association mapping in the classical strains is underpowered is conservative: accounting for multiple tests would only further reduce power.

False-positive simulations:

The false-positive rate of association mapping could be estimated by setting phenotypic effects to zero and performing association tests. Instead, we sought to model the more realistic scenario of a phenotype that does not map to the genomic region being tested, but still displays heritable variation across the classical strains in a manner that reflects their unequal relatedness.

To accomplish this goal, we used genomic similarity between strains to create phenotypes. Our measure of similarity was the fraction of SNPs from across the genome at which each of the 30 strains differed from a common strain, WSB/EiJ. We selected this strain because (i) it was included in the Inbred Strain HapMap, (ii) it is fairly closely related to the tested classical strains (which are predominantly descended from Mus domesticus; WADE and DALY 2005), and (iii) it was not included in the strain set used for the power simulations. This simplified approach assumes that two very closely related strains will show more similar genomic SNP distances from WSB/EiJ than will two strains with longer divergence times.

To simulate phenotypes, the variance in this distance was treated as the total additive genetic variance, Va. For each simulation, phenotypes with particular heritabilities—but no locus-specific effects—were generated from this (constant) Va using the approach described above. Association testing was then conducted separately for each genomic window. Phenotypes with total heritabilities of 0.4, 0.6, and 0.8 (assumed to come from loci outside the window under consideration) were considered in separate simulations. We selected these heritabilities to span a realistic range for complex phenotypes likely to be subject to association mapping in the classical strains. We considered haplotypes composed of two, three, and four neighboring SNPs in separate simulations. This approach recognizes that the potential for unequal relatedness among strains to produce spurious genotype–phenotype associations varies among genomic regions and allows this potential to be quantitatively measured.

Measuring marker characteristics:

We examined correlations between power and several measures of genotypic variation, including linkage disequilibria, allele frequencies, haplotype number, and haplotype diversity. Linkage disequilibrium was estimated as the average value of R2 (HILL and ROBERTSON 1968) or D' (LEWONTIN 1964) among all pairs of marker SNPs in a genomic window. Haplotype diversity across the k observed haplotypes was measured as

Formula
where pi is the frequency of the ith haplotype (NEI 1987). The size of each genomic interval was calculated as the difference between the physical positions of the first and last markers in a window.


RESULTS

Power:

Genomic distributions of power to detect genotype–phenotype associations across the classical strains show several patterns (Table 2). First, as expected, average power increases steadily with increasing PVE (Figure 1). Second, power is generally low for PVE values < ~0.25 (Figure 1). For example, assuming a PVE of 0.10, the average power across the genome using 3-SNP haplotypes is only 0.266 and the maximum is just 0.464. Third, similar power distributions are observed when haplotypes are composed of different numbers of marker SNPs (within a given PVE level; Figure 2). This pattern indicates that haplotype block sizes generally exceed the window sizes considered here, suggesting that the inbred strain HapMap has sufficient SNP density to capture most common haplotypic variation among the classical strains. Finally, power varies substantially among genomic locations (Figure 1). Some genomic windows retain little power to detect associations, even when PVE is large, while other regions display consistently higher power.


View this table:
In this window
In a new window

 
TABLE 2

Summary statistics of power to associate genotypes with complex trait variation in classical inbred strains

 

Figure 1
View larger version (13K):
In this window
In a new window
Download PPT slide
 
FIGURE 1.—

Genomic distributions of power for association tests using 3-SNP haplotypes.

 

Figure 2
View larger version (11K):
In this window
In a new window
Download PPT slide
 
FIGURE 2.—

Genomic distributions of power for association tests when the causal variant has PVE = 0.10. The label above each histogram is the number of contiguous marker SNPs combined to construct haplotypes. Note that the scale of the y-axis differs from that in Figure 1.

 
To understand the determinants of variation in power across classical strain genomes, we compared power to several characteristics of the SNP markers. Here, we focus on results assuming PVE = 0.10, which is a reasonable effect size for QTL underlying many quantitative traits. Similar results were observed for other PVE values.

The power to associate marker genotypes with complex trait variation is expected to increase with linkage disequilibrium. Positive correlations between power and average pairwise linkage disequilibrium support this prediction for the classical strains (Figure 3; Table 3). R2 shows stronger correlations with power than does D'; this difference is predicted by theory (DEVLIN and RISCH 1995; PRITCHARD and PRZEWORSKI 2001).


Figure 3
View larger version (30K):
In this window
In a new window
Download PPT slide
 
FIGURE 3.—

Scatterplots of power vs. linkage disequilibrium. PVE = 0.10 and association tests were performed with 3-SNP haplotypes.

 

View this table:
In this window
In a new window

 
TABLE 3

Spearman's correlations between power and genotypic characteristics

 
Increased power is also expected when allele frequencies at marker and causal SNPs are more similar (ZONDERVAN and CARDON 2004). Accordingly, power is negatively correlated with the absolute value of the difference between the causal SNP minor allele frequency and the average minor allele frequency of the marker SNPs (Figure 4; Table 3). These correlations are similar in magnitude to observed correlations between power and linkage disequilibrium.


Figure 4
View larger version (26K):
In this window
In a new window
Download PPT slide
 
FIGURE 4.—

Scatterplot of power vs. the absolute value of the difference between causal SNP minor allele frequency and the average minor allele frequency of the marker SNPs. PVE = 0.10 and association tests were performed with 3-SNP haplotypes.

 
Additionally, power should depend on the distribution of marker haplotypes. As the number of haplotypes grows, phenotypes for each haplotypic class are estimated (less accurately) from fewer strains and the number of degrees of freedom for the between-class test in the ANOVA increases. As predicted, genomic regions with fewer haplotypes exhibit significantly greater power (Table 3). This pattern underlies the apparently bimodal power distributions observed for lower PVE values (Figure 1): genomic regions harboring only two haplotypes show clear increases in power relative to regions with higher haplotype numbers. Power is also negatively correlated with haplotype diversity (Table 3), suggesting effects of both haplotype number and frequency.

Finally, the amount of historical recombination in an interval will differ across the genome because of variation in recombination rate and SNP density. As a result, power should be related to the size of the genomic interval. This prediction is supported by the results: window size and power are negatively correlated (Table 3).

Linkage disequilibrium, the difference between causal and marker SNP frequencies, haplotype number, haplotype diversity, and window size all contribute to the power to detect genotype–phenotype associations. We used multiple regression to (i) determine whether each variable influences power when effects of the other variables have been taken into account and (ii) estimate how much of the variation in power can be explained by combining these measures of marker variation. To satisfy assumptions of linear regression, all proportions, including power, R2, D', differences in marker-causal SNP frequency, and haplotype diversity were arcsine-square-root transformed prior to analysis. We selected the set of variables that explain the most genomic variation in power using the step function in the R statistical software (IHAKA and GENTLEMAN 1996), with default settings. We report the results of analyses using 3-SNP marker haplotypes (similar patterns were seen with alternative marker numbers). Both forward and backward stepwise procedures selected all variables (P < 0.05), indicating that each measure contributes independently to power (despite strong intercorrelations between variables). Although this pattern was consistent across PVE values, the relative contributions of each variable differed. Total adjusted R2-values decreased with increasing PVE, ranging from 0.562 (PVE = 0.05) to 0.215 (PVE = 0.5). Potential issues with stepwise multiple regression suggest that these estimates should be viewed with caution (MCCULLAGH and NELDER 1989). Nevertheless, the conclusion that combinations of the measured genotypic variables provide predictive information about power seems warranted.

Because different groups of strains have different histories, strain choice is expected to affect the power of association mapping. As a preliminary investigation into the effects of strain choice on method performance, we measured power across all intervals of chromosome 1 using 10 randomly selected subsets of 20 strains (from the original 30). Power is reduced relative to the 30-strain set across all 20-strain subsets and parameter combinations (results not shown). Pairs of strain sets show correlations in power (Table 4), suggesting that haplotype structure is reasonably conserved. However, correlations are moderate in magnitude and vary among strain set pairs (Table 4), indicating that associations identified in one strain set are not necessarily expected to replicate in a different group of strains.


View this table:
In this window
In a new window

 
TABLE 4

Pairwise Spearman's correlations between power estimates across chromosome 1 intervals for 10 randomly selected sets of 20 classical strains each

 

False-positive rates:

Results from simulations of phenotypes with heritable variation across the strains but without locus-specific effects reveal two key patterns (Figure 5). First, there is clear potential for the generation of spurious genotype–phenotype associations in the classical strains and this potential varies substantially across the genome. Although most genomic regions show acceptable false-positive rates of ≤0.05, many regions show higher rates, including false-positive frequencies of 1 when trait heritability is 0.8. For example, the mean false-positive rate assuming 3-SNP haplotypes and a heritability of 0.8 is 0.257. Second, the risk of false positives is higher for traits with higher heritabilities.


Figure 5
View larger version (8K):
In this window
In a new window
Download PPT slide
 
FIGURE 5.—

Genomic distributions of the false-positive rate for association tests using 3-SNP haplotypes. Heritability is the fraction of phenotypic variance due to additive genetic variance contributed by loci outside the tested window.

 


DISCUSSION

Reduced power:

Association mapping in classical strains of mice can localize QTL of large effect to narrow genomic intervals. Our estimates suggest that, on average, a locus with PVE of 0.35 can be detected with ~80% power using available SNP maps. Therefore, association mapping in the classical strains will often be useful for dissecting phenotypes with simpler genetic bases. However, our results point to the generally low power of this approach for finding loci underlying complex traits. First, most loci that contribute to the genetic basis of complex traits are expected to explain <35% of the phenotypic variance and will therefore often go undetected. For example, recovering loci with PVE of 0.10, which could be interpreted as a relatively large effect size for some phenotypes, with an average power of ~0.25 is simply not adequate for a general approach to mapping QTL. Second, these estimates have not been adjusted for multiple testing, suggesting that further reductions in power will accompany genomic scans for genotype–phenotype associations. Finally, our simulations assumed a simple additive model of quantitative variation. Although the inbred nature of the strains precludes dominance from contributing to phenotypic variance, associations generated by epistatic interactions will be even more challenging to find.

The low average power of association mapping in the classical strains is primarily attributable to the small number of strains (DARVASI 2001). In fact, empirical studies applying this approach have typically used fewer than the 30 strains assumed here. For comparison, standard QTL mapping usually employs hundreds of mice and association mapping in humans often involves thousands of individuals. Although estimates of genetic effects are improved by using inbred strains (in which measurement error and environmental contributions can be minimized), finding loci with small to moderate genetic effects on complex traits will generally require larger sample sizes using any strategy (LYNCH and WALSH 1998; DARVASI 2001; HIRSCHHORN and DALY 2005). The PVE attributable to a detected association may also be overestimated as a consequence of the small number of strains (BEAVIS 1994).

Although average power is low, there is substantial variation across the genome, showing that the performance of association mapping crucially depends on factors other than the number of strains. The histories of the classical strains are responsible for this genomic variation in power. Evolutionary forces including mutation, recombination, selection, and drift have shaped genomic patterns of genotypic and phenotypic variation and affected the ability to correlate individual genomic regions with complex traits among the strains. Although the relative contributions of these and other processes to current diversity are unclear, combinations of genotypic characteristics provide information about the relative power across the genome. Genomic regions with high linkage disequilibrium and (accordingly) limited haplotype diversity offer higher power. Additionally, the emphasis of current SNP maps on common variation translates into a higher likelihood of detecting QTL with intermediate frequencies across the strains (such QTL are also easier to detect with small sample sizes). As a result, this approach focuses more attention on variation ultimately derived from the wild ancestors of the classical strains and less on mutations that have arisen since their founding.

Potential for spurious associations:

The potential for population structure to generate spurious associations between genotype and phenotype has long been recognized in human and plant genetics. The problem is especially important for the classical mouse strains, which have histories involving admixture between divergent populations and patently nonrandom mating (SILVER 1995; BECK et al. 2000; WADE and DALY 2005). We illustrated this effect with a simple model assuming that closely related strains have more similar phenotypes and that the genetic variants responsible for phenotypic variation are not located in the genomic interval being tested. Although more elaborate models could be envisioned, this approach captures the primary cause of false-positive associations: unequal relatedness among the strains. Our results suggest a nontrivial risk of spurious associations across the classical strains.

Fortunately, several methods have been developed that adjust for the effects of population structure on association mapping using patterns of variation at unlinked markers (DEVLIN and ROEDER 1999; PRITCHARD et al. 2000; REICH and GOLDSTEIN 2001; YU et al. 2006). A particularly promising approach for the classical strains accounts for the effects of unequal relatedness on multiple levels (YU et al. 2006). Now that genomewide SNP and microsatellite polymorphism data exist for the classical strains, these strategies can begin to be applied. Efforts to minimize spurious associations will also benefit from full resequencing surveys of the classical strains, which will produce patterns of variation that are free from ascertainment and that better reflect population structure and phylogenetic history among the strains.

Future prospects:

The recent accumulation of genomewide polymorphism data (WILTSHIRE et al. 2003; CERVINO et al. 2005; WADE and DALY 2005) and phenotypic measurements (BOGUE and GRUBB 2004; GRUBB et al. 2004) for the classical strains is an exciting development in mouse genetics. Although many loci underlying complex trait variation will be missed by association mapping with the current strain set, the higher mapping resolution of those loci that are found should continue to motivate the development and application of this approach.

In addition to accounting for the effects of population structure, several possibilities exist for improving the performance of association mapping across the classical strains. Haplotype delineation using diversity (PATIL et al. 2001; ZHANG et al. 2002), linkage disequilibrium (GABRIEL et al. 2002), and information theoretic (ANDERSON and NOVEMBRE 2003) criteria should increase power. Detailed investigations of haplotype block structure in the classical strains will also be required to measure the expected level of mapping resolution. Useful information on genomic haplotype structure across the classical strains is rapidly accumulating (WADE et al. 2002; WILTSHIRE et al. 2003; FRAZER et al. 2004; IDERAABDULLAH et al. 2004; YALCIN et al. 2004; CUPPEN 2005; WADE and DALY 2005; ZHANG et al. 2005). Because classical strain genomes are ultimately derived from wild representatives of M. domesticus, M. musculus, M. castaneus, and M. molossinus (WADE et al. 2002; WADE and DALY 2005), detailed comparisons to patterns of haplotype diversity in natural populations of house mice would enable association mapping in the classical strains. Additionally, the application of model-based association procedures that better incorporate information on strain history in each genomic region could improve power (YALCIN et al. 2005; CERVINO et al. 2007) and continued examination of different statistical methods is warranted (MCCLURG et al. 2006).

Combining association mapping with other approaches, including QTL mapping, is also a fertile area for future research (CERVINO et al. 2005, 2007; DIPETRILLO et al. 2005). The ability of association mapping to refine genomic regions identified by QTL mapping will depend on the relationships of the crossed strains to the remaining strains; therefore, strain selection in QTL mapping could be influenced by patterns of haplotypic diversity across the larger strain set. The expected joint resolution and power of QTL mapping and association mapping deserves modeling attention. Method development might be informed by advances in combining linkage and association mapping in human genetics (ABECASIS et al. 2000; JUNG et al. 2005; WANG and ELSTON 2006).


ACKNOWLEDGEMENTS
We are very grateful to Miron Livny, Zachary Miller, and the Condor Project at the University of Wisconsin for providing access to a large computer cluster and clustering software that made the simulations possible. We thank Mark Daly for making the mouse genotype data freely available. We thank Andrew Clark and Kristi Montooth for initially suggesting the simulation scheme. Karl Broman, Phillip McClurg, and Tim Wiltshire provided useful discussion during the course of the project. Bruce Walsh and two anonymous reviewers provided helpful comments on the manuscript.


LITERATURE CITED

ABECASIS, G. R., L. R. CARDON and W. O. COOKSON, 2000 A general test of association for quantitative traits in nuclear families. Am. J. Hum. Genet. 66: 279–292.[CrossRef][Medline]

ANDERSON, E. C., and J. NOVEMBRE, 2003 Finding haplotype block boundaries by using the minimum-description-length principle. Am. J. Hum. Genet. 73: 336–354.[CrossRef][Medline]

BALDING, D. J., 2006 A tutorial on statistical methods for population association studies. Nat. Rev. Genet. 7: 781–791.[CrossRef][Medline]

BEAVIS, W. D., 1994 The power and deceit of QTL experiments: lessons from comparative QTL studies. Proceedings of the Forty-Ninth Annual Corn and Sorghum Industry Research Conference, American Seed Trade Association, Washington, DC, pp. 250–266.

BECK, J. A., S. LLOYD, M. HAFEZPARAST, M. LENNON-PIERCE, J. T. EPPIG et al., 2000 Genealogies of mouse inbred strains. Nat. Genet. 24: 23–25.[CrossRef][Medline]

BOGUE, M. A., and S. C. GRUBB, 2004 The Mouse Phenome Project. Genetica 122: 71–74.[CrossRef][Medline]

CERVINO, A. C., G. LI, S. EDWARDS, J. ZHU, C. LAURIE et al., 2005 Integrating QTL and high-density SNP analyses in mice to identify Insig2 as a susceptibility gene for plasma cholesterol levels. Genomics 86: 505–517.[CrossRef][Medline]

CERVINO, A. C., A. DARVASI, M. FALLAHI, C. C. MADER and N. F. TSINOREMAS, 2007 An integrated in silico gene mapping strategy in inbred mice. Genetics 175: 321–333.[Abstract/Free Full Text]

CHESLER, E. J., S. L. RODRIGUEZ-ZAS and J. S. MOGIL, 2001 In silico mapping of mouse quantitative trait loci. Science 294: 2423.[CrossRef][Medline]

CHURCHILL, G. A., D. C. AIREY, H. ALLAYEE, J. M. ANGEL, A. D. ATTIE et al., 2004 The Collaborative Cross, a community resource for the genetic analysis of complex traits. Nat. Genet. 36: 1133–1137.[CrossRef][Medline]

CUPPEN, E., 2005 Haplotype-based genetics in mice and rats. Trends Genet. 21: 318–322.[CrossRef][Medline]

DARVASI, A., 2001 In silico mapping of mouse quantitative trait loci. Science 294: 2423.[CrossRef][Medline]

DEVLIN, B., and N. RISCH, 1995 A comparison of linkage disequilibrium measures for fine-scale mapping. Genomics 29: 311–322.[CrossRef][Medline]

DEVLIN, B., and K. ROEDER, 1999 Genomic control for association studies. Biometrics 55: 997–1004.[CrossRef][Medline]

DIPETRILLO, K., S. W. TSAIH, S. SHEEHAN, C. JOHNS, P. KELMENSON et al., 2004 Genetic analysis of blood pressure in C3H/HeJ and SWR/J mice. Physiol. Genomics 17: 215–220.[Abstract/Free Full Text]

DIPETRILLO, K., X. WANG, I. M. STYLIANOU and B. PAIGEN, 2005 Bioinformatics toolbox for narrowing rodent quantitative trait loci. Trends Genet. 21: 683–692.[CrossRef][Medline]

EWENS, W. J., and R. S. SPIELMAN, 1995 The transmission/disequilibrium test: history, subdivision, and admixture. Am. J. Hum. Genet. 57: 455–464.[Medline]

FLINT, J., W. VALDAR, S. SHIFMAN and R. MOTT, 2005 Strategies for mapping and cloning quantitative trait genes in rodents. Nat. Rev. Genet. 6: 271–286.[CrossRef][Medline]

FRAZER, K. A., C. M. WADE, D. A. HINDS, N. PATIL, D. R. COX et al., 2004 Segmental phylogenetic relationships of inbred mouse strains revealed by fine-scale analysis of sequence variation across 4.6 mb of mouse genome. Genome Res. 14: 1493–1500.[Abstract/Free Full Text]

GABRIEL, S. B., S. F. SCHAFFNER, H. NGUYEN, J. M. MOORE, J. ROY et al., 2002 The structure of haplotype blocks in the human genome. Science 296: 2225–2229.[Abstract/Free Full Text]

GRUBB, S. C., G. A. CHURCHILL and M. A. BOGUE, 2004 A collaborative database of inbred mouse strain characteristics. Bioinformatics 20: 2857–2859.[Abstract/Free Full Text]

GRUPE, A., S. GERMER, J. USUKA, D. AUD, J. K. BELKNAP et al., 2001 In silico mapping of complex disease-related traits in mice. Science 292: 1915–1918.[Abstract/Free Full Text]

HILL, W. G., and A. ROBERTSON, 1968 Linkage disequilibrium in finite populations. Theor. Appl. Genet. 38: 226–231.[CrossRef]

HIRSCHHORN, J. N., and M. J. DALY, 2005 Genome-wide association studies for common diseases and complex traits. Nat. Rev. Genet. 6: 95–108.[Medline]

IDERAABDULLAH, F. Y., E. DE LA CASA-ESPERON, T. A. BELL, D. A. DETWILER, T. MAGNUSON et al., 2004 Genetic and haplotype diversity among wild-derived mouse inbred strains. Genome Res. 14: 1880–1887.[Abstract/Free Full Text]

IHAKA, R., and R. GENTLEMAN, 1996 R: a language for data analysis and graphics. J. Comp. Graph. Stat. 5: 299–314.[CrossRef]

JUNG, J., R. FAN and L. JIN, 2005 Combined linkage and association mapping of quantitative trait loci by multiple markers. Genetics 170: 881–898.[Abstract/Free Full Text]

LANDER, E. S., and N. J. SCHORK, 1994 Genetic dissection of complex traits. Science 265: 2037–2048.[Abstract/Free Full Text]

LEWONTIN, R. C., 1964 The interaction of selection and linkage. I. General considerations; heterotic models. Genetics 49: 49–67.[Free Full Text]

LIAO, G., J. WANG, J. GUO, J. ALLARD, J. CHENG et al., 2004 In silico genetics: identification of a functional element regulating H2-Ealpha gene expression. Science 306: 690–695.[Abstract/Free Full Text]

LIU, P., Y. WANG, H. VIKIS, A. MACIAG, D. WANG et al., 2006 Candidate lung tumor susceptibility genes identified through whole-genome association analyses in inbred mice. Nat. Genet. 38: 888–895.[CrossRef][Medline]

LYNCH, M., and B. WALSH, 1998 Genetics and Analysis of Quantitative Traits. Sinauer Associates, Sunderland, MA.

MCCLURG, P., M. T. PLETCHER, T. WILTSHIRE and A. I. SU, 2006 Comparative analysis of haplotype association algorithms. BMC Bioinform. 7: 61.[CrossRef][Medline]

MCCULLAGH, P., and J. A. NELDER, 1989 Generalized linear models. Chapman & Hall, New York.

MHYRE, T. R., E. J. CHESLER, M. THIRUCHELVAM, C. LUNGU, D. A. CORY-SLECHTA et al., 2005 Heritability, correlations and in silico mapping of locomotor behavior and neurochemistry in inbred strains of mice. Genes Brain Behav. 4: 209–228.[CrossRef][Medline]

MOTT, R., C. J. TALBOT, M. G. TURRI, A. C. COLLINS and J. FLINT, 2000 A method for fine mapping quantitative trait loci in outbred animal stocks. Proc. Natl. Acad. Sci. USA 97: 12649–12654.[Abstract/Free Full Text]

NEI, M., 1987 Molecular Evolutionary Genetics. Columbia University Press, New York.

PARK, Y. G., R. CLIFFORD, K. H. BUETOW and K. W. HUNTER, 2003 Multiple cross and inbred strain haplotype mapping of complex-trait candidate genes. Genome Res. 13: 118–121.[Abstract/Free Full Text]

PATIL, N., A. J. BERNO, D. A. HINDS, W. A. BARRETT, J. M. DOSHI et al., 2001 Blocks of limited haplotype diversity revealed by high-resolution scanning of human chromosome 21. Science 294: 1719–1723.[Abstract/Free Full Text]

PLETCHER, M. T., P. MCCLURG, S. BATALOV, A. I. SU, S. W. BARNES et al., 2004 Use of a dense single nucleotide polymorphism map for in silico mapping in the mouse. PLoS Biol. 2: e393.[CrossRef][Medline]

PRITCHARD, J. K., and M. PRZEWORSKI, 2001 Linkage disequilibrium in humans: models and data. Am. J. Hum. Genet. 69: 1–14.[CrossRef][Medline]

PRITCHARD, J. K., and N. A. ROSENBERG, 1999 Use of unlinked genetic markers to detect population stratification in association studies. Am. J. Hum. Genet. 65: 220–228.[CrossRef][Medline]

PRITCHARD, J. K., M. STEPHENS and P. DONNELLY, 2000 Inference of population structure using multilocus genotype data. Genetics 155: 945–959.[Abstract/Free Full Text]

REICH, D. E., and D. B. GOLDSTEIN, 2001 Detecting association in a case-control study while correcting for population stratification. Genet. Epidemiol. 20: 4–16.[CrossRef][Medline]

RISCH, N. J., 2000 Searching for genetic determinants in the new millennium. Nature 405: 847–856.[CrossRef][Medline]

SILVER, L. M., 1995 Mouse Genetics. Oxford University Press, New York.

VALDAR, W., L. C. SOLBERG, D. GAUGUIER, S. BURNETT, P. KLENERMAN et al., 2006 Genome-wide genetic association of complex traits in heterogeneous stock mice. Nat. Genet. 38: 879–887.[CrossRef][Medline]

WADE, C. M., and M. J. DALY, 2005 Genetic variation in laboratory mice. Nat. Genet. 37: 1175–1180.[CrossRef][Medline]

WADE, C. M., E. J. KULBOKAS 3RD, A. W. KIRBY, M. C. ZODY, J. C. MULLIKIN et al., 2002 The mosaic structure of variation in the laboratory mouse genome. Nature 420: 574–578.[CrossRef][Medline]

WANG, J., G. LIAO, J. USUKA and G. PELTZ, 2005 Computational genetics: From mouse to human? Trends Genet. 21: 526–532.[CrossRef][Medline]

WANG, T., and R. C. ELSTON, 2006 A quantitative linkage score for an association study following a linkage analysis. BMC Genet. 7: 5.[CrossRef][Medline]

WANG, X., R. KORSTANJE, D. HIGGINS and B. PAIGEN, 2004 Haplotype analysis in multiple crosses to identify a QTL gene. Genome Res. 14: 1767–1772.[Abstract/Free Full Text]

WILLIAMS, R. W., J. GU, S. QI and L. LU, 2001 The genetic structure of recombinant inbred mice: high-resolution consensus maps for complex trait analysis. Genome Biol. 2: RESEARCH0046.[Medline]

WILTSHIRE, T., M. T. PLETCHER, S. BATALOV, S. W. BARNES, L. M. TARANTINO et al., 2003 Genome-wide single-nucleotide polymorphism analysis defines haplotype patterns in mouse. Proc. Natl. Acad. Sci. USA 100: 3380–3385.[Abstract/Free Full Text]

YALCIN, B., J. FLINT and R. MOTT, 2005 Using progenitor strain information to identify quantitative trait nucleotides in outbred mice. Genetics 171: 673–681.[Abstract/Free Full Text]

YALCIN, B., J. FULLERTON, S. MILLER, D. A. KEAYS, S. BRADY et al., 2004 Unexpected complexity in the haplotypes of commonly used inbred strains of laboratory mice. Proc. Natl. Acad. Sci. USA 101: 9734–9739.[Abstract/Free Full Text]

YU, J., G. PRESSOIR, W. H. BRIGGS, I. VROH BI, M. YAMASAKI et al., 2006 A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nat. Genet. 38: 203–208.[CrossRef][Medline]

ZHANG, J., K. W. HUNTER, M. GANDOLPH, W. L. ROWE, R. P. FINNEY et al., 2005 A high-resolution multistrain haplotype analysis of laboratory mouse genome reveals three distinctive genetic variation patterns. Genome Res. 15: 241–249.[Abstract/Free Full Text]

ZHANG, K., M. DENG, T. CHEN, M. S. WATERMAN and F. SUN, 2002 A dynamic programming algorithm for haplotype block partitioning. Proc. Natl. Acad. Sci. USA 99: 7335–7339.[Abstract/Free Full Text]

ZONDERVAN, K. T., and L. R. CARDON, 2004 The complex interplay among factors that influence allelic association. Nat. Rev. Genet. 5: 89–100.[Medline]

Communicating editor: J. B. WALSH




This article has been cited by other articles:


Home page
IBMS BoneKEyHome page
C. L. Ackert-Bicknell and C. J. Rosen
The Future of Mouse Genetics in Osteoporosis Research
IBMS BoneKEy, June 1, 2009; 6(6): 200 - 209.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
S. L. Burgess-Herbert, A. Cox, S.-W. Tsaih, and B. Paigen
Practical Applications of the Bioinformatics Toolbox for Narrowing Quantitative Trait Loci
Genetics, December 1, 2008; 180(4): 2227 - 2235.
[Abstract] [Full Text] [PDF]


Home page
Integr. Comp. Biol.Home page
C. J. Vinyard and B. A. Payseur
Of "mice" and mammals: utilizing classical inbred mice to study the genetic architecture of function and performance in mammals
Integr. Comp. Biol., June 24, 2008; (2008) icn063v1.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
P. Jones, K. Chase, A. Martin, P. Davern, E. A. Ostrander, and K. G. Lark
Single-Nucleotide-Polymorphism-Based Association Mapping of Dog Stereotypes
Genetics, June 1, 2008; 179(2): 1033 - 1044.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
Z. Trachtulec, C. Vlcek, O. Mihola, S. Gregorova, V. Fotopulosova, and J. Forejt
Fine Haplotype Structure of a Chromosome 17 Region in the Laboratory and Wild Mouse
Genetics, March 1, 2008; 178(3): 1777 - 1784.
[Abstract] [Full Text] [PDF]