Skip to main content
  • Facebook
  • Twitter
  • YouTube
  • LinkedIn
  • Google Plus
  • Other GSA Resources
    • Genetics Society of America
    • G3: Genes | Genomes | Genetics
    • Genes to Genomes: The GSA Blog
    • GSA Conferences
    • GeneticsCareers.org

Institution: Massachusetts Inst of Technol MIT Libs

  • Log in
Genetics

Main menu

  • HOME
  • ISSUES
    • Current Issue
    • Early Online
    • Archive
  • ABOUT
    • About the journal
    • Why publish with us?
    • Editorial board
    • Early Career Reviewers
    • Contact us
  • SERIES
    • All Series
    • Genomic Prediction
    • Multiparental Populations
    • FlyBook
    • WormBook
    • YeastBook
  • ARTICLE TYPES
    • About Article Types
    • Commentaries
    • Editorials
    • GSA Honors and Awards
    • Methods, Technology & Resources
    • Perspectives
    • Primers
    • Reviews
    • Toolbox Reviews
  • PUBLISH & REVIEW
    • Scope & publication policies
    • Submission & review process
    • Article types
    • Prepare your manuscript
    • Submit your manuscript
    • After acceptance
    • Guidelines for reviewers
  • SUBSCRIBE
    • Why subscribe?
    • For institutions
    • For individuals
    • Email alerts
    • RSS feeds
  • Other GSA Resources
    • Genetics Society of America
    • G3: Genes | Genomes | Genetics
    • Genes to Genomes: The GSA Blog
    • GSA Conferences
    • GeneticsCareers.org

User menu

  • Log out

Search

  • Advanced search
Genetics

Advanced Search

  • HOME
  • ISSUES
    • Current Issue
    • Early Online
    • Archive
  • ABOUT
    • About the journal
    • Why publish with us?
    • Editorial board
    • Early Career Reviewers
    • Contact us
  • SERIES
    • All Series
    • Genomic Prediction
    • Multiparental Populations
    • FlyBook
    • WormBook
    • YeastBook
  • ARTICLE TYPES
    • About Article Types
    • Commentaries
    • Editorials
    • GSA Honors and Awards
    • Methods, Technology & Resources
    • Perspectives
    • Primers
    • Reviews
    • Toolbox Reviews
  • PUBLISH & REVIEW
    • Scope & publication policies
    • Submission & review process
    • Article types
    • Prepare your manuscript
    • Submit your manuscript
    • After acceptance
    • Guidelines for reviewers
  • SUBSCRIBE
    • Why subscribe?
    • For institutions
    • For individuals
    • Email alerts
    • RSS feeds
Previous ArticleNext Article

Fine Mapping of Quantitative Trait Loci Using Linkage Disequilibria With Closely Linked Marker Loci

T. H. E. Meuwissen and M. E. Goddard
Genetics May 1, 2000 vol. 155 no. 1 421-430
T. H. E. Meuwissen
*Research Institute of Animal Science and Health, Box 65, 8200 AB Lelystad, The Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: t.h.e.meuwissen@id.dlo.nl
M. E. Goddard
†Institute of Land and Food Resources, University of Melbourne, Parkville, Victoria 3052, Australia
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Article
  • Figures & Data
  • Info & Metrics
Loading

Abstract

A multimarker linkage disequilibrium mapping method was developed for the fine mapping of quantitative trait loci (QTL) using a dense marker map. The method compares the expected covariances between haplotype effects given a postulated QTL position to the covariances that are found in the data. The expected covariances between the haplotype effects are proportional to the probability that the QTL position is identical by descent (IBD) given the marker haplotype information, which is calculated using the genedropping method. Simulation results showed that a QTL was correctly positioned within a region of 3, 1.5, or 0.75 cM in 70, 62, and 68%, respectively, of the replicates using markers spaced at intervals of 1, 0.5, and 0.25 cM, respectively. These results were rather insensitive to the number of generations since the QTL occurred and to the effective population size, except that 10 generations yielded rather poor estimates of the QTL position. The position estimates of this multimarker disequilibrium mapping method were more accurate than those from a single marker transmission disequilibrium test. A general approach for identifying QTL is suggested, where several stages of disequilibrium mapping are used with increasingly dense marker spacing.

LINKAGE disequilibrium mapping has been successful in mapping genetical disorders (e.g., Hästbackaet al. 1992). The method attempts to find a chromosomal region that is identical by descent (IBD) among the diseased individuals, since such a region may carry the disease gene. The IBD region is detected by closely linked marker loci that carry identical alleles at this region in the diseased individuals (e.g., Pritchardet al. 1991; Houwenet al. 1994). Hence, simultaneous linkage disequilibria between several closely linked markers and the disease gene are detected. The size of the IBD region decreases with the number of meioses since the disease mutation occurred and may be small, which leads to the detection of a small region that contains the disease gene.

A linkage analysis is often used for the mapping of quantitative trait loci (QTL), where the inheritance of chromosomal regions within the data set is traced by markers (see Hoescheleet al. 1997, for a review). The region whose inheritance explains most of the variance of the phenotypic records indicates the most likely position of the QTL. To position the QTL, linkage mapping uses only the recombinations that occurred within the data set, which typically contains two to three generations. With closely linked markers, there will be few recombinations between adjacent markers during these two to three generations and hence a dense marker map will provide little extra information about the position of the QTL, unless the number of individuals per generation is very large (Darvasiet al. 1993). In a high-resolution mapping experiment, even with the use of recombinant inbred lines, Long et al. (1995) could map only QTL affecting bristle numbers in Drosophila to regions of ~5–10 cM using linkage analysis. Linkage disequilibrium mapping uses all recombinations since the mutation occurred, which increases the precision of the estimate of the position. Linkage disequilibrium mapping methods seem, therefore, more useful for precise estimation of QTL positions, while linkage mapping is more useful for a genome-wide scan for QTL.

Linkage disequilibrium mapping methods for QTL that consider several markers have not been proposed in the literature, mainly because the carriers of the mutant QTL allele cannot be identified. This is the case unless the effect of the mutation is very large and the methods for qualitative traits such as disease genes can be applied. An additional complication is that the allele, which represents the most recent QTL mutation, is unknown. Hence, even if we could identify the QTL alleles, it is unknown which QTL allele should contain the identical marker alleles in a chromosome region. However, calculation of linkage disequilibria between QTL and marker loci does not require knowledge about which mutation is most recent and, therefore, can be considered for the positioning of the QTL.

The linkage disequilibrium between a single marker and a QTL can be measured by estimating the effect of the marker on the quantitative trait in a regression analysis. This approach is extended to multiple marker loci by estimating the effect of marker haplotypes on the quantitative trait. The marker haplotypes that have identical marker alleles in a region surrounding the QTL are expected to show similar haplotype effects, since the identical markers indicate that the region is IBD and thus the haplotypes are expected to carry similar QTL alleles. In statistical terms, similar haplotype effects imply that the covariance between the haplotype effects is high. Whether two marker haplotypes have identical alleles in a region surrounding the QTL depends on the position of the QTL, and hence, the covariance between the haplotype effects depends on the position of the QTL. This dependence of haplotype covariances on the position of the QTL is illustrated in Table 1 for a simple situation, where marker haplotypes identify QTL alleles with certainty; i.e., correlations among the haplotypes are either 0 or 1. The general principle, however, also holds under less simple conditions. The covariances between haplotype effects can thus be used to position the QTL. The aim of this article is to formally develop this method for the estimation of the QTL position using the linkage disequilibria between several closely linked markers and the QTL. The accuracy of the method in finding the correct QTL position is investigated mainly in the context of livestock populations but the method (and results) can clearly be applied also to other types of populations. Last, the precision of the method is compared to that of a single marker transmission disequilibrium test (TDT) analysis (Rabinowitz 1997).

View this table:
  • View inline
  • View popup
  • Download powerpoint
TABLE 1

Correlation matrix of haplotype effects

METHODS

General: It is assumed here that a linkage mapping study has narrowed the position of the QTL down to a 5- to 20-cM region. Within this region, there are many markers available, with a between-marker spacing of typically 0.25–1 cM. Linkage disequilibrium mapping is used to find the most likely marker bracket, i.e., the region between two adjacent markers, that contains the QTL. In principle the method can also be used to estimate the position of the QTL within a marker bracket, but it seems that there is too little information for such a precise estimate.

Maximum-likelihood estimation of QTL position: It is assumed that phenotypic records from only the last generation of individuals are used in the analysis. Marker data are available on this generation and their parents so that for many individuals it is known which of the marker alleles is paternally and which is maternally derived; i.e., the linkage phases of the marker alleles are known and marker haplotypes can be constructed. For example, when the individual has genotype M1M2/N1N2 and its parents have M1M3/N1N3 and M2M4/N2N4, the linkage phase of the individual is M1N1/M2N2. If the linkage phase is not known, the haplotype is missing and the individual is excluded from the analysis. However, in the case of highly polymorphic closely linked markers (such that double recombinations are unlikely), the correct linkage phase can often be assigned with a high probability.

The phenotypic records of the last generation are modeled by y=Xb+Zh+e, (1) where y is the vector of records; b is the vector of fixed (nuisance) effects for which the data are to be corrected; h = (q × 1) vector of random effects of the haplotypes; e is the vector of residuals; and X and Z are known incidence matrices for the effects in b and h, respectively. The variance of the residuals is Var(e)=σe2R , where R is assumed here to be an identity matrix, but in general R can account for covariances between residuals, which may be due to background genes and family relationships. The variance of the haplotype effects is Var(h)=σh2Hp , where the matrix Hp yields the (co)variances of the haplotype effects up to proportionality and subscript p indicates that Hp depends on the assumed position of the QTL. The dimension of Hp is q*q, where q is the number of different haplotypes in the data.

Assuming multivariate normality, the residual loglikelihood of the data under the above model is L(Hp,σh2,σe2)∝−0.5[ln(∣V∣)+ln(∣X′V−1X∣)+(y−Xb^)′V−1(y−Xb^)] (2) (Patterson and Thompson 1971), where V=Var(y)=[ZHpZ′σh2+Rσe2] , and b^ is the generalized least-squares estimate of b. The term ln(|X′V−1X|) corrects for the fact that fixed effects are estimated instead of known and is redundant when fixed effects are absent or fixed-effect classes are large; i.e., estimation of fixed effects is accurate. Given a QTL position, p, i.e., given Hp, this likelihood is maximized to obtain estimates of the variance components σ^h2 and σ^e2 (see, e.g., Henderson 1984). Algorithms that require the inverse of the Hp matrix should be avoided for the residual maximum likelihood (REML) estimation of the variance components, because Hp will be (close to) singular when the distances between the markers become small; i.e., methods that require the inverse of Hp may be numerically unstable.

The likelihood L(Hp,σ^h2,σ^e2) can be calculated for every position of the QTL. The maximum-likelihood estimate of the QTL is the position where L(Hp,σ^h2,σ^e2) is highest. To calculate L(Hp,σ^h2,σ^e2) , we need to obtain the (co)variance matrix of the haplotype effects, Hp, given the position of the QTL.

Calculation of Hp: The covariance between two haplotypes effects, hi and hj, is Cov(hi,hj)=Prob(IBD∣marker haplotypes)×σh2, where Prob(IBD|marker haplotypes) is the probability that the QTL locus is IBD given the marker haplotypes. The probability that a locus is IBD given the haplotypes surrounding the locus may be obtained from using the coalescence process (Hudson 1985, 1993), but this proved complex for multiple markers and the IBD probabilities were difficult to obtain. Here, we used the gene-dropping method (Macclueret al. 1986) to obtain the IBD probabilities given the haplotype information.

In the genedropping method, markers and a putative QTL are simulated in a base generation. All 2Ne base generation QTL alleles, which are called founder alleles, have a unique number. The next NG descendant generations are simulated by choosing at random parents from the previous generation and letting their Ne offspring inherit haplotypes or recombinant haplotypes according to Mendel's rules and the recombination probabilities. The parents of each progeny are randomly sampled such that the effective population size is Ne. Because the founder QTL alleles have unique numbers, any two QTL alleles with the same number in generation NG are IBD. NG is the number of generations that passed since the mutation occurred.

A problem with this application of the genedropping method is that after NG generations, we will not obtain the same haplotypes as in our data set, because there are too many possible haplotypes. For instance, with 10 biallelic markers there are 1024 possible haplotypes and hence 1024 × 1024 covariances to be estimated. Fortunately many of these covariances are expected to be equal and so can be grouped together and a single value is estimated for the group. To assess the probability that haplotypes i and j contain QTL alleles that are IBD, we move along the chromosome away from the QTL locus and find a marker that has, say, allele M1 for haplotype i and allele M2 for haplotype j. The haplotypes are clearly not IBD at this marker locus, M. Hence, if there was an IBD region around the QTL, it has ended before marker M. Any equality or nonequality of alleles at markers further away from the QTL than locus M does not affect the probability that the QTL locus is IBD. Let Nl denote the number of markers for which two haplotypes have identical alleles, if we start at the QTL position and count toward the left until the first nonidentical marker alleles occur (Nl = 0, 1, 2, …). Similarly, let Nr denote the number of markers that carry identical alleles on the right side of the QTL until the first nonidentical marker alleles occur. Now we can classify all haplotype pairs into groups with equal (Nl,Nr), which are expected to have equal probabilities of being IBD at the marker locus. For instance, the probability that haplotypes (11111Q11111) and (22211Q111222) are IBD at the QTL is the same as the probability that haplotypes (22222Q22222) and (111122Q22211) share a IBD QTL, because both pairs have two identical markers to the left of the QTL and three to the right of the QTL [i.e., (Nl,Nr) = (2, 3)].

View this table:
  • View inline
  • View popup
TABLE 2

Details of the simulations

With the genedropping method the IBD probabilities of a pair of haplotypes can be estimated within each genedrop by dividing the number of times the QTL locus was IBD by the total number of times the haplotype pair was found. The estimates of the IBD probabilities of the haplotype pairs that belong to the same (Nl,Nr) group are averaged within a genedrop, and these averages are accumulated across 100,000 repeated gene-drops to obtain the estimates of the IBD probabilities for every haplotype combination group (Nl,Nr).

Testing the linkage disequilibrium mapping method: An analysis of simulated data, where the correct position of the QTL is known, is used to test the proposed mapping method. Details of the simulation are provided in Table 2. The simulation of the base and later generations was as with the genedropping method (see previous section). In the last generation, at random, one founder QTL allele is chosen among the group of surviving founder QTL alleles, and this allele obtains a value of 1 while all others obtain a value of 0. The latter results in a frequency of the positive QTL allele that is >0 (otherwise there was no polymorphic QTL to detect). Because the QTL was previously detected by linkage analysis study, it seems reasonable to assume that the frequency of the positive QTL allele is >0.1. The requirement that the QTL allele frequency is >0.1 is studied by choosing at random one founder allele among the group of founder alleles with a frequency >0.1 and giving this chosen founder allele an effect of 1. Marker haplotypes were known without error in the simulation study. One record per individual of the last generation was obtained by adding to the sum of both QTL allele effects of the individual an environmental effect that was sampled from N(0, 1). In the last generation there were 100 or 500 instead of Ne individuals to avoid confounding between the amount of information in the data set and the effective size of the population.

RESULTS

Number of generations since mutation and effective size are 100: Table 3 shows the estimated positions of the QTL, when the QTL mutation occurred 100 generations ago and the effective population size was 100, which was also used to calculate Hp. In 31–35 out of 50 replicated simulations, the estimated position was in the correct marker bracket or the neighboring one. This number decreased somewhat with a decreasing size of the marker brackets. However, with smaller bracket sizes the position estimates expressed in centimorgans became more precise. It should be noted also that the prior estimate of the QTL position was more precise with the smaller bracket sizes. Prior to the analysis, it was assumed known that the QTL was somewhere between the 10 markers, which span a region of 9, 4.5, and 2.25 cM if the distances between the markers are 1, 0.5, and 0.25 cM, respectively. The requirement that the allele frequency of the QTL in the last generation exceeds 0.1, which accounts for the fact that detected QTL are probably not rare, increased the precision of the estimates somewhat.

The estimated position was more often four brackets away from the correct position than three (Table 3), which seems counterintuitive. However, a close examination of some replicates whose estimates deviated four brackets from the correct position showed that the QTL carrying haplotypes often had identical alleles at markers 1–5 (or 6–10). Hence, the QTL appears to be somewhere between markers 1 and 5 (or 6 and 10). The method predicts the highest covariances between marker haplotypes, which share alleles at markers 1–5 when the QTL is in the leftmost bracket. Hence, the QTL is predicted in the leftmost bracket, which deviates four brackets from the correct bracket. The situation that all QTL carrying haplotypes have identical marker alleles at positions 1–5 occurs more often when there are few QTL carrying haplotypes, e.g., in replicates with a low frequency of the QTL (Table 3).

View this table:
  • View inline
  • View popup
TABLE 3

Precision of QTL position estimates

Figure 1 shows the average log-likelihood curve of the 50 replicated simulations as a deviation from the base likelihood, where the base likelihood is calculated using Equation 2 but with σh2=0 , i.e., without fitting haplotype effects. The average likelihood shows a bell-shaped curve with the peak at the QTL position. The symmetry of the curve suggests that the estimation of the QTL position is approximately unbiased. The maximum of the likelihood increases with smaller marker bracket sizes and the curve becomes more peaked. The likelihood at the brackets that are adjacent to the QTL-carrying bracket is, however, not much lower than that at the QTL-carrying bracket, which indicates that there will be little information to distinguish the correct bracket from its neighboring brackets. The difference in likelihood between the model with a QTL and the model without a QTL anywhere in the marked region is evidence for the existence of the QTL. Since this evidence will usually be independent of the linkage analysis that originally mapped the QTL, this likelihood-ratio test, based on linkage disequilibrium, will help to confirm the existence of the QTL.

In Table 3, the actual number of individuals, and thus phenotypic records, was equal to the effective number (100). In practice, the actual number of individuals may be much larger than the effective number. Hence, the phenotypic number of records can also exceed the effective number of animals, which results in more records per haplotype, i.e., more accurate estimates of haplotype effects. If there are 500 animals with records in the last generation, the number of position estimates that are in the correct or a neighboring bracket is increased to 38–39 out of 50 replicated simulations when the bracket size is 1 or 0.5 cM (Table 4). However, if the bracket size is 0.25 cM, the mapping precision is not increased compared to the situation with 100 recorded animals. The latter may be because there are too few (detectable) recombinations close to the QTL in the case of 0.25-cM brackets, such that an increased accuracy of the estimation of the effect of a haplotype hardly improves the accuracy of the position estimates.

Figure 1.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 1.

The log-likelihood of a QTL minus that of having no QTL averaged over 50 replicated simulations, with marker bracket sizes of 1, 0.5, and 0.25 cM. The number of generations since the QTL mutation and the effective population size were both 100.

(▪) 1-cM brackets, (♦) 0.5-cM brackets, (○) 0.25-cM brackets, (▴) QTL position.

View this table:
  • View inline
  • View popup
TABLE 4

Precision of QTL position estimates when the number of records was increased

Different numbers of generations since the mutation: Table 5 shows the fractions of the QTL position estimates that are in the correct or in the neighboring bracket when the time since the mutation is varied (NG = 200, 100, 50, or 10). Generally, a NG of 100 yielded the most precise QTL estimates. The QTL estimates with NG = 200 were slightly less precise probably because the inbreeding was getting slightly too high; i.e., the number of segregating haplotypes was reduced. After 200 generations at an effective size of 100 the inbreeding coefficient is 0.63. A NG of 50 yielded QTL position estimates when the bracket size was 0.25 cM much less precise than the higher NG values. This is probably because there have been too few recombinations between adjacent markers at this small bracket size. When NG is only 10 generations the position estimates become no better than chance, which is probably again due to too few recombinations.

In previous simulations it was assumed that the number of generations since the QTL mutation occurred was known when calculating Hp, whereas this parameter is unknown in practice. However, when Hp is calculated assuming a NG of 100, the precision of the QTL position estimates decreases only slightly (Table 5). In fact, for NG = 10 the precision is higher when NG = 100 instead of the true value is used to calculate Hp. It seems counter-intuitive that using a wrong NG for estimating Hp results in more precise QTL estimates here. However, this is probably because the contrasts between the haplotype effects that result in estimating the QTL position are much more pronounced in the Hp matrix with NG = 100 than in that with NG = 10, which may lead to little covariance between different haplotypes since recombination probabilities are small. Hence, the Hp assuming NG = 100 may have contrasted the small differences between haplotypes that result in QTL position estimates more, while the differences in likelihood with an Hp matrix assuming NG = 10 may be more affected by sampling errors on the estimates of Hp matrices at different positions. In general, however, Table 5 shows that assuming NG = 100 when estimating Hp results in close to optimal position estimates.

Different numbers of effective population sizes: Table 6 shows the results when the effective population size, Ne, was either 50 or 200. The results at Ne equal to 50 or 200 are similar to those at the Ne of 100, except at the marker bracket size of 0.25 cM, where the Ne = 50 or 200 schemes yielded somewhat less precise estimates. At Ne = 50 genetic drift is high, which makes the frequency of large chromosomal segments drift rapidly before recombination can reduce their sizes. The latter hampers mainly the mapping precision when small bracket sizes are used. At Ne = 200 there is little genetic drift of haplotypes, such that a steady-state linkage disequilibrium due to drift may not have occurred yet. The establishing of a steady-state linkage disequilibrium is probably slower when recombination frequencies are small; hence, the mapping precision is more reduced at small marker bracket sizes.

View this table:
  • View inline
  • View popup
TABLE 5

Precision of QTL position estimates with varying numbers of generations since the mutation

View this table:
  • View inline
  • View popup
TABLE 6

Precision of QTL position estimates with varying effective population sizes

Also in Table 6, a situation in which a QTL mutation occurred 1030 generations ago is investigated, where the effective size was 1000 during the first 1000 generations and 100 during the last 30 generations. In livestock populations this may be realistic, where the recent bottleneck is due to the introduction of herd books. The bottleneck seems to have favored the mapping precision at the small bracket size of 0.25 cM while the precision at the larger bracket sizes is reduced. The long time period since the mutation probably resulted in a small IBD region around the QTL, such that the further away markers will show too little linkage disequilibrium with the QTL (although this linkage disequilibrium is increased by the recent bottleneck).

Comparison to single-marker disequilibrium mapping: Table 7 compares position estimates when the last generation consists of a half-sib family structure, which is obtained by mating 10 randomly chosen males each to 10 different females. This half-sib family structure was also analyzed using the transmission disequilibrium test (Rabinowitz 1997). The TDT was applied by fitting a model with a fixed half-sib family effect and the effect of a single marker to the data, where the half-sib family effect ensures that the TDT uses only within-half-sib family deviations for the estimation of the marker effect. All markers were fitted in turn and the marker that resulted in the highest likelihood of the data was expected to be closest to the QTL. The half-sib family structure yields somewhat less precise QTL position estimation than the unstructured population of Table 3. This may be in part due to the use of the Hp matrices of the unstructured population for the analysis in Table 7. A more correct analysis with the multimarker method would require the use of a Hp matrix within and across half-sib families. Despite the use of the slightly wrong Hp matrices, the multimarker method yielded a 13–33% higher probability of positioning the QTL in the correct or an adjacent bracket than the TDT. It may be noted that the position estimates of the TDT method are at the marker position instead of in a bracket, which complicates an exact comparison of the precision of the position estimates somewhat. In the comparison of Table 7, the TDT position estimates that are at the boundaries of the brackets that are next to the QTL bracket are also counted as within these brackets.

View this table:
  • View inline
  • View popup
TABLE 7

Precision of QTL position estimates when single or multiple markers are used

DISCUSSION

Relations with other methods and extensions: A multiple marker-QTL linkage disequilibrium mapping method that often positioned a QTL within one marker bracket from its correct position was presented, where the size of the brackets was 1, 0.5, or 0.25 cM. Hence, the QTL is probably in the estimated bracket or in the bracket next to it, which implies a region of 3, 1.5, or 0.75 cM, respectively. Reducing the size of the marker brackets can thus reduce the size of this region (see also Figure 1). The method may be seen as an extension of the multipoint linkage disequilibrium mapping method for discrete traits (Terwilliger 1995) toward QTL. However, Terwilliger's discrete trait mapping method was a full maximum-likelihood method, whereas the present QTL mapping method is approximately maximum likelihood, due to the multivariate normality assumption involved in likelihood (2). This assumption implies that the method uses only the first two moments of the data for the estimation of the QTL position, which may be satisfactory in many situations because the higher moments of the data are often unknown and often contain little extra information.

The method presented in this article is also an extension of the single-marker-QTL TDT method (Rabinowitz 1997) toward using all markers simultaneously for the estimation of the QTL positions. This extension to more markers increased the accuracy of the position estimates substantially (Table 7). In the comparison of Table 7 the TDT method had a disadvantage because it used only within-half-sib family deviations for the estimation of the marker effects. This avoids family effects causing bias in the estimated marker effects in the TDT. In the simulations there were no family effects other than those due to the QTL, but in practice these would be likely to occur. Hence, in practice, the multimarker mapping method also has to correct for the effects of polygenetic and environmental family effects. Possible environmental family effects are easily included in the fixed-effect structure of model (1). A polygenic term should be included in the random effects of model (1) to account for the background genes.

A disadvantage of the multimarker mapping method in the comparison of Table 7 is that the Hp matrices were not adapted to the half-sib family structure. Estimation of the Hp matrices within and across families would overcome this problem. However, in practice the family structure will often be more complex than that of full- or half-sib families. This may be overcome by fitting haplotype × individual effects in model (1); i.e., there will be two haplotype effects estimated for every individual, and the IBD probabilities have to be calculated conditional on both the markers and the pedigree. The latter is an extension of the model of Fernando and Grossman (1989), who fitted two QTL effects per individual and used a covariance matrix of these QTL effects that was proportional to IBD probabilities. In the model of Fernando and Grossman, identity by descent, however, occurred only when a common ancestor was found within the marked pedigree, not before pedigree recording started. The latter makes Fernando and Grossman's method useful for linkage analysis mapping (see Hoescheleet al. 1997). We are in the process of combining the current IBD probabilities with those of Fernando and Grossman, which will combine the information from linkage disequilibrium mapping with that of linkage analysis.

The information from linkage disequilibria: Hill and Weir (1994) showed that the variance of the linkage disequilibrium between a closely linked marker and a QTL is large, such that the disequilibrium cannot be used for the precise mapping of the QTL. The situation investigated here differs from that of Hill and Weir in two respects. First, we use the disequilibria between all markers and the QTL simultaneously, which seems to avoid the problem of a high variability of a single linkage disequilibrium. Second, Hill and Weir assumed that the linkage disequilibrium was caused by random genetic drift, whereas, in the present study, the disequilibrium was initially created by the occurrence of a mutation in a single haplotype, which is probably less variable. Hence, the present population was not (yet) in equilibrium. Kaplan et al. (1995) investigated linkage disequilibrium mapping of disease genes and concluded that this method can be very precise in locating a gene in nonequilibrium populations.

There are three factors that limit the accuracy with which the method can map a QTL. First, if there are too few animals, the haplotype effects are estimated too poorly. Second, if chromosome segments surrounding the QTL are too big, the linkage disequilibrium will be uniform throughout the marker region. In that case we will be able to detect the presence of the QTL but not position it accurately. Third, if the chromosome segments surrounding the QTL are smaller than the marker brackets, then we may not detect any disequilibrium between the markers and the QTL. The chromosome segments will be large if either NG or Ne is small. If NG is small, there will be too few opportunities for recombination to break up the chromosome segment that contained the original QTL mutation. If Ne is small, then all existing copies of a QTL allele are likely to trace back to a common ancestor a small number of generations ago, again limiting the opportunities for recombination.

The situation with a population at a size of 1000 individuals for 1000 generations and a size of 100 individuals for the last 30 generations (Table 7) showed a high fraction of correct marker brackets at a bracket size of 0.25 cM while this fraction reduced when the bracket size became bigger. This suggests that the linkage disequilibrium between the markers, which were distanced at 1 cM, eroded during the 1000 generations at the high population size, while the 30 generations at a population size of 100 were not long enough to regenerate these linkage disequilibria.

The initial linkage disequilibrium due to the new mutation breaks down at a rate of (1 − r) per generation, where r is the recombination between the marker and the QTL. When the distance between the marker and the QTL is 1 or 0.25 cM, then 37 or 78%, respectively, of the disequilibrium remains after 100 generations. After 200 generations these figures are 14 and 60%, respectively. Hence, after many generations the initial disequilibrium disappears unless the marker is very close to the QTL. However, in Table 6, marker brackets of 1 cM still achieved a reasonable accuracy even when the QTL mutation occurred 200 generations ago. This suggests that the disequilibrium due to random drift was also used by the method to estimate the position of the QTL. Hence, disequilibrium due to finite size is building up as that caused by the initial mutation is eroding. Goddard (1991) showed that finite population size alone could generate substantial linkage disequilibrium between a QTL and a marker haplotype and this may explain why the method is not highly sensitive to the number of generations since the mutation.

Conditions affecting mapping precision: The results in Tables 6 and 7 showed that it is difficult to predict which population size should be used for the mapping of a QTL, especially since the age of the mutation is generally unknown. The combination of 100 generations at an effective size of 100 seemed to result in good QTL position estimates for the marker bracket sizes that were investigated here. However, other effective population sizes and numbers of generations since the mutation did not reduce the precision markedly, except that 10 generations seem to be too short to yield good QTL position estimates with linkage disequilibrium mapping. Mutations that occurred just 10 generations ago might not be so likely, but the QTL may have been introduced into the population by migration of an individual. Multiple mutations at the QTL can also generate substantial linkage disequilibrium in multimarker haplotypes because every mutation will be introduced in a more or less unique haplotype, and its effect will cause a covariance pattern as indicated by the Hp matrix. Hence, the precision of the multimarker linkage disequilibrium mapping method will hardly be affected by multiple QTL mutations if the haplotypes surrounding the mutations are sufficiently unique.

Mutations at the markers were ignored here, but their effect is similar to that of recombination between the markers; namely, the haplotype that is associated with the QTL is altered. Since mutations occur much less frequently than recombination, it seems sufficient to account for only the recombination between the markers. Also, the statistical model for the estimation of the haplotype effects assumes that the effects of the paternal and maternal haplotypes are additive. This does not imply that the effect of the QTL alleles needs to be additive, but only the average effect of the haplotypes across all genotypes will be used by the analysis. Strong dominance deviations may reduce or increase this average effect and can therefore affect the power of the analysis. This reduction or increase of the average effects depends also on the frequencies of the QTL alleles (Falconer 1989).

Another effect that limits the precision of estimates of QTL position is the calculation of Hp. If the distance between the markers is small, the fraction of the replicates where recombination between two particular markers occurs also becomes small, which implies that many replicates are needed. In the present simulations 100,000 replicated simulations were sufficient to estimate IBD probabilities for marker bracket sizes down to 0.25 cM. Although modern computers are fast, the number of replicated simulations needed to calculate Hp may limit the minimum distance between the markers. Accurate deterministic methods to assess Hp, which may be based on coalescence processes (Hudson 1985, 1993), will be useful to improve the precision of the present fine mapping technique. On the other hand, the estimation of Hp by simulation is probably more flexible than a deterministic prediction and thus can be easily adjusted to any known (changes in) structure of the real population.

Tables 3 and 4 show that the accuracy of the estimates of QTL positions is only moderately increased when there are more records available in a situation with equal effective population size. The benefit from increasing the number of animals would probably increase if the size of the QTL effect relative to the error variance was reduced. However, the results show that in some cases the structure of the population is the limiting factor in the precision of position estimation, e.g., too few recombinations since the QTL occurred.

A multi-stage approach to QTL mapping: A practical method for estimating the position of a QTL would be to start first with a genome-wide scan for QTL using linkage analysis methods. This would result in a region of, say, 20 cM, which most likely contains the QTL. Next, this region will be covered with markers that are 1 cM apart, and the present linkage disequilibrium mapping method will probably narrow the position of the QTL down to 3 cM. It will also be useful to test for the existence of the QTL using the likelihood-ratio test to confirm that there is a QTL in this region. This linkage disequilibrium-based test uses different information than the tests that were performed in the linkage analysis. In the following step, this 3-cM region is covered by markers that are 0.25 cM apart, and the position of the QTL is narrowed down to a 0.75-cM region. Within this region, even smaller marker brackets may be used to narrow down further the position of the QTL. Finally, comparative mapping (O'Brienet al. 1993), evolutionary tree mapping (Templetonet al. 1987), and/or positional cloning (e.g., Wallaceet al. 1990) may be used to identify the QTL.

Footnotes

  • Communicating editor: C. Haley

  • Received August 24, 1999.
  • Accepted January 14, 2000.
  • Copyright © 2000 by the Genetics Society of America

LITERATURE CITED

  1. ↵
    1. Darvasi A.,
    2. Weinreb A.,
    3. Minke V.,
    4. Weller J. I.,
    5. Soller M.
    , 1993 Detecting marker-QTL linkage and estimating QTL gene effect and map location using a saturated genetic map. Genetics 134: 943–951.
    OpenUrlAbstract/FREE Full Text
  2. ↵
    1. Falconer D. S.
    , 1989 An Introduction to Quantitative Genetics. Longman Group, Essex, United Kingdom.
  3. ↵
    1. Fernando R. L.,
    2. Grossman M.
    , 1989 Marker-assisted selection using best linear unbiased prediction. Genet. Sel. Evol. 21: 246–477.
    OpenUrl
  4. ↵
    1. Goddard M. E.
    , 1991 Mapping genes for quantitative traits using linkage disequilibrium. Genet. Sel. Evol. 23 (Suppl. 1): 131s–134s.
    OpenUrlCrossRef
  5. ↵
    1. Hastbacka J.,
    2. De La Chapelle A.,
    3. Kaitila I.,
    4. Sistonen P.,
    5. Waever A.,
    6. et al.
    , 1992 Linkage disequilibrium mapping in isolated founder populations: diastrophic dysplasia in Finland. Nat. Genet. 2: 204–211.
    OpenUrlCrossRefPubMedWeb of Science
  6. ↵
    1. Henderson C. R.
    , 1984 Applications of Linear Models in Animal Breeding. University of Guelph, Guelph, Ontario, Canada.
  7. ↵
    1. Hill W. G.,
    2. Weir B. S.
    , 1994 Maximum-likelihood estimation of gene location by linkage disequilibrium. Am. J. Hum. Genet. 54: 705–714.
    OpenUrlPubMedWeb of Science
  8. ↵
    1. Hoeschele I.,
    2. Uimari P.,
    3. Grignola F. E.,
    4. Zhang Q.,
    5. Gage K. M.
    , 1997 Advances in statistical methods to map quantitative trait loci in outbred populations. Genetics 147: 1445–1457.
    OpenUrlAbstract/FREE Full Text
  9. ↵
    1. Houwen R. H. L.,
    2. Baharloo S.,
    3. Blankenship K.,
    4. Raeymaekers P.,
    5. Juyn J.,
    6. et al.
    , 1994 Genome screening by searching for shared segments: mapping a gene for benign recurrent intrahepatic cholestasis. Nat. Genet. 8: 380–386.
    OpenUrlCrossRefPubMedWeb of Science
  10. ↵
    1. Hudson R. R.
    , 1985 The sampling distribution of linkage disequilibrium under an infinite alleles model without selection. Genetics 109: 611–631.
    OpenUrlAbstract/FREE Full Text
  11. ↵
    1. Hudson R. R.
    , 1993 The how and why of generating gene genealogies, pp. 23–36 in Mechanics of Molecular Evolution, edited by Takahata N., Clarck A. G.. Sinauer, Sunderland, MA.
  12. ↵
    1. Kaplan N. L.,
    2. Hill W. G.,
    3. Weir B. S.
    , 1995 Likelihood methods for locating disease genes in nonequilibrium populations. Am. J. Hum. Genet. 56: 18–32.
    OpenUrlPubMedWeb of Science
  13. ↵
    1. Long A. D.,
    2. Mullaney S. L.,
    3. Reid L. A.,
    4. Fry J. D.,
    5. Langley C. H.,
    6. et al.
    , 1995 High resolution mapping of genetic factors affecting abdominal bristle number in Drosophila melanogaster. Genetics 139: 1273–1291.
    OpenUrlAbstract/FREE Full Text
  14. ↵
    1. Maccluer J. W.,
    2. Vandeberg J. L.,
    3. Read B.,
    4. Ryder O. A.
    , 1986 Pedigree analysis by computer simulation. Zoo Biol. 5: 147–160.
  15. ↵
    1. O'Brien S. J.,
    2. Womack J. E.,
    3. Lyons L. A.,
    4. Moore K. J.,
    5. Jenkins N. A.,
    6. et al.
    , 1993 Anchored reference loci for comparative genome mapping in mammals. Nat. Genet. 3: 103–112.
    OpenUrlCrossRefPubMedWeb of Science
  16. ↵
    1. Patterson H. D.,
    2. Thompson R.
    , 1971 Recovery of inter-block information when block sizes are equal. Biometrika 58: 545–554.
    OpenUrlAbstract/FREE Full Text
  17. ↵
    1. Pritchard C.,
    2. Cox D. R.,
    3. Myers R. M.
    , 1991 The end in sight for Huntington disease? Am. J. Hum. Genet. 49: 1–6.
    OpenUrlPubMedWeb of Science
  18. ↵
    1. Rabinowitz D.
    , 1997 A transmission disequilibrium test for quantitative trait loci. Hum. Hered. 47: 342–350.
    OpenUrlCrossRefPubMedWeb of Science
  19. ↵
    1. Templeton A. R.,
    2. Boerwinkle E.,
    3. Sing C. F.
    , 1987 A cladistic analysis of phenotypic associations with haplotypes inferred from restriction endonuclease mapping. I. Basic theory and an analysis of alcohol dehydrogenase activity in Drosophila. Genetics 117: 343–351.
    OpenUrlAbstract/FREE Full Text
  20. ↵
    1. Terwilliger J. D.
    , 1995 A powerful likelihood method for the analysis of linkage disequilibrium between trait loci and one or more polymorphic marker loci. Am. J. Hum. Genet. 56: 777–787.
    OpenUrlPubMedWeb of Science
  21. ↵
    1. Wallace M. R.,
    2. Marchuck D. A.,
    3. Anderson L. B.,
    4. Letcher R.,
    5. Odeh H. M.,
    6. et al.
    , 1990 Type 1 neurofibromatosis gene: identification of a large transcript disrupted in three NF1 patients. Science 249: 181–186.
    OpenUrlAbstract/FREE Full Text
View Abstract
Previous ArticleNext Article
Back to top

PUBLICATION INFORMATION

Volume 155 Issue 1, May 2000

Genetics: 155 (1)

ARTICLE CLASSIFICATION

INVESTIGATIONS
View this article with LENS
Email

Thank you for sharing this Genetics article.

NOTE: We request your email address only to inform the recipient that it was you who recommended this article, and that it is not junk mail. We do not retain these email addresses.

Enter multiple addresses on separate lines or separate them with commas.
Fine Mapping of Quantitative Trait Loci Using Linkage Disequilibria With Closely Linked Marker Loci
(Your Name) has forwarded a page to you from Genetics
(Your Name) thought you would be interested in this article in Genetics.
Print
Alerts
Enter your email below to set up alert notifications for new article, or to manage your existing alerts.
SIGN UP OR SIGN IN WITH YOUR EMAIL
View PDF
Share

Fine Mapping of Quantitative Trait Loci Using Linkage Disequilibria With Closely Linked Marker Loci

T. H. E. Meuwissen and M. E. Goddard
Genetics May 1, 2000 vol. 155 no. 1 421-430
T. H. E. Meuwissen
*Research Institute of Animal Science and Health, Box 65, 8200 AB Lelystad, The Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: t.h.e.meuwissen@id.dlo.nl
M. E. Goddard
†Institute of Land and Food Resources, University of Melbourne, Parkville, Victoria 3052, Australia
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
del.icio.us logo Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
Citation

Fine Mapping of Quantitative Trait Loci Using Linkage Disequilibria With Closely Linked Marker Loci

T. H. E. Meuwissen and M. E. Goddard
Genetics May 1, 2000 vol. 155 no. 1 421-430
T. H. E. Meuwissen
*Research Institute of Animal Science and Health, Box 65, 8200 AB Lelystad, The Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: t.h.e.meuwissen@id.dlo.nl
M. E. Goddard
†Institute of Land and Food Resources, University of Melbourne, Parkville, Victoria 3052, Australia
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero

Related Articles

Cited By

More in this TOC Section

  • Comparative Genomics and Transcriptomics To Analyze Fruiting Body Development in Filamentous Ascomycetes
  • The Fate of Deleterious Variants in a Barley Genomic Prediction Population
  • Historical Introgressions from a Wild Relative of Modern Cassava Improved Important Traits and May Be Under Balancing Selection
Show more Investigations
  • Top
  • Article
    • Abstract
    • METHODS
    • RESULTS
    • DISCUSSION
    • Footnotes
    • LITERATURE CITED
  • Figures & Data
  • Info & Metrics

GSA

The Genetics Society of America (GSA), founded in 1931, is the professional membership organization for scientific researchers and educators in the field of genetics. Our members work to advance knowledge in the basic mechanisms of inheritance, from the molecular to the population level.

Online ISSN: 1943-2631

  • For Authors
  • For Reviewers
  • For Subscribers
  • Submit a Manuscript
  • Editorial Board
  • Press Releases

SPPA Logo

GET CONNECTED

RSS  Subscribe with RSS.

email  Subscribe via email. Sign up to receive alert notifications of new articles.

  • Facebook
  • Twitter
  • YouTube
  • LinkedIn
  • Google Plus

Copyright © 2019 by the Genetics Society of America

  • About GENETICS
  • Terms of use
  • Advertising
  • Permissions
  • Contact us
  • International access