Skip to main content
  • Facebook
  • Twitter
  • YouTube
  • LinkedIn
  • Google Plus
  • Other GSA Resources
    • Genetics Society of America
    • G3: Genes | Genomes | Genetics
    • Genes to Genomes: The GSA Blog
    • GSA Conferences
    • GeneticsCareers.org
  • Log in
Genetics

Main menu

  • HOME
  • ISSUES
    • Current Issue
    • Early Online
    • Archive
  • ABOUT
    • About the journal
    • Why publish with us?
    • Editorial board
    • Early Career Reviewers
    • Contact us
  • SERIES
    • All Series
    • Genomic Prediction
    • Multiparental Populations
    • FlyBook
    • WormBook
    • YeastBook
  • ARTICLE TYPES
    • About Article Types
    • Commentaries
    • Editorials
    • GSA Honors and Awards
    • Methods, Technology & Resources
    • Perspectives
    • Primers
    • Reviews
    • Toolbox Reviews
  • PUBLISH & REVIEW
    • Scope & publication policies
    • Submission & review process
    • Article types
    • Prepare your manuscript
    • Submit your manuscript
    • After acceptance
    • Guidelines for reviewers
  • SUBSCRIBE
    • Why subscribe?
    • For institutions
    • For individuals
    • Email alerts
    • RSS feeds
  • Other GSA Resources
    • Genetics Society of America
    • G3: Genes | Genomes | Genetics
    • Genes to Genomes: The GSA Blog
    • GSA Conferences
    • GeneticsCareers.org

User menu

  • Log out

Search

  • Advanced search
Genetics

Advanced Search

  • HOME
  • ISSUES
    • Current Issue
    • Early Online
    • Archive
  • ABOUT
    • About the journal
    • Why publish with us?
    • Editorial board
    • Early Career Reviewers
    • Contact us
  • SERIES
    • All Series
    • Genomic Prediction
    • Multiparental Populations
    • FlyBook
    • WormBook
    • YeastBook
  • ARTICLE TYPES
    • About Article Types
    • Commentaries
    • Editorials
    • GSA Honors and Awards
    • Methods, Technology & Resources
    • Perspectives
    • Primers
    • Reviews
    • Toolbox Reviews
  • PUBLISH & REVIEW
    • Scope & publication policies
    • Submission & review process
    • Article types
    • Prepare your manuscript
    • Submit your manuscript
    • After acceptance
    • Guidelines for reviewers
  • SUBSCRIBE
    • Why subscribe?
    • For institutions
    • For individuals
    • Email alerts
    • RSS feeds
Previous ArticleNext Article

Bayesian Mapping of Multiple Quantitative Trait Loci From Incomplete Outbred Offspring Data

Mikko J. Sillanpää and Elja Arjas
Genetics April 1, 1999 vol. 151 no. 4 1605-1619
Mikko J. Sillanpää
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Elja Arjas
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Article
  • Figures & Data
  • Info & Metrics
Loading

Abstract

A general fine-scale Bayesian quantitative trait locus (QTL) mapping method for outcrossing species is presented. It is suitable for an analysis of complete and incomplete data from experimental designs of F2 families or backcrosses. The amount of genotyping of parents and grandparents is optional, as well as the assumption that the QTL alleles in the crossed lines are fixed. Grandparental origin indicators are used, but without forgetting the original genotype or allelic origin information. The method treats the number of QTL in the analyzed chromosome as a random variable and allows some QTL effects from other chromosomes to be taken into account in a composite interval mapping manner. A block-update of ordered genotypes (haplotypes) of the whole family is sampled once in each marker locus during every round of the Markov Chain Monte Carlo algorithm used in the numerical estimation. As a byproduct, the method gives the posterior distributions for linkage phases in the family and therefore it can also be used as a haplotyping algorithm. The Bayesian method is tested and compared with two frequentist methods using simulated data sets, considering two different parental crosses and three different levels of available parental information. The method is implemented as a software package and is freely available under the name Multimapper/outbred at URL http://www.rni.helsinki.fi/~mjs/.

INBRED line cross designs are routinely used for quantitative trait locus (QTL) mapping in experimental organisms, because then full heterozygosity and perfect coupling between alleles in the QTL and in nearby marker loci are found in all F1 individuals. Furthermore, the biallelic nature of the design suits well the tradition in genetics, where QTL are treated as biallelic and all different heterozygous QTL effects are considered jointly as a dominance effect. Depending on the organism, an attempt to produce inbred lines is not always practical or even possible (Haleyet al. 1994); then methods developed for outbred designs are to be used.

Presently, there are QTL mapping methods suitable for the analysis of outbred populations (for a review see Hoescheleet al. 1997) as well as general pedigrees (e.g., Heath 1997). However, an application of general pedigree analysis methods tends to be statistically inefficient and might actually not be possible for data arising from controlled outcrossing experiments. Therefore “design-specific” methods are needed. Their main advantages over general purpose pedigree methods are (1) incorporation of design-specific properties (such as a control of maximum number of possible QTL genotypes) into the analysis; and (2) background variation can be controlled by marker covariates, instead of using polygenic components or unlinked QTL.

When interval mapping, where a putative QTL is placed somewhere between markers, is applied to outbred offspring data, linkage phases (haplotypes) of parents must be considered. They are needed to determine whether paternally (or maternally) derived alleles at two neighboring loci are of the same grandparental origin or not. For a comparison, note that the grandparental line origin of alleles found in inbred line-cross offspring is automatically known in all marker positions.

Haley et al. (1994) presented a QTL mapping method for outbred line-cross data (F2) concerning two divergent breeding populations, where a fixation of different influential QTL alleles in different grandparental lines was assumed. Their method requires genotyped grandparents to establish haplotypes for parents. They reduce the allelic space by using grandparental origin indicators instead of original marker alleles (see also Lander and Green 1987; Thompson 1994; Kruglyaket al. 1995; where a grandparental origin indicator is a binary digit in the inheritance vector). This work was extended to four segregating alleles by Knott et al. (1997). Somewhat earlier Maliepaard and Van Ooijen (1994) and Jansen (1996) presented more general algorithms for outcrossing experiments. Their methods assume neither a fixation of QTL in the crossed grandparental lines nor the availability of grandparental genotypes, but instead require known haplotypes for parents. The method of Jansen (1996) was recently generalized by Jansen et al. (1998) to more complex populations, where parental haplotypes were not required to be known in advance. In this method, the full genotypic and allelic origin information is considered in all founders, but only segregation indicators (i.e., grandparental origins) are used in nonfounders. No information is lost because the actual allelic forms for nonfounders can be traced from the pedigree by following each gene flow backward. Moreover, this treatment was shown to lead to more efficient mixing of the sampler than did methods in which the genotypes for nonfounders also are stored. The same idea was mentioned by Thompson (1994), and it was used by Sobel and Lange (1996) for descent graphs in pedigree analysis. Jansen et al. (1998) tested many different models.

Recently we presented a Bayesian QTL mapping method from incomplete inbred line-cross data (Sillanpää and Arjas 1998). This article also contains numerous references to other Bayesian works on QTL mapping. In this framework, the number of influential QTL in the analyzed chromosome is treated as an unobserved random variable, and then the algorithmic ideas of Green (1995) are applied to deal with the varying dimension of the parameter space. We used an idea similar to composite interval mapping (Jansen 1993; Zeng 1993, 1994; Jansen and Stam 1994; Kao and Zeng 1997) to account for the influence of some QTL in other chromosomes. We also advocated the use of the posterior QTL intensity as a new probabilistic summary measure for the inference. Now we generalize this approach to cover also backcross and F2 (full-sib) offspring data, or multiple F2 families from outcrossing experiments. In the method, the assumption concerning the fixation of QTL alleles in the crossed lines, as well as the degree in which the haplotypes or genotypes in parents or in grandparents are known, are optional. The assumption concerning fixation of QTL, together with the design (BC or F2), determines the maximal number of QTL genotypes that can segregate in a family structure. We assume that the offspring are at least partly genotyped and that corresponding quantitative phenotypic measurements from the trait are available. If the parents and/or grandparents are not genotyped, we use information from progeny to impute consistent multiple random haplotypes for the parents, following a Markov Chain Monte Carlo (MCMC) scheme. We also use grandparental origin indicators as in Haley et al. (1994), but the coding is redone for each haplotype arrangement (imputation) in parents. As a byproduct, this approach produces the linkage-phase distributions for each offspring and their parents. Therefore it can also be used for haplotyping, in data with at least partially genotyped parents (see discussion).

If the F2 family sizes in the studied plant or animal organism are relatively small, one has to combine information from several families. A complication arising from family pooling is that there will then typically be a large number of founders and therefore possible QTL alleles in the data. (Note also that the applicability of marker covariates needs to be considered.) To keep the maximum number of QTL genotypes low (≤4) in the combined data, one can assume one of the following alternatives: (1) Grandparents in each family have been drawn from the same two gene pools (lines), in which case they all represent two different QTL alleles in each trait locus. (Fixation of different QTL alleles in these two lines has been assumed.) (2) All families to be combined are related and share the same two grandparents, i.e., all parents belong to the same F1 generation. (Fixation of different QTL alleles in the two lines is again assumed.) (3) All families in the (combined) data are related and share the same four grandparents (numbered from 1 to 4) in such a way that one parent in each family is always progeny of grandparents 1 and 2 and the other parent is always progeny of grandparents 3 and 4; parents descending from grandparents 1 and 4, or 2 and 3, are excluded. Fixation of different QTL alleles in all four grandparental lines and that these lines show somewhat different phenotypic values has been assumed. If these assumptions are met, the resulting offspring population will have four different QTL alleles segregating in each trait locus.

In the following, we focus mainly on data from a onefamily experiment. Our model is described next, followed by the results from simulation experiments and a discussion. In two appendixes, parameter estimation and summary measures for statistical inference are considered.

MODEL

We use the notation of Sillanpää and Arjas (1998) for the following entities: phenotype vector (y), the number of offspring individuals (Nind), the number of QTL (Nqtl), QTL location vector (l), QTL genotype matrix (χ), the number of background controls (Nbc), incomplete and complete background control genotype information including parents (Xo and X *o), the number of QTL genotypes (Ngen), QTL genotypic effect (regression coefficient) vectors (b1, b2,..., bNqtl), genotypic effects for background controls (C), residual variance (σ2), fixed marker map m, and consistency between complete and incomplete information (A* ∼ A).

Let I = (Ii) be the indicator vector, where element Ii = 1{yi observed} takes the value one or zero depending on whether yi is observed or not. Let H* and H be the corresponding complete and incomplete (observed) haplotype information (genotype + allelic origin information:paternal/maternal) in the marker positions. In each case, we indicate the split between maternally and paternally inherited haplotypes by writing H* = (H*F, H*M) and H = (HF, HM). Here H* and H are taken to be (Nind + 2) × N matrices, where N is the number of markers in the considered chromosome. Note that incomplete haplotype information often covers complete genotypic information but not the allelic origin.

In the chosen experimental design, let α = (α1,..., αNgen) be the vector containing all possible QTL genotypes at any locus, so that their actual allelic forms are unknown. These QTL genotypes correspond to combinations of QTL alleles that were present in the crossed grandparents (founders) and that were transmitted to the F parents. Let γk=(γk1,…,γkNgenbc(k)) be the vector containing all possible background control genotypes and let Ngenbc(k) be their number (maximally four) at the kth background control. Let B = (Bi), where Bi is a vector of covariates (e.g., age, sex, or treatment) for offspring i. Let ρ be a vector of regression coefficients of these covariates (including also class means if some covariate is a classification variable). In case there is no individual control, we let all Bi reduce to Bi = 1 and ρ to a common regression intercept ρ = a. Here we consider only the case where no covariate values are missing.

We consider the following composite interval mapping (Kao and Zeng 1997) model for y: yi=ρ′Bi+∑q=1Nqtl∑j=1Ngenbqj1{xqi=αj}+∑k=1Nbc∑j=1Ngenbc(k)ckj1{xik∗=γkj}+ei. (1) Here 1{xqi=αj} and 1{Xik∗=γkj} are indicator variables (cf. Sillanpää and Arjas 1998). For contrast parameterization, we can impose constraints bq1 = 0 and ck1 = 0 here for all values of q and k (see appendix b).

We use the shorthand notation δ = (b1,..., bNqtl, σ2, ρ, C) and θ=(δ,χ,1,H∗,Xo∗,Nqtl) . Under natural conditional independence assumptions (cf. Sillanpää and Arjas 1998) the joint prior density function for θ can be presented in the product form p(θ∣m)=p(H∗∣m)p(Nqtl∣m)p(l∣m,Nqtl)×p(χ∣H∗,l,m,Nqtl)p(δ∣Nqtl)p(Xo∗). (2) The posterior density of θ is then proportional to the right-hand side of p(θ∣y,I,H,Xo,m)∝p(θ∣m)p(y,I,H,Xo∣θ,m)=p(θ∣m)p(y∣θ,I,m)1{H∗∼H,Xo∗∼Xo}, (3) where p(y | θ, I, m) is the likelihood function (normal density) constructed from those independent residuals ei in (1) in which the observation indicator Ii = 1. Here the complete background control genotypes are determined uniquely from Xo∗ .

The ingredients of the prior density (2) are specified as follows. Denote complete haplotype information at the marker positions of the ith offspring by Hi∗ , and similarly that of male and female parents by HM∗ and HF∗ . We consider the simple product form prior for the complete haplotypes in the family: p(H∗∣m)=p(HF∗∣m)p(HM∗∣m)∏i=1Nindp(Hi∗∣HF∗,HM∗,m) . Furthermore, for each offspring i we can further factorize the prior and compute it as the product p(Hi∗∣HF∗,HM∗,m)=p(Hi∗F∣HF∗,m)p(Hi∗M∣HM∗,m)1{Hi∗F∼HF∗,Hi∗M∼HM∗}=p(G1,iF(H∗))∏s=1N−1[p(Gs+1,iF(H∗)∣Gs,iF(H∗))]×p(G1,iM(H∗))∏s=1N−1[p(Gs+1,iM(H∗)∣Gs,iM(H∗))]×1{Hi∗F∼HF∗,Hi∗M∼HM∗}. (4) Here, Gs,iF(x)(Gs,iM(x)) is a function of haplotype information x, and it determines the grandparental origin of the maternal (paternal) allele of individual i at marker locus s. The probabilities p(G1,iM(H∗)−F)=p(G1,iM(H∗)=M)=p(G1,iF(H∗)=F)=p(G1,iF(H∗)=M)=12 are the prior probabilities of different grandparental origins under Mendelian segregation for paternally and maternally inherited alleles at marker locus 1 in offspring i. When only maternally inherited alleles of offspring i are considered, then p(Gs+1,iF(H∗)∣Gs,iF(H∗))=rs,s+11{Gs+1,iF(H∗)≠Gs,iF(H∗)}+(1−rs,s+1)1{Gs+1,iF(H∗)=Gs,iF(H∗)} (5) is the conditional probability that in individual i the marker at position s + 1 is of grandparental origin Gs+1,iF(H∗) provided that the marker at position s has grandparental origin Gs,iF(H∗) . Here rs,s+1 is the recombination fraction between the markers s and s + 1. The structure of p(Gs+1,iM(H∗)∣Gs,iM(H∗)) derived for paternally inherited alleles is similar.

Let the complete background control marker information in parents F and M be Xo,F∗ and Xo,M∗ , respectively. We assume the following prior form for background control genotypes in the other chromosomes: p(Xo∗)=p(Xo,F∗)p(Xo,M∗)∏i=1Nindp(Xo,i∗∣Xo,F∗,Xo,M∗) , where p(Xo,i∗∣Xo,F∗,Xo,M∗)∝p(Xo,i∗)1{Xo,i∗∼Xo,F∗,Xo,i∗∼Xo,M∗} . We also assume marker independence and that all (consistent) genotypes are a priori equally likely.

The prior distribution of the number of QTL is assumed to be truncated Poisson (see Sillanpää and Arjas 1998). For all QTL locations, we assume the uniform prior distribution on the considered chromosome. The prior for QTL genotype coefficients is assumed to be normal with zero mean and zero correlation, the variance being a hyperparameter specified by the analyst.

As in Sillanpää and Arjas (1998), we use the term object to represent any marker or QTL in the considered chromosome and the term flanking object (of the QTL q) to represent any combination of two entities [markers and/or QTL: 1,..., (q – 1)] having their loci closest to the QTL q. Now, denote by Hi,LR∗q the complete (grandparental origin-coded) haplotype of the left and the right flanking object of the qth QTL in offspring i. We denote by rq = (rq1, rq2)′ the resulting recombination fractions between the QTL at lq and the corresponding flanking object (after an application of Haldane's map function). Here we assume that the recombination rates in male and female meioses are the same (even though an extension to different recombination fractions would be straightforward; see Haleyet al. 1994). The prior distribution of ordered genotypes of QTL is now assumed to have the product form p(χ∣H∗,l,m,Nqtl)=∏q=1Nqtlp(xq∣x1,…,xq−1,H∗,l,m)=∏q=1Nqtl∏i=1Nindp(xqi∣Hi,LR∗q,rq) . Note that QTL are not automatically (conditionally) independent from each other (see Sillanpää and Arjas 1998).

View this table:
  • View inline
  • View popup
TABLE 1

The number of possible alleles and genotypes at a QTL in outbred linecross designs (backcross and F2 intercross with and without assumed fixation of QTL alleles in different lines)

Figure 1.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 1.

—Genotypes of two marker positions in an F2 full-sib family with three offspring (an example). Haplotypes of all individuals are known. Arrows indicate how alleles are coded with respect to their grandparental “line” origins. The paternal (maternal) haplotype of offspring becomes a sequence of lines 1 and 2 (3 and 4). For an illustration, suppose that there is a QTL (q) between the markers and that each haplotype of the parents contains a different QTL allele (also numbered from 1 to 4) in that locus. The four QTL genotypes are then combinations of parental chromosomes and they show the following correspondence here: 13 = (A–C, E–F), 14 = (A–C, A–G), 23 = (B–D, E–F), and 24 = (B–D, A–G). Denote the recombination fraction between the flanking markers by rLR, and that between the QTL and the left (right) flanking marker by rq1 (rq2). By applying Equation 6, the probability of QTL genotype 13 occurring in offspring 1 is given by ((1 – rq1)rq2 ×rq1rq2)/(rLR(1 – rLR)), that of 14 by ((1 – rq1)rq2 × (1 – rq1)(1 – rq2))/(rLR(1 – rLR)), that of 23 by (rq1(1 – rq2) × rq1rq2)/(rLR (1 – rLR)), and that of 24 by (rq1(1 – rq2) × (1 – rq1) (1 – rq2))/(rLR(1 – rLR)).

The QTL analysis of the offspring is done in terms of parental haplotypes. The numbers of possible QTL alleles and QTL genotypes in BC and F2 designs are found in Table 1. Given the QTL genotype vector α = (α1,..., αNgen), the prior probabilities for s = 1,..., Ngen are calculated from the equation p(xqi=αs∣Hi,LR∗q,rq)=p(xqi=αs∣HiL∗q,rq)×p(Hi,R∗q∣xqi=αs,rq)p(HiR∗q∣HiL∗q,rq) (6) Here HiL∗q=(GL(q),iF(H∗),GL(q),iM(H∗)) , and HiR∗q=(GR(q),iF(H∗),GR(q),iM(H∗)) are the left- and right-ordered flanking object (QTL or marker) genotypes in the grandparental origin form. Haplotype coding and the evaluation of the probability in (6) in F2 and backcross designs are illustrated in Figures 1 and 2.

Figure 2.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 2.

—Genotypes at two marker positions in a backcross family with one offspring (an example). Haplotypes of all individuals are known. Arrows indicate how alleles are coded with respect to their grandparental origins. Paternal parent of backcross progenies is also one of the grandparents, i.e., code 3 means {1 or 2}. In case fixed (grandparental) lines are assumed, codes 2 and 3 can be replaced by 1, and thus paternal meioses are not considered in the QTL genotype probability calculations at all, i.e., if an offspring chromosome inherited from the nonfounder parent, here the maternal parent, is of type 1–2, then for a QTL between the markers the genotype probabilities of types 1X and 2X are given by (1 – rq1)rq2/rLR and rq1(1 – rq2)/rLR, respectively; here X indicates the other (not considered) allele.

SIMULATION ANALYSIS

To test the performance of this method, an outcrossing F2 population consisting of Nind = 200 offspring was generated by a simulation program provided by J. W. Van Ooijen (Centre for Biometry Wageningen, CPRO-DLO, The Netherlands). We considered two 100-cM long chromosomes, both having 11 evenly spaced markers, at every 10 cM. The simulated trait had a genetic (QTL) variance 4.47 and a phenotypic variance 6.35, resulting in heritability 0.7. Two sets of parental crosses were generated: In the first set the parental mating type was fully informative (AB × CD) at all marker loci, and in the second set the degree of informativeness, as well as the corresponding linkage phases, varied from locus to locus. The simulated true underlying parental cross in the second set is shown in Figure 3; it is underlying in the sense that after the simulation this information was “forgotten” and not used in the Bayesian analyses (as explained below). The genotype-specific phenotype effects and the locations of the three simulated QTL can be found from Table 2. All haplotypic assignments in the offspring were assumed unknown. In the statistical analyses, three specifications regarding the amount of parental information were considered: (1) All genotypes and haplotypic assignments in parents were assumed known; (2) all genotypes were assumed known but their phases unknown in parents; and (3) all parental and grandparental marker information was assumed unknown (missing). The performance of our method was compared to that of “all-markers” interval mapping (IM; Maliepaard and Van Ooijen 1994) and to multiple QTL mapping with two background controls [MQM/02; both implemented in the MAPQTL program of Van Ooijen and Maliepaard 1996; MAPQTL (tm) version 3.0; CPRO-DLO, Wageningen, The Netherlands]. Note that in the IM and MQM methods the genotypes and the linkage phases in parents must be known.

Figure 3.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 3.

—Parental cross (maternal parent × paternal parent) in two simulated chromosomes.

In addition, the simulated data in which each QTL had four alleles were analyzed (in cases 1 and 3), having incorrectly assumed fixed grandparental lines (where grandfathers were assumed to originate from the same line). This was done to see how this erroneous assumption influences the results.

View this table:
  • View inline
  • View popup
TABLE 2

The locations and the individual phenotypic effects (eff{·}) of different heterozygous genotypes of the three simulated QTL

View this table:
  • View inline
  • View popup
TABLE 3

The ranges R(·) of the proposal distributions for different parameters, the corresponding proposal probabilities, the numbers of iterations, and the indices of the background control markers from other chromosomes, which were used in the simulation analyses

In all Bayesian analyses described here, our C-program implementing a Metropolis-Hastings chain was run 5,000,000 cycles in a Pentium II/266MHz computer. No values were deleted because of burn-in, but the chain was thinned so that only every fifth iteration was saved, resulting in 1,000,000 sampled values for each parameter. After a preprocessing stage (see appendix a), background controls were chosen. When analyzing a real data set, they can be determined by a single marker regression or by performing several analyses. Here, however, we simply chose marker 3 in chromosome 1 and marker 4 in chromosome 2 as background controls. Very likely, a few reanalyses would have led to the same conclusion. As no covariates (age, sex, etc.) were used, there was a common intercept (ρ = a and Bi = 1 for all i). The running times, in circumstances where there was practically no other load in the computer, varied around 9 hr. The initial value for the number of QTL was three, and the corresponding locations were 20.0 cM, 50.0 cM, and 80.0 cM. The Poisson mean (hyperparameter) was set to λ = 2 and the maximum number of QTL (in the analyzed chromosome) to three. The residual standard deviation was chosen to be uniform over the range [0.0, 2.55], the right endpoint being equal to the phenotypic standard deviation estimate from the data. The prior of the intercept was taken to be uniform on [–13, 13], those of the QTL genotypic regression coefficients were independent normal distributions with mean zero and variance 100, and the prior of the background control genotypic regression coefficients was uniform on [–13, 13]. Finally, the prior of the QTL locations was uniform over [0, 100]. The control parameter values used in the final analyses are given in Table 3. The proposal distribution for the genotypic effects (coefficients) was chosen to be N(0, 0.5) in cases where the addition of a new QTL to the model was proposed.

Figure 4.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 4.

—Results from the estimation when all markers are fully informative. (Top) Graphs of the posterior QTL intensity in chromosome 1 when all parental genotypic information is known (left), and when only parental genotypes (but not linkage phases) are known (right). Here the histogram corresponds to the (approximate) posterior QTL intensity over the chromosome, with binlength 1 cM. On the top left, the results from interval mapping (IM, solid line) and multiple QTL mapping with two background controls (MQM/02, broken line) are shown. The left (right) y-axis corresponds to the posterior QTL intensity (LOD score). Note the logarithmic scale of the LOD score. From the remaining eight panels, four (on the left) describe phenotypic effect estimates of different genotypes in chromosome 1 with all parental genotypic information known, and four (on the right) those in chromosome 1 when only parental genotypes are available. The solid line is the pointwise posterior median, and the gray lines the 2.5 and 97.5% quantiles of the posterior distribution, of the phenotypic effect of a putative QTL. (*) The simulated true QTL. (o) The labeling corresponding to best fit of the phenotypic effects of the QTL to their estimates. Shaded regions are suggested credible intervals for QTL localization. The estimated phenotypic effects are reliable only in these regions.

Figure 5.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 5.

—Results from the estimation when marker information varies from marker to marker. (Top) Graphs of the posterior QTL intensity in chromosome 1 when all parental genotypic information is known (left), and when only parental genotypes (but not linkage phases) are known (right). In these panels, the histogram corresponds to the (approximate) posterior QTL intensity over the chromosome, with binlength 1 cM. (top left) The results from interval mapping (IM, solid line) and multiple QTL mapping with two background controls (MQM/02, broken line) are shown. The left (right) y-axis corresponds to the posterior QTL intensity (LOD score). Note the logarithmic scale of the LOD score. From the remaining eight panels, four (on the left) describe phenotypic effect estimates of different genotypes in chromosome 1 with all parental genotypic information known, and four (on the right) those in chromosome 1 when only parental genotypes are available. The solid line is the pointwise posterior median, and the gray lines the 2.5 and 97.5% quantiles of the posterior distribution, of the phenotypic effect of a putative QTL. (*) The simulated true QTL. (o) The labeling corresponding to best fit of the phenotypic effects of the QTL to their estimates. Shaded regions are suggested credible intervals for QTL localization. The estimated phenotypic effects are reliable only in these regions.

View this table:
  • View inline
  • View popup
TABLE 4

The posterior distribution of the number of QTL and its expectation in four analyses of chromosome 1

In the IM and MQM/02 analyses, walking speed was set to 0.5 cM, which is the smallest admissible value in the MAPQTL software. We used the same background controls in MQM/02 as in the Bayesian analyses.

RESULTS

The Bayesian posterior QTL intensities (see appendix b) in chromosome 1, when all parental information was present (case 1) or when parental linkage phases were absent (case 2), are shown in Figure 4 (top) when all markers are fully informative, and Figure 5 (top) when marker information varies from marker to marker. The curves consisting of the pointwise medians and the 2.5 and 97.5% quantiles of the posterior distribution of the phenotypic effects of the four genotypes, as functions of the putative QTL location, are shown in the same figures when all parental information is present (left), or when parental linkage phases are unknown (right). Approximate posterior distributions of the number of QTL in chromosome 1, obtained from these four different analyses, are shown in Table 4. The analyses where all parental information was absent (case 3) are not summarized in figures or in tables. This is because in theory case 3 is not fully identifiable, resulting in probabilistic summary measures (the posterior QTL intensity and the posterior distribution of the number of QTL) that are not unique. These problems are described and considered more in the discussion.

Table 5 gives a brief summary of our findings concerning the localization of QTL as suggested by the QTL intensities in Figures 4 and 5. The table makes direct reference to (approximate) posterior probabilities that a particular chromosomal region I of high QTL-intensity concentration contains a given number of QTL. Also the corresponding posterior expectations are calculated. The analyses support quite strongly the hypothesis of two QTL in chromosome 1.

In the analyses where all markers were fully informative (Figure 4, top), the two posterior QTL-intensity graphs (from cases 1 and 2) became nearly identical, regardless of whether parental linkage phase information was available or not. Both posterior QTL-intensity graphs were nicely concentrated around the left QTL at 32.7 cM. The graphs surrounding the right (weaker) QTL at 58 cM were much wider, and there was also some bias to the left. However, the true simulated QTL is still inside the regions [41 cM, 60 cM] and [41 cM, 63 cM] of elevated posterior QTL intensities. In this case (Figure 5, top left), the MQM analysis performed well in both QTL localizations in chromosome 1, but the IM analysis managed to localize only the left QTL. (Note that the posterior QTL-intensity graphs covering the regions [41 cM, 60 cM] and [41 cM, 63 cM] are multimodal. This is apparently the same phenomenon that is typical to the LOD-score curve at marker points: often there is more evidence, because of marker genotyping, against placing a putative QTL exactly at a marker locus than against placing it somewhere nearby.) The graph leaves somewhat uncertain why, of the two modes, the one that is farther away from the true simulated QTL at 58 cM ended up being higher in the first case.

View this table:
  • View inline
  • View popup
TABLE 5

Approximate (posterior) probability Embedded Image that a given chromosomal area Embedded Image contains at least one QTL, calculated for different areas Embedded Image in four analyses

It can be seen from Figure 5 that the nonconstant marker information analysis (case 1) results in high posterior QTL intensities surrounding both simulated QTL in chromosome 1. The IM and MQM analyses localized quite well the “left” QTL at 32.7 cM, but localization of the “right” QTL at 58 cM was poor with both methods. Somewhat surprisingly, in the Bayesian method, the left, more influential, QTL was not localized as accurately as the right QTL when linkage phases were available in parents. This may be a consequence of the fact that there is a highly informative marker very close to the right QTL, whereas this is not the case with the left QTL (see Table 6). As could be expected, the localization was somewhat less accurate when the parental genotypes or their linkage phases were not available.

View this table:
  • View inline
  • View popup
TABLE 6

Estimated informativeness of different marker loci of the simulated data set, with two degrees of parental genotype information

Consider next the estimation of the phenotypic effects, indicated by asterisks in Figures 4 and 5. As could be expected, the estimation was most successful in the case (displayed in Figure 4, left) where marker information was complete and where complete parental information was available. In the case of nonconstant marker information, but still assuming complete knowledge of the parental genotypes and linkage phases, the estimates were somewhat less accurate, with some of the true values being just outside the 95% credible boundaries (Figure 5, left). When analyzing real data, the true labeling [i.e., assigning of the QTL genotypes (13, 14, 23, 24) to the true grandparental alleles] of the phenotypic effects is almost always unknown (except for the QTL genes that have been positionally cloned). If parental genotype and/or linkage phase information are missing, the labeling of the genotypic effects according to the grandparental origin of the alleles also becomes nonunique in the simulated case. For this reason, when comparing the phenotypic effect estimates with the true values used in the simulation, we have to make sure that each estimate is matched correctly with a combination of two grandparental QTL alleles. Such reassignment of the QTL genotypes is indicated on the right-hand side of Figures 4 and 5 by circles. In chromosome 1, note that the genotype labels are not consistent with each other in case 2.

View this table:
  • View inline
  • View popup
TABLE 7

Point estimates and their support regions from different analyses

The performance of the IM and MQM methods in the estimation of the phenotypic coefficients of the putative QTL was not particularly good. Moreover, they do not provide confidence intervals for such point estimates. Confidence intervals would have to be determined separately, for example, by employing bootstrap techniques.

The point estimates of QTL locations and their support regions are summarized in Table 7 for four different analyses of chromosome 1.

When considering chromosome 2 (which was analyzed only in cases 1 and 3), the posterior QTL-intensity graphs (see Figure 6) were all nicely concentrated around the simulated true QTL at 41.2 cM, regardless of whether the markers were fully informative or not. Also, the IM and MQM methods were able to localize the QTL at 41.2 cM quite well.

The performance of the analyses (cases 1 and 3), when it was incorrectly assumed that the grandparental lines are fixed (pictures not shown), was quite poor in chromosome 1. The only exception was the case where all parental information was available and all markers were fully informative. Then the simulated QTL at 32.7 cM was localized rather well, and there was also some indication of QTL activity around the QTL at 58 cM. Assuming fixation in the situation where all markers were fully informative but where all parental information was absent, only the latter QTL resulted in a high (but broad) QTL-intensity concentration.

DISCUSSION

We have presented here a Bayesian procedure for mapping multiple QTL from incomplete outbred offspring data, thus extending our earlier method (Sillanpää and Arjas 1998) to a more general experimental design. A test version of the software (written in C language) is available at http://www.rni.helsinki.fi/~mjs/. The method is capable of handling situations where marker information from parents and/or grandparents is missing in varying degrees, as well as cases where some of the marker information from the offspring is unavailable. In contrast to Sillanpää and Arjas (1998), the present model was not overparameterized, because this did not seem to improve the mixing properties of the sampler.

Following Sillanpää and Arjas (1998), we use the posterior QTL intensity as a probabilistic summary measure for the localization of QTL. During the MCMC sampling, we do not restrict the order of the QTL in any way to label them. If order-based labeling is preferred, it can be established afterward from the MCMC realizations. This is an alternative to imposing constraints on the MCMC simulation as was done, e.g., in Satagopan et al. (1996), Satagopan and Yandell (1996), Richardson and Green (1997), and in Uimari and Hoeschele (1997).

We tested the performance of our method by using simulated F2 data sets (two informativeness levels), with varying degrees of parental marker information (three levels). It seems intuitively plausible, and it also became clear from our simulations, that the availability of parental linkage phase information is more important in the case where the markers are not fully informative. The situation where also a part of the offspring marker genotypes is missing was not considered in the test analyses.

Figure 6.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 6.

—The posterior QTL intensity in chromosome 2 when all parental haplotype information is known and the marker information varies from marker to marker. The results from interval mapping (IM, solid line) and multiple QTL mapping with two background controls (MQM/02, broken line) are shown. The histogram corresponds to the (approximate) posterior QTL intensity over the chromosome, with binlength 1 cM. The left (right) y-axis corresponds to the posterior QTL intensity (LOD score). (*) The simulated true QTL.

Standardization of the phenotypic data is recommended before applying Bayesian QTL mapping in practice. Then the same proposal windows and other control parameters can be applied to different data sets, instead of performing separate test trials for each. Another advantage is that the numerical accuracy may be improved because computers' ability to store floating point numbers is maximal when dealing with numbers between zero and one.

The marker covariates can be chosen by an application of simple linear regression at each marker (putative QTL) position, omitting individuals whose genotype at that locus was unknown (because data augmentation would need linkage phase information). In doing so, one should pay attention to how much information a potential covariate marker carries and how many missing values there are. If an interesting region does not contain any fully informative markers, one can often find two closely linked markers such that each marker alone is informative only with respect to one (and a different) parent.

Parental mating type is usually not constant in outcrossing experiments. Thus a systematic application of some index describing the proportion of informative meioses locally present in the data will help the analyst to quantify the possibility of localizing a QTL in different areas of the considered chromosome. One such measure is displayed in Table 6. The influence of marker informativeness (cf. marker polymorphism in Kruglyak 1997) can be seen clearly from our simulation analysis (Table 6 and Figures 4 and 5) where, in the uninformative areas, intensity graphs are much more spread out, or even biased in some direction.

The phenotypic effects can be estimated reliably only in chromosomal regions in which the posterior QTL intensity is sufficiently high. As an alternative to the locationwise posterior densities for phenotypic effects shown in Figures 4 and 5, the posterior density can be constructed as an expectation over several pointwise values (of phenotypic effects), each being associated with a putative QTL location within a particular region of high posterior QTL intensity. One such posterior density is shown in Figure 7.

There appear to be two possible philosophies about how the indexing of QTL genotypes should be interpreted. Considering QTL genotype 13, for example, the first interpretation says that lines 1 and 3 are names for the parental haplotypes. In this case the remaining uncertainty concerning linkage phase is in how the grandparental alleles are assigned to these haplotypes. According to the second interpretation, lines 1 and 3 are names for the grandparental lines (alleles), and uncertainty is in the assignment of the parental haplotypes to these lines. Obviously, these two ways of thinking lead to different results only when there is some uncertainty in the parental linkage phases. We have adopted here the first interpretation, even though the second one is in some sense more fundamental in the context of QTL mapping.

We stress that in situations where all parental information is missing (case 3) it will be problematic to assign unique grandparental origins to the estimated phenotype effects. In this situation, both parents have symmetric pairs of haplotype configurations that are a posteriori equally likely to be the correct underlying mating structure. As a consequence, under these circumstances the correspondence between QTL genotypes (13, 14, 23, and 24; cf. Figure 1) and their grandparental alleles is not unique. In our program, the assignment can actually change from one iteration cycle to another within one MCMC run, let alone in different runs. (In practice such changes are rare because of the strong local dependence between offspring and their parents and between adjacent loci.) In case 3, the parental phase reconstruction can actually change suddenly in some region of the chromosome to a symmetrical mating type. (This can only be checked from the simulated data.) Also the resulting posterior QTL-intensity curves can differ in such regions in different MCMC runs.

Figure 7.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 7.

—Approximate posterior distribution of the phenotypic effect of genotype 13 of the left QTL (chromosome 1), determined from the interval from 27 cM to 39 cM. All parental information was available and all markers were fully informative.

In cases 1 and 2, the very strong local dependency structure between parents and offspring and between adjacent loci will in practice prevent such phase transitions during the same MCMC run. Therefore, to avoid problems of this kind, we strongly recommend that at least one of the parents should be genotyped in several marker loci along the chromosome, as equidistant as is possible.

Locally, of course, if there is a fully informative (reference) marker, in case 3 we can also avoid such identifiability problems and averaging in estimation by fixing the assignments (segregation indicators) arbitrarily at the reference marker and then using the fact that, as long as the genetic distance from the marker is short, haplotype assignment can be made in a way that is with high probability consistent with that chosen at the reference marker locus. If this informative marker is near a contemplated QTL, this technique will also facilitate the estimation of the corresponding phenotypic effects, by keeping the four haplotypic assignments (and thus the corresponding QTL allele combinations) apart. A more negative aspect of this technique is that it works only locally, as simultaneous haplotype assignments at two or more marker positions might not agree with the true haplotype configuration. As a consequence, the estimation would need a new MCMC run for each such local assignment.

Acknowledgments

M.S. thanks Matti Taskinen for his advice in the programming work, and Päivi Hurme and Outi Savolainen for many useful discussions about the designs. We are grateful to Johan Van Ooijen for providing his simulation program, which was used to generate test data sets, and to Pekka Uimari and three anonymous referees for their constructive comments on the manuscript. This work was supported by a research grant (no. 38352) from the Academy of Finland, and by the ComBi Graduate School.

APPENDIX A: PREPROCESSING AND PARAMETER ESTIMATION

Before the actual statistical analysis, the data go through a preprocessing stage. In this process, we infer as much of the marker genotype and linkage phase information as is possible by direct logical deduction from known parts of the family structure. The deduction rules applied here (sequentially until there are no new assignments) are similar to the genotyping rules of Wijsman (1987). If grandparental genotypes are present, these deduction rules are first applied to the grandparents and parents, and then to the parents and offspring. In this process, sets of consistent parental mating types are determined for each marker (see below) and they are later repeatedly applied for the estimation.

Let us consider a multi-allelic marker in the chromosome to be analyzed, where, after the logical deductions, the genotypes of the parents are still unknown. Further, consider the genotype or complete haplotype imputations for both parents by updating them one at a time. In such situations, when the genotype of one parent has been imputed, some offspring genotypes may in fact uniquely determine the genotype of the other parent. To avoid this and to make the sampler work more efficiently, genotypes of parents are considered jointly, and they have to form a pair that is consistent with the offspring genotypes. Therefore, we go through all possible allele combinations in parents, one at a time at each marker locus, and check whether any of them is inconsistent with the offspring genotypes. All inconsistent pairs are eliminated. In a backcross, one needs to check an additional consistency in genotypes of related parents.

Sometimes a block-update is preferred over a single-site-update in MCMC applications to pedigrees (Kong 1991; Jansset al. 1995; Heath 1997; Jensen and Kong 1997). This is because a local dependence resulting from inheritance constraints can be so strong that the sampler in practice will be reducible during the available time if single-site updating dynamics are used (see Sheehan and Thomas 1993; Linet al. 1994; Lin 1995; Jensen and Sheehan 1998). Even biallelic loci can be practically reducible in some designs; see Janss et al. (1995). Single outbred family (F2 design) with many offspring is an extreme example of this kind of strong dependence structure. Therefore, haplotypes for the entire family are updated as one block (Step 2 below) at each marker. (In some cases, due to the dependency between adjacent loci, good mixing properties of the sampler may be difficult to achieve, even when block-updating is applied within one locus.)

In the following, we describe only those parts of the estimation algorithm that are different from those in Sillanpää and Arjas (1998; see also the graphical representation of the model therein):

Step 2. The following is repeated for each marker, j = 1,..., N: A new ordered genotype proposal (familyblock) at the jth position is constructed as follows:

  1. If one or both genotypes in parents are unknown, a consistent pair of genotypes is proposed. Each consistent genotype-pair is considered as equally likely.

  2. If unknown, their allelic origins are also proposed considering each configuration as equally likely.

  3. Incomplete offspring genotypes are completed by taking one allele (with equal transmission probabilities) from each parent. These transmissions simultaneously specify the allelic origins and the grandparental origins, which are then updated accordingly.

  4. Unknown allelic origins of known offspring genotypes are determined by using deduction. Origins of a homozygote can be assigned randomly, and an offspring allele not found in one parent must originate from the other parent. If some origins are left uncertain, they are proposed with equal probabilities.

  5. Grandparental origins are determined for offspring alleles having a heterozygous parent, but are randomly assigned for alleles inherited from homozygotes.

The family-block proposal H(j)∗new is accepted, separately for each marker j, with probability min{1,p(χ=χ(t)∣H∗(t,new(j)),l(t),m,Nqtl(t))×∏i=1Nindfj,i(H∗(t,j−1),H∗(t,new(j)))∕[p(χ=χ(t)∣H∗(t,j−1),l(t),m,Nqtl(t))×∏i=1Nindfj,i(H∗(t,j−1),H∗(t,j−1))]}. If the proposals for marker j are accepted, then H(j)∗(t)=H(j)∗new , and otherwise H(j)∗(t)=H(j)∗(t−1) . Here the notation H(j)∗(t) refers to the family-block haplotype in the jth marker in the tth round, while vector H∗(t,new(j))=(H(1)∗(t),…,H(j−1)∗(t)H(j)∗new,H(j+1)∗(t−1),…,H(N)∗(t−1)) , vector H∗(t,j)=(H(1)∗(t),…,H(j)∗(t)H(j+1)∗(t−1),…,H(N)∗(t−1)) , and function fj,i(H1∗,H2∗)={p(Gj+1,iF(H1∗)∣Gj,iF(H2∗))×p(Gj,iF(H2∗)∣Gj−1,iF(H1∗))×p(Gj+1,iM(H1∗)∣Gj,iM(H2∗))×p(Gj,iM(H2∗)∣Gj−1,iM(H1∗))} .

Step 3. Random walk proposals for regression parameters are generated in three different blocks: (1) mean, environmental covariates, and residual standard deviation; (2) all QTL genotypic coefficients; and (3) all background control coefficients. Denote by L1 (L2) the likelihood and by p1 (p2) the normal density prior for the QTL genotypic coefficients evaluated at the new (old) values. The proposals are accepted separately for each block with probability min{1, L1 × p1/(L2 × p2)}. If accepted, then δ(t) = δnew, and otherwise δ(t) =δ(t – 1). (In block 3, the acceptance ratio is evaluated separately for each background control.)

Step 4. Imputation for the missing background control markers is done as in Sillanpää and Arjas (1998) except for the following: A consistent genotype pair is first proposed for the parents. Then all offspring with a missing genotype in the corresponding background control position are completed by sampling alleles according to Mendelian transmission probabilities.

APPENDIX B

As in Sillanpää and Arjas (1998), we divide the chromosome into bins Δ1, Δ2,..., ΔNbins, where λ is the approximate posterior QTL intensity on interval Δj, obtained from the Monte Carlo simulation of Ncycs iteration cycles. In a backcross or an F2 intercross, let Djx^(d)=∑k=1Ncycs∑q=1Nqtl(k)1{lq(k)∈Δj,bqx(k)−μq(k)≤d}∑k=1Ncycs∑q=1Nqtl(k)1{lq(k)∈Δj} (7) be the empirical estimator of c.d.f. Djx(d) associated with the phenotypic effect of heterozygous QTL genotype x at a putative QTL in bin Δj and μq(k)=∑x=1Ngenbqx(k)∕Ngen . If fixation of QTL alleles in different grandparental lines is assumed, we can use distribution functions similar to those presented for F2 in Sillanpää and Arjas (1998).

Footnotes

  • Communicating editor: Z-B. Zeng

  • Received July 6, 1998.
  • Accepted December 28, 1998.
  • Copyright © 1999 by the Genetics Society of America

LITERATURE CITED

  1. ↵
    1. Green P. J.
    , 1995 Reversible jump Markov Chain Monte Carlo computation and Bayesian model determination. Biometrika 82: 711–732.
    OpenUrlAbstract/FREE Full Text
  2. ↵
    1. Haley C. S.,
    2. Knott S. A.,
    3. Elsen J.-M.
    , 1994 Mapping quantitative trait loci in crosses between outbred lines using least squares. Genetics 136: 1195–1207.
    OpenUrlAbstract/FREE Full Text
  3. ↵
    1. Heath S. C.
    , 1997 Markov chain Monte Carlo segregation and linkage analysis for oligogenic models. Am. J. Hum. Genet. 61: 748–760.
    OpenUrlPubMedWeb of Science
  4. ↵
    1. Hoeschele I.,
    2. Uimari P.,
    3. Grignola F. E.,
    4. Zhang Q.,
    5. Gage K. M.
    ,1997 Advances in statistical methods to map quantitative trait loci in outbred populations. Genetics 147: 1445–1457.
    OpenUrlAbstract/FREE Full Text
  5. ↵
    1. Jansen R. C.
    , 1993 Interval mapping of multiple quantitative trait loci. Genetics 135: 205–211.
    OpenUrlAbstract/FREE Full Text
  6. ↵
    1. Jansen R. C.
    , 1996 A general Monte Carlo method for mapping multiple quantitative trait loci. Genetics 142: 305–311.
    OpenUrlAbstract/FREE Full Text
  7. ↵
    1. Jansen R. C.,
    2. Stam P.
    , 1994 High resolution of quantitative traits into multiple loci via interval mapping. Genetics 136: 1447–1455.
    OpenUrlAbstract/FREE Full Text
  8. ↵
    1. Jansen R. C.,
    2. Johnson D. L.,
    3. Van Arendonk J. A. M.
    , 1998 A mixture model approach to the mapping of quantitative trait loci in complex populations with an application to multiple cattle families. Genetics 148: 391–399.
    OpenUrlAbstract/FREE Full Text
  9. ↵
    1. Janss L. L.,
    2. Thompson G. R.,
    3. Van Arendonk J. A. M.
    , 1995 Application of Gibbs sampling for inference in a mixed major genepolygenic inheritance model in animal populations. Theor. Appl. Genet. 91: 1137–1147.
    OpenUrlPubMedWeb of Science
  10. ↵
    1. Jensen C. S.,
    2. Kong A.
    , 1997 Blocking Gibbs sampling for linkage analysis in large pedigrees with many loops. Manuscript available at MCMC preprint service (http://www.stats.bris.ac.uk/MCMC/).
  11. ↵
    1. Jensen C. S.,
    2. Sheehan N.
    , 1998 Problems with determination of noncommunicating classes for Monte Carlo Markov Chain applications in pedigree analysis. Biometrics 54: 416–425.
    OpenUrlCrossRefPubMed
  12. ↵
    1. Kao C.-H.,
    2. Zeng Z.-B.
    , 1997 General formulas for obtaining the MLEs and the asymptotic variance-covariance matrix in mapping quantitative trait loci when using the EM algorithm. Biometrics 53: 653–665.
    OpenUrlCrossRefPubMed
  13. ↵
    1. Knott S. A.,
    2. Neale D. B.,
    3. Sewell M. M.,
    4. Haley C. S.
    , 1997 Multiple marker mapping of quantitative trait loci in an outbred pedigree of loblolly pine. Theor. Appl. Genet. 94: 810–820.
    OpenUrlCrossRefWeb of Science
  14. ↵
    1. Kong A.
    , 1991 Analysis of pedigree data using methods combining peeling and Gibbs sampling, pp. 379–385 in Computer Science and Statistics Proceedings of the 23rd Symposium on the Interface, edited by Keramidas E. M., Kaufman S. M.. Interface Foundation, Fairfax Station, VA.
  15. ↵
    1. Kruglyak L.
    , 1997 The use of a genetic map of biallelic markers in linkage studies. Nat. Genet. 17: 21–24.
    OpenUrlCrossRefPubMedWeb of Science
  16. ↵
    1. Kruglyak L.,
    2. Daly M. J.,
    3. Lander E. S.
    , 1995 Rapid multipoint linkage analysis of recessive traits in nuclear families, including homozygosity mapping. Am. J. Hum. Genet. 56: 519–527.
    OpenUrlPubMedWeb of Science
  17. ↵
    1. Lander E. S.,
    2. Green P.
    , 1987 Construction of multilocus genetic linkage maps in humans. Proc. Natl. Acad. Sci. USA 84: 2363–2367.
    OpenUrlAbstract/FREE Full Text
  18. ↵
    1. Lin S.
    , 1995 A scheme for constructing an irreducible Markov Chain for pedigree data. Biometrics 51: 318–322.
    OpenUrlCrossRefPubMedWeb of Science
  19. ↵
    1. Lin S.,
    2. Thompson E.,
    3. Wijsman E.
    , 1994 Finding noncommunicating sets for Markov Chain Monte Carlo estimation on pedigrees. Am. J. Hum. Genet. 54: 695–704.
    OpenUrlPubMed
  20. ↵
    1. Maliepaard C.,
    2. Van Ooijen J. W.
    , 1994 QTL mapping in a full-sib family of an outcrossing species, pp. 140–146 in Biometrics in Plant Breeding: Applications of Molecular Markers, edited by Van Ooijen J. W., Jansen J.. CPRO-DLO, Wageningen, The Netherlands.
  21. ↵
    1. Richardson S.,
    2. Green P. J.
    , 1997 On Bayesian analysis of mixtures with an unknown number of components. J. R. Stat. Soc. Ser. B 59: 731–792.
    OpenUrlCrossRef
    1. Satagopan J. M.,
    2. Yandell B. S.
    , 1996 Estimating the number of quantitative trait loci via Bayesian model determination. Special Contributed Paper Session on Genetic Analysis of Quantitative Traits and Complex Diseases, Biometric Section, Joint Statistical Meetings, Chicago, IL (available at ftp://ftp.stat.wisc.edu/pub/yandell/revjump.html/).
  22. ↵
    1. Satagopan J. M.,
    2. Yandell B. S.,
    3. Newton M. A.,
    4. Osborn T. C.
    , 1996 A Bayesian approach to detect quantitative trait loci using Markov Chain Monte Carlo. Genetics 144: 805–816.
    OpenUrlAbstract/FREE Full Text
  23. ↵
    1. Sheehan N.,
    2. Thomas A.
    , 1993 On the irreducibility of a Markov chain defined on a space of genotype configurations by a sampling scheme. Biometrics 49: 163–175.
    OpenUrlCrossRefPubMedWeb of Science
  24. ↵
    1. Sillanpää M. J.,
    2. Arjas E.
    , 1998 Bayesian mapping of multiple quantitative trait loci from incomplete inbred line cross data. Genetics 148: 1373–1388.
    OpenUrlAbstract/FREE Full Text
  25. ↵
    1. Sobel E.,
    2. Lange K.
    , 1996 Descent graphs in pedigree analysis: application to haplotyping, location scores, and marker-sharing statistics. Am. J. Hum. Genet. 58: 1323–1337.
    OpenUrlPubMedWeb of Science
  26. ↵
    1. Thompson E. A.
    , 1994 Monte Carlo likelihood in genetic mapping. Stat. Sci. 9: 355–366.
    OpenUrlCrossRefWeb of Science
  27. ↵
    1. Uimari P.,
    2. Hoeschele I.
    , 1997 Mapping linked quantitative trait loci using Bayesian analysis and Markov chain Monte Carlo algorithms. Genetics 146: 735–743.
    OpenUrlAbstract/FREE Full Text
  28. ↵
    1. Van Ooijen J. W.,
    2. Maliepaard C.
    , 1996 Plant Genome IV. Abstract at: http://probe.nalusda.gov:8000/otherdocs/pg/pg4/abstracts/p316.html/.
  29. ↵
    1. Wijsman E. M.
    , 1987 A deductive method of haplotype analysis in pedigrees. Am. J. Hum. Genet. 41: 356–373.
    OpenUrlPubMedWeb of Science
  30. ↵
    1. Zeng Z-B.
    , 1993 Theoretical basis for separation of multiple linked gene effects in mapping quantitative trait loci. Proc. Natl. Acad. Sci. USA 90: 10972–10976.
    OpenUrlAbstract/FREE Full Text
  31. ↵
    1. Zeng Z-B.
    , 1994 Precision mapping of quantitative trait loci. Genetics 136: 1457–1468.
    OpenUrlAbstract/FREE Full Text
View Abstract
Previous ArticleNext Article
Back to top

PUBLICATION INFORMATION

Volume 151 Issue 4, April 1999

Genetics: 151 (4)

ARTICLE CLASSIFICATION

INVESTIGATIONS
View this article with LENS
Email

Thank you for sharing this Genetics article.

NOTE: We request your email address only to inform the recipient that it was you who recommended this article, and that it is not junk mail. We do not retain these email addresses.

Enter multiple addresses on separate lines or separate them with commas.
Bayesian Mapping of Multiple Quantitative Trait Loci From Incomplete Outbred Offspring Data
(Your Name) has forwarded a page to you from Genetics
(Your Name) thought you would be interested in this article in Genetics.
Print
Alerts
Enter your email below to set up alert notifications for new article, or to manage your existing alerts.
SIGN UP OR SIGN IN WITH YOUR EMAIL
View PDF
Share

Bayesian Mapping of Multiple Quantitative Trait Loci From Incomplete Outbred Offspring Data

Mikko J. Sillanpää and Elja Arjas
Genetics April 1, 1999 vol. 151 no. 4 1605-1619
Mikko J. Sillanpää
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Elja Arjas
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
del.icio.us logo Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
Citation

Bayesian Mapping of Multiple Quantitative Trait Loci From Incomplete Outbred Offspring Data

Mikko J. Sillanpää and Elja Arjas
Genetics April 1, 1999 vol. 151 no. 4 1605-1619
Mikko J. Sillanpää
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Elja Arjas
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero

Related Articles

Cited By

More in this TOC Section

  • Selective Sweep at a QTL in a Randomly Fluctuating Environment
  • Comparison of the Full Distribution of Fitness Effects of New Amino Acid Mutations Across Great Apes
  • Broad Promotes Neuroepithelial Stem Cell Differentiation in the Drosophila Optic Lobe
Show more Investigations
  • Top
  • Article
    • Abstract
    • MODEL
    • SIMULATION ANALYSIS
    • RESULTS
    • DISCUSSION
    • Acknowledgments
    • APPENDIX A: PREPROCESSING AND PARAMETER ESTIMATION
    • APPENDIX B
    • Footnotes
    • LITERATURE CITED
  • Figures & Data
  • Info & Metrics

GSA

The Genetics Society of America (GSA), founded in 1931, is the professional membership organization for scientific researchers and educators in the field of genetics. Our members work to advance knowledge in the basic mechanisms of inheritance, from the molecular to the population level.

Online ISSN: 1943-2631

  • For Authors
  • For Reviewers
  • For Subscribers
  • Submit a Manuscript
  • Editorial Board
  • Press Releases

SPPA Logo

GET CONNECTED

RSS  Subscribe with RSS.

email  Subscribe via email. Sign up to receive alert notifications of new articles.

  • Facebook
  • Twitter
  • YouTube
  • LinkedIn
  • Google Plus

Copyright © 2019 by the Genetics Society of America

  • About GENETICS
  • Terms of use
  • Advertising
  • Permissions
  • Contact us
  • International access