Genetics, Vol. 157, 1357-1367, March 2001, Copyright © 2001

Biased Estimation of the Recombination Fraction Using Half-Sib Families and Informative Offspring

L. Gomez-Rayaa
a AKVAFORSK, Institute for Aquaculture Research, N-1432 Ås, Norway

Corresponding author: L. Gomez-Raya, Area de Produccio Animal, Av. Alcalde Rovira Roure, 177, 25198 Lleida, Spain., luis.gomez{at}irta.es (E-mail)

Communicating editor: J. B. WALSH


*  ABSTRACT
*TOP
*ABSTRACT
*THEORY AND METHODS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*APPENDIX C
*APPENDIX D
*LITERATURE CITED

A maximum-likelihood method to estimate the recombination fraction and its sampling variance using informative and noninformative half-sib offspring is derived. Estimates of the recombination fraction are biased up to 20 cM when noninformative offspring are discarded. In certain scenarios, the sampling variance can be increased or reduced up to fivefold due to the bias in estimating the recombination fraction and the LOD score can be reduced up to 5 units when discarding noninformative offspring. Comparison of the estimates of recombination fraction, map distance, and LOD score when constructing a genetic map with 251 two-point linkage analyses and six families of Norwegian cattle was carried out to evaluate the implications of discarding noninformative offspring in practical situations. The average discrepancies in absolute value (average difference when using and neglecting noninformative offspring) were 0.0146, 1.64 cM, and 2.61 for the recombination fraction, map distance, and the LOD score, respectively. A method for simultaneous estimation of allele frequencies in the dam population and a transmission disequilibrium parameter is proposed. This method might account for the bias in estimating allele frequencies in the dam population when the half-sib offspring is selected for production traits.


A large effort has been made to construct genetic maps of markers for many farm animals. The traditional approach has been to use reference families in which one or more families consisting of parents and offspring are genotyped for genetic markers with as much coverage as possible of the genome. In cattle, the resource families have been consisting of full-sibs (BARENDSE et al. 1997 Down), mixed full- and half-sib groups (KAPPES et al. 1997 Down), or half-sibs (MA et al. 1996 Down). Increasing interest in the construction of genetic maps aimed to map quantitative trait loci (QTL) in cattle has focused on the development of male genetic maps in different populations (GEORGES et al. 1995 Down; MA et al. 1996 Down; VAGE et al. 2000 Down). Those genetic maps have been used for QTL mapping by typing sires and their half-sib sons and using records of production in the granddaughters in the so-called granddaughter design (WELLER et al. 1990 Down). Usually DNA from sires and half-sib offspring is available. However, DNA from dams is usually not available because they are culled before the time of typing is started. Noninformative half-sib offspring will be produced in this situation. For example, let a sire be heterozygous Aa at a codominant marker. Alleles segregating in the dam population are A, a, and a*, where a* is any other allele segregating in the population and different from the alleles of the sire. Five genotypes of half-sib offspring are possible: AA, Aa*, Aa, aa*, and aa. Tracing of sire alleles in offspring AA, aa, Aa*, and aa* is straightforward. However, tracing of the sire alleles in offspring with the same genotype as their sire, Aa, is not possible. Therefore, offspring with the same genotype as their sire are noninformative.

Methods for linkage analysis have either ignored offspring with genotype Aa (VILKKI et al. 1997 Down; MA et al. 1996 Down) or made use of estimates of allele frequency in the dam population with a single sire family at the time (GEORGES et al. 1995 Down).

The method proposed by GEORGES et al. 1995 Down to estimate allele frequencies in the dam population also assumed that either allele in a heterozygous sire segregates with equal probability in spite of the fact that those sons have been highly selected for production traits. Consequently, a higher frequency of the alleles associated to those traits among the selected offspring is expected.

More recently, consensus maps are being developed for each of the cattle chromosomes using data provided by different labs with different reference families (CASAS et al. 1999 Down; GU et al. 2000 Down). The linkage analysis is carried out using the software CRIMAP, which ignores noninformative half-sib offspring.

It is the purpose of this article to show that discarding noninformative offspring leads to biased estimates of the recombination fraction and, therefore, of the map distances. It is also shown that neglecting the dam contribution affects both sampling variance of the estimates of the recombination fraction and the LOD score.

The order of this article is (1) to describe a maximum-likelihood method to estimate the recombination fraction and its sampling variance assuming known allele frequencies in the dam population, (2) to show the magnitude of the bias in the estimates of the recombination fraction when discarding noninformative offspring, (3) to show the effect of discarding noninformative offspring on the sampling variance of the recombination fraction and on the LOD scores, (4) to describe a maximum-likelihood method for estimating both a transmission disequilibrium parameter and allele frequencies in the dam population accounting for selection in the half-sib offspring, and (5) to evaluate the discrepancies in estimating the recombination fraction, its sampling variance, map distance, and LOD score when using or discarding noninformative offspring in the construction of a genetic map with six large half-sib families of Norwegian cattle.


*  THEORY AND METHODS
*TOP
*ABSTRACT
*THEORY AND METHODS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*APPENDIX C
*APPENDIX D
*LITERATURE CITED

Maximum-likelihood estimation using informative and noninformative half-sib offspring:
Let a sire be heterozygous at two codominant markers with alleles A and a at the first marker and with alleles B and b at the second marker. Alleles a* and b* represent any other allele segregating in the dam population different from the alleles of the sire at the first and the second marker, respectively. Assume that linkage phase is AB/ab and that the true recombination fraction is c. Offspring genotypes produced after mating the sire with different dams and having one offspring from each mating can be classified according to the gamete inherited from the sire: AB, Ab, aB, ab, Ax, ax, xB, xb, and xx. In this notation, x refers to noninformative offspring at the corresponding marker. For example, offspring with genotype xx are noninformative at both markers. Table 1 shows the type of gametes inherited from sire to noninformative offspring given the gametes from sire and dam. The expected frequency of each genotype can be computed by multiplying the frequencies of sire and dam gametes and adding over all possible combinations for each type of gamete. The gamete frequencies for each type of gamete can be used in the maximum-likelihood estimation of the recombination fraction. The maximum-likelihood equation is

(1)

where

with ni being the gametic counts of offspring inheriting gamete with genotype i from sire (i = AB, Ab, aB, ab, Ax, ax, Bx, bx, xx). The values of {phi}i are given in Table 2. A considerable amount of data would be necessary to estimate accurately the 10 parameters of the above equation. Assuming both linkage equilibrium at each pair of alleles and known allele frequencies in the dam population, the number of parameters to be estimated reduces to the recombination fraction. Methods to estimate allele frequencies in the dam population are addressed later.


 
View this table:
In this window
In a new window

 
Table 1. Types of noninformative offspring produced after mating a sire to different dams for all possible gamete combinations from parents


 
View this table:
In this window
In a new window

 
Table 2. Gamete probabilities and their first derivative with respect to c

The maximum-likelihood equation assuming both linkage equilibrium and known allele frequencies in the dam population is

(2)

where {phi}i are the gamete probabilities for gamete i (i = AB, Ab, aB, ab, Ax, ax, Bx, bx, xx) with values given in Table 2.

A solution to the maximum-likelihood equation can be obtained using the grid search method or methods using derivatives such as Newton-Raphson (Appendix A). The latter methods allow computation of the approximate sampling variance of the estimates of the recombination fraction

where

and ni and {phi}i are the gametic counts and the probability of genotype i, respectively. The first derivatives of {phi}i with respect to c are given in Table 2.

In general, the linkage phase is unknown and offspring from several sires are available. The joint-likelihood equation using both phases and multiple-sire families is

where nf is the number of sire families and Li,1(c) is the likelihood for family i assuming linkage phase of the sire AB/ab as given in Equation 2. The likelihood for family i assuming linkage phase of the sire Ab/aB is Li,2(c). The latter likelihood can be constructed following the same rules as for Equation 2 but accounting for linkage phase Ab/aB. The two linkage phases are assumed to be equally frequent in the above equation. Assuming linkage phase is known for each sire, the approximate sampling variance of the recombination fraction is

where Li(c) is the likelihood of sire family i.

Bias in the estimation of the recombination fraction using informative offspring:
The maximum-likelihood estimate of the recombination fraction when noninformative offspring are discarded (c*) is simply the frequency of informative recombinants divided by the sum of frequencies of all informative offspring (Appendix B). From Table 1, the expected frequency of informative recombinants is

where ß = [(1 - fa)(1 - fB) + (1 - fA)(1 - fb)]. Similarly, the expected frequency of informative nonrecombinants is

where {alpha} = [(1 - fa)(1 - fb) + (1 - fA)(1 - fB)].

Consequently, the expected recombination fraction when discarding noninformative offspring is

(3)

The bias in the estimates of the recombination fraction when discarding noninformative offspring is bias = c - E(c*). Bias in the estimation of the recombination fraction is null in two trivial situations: (1) when the true recombination fraction is 0, and (2) when fA = fa or fB = fb. In the first situation, E(c*) = = = 0. In the second situation {alpha} = ß and consequently E(c*) = cß/(cß + (1 - c)ß) = c. Fig 1 illustrates the amount of bias for some particular cases.



View larger version (19K):
In this window
In a new window
Download PPT slide
 
Figure 1. Bias in the estimates of the recombination fraction when discarding noninformative offspring for varying values of the recombination fraction (c). The following scenarios were considered regarding allele frequencies in the dam population: (a) fA = 0.8, fB = 0.8, fa = 0.1, fb = 0.1; (b) fA = 0.4, fB = 0.4, fa = 0.1, fb = 0.1; (c) fA = 0.4, fB = 0.1, fa = 0.1, fb = 0.4; and (d) fA = 0.8, fB = 0.1, fa =0.1, fb = 0.8.

Variance of the estimates of the recombination fraction using informative offspring:
Discarding noninformative offspring also affects the sampling variance of the estimates of the recombination fraction. It is possible to evaluate the relation between both estimates using Fisher's approximation. The estimate of the sampling variance of the recombination fraction when using only informative offspring is

(4)

and Var(c) is the sampling variance of the estimate of the recombination fraction using all information (informative and noninformative offspring).

Monte Carlo simulation:
Analytic evaluation of the changes in the LOD scores by neglecting noninformative offspring is difficult because of the large number of combinations of the nine possible genotypes of offspring. A computer simulation was carried out in which one sire family was simulated with 50 offspring (average number of sons used in the construction of a genetic map of Norwegian cattle; VAGE et al. 2000 Down). Linkage phase and allele frequencies in the dam population were assumed to be known. The transmission of either allele at each marker from sire to offspring was assumed to be 1/2. Offspring from the sire was simulated using random drawings from a uniform distribution. If a drawing was in the interval between 0 and {phi}AB then the gametic count for AB was increased by one. If the drawing was between {phi}AB and ({phi}AB + {phi}Ab) then the gametic count for genotype Ab was increased by one. This process was carried out until all 50 offspring were assigned to one of the nine possible gametes according to their probability ({phi}i; i = AB, Ab, aB, ab, Ax, ax, Bx, bx, xx) and then accumulated in their corresponding gametic count. This approach is equivalent to simulating each of the markers having a Mendelian inheritance to his sons but it is computationally less demanding. Gametic counts were used to estimate recombination fraction and to compute LOD scores (log10[L(c)/L(1/2)]) using all offspring (informative and noninformative) and when discarding noninformative offspring. Each simulation set was replicated 10,000 times. Simulated recombination fractions were 0.05, 0.15, or 0.25. Results are given in Table 4.


 
View this table:
In this window
In a new window

 
Table 3. Frequency of offspring genotypes in the estimation of allele frequencies in the dam population


 
View this table:
In this window
In a new window

 
Table 4. Monte Carlo simulation results

Estimation of allele frequency in the dam population:
The above maximum-likelihood method to estimate recombination fraction using half-sib families assumed that allele frequencies in the dam population are known. In practice, allele frequencies might be estimated from the same data. A general maximum-likelihood estimation of allele frequencies in the dam population allowing estimation of the transmission disequilibrium of the sire alleles among offspring is described in this section. This disequilibrium would arise if the marker is linked to loci affecting a quantitative trait under selection. Selected offspring is the more frequent situation in cattle since bulls with lower estimated breeding values are culled after progeny testing.

Let a heterozygous sire, Aa, produce half-sib offspring assuming that the frequency of alleles A, a, and a* in the dam population are fA, fa, and fa*, respectively. As in the previous sections, allele a* corresponds to any allele segregating in the dam population different from the alleles of that particular sire. The gametes inherited from sire and dam to offspring, the offspring genotypes, and their frequencies are given in Table 3. In the notation of the table, the transmission parameter (v) is the probability of transmitting one of the sire alleles to his offspring.

The likelihood equation to be maximized is

where K is a constant and ni is the count for genotype i (i = AA, Aa, aa, Aa*, aa*). Solutions can be obtained using the grid search method after maximizing the above equation for v and fA. The maximum-likelihood solution for fa* is fa* = , where nT = nAa* + naa* + naa + nAa + nAA.

Two particular cases can be considered in the above equation. The first is when both alleles have the same probability of being transmitted to the offspring; i.e., v = . The likelihood equation becomes

which has an explicit solution

Using the same arguments a solution for fa is obtained:

The derivation is given in Appendix C. This is the situation considered by GEORGES et al. 1995 Down.

The second case is when reliable estimates of allele frequencies in the dam population are available and they do not need to be estimated. The explicit solution for the transmission parameter is

where A = [(nAA + nAa*)(1 - fA - fa*) - (naa + naa*)fA + (1 - 2fA - fa*)nAa], B = [(nAa + nAa*) fA], and C = -[(naa + naa*)(1 - fA - fa*)]. The derivation is given in Appendix D.

Comparison of a genetic map of Norwegian cattle constructed when using only informative offspring and when using all offspring:
The previous sections addressed the impact of discarding noninformative offspring on linkage analysis results. However, the assumption of known frequencies at the alleles segregating in the dam population was necessary. It is of interest to evaluate, in practice, the effect of discarding noninformative offspring in the construction of genetic maps. A genetic map of Norwegian cattle was constructed using six half-sib families with an average of 50 informative and noninformative offspring per family (VAGE et al. 2000 Down). The map covers all 29 autosomal chromosomes. More information about the map can be found in the web site http://www.nlh.no/Institutt/IHF/Genkartstorfe/.

Comparison of estimates of the recombination fraction using all (AO) vs. only informative offspring (IO) was carried out by computing (1) accumulated discrepancy in absolute value of the recombination fraction, {sum}n1ai=1|cIO(i) - cAO(i)|, where nla is the number of two-point linkage analyses performed, cIO(i) and cAO(i) are the estimates of the recombination fraction obtained in linkage analysis i when using only informative offspring and when using all offspring, respectively; and (2) accumulated net discrepancy, {sum}n1ai=1(cIO(i) - cAO(i)). The latter allows us to investigate if the recombination fractions are either increasing or decreasing in a systematic fashion and, therefore, enlarging or reducing genetic maps constructed when using only informative offspring. The same parameters were used to evaluate map distances, sampling variances of the recombination fraction, and LOD scores. A total of 251 two-point linkage analyses were carried out and the order of loci was as described by VAGE et al. 2000 Down. A list of the markers per autosomal chromosome is given in Table 5. Blood groups from the map of VAGE et al. 2000 Down were not used since the linkage analysis methodology described in this article applies only to codominant markers. For simplicity, estimates of allele frequency at each marker in the dam population were computed assuming that both alleles at each marker have the same probability of being transmitted from the sire to their sons.


 
View this table:
In this window
In a new window

 
Table 5. Comparison of 251 two-point linkage analyses when using and neglecting noninformative offspring in the construction of a genetic map of Norwegian cattle


*  RESULTS
*TOP
*ABSTRACT
*THEORY AND METHODS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*APPENDIX C
*APPENDIX D
*LITERATURE CITED

Examples of the magnitude of the bias in the estimation of the recombination fraction when ignoring the dam's contribution are depicted in Fig 1. The bias is very severe (up to 20 cM) when allele frequencies are very different at each of the two markers. Bias increases with the value of the recombination fraction and can be positive or negative depending on which sort of gamete (recombinant or nonrecombinant) is more frequently produced among noninformative offspring.

Fig 2 shows the effect of discarding noninformative offspring on the sampling variances of the estimates of the recombination fraction. The component {gamma} (Equation 4) increases or decreases as much as five times the sampling variance of the estimates of the recombination fraction when allele frequencies are very different at the markers. The value of {gamma} is up to 17% larger or smaller for moderate differences between allele frequencies and increases with smaller recombination fractions.



View larger version (17K):
In this window
In a new window
Download PPT slide
 
Figure 2. Factor affecting the sampling variance ({gamma}) of the estimates of the recombination fraction when discarding noninformative offspring for varying values of the recombination fraction (c). The following scenarios were considered regarding allele frequencies in the dam population: (a) fA = 0.8, fB = 0.8, fa = 0.1, fb = 0.1; (b) fA = 0.4, fB = 0.4, fa = 0.1, fb = 0.1; (c) fA = 0.4, fB = 0.1, fa = 0.1, fb = 0.4; and (d) fA = 0.8, fB = 0.1, fa = 0.1, fb = 0.8.

The results of the simulation experiment are given in Table 4. The average estimates of the recombination fraction over replicates when discarding noninformative offspring () were very similar to their predicted values (*). Table 4 also shows that maximum-likelihood estimation using AO allows unbiased estimation of the recombination fraction. The use of all offspring may increase LOD scores >5 units. As expected, there is not any bias in the estimates of the recombination fraction when the frequencies of alleles within each marker are identical (Table 4).

In practice, a large variety of situations may occur with respect to allele frequencies at linked markers in the dam population. A comparison of the values of linkage parameters in practical scenarios is needed to evaluate the impact of using only informative offspring in the construction of a genetic map. Discrepancies in absolute value for the recombination fraction, map distance, and LOD score for each of the autosomal chromosomes are given in Table 5. The highest average discrepancy for the recombination fraction corresponds to chromosome 3 (0.28). The average discrepancy of the genetic distance for the 13 intervals in that chromosome is 3.2 cM. The average discrepancies in absolute value considering the entire genome are 0.0146, 1.64 cM, and 2.61 for the recombination fraction, map distance, and LOD score, respectively.

It is also of interest to know if the use of only informative half-sib offspring alters the length of the genetic map or how the loss of information affects the variances of the estimates of the recombination fraction. Table 6 shows the accumulated net discrepancies of the recombination fraction, map distance, average variance of the estimate of the recombination fraction, and number of linkage analyses with significant LOD scores (with values >3) when using only informative offspring and when using all information. The genetic map constructed using only informative offspring has a reduced length (2806.9 cM) when compared to a genetic map constructed using all offspring (2897.4 cM). However, this reduction in length is only ~3%. The variance of the estimates of the recombination fraction is increased when only informative offspring are used (0.057), showing the loss of information by excluding noninformative offspring. Results from 22 linkage analyses were not significant (LOD score <3) when using only informative offspring but were significant (LOD score >3) when using all offspring.


 
View this table:
In this window
In a new window

 
Table 6. Accumulated genome linkage results when using and neglecting noninformative offspring in the construction of a genetic map of Norwegian cattle


*  DISCUSSION
*TOP
*ABSTRACT
*THEORY AND METHODS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*APPENDIX C
*APPENDIX D
*LITERATURE CITED

Linkage analysis using half-sib families is a powerful tool for accurate estimation of recombination fractions and map distances. This is because the widespread use of artificial insemination in cattle allows a very large number of male meioses to be available for linkage analysis. However, the maximum-likelihood method must make use of both informative and noninformative offspring to yield unbiased estimates of map distances. If only informative offspring are used in the linkage analysis then a severe bias may occur in the estimation of the recombination fraction. The magnitude of the bias depends on the allele frequency in the dam population being used. In particular cases, the bias can be up to 20 cM. The average bias shown in the construction of a genetic map of Norwegian cattle was 1.64 cM. One of the main reasons to construct genetic maps using half-sib families is their further use for QTL mapping in the granddaughter design. The results of this article indicate that genetic maps constructed using only informative offspring will be inaccurate, which may reduce power for QTL mapping. The amount of bias is also considerable with today's efforts aimed at fine structural mapping of already mapped QTL.

Discarding the information of noninformative half-sib offspring also increases the sampling variance of the estimates of the recombination fraction and reduces the LOD score. The relation between sampling variances of estimates using all offspring and only informative offspring was described by the term {gamma}. It is the increase or decrease in the sampling variance due to bias when estimating the recombination fraction. In extreme cases, {gamma} could take values of >5 indicating the large impact of neglecting noninformative offspring on the sampling variance of the estimates of the recombination fraction. Another factor affecting the sampling variances not represented in {gamma} is the amount of data available for the linkage analysis. Discarding noninformative offspring would reduce the amount of data available for the analysis and, consequently, the sampling variance of the estimates of the recombination fraction. This fact is likely contributing to the observed results of the sampling variance of the recombination fraction when constructing a genetic map of Norwegian cattle. The average sampling variance was 0.057 and 0.045 when using only informative and all offspring, respectively. In general, the use of the noninformative offspring increases the amount of information available for the analysis and, consequently, the LOD scores. The effect of using noninformative offspring on the LOD score is related to the magnitude of the allele frequencies in the dam population in two ways. First, higher frequencies at the alleles represented in the sire reduce the proportion of informative offspring and, therefore, the amount of information in the linkage analysis. Second, if the proportion of recombinants among noninformative offspring is higher than the proportion of recombinants produced by the sire then the estimates of the recombination fraction are biased downward. The smaller the recombination fraction is, the higher the LOD score becomes. The latter can be observed in the simulation results when allele frequencies were fA = 0.8, fa = 0.1, fB = 0.1, and fb = 0.8, yielding average LOD scores of 1.69 and 7.22 when using only informative and all offspring, respectively.

GEORGES et al. 1995 Down carried out linkage analysis using one single half-sib family and making use of estimates of allele frequencies in the dam population. However, their method to estimate allele frequencies assumed that the transmission parameter was 1/2, i.e., equal probability of transmission for either allele from the sire to his half-sib sons. The American cattle population is selected for production traits and hence the sample taken for their linkage analysis corresponded to highly selected sons. Consequently, it is expected that markers linked to quantitative trait loci for production traits would not segregate 50:50 among offspring. The use of those estimates may yield biased estimates of the allele frequencies, which might also yield biased estimates of the recombination fraction. In this situation, the proposed maximum-likelihood approach to simultaneously estimate dam allele frequencies and the transmission parameter should be the method of choice. In addition, this maximum-likelihood approach could be used to carry out hypothesis testing for the transmission parameter being different from 1/2 as a way to identify areas of the genome being effectively changed under selection. In fact, preliminary results using the genotyping information of Norwegian cattle yielded more significant results for the transmission parameter than would be expected by chance. Norwegian cattle is a dual-purpose breed in which the young bulls, tested and selected for growth performance, were used to construct the genetic map assuming a probability 1/2 of transmission of either allele from sire to offspring (VAGE et al. 2000 Down).

The increasing amount of male genetic maps generated in different cattle populations in the world (e.g., VILKKI et al. 1997 Down; GEORGES et al. 1995 Down; MA et al. 1996 Down; VAGE et al. 2000 Down) provides a source for genomic studies such as homogeneity of the recombination across families. This is also important for QTL mapping because of the usual assumption of all sires having the same recombination fraction. However, a maximum-likelihood method using both informative and noninformative offspring should be used to avoid a different bias in the estimation of the recombination fraction for each of the sire families since the sires are likely carriers of different alleles segregating at a different frequency in the dam population. Analysis of homogeneity of the recombination fraction could then be performed using the available male genetic maps.

There is an international initiative to develop consensus maps for each of the cattle chromosomes. Consensus maps for chromosomes 4 and 7 have been already published (CASAS et al. 1999 Down; GU et al. 2000 Down). Consensus maps are being constructed using data from different cattle populations and CRIMAP software, which makes use of only the informative offspring when analyzing half-sib families. It can be concluded from the results of this article that genetic distances in consensus maps are likely biased.

Maximum-likelihood methods using half-sib families have been discussed in the context of two-point linkage analyses. In practice, the methods can be extended to include any number of loci. The use of only informative offspring in multipoint linkage analysis may yield the wrong order of loci. Caution must be taken in using consensus maps, which ignore noninformative offspring in half-sib families.


*  ACKNOWLEDGMENTS

Genotype information from Norwegian cattle was provided by Helge Klungland, Dag Inge Våge, Ingrid Olsaker, and Sigbjørn Lien. Biological material for typing was provided by GENO (breeding organization of Norwegian cattle). This work has been supported by the Norwegian Research Council (NFR) project number 130162/130, titled "Strategic QTL Research Plan for Disease Resistance in Atlantic Salmon and Cattle."

Manuscript received June 15, 2000; Accepted for publication November 20, 2000.


*  APPENDIX A
*TOP
*ABSTRACT
*THEORY AND METHODS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*APPENDIX C
*APPENDIX D
*LITERATURE CITED

Solution to maximum-likelihood equations using all offspring and the Newton-Raphson method:
The maximum-likelihood equation (Equation 2) assuming linkage disequilibrium and known allele frequencies in the dam population is L(c) = K({phi}AB)nAB({phi}Ab)nAb({phi}aB)naB({phi}ab)nab x ({phi}Ax)nAx({phi}ax)nax({phi}xB)nxB({phi}xb)nxb({phi}xx)nxx, where {phi}i are the gamete probabilities for gamete i (i = AB, Ab, aB, ab, Ax, ax, Bx, bx, xx) with values given in Table 2.

Taking the natural logarithm to both sides of the likelihood equation and dropping the constant term from the equation,

The first derivative of the likelihood equation is

where i refers to the nine possible gametes inherited from the sire (i = AB, Ab, aB, ab, Ax, ax, xB, xb, and xx). The value of the first derivatives of {phi}i with respect to c is given in Table 2.

The second derivative of the likelihood equation is

The Newton-Raphson solutions are obtained by iterating on the equation,

where c' is either a first guess or a computed value of the recombination fraction in the previous iteration.

The approximate sampling variance of the estimate of the recombination fraction is Var(c) {approx} 1/-[{partial}2ln L(c)/{partial}c2]c=c.


*  APPENDIX B
*TOP
*ABSTRACT
*THEORY AND METHODS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*APPENDIX C
*APPENDIX D
*LITERATURE CITED

Maximum-likelihood solution using only informative offspring:
Assuming linkage phase AB/ab in the sire, let nAB, naB, nAb, and nab be the observed counts for gametes AB, Ab, aB, and ab inherited from the sire. The maximum-likelihood equation discarding noninformative offspring is

The first derivative of natural logarithm of the likelihood is

Making the above expression equal to 0, a solution is obtained for the recombination fraction:

That is, the maximum-likelihood estimate is the number of recombinants divided by the total number of informative offspring. The approximate sampling variance of the estimates is Var(c) {approx} , where


*  APPENDIX C
*TOP
*ABSTRACT
*THEORY AND METHODS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*APPENDIX C
*APPENDIX D
*LITERATURE CITED

Solution to the maximum-likelihood equation to estimate allele frequency in the dam population when either allele has the same probability of transmission from sire to sons (v = ):
The likelihood equation is

Taking the natural logarithm to both sides of the above equation after dropping the constant term leads to

Taking the derivative of ln L(fA) with respect to fA leads to

The solution to the natural logarithm of the maximum likelihood is obtained by equating to 0 the above expression and by solving for fA the resulting equation

where fa* = (nAa* + nAa*)/nT. The solution for fa is obtained using the same arguments as for fA,


*  APPENDIX D
*TOP
*ABSTRACT
*THEORY AND METHODS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*APPENDIX C
*APPENDIX D
*LITERATURE CITED

Estimation of the transmission parameter assuming known allele frequencies in the dam population:
The maximum-likelihood equation is

Taking the natural logarithm to both sides of the above equation after dropping the constant term leads to

Taking the derivative of the natural logarithm of the likelihood with respect to v,

Equating the above equation to 0 leads to

The above equation reduces to

where A = [(nAA + nAa*)(1 - fA - fa*) - (naa + naa*)fA + (1 - 2fA - fa*)nAa], B = [(nAa + nAa*)fA], and C = -[(naa + naa*)(1 - fA - fa*)].

After rearranging this equation a quadratic is obtained,

with one solution within the parameter space:


*  LITERATURE CITED
*TOP
*ABSTRACT
*THEORY AND METHODS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*APPENDIX C
*APPENDIX D
*LITERATURE CITED

BARENDSE, W., D. VAIMAN, S. J. KEMP, Y. SUGIMOTO, and S. M. ARMITAGE et al., 1997  A medium density genetic linkage map of the bovine genome. Mamm. Genome 8:21-28[Medline].

CASAS, E., W. BARENDSE, J. E. BEEVER, B. M. BURNS, and S. K. DAVIS et al., 1999  Bovine chromosome 4 workshop: consensus and comprehensive linkage maps. Anim. Genet. 30:375-377[Medline].

GEORGES, M., D. NIELSEN, M. MACKINNON, A. MISHRA, and R. OKIMOTO et al., 1995  Mapping quantitative trait loci controlling milk production in dairy cattle by exploiting progeny testing. Genetics 139:907-920[Abstract].

GU, Z., L. GOMEZ-RAYA, D. I. VÅGE, K. ELO, and W. BARENDSE et al., 2000  Consensus and comprehensive linkage maps of bovine chromosome 7 (BTA7). Anim. Genet. 31:206-209[Medline].

KAPPES, M. S., J. W. KEELE, R. S. STONE, R. A. MCGAW, and T. S. SONSTEGARD et al., 1997  A second-generation linkage map of the bovine genome. Genome Res. 7:235-249[Abstract/Free Full Text].

MA, R. Z., J. E. BEEVER, Y. DA, C. A. GREEN, and I. RUSS et al., 1996  A male linkage map of the cattle (Bos taurus) genome. J. Hered. 87:261-271[Abstract/Free Full Text].

GE, D. I., I. OLSAKER, H. KLUNGLAND, L. GOMEZ-RAYA, and S. LIEN, 2000  A male genetic map designed for QTL mapping in Norwegian Cattle. Acta Agric. Scand. Sect. A, Anim. Sci. 50:56-63.

VILKKI, H. J., D. J. DE KONING, K. ELO, R. VELMALA, and A. MAKI-TANILA, 1997  Multiple marker mapping of quantitative trait loci of Finnish dairy cattle by regression. J. Dairy Sci. 80:198-204[Abstract/Free Full Text].

WELLER, J. I., Y. KASHI, and M. SOLLER, 1990  Power of daughter and granddaughter designs for determining linkage between marker loci and quantitative trait loci in dairy cattle. J. Dairy Sci. 73:2525-2537[Abstract].




This article has been cited by other articles:


Home page
J ANIM SCIHome page
L. Gomez-Raya
Inferring unknown genotypes of sires at codominant deoxyribonucleic acid markers in half-sib families
J Anim Sci, June 1, 2009; 87(6): 1872 - 1882.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
L. Gomez-Raya, H. G. Olsen, F. Lingaas, H. Klungland, D. I. Vage, I. Olsaker, S. B. Talle, M. Aasland, and S. Lien
The Use of Genetic Markers to Measure Genomic Response to Selection in Livestock
Genetics, November 1, 2002; 162(3): 1381 - 1388.
[Abstract] [Full Text] [PDF]


Home page
J ANIM SCIHome page
Y. Da, J. Garbe, N. London, and J. Xu
Linkage analysis using direct and indirect counting and relative efficiencies for codominant and dominant loci
J Anim Sci, October 1, 2002; 80(10): 2528 - 2539.
[Abstract] [Full Text] [PDF]