- THIS ARTICLE
-
Abstract
- Full Text (PDF)
-
All Versions of this Article:
genetics.107.077784v1
177/2/1117 most recent - Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Chamberlain, A. J.
- Articles by Goddard, M. E.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Chamberlain, A. J.
- Articles by Goddard, M. E.
Originally published as Genetics Published Articles Ahead of Print on August 24, 2007.
Genetics, Vol. 177, 1117-1123, October 2007, Copyright © 2007
doi:10.1534/genetics.107.077784
The Number of Loci That Affect Milk Production Traits in Dairy Cattle
Amanda Jane Chamberlain*,
,1,
Helen Clare McPartlan* and
Michael Edward Goddard*,
* Animal Genetics and Genomics Platform, Department of Primary Industries Victoria, Attwood, 3049, Victoria, Australia and
Land and Food Resources, University of Melbourne, Parkville, 3052, Victoria, Australia
1 Corresponding author: Department of Primary Industries Victoria, 475 Mickleham Rd., Attwood, 3049, Victoria, Australia.
E-mail: amanda.chamberlain{at}dpi.vic.gov.au
We have used the results of an experiment mapping quantitative trait loci (QTL) affecting milk yield and composition to estimate the total number of QTL affecting these traits. We did this by estimating the number of segregating QTL within a half-sib daughter design using logic similar to that used to estimate the "false discovery rate" (FDR). In a half-sib daughter design with six sire families we estimate that the average sire was heterozygous for
5 QTL per trait. Also, in most cases only one sire was heterozygous for any one QTL; therefore at least 30 QTL were likely to be segregating for these milk production traits in this Holstein population.
MILK production and composition are classical quantitative traits, often described by the infinitesimal model, which assumes that the number of loci is infinitely large, each with an infinitely small effect (LYNCH and WALSH 1998). However, QTL mapping experiments have shown that there are not really an infinite number of loci and that some genes have a moderate effect on quantitative traits, i.e., DGAT1 on chromosome 14 (GRISART et al. 2004) and GHR on chromosome 20 (BLOTT et al. 2003). In fact, THODAY and THOMPSON (1976) showed that a normal distribution of phenotypes in a segregating population can be the result of as few as three independently segregating loci. The infinitesimal model produces good predictions of short-term selection response, but for prediction of long-term response, and especially for predicting the value of marker-assisted selection, it would be useful to know the number of real QTL underlying a typical quantitative trait. This would improve our knowledge of the genetics of quantitative traits, which would improve our prediction of the results of natural and artificial selection, as well as planning for marker-assisted selection.
The classical method of WRIGHT (1968) for estimating the number of genes underlying a quantitative trait is simple and has been widely used. However, due to the failure of many of its assumptions it is seriously biased. LANDE (1981), ZENG et al. (1990), and ZENG (1992) have all applied variations of this estimator to predict the number of loci having an effect on quantitative traits. However, they have all failed to correct all the incorrect assumptions of Wright. Because of this Wright's estimator has been considered unsuitable for general use, although it may be used to give an estimate of the minimum number of genes. The availability of molecular data has enabled new methods for estimating the number of genes or QTL. OTTO and JONES (2000) described a QTL-based estimator for the total number of loci. Other groups have estimated the number of QTL from looking at a number of QTL mapping studies (DOEBLEY et al. 1990; DOEBLEY and STEC 1991; BEAVIS 1994). All of these studies used only significant QTL and did not account for false positives or false negatives.
When many hypotheses are tested, as is the case in whole-genome scans for QTL, the concept of false discovery rate (FDR) has been introduced as an alternative to classical significance testing (BENJAMINI and HOCHBERG 1995; WELLER et al. 1997). The logic behind the FDR provides an estimate of the total number of genuinely false null hypotheses, which is the number of real QTL. In this article the logic behind the FDR was used, not to determine which QTL were significant, but to estimate across a whole-genome scan the number of QTL that were segregating per trait. We did this by estimating the number of segregating QTL within a half-sib daughter design.
100 high and 100 low Australian Selection Index (ASI) daughters. ASI is an economic index of milk, fat, and protein yields. These animals were genotyped for 130 microsatellite markers distributed across 21 chromosomes. Linkage analysis was performed using QTL express (SEATON et al. 2002). Phenotypes used in the linkage analysis, provided by the Australian Dairy Herd Improvement Scheme, were deregressed Australian breeding values (ABVs) for protein yield, fat yield, milk yield, protein percentage, fat percentage, fat percentage corrected for protein percentage (F% – P%), and ASI, where F% – P% = F% – 1.43P% and ASI = (3.8 x protein) + (0.9 x fat) – (0.048 x milk). Analyses were fixed at marker midpoints throughout the genome, and only positions >15 cM apart were chosen for this analysis. The final data set consisted of a total of 89 chromosomal positions, for between one and six sires, a total of 410 QTL tests. A total of seven traits were tested, resulting in 2870 data points. Chromosome, trait, family, and estimate of QTL effect, along with its standard error and absolute t-value, were extracted from the raw data for all of these positions. The absolute t-values were converted to a P-value. Three analyses were performed on this data set. First, the number of heterozygous QTL per sire per trait was estimated. Second, the number of heterozygous QTL per sire per trait was corrected for artifacts caused by linkage between adjacent chromosome segments. Third, the number of heterozygous sires for any given QTL was estimated. In each case, the data set is modeled in terms of a parameter and maximum likelihood (ML) used to estimate this parameter. In some cases the ML estimate was obtained by equating the expected and observed proportions and in other cases by numerically maximizing the likelihood in Excel (Microsoft).
Estimation of the proportion of heterozygous QTL:
Because many significance tests are performed, we expect to find some false positive QTL. We can correct for these when estimating the total number of real QTL, if the QTL that are really heterozygous are assumed to be always significant. Then, the expected proportion of marker brackets that contain an apparently significant QTL (x) can be predicted as
![]() |
This expected proportion of significant marker brackets was equated to the observed proportion (
) and the equation solved to estimate
as
![]() |
The estimates of the numbers of real (
) and false (
) QTL were calculated as
![]() |
![]() |
![]() |
, because there were six sires and seven traits.
Estimation of the proportion of heterozygous QTL, accounting for shadow QTL:
Positions that were at least 15 cM apart were used to minimize the probability that a QTL caused a significant result in two or more adjacent marker intervals. However, this may still have occurred, causing some QTL to have been double counted and so overestimating the number of QTL. Significant, but false, QTL caused by a real QTL in an adjacent marker bracket are referred to as shadow QTL. If there were no shadow QTL one would expect the significance tests in two adjacent marker brackets to be independent. To test whether any shadow QTL occurred, which would result in QTL being double counted, it was determined for all positions on a chromosome, bar the first, whether the QTL was significant at that position (b) and the position before it (a). Thus pairs of marker brackets in the same sire were classified into four possible categories, i.e., yes yes, yes no, no yes, or no no, where yes meant a significant QTL. The observed frequencies of these classes were calculated, as were the expected frequencies (assuming independence between adjacent marker brackets) as
![]() |
![]() |
![]() |
![]() |
2-test the independence of the observed data was tested to determine whether in fact QTL had been double counted.
Once double counting was confirmed a new model for determining the proportion of heterozygous QTL was used, accounting not only for false positives but also for shadow QTL. Let q be the probability of a heterozygous QTL and p be the significance level. It was assumed that the QTL at positions a and b occurred independently of each other and the probabilities of each of the heterozygous QTL status scenarios would be calculated as
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
and
that maximized the log L subject to the constraints that 0 <
< 1, 0 <
< 1, 0 < P(ab) < 1, and
.
The effect of a heterozygous QTL on a trait is the difference between the effects of the two chromosomes and can be arbitrarily labeled as positive (+yes) or negative (–yes). When there are two QTL on adjacent segments the same chromosome can carry the positive allele for both QTL, here labeled as +yes +yes, or the positive allele at one QTL and the negative allele at the other QTL, here labeled as +yes –yes. If the QTL are in linkage equilibrium, these two possibilities are equally likely, so that
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
and
in the same way as described above.
The two methods for estimating
and
(i.e., the method that distinguishes adjacent brackets with significant effects in opposite directions and the method that does not) represent opposite extremes in the models that could be used for the occurrence of real heterozygous QTL and shadow QTL. Therefore these estimates might be expected to bracket the correct estimates of q and s.
Calculation of the number of heterozygous sires for any given QTL:
This estimate of
, the proportion of marker brackets that contained a heterozygous QTL, did not determine the number of QTL that were segregating over the six sires for each trait. This is because it did not determine if QTL heterozygous in one sire are the same as those that were heterozygous in other sires. The purpose of the analysis described here was to do that.
Let ni be the number of chromosome positions (i.e., marker brackets) where i sires had an apparently significant QTL at the desired significance threshold (p). Since there were six sires in total, observed values of n0, n1, n2, n3, n4, n5, and n6 were calculated, and
was equal to the number of chromosome positions analyzed for all six sires. Let hi be the probability that there were i sires that were really heterozygous at any given QTL, i.e., h0, h1, h2, h3, h4, h5, and h6. Let xi be the probability that i sires were significant, assuming that all sires that were really heterozygous were significant, so that
![]() |
![]() |
![]() |
![]() |
![]() |
–
that maximized the log L subject to the constraints that 0 < hi < 1,
, and
,
, and
were set to 0, because it was unlikely that so many sires were really heterozygous for any one QTL.
We were also able to determine the expected number of heterozygous sires, when at least one of n sires is heterozygous, under a neutral selection model (CROW and KIMURA 1970) as
![]() |
250 for intervals >0.5. Therefore it can be concluded that there were in fact some QTL that were really heterozygous.
|
Estimation of the proportion of heterozygous QTL:
Table 2 presents the results estimating the proportion of heterozygous QTL (
), as well as the FDR, combining all sires and all traits, i.e., all 2870 data points. Table 2 shows that as the significance threshold became more stringent, the proportion of observed significant QTL (
) was reduced, as well as the estimate of the proportion of heterozygous QTL (
). The real number of heterozygous QTL (
), for each significance threshold, could be calculated using the formula
, and this enabled the calculation of the number of QTL segregating per sire per trait (Table 2). It is clear from Table 2 that the estimated number of real heterozygous QTL plateaus as the significance threshold becomes less stringent.
|
Estimating the proportion of heterozygous QTL, accounting for false positives and shadow QTL:
The
2-test for independence of significant QTL at adjacent chromosome positions was highly significant. Therefore it was concluded that some QTL had been double counted. Results for estimating the proportion of heterozygous QTL, accounting for false positives and shadow QTL, using a significance threshold of P < 0.1 are presented in Tables 3 and 4, where Table 4 distinguishes between two QTL of the same orientation and different orientation. From Tables 3 and 4 it can be seen that the proportion of heterozygous QTL lay somewhere between 5 and 8% and the proportion of shadow QTL, given a real QTL in the adjacent marker bracket, between 66 and 69%.
|
|
Tables 3 and 4 also present the results for estimating the proportion of heterozygous QTL, accounting for false positives and shadow QTL, using a significance threshold of P < 0.5. From Tables 3 and 4 it can be seen that the proportion of heterozygous QTL lay somewhere between 8 and 27% and the proportion of shadow QTL between 77 and 80%.
Estimation of the number of heterozygous sires for any given QTL:
Results from the ML analysis estimating the number of heterozygous sires for any given QTL, using the observations of significant (P < 0.5) QTL among the 49 marker brackets tested in all six sires, are presented in Table 5. It shows the number of QTL observed to be significant in i sires (ni) and the estimated proportion of QTL really heterozygous for i sires (hi). Sixteen percent of chromosome positions were not real heterozygous QTL in any of the six sires (
), and 84% of chromosome positions were real heterozygous QTL in only one of the six sires (
). These estimates changed very little when the constraint that
,
, and
should be zero was removed, so that
= 19%,
= 80%, and
= 1%.
|
Table 5 also shows the results using a more stringent significance threshold of P < 0.1. Again most chromosome positions were real heterozygous QTL in only one of the six sires (55%), although a larger proportion of chromosome positions were not real heterozygous QTL in any of the six sires (36%), and 8% of positions were real heterozygous QTL in three sires. These estimates did not change at all when the constraint that
,
, and
should be zero was removed. Under a neutral selection model the expected number of heterozygous sires, when at least one of six sires is heterozygous, is 1.99.
9 QTL segregating per sire per trait. This estimate of 9 might be an underestimate because the 89 marker brackets used did not cover the whole genome; only 21 of the 30 chromosomes were covered. Also, as stated previously, P < 0.5 was chosen to maximize the power of the experiment and minimize false negatives. However, the power was still <1. Therefore our estimate of 9 could be underestimated. Although this number does agrees well with the estimate of 10 heterozygous QTL per sire per trait estimated by HAYES and GODDARD (2001), using a meta-analysis of all the published data for dairy cattle.
This estimate, along with that of HAYES and GODDARD (2001), did not correct for shadow QTL. It was possible that a QTL caused a significant test statistic in an adjacent marker bracket leading to a shadow QTL, and therefore some QTL may have been double counted. In this study midpoints 15 cM apart were used to avoid double counting QTL; however, a significant
2-test revealed that double counting had occurred. An alternative, but less likely, explanation is that QTL really do occur more commonly than expected in adjacent marker brackets. Therefore the proportion of shadows was then estimated and the proportion of heterozygous QTL (
) reestimated, accounting for both false positives and shadow QTL (Tables 3 and 4). Using a significance threshold of P < 0.1 the proportion of heterozygous QTL was found to be between 5 and 8% (Tables 3 and 4), whereas previously, not accounting for shadow QTL, it was 9% (Table 2, P < 0.1). Using a significance threshold of P < 0.5, so as to avoid missing QTL, the proportion of heterozygous QTL was found to be between 8 and 27% (Tables 3 and 4), when previously it was 13% (Table 2, P < 0.5). The estimate of 27% heterozygous QTL was inflated because, at a significance threshold of P < 0.5, the number of observations that fell into the +yes –yes class was increased, compared with P < 0.1. The assumption that observations that fell into the +yes –yes class must indicate two QTL being heterozygous resulted in the estimated number of QTL being inflated by false positives. However, using the P < 0.1 significance threshold, there were very few of these +yes –yes observations because estimated effects at adjacent positions were positively correlated. Therefore the estimate of the proportion of heterozygous QTL (
) was somewhere between 5 and 13% (best estimate of 8%), and the number of QTL per sire per trait between three and nine (best estimate five). A weakness of this analysis is that it does not account for multiple QTL within a marker bracket. Therefore, if there are indeed multiple QTL within a marker bracket, the number of QTL will have been underestimated. We have also assumed the presence of QTL in adjacent brackets is independent. Clustering of QTL at distances >15 cM is possible but seems unlikely.
From the above analysis it was not known if the QTL that were segregating in one sire were also segregating in other sires. The ML estimation of the number of heterozygous sires for any given QTL (Table 5) indicates that most QTL were heterozygous only in one of the six sires, whether P < 0.1 or P < 0.5 was used.
Other articles have reported the average number of grandsires that appear to be heterozygous for significant QTL. Their estimates range from one in two to one in six (BOICHARD 1997; ZHANG et al. 1998; HEYEN et al. 1999; DE KONING et al. 2001; BENNEWITZ et al. 2003, 2004; BOICHARD et al. 2003). These articles conducted stringent marker–trait tests for QTL, summing over all sires/grandsires, for daughter and granddaughter designs, respectively. These tests are more likely to be significant if several sires/grandsires are heterozygous for the QTL. Therefore QTL that are heterozygous in only one of the sires/grandsires are generally not detected. This is evident in the articles mentioned above, in which QTL were rarely detected where less than two grandsires were heterozygous, particularly for QTL that were significant at the genomewide and experimentwide levels. Therefore, these articles have possibly overestimated the number of heterozygous grandsires because they have not detected QTL that are heterozygous in only one grandsire. This bias has been avoided in the method presented here, because it has used sire–marker–trait tests, opposed to marker–trait tests, to determine the presence of a QTL and therefore a heterozygous sire. However, allowing for the bias, the estimates of one in two to one in six grandsires being heterozygous for the detected QTL are not so different from the estimate of 1.0–1.25 (the average number of heterozygous sires where a QTL is detected) in six sires obtained here. This estimate is below that expected from a neutral selection model (1.99). Because the power to detect heterozygous QTL is <1 we will have underestimated the number of marker brackets with multiple heterozygous sires. However, we will also have underestimated the number of marker brackets with only one heterozygous sire, and so our estimate of the average number of heterozygous sires (1.0–1.25) should not be badly biased. Therefore the simplest explanation is that selection has decreased the frequency of heterozygotes for QTL affecting milk production traits. This would be the expected consequence of natural selection acting against mutant alleles and causing lower frequency of mutant alleles than expected under the neutral model.
Assuming the estimate of one in six sires applies to all positions in the genome, not just those used in the analysis presented here, there were between 18 (6 x 3) and 54 (6 x 9), best estimate of 30, QTL per trait segregating in the six sires. As the power of the experiment increases and the number of sires tested increases, the number of QTL detected increases, so these estimates should be regarded as minimum estimates. HAYES and GODDARD (2001) adjusted the estimate of LANDE (1981), to allow for unequal allelic effects, and the number of loci affecting a quantitative trait in an outbred population worked out to be 30, agreeing with the results presented here.
Since ASI is an index of three traits, it might be assumed that ASI should have three times as many QTL as component traits milk, fat, and protein yields. However, some QTL affect all three traits and some affect the components in a manner that leads to (almost) no effect on ASI. Therefore the number of QTL affecting ASI is less than three times the number affecting milk, fat, and protein yields. In addition, a QTL with a small but detectable effect on one of the three component traits may have such a small effect on ASI that it is no longer detectable.
BEAVIS, W. D., 1994 The power and deceit of QTL experiments: lessons from comparative QTL studies, pp. 250–266 in 49th Annual Corn and Sorghum Industry Research Conference. American Seed Trade Association, Washington, DC.
BENJAMINI, Y., and Y. HOCHBERG, 1995 Controlling the false discovery rate - a practical and powerful approach to multiple testing. J. R. Stat. Soc. Ser. B Methodol. 57: 289–300.
BENNEWITZ, J., N. REINSCH, C. GROHS, H. LEVEZIEL, A. MALAFOSSE et al., 2003 Combined analysis of data from two granddaughter designs: a simple strategy for QTL confirmation and increasing experimental power in dairy cattle. Genet. Sel. Evol. 35: 319–338.[CrossRef][Medline]
BENNEWITZ, J., N. REINSCH, V. GUIARD, S. FRITZ, H. THOMSEN et al., 2004 Multiple quantitative trait loci mapping with cofactors and application of alternative variants of the false discovery rate in an enlarged granddaughter design. Genetics 168: 1019–1027.
BLOTT, S., J. J. KIM, S. MOISIO, A. SCHMIDT-KUNTZEL, A. CORNET et al., 2003 Molecular dissection of a quantitative trait locus: a phenylalanine-to-tyrosine substitution in the transmembrane domain of the bovine growth hormone receptor is associated with a major effect on milk yield and composition. Genetics 163: 253–266.
BOICHARD, D., and M. D. BISHOP, 1997 Detection of QTLs influencing milk production and mastitis resistance with a granddaughter design in Holstein cattle. 48th Annual Meeting of the EAAP, Commission on Animal Genetics, Vienna, pp. 1–5.
BOICHARD, D., C. GROHS, F. BOURGEOIS, F. CERQUEIRA, R. FAUGERAS et al., 2003 Detection of genes influencing economic traits in three French dairy cattle breeds. Genet. Sel. Evol. 35: 77–101.[CrossRef][Medline]
BOVENHUIS, H., and C. SCHROOTEN, 2002 Quantitative trait loci for milk production traits in dairy cattle. 7th World Congress on Genetics Applied to Livestock Production, Montpellier, France, Vol. 31, pp. 27–34.
CROW, J. F., and M. KIMURA, 1970 An Introduction to Population Genetics Theory. Harper & Row, New York/Evanston, IL/London.
DE KONING, D. J., N. F. SCHULMAN, K. ELO, S. MOISIO, R. KINOS et al., 2001 Mapping of multiple quantitative trait loci by simple regression in half-sib designs. J. Anim. Sci. 79: 616–622.
DOEBLEY, J., and A. STEC, 1991 Genetic analysis of the morphological differences between maize and teosinte. Genetics 129: 285–295.[Abstract]
DOEBLEY, J., A. STEC, J. WENDEL and M. EDWARDS, 1990 Genetic and morphological analysis of a maize teosinte F2 population—implications for the origin of maize. Proc. Natl. Acad. Sci. USA 87: 9888–9892.
GRISART, B., F. FARNIR, L. KARIM, N. CAMBISANO, J. J. KIM et al., 2004 Genetic and functional confirmation of the causality of the DGAT1 K232A quantitative trait nucleotide in affecting milk yield and composition. Proc. Natl. Acad. Sci. USA 101: 2398–2403.
HAYES, B., and M. E. GODDARD, 2001 The distribution of the effects of genes affecting quantitative traits in livestock. Genet. Sel. Evol. 33: 209–229.[CrossRef][Medline]
HEYEN, D. W., J. I. WELLER, M. RON, M. BAND, J. E. BEEVER et al., 1999 A genome scan for QTL influencing milk production and health traits in dairy cattle. Physiol. Genomics 1: 165–175.
LANDE, R., 1981 The minimum number of genes contributing to quantitative variation between and within populations. Genetics 99: 541–553.
LYNCH, M., and B. WALSH, 1998 Genetics and Analysis of Quantitative Traits. Sinauer Associates, Sunderland, MA.
MOSIG, M. O., E. LIPKIN, G. KHUTORESKAYA, E. TCHOURZYNA, M. SOLLER et al., 2001 A whole genome scan for quantitative trait loci affecting milk protein percentage in Israeli-Holstein cattle, by means of selective milk DNA pooling in a daughter design, using an adjusted false discovery rate criterion. Genetics 157: 1683–1698.
OTTO, S. P., and C. D. JONES, 2000 Detecting the undetected: estimating the total number of loci underlying a quantitative trait. Genetics 156: 2093–2107.
SEATON, G., C. S. HALEY, S. A. KNOTT, M. KEARSEY and P. M. VISSCHER, 2002 QTL express: mapping quantitative trait loci in simple and complex pedigrees. Bioinformatics 18: 339–340.
THODAY, J. M., and J. N. THOMPSON, 1976 Number of segregating genes implied by continuous variation. Genetica 46: 335–344.[CrossRef]
WELLER, J. I., J. Z. SONG, Y. I. RONIN and A. B. KOROL, 1997 Designs and solutions to multiple trait comparisons. Anim. Biotechnol. 8: 107–122.
WRIGHT, S., 1968 Evolution and the Genetics of Populations. University of Chicago Press, Chicago.
ZENG, Z. B., 1992 Correcting the bias of Wright estimates of the number of genes affecting a quantitative character—a further improved method. Genetics 131: 987–1001.[Abstract]
ZENG, Z. B., D. HOULE and C. C. COCKERHAM, 1990 How informative is Wright estimator of the number of genes affecting a quantitative character. Genetics 126: 235–247.[Abstract]
ZHANG, Q., D. BOICHARD, I. HOESCHELE, C. ERNST, A. EGGEN et al., 1998 Mapping quantitative trait loci for milk production and health of dairy cattle in a large outbred pedigree. Genetics 149: 1959–1973.
Communicating editor: C. HALEY
- THIS ARTICLE
-
Abstract
- Full Text (PDF)
-
All Versions of this Article:
genetics.107.077784v1
177/2/1117 most recent - Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Email this article to a friend
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Chamberlain, A. J.
- Articles by Goddard, M. E.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Chamberlain, A. J.
- Articles by Goddard, M. E.

































