| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Genetics, Vol. 174, 455-464, September 2006, Copyright © 2006
doi:10.1534/genetics.106.058966
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Centre for Integrative Genetics and Department of Animal and Aquacultural Sciences, Norwegian University of Life Sciences, N-1432 Ås, Norway
2 Corresponding author: 475 Mickelham Rd., Attwood 3049, Melbourne, Victoria, Australia.
E-mail: ben.hayes{at}dpi.vic.gov.au
| ABSTRACT |
|---|
|
|
|---|
S1-, ß-,
S2-, and
-casein, are coded by the loci CSN1S1, CSN2, CSN1S2, and CSN3, respectively, located within a 250-kb segment of caprine chromosome 6 (MARTIN et al. 2002a). A number of genetic variants of the casein genes that affect milk production traits have been described. CSN1S1 is the most complex and is highly polymorphic (MARTIN et al. 2002b). As many as 18 different alleles have been identified for this gene (LEROUX et al. 2003; DEVOLD 2004), including alleles with "strong" effects, "medium" effects, and "weak" effects and "null" alleles associated with no synthesis of the protein (LEROUX et al. 1990; NEVEU et al. 2002). Goats carrying the strong variants have been reported to produce milk with a significantly higher casein, total protein, and fat content than goats carrying the weak variants (MANFREDI et al. 1993; REMEUF 1993; BARBIERI et al. 1995). However, reports of the effects of these alleles on milk yield are somewhat contradictory; for example, MAHE et al. (1994) reported that genetic variants of CSN1S1 had no effects on milk yield. A unique deletion in CSN1S1 (here termed the 04 allele) has been reported in Norwegian goats and is present in the Norwegian population at high frequency [0.86 (ÅDNØY et al. 2003)]. In most other European breeds the null alleles are at low frequency.
At least five variants of CSN2 are described, including a null allele associated with no protein expression (MAHE and GROSCLAUDE 1993; GALLIANO et al. 2004). Likewise at least five different alleles have been described for CSN1S2 (RECIO et al. 1997; MARTIN et al. 1999; ERHARDT et al. 2002). PRINZENBERG et al. (2005) recently described alleles of CSN3, including two new alleles and a new nomenclature for the 16 previously described alleles. The influence of CSN3 on milk production traits still remains to be evaluated.
While the mutations described above can dramatically affect the levels of expression of the genes they are found in, effects on total protein production are in some cases less pronounced. One hypothesis is that when expression of one casein gene is downregulated, the others can be upregulated to compensate (LEROUX et al. 2003). BOVENHUIS et al. (1992) suggested that the conflicting results of the effects of mutations in casein genes for cattle at least might be due to linkage between mutations in different caseins, as well as the different statistical models used in the analyses. They proposed a multigene model as an alternative to single-gene models. As mutations with effects on quantitative traits such as milk production can occur in exons, introns (e.g., ANDERSSON and GEORGES 2004), promoters, and other regulatory sequences (e.g., HOOGENDOORN et al. 2003), it is possible that the functional mutation(s) will not be within the set of mutations (such as single-nucleotide polymorphisms, SNPs) genotyped in the data set. However, the functional mutation(s) will have occurred in an ancestral chromosome segment, copies of which persist in the current generation of animals. These identical-by-descent (IBD) chromosome segments can be identified in the current population by unique haplotypes of SNP or marker alleles. This suggests investigating the effects of haplotypes of the mutation alleles across the casein loci on protein production as an alternative to considering the genes in isolation.
In this article, we investigate effects of haplotypes of polymorphisms in the casein genes on production traits in the Norwegian dairy goat population. We first sequenced fragments of all four of the casein loci in seven Norwegian dairy bucks to detect additional polymorphisms in the casein loci. We also sequenced fragments of the promoters for these genes. We then genotyped the 436 bucks representative of the commercial population, for these new SNPs as well as previously reported mutations, and constructed haplotypes within each casein locus. The pattern of linkage disequilibrium between the haplotypes suggested preferential recombination between CSN2 and CSN1S2; this was supported by analysis of linkage disequilibrium between individual markers. The haplotypes were found to have large effects on the milk production traits. The possibility of using these haplotypes in haplotype-assisted selection (HAS) is discussed.
| METHODS |
|---|
|
|
|---|
Primers for amplification and resequencing are given in supplemental material at http://www.genetics.org/supplemental/. Samples were sequenced using Dye terminator chemistry and an ABI3730 sequencer (Applied Biosystems, Foster City, CA). For the identification of SNPs a pipeline based on the phred, phrap, and polyphred programs was used as described by OLSEN et al. (2005). Contig assembly and putative SNPs were visually inspected using consed (GORDON et al. 1998) before assays were constructed and SNPs were genotyped using matrix-assisted laser desorption/ionization time-of-flight mass spectroscopy (MALDI-TOF MS) (Sequenom, San Diego). Assays for the genotyping are provided as supplemental information (http://www.genetics.org/supplemental/). For simplicity, polymorphisms are labeled SNP1SNP39, in order along the chromosome segment containing the caseins. Three polymorphisms in exon 12 of CSN1S1 were genotyped in the same assay and coded as alleles 1, 3, and 6. Sequences for the three polymorphisms in exon 12 are as follows:
Allele 1 is characterized by a single-point deletion coding for very low levels of mRNA (our unpublished data) and a truncated protein undetectable by IEF of milk samples (VEGARUD et al. 1989). So far this deletion is reported only in Norwegian goats with a surprisingly high frequency of 0.86 (ÅDNØY et al. 2003).
Haplotype construction:
To construct haplotypes from SNP genotypes of 436 bucks, two different programs were used in sequence. SimWalk (SOBEL and LANGE 1996) uses pedigree information to reconstruct haplotypes, while PHASE (STEPHENS et al. 2001) uses linkage disequilibrium and allele frequencies. Sufficient pedigree for successful haplotype construction with SimWalk was available only for a subset of the data, including 240 bucks belonging to half-sib families (common sire) of at least six individuals. The identified haplotypes of these 240 bucks were then assumed to be phase-known genotypes in the PHASE program, along with the phase-unknown genotypes of the rest of the 196 Norwegian bucks. Haplotypes were predicted within each casein locus. Haplotypes with a frequency of <1% were omitted from the data set.
Level of linkage disequilibrium:
Once the haplotypes were constructed, we estimated the level of linkage disequilibrium between all pairs of loci using the r2-statistic (HUDSON 1985). The result was visualized using the Haploview program (BARRET et al. 2005).
To determine if there were differences in intragenic and intergenic levels of linkage disequilibrium (LD) and to determine if levels of LD were significantly lower between pairs of markers flanking a potential site of preferential recombination suggested from the visualization of results (between CSN2 and CSN1S2), we fitted the following model to the r2-values between all pairs of markers,
![]() |
is the estimate of r2 between markers i and j; xij is an indicator variable that takes the value of 0 if both markers are in the same gene and 1 if the markers are in a different gene; zij is an indicator variable that takes the value of 1 if both markers are in either CSN1 or CSN2, 2 if both markers are in CSN1S2 or CSN3, and 3 otherwise; and
, where cij is the distance between markers i and j in megabases, and N is a parameter reflecting the effective population size (e.g., SVED 1971). We ran the above model in ASREML (GILMOUR et al. 1999) with values of N from 1 to 25,000. The parameter estimates were taken from the model with the value of N that maximized the log likelihood.
Estimation of haplotype and SNP effects:
The effects of the haplotypes on the buck's daughter-yield deviations (DYDs) were calculated for kilograms and percentage of fat, protein, and lactose, in addition to kilograms of milk. DYDs were calculated using data from the Norwegian Dairy Goat Control. Milk production records for each goat were first corrected for the effects of days in milk (DIM), lactation number, herd-year-test day (hy-td), and permanent environment (p-env), calculated from all goat control records, using the model
![]() |
![]() |
,
,
, and
are variance components estimated simultaneously with the effects, I is the identity matrix, and A is the additive genetic relationship matrix, including 173,179 animals.
Of the bucks with reliable haplotypes, only 207 had daughters in the goat control. These daughters had 29,032 test-day records for milk, 18,465 test-day records for protein, 18,246 test-day records for fat, and 18,600 records for lactose. The DYDs for the 207 bucks were calculated by averaging the daughters' corrected milk records. Next we estimated the effect of the haplotypes on the DYDs for the seven traits. Weighted analyses were performed in ASREML (GILMOUR et al. 1999) by the model
![]() |
![]() |
is the phenotypic variance (BOVENHUIS and MEUWISSEN 1996). However, for protein percentage the weight statement in the ASREML analyses had to be removed, due to the fact that there was no error variance left after fitting the haplotype and the buck. The (co)variance matrix was
![]() |
The additive genetic relationship matrix A included 2270 animals from six generations. A likelihood-ratio test was performed to evaluate if the haplotypes had significant effects on the milk production traits. Let L0 be the likelihood value for the model under H0, where the haplotype for a particular casein is omitted from the model. Then L1 is the likelihood value for the alternative model, that is, when all haplotypes are included in the model. The test statistic was defined as
. The haplotype effects for a particular casein locus were taken as significant if the test statistic was >2.71 (ALMASY and BLANGERO 1998). Interactions between casein locus haplotypes were also fitted if the haplotypes for more than one casein locus were significant.
In addition to testing effects of haplotypes, ASREML was used to test if the individual SNPs had significant effect on the milk production traits, using the following model for each of the 39 SNPs,
![]() |
| RESULTS |
|---|
|
|
|---|
|
|
possible haplotypes) indicates considerable LD in the segment of chromosome containing the SNPs. LD between pairs of loci varied from complete disequilibrium to almost no disequilibrium (Figure 1). As distances between loci increased, both the variability and the level of LD declined. Regions of high LD were not equally spread across the chromosome segment. LD was much higher between SNPs in CSN1S1 and CSN2 and SNPs in CSN1S2 and CSN3 than between SNPs in CSN2 and CSN1S2.
|
|
|
|
|
Effects of individual SNPs:
The significance of effects of the SNPs on four milk production traits is shown in Figure 4. None of the SNPs were significant at the 5% chromosome segmentwise threshold. At the 10% threshold, only two SNPs were significantSNP31 had a significant effect on protein percentage and SNP15 had a significant effect on lactose percentage. In general two areas of the chromosome segment appeared to have some effect on the traits: a cluster of SNPs in CSN3 and SNP14 in CSN1S1.
|
Haplotype tagging:
Only 11 of the 39 SNPs were required to capture all the information contained in the haplotypes according to the SNPtagger software (KE and CARDON 2003). This reflects the extensive LD in this chromosome segment. The SNPs required are given in supplemental information (http://www.genetics.org/supplemental/).
| DISCUSSION |
|---|
|
|
|---|
Both the haplotype analysis and the analysis of effects of individual SNPs were consistent in indicating two casein loci with effects on milk production traits in Norwegian goats: CSN1S1 and CSN3. The analyses of the effects of the individual SNPs indicated that two sites on the chromosome segment containing the caseins were having suggestive effects on the milk production traits: SNP14 in CSN1S1 with effects on protein and fat percentage and kilograms of fat, SN15 on lactose percentage, and a cluster of SNPs in the promoter of CSN3 with effects on protein and fat percentage and on kilograms of milk and lactose. The SNP14 deletion mutation in CSN1S1 leads to very low gene expression (our unpublished data) and is found at a high frequency, 0.86, in the Norwegian dairy goat population (ÅDNØY et al. 2003). The high frequency of this mutation, which decreases dry matter yield, is difficult to explain in light of the fact that the breeding goal for this goat population is an increased dry matter content in milk. One explanation could be that the founders of the Norwegian goat population, the number of which was likely to be small, carried the deletion at high frequency. Other deletions resulting in null or low levels of expression of CSN1 have been reported, as reviewed in NEVEU et al. (2005). NEVEU et al. (2005) and others proposed that lack of CSN1S1 disrupts the intracellular transport of caseins, leading to accumulation of caseins in the cisternae, which in turn disturbs the whole secretion process, including lipids. Our observation of reduced fat kilograms in the presence of the SNP14 deletion adds further weight to this hypothesis. Haplotype 1 of CSN1S1 carried the SNP14 deletion. However, while the SNP14 effect was sufficient to explain the effect of the CSN1S1 haplotypes on fat kilograms, it was not sufficient to explain the effect of these haplotypes on protein percentage. One explanation would be that this haplotype is in linkage disequilibrium with an as yet undetected mutation that is causing the other portion of the effect on protein percentage.
The effect of mutations in CSN3 on production traits in goats has not been previously reported. In our single SNP analysis, a cluster of SNPs in the promoter region of CSN3 had suggestive effects on protein percentage and fat percentage. However, the effect of the haplotypes of these SNPs had a higher test statistic. This suggests that none of the SNPs we have detected in this gene are the causative mutation, rather the SNPs, and even more so the haplotypes, are in linkage disequilibrium with the true causative mutation. The SNPs in CSN3 are in very strong linkage disequilibrium.
The variability in LD between the SNPs (r2 ranging between 1 and almost 0), particularly between those only tenss of bases apart, was striking. Mechanisms such as gene conversion have been proposed to explain the high variability between very closely spaced SNPs (e.g., (FRISSE et al. 2001). We found that the level of linkage disequilibrium for pairs of markers within each casein locus was higher than for pairs of markers in different loci, even though a correction was made for declining linkage disequilibrium with increasing distance between a pair of markers. This finding concurs with observations of reduced recombination in genic regions compared with that in nongenic regions (e.g., MYERS et al. 2005).
LD was not evenly spread across the chromosome segment containing the caseinshigh levels of LD were observed at either end of the segment, with low levels of LD in the middle of the segment. Levels of linkage disequilibrium for marker pairs spanning CSN2CSN1S2 were significantly lower than those for marker pairs located within the two segments, even when a correction was made for declining LD with distance. Preferential recombination in the region of the chromosome segment containing the caseins would ensure continuous generation of new combinations of casein gene alleles. There has been a previous report of recombination generating new alleles in caprine caseins (BEVILACQUA et al. 2002), although the proposed site of recombination was within the CSNS1 locus.
Milk from Norwegian dairy goats is used almost entirely for cheese production. Farmers are paid for kilograms of milk, but with a bonus for increased dry matter content. However, many farmers exceed their quota, so they would receive extra returns only with increased dry matter percentage. As we have identified haplotypes that increase protein percentage and fat percentage and decrease milk volume, for example, haplotype 4 in CSN1S1 and haplotype 2 in CSN3, HAS would seem to have potential in Norwegian dairy goats, particularly as such haplotypes appear to be at only moderate frequency in the population. The cost of HAS would be greatly reduced by the use of the 11 tagging SNPs, rather than the entire set of 39 SNPs.
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
| FOOTNOTES |
|---|
| LITERATURE CITED |
|---|
|
|
|---|
ÅDNØY, T., G. VEGARUD, T. G. DEVOLD, R. NORDBØ, I. COLBJØRNSEN et al., 2003 Effects of the 0- and F-alleles of alpha S1 casein in two farms of northern Norway. Proceedings of the International Workshop on Major Genes and QTL in Sheep and Goat, Toulouse, France.
ALMASY, L., and J. BLANGERO, 1998 Multipoint quantitative-trait linkage analysis in general pedigrees. Am. J. Hum. Genet. 62: 11981211.[CrossRef][Medline]
ANDERSSON, L., and M. GEORGES, 2004 Domestic-animal genomics: deciphering the genetics of complex traits. Nat. Rev. Genet. 5: 202212.[CrossRef][Medline]
BARBIERI, M. E., E. MANFREDI, J. M. ELSEN, G. RICORDEAU, J. BOUILLON et al., 1995 Effects of the alpha(S1)-casein locus on dairy performances and genetic-parameters of alpine goats. Genet. Sel. Evol. 27: 437450.[CrossRef]
BARRETT, J. C., B. FRY, J. MALLER and M. J. DALY, 2005 Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics 21: 263265.
BEVILACQUA, C., P. FERRANTI, G. GARRO, C. VELTRI, R. LAGONIGRO et al., 2002 Interallelic recombination is probably responsible for the occurrence of a new alpha(s1)-casein variant found in the goat species. Eur. J. Biochem. 269: 12931303.[Medline]
BOVENHUIS, H., and T. H. E. MEUWISSEN, 1996 Detection and mapping of quantitative trait loci. Workshop, Animal Genetics and Breeding, University of New England, Armidale, New South Wales, Australia.
BOVENHUIS, H., J. A. M. VANARENDONK and S. KORVER, 1992 Associations between milk protein polymorphisms and milk-production traits. J. Dairy Sci. 75: 25492559.[Abstract]
DEVOLD, T. G., 2004 Effect of milk polymorphism on protein composition in milk from Norwegian breed of dairy goat and Norwegian dairy cattle. D.Sc. Thesis, Norwegian University of Life Sciences, Ås, Norway.
ERHARDT, G., S. JAGER, E. BUDELLI and A. CAROLI, 2002 Genetic polymorphism of goat alpha(S2)-casein (CSN1S2) and evidence for a further allele. Milchwissenschaft 57: 137140.
FRISSE, L., R. R. HUDSON, A. BARTOSZEWICZ, J. D. WALL, J. DONFACK et al., 2001 Gene conversion and different population histories may explain the contrast between polymorphism and linkage disequilibrium levels. Am. J. Hum. Genet. 69: 831843.[CrossRef][Medline]
GALLIANO, F., R. SALETTI, V. CUNSOLO, S. FOTI, D. MARLETTA et al., 2004 Identification and characterization of a new beta-casein variant in goat milk by high-performance liquid chromatography with electrospray ionization mass spectrometry and matrix-assisted laser desorption/ionization mass spectrometry. Rapid Commun. Mass Spectrom. 18: 19721982.[CrossRef][Medline]
GILMOUR, A. R., B. R. CULLIS, S. J. WELHAM and R. THOMPSON, 1999 ASREML reference manual. NSW Agric. Biometric Bull. 3: 210.
GORDON, D., C. ABAJIAN and P. GREEN, 1998 Consed: a graphical tool for sequence finishing. Genome Res. 8: 195202.
HAYES, B. J., P. E. VISSCHER, H. MCPARTLAN and M. E. GODDARD, 2003 A novel multi-locus measure of linkage disequilibrium and its use to estimate past effective population size. Genome Res. 13: 635643.
HOOGENDOORN, B., S. L. COLEMAN, C. A. GUY, K. SMITH, T. BOWEN et al., 2003 Functional analysis of human promoter polymorphisms. Hum. Mol. Genet. 12: 22492254.
HUDSON, R. R., 1985 The sampling distribution of linkage disequilibrium under an infinite allele model without selection. Genetics 109: 611631.
KE, X. Y., and L. R. CARDON, 2003 Efficient selective screening of haplotype tag SNPs. Bioinformatics 19: 287288.
LEROUX, C., P. MARTIN, M. F. MAHE, H. LEVEZIEL and J. C. MERCIER, 1990 Restriction-fragment-length-polymorphism identification of goat alpha-S1-casein allelesa potential tool in selection of individuals carrying alleles associated with a high-level protein-synthesis. Anim. Genet. 21: 341351.[Medline]
LEROUX, C., F. LE PROVOST, E. PETIT, L. BERNARD, Y. CHILLIARD et al., 2003 Real-time RT-PCR and cDNA macroarray to study the impact of the genetic polymorphism at the alpha(s1)-casein locus on the expression of genes in the goat mammary gland during lactation. Reprod. Nutr. Dev. 43: 459469.[Medline]
MAHE, M. F., and F. GROSCLAUDE, 1993 Polymorphism of beta-casein in the Creole goat of Guadeloupeevidence for a null allele. Genet. Sel. Evol. 25: 403408.[CrossRef]
MAHE, M. F., E. MANFREDI, G. RICORDEAU, A. PIACERE and F. GROSCLAUDE, 1994 Effects of the alpha-S1-casein polymorphism on goat dairy performancesa within-sire analysis of alpine bucks. Genet. Sel. Evol. 26: 151157.[CrossRef]
MANFREDI, E., M. E. BARBIERI, J. BOUILLON, A. PIACERE, M. F. MAHE et al., 1993 Effects of alpha(S1) casein variants on dairy performance in goats. Lait 73: 567572.[CrossRef]
MARTIN, P., M. OLLIVIER-BOUSQUET and F. GROSCLAUDE, 1999 Genetic polymorphism of caseins: a tool to investigate casein micelle organization. Int. Dairy J. 9: 163171.[CrossRef]
MARTIN, P., M. SZYMANOWSKA, L. ZWIERZCHOWSKI and C. LEROUX, 2002 The impact of genetic polymorphism on the protein composition of ruminant milks. Reprod. Nutr. Dev. 42: 433459.[CrossRef][Medline]
MYERS, S., L. BOTTOLO, C. FREEMAN, G. MCVEAN and P. DONNELLY, 2005 A fine-scale map of recombination rates and hotspots across the human genome. Science 14(310): 321324.[CrossRef]
NEVEU, C., A. RIAUBLANC, G. MIRANDA, J. F. CHICH and P. MARTIN, 2002 Is the apocrine milk secretion process observed in the goat species rooted in the perturbation of the intracellular transport mechanism induced by defective alleles at the alpha(s1)-Cn locus? Reprod. Nutr. Dev. 42: 163172.[CrossRef][Medline]
OLSEN, H. G., S. LIEN, M. GAUTIER, H. NILSEN, A. ROSETH et al., 2005 Mapping of a milk production quantitative trait locus to a 420-kb region on bovine chromosome 6. Genetics 169: 275283.
PRINZENBERG, E. M., K. GUTSCHER, S. CHESSA, A. CAROLI and G. ERHARDT, 2005 Caprine kappa-casein (CSN3) polymorphism: new developments in molecular knowledge. J. Dairy Sci. 88: 14901498.
RECIO, I., M. L. PEREZRODRIGUEZ, L. AMIGO and M. RAMOS, 1997 Study of the polymorphism of caprine milk caseins by capillary electrophoresis. J. Dairy Res. 64: 515523.[CrossRef][Medline]
REMEUF, F., 1993 Influence of genetic-polymorphism of caprine alpha(S1)-casein on physicochemical and technological properties of goats milk. Lait 73: 549557.[CrossRef]
SOBEL, E., and K. LANGE, 1996 Descent graphs in pedigree analysis: applications to haplotyping, location scores, and marker-sharing statistics. Am. J. Hum. Genet. 58: 13231337.[Medline]
STEPHENS, M., N. J. SMITH and P. DONNELLY, 2001 A new statistical method for haplotype reconstruction from population data. Am. J. Hum. Genet. 68: 978989.[CrossRef][Medline]
SVED, J. A., 1971 Linkage disequilibrium and homozygosity of chromosome segments in finite populations. Theor. Popul. Biol. 2: 125141.[CrossRef][Medline]
VEGARUD, G. E., T. S. MOLLAND, M. J. BROVOLD, T. G. DEVOLD, P. ALESTROM et al., 1989 Rapid separation of genetic-variants of caseins and whey proteins using urea-modified gels and fast electrophoresis. Milchwissenschaft 44: 689691.
Communicating editor: R. W. DOERGE
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |