help button home button Genetics AJP: Endocrinology
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Bijma, P.
Right arrow Articles by Woolliams, J. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Bijma, P.
Right arrow Articles by Woolliams, J. A.
Genetics, Vol. 154, 1865-1877, April 2000, Copyright © 2000

A General Procedure for Predicting Rates of Inbreeding in Populations Undergoing Mass Selection

Piter Bijmaa, Johan A. M. Van Arendonka, and John A. Woolliamsb
a Animal Breeding and Genetics Group, Wageningen Institute of Animal Sciences, Wageningen Agricultural University, 6700 AH Wageningen, The Netherlands
b Roslin Institute (Edinburgh), Roslin, Midlothian EH25 9PS, United Kingdom

Corresponding author: Piter Bijma, Animal Breeding and Genetics Group, Department of Animal Sciences, Wageningen Agricultural University, P.O. Box 338, Marijkeweg 40, 6700 AH Wageningen, The Netherlands., piter.bijma{at}alg.vf.wau.nl (E-mail)

Communicating editor: R. G. SHAW


*  ABSTRACT
*TOP
*ABSTRACT
*DERIVATION OF EXPRESSIONS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*LITERATURE CITED

Predictions of rates of inbreeding ({Delta}F), based on the concept of long-term genetic contributions assuming the infinitesimal model, are developed for populations with discrete or overlapping generations undergoing mass selection. Phenotypes of individuals are assumed to be recorded prior to reproductive age and to remain constant over time. The prediction method accounts for inheritance of selective advantage both within and between age classes and for changing selection intensities with age. Terms corresponding to previous methods that assume constant selection intensity with age are identified. Predictions are accurate (relative errors <=8%), except for cases with extreme selection intensities in females in combination with high heritability. With overlapping generations {Delta}F reaches a maximum when parents are equally distributed over age classes, which is mainly due to selection of the same individuals in consecutive years. {Delta}F/year decreases much more slowly compared to {Delta}F/generation as the number of younger individuals increases, whereas the decrease is more similar as the number of older individuals increases. The minimum {Delta}F (per year or per generation) is obtained when most parents were in the later age classes, which is mainly due to an increased number of parents per generation. With overlapping generations, the relationship between heritability and {Delta}F is dependent on the age structure of the population.


IN the absence of selection and with a Poisson distribution of family size, expected rates of inbreeding are related directly to the number of parents: E({Delta}F) {approx} 1/8Nm + 1/8Nf (WRIGHT 1969 Down, p. 212). In selected populations, however, superior families contribute more offspring to the next generation than average families. This increases the rate of inbreeding of a selected population compared to an unselected population. Prediction of rates of inbreeding in selected populations is difficult, because selection decisions are correlated over generations due to the inheritance of selective advantage. Methods accounting for only one or two generations of selection (e.g., BURROWS 1984A Down, BURROWS 1984B Down) therefore generally underestimate the rate of inbreeding (WRAY et al. 1990 Down; see CABALLERO 1994 Down for a review).

Two approaches to prediction of rates of inbreeding for selected populations can be distinguished. First, rates of inbreeding can be predicted on the basis of the variance of allele frequency, using the idea of accumulation of selective advantages over generations (ROBERTSON 1961 Down). Using this approach and equilibrium genetic variances, SANTIAGO and CABALLERO 1995 Down obtained accurate predictions for populations with discrete generations under mass selection. NOMURA 1996 Down extended that method to populations with overlapping generations and equal numbers of parents per sex in every age class. Second, rates of inbreeding can be predicted using the concept of long-term genetic contributions. Rates of inbreeding are proportional to the sum of squared long-term genetic contributions of ancestors (WRAY and THOMPSON 1990 Down). WRAY and THOMPSON 1990 Down obtained accurate predictions of rates of inbreeding for populations with discrete generations under mass selection, using iterative regression methods. For discrete generations and mass selection a closed form expression was obtained by WOOLLIAMS et al. 1993 Down. For more complicated selection schemes, however, predictions became unmanageable due to the recursive nature of the procedure and the need for predicting the variance of long-term genetic contributions (WRAY et al. 1994 Down).

Recently, WOOLLIAMS and BIJMA 2000 Down showed that the variance of long-term genetic contributions is related to their squared expectation, making a separate prediction of the variance redundant. Furthermore, WOOLLIAMS et al. 1999 Down obtained general predictions of expected genetic contributions using equilibrium genetic variances instead of second generation genetic variances (WOOLLIAMS et al. 1993 Down). Using the approach of WOOLLIAMS et al. 1999 Down, BIJMA and WOOLLIAMS 1999 Down obtained accurate predictions of genetic contributions for populations with overlapping generations under mass or sib-index selection. However, they did not develop predictions for rates of inbreeding for those schemes.

The aim of this article is twofold. First, explicit prediction equations for rates of inbreeding in populations with discrete or overlapping generations under mass selection are developed, on the basis of the theory of WOOLLIAMS et al. 1999 Down and WOOLLIAMS and BIJMA 2000 Down. These predictions are valid for any distribution of parents across age classes, overcoming the restriction of NOMURA 1996 Down, to give a general and practical method for mass selection with overlapping generations. These methods are compared to methods of SANTIAGO and CABALLERO 1995 Down for discrete generations, and to methods of NOMURA 1996 Down for the special case of equal numbers of parents per age class with overlapping generations. The accuracy of predictions is examined using simulation. Second, relationships between rates of inbreeding and genetic or population parameters are examined, and differences between populations with discrete and overlapping generations are presented and discussed.


*  DERIVATION OF EXPRESSIONS
*TOP
*ABSTRACT
*DERIVATION OF EXPRESSIONS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*LITERATURE CITED

Population model:
This section describes the population and the selection procedures for which rates of inbreeding are predicted. This model is also used in the simulation. The trait considered is assumed to be determined by an infinite number of additive loci, each having an infinitesimal effect (infinitesimal model; FISHER 1918 Down). Phenotypic values are the sum of additive genetic values (breeding values) and environmental values, P = A + E. A population with either discrete or overlapping generations under mass selection is modeled. With parents up to a maximum age of cmax there are 2cmax categories, one for each sex and age of parent. Categories are indexed by k or by l, so k = 1 ... cmax are males, and k = cmax+1 ... 2cmax are females. With discrete generations, there are only two categories: males and females that are indexed by s = m or f. Before reproductive age, phenotypes of individuals are recorded and remain unchanged over time, so that ranking of individuals within categories is constant over time. Within categories, individuals are ranked on their phenotype and each year the highest-ranking nk individuals are selected from the kth category, to produce the next cohort. The total number of male and female parents of each cohort is Nm = {Sigma}cmaxk=1nk and Nf = {Sigma}2cmaxk=cmax+1nk, respectively. Each sire is mated at random to d dams (d = ), and each dam produces a fixed number, no, of offspring (1/2no of each sex), so that for each sex the total number of offspring born in a cohort is T = noNf. The unit of age, i.e., the interval between consecutive age classes, was 1 year. Genetic contributions and rates of inbreeding per year therefore will be equal to genetic contributions and rates of inbreeding per cohort.

General:
The prediction of {Delta}F is based on the concept of long-term genetic contributions (JAMES and MCBRIDE 1958 Down). The long-term genetic contribution (ri) of ancestor i in cohort t1 is defined as the proportion of genes present in individuals in cohort t2 deriving by descent from i, where (t2 - t1) -> {infty} (WOOLLIAMS et al. 1993 Down). In the remainder of the current article, long-term genetic contributions of ancestors are referred to as "genetic contributions," or simply as "contributions."

Rates of inbreeding are predicted from WOOLLIAMS and BIJMA 2000 Down,

(1)

where 1T = (1 1 1 ... 1), N is a 2cmax x 2cmax diagonal matrix containing the numbers of parents selected from every category, u is a 2cmax vector of expected lifetime long-term genetic contributions of parents, i.e., u2 = (u2i,1 u2i,2 ... u2i,2cmax), where ui,s is the expected lifetime long-term contribution of individual i in category s conditional on its selective advantage (which in mass selection is the breeding value), and {delta} is a 2cmax vector of correction factors for deviations of the variance of family size (Vn) from independent Poisson variances. Throughout the article, family size refers to the number of selected offspring of a parent, not to the number of candidates. With mass selection and fixed no, {delta} takes negative values, showing that {Delta}F for fixed no is less than for no ~ Poisson. In Equation 1, categories are exclusive, i.e., individuals are in only one category, and categories are therefore indexed by s instead of k. The scalar equivalent of Equation 1 is E({Delta}F) = {Sigma}snsE(u2i,s) + {Sigma}sns{delta}s, where {Sigma}s denotes summation over all exclusive categories.

To calculate E(u2i,s), the selective advantage of the mate has to be included since the mate affects the contribution of an ancestor. With random mating and mass selection, however, the selective advantages of mates are independent and it is therefore possible to ignore the mate when calculating ui,s and add the mate term when calculating E(u2i,s). The advantage of this is that the selective advantage contains only one term (the breeding value of the individual), which simplifies the prediction of ui,k.

Rates of inbreeding are predicted in three steps. First, expected genetic contributions are predicted using the method of WOOLLIAMS et al. 1999 Down. Second, E(u2i,s) is derived and third, {delta}s is derived. Discrete and overlapping generations are treated separately.

The difference between the current prediction and the method of WOOLLIAMS et al. 1993 Down is (1) the current prediction is based on equilibrium genetic variances, which simplifies the prediction of ui,s (WOOLLIAMS et al. 1999 Down); (2) the variance of genetic contributions is not predicted separately, since it is related to the mean (WOOLLIAMS and BIJMA 2000 Down).

Discrete generations
Step 1: prediction of expected long-term genetic contributions: Expected genetic contributions of ancestors are obtained from the linear model (BIJMA and WOOLLIAMS 1999 Down),

(2)

where s denotes males or females, {alpha}s is the expected contribution for an average ancestor of sex s, and ßs is the regression coefficient of the contribution on the breeding value (Ai,s) of the ancestor as a deviation from the average of the selected group (s) for sex s. In discrete generations, {alpha}s = and ßs = , where {lambda} = i{sigma}-1P is the average regression coefficient of the number of selected male and female offspring on the breeding value of the parent, and {pi} = (1 - {kappa}h2) is the average regression coefficient of the breeding value of selected male and female offspring on the breeding value of the parent (BIJMA and WOOLLIAMS 1999 Down). Here, i = (im + if) is selection intensity, {kappa} = ({kappa}m + {kappa}f) is PEARSON 1903 Down variance reduction coefficient, and h2 = , where {sigma}2A and {sigma}2P are BULMER 1971 Down equilibrium genetic and phenotypic variance.

Step 2: derivation of E(u2i,s): Substituting Equation 2 and with terms added for the mate,

(3)


(4)

where j denotes the mate and

(5)

From Equation 1, ignoring the second term, E({Delta}F) = [NmE(u2i,m) + NfE(u2i,f)]. From Equation 3 and Equation 4 and the equations for ßs, {lambda}, and {pi}, predicted {Delta}F (see Appendix A) is

(6)

For Nm = Nf = N, the result simplifies to

(7)

The assumption for Equation 6 and Equation 7 is that, conditional on the selective advantage [i.e., conditional on (Ai,s - s) in mass selection] family size follows a Poisson distribution (WOOLLIAMS and BIJMA 2000 Down), which is approximately the case with mass selection when no ~ Poisson. A numerical example is in APPENDIX A.

Step 3: Correction of E({Delta}F) for deviations of Vn from Poisson variances: With fixed no, family size follows a hypergeometric distribution (BURROWS 1984B Down) and a correction is required according to the second term of Equation 1. In this article, the hypergeometric variance is approximated by a binomial variance, which simplifies the prediction. For more complicated selection strategies, e.g., index selection, a hypergeometric variance may be required (WOOLLIAMS and BIJMA 2000 Down).

With discrete generations, the second term of Equation 1 reduces to 1/8[Nm{delta}m + Nf{delta}f], where {delta}s = {alpha}TVn(s),dev{alpha}, {alpha}T = ({alpha}m {alpha}f) and Vn(s),dev is the 2 x 2 matrix of deviations of the (co)variance of family size from Poisson variances for sex s (WOOLLIAMS and BIJMA 2000 Down). Diagonal elements of Vn(s),dev are obtained as Vn(s),dev = Vn(s) - Vn(s),Poisson, which are of the form np(1 - p) - np = -np2, where n is the number of candidates and p is the selected proportion. Off-diagonal elements of Vn(s),dev are zero. For discrete generations the total correction (Appendix A) equals

(8)

Relation to SANTIAGO and CABALLERO (1995): The prediction equation of SANTIAGO and CABALLERO 1995 Down can be related directly to the current prediction. With random mating and assuming {alpha}I,s = {alpha}O = 0 (see SANTIAGO and CABALLERO 1995 Down for notation), Equations 21 and 36 of SANTIAGO and CABALLERO 1995 Down reduce to Nm[{alpha}2m + {alpha}2mQ2C2m] + Nf[{alpha}2f + {alpha}2fQ2C2f] (BIJMA et al. 1999 Down). This can be equated directly to the first term of Equation 1, which shows that E(u2i,s) corresponds to [{alpha}2s + {alpha}2sQ2C2s], and also that {alpha}2sQ2C2s corresponds to ß2sE[(Ai,s - s)2]. SANTIAGO and CABALLERO 1995 Down use Q = , which is identical to our 1/(1 - {pi}). Furthermore, they use C2s = i2h2(1 - {kappa}h2), which is identical to our 2{lambda}2E[(Ai,s - s)2], where the 2 accounts for the mate.

The correction for deviations of Vn from Poisson variances can also be related to Equation 36 of SANTIAGO and CABALLERO 1995 Down. They use Vn(s)(s',s') = (see SANTIAGO and CABALLERO 1995 Down, Equation 30 and ignore the term C2sm), where ñs is the number of selection candidates per sex of a parent of sex sm = nod, ñf = no) and s' denotes the sex of the offspring. This is a binomial variance. The deviation from a Poisson variance (i.e., Ns'/Ns) equals Vn(s),dev(s', s') = . From Equation 36 of SANTIAGO and CABALLERO 1995 Down, the total correction of {Delta}F equals -1/8T-1, which is identical to Equation 8 (BIJMA et al. 1999 Down). Therefore, Equations 21, 30, and 36 of SANTIAGO and CABALLERO 1995 Down are identical to the current prediction for mass selection. A numerical difference between both methods exists because SANTIAGO and CABALLERO 1995 Down omit the correction for a finite number of parents when calculating their C2ss', which would be equivalent to omitting the (1 - 1/Ns) in Equation 5 of the current prediction.

Overlapping generations
Step 1: prediction of expected long-term genetic contributions: Genetic contributions are predicted using Equation 2 again, but now categories refer to sex-age class combinations, which are indexed by k instead of s, so that k = 1 ... 2cmax and ui,k is the expected genetic contribution of individual i originating from its selection in category k. Solutions for {alpha}k and ßk are obtained from WOOLLIAMS et al. 1999 Down,

(9)


(10)

where * denotes element-by-element multiplication, T denotes the transpose of matrices, I is the 2cmax x 2cmax identity matrix, N is a 2cmax x 2cmax diagonal matrix containing the numbers of parents selected from every category (nk), {Pi} is a 2cmax x 2cmax matrix with each element, {pi}kl, being the regression coefficient of the breeding value of a selected offspring in category k on the breeding value of the parent in category l, {Lambda} is a 2cmax x 2cmax matrix with each element, {lambda}kl, being the regression coefficient of the number of selected offspring in category k on the breeding value of the parent in category l, G is a 2cmax x 2cmax modified gene flow matrix connecting selected offspring to parental categories, D is a 2cmax x 2cmax matrix of deviations of breeding values from the mean of the selected group, {alpha} is a 2cmax vector of elements {alpha}l, and ß is a 2cmax vector of elements ßl. Generation interval (L) was calculated as the time interval in which genetic contributions sum to 1: L = (WOOLLIAMS et al. 1999 Down). More details and a numerical example are in BIJMA and WOOLLIAMS 1999 Down.

Contributions predicted from Equation 9 and Equation 10 are per year; i.e., they are the long-term contribution of a single cohort, not of a total generation. Rates of inbreeding predicted from these contributions therefore are also per year.

Step 2: derivation of E(u2i,s): For the calculation of E(u2i,s) one needs to find the lifetime expected genetic contribution; i.e., one has to account for the fact that individuals may be selected in multiple categories. With cmax age classes per sex and the ranking of individuals within age classes remaining constant, there are 2cmax exclusive categories, which will be indexed by s, i.e., individuals selected once, twice, up to cmax times for each sex. Therefore, s = 1 ... cmax denotes males selected 1 through cmax times and s = cmax+1 ... 2cmax denotes females selected 1 through cmax times. The expected lifetime contribution for these categories is ui,s = {Sigma}kui,k, where the sum is taken over the age-sex categories k from which i is selected. Thus individuals are indexed in two different ways, i.e., by whether or not they were selected at a specific age, denoted by k, and by how many times they were selected throughout their lifetime, denoted by s.

The scalar equivalent of the first term of Equation 1 is

with the first term denoting males and the second females. The summation over exclusive categories s can be written in terms of the categories k, for males,

(11)

and for females,

(12)

where min(nk,nl) denotes the minimum of nk and nl (see also example in Appendix B). These summations can be written in matrix form, so that for Poisson family size, the rate of inbreeding per year is

(13)

where 1 = (1 1 ... 1)T, No is similar to N but has a reordering of age classes within sexes so that they go from large to small according to the number of parents, and Uo is a 2cmax x 2cmax matrix containing a lower triangular submatrix for each sex (with categories ordered as in No), with E(u2i,k) on the diagonal and 2E(ui,kui,l) as off-diagonals in the lower triangular submatrices (see example in Appendix B). Note that, although Equation 1 uses exclusive categories s, we have expressed {Delta}FY in terms of the age-sex categories k in Equation 13. Thus, the expected genetic contributions for the categories k can be used directly in Equation 13. Rates of inbreeding per generation were calculated as E[{Delta}FL] = LE[{Delta}FY].

As with discrete generations, E(u2i,k) has to include terms for the mates. With overlapping generations, the mate term consists of two elements. The first element is due to the category of the mate as a deviation of the average category for the sex of the mate, {alpha}l - sex(l). The second term is due to the selective advantage of the mate within its category, ßl(Ai,l - l). Therefore, for males, ui,k = {alpha}k + ßk(Ai,k - k) + {Sigma}dj=1[({alpha}l - sex(l)) + ßl(Aj,l - l)]; and for females, ui,k = {alpha}k + ßk(Ai,k - k) + , where j denotes the mate, l the category of the mate, and sex the sex of the mate. For Equation 11 and Equation 12, expectations of squared contributions are obtained for males as

(14)

where k = 1 ... cmax, and for females as

(15)

where k = cmax+1 ... 2cmax and bars with subscripts m or f denote weighted averages over mate categories.

Cross-products in Equation 11 and Equation 12 arise only from the individuals selected in both categories, which are all the individuals selected from the smallest category [i.e., min(nk,nl)]. Cross-products are therefore

(16)

where subscript min denotes the category with the lower number of parents and subscript max denotes the category with the higher number of parents. (With random mating there is no covariance between different mates of i; therefore, there is no mate term in the cross-product.) A numerical example is in Appendix B.

Step 3: correction of E({Delta}Fy) for deviations of Vn from Poisson variances: The second term of Equation 1 is 1/81TN{delta}, where {delta} is a 2cmax vector of elements {delta}k = {alpha}TVn(k),dev{alpha}, and Vn(k),dev is a 2cmax x 2cmax matrix with deviations from Poisson variances (WOOLLIAMS and BIJMA 2000 Down). Similar to the discrete generation case, Vn(k) is approximated by a binomial variance. Elements of Vn(k),dev and a numerical example are in Appendix B.

Relation to NOMURA (1996): NOMURA 1996 Down developed predictions for the special case of equal numbers of parents per sex selected from every age class (denoted nm and nf), i.e., for constant selection intensity with age. With those schemes every parent is selected in every category (except for categories with zero parents) and there are only two exclusive categories; i.e., males selected always and females selected always. In this respect, schemes with equal numbers of parents selected from every age class are like discrete generations, i.e., only two categories that do not compete for being selected. BIJMA et al. 1999 Down show that Equations 30 and 31 of NOMURA 1996 Down reduce to {Delta}FY = nm[{alpha}2m + {alpha}2mQ2C2m] + nf [{alpha}2f + {alpha}2fQ2C2f], which is equivalent to the first term of Equation 1. This result is a rescaling of discrete generations, i.e., with discrete generation {alpha}s = , with overlapping generations and two exclusive categories, each contributing half, {alpha}s = , where L is the generation interval. Summation of contributions over the number of parents per generation shows that they sum to unity:

Furthermore, NOMURA 1996 Down calculated Q using (I - P)-1 [P is a gene flow matrix identifying the contribution of parental age groups to selected offspring multiplied by the proportion of genetic variance remaining after selection; NOMURA 1996 Down(Appendix)], which, for his special case, is equivalent to our (I - GT*{Pi}T)-1 (see WOOLLIAMS et al. 1999 Down, Equation 10). Analogous to SANTIAGO and CABALLERO 1995 Down, NOMURA 1996 Down calculated C2somitting the (1 - 1/ns). Contrary to SANTIAGO and CABALLERO 1995 Down and to the present study, NOMURA 1996 Down included a term C2ss' in the calculation of Vn(s) [first term in Equation 22 of NOMURA 1996 Down]. Finally, NOMURA 1996 Down considered only one generation inheritance of selective advantage when he calculated the total contribution of age classes. [See Equation 8 of NOMURA 1996 Down, which is equivalent to solving {alpha} from N{alpha} = GTN{alpha} instead of using Equation 9 (WOOLLIAMS et al. 1999 Down).]

Stochastic simulation: To examine the accuracy of the prediction equations, the breeding scheme described in the "population model" section was simulated and rates of inbreeding were calculated from simulated data. The simulation procedure is described in BIJMA and WOOLLIAMS 1999 Down. In the simulated data, an ancestor cohort t1 and a descendent cohort t2 were chosen (BIJMA and WOOLLIAMS 1999 Down). Inbreeding coefficients of individuals in cohorts t1 and t2 were calculated from the simulated pedigree, using the algorithm of MEUWISSEN and LUO 1992 Down. Rates of inbreeding per year were calculated as {Delta}Fy = 1 - [](t2-t1)-1, where t1 and t2 are the average inbreeding coefficients in cohorts t1 and t2, respectively. Rates of inbreeding per generation were calculated as {Delta}FL = L{Delta}Fy. Results were averaged over 500 replicates.


*  RESULTS
*TOP
*ABSTRACT
*DERIVATION OF EXPRESSIONS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*LITERATURE CITED

Discrete generations:
For examination of the accuracy of predictions and to identify the origin of prediction errors, Table 1 shows simulated and predicted {Delta}F. Two types of predictions are in Table 1: {Delta}Fpred* is the prediction using {alpha} and ß estimated from simulation, and {Delta}Fpred is the full deterministic prediction using {alpha} and ß from Equation 9 and Equation 10. Differences between {Delta}Fpred and {Delta}Fpred* reflect prediction errors originating from the prediction of ß [in discrete generations, {alpha}s = is known]. Differences between {Delta}Fsim and {Delta}Fpred* reflect errors in Equation 1.


 
View this table:
[in this window]
[in a new window]

 
Table 1. Rates of inbreeding from simulation ({Delta}Fsim) and from prediction ({Delta}Fpred*, {Delta}Fpred) for populations with discrete generations

Generally, errors of the full prediction in Table 1 are small, most errors are below 5%, maximum errors are up to 8.1%, and trends agree well between simulations and predictions. Though errors are small, some trends can be observed. Most errors are positive and errors tend to be highest for Nm = 10, but errors tend to be negative for no = 8 and Nm = 100. Prediction errors are partly due to errors in the prediction of ß; i.e., {Delta}Fpred* is generally more accurate than {Delta}Fpred. Because we have approximated the hypergeometric variance of family size by a binomial variance, positive errors for small numbers of parents were expected. The correction for hypergeometric variances becomes larger with fewer parents (BURROWS 1984B Down), whereas a binomial correction is unaffected by the number of parents. Because the correction is a negative value, a binomial correction results in an overprediction for small numbers of parents. The current prediction was compared to the prediction of SANTIAGO and CABALLERO 1995 Down. As expected from the close agreement between equations of both methods, both methods gave very similar results (BIJMA et al. 1999 Down).

Fig 1 shows the relationship between {Delta}F and heritability (h20), for Nm = Nf = 20 and for three selection intensities (no = 2, 8, or 32 -> i = 0, 1.271, or 1.967). Though relationships of {Delta}F with heritability and selection intensity can be inferred from other studies (e.g., WRAY and THOMPSON 1990 Down), they have never been explored in detail.



View larger version (16K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 1. Relation of predicted (lines) and simulated (symbols) rates of inbreeding ({Delta}F) with heritability (h20) for populations with discrete generations, with 20 sires and 20 dams and varying number of offspring per dam (no). no = 2: {square}, {Delta}Fsim; · · ·, {Delta}Fpred. no = 8: {triangleup}, {Delta}Fsim; - - -, {Delta}Fpred. no = 32: {circ}, {Delta}Fsim; —, {Delta}Fpred.

Fig 1 shows that {Delta}F has a maximum for intermediate heritabilities (except for no = 2), and changes in {Delta}F are more pronounced with greater selection intensity. The maximum of {Delta}F for intermediate h2 is due to the Bulmer effect. When the Bulmer effect is ignored in Equation 7 (i.e., {kappa} = 0) the rate of inbreeding increases with h2 over the whole range. The logic behind this is that with increasing h2 the reduction of between-family variance increases, reducing the importance of the family component in the phenotype. [Note also that the intraclass correlation between full sibs [{rho} = h2(1 - {kappa}h2)] has a maximum for h2max = , which for a common value of {kappa} = 0.8 equals h2max = 0.625]. For h20 = 0 and with Poisson family size, Equation 1 reduces to E[{Delta}F] = + = 0.0125 (WRIGHT 1969 Down, p. 212).

With no = 2, one male and one female offspring are selected from every pair of parents, which gives zero variance of family size, ß = 0, and minimal inbreeding. Expected long-term genetic contributions are equal for all parents and the variance of the contributions is zero; i.e., expected and realized contributions are equal. The absence of variance of family size with no = 2 is taken into account by the correction of {Delta}F for deviations of Vn from Poisson variances. Without this correction, {Delta}Fpred is equal to a situation with h20 = 0 and Poisson family size, resulting in {Delta}Fpred = 0.0125. The correction halves the prediction to 0.00625. This is an established result (FALCONER and MACKAY 1996 Down, p. 69). In the absence of variance of family size (which can only be achieved for d = 1), effective population size equals twice the actual population size: Ne = 2(Nm + Nf) and E({Delta}F) = = 0.00625.

With higher selection intensities (no = 8 or 32), {Delta}F increases considerably with heritability. For example, for h2 = 0.6, {Delta}F increases by 54% compared to random selection (i.e., h2 = 0) for no = 8, and by 105% for no = 32. The large increase of {Delta}F with selection intensity originates from the regression of the number of selected offspring on the breeding value of the parent, which is linear in i ({lambda} = i{sigma}-1P), giving a quadratic term in {Delta}F (Equation 7). Large values of {lambda} indicate that the population descends for a large proportion from only a few ancestors.

For practical selection intensities (no = 2, 8), there is close agreement between {Delta}Fpred and {Delta}Fsim. For large selection intensities errors are larger (e.g., for no = 200, Nm = Nf = 40 and h2 = 0.4, an error of -18% was found). Large errors for extreme selection intensities do not undermine the general theory, i.e., Equation 1 is still valid, but the linear model (Equation 2) may be insufficient to predict expected genetic contributions (WOOLLIAMS and BIJMA 2000 Down).

Overlapping generations:
Table 2 shows simulated and predicted rates of inbreeding per generation and generation intervals. Predictions of {Delta}F using {alpha} and ß from simulation (such as {Delta}Fpred* in Table 1) are not included, because standard errors on ß were too large to draw conclusive inferences. Because the potential number of alternative schemes is very large with overlapping generations, a wide range of schemes was evaluated. Only schemes 1, 3, 5, 6, and 7 are within the scope of NOMURA 1996 Down. Schemes 1–5 represent a situation with two age classes with gradually increasing ages of females. Scheme 6 is similar to scheme 5 but with a mating ratio of two. Scheme 7 has equal numbers of parents in all categories. With schemes 8 and 9, parents were ranked on estimated breeding value [EBVi,k = h2 (Pi,k - k)] across age classes and the highest ranking Nm males and Nf females were selected across age classes, which gives the highest genetic level of the offspring in the next cohort (JAMES 1987 Down). This strategy resulted in N = diag{19, 1, 38, 2} for no = 8 and N = diag{18, 2, 33, 7} for no = 4 (for both h20 = 0.2 and 0.5). Furthermore, some arbitrary schemes with three and four age classes were evaluated to show that predictions are also accurate for more than two age classes. Prediction errors of {Delta}FL were small, with most <5%. The maximum error was 6.6% and most errors were positive. Similar to the case with discrete generations, positive errors for small numbers of parents were expected due to the binomial approximation for the variance of family size.


 
View this table:
[in this window]
[in a new window]

 
Table 2. Rates of inbreeding per generation from simulation ({Delta}Fsim) and from prediction ({Delta}Fpred) and generation intervals from simulation (Lsim) and prediction (Lpred) for populations with overlapping generations

Generation intervals are systematically underpredicted in Table 2 (except for schemes with only one reproductive category per sex in which case L is fixed; schemes 1, 5, and 6). The underprediction is entirely explained by the way Lsim is calculated, i.e., Lsim = , where, Lk = ; i.e., the generation interval is calculated per replicate as the time in which genetic contributions sum to unity and subsequently averaged over replicates (BIJMA and WOOLLIAMS 1999 Down). However, if {alpha} was averaged over replicates and Lsim was calculated from the average, i.e., Lsim = , then Lpred and Lsim were in very close agreement (results not shown). This result was expected from the nonlinear relationship between L and {alpha}, so that E[L] differs from .

Results from the current prediction were compared to results from the prediction of NOMURA 1996 Down for the special case of equal numbers of parents selected from every age class. [A comparison was made for all schemes presented by NOMURA 1996 Down.] As expected from theory, results from both methods were similar (BIJMA et al. 1999 Down).

Relationship between {Delta}F and distribution of parents over age classes: Fig 2 shows the relationship between the rate of inbreeding (per year and per generation) and the proportion of parents selected from the second age class (p2), for a population with two age classes, Nm = Nf = 20, h20 = 0.4, and no = 10. With the exception of p2 = 0, 0.5, and 1.0, these schemes are beyond the scope of NOMURA 1996 Down. Generation interval was also included in Fig 2. On the horizontal axis, parents are shifted from all parents in the first age class (p2 = 0, N = diag{20, 0, 20, 0}) to all parents in the second age class (p2 = 1, N = diag{0, 20, 0, 20}). For p2 = 1, no predictions are presented (i.e., no lines, only symbols), because in this scheme there are two distinct subpopulations that do not mix; i.e., individuals born in odd-numbered cohorts are one population and individuals born in even-numbered cohorts are the other population. This scheme violates the assumption of one random mating population in the derivation of {Delta}Fpred. Therefore, the populations should be treated separately, which resulted in accurate predictions. Despite the complex relationship between {Delta}F and p2 in Fig 2, where, for example, {Delta}FY is nearly constant before declining sharply, accurate predictions were obtained throughout. The rate of inbreeding per year has a flat curve with a maximum for p2 = 0.5, because the increase of {Delta}FL with p2 is counteracted by an increase in the generation interval, and as a result, {Delta}FY = shows only slight increase before p2 = 0.5 and steep decrease after p2 = 0.5.



View larger version (14K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 2. Relation of the proportion of parents from the second age class (p2) with predicted (lines) and simulated (symbols) generation intervals (L) and with rates of inbreeding per year ({Delta}Fy), and per generation ({Delta}FL), for a population with two age classes, Nm = 20, Nf = 20, h20 = 0.4, and no = 10. •, {Delta}Fy,sim; · · ·, {Delta}Fy,pred; {blacktriangleup}, {Delta}FL,sim; - - -, {Delta}FLpred; *, Lsim; —, Lpred.

For random selection, HILL 1979 Down showed that the rate of inbreeding in overlapping generations is related to the lifetime variance of family size and the number of parents entering the population per generation. The same pattern can be observed in Fig 2, which shows that {Delta}FL has a maximum when parents are equally distributed over age classes, i.e., for N = diag{10, 10, 10, 10}, where the 10 parents selected in age class 1 the first year are the same as the 10 parents selected in age class 2 the next year. Thus only 10 distinct parents are selected from every cohort for this scheme, and with L = 1.41 the number of parents entering the population per generation equals only 14.1. For N = diag{20, 0, 20, 0}, 20 distinct parents are selected from every cohort and with L = 1, 20 parents enter the population per generation. The rate of inbreeding per generation reaches a minimum for p2 = 0.95 (N = diag{1, 19, 1, 19}). At first glance, this result is counterintuitive; i.e., one might expect approximately equal rates of inbreeding per generation for N = diag{19, 1, 19, 1} and for N = diag{1, 19, 1, 19}. However, for N = diag{1, 19, 1, 19}, 19 distinct individuals are selected from every cohort and, with L = 1.90, the number of parents per generation equals 36.1.

Line subdivision and migration: As mentioned earlier, the scheme with N = diag{0, 20, 0, 20} has two nonmixing lines. Changing this scheme to N = diag{1, 19, 1, 19} is equivalent to allowing some migration between both lines. Fig 3 shows a comparison between full line subdivision, line subdivision with migration, and one single line for schemes with 2 or 3 age classes. Note that the total number of parents per year is equal per comparison. The comparison shows that allowing some migration between lines substantially reduces {Delta}FL (i.e., 0.0104 vs. 0.0141 and 0.0075 vs. 0.0141). The smallest {Delta}F is obtained when lines are joined together ({40, 40} with a cohort interval of 2 years and {60, 60} with a cohort interval of 3 years). When comparing these rates of inbreeding, it must be realized, however, that the schemes with full line subdivision accumulate a between-line genetic variance equal to 2(1 - )F{sigma}2A0, where the (1 - 1/nlines) accounts for the fact that the mean is estimated from the sample; i.e., the variance is the observed variance in the sample (FALCONER and MACKAY 1996 Down, p. 265). The total genetic variance at time t, i.e., {sigma}2A,t = {sigma}2A,between + {sigma}2A,within, equals {sigma}2A0 for N = diag{0, 20, 0, 20} and {sigma}2A0(1 + Ft) for N = diag{0, 0, 20, 0, 0, 20} and therefore the genetic variance is larger with full line subdivision.



View larger version (23K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 3. Comparison between full line subdivision, partial migration, and one line, with an equal total number of parents per year. Numbers at lines represent the number of parents per sex, h20 = 0.4, no = 6.

Relationship between {Delta}F and heritability: Fig 4 shows the relationship between h20 and {Delta}FL, for two breeding schemes. The first scheme (S1) has most parents in the first age class, N = diag{16, 4, 16, 4}, whereas the second scheme (S2) has most parents in the second age class, N = diag{4, 16, 4, 16}. With S1, {Delta}FL has a maximum for h20 = 0.5–0.6, similar to the discrete generation case (see Fig 1). With S2, however, {Delta}FL increases with heritability over the whole range. The increase of {Delta}FL with h20 for S2 is mainly due to an increased contribution of parents in age class 1 at high heritabilities. With high heritability, genetic gain is large, which gives offspring of 1-year-old parents an increased selective advantage. This increases the contribution of parents in age class 1 relative to the contribution of parents in age class 2. For example, with S2 and h20 = 0.5, expected genetic contributions of average parents are {alpha}T = [0.027 0.012 0.027 0.012], whereas for h20 = 0.9, expected genetic contributions of average parents are {alpha}T = [0.040 0.011 0.040 0.011]. This result shows that with increasing h20 the genetic contributions become distributed more unequally over parents, resulting in a higher sum of squared contributions and therefore in an increased {Delta}F. Furthermore, with S2, ß increases with heritability, resulting in increased differences between genetic contributions of different parents selected from the same category, which further increases {Delta}F.



View larger version (16K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 4. Relation of heritability with simulated (symbols) and predicted (lines) generation interval (L) and with the rate of inbreeding per generation ({Delta}FL), with no = 8, for two different breeding schemes, S1, N = diag{16, 4, 16, 4} and S2, N = diag{4, 16, 4, 16}. S1: {circ}, {Delta}FL,sim; - - -, {Delta}FL,pred; {triangleup}, Lsim; · · ·, Lpred. S2: •, {Delta}FL,sim; —, {Delta}FL,pred; {blacktriangleup}, Lsim; – - –, Lpred.

Rates of inbreeding per year can be obtained from Fig 4 as {Delta}FY = , which shows the same trends with h20 as {Delta}FL. In conclusion, results from Fig 4 show that in contrast to the case of discrete generations, no general pattern can be observed in the relationship between {Delta}F and h20 with overlapping generations.


*  DISCUSSION
*TOP
*ABSTRACT
*DERIVATION OF EXPRESSIONS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*LITERATURE CITED

Explicit prediction equations for rates of inbreeding in populations with either discrete or overlapping generations under mass selection were developed, on the basis of the approach of WOOLLIAMS and BIJMA 2000 Down and WOOLLIAMS et al. 1999 Down. Except for extreme selection intensities in females, predictions were accurate for discrete as well as for overlapping generations. Though based on a different approach, the current method extends the method of NOMURA 1996 Down to populations with overlapping generations and an arbitrary distribution of parents across age classes, removing the stringent restriction of NOMURA 1996 Down. Relationships between rates of inbreeding and genetic and population parameters were also presented. General relationships apparent in discrete generations could not be extended to overlapping generations. For the prediction of rates of inbreeding in overlapping generations it is crucial to account for the inheritance of selective advantage both between and within categories. For discrete generations with only two categories (males and females), which do not compete for selection, only competition between selection candidates within categories is relevant.

The current method was compared to methods based upon the proportion of genetic variance transmitted to the offspring, which showed that with random mating, the equations of both SANTIAGO and CABALLERO 1995 Down and NOMURA 1996 Down can be reduced to simple expressions in terms of expected genetic contributions. SANTIAGO and CABALLERO 1995 Down suggested that the differences between their results using the reduced genetic variance and those of WOOLLIAMS et al. 1993 Down using long-term contributions were due to the difference in approach. The present results show that the differences obtained previously were most likely due to errors in the derivations involving complex pathways over multiple generations that were needed by WOOLLIAMS et al. 1993 Down. These complexities were avoided by SANTIAGO and CABALLERO 1995 Down. However, WOOLLIAMS and BIJMA 2000 Down were able to derive the present results using long-term contributions by modeling the transfer of selective advantages in a single generation by assuming an equilibrium. The idea of basing the prediction on Bulmer's equilibrium variances was introduced by SANTIAGO and CABALLERO 1995 Down. However, their approach to modeling the inheritance of selected advantage by the proportion of genetic variance retained is correct only for mass selection [see WOOLLIAMS et al. 1999 Down for a general approach].

Prediction errors became large when the number of selection candidates per dam became extremely large (Fig 2), but these situations are out of the range of most artificial selection programs. Certain species (e.g., fish or chicken) are able to produce many offspring per dam, but the number of selection candidates per dam is generally lower. High selection intensities in males can easily be obtained with a limited number of selection candidates per dam when the mating ratio is large. For these situations predictions were accurate (see Table 1, schemes with d = 5, no = 8 -> i = 2.063). The errors with large no were not present for low h2 (results not shown), which indicates that the current method is also applicable to species with a large number of offspring when natural directional selection acts on a trait with low heritability.

In this article, equations for predicting rates of inbreeding were developed assuming a model of truncation selection on a normally distributed trait controlled by an infinitesimal model of gene effects. The predicted rate of inbreeding relates to homozygosity (by descent) at a neutral locus, unlinked to genes affecting the trait under selection (WOOLLIAMS and BIJMA 2000 Down). When the infinitesimal model does not hold, and the number of genes affecting the trait is large, or when the number of chromosomes is small, it is questionable whether neutral and unlinked loci exist at all. When loci are nonneutral, or linked to nonneutral loci, predicted rates of inbreeding cannot be related directly to the homozygosity at the locus, because a covariance between the genetic contribution and the gene frequency will arise due to selection (WOOLLIAMS and BIJMA 1999). However, the rate of inbreeding can still be related to rates of inbreeding obtained by analyzing pedigrees using WRIGHT 1922 Down path coefficient method, or MALECOT 1948 Down coefficient of kinship, and also to estimates of inbreeding depression based on inbreeding levels calculated from the pedigree. Recently, SANTIAGO and CABALLERO 1998 Down extended prediction methods for effective population size to populations with linked loci undergoing mass selection but for discrete generations only.

In general, to obtain accurate predictions of {Delta}F one needs to account for more than one generation of inheritance of selective advantage between categories. It was sufficient for NOMURA 1996 Down to account for only a single generation because of the special case of equal numbers of parents per age class. In that case, shifting contributions between age classes has only a minor effect on {Delta}F because the contributions will remain with the same individuals with the same relative fitness, because every individual is selected in every category. Therefore the lifetime contribution will not be affected. For schemes where the number of parents differs between age classes, shifting of contributions between categories means shifting to other individuals (at least partly), which will affect the lifetime contribution. Consider, for example, scheme 10 in Table 2 with h20 = 0.5: accounting for only one generation of inheritance (i.e., calculating {alpha} from N{alpha} = GTN{alpha}; WOOLLIAMS et al. 1999 Down) gives {Delta}Fpred = 0.0128, an error of -21%; whereas using Equation 9 gives {Delta}Fpred = 0.0159, an error of only -2%.

The use of the concept of long-term genetic contributions to predict rates of inbreeding has several appealing properties. First, the derivation of the relationship between rates of inbreeding and genetic contributions is based directly on the probability of alleles being identical by descent, which enhances the intuitive understanding (WOOLLIAMS and BIJMA 2000 Down). Furthermore, rates of genetic gain can easily be obtained from the covariance between the genetic contribution and the Mendelian sampling component of the breeding value (WOOLLIAMS and THOMPSON 1994 Down; WOOLLIAMS et al. 1999 Down), which integrates methods for predicting genetic gain and rates of inbreeding. Finally, the prediction procedure for genetic contributions describes mechanisms determining the impact of current individuals on future populations and the turnover rate of genes and gives therefore an understanding of the mechanisms determining the development of the pedigree (WOOLLIAMS et al. 1999 Down; BIJMA and WOOLLIAMS 1999 Down). Because the approach is general, it is clear how prediction equations can be extended to other situations.

With a fixed total number of parents selected per year, populations showed maximum rates of inbreeding (per year and per generation) when the number of parents entering the populations per generation was least, which occurred with an equal number of parents in every age class. Rates of inbreeding were smallest when most parents were in the older age classes, because those schemes had the largest number of parents entering the population per generation. This result broadly resembles the results of HILL 1974 Down for random selection in overlapping generations, although selected populations have an additional component of inbreeding arising from the expected differential contributions within age classes, which will modify this relationship. Schemes with most parents in the later age classes resembled population subdivision with some migration between lines. Because the selective advantage of categories depends on heritability, genetic contributions of categories are strongly affected when heritability changes (BIJMA and WOOLLIAMS 1999 Down); i.e., contributions generally shifted to the younger age classes when heritability increased. Therefore, no general relationship between heritability and rate of inbreeding could be observed with overlapping generations.

In this article, equations were developed to predict rates of inbreeding for diploid populations with two sexes under controlled selection. The results are therefore primarily relevant for populations under artificial selection, for example, in animal breeding or in selection experiments. Though this article focuses on mass selection within age classes, results for mass selection across age classes can easily be accomplished by choosing the appropriate N, as in schemes 8 and 9 in Table 2. An extension to a situation where individuals in older age classes have more information, e.g., progeny information, only requires the calculation of probabilities of selecting the same individual on different ages, which can be done using standard index theory. The method can also be extended to other selection strategies and modes of inheritance (e.g., index selection and imprinting), using the key results of WOOLLIAMS and BIJMA 2000 Down and WOOLLIAMS et al. 1999 Down.

In animal breeding, optimization of breeding programs has focused for a long time on the maximization of genetic gain for the short term, partly because methods to predict long-term response were not available. When rates of inbreeding in selected populations can be predicted, predictions of long-term response under the infinitesimal model become available. This article enables methods for the optimization of breeding schemes on the long term (e.g., VILLANUEVA et al. 1996 Down; VILLANUEVA and WOOLLIAMS 1997 Down) to be extended to populations with overlapping generations and mass selection.


*  ACKNOWLEDGMENTS

J.A.W. gratefully acknowledges the Ministry of Agriculture, Fisheries and Food (United Kingdom) for financial support. Ab Groen is acknowledged for giving useful comments on the manuscript, and Tetsuro Nomura for generously sending us a copy of his programs. This research was financially supported by the Netherlands Technology Foundation (STW) and was coordinated by the Earth and Life Science Foundation (ALW).

Manuscript received March 30, 1999; Accepted for publication December 6, 1999.


*  APPENDIX A
*TOP
</