Genetics, Vol. 150, 921-930, October 1998, Copyright © 1998

Analysis of Population Structure in Autotetraploid Species

Joëlle Ronforta, Eric Jenczewskia, Thomas Bataillona, and François Roussetb
a Laboratoire de Génétique et d'Amélioration des Plantes, Institut National de la Recherche Agronomique, Domaine de Melgueil, 34130 Mauguio, France
b Laboratoire Génétique et Environnement, Institut des Sciences de l'Evolution, Université des Sciences et Techniques du Languedoc, 34095 Montpellier, France

Corresponding author: Joëlle Ronfort, Laboratoire de Génétique et d’Amélioration des Plantes, Institut National de la Recherche Agronomique, Domaine de Melgueil, 34130 Mauguio, France., ronfort{at}ensam.inra.fr (E-mail).

Communicating editor: M. SLATKIN


*  ABSTRACT
*TOP
*ABSTRACT
*HIERARCHICAL GENIC STRUCTURE AND...
*EQUILIBRIUM VALUES OF THE...
*POPULATION PARAMETERS ESTIMATION
*DISCUSSION
*APPENDIX 1
*LITERATURE CITED

Population structure parameters commonly used for diploid species are reexamined for the particular case of tetrasomic inheritance (autotetraploid species). Recurrence equations that describe the evolution of identity probabilities for neutral genes in an "island model" of population structure are derived assuming tetrasomic inheritance. The expected equilibrium value of FST is computed. In contrast to diploids, the correlation of genes between individuals within populations with respect to genes between populations (FST) may vary among loci due to the particular segregation patterns expected under tetrasomic inheritance and is consequently inappropriate for estimating demographic parameters in such populations. We thus define a new parameter ({rho}) and derive its relationship with Nm. This relationship is shown to be independent from both the selfing rate and the proportion of double reduction. Finally, the statistical procedure required to evaluate these parameters using data on gene frequencies distribution among autotetraploid populations is developed.


DUE to its frequent occurrence among angiosperm species (from 30 to 50%; STEBBINS 1971 Down; GRANT 1981 Down), polyploidy is now recognized as an important step in the evolutionary diversification of flowering plants (LEWIS 1980 Down; LEVIN 1983 Down; STEBBINS 1985 Down; THOMPSON and LUMARET 1992 Down; SOLTIS and SOLTIS 1993 Down; BRETAGNOLLE and THOMPSON 1995 Down, BRETAGNOLLE and THOMPSON 1996 Down; PETIT et al. 1996 Down, PETIT et al. 1997 Down). Polyploid species are commonly classified in two major types according to their presumed origin: allopolyploids are thought to result from hybridization between different taxa and subsequent chromosome doubling, while autopolyploids presumably stem from the chromosome doubling of the same genome, primarily by fusion of unreduced gametes (BEVER and FELBER 1992 Down; BRETAGNOLLE and THOMPSON 1995 Down). Autotetraploidy was originally thought to be rare and maladaptive as compared to allopolyploidy. However, a growing number of studies using genetic information in addition to cytological and morphological traits confirm that autopolyploids are more common and of greater evolutionary importance than originally appreciated (LEVIN 1983 Down; CRAWFORD 1985 Down; RIESEBERG and DOYLE 1989 Down; SOLTIS and SOLTIS 1989 Down).

Due to the addition of divergent genomes, inheritance in allopolyploids is disomic; i.e., pairing behavior during meiosis is similar to that of nonhomologous pairs of chromosomes in diploids. In contrast, segregation patterns in autopolyploids are much more complex because more than two homologous chromosomes can pair during meiosis. Multivalents leading to polysomic inheritance are formed. This does not necessarily lead to random assortments of homologous chromosomes into gametes; two sister chromatids may also segregate into the same gamete (Figure 1). This phenomenon, known as "double reduction," is specific to autopolyploids. It increases the production of homozygous gametes as compared to what is expected under random chromosome segregation and is thus likely to alter many basic expectations of population genetics (BEVER and FELBER 1992 Down). Because the frequency of double reduction depends on the occurrence of crossovers between the centromere and the locus under consideration (Figure 1), segregation patterns are expected to vary among loci, obscuring predictions regarding genetic aspects of autopolyploids.



View larger version (47K):
In this window
In a new window
Download PPT slide
 
Figure 1. Possible segregation patterns of a locus in an autotetraploid individual following the formation of a quadrivalent. Type I describes the segregation patterns expected when there is no crossover between the centromere and the locus. The first division is then reductional. When a crossover occurs between the centromere and the locus (Types II and III), the first division can be either equational (Type II) or reductional (Type III). Under Type III, the second division may then lead to double reduction. In the present case, gametes ii and jj have undergone double reduction.

Probably because of the agronomic significance of polyploid species, the consequences of polysomic inheritance and double reduction have been investigated, especially for self-fertilization and regular systems of inbreeding (HALDANE 1930 Down; DEMARLY 1963 Down; BENNETT 1968 Down; GALLAIS 1990 Down). In contrast, few investigations have dealt with the amount and patterns of genetic variation among naturally occurring autopolyploid populations, and theoretical models incorporating population structure and estimation procedures are still lacking for tetrasomic inheritance (GLENDINNING 1989 Down; BEVER and FELBER 1992 Down for review; MOODY et al. 1993 Down).

For diploids, the distribution of genetic diversity within and among natural populations is commonly analyzed using theoretical models of population structure, for instance, the island model or the stepping stone model. Functions of probabilities of gene identity within and between units (populations, subpopulations), such as FST (WRIGHT 1951 Down), can be estimated using isozyme or DNA-based marker diversity and can be compared to expectations under specific models such as Wright's island or isolation by distance models (SLATKIN and BARTON 1989 Down; ROUSSET 1997 Down). The relationships between estimates and expectations can then be used to quantify gene flow between the studied units or even to understand how ecological and life history traits may influence the distribution of genetic variation within and among populations (see, for example, LOVELESS and HAMRICK 1984 Down; HAMRICK and GODT 1990 Down). However, models of population structure as well as estimation procedures have been almost exclusively devoted to diploid populations (see, however, WRIGHT 1938 Down).

The aim of this article is to develop a theoretical framework for the analysis of population structure in autotetraploid species. Recurrence equations that describe probabilities of gene identity under the island or isolation by distance models may be generalized for the case of tetrasomic inheritance; the case of the island model is given here as an illustration. Equilibrium values for traditional F-statistics parameters are derived. Because the proportion of double reduction may vary over loci, we define an additional function of probabilities of gene identity. This parameter seems appropriate to analyze population structure in autotetraploids, because its relationship with the migration rate and the population size is shown to be independent from both the selfing rate and the proportion of double reduction. Finally, following WEIR and COCKERHAM 1984 Down, we define estimators for the different parameters using the analysis of variance framework.


*  HIERARCHICAL GENIC STRUCTURE AND DEFINITION OF PARAMETERS
*TOP
*ABSTRACT
*HIERARCHICAL GENIC STRUCTURE AND...
*EQUILIBRIUM VALUES OF THE...
*POPULATION PARAMETERS ESTIMATION
*DISCUSSION
*APPENDIX 1
*LITERATURE CITED

Let Q stand for the probability of identity, Q0 for pairs of genes within individuals, Q1 for pairs of gene between individuals within subpopulations, and Q2 for pairs of genes between subpopulations. Throughout this article, the notation Qj will refer to probabilities of identity in state (IIS) and the j indices (j = 0 to 2) to the same pairs of genes. The addition of a dot on the top of a parameter will denote probabilities of identity by descent (IBD) (i.e., j), and the standard notation {equiv} will be used to distinguish the definition of parameters from their values under particular models of population structure.

Under tetrasomic inheritance, four genes are available at a given locus. Then, a random pair of genes within individuals (Q0) can be issued either from the same gamete (probability 1/3) or from two different gametes (probability 2/3). If QA and QB denote the probability of IIS associated, respectively, with these two categories of pairs of genes, then Q0 = .

Following COCKERHAM and WEIR 1987 Down, COCKERHAM and WEIR 1993 Down; (see also ROUSSET 1996 Down), F-statistics parameters can be defined as

(1)

(2)

(3)

Another parameter we will consider is

(4)

This parameter is analogous to the "correlation between truly outcrossed mates" in diploids (WALLER and KNIGHT 1989 Down; TACHIDA and YOSHIMARU 1996 Down). For diploids, interest in this correlation has come from the fact that the relationship between this parameter, the migration rate, and the population size is independent from the selfing rate (see NAGYLAKI 1983 Down; TACHIDA and YOSHIMARU 1996 Down). We will show that, for in autotetraploids, this relationship is moreover independent of the proportion of double reduction and therefore identical for all loci independently of their distance to the centromere.

Let us define r as the IBD probability for two genes in different individuals located either in two different subpopulations (r = 2) or in the same subpopulation (r = 1) and use the relationship between coalescence of genes and identity probabilities (MALECOT 1975 Down; TACHIDA 1985 Down; SLATKIN and VOELM 1991 Down): the probability of IBD for a pair of genes is the probability that neither gene has mutated between the present time and the time of first common ancestry, that is, their coalescence time (MALECOT 1975 Down; SLATKIN 1991 Down). This yields the expression

(5)
where µ is the mutation rate per generation, P(t) the probability that two genes coalesce at generation t in the past, and T a random variable that describes the coalescence time for these two genes. As shown by SLATKIN and VOELM 1991 Down(see also TACHIDA and YOSHIMARU 1996 Down), if we think of the process as going backward in time, then T can be divided in two phases: T1, the waiting time for two genes to be found in the same individual, and T2, the time for the two genes in the same individual to coalesce (Figure 2). As a result, and since T1 and T2 are independent,

(6)



View larger version (20K):
In this window
In a new window
Download PPT slide
 
Figure 2. Waiting times for coalescence of two genes located in two different individuals that are at distance x at the present time (x = 1 for individuals located in the same population; x = 2 for individuals in two different populations) and their associated probabilities. Populations are represented by large ellipses; small circles represent genes (four such circles denote an individual) shaded in black or gray for the genes studied, in white for genes not considered.

Because T2 corresponds to a coalescence time, then using the relationships between the coalescence of genes and identity probabilities, E[(1 - µ)2T2] represents the IBD probability for a pair of genes when both are sampled in the same individual (Figure 2):

(7)

Unlike T2, T1 in this instance is not a coalescence time but rather the "waiting time" for two genes initially at distance r to migrate within the same individual (Figure 2). To define T1, we do not make any reference to identity between the two genes under consideration, and this waiting time will depend only on the initial distance between the two genes (r = 1 or 2) and on the way genes migrate within and between subpopulations. Hence, E[(1 - µ)2T1] , which we will denote r in what follows, is not an IBD probability but simply denotes the probability that neither gene has mutated during T1. Since double reduction affects only transition probabilities for genes within individuals, it does not affect T1 nor r. These two parameters are consequently independent from the proportion of double reduction.

Now, following (2), and using (3), the IBD probability for two genes in different individuals at distance r reduces to

(8)
and, putting this formula into (4), yields the following expression for {rho}·:

(9)

This parameter is of interest for two reasons: (1) Because the rs are independent of the coefficient of double reduction, this equation shows that this is also true for {rho}·; (2) as will be shown later, the expected value of {rho}· can be deduced with minimal effort from previous models of haploid populations.

Consider now,

(10)

Noting that

we can always write

which reduces to

(11)

This may be compared to the result of the diploid model with selfing (TACHIDA and YOSHIMARU 1996 Down), in which one can write

(12)


*  EQUILIBRIUM VALUES OF THE PARAMETERS IN AN ISLAND MODEL
*TOP
*ABSTRACT
*HIERARCHICAL GENIC STRUCTURE AND...
*EQUILIBRIUM VALUES OF THE...
*POPULATION PARAMETERS ESTIMATION
*DISCUSSION
*APPENDIX 1
*LITERATURE CITED

We consider a finite island model (WRIGHT 1951 Down) of population structure: a set of n subpopulations, each consisting of N individuals, with nonoverlapping generations. Individuals are monoecious and subpopulations exchange migrant gametes at a rate m. Each migrant has an equal chance of coming from each of the other n - 1 subpopulations. Genes are assumed to be neutral and the mutation rate µ is the same for all alleles. Following NAGYLAKI 1983 Down and CROW and AOKI 1984 Down, two notations will be used. After migration, the proportion of pairs of genes that originate from one subpopulation in the previous generation is a = (1 - m)2 + for genes within a subpopulation, and b = for genes from different subpopulations. In each subpopulation, a proportion S of offspring is produced through selfing and the proportion of double reduction for the studied locus is denoted {alpha}.

When individuals are autotetraploid, there are 4N genes in each subpopulation. Then, provided neither gene has mutated [with probability {gamma} = (1 - µ)2], genes originating from the same subpopulation are identical by descent with probability (1 + 30)/4N + (1 - 1/N)1, while genes from different subpopulations are identical by descent with probability 2. The recurrence relations for 1 and 2 are as follows (t denoting time in generation):

(13)

(14)

Combining these two relationships, we obtain

(15)

At equilibrium, the Qi's do not change, hence

(16)

Using d = a - b, this equation can be expressed as

(17)

Noting that - 1 = , then substituting this into (10) and using (17), yields

(18)
which is the same result as in the diploid (or haploid) model. Using {theta} = , Equation 18 becomes

(19)

i.e.,

(20)
and, using (11),

(21)

As one may note, we do not need to know identity probabilities within subpopulations (Q0 and Q1) to derive these results. For diploids, the expected equilibrium value of FIS depends on the selfing rate (S), and the population size (N). For autotetraploids, it also depends on the proportion of double reduction that increases the proportion of homozygous gametes produced [see, for example, BENNETT 1968 Down for the case of a (single) large autotetraploid population]. When neither selfing nor double reduction are occurring in the population (i.e., S = 0 and {alpha} = 0), FIS = 0. Equation 21 can then be further simplified into

(22)

Expected values of can be computed for other mutation models as previously described (e.g., CROW and AOKI 1984 Down; ROUSSET 1996 Down), as well as for other geographical models. Under isolation by distance models, it can be shown that {rho}r/(1 - {rho}r) {approx} r/(2D{sigma}2) + Constant, for a pair of populations at distance r in a one-dimensional model, and {rho}r/(1 - {rho}r) {approx} ln(r)/(2D{pi}{sigma}2) + Constant, in a two-dimensional model, where D is the population density and {sigma}2 is a measure of dispersal (ROUSSET 1997 Down).


*  POPULATION PARAMETERS ESTIMATION
*TOP
*ABSTRACT
*HIERARCHICAL GENIC STRUCTURE AND...
*EQUILIBRIUM VALUES OF THE...
*POPULATION PARAMETERS ESTIMATION
*DISCUSSION
*APPENDIX 1
*LITERATURE CITED

Consider a dataset describing the genotypic constitution of autotetraploid individuals sampled (at random) from a set of r subpopulations. Each subpopulation is represented by ni individuals (sample size), where i refers to the ith subpopulation. To build estimators for the level of population differentiation, we use the linear model with hierarchical effects (subpopulations, individuals within subpopulations, and genes within individuals) developed by COCKERHAM 1969 Down, COCKERHAM 1973 Down for the analysis of diploid population structure. Now xijk is an indicator variable describing the state of the kth gene (1 <= k <= 4, instead of 1 <= k <= 2 for diploids) in the jth sampled individual (1 <= j <= ni) of the ith subpopulation (1 <= i <= r). For a particular allele u, xijk:u = 1, if the gene is u, xijk:u = 0 otherwise, and the ANOVA setup is as follows:

Using the same developments as for diploids (WEIR 1996 Down), the following sum of squares expectations can be derived (details are given in the Appendix 1): for genes within individuals

(23a)
for genes between individuals within subpopulations

(23b)
and for genes between individuals from different subpopulations

(23c)
where S1 = {Sigma}ini, S2 = {Sigma}in2i, Wd {equiv} S1 - r, Wa {equiv} S1 - , and Ww {equiv} r - 1 .

From Equation 23aEquation 23bEquation 23c, we obtain

(24)

(25)
which yield an estimator of FST:

(26)

Now, noting that 1 + 3Q0 - 4Q1 = 1 - Q0 + 4(Q0 - Q1) = , we have

(27)

An estimator of IT {equiv} 1 - is

(28)
and

(29)

For all these parameters, multilocus estimates (i.e., combining the information from all alleles and all loci) are defined as the sum of locus-specific numerators divided by the sum of locus-specific denominators (see also REYNOLDS et al. 1983 Down; WEIR 1996 Down). For example,

(30)
where l refers to the lth loci and u to the uth allele (with nl, the number of locus and ul, the number of alleles at locus l). Given the dependency of F-statistics on the proportion of double reduction (see above), multilocus estimates of these parameters will be appropriate to make inferences about the balance between migration (and/or mutation) and drift only if {alpha} = 0 for all the studied loci. As soon as {alpha} != 0 for at least one locus, only the estimate of {rho} will have this property.


*  DISCUSSION
*TOP
*ABSTRACT
*HIERARCHICAL GENIC STRUCTURE AND...
*EQUILIBRIUM VALUES OF THE...
*POPULATION PARAMETERS ESTIMATION
*DISCUSSION
*APPENDIX 1
*LITERATURE CITED

The aim of this study was to adapt the use of Wright's FST to estimate population structure and gene flow in autotetraploid species. In contrast to diploids, FST estimates in autotetraploids are expected to vary across the loci as a consequence of different amounts of double reduction during meiosis (Figure 1 and Introduction). This problem is illustrated in Equation 11 because FIS will vary depending on both the selfing rate and the proportion of double reduction ({alpha}). Since the proportion of double reduction for a given locus is difficult to assess empirically and because population structure estimates should be based on several loci, we defined a new function of identity probabilities, {rho}, which is an analogue to the "correlation between truly outcrossed mates" previously defined for diploids (WALLER and KNIGHT 1989 Down; TACHIDA and YOSHIMARU 1996 Down). For both diploids and tetraploids, the relationship between this correlation and the product Nm is independent from the selfing rate (except when selfing affects migration). For autotetraploids, interest in {rho} comes mainly from the fact that this relationship is also independent of the proportion of double reduction and therefore identical for all loci independently of their distance to the centromere. The parameter {rho} can consequently be used to assess population structure over many loci, without any prior knowledge concerning the proportion of double reduction.

Inspection of the relationship between {rho} and FST (11) shows that FST is increased by a factor (1 + 3FIS)/4 when self-fertilization or double reduction occurs within subpopulations. This means that like self-fertilization, double reduction reduces the effective subpopulation size and hence promotes differentiation among subpopulations (for the studied locus). The complication due to partial selfing or double reduction can be absorbed in the single parameter FIS and by defining the effective population size as NZ = . Equation 21 can then be used with NZ replacing N, i.e., = , while {rho} is still equal to {rho}/(1 - {rho}) {approx} 1/(2Nm{theta} + 2), which depends only on the migration rate, mutation rate, and the demographic population size (i.e., N, not NZ). Comparison of Equation 11 with the results of the diploid model (12) further shows that self-fertilization has a greater influence on differentiation in autotetraploids as compared to diploids.

When ignoring selfing and double reduction, the expected effect of drift under the island model of population structure is halved at equilibrium as compared to expectations for diploids, i.e., FST {approx} 1/(1 + 4Nm{theta} + 4) (CROW and AOKI 1984 Down; COCKERHAM and WEIR 1987 Down). This can be interpreted as the decrease in the rate of coalescence of genes within subpopulations and is due to the fact that the probability of drawing the same gene within an individual is reduced to 1/4 in an autotetraploid species instead of 1/2 in diploids. In other words, this means that, for a same demographic population size, the effective population size is doubled in an autotetraploid population as compared to a diploid one. This result is in accordance with earlier work on autotetraploids (MOODY et al. 1993 Down) that assumed that mutation alone was opposed to genetic drift (i.e., m = 0, FIS = 0, {alpha} = 0): FST = .

Following the linear model derived in COCKERHAM 1969 Down, COCKERHAM 1973 Down for diploid data, estimators for F-statistics and {rho} can be computed through hierarchical analyses of variance of gene frequencies. Simulations were performed to assess possible bias in the estimation of {rho} due to small sample sizes. We simulated a finite island model composed of n monoecious subpopulations of size N. In each subpopulation, 10 neutral, independent loci (recombination rate = 0.5), each with K possible allelic states (K-allele model), and all segregating according to the same proportion of double reduction ({alpha}) were modeled. Initial frequencies of the different allelic states were made equal in all the subpopulations (initial frequency = 1/K). Each subpopulation had the same mating system: complete outcrossing (S = 0) or partial selfing (S != 0). We assumed discrete and nonoverlapping generations. Mutation occurs at a rate µ per locus per generation, each allele having an equal chance to mutate toward one of the K-1 other allelic states. Migration occurs through male gametes only: to produce the next generation in a given subpopulation, each pollen grain was sampled independently, and with probability m it was chosen among gametes from the remaining n - 1 subpopulations. As shown in Figure 3, the discrepancies between the average value of the estimator and the expected value of {rho} are very small even for small sample sizes, with either S = 0 or S != 0 and {alpha} != 0.



View larger version (15K):
In this window
In a new window
Download PPT slide
 
Figure 3. Comparison of average values of the estimator () with expected equilibrium values of {rho}. An island model was simulated assuming K = 9 allelic states for each of 10 loci, n = 12 subpopulations and µ = 10-5 (see text for a complete description of the simulation procedure). To determine the number of generations required for the population to reach its equilibrium, sampling was performed in generations 200, 400, 1000, 2000, and 5000. For N = 50, the estimates were stable after 2000 generations. Each symbol gives the value of computed after 2000 generations and averaged over 10 replicated simulations (standard errors were always less than 10-3 and are therefore not shown) for N = 50, no selfing, {alpha} = 0. {circ} is for sample size e = 10, {bullet} for e = 30. Lines were computed using the expected equilibrium value of {rho}, i.e., {rho} {approx} for small Nm. Simulations were performed for two other parameter sets: N = 50, S = 0.2, {alpha} = 0; N = 50, no selfing, {alpha} = . {rho} is identical for these different parameter sets and the average values of the estimates () are too close to be distinguished on the figure.

We wrote a computer program estimating F-statistics and the parameter {rho} according to the ANOVA setup developed above (details of the computations are given in the appendix). The program provides estimations for {rho}, FST, FIS, and FIT for each allele as well as estimates combining data over alleles and over loci. To test for a departure from FST = 0, the program allows for Fisher's exact test on (population x genotypes) contingency tables [for each locus separately, see RAYMOND and ROUSSET 1995 Down for the diploid model]. Exact tests on contingency tables in which cell counts are tetraploid genotypes are valid even if there is double reduction. As for diploid datasets (RAYMOND and ROUSSET 1995 Down), the software allows for analysis either over the whole set of populations or for pairs of populations. The program containing both estimations and exact tests procedures is available upon request.


 
View this table:
In this window
In a new window

 
Table 1. Nested analysis of variance layout for estimation of the variance components of population structure in autotetraploid organisms, and corresponding gene frequency-based equations.


*  ACKNOWLEDGMENTS

We thank D. Couvet and P. Jarne for discussions, M. Raymond for advice concerning the computer program, and J. M. Prosperi for comments on the manuscript. This work was supported by a grant from the French "Bureau des Ressources Génétiques" to E.J. and J.R. This is contribution number 98-085 of the Institut des Sciences de l'Evolution.

Manuscript received March 4, 1998; Accepted for publication June 23, 1998.


*  APPENDIX 1
*TOP
*ABSTRACT
*HIERARCHICAL GENIC STRUCTURE AND...
*EQUILIBRIUM VALUES OF THE...
*POPULATION PARAMETERS ESTIMATION
*DISCUSSION
*APPENDIX 1
*LITERATURE CITED

Computation of expected sum of squares of gene frequencies involved in estimating {rho} and F-statistics: Let Eu denote the expectation of xijk:u, and {pi}u, the expected frequency of the allele u. Then {epsilon}[(xijk:u- Eu)2] = {pi}u -{pi}2u , where {epsilon} denotes expectation. Then, summing over all alleles, we obtain

(A1)
where Q3 denotes the identity probability for genes from different independent replicate populations, then, in the following, we write [xijk - E] for the sum over alleles. Using the relationship (xk - xk')2 = (xk - E)2 + (xk' - E)2 - 2 · (xk - E) · (xk' - E), we obtain a useful equation for the covariance of two genes, i.e.,

(A2)

This, derived for different pairs of genes, yields the covariances

(A3)
for genes within individuals,

(A4)
for genes between individuals within subpopulations, and

(A5)
for genes between individuals in different subpopulations. These relationships can be used to derive the expectations

i.e.,

(A6)

i.e.,

(A7)

i.e.,

(A8)
where S1 = {Sigma}ini and S2 = {Sigma}in2i .

Now, the basic relationship {epsilon}[{Sigma}riwi(xi - x)2] = {epsilon}[{Sigma}riwi (xi - E)2] - {epsilon}[{Sigma}riwi(x - E)2] can be used to write sum of squares expectations, for genes within individuals,

and using (A6) and (A7), we obtain

(A9)

Following the same procedure and denoting Wd {equiv} S1 - r, Wa {equiv} S1 - S , and Ww {equiv} r - 1 , we find the following sum of squares expectations: for genes between individuals within subpopulations (using A6 and A7),

(A10)
for genes between individuals from different subpopulations (using A7 and A8),

(A11)

As for diploids (COCKERHAM and WEIR 1987 Down), the components of variance of the nested ANOVA model (xijk:u = µu + {alpha}i:u + ßj:u + {epsilon}ijk:u) can be expressed as linear functions of identity probabilities, i.e.,

(A12)

(A13)
and

(A14)

ANOVA framework for the estimation of {rho} and F-statistics:
To compute sum of squares, straight way gene frequencies were used instead of indicator variables (xijk). This method is based on the following relationships between gene frequency estimates and the indicator variable [see WEIR 1996 Down for the diploid case],

where p˜Ai = and P˜AAi = P˜0,i + + with P0,i, P˜1,i , and P˜2,i standing, respectively, for the proportion of monogenic (AAAA), trigenic (AAAa), and digenic (AAab) individuals in the ith population (MALECOT 1948 Down). These relationships yield more convenient expressions for the variance components of the analysis, as shown in Table 1.


*  LITERATURE CITED
*TOP
*ABSTRACT
*HIERARCHICAL GENIC STRUCTURE AND...
*EQUILIBRIUM VALUES OF THE...
*POPULATION PARAMETERS ESTIMATION
*DISCUSSION
*APPENDIX 1
*LITERATURE CITED

BENNETT, J. H., 1968  Mixed self- and cross-fertilization in a tetrasomic species. Biometrics 24:485-500[Medline].

BEVER, J. D. and F. FELBER, 1992  The theoretical population genetics of autopolyploidy. Oxford Surv. Evol. Biol. 8:185-217.

BRETAGNOLLE, F. and J. D. THOMPSON, 1995  Gametes with somatic chromosome number: mechanisms of their formation and role in the evolution of autopolyploid plants. New Phytol. 129:1-22.

BRETAGNOLLE, F. and J. D. THOMPSON, 1996  An experimental study of ecological differences in winter growth between sympatric diploid and autotetraploid Dactylis glomerata. J. Ecol. 84:343-351.

COCKERHAM, C. C., 1969  Variance of gene frequencies. Evolution 23:72-84.

COCKERHAM, C. C., 1973  Analysis of gene frequencies. Genetics 74:679-700[Abstract/Free Full Text].

COCKERHAM, C. C. and B. S. WEIR, 1987  Correlations, descent measures: drift with migration and mutation. Proc. Natl. Acad. Sci. USA 84:8512-8514[Abstract/Free Full Text].

COCKERHAM, C. C. and B. S. WEIR, 1993  Estimation of gene flow from F-Statistics. Evolution 47:855-863.

CRAWFORD, D. J., 1985  Electrophoretic data and plant speciation. Syst. Bot. 10:405-416.

CROW, J. F. and K. AOKI, 1984  Group selection for a polygenic behavioral trait: estimating the degree of population subdivision. Proc. Natl. Acad. Sci. USA 81:6073-6077[Abstract/Free Full Text].

DEMARLY, Y., 1963  Génétique des tétraploïdes et amélioration des plantes. Ann. Amelior. Plantes 13:307-400.

GALLAIS, A., 1990 Théorie de la Selection en Amélioration des Plantes. Masson, Paris.

GLENDINNING, D. R., 1989  Some aspects of autotetraploid population dynamics. Theor. Appl. Genet. 78:233-242.

GRANT, V., 1981 Plant Speciation, Ed. 2. Columbia University Press, New York.

HALDANE, J. B. S., 1930  Theoretical genetics of autotetraploids. J. Genet. 22:359-372.

HAMRICK, J. L., and J. W. GODT, 1990 Allozyme diversity in plant species, pp. 43–63 in Plant Population Genetics, Breeding and Genetic Resources, edited by A. H. D. BROWN, M. T. CLEGG, A. L. KAHLER and B. S. WEIR. Sinauer Associates Inc., Sunderland, MA.

LEVIN, D. A., 1983  Polyploidy and novelty in flowering plants. Am. Nat. 122:1-25.

LEWIS, W. H., 1980 Polyploidy in angiosperms: dicotyledons, pp. 241–268 in Polyploidy, Biological Relevance, edited by W. H. LEWIS. Plenum Press, New York.

LOVELESS, M. D. and J. L. HAMRICK, 1984  Ecological determinants of genetic structure in plant populations. Annu. Rev. Ecol. Syst. 15:65-95.

MALÉCOT, G., 1948 Les Mathématiques de l'Hérédité. Masson, Paris.

MALÉCOT, G., 1975  Heterozygosity and relationship in regularly subdivided populations. Theor. Popul. Biol. 8:212-241[Medline].

MOODY, M. E., L. D. MUELLER, and D. E. SOLTIS, 1993  Genetic variation and random drift in autotetraploid populations. Genetics 134:649-657[Abstract].

NAGYLAKI, T., 1983  The robustness of neutral modes of geographical variation. Theor. Popul. Biol. 24:268-294.

PETIT, C., J. D. THOMPSON, and F. BRETAGNOLLE, 1996  Phenotypic plasticity in relation to ploidy level and corm production in the perennial grass Arrhenatherum elatius. Can. J. Bot. 74:1964-1973.

PETIT, C., P. LESBROS, X. GE, and J. D. THOMPSON, 1997  Variation in flowering phenology and selfing rate across a contact zone between diploid and tetraploid Arrhenaterum elatius (Poaceae). Heredity 79:31-40.

RAYMOND, M. and F. ROUSSET, 1995  An exact test for population differentiation. Evolution 49:1280-1283.

REYNOLDS, J., B. S. WEIR, and C. C. COCKERHAM, 1983  Estimation of the coancestry coefficient: basis for a short-term genetic distance. Genetics 105:767-779[Abstract/Free Full Text].

RIESEBERG, L. H. and M. F. DOYLE, 1989  Tetrasomic segregation in the naturally occurring autotetraploid Allium nevii (Alliaceae). Hereditas 111:31-36.

ROUSSET, F., 1996  Equilibrium values of measures of population subdivision for stepwise mutation processes. Genetics 142:1357-1362[Abstract].

ROUSSET, F., 1997  Genetic differentiation and estimation of gene flow from F-statistics under isolation by distance. Genetics 145:1219-1228[Abstract].

SLATKIN, M., 1991  Inbreeding coefficients and coalescence times. Genet. Res. 58:167-175[Medline].

SLATKIN, M. and N. H. BARTON, 1989  A comparison of three indirect methods for estimating average levels of gene flow. Evolution 43:1349-1368.

SLATKIN, M. and L. VOELM, 1991  FST in a hierarchical island model. Genetics 127:627-629[Abstract].

SOKAL, R. R., and F. J. ROHLF, 1995 Biometry, Ed. 3. Freeman and Company, New York.

SOLTIS, D. E. and P. S. SOLTIS, 1989  Genetic consequences of autopolyploidy in Tolmiea (Saxifragaceae). Evolution 43:586-594.

SOLTIS, D. E. and P. S. SOLTIS, 1993  Molecular data and the dynamic nature of polyploidy. Crit. Rev. Plant Sci. 12:243-273.

STEBBINS, G. L., 1971 Chromosomal Evolution in Higher Plants. Addison-Wesley, Reading, MA.

STEBBINS, G. L., 1985  Polyploidy, hybridization and the invasion of new habitats. Ann. MO Bot. Gard. 72:824-832.

TACHIDA, H., 1985  Joint frequencies of alleles determined by separate formulation for the mating and mutation systems. Genetics 111:963-974[Abstract/Free Full Text].

TACHIDA, H. and H. YOSHIMARU, 1996  Genetic diversity in partially selfing populations with the stepping-stone structure. Heredity 77:469-475.

THOMPSON, J. D. and R. LUMARET, 1992  The evolutionary dynamics of polyploid plants: origins, establishment and persistence. Trends Ecol. Evol. 7:302-307.

WALLER, D. M. and S. E. KNIGHT, 1989  Genetic consequences of outcrossing in the cleistogamous annual, Impatiens capensis. II. Outcrossing rates and genotypic correlations. Evolution 43:860-869.

WEIR, B. S., 1996 Genetic Data Analysis II: Methods for Discrete Population Genetic Data. Sinauer Associates, Sunderland, MA.

WEIR, B. S. and C. C. COCKERHAM, 1984  Estimating F-Statistics for the analysis of population structure. Evolution 38:1358-1370.

WRIGHT, S., 1938  The distribution of gene frequencies in populations of polyploids. Proc. Natl. Acad. Sci. USA 24:372-377[Free Full Text].

WRIGHT, S., 1951  The genetical structure of populations. Ann. Eugen. 15:323-354.