help button home button Genetics Mol Pharm
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Asmussen, M. A.
Right arrow Articles by Orive, M. E.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Asmussen, M. A.
Right arrow Articles by Orive, M. E.
Genetics, Vol. 155, 813-831, June 2000, Copyright © 2000

The Effects of Pollen and Seed Migration on Nuclear-Dicytoplasmic Systems. I. Nonrandom Associations and Equilibrium Structure With Both Maternal and Paternal Cytoplasmic Inheritance

Marjorie A. Asmussena and Maria E. Orivea,b
a Department of Genetics, University of Georgia, Athens, Georgia 30602-7223
b Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, Kansas 66045-2106

Corresponding author: Maria E. Orive, Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, KS 66045-2106., orive{at}ukans.edu (E-mail)

Communicating editor: A. G. CLARK


*  ABSTRACT
*TOP
*ABSTRACT
*THE NUCLEAR DICYTOPLASMIC SYSTEM
*ADMIXTURE FORMULAS
*MODEL OF POLLEN AND...
*DICYTONUCLEAR EQUILIBRIUM...
*DICYTONUCLEAR EQUILIBRIUM...
*DISCUSSION
*APPENDIX A
*APPENDIX B
*APPENDIX C
*LITERATURE CITED

We determine the nuclear-dicytoplasmic effects of unidirectional gene flow via pollen and seeds upon a mixed-mating plant population, focusing on nuclear-mitochondrial-chloroplast systems where mitochondria are inherited maternally and chloroplasts paternally, as in many conifers. After first delineating the general effects of admixture (via seeds or individuals) on the nonrandom associations in such systems, we derive the full dicytonuclear equilibrium structure, including when disequilibria may be indicators of gene flow. Substantial levels of permanent two- and three-locus disequilibria can be generated in adults by (i) nonzero disequilibria in the migrant pools or (ii) intermigrant admixture effects via different chloroplast frequencies in migrant pollen and seeds. Additionally, three-locus disequilibria can be generated by higher-order intermigrant effects such as different chloroplast frequencies in migrant pollen and seeds coupled with nuclear-mitochondrial disequilibria in migrant seeds, or different nuclear frequencies in migrant pollen and seeds coupled with mitochondrial-chloroplast disequilibria in migrant seeds. Further insight is provided by considering special cases with seed or pollen migration alone, complete random mating or selfing, or migrant pollen and seeds lacking disequilibria or intermigrant admixture effects. The results complete the theoretical foundation for a new method for estimating pollen and seed migration using joint cytonuclear or dicytonuclear data.


THERE are two different avenues for gene flow in plant populations: pollen migration and seed migration. Joint nuclear-cytoplasmic data can be especially useful for decomposing and estimating these two types of gene flow due to the asymmetrical inheritance of cytonuclear systems, where nuclear alleles are inherited through both parents while cytoplasmic alleles are usually inherited uniparentally (ASMUSSEN and SCHNABEL 1991 Down; SCHNABEL and ASMUSSEN 1992 Down). This asymmetry and the differing effects of pollen and seed migration will be reflected in the cytonuclear structure of the plant population, through both the joint genotype frequencies and the pattern of statistical associations (disequilibria) generated between the cytoplasmic marker and the nuclear alleles and genotypes.

The existence of two uniparentally inherited organelles in plants greatly enhances the utility of cytonuclear data. Although cases of biparental inheritance of organelle genomes exist (MEDGYESY et al. 1986 Down; WAGNER et al. 1991 Down), the majority of plant species exhibit strict maternal inheritance of both mitochondria and chloroplasts (KIRK and TILNEY-BASSETT 1978 Down; BIRKY 1988 Down; HARRISON and DOYLE 1990 Down). However, in two families of coniferous gymnosperms (including the largest family, the Pinaceae), chloroplast DNA (cpDNA) appears to be paternally inherited, while mitochondrial DNA (mtDNA) is maternally inherited (NEALE et al. 1986 Down; WAGNER et al. 1987 Down; NEALE and SEDEROFF 1989 Down; MOGENSEN 1996 Down). Joint nuclear-mtDNA-cpDNA data should be uniquely valuable in estimating plant gene flow in such systems, since pollen will carry a haploid nuclear component and the chloroplast genome, while seeds will carry complete complements of all three genomes. Nuclear and chloroplast alleles will therefore have two opportunities for movement, while mitochondrial alleles can move only through seeds. Furthermore, although pollen migration can only carry nonrandom associations between nuclear and chloroplast alleles, seed migration can carry all possible pairwise associations between alleles and genotypes in the three genomes, as well as higher-order three-locus nuclear-mitochondrial-chloroplast associations.

Previous work (ASMUSSEN and SCHNABEL 1991 Down; SCHNABEL and ASMUSSEN 1992 Down) has determined the effects of unidirectional pollen and seed migration on the standard, two-locus cytonuclear system with a nuclear locus and a single, uniparentally inherited cytoplasmic marker. With maternal cytoplasmic inheritance, ASMUSSEN and SCHNABEL 1991 Down found that nonzero cytonuclear disequilibria are maintained only if migrant seeds carry nonrandom cytonuclear associations. These migrant associations would be found when selection or other nonrandomizing factors such as assortative mating act on the source population (ARNOLD et al. 1988 Down; BABCOCK and ASMUSSEN 1996 Down, BABCOCK and ASMUSSEN 1998 Down) as well as when seeds are contributed by multiple, genetically distinct sources, as might be expected in hybrid zones and other areas of admixture. Significant and long-lasting transient disequilibria can be generated, however, by migrant seeds lacking such associations. Pollen dispersal has only a small effect on the disequilibria caused by seed migration and cannot generate even transient disequilibria de novo.

In contrast, with paternal cytoplasmic inheritance (SCHNABEL and ASMUSSEN 1992 Down), pollen migration can significantly affect the cytonuclear structure of the recipient population through nonrandom cytonuclear associations in the migrant pollen, although pollen migration alone generally produces lower levels of disequilibria than does the equivalent amount of seed migration alone. Additionally, with paternal cytoplasmic inheritance, the presence of both types of gene flow can generate nonrandom cytonuclear associations via intermigrant admixture effects, such as differences in nuclear and chloroplast allele frequencies in migrant pollen and seeds. Such allele frequency differences in the two migrant pools could be caused by having separate sources for migrant pollen and seeds, common sources whose relative contributions depend on the type of gene flow, or selection or other evolutionary forces acting during the life cycle of the source population(s). Together, these results for cytonuclear disequilibria suggest that three-locus, nuclear-mtDNA-cpDNA data juxtaposing both forms of uniparental inheritance should be uniquely powerful for estimating pollen and seed migration rates, since they provide the greatest number of avenues for the accumulation of nonrandom associations.

As a first step toward formally testing this idea, we extend these previous pollen and seed dispersal models to the three-locus nuclear-dicytoplasmic (dicytonuclear) system parameterized by SCHNABEL and ASMUSSEN 1989 Down. We first review the dicytonuclear frequencies and associations for this system and then delimit the effects of population admixture on the three-locus disequilibria. We next determine the dynamics and full equilibrium structure when the two cytoplasmic markers are inherited through opposite parents, including when each type of two- and three-locus association is generated. Further insight into the distinctive dicytonuclear effects of pollen and seed flow is provided by numerical examples showing the magnitude and duration of the disequilibria they generate and by the equilibrium structure for a series of special cases subsumed within our general framework. Together, these new three-locus results complete the theoretical foundation for a new method for estimating pollen and seed flow in plant populations using joint cytonuclear or dicytonuclear data. This final critical step is presented in a companion article (ORIVE and ASMUSSEN 2000 Down).


*  THE NUCLEAR DICYTOPLASMIC SYSTEM
*TOP
*ABSTRACT
*THE NUCLEAR DICYTOPLASMIC SYSTEM
*ADMIXTURE FORMULAS
*MODEL OF POLLEN AND...
*DICYTONUCLEAR EQUILIBRIUM...
*DICYTONUCLEAR EQUILIBRIUM...
*DISCUSSION
*APPENDIX A
*APPENDIX B
*APPENDIX C
*LITERATURE CITED

We consider the nuclear-dicytoplasmic structure of a diploid plant population. The dicytonuclear system consists of an autosomal nuclear locus with two alleles, A and a, and two haploid cytoplasmic loci: a mitochondrial marker with two alleles (cytotypes) M and m and a chloroplast marker with two alleles (cytotypes) C and c. As is true in the Pinaceae (MOGENSEN 1996 Down), we assume that the mitochondria are inherited maternally and chloroplasts paternally, with no cross-parental leakage of either organelle. The definitions of frequency and nonrandom association variables follow those of SCHNABEL and ASMUSSEN 1989 Down, and we use the notation of that article throughout, except where noted below.

Frequency variables:
The frequencies of the 12joint, three-locus genotypes are given in Table 1 as Uij, Vij, and Wij, where the first index (i = 1, 2) indicates M or m alleles and the second index (j = 1, 2) indicates C or c alleles. The joint cytotype frequencies (Xij) are obtained by summing across each row and the nuclear genotype frequencies (U, V, and W) by summing down each column. The nuclear allele frequencies are

(1a)

and

(1b)

where freq denotes "frequency of." The cytotype frequencies for the two cytoplasmic markers are

(2)

where YM = 1 - XM and YC = 1 - XC. It is also useful to specify the frequencies of the three-locus triallelic combinations where, for each joint cytotype combination i, j = 1, 2,

(3a)

and

(3b)


 
View this table:
[in this window]
[in a new window]

 
Table 1. Joint nuclear-mitochondrial-chloroplast genotype frequencies

For instance, P11 = freq(A/M/C) is the probability that a randomly sampled individual has the joint cytotype MC and that a randomly selected nuclear allele from that individual is A. Note that although we normally use slashes (/) to separate different markers, we omit the slash between the two organellar loci when viewed as a joint cytotype.

The final frequency variables are those for the two-locus nuclear-cytoplasmic combinations within each of the two cytonuclear subsystems. The two sets of joint, two-locus genotypic frequencies are given in Table 2. The joint diallelic frequencies for the nuclear-mitochondrial subsystem represent the ovule frequencies, while those for the nuclear-chloroplast subsystem represent the pollen frequencies in the population (Table 3).


 
View this table:
[in this window]
[in a new window]

 
Table 2. Nuclear-mitochondrial and nuclear-chloroplast genotype frequencies


 
View this table:
[in this window]
[in a new window]

 
Table 3. Joint diallelic frequencies for the nuclear-mitochondrial and nuclear-chloroplast subsystems

Two-locus disequilibria:
We consider three sets of pairwise disequilibria among the three loci. These disequilibria are calculated in the same general way as are linkage disequilibria among nuclear loci and equal the difference between the frequency of each joint combination and that expected under random association between the two components involved. The first two involve the two-locus associations in each cytonuclear subsystem as defined in ASMUSSEN et al. 1987 Down. To facilitate the development of concise general formulas in the three-locus system, they are denoted here by a more informative notation that explicitly includes the nuclear and cytoplasmic components involved. The two allelic disequilibria

(4)

(DM and DC in SCHNABEL and ASMUSSEN 1989 Down) measure the nonrandom association between the nuclear alleles and each cytoplasmic marker. The six genotypic disequilibria

(5)

similarly measure the nonrandom associations between each nuclear genotype and each cytoplasmic marker, where here we have substituted AA, Aa, aa for the usual genotype placeholders 1, 2, and 3. For example, DAA/M = freq(AA/M) - freq(AA)freq(M) measures the nonrandom association between AA nuclear homozygotes and the mitochondrial marker M (D1M in SCHNABEL and ASMUSSEN 1989 Down). Within each of the two cytonuclear subsystems, each disequilibrium can be written as a linear combination of any two of the others via the relations

(6a)

and

(6b)

where each * stands for M for the nuclear-mitochondrial associations and each * stands for C for the nuclear-chloroplast associations.

The third and final type of two-locus disequilibria is the cytoplasmic disequilibrium,

(7)

(DMC in SCHNABEL and ASMUSSEN 1989 Down), which is new to the three-locus system and measures the nonrandom association between alleles at the two cytoplasmic loci. The two-locus genotypic frequencies (UiM, UiC, etc.), joint diallelic frequencies (PiM, PiC, etc.), and joint cytotype frequencies (Xij) can be written in terms of the marginal frequencies and the corresponding cytonuclear and cytoplasmic disequilibria, as shown in Table 4 Table 5 Table 6.


 
View this table:
[in this window]
[in a new window]

 
Table 4. Decomposition of two-locus cytonuclear genotype frequencies


 
View this table:
[in this window]
[in a new window]

 
Table 5. Decomposition of joint cytonuclear diallelic frequencies


 
View this table:
[in this window]
[in a new window]

 
Table 6. Decomposition of joint cytotype frequencies

Three-locus disequilibria:
The three-locus system can also result in higher-order associations that involve all three loci, both pairwise three-locus associations and full three-way associations. These are specified fully in SCHNABEL and ASMUSSEN 1989 Down and for convenience are briefly summarized here. Throughout, we focus on the associations involving the M and C cytotypes; the interrelations in the three-locus system allow us to derive all of the other associations from these (SCHNABEL and ASMUSSEN 1989 Down). The basic three-locus disequilibria are the three joint genotypic disequilibria

(8)

and the joint allelic disequilibrium

(9)

which, respectively, measure the nonrandom associations between the MC joint cytotype and the three genotypes and two alleles at the nuclear locus.

Several true three-way measures of nonrandom associations can also be defined that measure associations among the three markers (nuclear, mitochondrial, and chloroplast) after taking into account all of the possible two-way associations (nuclear-mitochondrial, nuclear-chloroplast, and mitochondrial-chloroplast). For the M/C cytotype, we have three three-way genotypic disequilibria

(10)

and the three-way allelic disequilibrium

(11)

which is analogous to the three-way gametic disequilibrium for three nuclear loci (BENNETT 1954 Down). The sets of four joint and four three-way associations each satisfy the relations given in (6a) and (6b), with * = MC for the joint disequilibria and * = M/C for the three-way disequilibria. We also note that each three-way disequilibrium can be written in terms of the corresponding joint dicytonuclear and two-locus disequilibria and the marginal cytotype frequencies, with

(12)

for each nuclear component, N = A, AA, Aa, or aa.

Finally, only 11 independent variables are necessary to completely describe the full 12 joint genotype system. One such parameterization (Table 7) includes five two-locus disequilibria (DA/M, DA/C, DAa/M, DAa/C, and DM/C), two three-locus disequilibria (DA/MC and DAa/MC), the nuclear, mitochondrial, and chloroplast allele frequencies (P, XM, and XC), and the nuclear heterozygote frequency (V). However, for completeness, our analysis below provides the results for an allelic, homozygote, and heterozygote association for each disequilibrium category: DA/MC, DAA/MC, and DAa/MC for the three-locus joint disequilibria and their counterparts for the three-way disequilibria.


 
View this table:
[in this window]
[in a new window]

 
Table 7. Decomposition of three-locus genotype frequencies


*  ADMIXTURE FORMULAS
*TOP
*ABSTRACT
*THE NUCLEAR DICYTOPLASMIC SYSTEM
*ADMIXTURE FORMULAS
*MODEL OF POLLEN AND...
*DICYTONUCLEAR EQUILIBRIUM...
*DICYTONUCLEAR EQUILIBRIUM...
*DISCUSSION
*APPENDIX A
*APPENDIX B
*APPENDIX C
*LITERATURE CITED

After migration by seeds (or adults), the dicytonuclear makeup of the new population will be a mixture of that of residents and migrants. We consider here the admixture effects for the general case where n genetically distinct sources contribute to a single population. Define the expected value of a variable Z across all n sources as

(13)

where mi is the fraction contributed by source i and Z(i) is the value of the variable Z in source i. Frequencies after admixture are simply their weighted average (expectation) across all sources,

(14)

where T indicates values in the total population after admixture.

Admixture has a more complicated effect on disequilibria since these can be generated in the total population by a two-locus Wahlund effect if the sources are genetically distinct (NEI and LI 1973 Down; ASMUSSEN and ARNOLD 1991 Down; GOODISMAN and ASMUSSEN 1997 Down). Following admixture, the two-way disequilibria will be the weighted average of the disequilibria across the n sources plus the covariance across the sources between the frequencies of the two genetic components being considered (ASMUSSEN and ARNOLD 1991 Down). For example, the allelic and genotypic cytonuclear disequilibria after admixture are

(15a)


(15b)

where N = A, AA, Aa, or aa, and FN is the frequency of N. The cytoplasmic disequilibrium after admixture has the analogous form

(16)

while the joint allelic and joint genotypic disequilibria become

(17)

where, again, N = A, AA, Aa, or aa.

The derivation of these general formulas depends on the fact that Cov(X,Y) = E(XY) - E(X) E(Y) for any two random variables X and Y. As an example, we derive the joint allelic disequilibrium after admixture, using (9) and (14), as

which is (17), where N = A and FN = P.

For all of the two-way disequilibria, the covariance terms have a simple interpretation for the special case where there are only two source populations contributing to the total population. In this case the admixture Equation 15aEquation 15bEquation 16 HREF="#FD17">Equation 17 become

where (i) indicates source i and N = A, AA, Aa, or aa. One can see that the covariance term will be nonzero, causing an admixture effect, if and only if the two components both differ across the two source populations. In general, to have an admixture effect with more than two source populations, it is necessary but not sufficient that the frequencies of both components vary across the sources.

Three-way disequilibria after admixture are more complex than are two-way disequilibria, involving the weighted averages of the disequilibria and cytoplasmic frequencies, covariances between nuclear and cytoplasmic frequencies, and covariances between cytoplasmic frequencies and two-locus cytonuclear disequilibria. The general formula is

(18)

for N = A, AA, Aa, or aa, where, once again, FN is the frequency of N. An admixture effect will be found for three-way associations only if at least one of the contributing covariances is nonzero. As an example of the derivation, we consider the three-way allelic disequilibrium, DA/M/C. From (11) and (14),

which, using (7) and (15), can be rewritten as

Once again using the definition of the covariance given above, we have

Finally, using the definition of the three-way allelic disequilibrium (DA/M/C) in (11) gives us the formula corresponding to (18), with N = A and FN = P,


*  MODEL OF POLLEN AND SEED MIGRATION
*TOP
*ABSTRACT
*THE NUCLEAR DICYTOPLASMIC SYSTEM
*ADMIXTURE FORMULAS
*MODEL OF POLLEN AND...
*DICYTONUCLEAR EQUILIBRIUM...
*DICYTONUCLEAR EQUILIBRIUM...
*DISCUSSION
*APPENDIX A
*APPENDIX B
*APPENDIX C
*LITERATURE CITED

The model of pollen and seed migration (summarized in Fig 1) represents a three-locus extension of the two-locus cytonuclear migration models considering only maternal or only paternal cytoplasmic inheritance in ASMUSSEN and SCHNABEL 1991 Down and SCHNABEL and ASMUSSEN 1992 Down. We consider a population with nonoverlapping generations and no seed dormancy, from which we census adults. Migration is modeled via a continent-island framework with unidirectional migration, from the source population(s) to the population under consideration (Fig 2). We assume that the migration rates and the genetic composition(s) of the source population(s) are constant over time, and we ignore the effects of selection, mutation, and genetic drift. Mating is a mixture of selfing, which occurs at rate s, and outcrossing, which occurs at rate 1 - s in accordance with the mixed-mating model (CLEGG 1980 Down). During mating, pollen migration occurs at rate M, such that the fraction of outcrossed pollen that comes from the migrant pollen pool is M, and the fraction of outcrossed pollen that comes from within the population is 1 - M. The total fraction of migrant pollen per generation is thus M(1 - s). After pollination and seed maturation, seed migration occurs at a rate m, which means that each generation the seed population is a mixture of migrant seeds (a fraction m) and resident seeds (a fraction 1 - m). Following germination and growth, a new adult population is formed, completing the generation cycle.



View larger version (13K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 1. Adult census model showing the two types of gene flow within a generation cycle. Pollen migration occurs first, at rate M, followed by seed migration at rate m. Mating is a mixture of selfing, which occurs at rate s, and outcrossing, which occurs at rate 1 - s.



View larger version (7K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 2. Continent-island migration model with unidirectional pollen (M) and seed (m) migration.

We distinguish variables in the various life stages by letting uppercase letters represent variables in adults (e.g., P, U, XM, XC, DM/C) and lowercase letters represent the corresponding variables in the interim seed population (e.g., p, u, xM, xC, dM/C). Variables in the two migrant pools are distinguished by overbars, with lowercase letters again indicating seeds and uppercase letters now indicating pollen. Since migrant pollen carries only a haploid nuclear component and the chloroplast genome, it is characterized by its nuclear-chloroplast diallelic frequencies (1C, 2C, 1C, 2C) together with its nuclear and chloroplast allele frequencies (, C) and the nuclear-chloroplast allelic disequilibrium (A/C). Migrant seeds, on the other hand, carry a full complement of all three genomes, and therefore have analogs of all the three-locus frequency and disequilibrium variables defined above (e.g., ij, , M, C, A/M, A/C, M/C, A/MC, A/M/C). Disequilibria may occur in migrant seed and pollen if, for example, the source population itself is a mixture of genetically distinct populations or experiences appropriate forms of nonrandom mating (ARNOLD et al. 1988 Down) or selection (BABCOCK and ASMUSSEN 1996 Down, BABCOCK and ASMUSSEN 1998 Down).

The interim seed variables, recursion equations, and equilibrium values for the nuclear-mitochondrial and nuclear-chloroplast subsystems are equivalent to those for the cases of strictly maternal and strictly paternal cytoplasmic inheritance, respectively (ASMUSSEN and SCHNABEL 1991 Down; SCHNABEL and ASMUSSEN 1992 Down). To convert those results to the notation used here, we replace each cytoplasmic and cytonuclear variable z for maternal cytoplasmic inheritance with zM (e.g., x becomes xM,) and those variables for paternal cytoplasmic inheritance with zC (e.g., x becomes xC), with the nuclear component (A, AA, Aa, or aa) explicitly indicated in the cytonuclear disequilibria as above.

We develop the recursion equations for the frequencies and representative disequilibria new to the dicytonuclear system for the complete model with mixed mating and both pollen and seed migration in two steps, first finding the interim values in the seeds following pollen migration and fertilization, and then calculating the new adult values following seed migration. We also consider eight important cases subsumed within this general framework.

Interim seed values:
The joint genotype frequencies at the interim seed stage can be calculated by considering the contribution to each joint genotype due to self-fertilization of resident ovules and that due to random outcrossing. The contribution due to self-fertilization depends only on the joint genotype frequencies of the resident adults (Table 1), while the contribution due to outcrossing is determined by the joint nuclear-mitochondrial allele frequencies in resident ovules and the joint nuclear-chloroplast allele frequencies in the pollen pool (Table 3), which contains both resident and migrant pollen. For example, consider the frequency of AA/M/C progeny. Such progeny result from the fertilization of a resident A/M ovule by A/C pollen. Under selfing (which occurs with probability s), the only individuals who can produce the right type of ovules and pollen are AA/M/C (frequency U11, who always do) and Aa/M/C (frequency V11, who produce each correct gamete type half of the time and the appropriate combination of gametes one-fourth of the time). The contribution due to outcrossing (probability 1 - s) is similarly straightforward to derive. In this case an A/M ovule (frequency P1M) is fertilized by either migrant A/C pollen (probability M 1C) or by resident A/C pollen [probability (1 - M) P1C]. The frequency of AA/M/C progeny is then

Repeating the same reasoning, we find that the frequencies for all of the interim seed genotypes are

(19a)


(19b)

where i = 1 or 2 indicates M or m at the mitochondrial locus and j = 1 or 2 indicates C or c at the chloroplast locus.

From these interim dicytonuclear genotype frequencies, the marginal frequencies at the three loci can be found along with the interim two-locus cytonuclear disequilibria using the definitions in (4) and (5). Interim values for each of the two-locus subsystems that will be used in further derivations are given in Appendix A, Equation A1Equation A2Equation A3Equation A4Equation A5Equation A7 HREF="#FDA8">Equation A88. The interim seed values for the other new three-locus variables can then be derived from the joint genotype frequencies using Table 1 and (7–11). The joint cytotype frequencies are

(20)

where X1M = XM, X2M = YM, X1C = XC, and X2C = YC, and the two-locus cytoplasmic disequilibrium is

(21)

The formulas for the interim three-locus joint and three-way disequilibria in seeds are more complex and are given in Appendix A, Equation A9Equation A10Equation A11Equation A1212.

Recursion equations:
Seed migration completes the life cycle. Using (13) and (14), each frequency variable in the new generation of adults is simply the weighted average of the corresponding value in migrant (m) and resident seeds (1 - m). The new frequencies of the joint genotypes and the joint cytotypes are then

(22)

and

(23)

where the corresponding interim seed frequencies are given by (19) and (20).

The disequilibria in the new adults are the result of admixture between resident and migrant seeds, which can be calculated using (15–18). The cytoplasmic and joint three-locus disequilibria in new adults, for example, will be the weighted average of the corresponding disequilibria in migrant and resident seeds plus the covariance across these two seed populations between the frequencies of the two genetic components being considered. From (16), the cytoplasmic disequilibrium after admixture is

(24)

while, using (17), the new joint three-locus disequilibria are found to be

(25)

for N = A, AA, Aa, or aa, where fN is the frequency of the nuclear component N in seeds. The corresponding three-way disequilibria are similarly obtained from (18) as

(26)

Multiple alleles:
Extension of this general framework to include multiple alleles and cytotypes is straightforward for the joint genotype and marginal frequencies. If we have n1 nuclear alleles (A1, ... , An1), n2 mitochondrial cytotypes (M1, ... , Mn2), and n3 chloroplast cytotypes (C1, ... , Cn3), for a total of n1(n1 + 1)n2n3/2 joint genotypes, we can let Fij,k,l indicate the frequency of adults with the AiAj nuclear genotype and the Mk mitochondrial and Cl chloroplast types. As an example of such a generalization, consider Equation 19aEquation 19b giving interim joint genotype frequencies in seeds (fij,k,l). These would be replaced by two general equations, one for homozygotes,

and one for heterozygotes,

where Pi,k is the frequency of resident Ai/Mk ovules, Pi,(l) is the frequency of resident Ai/Cl pollen, i,(l) is the frequency of migrant Ai/Cl pollen, and i, j = 1, 2, ... , n1, k = 1, 2, ... , n2, and l = 1, 2, ... , n3. Note that we do not distinguish between the nuclear genotypes AiAj and AjAi, so that, for j > i, fij,k,l = fji,k,l and Fij,k,l = Fji,k,l. Substituting these interim seed values into the analogs of (22) then gives the adult genotypic recursions for the case of multiple alleles.

The corresponding increase in the number of disequilibria, however, is not as straightforward to analyze. Disequilibria could be defined, in the manner of Equation 4Equation 5Equation 7Equation 8 HREF="#FD9">Equation 9Equation 10Equation 11Equation 12, for each possible two-way or three-way association between the various alleles, genotypes, and cytotypes following the multiallelic approach for cytonuclear systems in ASMUSSEN and BASTEN 1996 Down. Although there is no difficulty in setting up such definitions, interpretation of such a complex system is left for further work. For the remainder of this article, we focus on the basic case of diallelic loci.


*  DICYTONUCLEAR EQUILIBRIUM STRUCTURE
*TOP
*ABSTRACT
*THE NUCLEAR DICYTOPLASMIC SYSTEM
*ADMIXTURE FORMULAS
*MODEL OF POLLEN AND...
*DICYTONUCLEAR EQUILIBRIUM...
*DICYTONUCLEAR EQUILIBRIUM...
*DISCUSSION
*APPENDIX A
*APPENDIX B
*APPENDIX C
*LITERATURE CITED

The utility of cytonuclear and dicytonuclear data for decomposing and estimating gene flow in plant populations depends on the extent to which such data reflect the differential effects of pollen and seed migration. To address this issue, we now turn to the equilibrium state for the dicytonuclear system, which is determined by the equilibrium values of the frequency and disequilibrium variables within the parameterization shown in Table 6 and Table 7. These are calculated by setting each value after one generation of migration and mating equal to its previous value (for example, P' = P) and solving. Although not shown here, the stability of the three-locus equilibrium and the full dynamical behavior of the dicytonuclear system are determined by the explicit time-dependent solutions for the values of the independent populational variables in each generation t. The dynamical solutions for each of the two-locus cytonuclear systems are given in ASMUSSEN and SCHNABEL 1991 Down and SCHNABEL and ASMUSSEN 1992 Down. Similar methods yield the explicit time-dependent solutions for the disequilibria needed to fully parameterize the three-locus dicytonuclear system. The expected form of these dynamical solutions with their multiple geometric terms shows that the resident population always converges to the unique three-locus equilibrium specified by the equilibrium equations given below.

We focus first on the outcome in mixed-mating populations receiving both pollen and seed migration (0 < s, m, M < 1); the distinctive features of populations that are purely selfing or random mating or experience only one form of gene flow are treated in the subsequent section as special cases. Two practical points should be kept in mind when interpreting the equilibrium values under these different biological conditions. First, although this is a continent-island model, the resident population does not simply become an exact replica of the source population because of the two distinct forms of gene flow and the various levels of nonrandom associations these can generate among the three markers. Second, we are particularly interested in when permanent disequilibria are produced. Although not strictly necessary for gene flow estimation, permanent disequilibria should increase the conditions under which the equilibrium for a cytonuclear or dictyonuclear system reflects, and can be used to estimate, the rate of pollen (M) or seed (m) migration.

Final cytonuclear variables and marginal frequencies:
The equilibria for the two-locus subsystems are given in ASMUSSEN and SCHNABEL 1991 Down and SCHNABEL and ASMUSSEN 1992 Down and are provided in the current notation in Appendix B, Equations B1–B4. Because of their key roles in the new three-locus formulas, we reiterate the equilibrium formulas for the marginal frequencies and the nuclear genotype frequencies here. The final nuclear and chloroplast allele frequencies,

(27)

and

(28)

are weighted averages of the frequencies in migrant seeds and pollen, while the equilibrium mitochondrial frequency,

(29)

is simply the frequency in migrant seeds, since only migrant seeds, and not migrant pollen, carry the mitochondrial locus. Note that in each case a polymorphic equilibrium is required to generate permanent disequilibria involving that marker. The frequency of nuclear heterozygotes converges to a complicated convolution of the frequency of heterozygotes in migrant seeds and the nuclear allele frequencies in the two migrant pools,

(30)

with the final nuclear homozygote frequencies given by Û = - and W = - .

The final marginal frequencies are those of the joint cytotypes, which are new to the three-locus system. From (20) and (23), we find that the equilibrium frequency of each joint cytotype

(31)

(where X1M = XM, X2M = YM, X1C = XC, and X2C = YC) is the weighted average of the corresponding frequency in migrant seeds (mij) and the frequency in resident outcrossed seeds formed using either migrant pollen [(1 - m)M(1 - s)iMjC] or resident pollen [(1 - m) (1 - M)(1 - s)iMjC].

Final two-way disequilibria:
Turning to disequilibrium measures new to the three-locus system, we find that the equilibrium value for the cytoplasmic disequilibrium is

(32)

From (32), we see that pollen migration has no effect on the final association between alleles at the two cytoplasmic loci since pollen carry only the chloroplast and not the mitochondrial marker. In fact, there will be permanent disequilibrium between the two cytoplasmic loci if and only if the mitochondrial and chloroplast cytotypes are nonrandomly associated in migrant seeds. Such nonrandom associations among these or other markers could reflect disequilibria generated by selection or other nonrandomizing forces in the source population; migrant seed (or pollen) disequilibria would also be expected if seeds (or pollen) are derived from multiple, genetically distinct sources, as would occur in hybrid zones or other areas of admixture (see Equation 15aEquation 15bEquation 16Equation 17Equation 18).

We now consider the representative three-locus disequilibria. At equilibrium, the joint allelic disequilibrium is a linear combination of six factors,

(33)

where the ci are constants that depend on the migration and selfing rates and the cytotype frequencies in the migrant seeds and pollen; these are given in (C1) in Appendix C. We see immediately that joint allelic disequilibrium can be produced by the corresponding disequilibrium in migrant seeds (A/MC) or by allelic cytonuclear disequilibria in the migrant pools within either the nuclear-mitochondrial or nuclear-chloroplast subsystems (A/M, A/C, A/C). Joint allelic associations can also be generated by intermigrant interactions involving unequal nuclear allele frequencies in migrant pollen and seeds coupled either with cytoplasmic disequilibrium in migrant seeds ( != and M/C != 0) or with unequal chloroplast frequencies in the two migrant pools ( != and C != C). Different nuclear or chloroplast frequencies in the migrant pollen and seeds could be a result of differences in average distance for these two types of gene flow, resulting in different sources for pollen and seeds or differences in the relative contribution of multiple source populations to the two migrant pools (e.g., source 1 might contribute more pollen than seeds). Selection or other evolutionary forces within the life cycle of the source population(s) could also result in different nuclear or chloroplast frequencies in the two migrant pools. While joint nuclear-chloroplast allele frequency differences also generate nuclear-chloroplast allelic disequilibria [A/C, which is in SCHNABEL and ASMUSSEN 1992 Down], the intermigrant interaction of nuclear allele frequency differences with cytoplasmic disequilibria in migrant seeds is a new factor unique to the three-locus system.

The final three-locus joint genotypic disequilibria can be written as

(34a)

where the ci coefficients for each disequilibrium are given in (C2) and (C3), respectively. Similarly to the joint allelic disequilibrium, joint genotypic disequilibria can be produced by joint disequilibria in migrant seeds (AA/MC and Aa/MC for the homozygote disequilibrium, Aa/MC alone for the heterozygote disequilibrium) or by allelic cytonuclear disequilibria in the migrant pools (A/M, A/C, A/C and crossproducts of these). However, joint genotypic disequilibria can also be generated directly by cytoplasmic disequilibria in migrant seeds (M/C) without the need for unequal nuclear allele frequencies in the two migrant pools. Finally, joint genotypic disequilibria can be generated via intermigrant interactions involving unequal chloroplast frequencies in migrant pollen and seeds (C != C), either alone or in conjunction with other intermigrant factors ( != with or without A/M != 0).

Final three-way disequilibria:
The three-way associations are generated in fewer ways than the three-locus joint disequilibria. Allelic cytonuclear disequilibria in the migrant pools (A/M, A/C, or A/C != 0) or "simple" intermigrant admixture effects due to allele frequency differences in migrant pollen and seeds (C != C, or != and C != C) are insufficient to generate true three-way associations. For example, the three-way allelic disequilibrium is found at equilibrium to be

(35)

where the ci are given in (C4). This can be generated in only three ways: by the corresponding disequilibria in migrant seeds (A/M/C) or by either of two types of intermigrant interactions unique to the dicytonuclear system. The first couples unequal nuclear allele frequencies with cytoplasmic disequilibria in migrant seeds ( != and M/C != 0), which also enters into the joint allelic disequilibrium (33). The second is a new, three-locus interaction involving unequal chloroplast frequencies in migrant pollen and seeds together with nuclear-mitochondrial allelic disequilibria in migrant seeds (C != C and A/M != 0).

Finally, at equilibrium, the three-way genotypic disequilibria are

(36a)


(36b)

where the ci are given in (C5) and (C6), respectively. Paralleling the joint genotypic disequilibria in (34), these can be generated by three-way genotypic associations in migrant seeds (AA/M/C or Aa/M/C for the homozygote disequilibria, Aa/M/C alone for the heterozygote disequilibria) and by cytoplasmic disequilibria in migrant seeds (M/C). However, as for the three-way allelic association, the allelic cytonuclear disequilibria in the migrant pools contribute only in conjunction with other terms. For example, nuclear-chloroplast allelic disequilibrium in either migrant pool can generate three-way genotypic associations only in conjunction with nuclear-mitochondrial allelic disequilibria in migrant seeds (A/M != 0 plus A/C != 0 or A/C != 0). Additionally, nuclear-mitochondrial allelic disequilibria in migrant seeds can contribute when combined with unequal chloroplast frequencies in the migrant pools, with or without unequal nuclear allele frequencies (C != C and A/M != 0; or != , C != C, and A/M != 0). Finally, another form of intermigrant interaction, new to the three-locus system, can also produce three-way genotypic disequilibria: unequal chloroplast frequencies in the migrant pools coupled with the corresponding nuclear-mitochondrial genotypic disequilibrium in migrant seeds (e.g., C != C and AA/M != 0 for AA/M/C). The dependence of the final three-way homozygote disequilibria on the heterozygote value (Aa/M/C) means that the former can also be generated by unequal chloroplast frequencies in conjunction with nonrandom associations between heterozygotes and the mitochondrial marker in seeds (C != C and Aa/M != 0).

Equilibrium three-locus genotype frequencies:
To complete the specification of the equilibrium state for the dicytonuclear system, we must calculate the final three-locus genotype frequencies (Ûij, ij, Wij). These can be obtained by substituting the relevant equilibrium formulas [(27–30), (32–34), (B1–B4), (C1), and (C2)] into the decompositions given in Table 6 and Table 7.

Numerical examples:
Numerical examples allow us to compare the cytoplasmic and three-locus disequilibria generated by pollen and seed migration in the full nuclear-mtDNA-cpDNA system with the disequilibria in each of the two cytonuclear subsystems. Previous work has found that 0.1 is roughly the minimal detectable level of two-way disequilibria, given reasonable sample sizes and marginal frequencies (BASTEN and ASMUSSEN 1997 Down). Generally, both permanent and transient disequilibria generated via pollen migration alone are smaller in magnitude than disequilibria generated via a comparable amount of seed migration alone, as was found previously for the cytonuclear subsystems by SCHNABEL and ASMUSSEN 1992 Down. Higher selfing rates increase the magnitude of disequilibria and slow the decay of transient disequilibria. Both permanent and transient three-way disequilibria tend to be smaller than the corresponding two-locus or joint disequilibria, as would be expected of measures for higher-order effects.

Fig 3 shows an example where the resident population is initially monomorphic at each marker (U11 = 1.0) and receives genetically distinct migrant pollen and seeds ( = 0, = 0.7, C = 1.0, C = 0.7) with nonrandom cytonuclear associations in the seeds. Such a situation might arise, for example, if a genetically homogeneous population received migrant pollen from a source fixed for a different nuclear marker but received seeds from both this source and another source, differing in both nuclear and chloroplast frequencies. These differences in nuclear and chloroplast frequencies in the migrant pools and cytonuclear associations in migrant seeds generate permanent nonzero values for all the disequilibria. In the absence of actual values of seed and pollen migration rates and selfing rates, this example is used to illustrate cases with a larger pollen migration rate (M = 0.2) than seed migration rate (m = 0.05) as would be expected for most plant species with wind-dispersed pollen. The joint allelic and genotypic disequilibria for the three-genome system (Fig 3C) are similar in the shape of their trajectories to the disequilibria from the nuclear-mitochondrial subsystem (Fig 3A), although somewhat greater in magnitude. The nuclear-chloroplast and cytoplasmic disequilibria also have similar shapes but are smaller in magnitude (Fig 3B and Fig D). The three-way allelic and genotypic disequilibria for this example (Fig 3D) both have slightly smaller magnitudes and behave very differently than the two-way disequilibria. For instance, the three-way allelic and homozygote disequilibria have signs opposite those of the corresponding joint associations, while the three-way heterozygote disequilibrium differs from all other heterozygote disequilibria in this example by not changing sign along its trajectory. This example corresponds to a mainly outcrossing species (s = 0.1); increasing the selfing rate to s = 0.9 increases the magnitude of all the disequilibria except the heterozygote nuclear-mitochondrial (DAa/M) and joint (DAa/MC) disequilibria (all parameters other than s as in Fig 3, data not shown). Most of the other disequilibrium measures are increased by a factor of ~4 to 8, but some increase by a great deal more. For example, the homozygote nuclear-chloroplast disequilibrium (DAA/C) increases by a factor of 20, from ~0.004 (s = 0.1) to 0.08 (s = 0.9).



View larger version (25K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 3. Trajectories of permanent nonzero disequilibria. Initial conditions were U11 = 1.0; parameter values were s = 0.1, m = 0.05, M = 0.2, 11 = 0.7, 22 = 0.3, and 1C = 1.0. Disequilibria in the migrant pools were M/C = A/* = AA/* = -aa/* = 0.21, Aa/* = Aa/M/C = 0 (where * indicates M, C, or MC), A/M/C = AA/M/C = -aa/M/C = -0.084, and A/C = 0.

Fig 4 gives a case where the resident population receives only migrant seeds and no migrant pollen (M = 0) and is initially fixed at the nuclear and mitochondrial loci (P = XM = 1.0), but polymorphic at the chloroplast locus (XC = 0.8). In contrast to the population shown in Fig 3, here the migrant seeds have no cytonuclear disequilibria, and there can be no intermigrant admixture effects with only one type of migration; therefore, no permanent nonzero disequilibria are generated. However, transient disequilibria are generated by differences in the nuclear, mitochondrial, and chloroplast frequencies between the original population and the migrant seeds ( = M = 0.0, C = 0.4). In this example with high selfing (s = 0.9), the transient nuclear-mitochondrial disequilibria (Fig 4A) and joint dicytonuclear disequilibria (Fig 4C) that are generated can reach quite high values (>0.15 in magnitude) before dissipating; additionally, these can persist for relatively long periods of time (>0.05 in magnitude for almost 40 generations), increasing the probability of detection, particularly for long-lived organisms where generation times are long. However, if we consider lower selfing rates corresponding to predominantly outcrossing species, both the maximum level and the duration of the disequilibria are much reduced. For example, with a selfing rate of s = 0.1 (all other parameters as in Fig 4, data not shown), the transient nuclear-mitochondrial and joint dicytonuclear disequilibria do not reach 0.1 in magnitude before dissipating and remain above 0.05 in magnitude for <10 generations. Even with a high selfing rate (s = 0.9), neither the nuclear-chloroplast nor the cytoplasmic disequilibria (Fig 4B and Fig D) exceed 0.08 before dissipating. The three-way measures for this example are much smaller in magnitude than the other associations (Fig 4D) and are interesting in that all but the heterozygote association show a sign change in their trajectories.



View larger version (24K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 4. Trajectories of transient nonzero disequilibria. Initial conditions were U11 = 0.8, U12 = 0.2; parameter values were s = 0.9, m = 0.05, M = 0, 21 = 0.4, 22 = 0.6, 2C = 0.6, and 1C = 0.4. All disequilibria in the migrant pools are zero. Note differences in scale between left and right.


*  DICYTONUCLEAR EQUILIBRIUM STRUCTURE FOR SPECIAL CASES
*TOP
*ABSTRACT
*THE NUCLEAR DICYTOPLASMIC SYSTEM
*ADMIXTURE FORMULAS
*MODEL OF POLLEN AND...
*DICYTONUCLEAR EQUILIBRIUM...
*DICYTONUCLEAR EQUILIBRIUM...
*DISCUSSION
*APPENDIX A
*APPENDIX B
*APPENDIX C
*LITERATURE CITED

To further analyze the effects of the two different forms of gene flow and the mating system on the generation of permanent disequilibria, we consider the equilibrium structure under a series of important special cases subsumed within this general framework. These include (1) seed migration alone (0 < m < 1, M = 0), (2) pollen migration alone (0 < M < 1, m = 0), (3) complete random mating (s = 0), (4) complete self-fertilization with seed migration (s = 1, 0 < m < 1), (5) equal nuclear allele frequencies in the two migrant pools ( = ), (6) equal chloroplast frequencies in the two migrant pools (C = C), (7) equivalent migrant pools ( = , C = C, A/C = A/C), and (8) no migrant disequilibria ( = = 0 for all migrant pollen and seed disequilibria). Since in each case the behavior of the disequilibria for both of the cytonuclear subsystems has been considered earlier (ASMUSSEN and SCHNABEL 1991 Down; SCHNABEL and ASMUSSEN 1992 Down), details for these are not given unless necessary for clarification. In the following discussion, we focus on the distinctive features of each special case that differ from the general case presented above, with explicit equilibrium formulas given only when they represent significant simplifications.

Seed migration alone (0 < m < 1, M = 0):
Populations that receive gene flow solely through seeds have a very different equilibrium structure from those receiving both forms of gene flow. With no pollen migration, all of the allele frequencies approach those in migrant seeds ( = , M =