Abstract
We determine the nuclear-dicytoplasmic effects of unidirectional gene flow via pollen and seeds upon a mixed-mating plant population, focusing on nuclear-mitochondrial-chloroplast systems where mitochondria are inherited maternally and chloroplasts paternally, as in many conifers. After first delineating the general effects of admixture (via seeds or individuals) on the nonrandom associations in such systems, we derive the full dicytonuclear equilibrium structure, including when disequilibria may be indicators of gene flow. Substantial levels of permanent two- and three-locus disequilibria can be generated in adults by (i) nonzero disequilibria in the migrant pools or (ii) intermigrant admixture effects via different chloroplast frequencies in migrant pollen and seeds. Additionally, three-locus disequilibria can be generated by higher-order intermigrant effects such as different chloroplast frequencies in migrant pollen and seeds coupled with nuclear-mitochondrial disequilibria in migrant seeds, or different nuclear frequencies in migrant pollen and seeds coupled with mitochondrial-chloroplast disequilibria in migrant seeds. Further insight is provided by considering special cases with seed or pollen migration alone, complete random mating or selfing, or migrant pollen and seeds lacking disequilibria or intermigrant admixture effects. The results complete the theoretical foundation for a new method for estimating pollen and seed migration using joint cytonuclear or dicytonuclear data.
THERE are two different avenues for gene flow in plant populations: pollen migration and seed migration. Joint nuclear-cytoplasmic data can be especially useful for decomposing and estimating these two types of gene flow due to the asymmetrical inheritance of cytonuclear systems, where nuclear alleles are inherited through both parents while cytoplasmic alleles are usually inherited uniparentally (Asmussen and Schnabel 1991; Schnabel and Asmussen 1992). This asymmetry and the differing effects of pollen and seed migration will be reflected in the cytonuclear structure of the plant population, through both the joint genotype frequencies and the pattern of statistical associations (disequilibria) generated between the cytoplasmic marker and the nuclear alleles and genotypes.
The existence of two uniparentally inherited organelles in plants greatly enhances the utility of cytonuclear data. Although cases of biparental inheritance of organelle genomes exist (Medgyesyet al. 1986; Wagneret al. 1991), the majority of plant species exhibit strict maternal inheritance of both mitochondria and chloroplasts (Kirk and Tilney-Bassett 1978; Birky 1988; Harrison and Doyle 1990). However, in two families of coniferous gymnosperms (including the largest family, the Pinaceae), chloroplast DNA (cpDNA) appears to be paternally inherited, while mitochondrial DNA (mtDNA) is maternally inherited (Nealeet al. 1986; Wagneret al. 1987; Neale and Sederoff 1989; Mogensen 1996). Joint nuclear-mtDNA-cpDNA data should be uniquely valuable in estimating plant gene flow in such systems, since pollen will carry a haploid nuclear component and the chloroplast genome, while seeds will carry complete complements of all three genomes. Nuclear and chloroplast alleles will therefore have two opportunities for movement, while mitochondrial alleles can move only through seeds. Furthermore, although pollen migration can only carry nonrandom associations between nuclear and chloroplast alleles, seed migration can carry all possible pairwise associations between alleles and genotypes in the three genomes, as well as higher-order three-locus nuclear-mitochondrial-chloroplast associations.
Previous work (Asmussen and Schnabel 1991; Schnabel and Asmussen 1992) has determined the effects of unidirectional pollen and seed migration on the standard, two-locus cytonuclear system with a nuclear locus and a single, uniparentally inherited cytoplasmic marker. With maternal cytoplasmic inheritance, Asmussen and Schnabel (1991) found that nonzero cytonuclear disequilibria are maintained only if migrant seeds carry nonrandom cytonuclear associations. These migrant associations would be found when selection or other nonrandomizing factors such as assortative mating act on the source population (Arnoldet al. 1988; Babcock and Asmussen 1996, 1998) as well as when seeds are contributed by multiple, genetically distinct sources, as might be expected in hybrid zones and other areas of admixture. Significant and long-lasting transient disequilibria can be generated, however, by migrant seeds lacking such associations. Pollen dispersal has only a small effect on the disequilibria caused by seed migration and cannot generate even transient disequilibria de novo.
In contrast, with paternal cytoplasmic inheritance (Schnabel and Asmussen 1992), pollen migration can significantly affect the cytonuclear structure of the recipient population through nonrandom cytonuclear associations in the migrant pollen, although pollen migration alone generally produces lower levels of disequilibria than does the equivalent amount of seed migration alone. Additionally, with paternal cytoplasmic inheritance, the presence of both types of gene flow can generate nonrandom cytonuclear associations via intermigrant admixture effects, such as differences in nuclear and chloroplast allele frequencies in migrant pollen and seeds. Such allele frequency differences in the two migrant pools could be caused by having separate sources for migrant pollen and seeds, common sources whose relative contributions depend on the type of gene flow, or selection or other evolutionary forces acting during the life cycle of the source population(s). Together, these results for cytonuclear disequilibria suggest that three-locus, nuclear-mtDNA-cpDNA data juxta-posing both forms of uniparental inheritance should be uniquely powerful for estimating pollen and seed migration rates, since they provide the greatest number of avenues for the accumulation of nonrandom associations.
As a first step toward formally testing this idea, we extend these previous pollen and seed dispersal models to the three-locus nuclear-dicytoplasmic (dicytonuclear) system parameterized by Schnabel and Asmussen (1989). We first review the dicytonuclear frequencies and associations for this system and then delimit the effects of population admixture on the three-locus disequilibria. We next determine the dynamics and full equilibrium structure when the two cytoplasmic markers are inherited through opposite parents, including when each type of two- and three-locus association is generated. Further insight into the distinctive dicytonuclear effects of pollen and seed flow is provided by numerical examples showing the magnitude and duration of the disequilibria they generate and by the equilibrium structure for a series of special cases subsumed within our general framework. Together, these new three-locus results complete the theoretical foundation for a new method for estimating pollen and seed flow in plant populations using joint cytonuclear or dicytonuclear data. This final critical step is presented in a companion article (Orive and Asmussen 2000).
Joint nuclear-mitochondrial-chloroplast genotype frequencies
THE NUCLEAR DICYTOPLASMIC SYSTEM
We consider the nuclear-dicytoplasmic structure of a diploid plant population. The dicytonuclear system consists of an autosomal nuclear locus with two alleles, A and a, and two haploid cytoplasmic loci: a mitochondrial marker with two alleles (cytotypes) M and m and a chloroplast marker with two alleles (cytotypes) C and c. As is true in the Pinaceae (Mogensen 1996), we assume that the mitochondria are inherited maternally and chloroplasts paternally, with no cross-parental leakage of either organelle. The definitions of frequency and nonrandom association variables follow those of Schnabel and Asmussen (1989), and we use the notation of that article throughout, except where noted below.
Frequency variables: The frequencies of the 12 joint, three-locus genotypes are given in Table 1 as Uij, Vij, and Wij, where the first index (i = 1, 2) indicates M or m alleles and the second index (j = 1, 2) indicates C or c alleles. The joint cytotype frequencies (Xij) are obtained by summing across each row and the nuclear genotype frequencies (U, V, and W) by summing down each column. The nuclear allele frequencies are
Nuclear-mitochondrial and nuclear-chloroplast genotype frequencies
The final frequency variables are those for the two-locus nuclear-cytoplasmic combinations within each of the two cytonuclear subsystems. The two sets of joint, two-locus genotypic frequencies are given in Table 2. The joint diallelic frequencies for the nuclear-mitochondrial subsystem represent the ovule frequencies, while those for the nuclear-chloroplast subsystem represent the pollen frequencies in the population (Table 3).
Two-locus disequilibria: We consider three sets of pairwise disequilibria among the three loci. These disequilibria are calculated in the same general way as are linkage disequilibria among nuclear loci and equal the difference between the frequency of each joint combination and that expected under random association between the two components involved. The first two involve the two-locus associations in each cytonuclear subsystem as defined in Asmussen et al. (1987). To facilitate
the development of concise general formulas in the three-locus system, they are denoted here by a more informative notation that explicitly includes the nuclear and cytoplasmic components involved. The two allelic disequilibria
Joint diallelic frequencies for the nuclear-mitochondrial and nuclear-chloroplast subsystems
The third and final type of two-locus disequilibria is the cytoplasmic disequilibrium,
Decomposition of two-locus cytonuclear genotype frequencies
Three-locus disequilibria: The three-locus system can also result in higher-order associations that involve all three loci, both pairwise three-locus associations and full three-way associations. These are specified fully in Schnabel and Asmussen (1989) and for convenience are briefly summarized here. Throughout, we focus on the associations involving the M and C cytotypes; the interrelations in the three-locus system allow us to derive all of the other associations from these (Schnabel and Asmussen 1989). The basic three-locus disequilibria are the three joint genotypic disequilibria
Decomposition of joint cytonuclear dialletic frequencies
Several true three-way measures of nonrandom associations can also be defined that measure associations among the three markers (nuclear, mitochondrial, and chloroplast) after taking into account all of the possible two-way associations (nuclear-mitochondrial, nuclear-chloroplast, and mitochondrial-chloroplast). For the M/C cytotype, we have three three-way genotypic disequilibria
Finally, only 11 independent variables are necessary to completely describe the full 12 joint genotype system. One such parameterization (Table 7) includes five two-locus disequilibria (Da/m, Dmc, Da/M, Dia/c, and Dm/c), two three-locus disequilibria (DA/MC and DAa/MC), the nuclear, mitochondrial, and chloroplast allele frequencies (P, XM, and XC), and the nuclear heterozygote frequency (V). However, for completeness, our analysis below provides the results for an allelic, homozygote, and heterozygote association for each disequilibrium category: Da/mc, DM/mc, and Dia/MC for the three-locus joint disequilibria and their counterparts for the three-way disequilibria.
Decomposition of joint cytotype frequencies
ADMIXTURE FORMULAS
After migration by seeds (or adults), the dicytonuclear makeup of the new population will be a mixture of that of residents and migrants. We consider here the admixture effects for the general case where n genetically distinct sources contribute to a single population. Define the expected value of a variable Z across all n sources as
Admixture has a more complicated effect on disequilibria since these can be generated in the total population by a two-locus Wahlund effect if the sources are genetically distinct (Nei and Li 1973; Asmussen and Arnold 1991; Goodisman and Asmussen 1997). Following admixture, the two-way disequilibria will be the weighted average of the disequilibria across the n sources plus the covariance across the sources between the frequencies of the two genetic components being considered (Asmussen and Arnold 1991). For example, the allelic and genotypic cytonuclear disequilibria after admixture are
The derivation of these general formulas depends on the fact that Cov(X,Y) = E(XY) − E(X) E(Y) for any two random variables X and Y. As an example, we derive the joint allelic disequilibrium after admixture, using (9) and (14), as
For all of the two-way disequilibria, the covariance terms have a simple interpretation for the special case where there are only two source populations contributing to the total population. In this case the admixture formulas (15, 16 and 17) become
Decomposition of three-locus genotype frequencies
Three-way disequilibria after admixture are more complex than are two-way disequilibria, involving the weighted averages of the disequilibria and cytoplasmic frequencies, covariances between nuclear and cytoplasmic frequencies, and covariances between cytoplasmic frequencies and two-locus cytonuclear disequilibria. The general formula is
MODEL OF POLLEN AND SEED MIGRATION
The model of pollen and seed migration (summarized in Figure 1) represents a three-locus extension of the two-locus cytonuclear migration models considering only maternal or only paternal cytoplasmic inheritance in Asmussen and Schnabel (1991) and Schnabel and Asmussen (1992). We consider a population with nonoverlapping generations and no seed dormancy, from which we census adults. Migration is modeled via a continent-island framework with unidirectional migration, from the source population(s) to the population under consideration (Figure 2). We assume that the migration rates and the genetic composition(s) of the source population(s) are constant over time, and we ignore the effects of selection, mutation, and genetic drift. Mating is a mixture of selfing, which occurs at rate s, and outcrossing, which occurs at rate 1 − s in accordance with the mixed-mating model (Clegg 1980). During mating, pollen migration occurs at rate M, such that the fraction of outcrossed pollen that comes from the migrant pollen pool is M, and the fraction of outcrossed pollen that comes from within the population is 1 − M. The total fraction of migrant pollen per generation is thus M(1 − s). After pollination and seed maturation, seed migration occurs at a rate m, which means that each generation the seed population is a mixture of migrant seeds (a fraction m) and resident seeds (a fraction 1 − m). Following germination and growth, a new adult population is formed, completing the generation cycle.
We distinguish variables in the various life stages by letting uppercase letters represent variables in adults (e.g., P, U, XM, XC, DM/C) and lowercase letters represent the corresponding variables in the interim seed population (e.g., p, u, xM, xC, dM/C). Variables in the two migrant pools are distinguished by overbars, with lowercase letters again indicating seeds and uppercase letters now indicating pollen. Since migrant pollen carries only a haploid nuclear component and the chloroplast genome, it is characterized by its nuclear-chloroplast diallelic frequencies (
Adult census model showing the two types of gene flow within a generation cycle. Pollen migration occurs first, at rate M, followed by seed migration at rate m. Mating is a mixture of selfing, which occurs at rate s, and outcrossing, which occurs at rate 1 − s.
The interim seed variables, recursion equations, and equilibrium values for the nuclear-mitochondrial and nuclear-chloroplast subsystems are equivalent to those for the cases of strictly maternal and strictly paternal cytoplasmic inheritance, respectively (Asmussen and Schnabel 1991; Schnabel and Asmussen 1992). To convert those results to the notation used here, we replace each cytoplasmic and cytonuclear variable z for maternal cytoplasmic inheritance with zM (e.g., x becomes xM,) and those variables for paternal cytoplasmic inheritance with zC (e.g., x becomes xC), with the nuclear component (A, AA, Aa, or aa) explicitly indicated in the cytonuclear disequilibria as above.
We develop the recursion equations for the frequencies and representative disequilibria new to the dicytonuclear system for the complete model with mixed mating and both pollen and seed migration in two steps, first finding the interim values in the seeds following pollen migration and fertilization, and then calculating the new adult values following seed migration. We also consider eight important cases subsumed within this general framework.
Interim seed values: The joint genotype frequencies at the interim seed stage can be calculated by considering the contribution to each joint genotype due to self-fertilization of resident ovules and that due to random outcrossing. The contribution due to self-fertilization depends only on the joint genotype frequencies of the resident adults (Table 1), while the contribution due to outcrossing is determined by the joint nuclear-mitochondrial allele frequencies in resident ovules and the joint nuclear-chloroplast allele frequencies in the pollen pool (Table 3), which contains both resident and migrant pollen. For example, consider the frequency of AA/M/C progeny. Such progeny result from the fertilization
of a resident A/M ovule by A/C pollen. Under selfing (which occurs with probability s), the only individuals who can produce the right type of ovules and pollen are AA/M/C (frequency U11, who always do) and Aa/M/C (frequency V11, who produce each correct gamete type half of the time and the appropriate combination of gametes one-fourth of the time). The contribution due to outcrossing (probability 1 − s) is similarly straightforward to derive. In this case an A/M ovule (frequency P1M) is fertilized by either migrant A/C pollen (probability M P1C) or by resident A/C pollen [probability (1 − M) P1C]. The frequency of AA/M/C progeny is then
Continent-island migration model with unidirectional pollen (M) and seed (m) migration.
From these interim dicytonuclear genotype frequencies, the marginal frequencies at the three loci can be found along with the interim two-locus cytonuclear disequilibria using the definitions in (4) and (5). Interim values for each of the two-locus subsystems that will be used in further derivations are given in appendix a, Equations A1, A2, A3, A4, A5, A6, A7 and A8. The interim seed values for the other new three-locus variables can then be derived from the joint genotype frequencies using Table 1 and (7–11). The joint cytotype frequencies are
Recursion equations: Seed migration completes the life cycle. Using (13) and (14), each frequency variable in the new generation of adults is simply the weighted average of the corresponding value in migrant (m) and resident seeds (1 − m). The new frequencies of the joint genotypes and the joint cytotypes are then
The disequilibria in the new adults are the result of admixture between resident and migrant seeds, which can be calculated using (15, 16, 17 and 18). The cytoplasmic and joint three-locus disequilibria in new adults, for example, will be the weighted average of the corresponding disequilibria in migrant and resident seeds plus the covariance across these two seed populations between the frequencies of the two genetic components being considered. From (16), the cytoplasmic disequilibrium after admixture is
Multiple alleles: Extension of this general framework to include multiple alleles and cytotypes is straightforward for the joint genotype and marginal frequencies. If we have n1 nuclear alleles (A1, …, An1), n2 mitochondrial cytotypes (M1, …, Mn2), and n3 chloroplast cytotypes (C1, …, Cn3), for a total of n1(n1 + 1)n2n3/2 joint genotypes, we can let Fij,k,l indicate the frequency of adults with the AiAj nuclear genotype and the Mk mitochondrial and Cl chloroplast types. As an example of such a generalization, consider Equations 19a, 19b and 19c giving interim joint genotype frequencies in seeds (fij,k,l). These would be replaced by two general equations, one for homozygotes,
The corresponding increase in the number of disequilibria, however, is not as straightforward to analyze. Disequilibria could be defined, in the manner of Equations 4, 5, 6, 7, 8, 9, 10, 11 and 12, for each possible two-way or three-way association between the various alleles, genotypes, and cytotypes following the multiallelic approach for cytonuclear systems in Asmussen and Basten (1996). Although there is no difficulty in setting up such definitions, interpretation of such a complex system is left for further work. For the remainder of this article, we focus on the basic case of diallelic loci.
DICYTONUCLEAR EQUILIBRIUM STRUCTURE
The utility of cytonuclear and dicytonuclear data for decomposing and estimating gene flow in plant populations depends on the extent to which such data reflect the differential effects of pollen and seed migration. To address this issue, we now turn to the equilibrium state for the dicytonuclear system, which is determined by the equilibrium values of the frequency and disequilibrium variables within the parameterization shown in Tables 6 and 7. These are calculated by setting each value after one generation of migration and mating equal to its previous value (for example, P′ = P) and solving. Although not shown here, the stability of the three-locus equilibrium and the full dynamical behavior of the dicytonuclear system are determined by the explicit time-dependent solutions for the values of the independent populational variables in each generation t. The dynamical solutions for each of the two-locus cytonuclear systems are given in Asmussen and Schnabel (1991) and Schnabel and Asmussen (1992). Similar methods yield the explicit time-dependent solutions for the disequilibria needed to fully parameterize the three-locus dicytonuclear system. The expected form of these dynamical solutions with their multiple geometric terms shows that the resident population always converges to the unique three-locus equilibrium specified by the equilibrium equations given below.
We focus first on the outcome in mixed-mating populations receiving both pollen and seed migration (0 < s, m, M < 1); the distinctive features of populations that are purely selfing or random mating or experience only one form of gene flow are treated in the subsequent section as special cases. Two practical points should be kept in mind when interpreting the equilibrium values under these different biological conditions. First, although this is a continent-island model, the resident population does not simply become an exact replica of the source population because of the two distinct forms of gene flow and the various levels of nonrandom associations these can generate among the three markers. Second, we are particularly interested in when permanent disequilibria are produced. Although not strictly necessary for gene flow estimation, permanent disequilibria should increase the conditions under which the equilibrium for a cytonuclear or dictyonuclear system reflects, and can be used to estimate, the rate of pollen (M) or seed (m) migration.
Final cytonuclear variables and marginal frequencies: The equilibria for the two-locus subsystems are given in Asmussen and Schnabel (1991) and Schnabel and Asmussen (1992) and are provided in the current notation in appendix b, Equations B1, B2, B3 and B4. Because of their key roles in the new three-locus formulas, we reiterate the equilibrium formulas for the marginal frequencies and the nuclear genotype frequencies here. The final nuclear and chloroplast allele frequencies,
The final marginal frequencies are those of the joint cytotypes, which are new to the three-locus system. From (20) and (23), we find that the equilibrium frequency of each joint cytotype
Final two-way disequilibria: Turning to disequilibrium measures new to the three-locus system, we find that the equilibrium value for the cytoplasmic disequilibrium is
We now consider the representative three-locus disequilibria. At equilibrium, the joint allelic disequilibrium is a linear combination of six factors,
The final three-locus joint genotypic disequilibria can be written as
Final three-way disequilibria: The three-way associations are generated in fewer ways than the three-locus joint disequilibria. Allelic cytonuclear disequilibria in the migrant pools (
Finally, at equilibrium, the three-way genotypic disequilibria are
Equilibrium three-locus genotype frequencies: To
complete the specification of the equilibrium state for the dicytonuclear system, we must calculate the final three-locus genotype frequencies (
Trajectories of permanent nonzero disequilibria. Initial conditions were U11 = 1.0; parameter values were s = 0.1, m = 0.05, M = 0.2, ,
, and
. Disequilibria in the migrant pools were
,
(where * indicates M, C, or MC),
, and
.
Numerical examples: Numerical examples allow us to compare the cytoplasmic and three-locus disequilibria generated by pollen and seed migration in the full nuclear-mtDNA-cpDNA system with the disequilibria in each of the two cytonuclear subsystems. Previous work has found that 0.1 is roughly the minimal detectable level of two-way disequilibria, given reasonable sample sizes and marginal frequencies (Basten and Asmussen 1997). Generally, both permanent and transient disequilibria generated via pollen migration alone are smaller in magnitude than disequilibria generated via a comparable amount of seed migration alone, as was found previously for the cytonuclear subsystems by Schnabel and Asmussen (1992). Higher selfing rates increase the magnitude of disequilibria and slow the decay of transient disequilibria. Both permanent and transient three-way disequilibria tend to be smaller than the corresponding two-locus or joint disequilibria, as would be expected of measures for higher-order effects.
Figure 3 shows an example where the resident population is initially monomorphic at each marker (U11 = 1.0) and receives genetically distinct migrant pollen and seeds (
Figure 4 gives a case where the resident population receives only migrant seeds and no migrant pollen (M = 0) and is initially fixed at the nuclear and mitochondrial loci (P = XM = 1.0), but polymorphic at the chloroplast locus (XC = 0.8). In contrast to the population shown in Figure 3, here the migrant seeds have no cytonuclear disequilibria, and there can be no intermigrant admixture effects with only one type of migration; therefore, no permanent nonzero disequilibria are generated. However, transient disequilibria are generated by differences in the nuclear, mitochondrial, and chloroplast frequencies between the original population and the migrant seeds (
DICYTONUCLEAR EQUILIBRIUM STRUCTURE FOR SPECIAL CASES
To further analyze the effects of the two different forms of gene flow and the mating system on the generation of permanent disequilibria, we consider the equilibrium structure under a series of important special cases subsumed within this general framework. These include (1) seed migration alone (0 < m < 1, M = 0), (2) pollen migration alone (0 < M < 1, m = 0), (3) complete random mating (s = 0), (4) complete self-fertilization with seed migration (s = 1, 0 < m < 1), (5) equal nuclear allele frequencies in the two migrant pools (
Seed migration alone (0 < m < 1, M = 0): Populations that receive gene flow solely through seeds have a very different equilibrium structure from those receiving both forms of gene flow. With no pollen migration, all of the allele frequencies approach those in migrant seeds (
Trajectories of transient nonzero disequilibria. Initial conditions were U11 = 0.8, U12 = 0.2; parameter values were s = 0.9, m = 0.05, M = 0, ,
,
, and
. All disequilibria in the migrant pools are zero. Note differences in scale between left and right.
The final three-way allelic disequilibrium,
Pollen migration alone (0 < M < 1, m = 0): An even more distinctive equilibrium structure results when pollen is the only form of gene flow, as might arise in wind-pollinated and other species where pollen travels much farther than seeds. When there is no seed migration, both the nuclear and chloroplast allele frequencies approach those in migrant pollen (
Complete random mating (s = 0): The overall effects of the mating system are illustrated by the extremes with either complete random mating or complete selfing. In a completely random mating population, all equilibrium values simplify somewhat but retain the same overall form as in the general case (e.g.,
Complete self-fertilization with seed migration (s = 1, 0 < m < 1): Completely selfing populations are distinct, as a result of having both a unique mating system and being closed to pollen flow. Those receiving seed flow are like mixed-mating populations with seed migration alone in that allele frequencies converge to those in migrant seeds (
Equal nuclear allele frequencies in migrant pollen and seeds (
Equal chloroplast frequencies in migrant pollen and seeds
Equivalent migrant pools (
No migrant disequilibria (
DISCUSSION
Unidirectional pollen migration and seed migration have distinctive effects on the dicytonuclear structure of plant populations in which mitochondria are inherited maternally and chloroplasts paternally, as appears true in many conifer species. We have provided an extensive analysis of such systems by a direct extension of previous pollen and seed dispersal models for two-locus cytonuclear systems with either maternal or paternal cytoplasmic inheritance (Asmussen and Schnabel 1991; Schnabel and Asmussen 1992). Explicit analytic formulas are given for the effects of population admixture (via seeds or individuals) upon cytoplasmic and cytonuclear disequilibria, as well as for the full dicytonuclear equilibrium structure for selectively neutral markers in random mating, mixed mating, and purely selfing populations receiving gene flow through pollen and seeds. We are particularly concerned with the extent to which nonrandom associations among the three loci can be used as indicators of pollen and seed flow.
Permanent two- and three-locus disequilibria are generated in three main ways in this system. First, nonrandom associations are generated in the resident population if there are nonzero disequilibria in the migrant pools, especially in migrant seeds. Because of this, seed migration is of special importance for the generation of permanent disequilibria. Pollen migration alone can only generate permanent nuclear-chloroplast and joint three-locus disequilibria, and only if there are nonrandom associations between nuclear and chloroplast alleles in migrant pollen (
The second general cause of permanent nonrandom associations among the three loci is through intermigrant admixture effects due to differing cytonuclear compositions in the two migrant pools (Schnabel and Asmussen 1992). For example, joint differences in nuclear and chloroplast allele frequencies between migrant pollen and seeds (
The third and final general source of permanent disequilibria comes from the complex intermigrant effects that generate joint and three-way three-locus disequilibria. These new interactions involve all three genomes and require either allelic disequilibria in both migrant pools or differences in allele frequencies in the two migrant pools plus nonrandom associations in migrant seeds. For example, three-way genotypic associations can be generated by nuclear-mitochondrial allelic disequilibrium in migrant seeds in conjunction with nuclear-chloroplast allelic disequilibrium in migrant pollen (
Even when no permanent disequilibria are generated, transient disequilibria can be produced by admixture between the migrants and the resident population. Such transient associations can reach quite high values and persist for relatively large numbers of generations. In long-lived organisms such as trees, generation times can be quite long, indicating that measurable nonrandom associations may be found even in situations where permanent disequilibria are not possible.
The magnitude of disequilibria formed by gene flow (and thus the ability to detect such nonrandom associations in natural populations) depends on whether populations are censused prior to or after mating and segregation can break up associations formed by admixture. We have focused here on censusing adults; in general, assaying three markers from adult tissues will be easier than doing so from seeds, especially for species whose seeds are small. However, the model presented here can be readily extended to populations of species such as conifers, which are easily censused at the seed stage (Conkle 1971), following the same approach used in the prior two-locus models (Asmussen and Schnabel 1991; Schnabel and Asmussen 1992). The equilibrium frequencies and disequilibria in seeds are readily obtained by substituting the adult equilibria into the right-hand sides of the interim seed values given in appendix a. For either adult or seed census data, the statistical significance of the joint disequilibria can be determined using the procedures for multiallelic markers developed by Basten and Asmussen (1997), treating the four joint cytotypes as four alleles at a single cytoplasmic marker. The three-way disequilibria require the development of new statistical theory, which will be presented elsewhere.
Three-locus nuclear-dicytoplasmic data from plants with both paternally and maternally inherited organelles represent a unique opportunity to more accurately estimate both the absolute and relative rates of pollen- and seed-mediated gene flow. Analysis of joint genotype frequencies and nonrandom associations among all three loci can allow a better understanding of the relative roles that the mating system and gene flow play in shaping the genetic structure of such populations than has previously been possible. The dicytonuclear migration model presented here, along with previous work on plant cytonuclear systems (Schnabel and Asmussen 1989, 1992; Asmussen and Schnabel 1991), completes the necessary background for understanding the effects of asymmetrical gene flow upon the cytonuclear and dicytonuclear structure of plant populations. Together, these results provide the theoretical foundation for a new method for estimating pollen and seed gene flow from such joint cytonuclear and dicytonuclear data, which is presented in a companion article (Orive and Asmussen 2000).
Acknowledgments
We thank Chris Babcock, Mike Goodisman, Deb Overath, María Sánchez, Janet Hatt, and two anonymous reviewers for their helpful comments and suggestions on this work, and Sally Otto for her notational advice. Part of this work was completed while M.E.O. was a fellow at the Institute of Cell, Animal and Population Biology, University of Edinburgh, and we thank Nick Barton and his lab for their assistance. This research was supported in part by National Institutes of Health grant GM-48528 and National Science Foundation grant DEB 92-10895 to M.A.A. and by a National Science Foundation NATO postdoctoral fellowship and a University of Kansas New Faculty General Research Fund grant to M.E.O.
APPENDIX A: INTERIM SEED VALUES
Marginal frequencies and disequilibria in cytonuclear subsystems: The interim nuclear and cytotype frequencies in seeds are
Three-locus disequilibria: The interim three-locus joint allelic disequilibrium in seeds is
Finally, we can write the interim three-way, three-locus allelic disequilibrium in seeds as
APPENDIX B: EQUILIBRIA FOR THE TWO-LOCUS CYTONUCLEAR DISEQUILIBRIA
The final nuclear-mitochondrial disequilibria are
APPENDIX C: EQUILIBRIA FOR THE THREE-LOCUS DISEQUILIBRIA
We give here the details of the full forms for the final three-locus disequilibria. These are given in terms of the six constant factors,
For the homozygote three-locus joint genotypic disequilibrium,
For the heterozygote three-locus joint genotypic disequilibrium,
The constants multiplying the various terms for the three-locus, three-way allelic disequilibrium,
The constants for the three-locus, three-way homozygote disequilibrium,
Finally, the constants for the three-locus, three-way heterozygote disequilibrium,
Footnotes
-
Communicating editor: A. G. Clark
- Received November 25, 1999.
- Accepted February 14, 2000.
- Copyright © 2000 by the Genetics Society of America