Abstract
Equilibrium probabilities of identity by descent (IBD), for pairs of genes within individuals, for genes between individuals within subpopulations, and for genes between subpopulations are calculated in metapopulation models with fixed or varying colony sizes. A continuous-time analog to the Moran model was used in either case. For fixed-colony size both propagule and migrant pool models were considered. The varying population size model is based on a birth-death-immigration (BDI) process, to which migration between colonies is added. Wright's F statistics are calculated and compared to previous results. Adding between-island migration to the BDI model can have an important effect on the equilibrium probabilities of IBD and on Wright's index.
WRIGHT'S island model considers “a population subdivided into random breeding islands with populations of size N of which the proportion m consists of immigrants that may be considered a random sample of the total species” (
Wright 1969). The model also assumes that (1) the populations have discrete nonoverlapping generations, (2) individual populations do not fluctuate in size, and (3) migration is uniform over all islands. Under those assumptions and at steady state, Wright showed that F
ST, the correlation between uniting gametes, could be written as a simple function of Nm, namely
Also implicitly assumed is that the rate of mutation is weak relative to the stronger rates of migration and random genetic drift. Despite the demanding assumptions of this model,
Equation 1 has been widely used to obtain indirect estimates of gene flow among populations.
Wright's F statistics, F
IS and F
ST, where the subindices refer to individuals, subpopulations, and total population, are expressed in terms of probability of identity by descent (IBD) as
and
where the
are the probabilities of IBD (
Malécot 1948), θ
1 for pairs of genes within individuals, θ
2 for genes between individuals within subpopulations, and θ
3 for genes between subpopulations (
Cockerham and Weir 1987;
Rousset 1996); precise definitions are given below. The index F
IS measures the extent to which genetic variation in a subpopulation is explained by variation within individuals, and F
ST similarly the degree to which the total genetic variation is accounted for by variation within subpopulations. Recently, attempts have been made to relax some of the assumptions of Wright's island model (see
McCauley 1993 for a review). Among these,
Maruyama and
Tachida (
1992) derived an equation analogous to
Equation 1 in the case of a metapopulation with a discrete time propagule model of extinction/ recolonization (each extinct colony is replaced by a copy of a randomly selected existing colony) and selfing. This was achieved by deriving recursion equations for θ
1, θ
2, and θ
3 and calculating the equilibrium values. On their part,
Rannala and
Hartigan (
1995) investigated the effect on IBD probabilities of fluctuating population sizes in a continuous time setting. They assumed a collection of disjunct patches inhabited by haploid individuals whose number changes over time due to mortality, birth, and immigration from a large external population and studied the corresponding demographic model based on a linear birth-death-immigration process. Because there is no migration between subpopulations in the model their study refers basically to a single colony population. It was found that given the size N of the system at some point t in time the conditional probability θ
2 =θ
2(t, N) only depends on the birth and immigration rates but not on the death rate and not on N or t. This led to some conclusions on the applicability of their results also to transient situations where the population size has either not yet reached a stationary distribution or does not settle into a steady state.
In the present article we extend both models. In the first part a continuous-time version of the Maruyama and Tachida model is given and extended to a migrant pool extinction model. Detailed arguments are provided for deriving the systems of ordinary differential equations satisfied by the IBD probabilities, starting out from the basic mutation/reproduction balance and successively adding migration and extinction mechanisms. In the second part of the article, migration between islands and selfing are added to a diploid version of Rannala and Hartigan's model. An approximation for Wright's index at population genetic equilibrium is found. It is shown in particular that when there is migration among islands as well as immigration from a large source with a fixed probability of IBD (the situation modeled by Rannala and Hartigan), the death rate may have an important effect on probabilities of IBD and FST.
GENERALITIES AND SCOPE
Our purpose is to model and study genetic diversity at individual, colony, and total population levels. The distribution of genetic diversity in the population is affected by five agencies: (1) mutation, (2) reproduction, (3) extinction and subsequent replacement of colonies, (4) migration between colonies and/or from an external source, and (5) population size.
We assume that mutation follows an infinite-allele model (Kimura and Crow 1964), which assumes that every mutation creates an allele that is not currently existing in the population. Under this model, two alleles that are identical in state are also identical by descent. Two types of reproduction systems are considered: asexual reproduction and sexual reproduction. Sexual reproduction includes both random and mixed mating (part of the population mating at random, the other part selfing). An extinct colony can be replaced by a given number of individuals originating from one of the existing colonies (propagule model; Slatkin 1977) or by a given number of individuals randomly sampled over the colonies (migrant pool model; Slatkin 1977). In either case, the initial size of the population is restored instantly. Alternatively, colonies can be gradually recolonized, allowing varying population sizes. Finally, colonies can exchange individuals (seeds) or single gametes (pollen).
We consider two models of basically different character, a fixed-size model where the colonies are equally large and of given size, and a variable-size model in which the sizes of the subpopulations are given by birth-and-death processes. We introduce next the various parameters and the precise dynamics in both models.
Model parameters: In both models there are n colonies. For the fixed-size model we let N denote the number of diploid individuals in each colony and k be the number of colonists. The distribution of genetic material evolves according to the nonnegative Poisson intensities
u, mutation of an allele into a new type per gene,
1/2, reproduction/compensatory death per individual,
λ, extinction/recolonization per colony,
ms, seed migration between colonies per individual.
In addition, two probabilities enter as parameters, namely,
The corresponding set of parameters for the variable-size model are the Poisson intensities
u, mutation of an allele into a new type per gene,
1/2, reproduction per individual,
δ, death per individual,
β, immigration of a new individual from the mainland per colony,
ms, seed migration between colonies per individual,
and the selfing probability s, 0 ≤ s ≤ 1. The reproduction intensity of 1/2 was retained to conform with the usual choice of normalization. In that case, with probability h/2 + o(h) one individual, and hence with probability h + o(h) one of two alleles in different individuals considered at time t + h, is the result of reproduction during (t, t + h].
Mutation acts in the same way in both models. It is the basic cause of reduction in identity by state of the genes in the population. The other mechanisms require more precise descriptions.
Reproduction: Obviously the main difference between fixed and variable size is that in the first case the Moran model mechanism of automatic compensating deaths following the formation of every new individual applies (Moran 1958), whereas in the latter a birth event more realistically increases the size of the population by one unit. In the variable-size case the assumption δ> 1/2 will guarantee that the population does not grow without bound and the assumption β > 0 will prevent permanent extinction.
To define reproduction in the fixed-size model we distinguish first of all whether reproduction occurs by random mating or asexually, and treat these cases separately. In both cases we assume the Moran model mechanism of automatic compensating deaths.
The asexual case is simpler. Here, in the event that the Poisson intensity signals that an individual is going to reproduce, a genetic copy of this diploid individual is produced followed by the death of one of the N original individuals. It is worth noting that the killed individual may thus be the one which itself triggered the production of the new.
The more complicated sexual reproduction case also involves the selfing parameter s as follows. Consider one of the N individuals and suppose its Poisson clock goes off. This we understand as an instruction to first make a copy of one of its own genes that is to form the first gene in the pair of genes representing the new individual. To find the second gene in the new pair there are two possibilities. Namely, with probability s the reproducing individual selects the second gene again to be a copy of one of its own, and with probability 1 – s it selects the second gene to be a copy of any one of the 2N existing genes sampled uniformly in the subpopulation. The actual selfing probability is therefore s + (1 – s)/ N rather than s. Again the formation in this manner of the new individual is instantly followed by a compensatory death.
In the variable-size model, for simplicity we restrict the analysis to sexual reproduction only.
Extinction/recolonization: As already stated we cover the two schemes introduced by Slatkin (1977) within the fixed-size model. The first mechanism is the so-called propagule recolonization, which means that upon extinction of a colony, k individuals are picked randomly in one of the n existing colonies to replace it. The second is the migrant pool. The difference is that now the replacement colony is put together by sampling k individuals uniformly over the whole population. Cases intermediate between those two extreme models are also taken care of by considering the probability of common origin of two colonists (Whitlock and McCauley 1990). In all cases, the k individuals then reproduce to restore the size of the population.
For the variable-size model one can say that deaths and immigration replace extinction/recolonization. Even if colonies may go extinct from time to time, influx from an external source is guaranteed. The effect of β is that external individuals enter into each colony independently, and an additional assumption will then be made regarding the genetic relationship between existing and immigrating gametes.
Migration: To each individual is associated the seed migration intensity ms, which, when its Poisson clock rings, forces this individual to select randomly and uniformly one of the existing individuals in the population and then exchange position with the particular individual that was chosen. For example, in the fixed-size model this yields a probability 1/nN that nothing at all happens. Pollen migration, on the other hand, is a refinement of the random mating mechanism. We let mp denote the probability that one of the genes composing a newly created individual due to pollen migration should be considered as being selected randomly in the full existing population and not only within its own colony.
Now we introduce more formally the IBD probabilities that we wish to study in this work. Pick two different alleles, A and B, randomly in the population. We consider the probabilities θk, k = 1, 2, 3, that A and B are identical by descent given their hierarchical “distance” k,
θ1, if A and B belong to the same individual,
θ2, if A and B belong to different individuals in the same colony,
θ3,if A and B are taken from different colonies in the population.
FIXED-POPULATION SIZE MODEL
Mutation and reproduction: We consider the IBD probabilities θk(t), t ≥ 0, k = 1, 2, 3, as functions of time, and investigate to what extent they change during a small interval of time (t, t + h], h > 0. Pick two alleles at hierarchical distance k from each other and observe them at time t + h.
First suppose that only mutation is in effect. Because the probability of a mutation in (t, t + h] of one of the selected genes is proportional to 2uh,
hence,
and letting h tend to zero,
The solution
ultimately tends to zero as a result of the assumed infinite-allele model.
Now we include reproduction, which stabilizes the genetic structure in the population and counteracts the diversifying effect of mutation through genetic drift toward fixation. The balance of mutation and genetic drift then leads to stationary probabilities θ1 and θ2. However, there is no genetic drift over colony borders, and hence for k = 3 the only solution is still θ3 = 0.
Asexual reproduction: Assume that all reproduction is asexual. Then the genetic relationship between two genes in a single individual does not change in case that individual was just formed as a copy of a previously existing individual. Mutation, on the other hand, will affect their interrelationship, forcing the IBD probability to vanish asymptotically as in (2). Hence the only stationary solution for θ1(t) is θ1 = 0.
For θ
2(t) note that with probability h + o(h) one of the chosen genes belongs to an individual that during (t, t + h] was produced by asexual reproduction. Then one of the two individuals involved, call it A, existed at time t and the other one, call it B, has been replaced by a copy of a randomly picked individual that existed at time t. With probability 1/N, B happens to be a copy of A in which case the two chosen genes must be considered belonging to the same individual. In this last case we have selected one gene in A and one gene in the copy of A called B. But the probability that two different genes in the same individual are IBD at t is θ
1(t), and thus our two genes are IBD with probability 1/2 + θ
1(t)/2. With probability 1 – 1/N their IBD probability remains at θ
2(t). Hence,
so that
and thus
The stationary solution for θ
2(t) is obtained by taking θ′
2(t) = θ
1(t) = 0 above; thus
which is half the value of the stationary solution of θ
2(t) in a haploid model.
Sexual reproduction: We start by deriving an equation for θ
2(t). Consider at time t + h two alleles, a and b, from two different individuals in the same colony and suppose one of them, a say, belongs to an individual that was created sexually in the same time interval elapsed since time t. With probability 1/N, allele a is copied from the individual to which b belongs, in which case they are identical by descent with probability 1/2 + θ
1(t)/2. With probability 1 – 1/N, the gene a was copied elsewhere; hence their IBD probability remains θ
2(t). Some reflection shows that these same relations hold whether or not the individual decided to self. Hence,
This is the same equation as the one obtained for the asexual case, thus again leading to
The corresponding equation for θ
1(t), however, is different. In fact, by splitting selfing from nonselfing, weighted by probabilities s and 1 – s, we find
leading to
Relation (
4) shows that at equilibrium
By inserting this into (
5) taking θ′
1(t) = 0 we obtain the stationary solutions for mutation/genetic drift balance. They can be written
Note that the case
which corresponds to an actual selfing rate s + (1 – s)/N = 2/N, leads to the traditional expressions
Turning to Wright's indices, we obtain for the asexual case described previously,
and for the sexual system (
7,
8),
Negative values of F
IS, which occur for s < 1/(N – 1) in the sexual case, indicate that two genes in different individuals are more likely to be IBD than two genes in the same individual.
Adding migration and extinction: The next step toward enriching the applicability of the model and taking into account the colony structure that until now did not enter the derivations, is to add migration and extinction. Seed migration: To include seed migration we must consider the equations for θ
2(t) and θ
3(t), noting that migration of individuals does not affect θ
1(t). Consider first a pair of genes selected in the same colony. Then the intensity of a migration event in (t, t + h], which corresponds to selecting the two genes, in fact, from different colonies is 2m
s(1 – 1/n). Hence
Similarly, if two genes are selected in different colonies at time t + h the intensity is 2m
sh/n such that they are considered to be stemming from the same colony at time t due to migration during (t, t + h]. Hence
Together with the previously derived
Equation 5 for θ
1(t), these relations yield in the limit h → 0 the coupled system of equations
By solving the system of Equations
11,
12, and
13 with vanishing left-hand terms for its stationary solution (θ
1, θ
2, θ
3) in steady state, we find the equilibrium relation
which replaces (
6), and then
where
Pollen migration: Introduce the total migration parameter
To include migration of pollen, we start again with θ
1(t). During the mating phase the probability is m
p that one gene ends up in the new individual from a migrating pollen; hence the probability is m
p(1 – 1/n) that it originated in another colony. Therefore the term m
p(1 – 1/n)θ
3(t) enters the differential equation for θ
1(t), and the corresponding reduction occurs for the other, random mating terms, more exactly
The differential equation for θ
2(t) modifies similarly. Exploiting the notation m introduced above we see that
Equation 12 changes into
Then we need to take into account that, because of pollen migration, reproductions in (t, t + h] now will affect θ
3. With probability m
p/n the selected genes belong to the same colony. This leads to
In steady state the relation between θ
2 and θ
3 is found from (
17) and (
18),
and the analog of system (
15) is obtained similarly. In particular, it is seen that
Equation 14 still holds in this case with α replaced by
To complete the analysis of the migration model, it remains to do the algebraic manipulations leading to the following expressions for Wright's indices:
and
where
In particular, for m
p = 0,
Extinction/recolonization: The recolonization event consists of two steps that are supposed to occur instantly in case of extinction. First k,1 ≤ k ≤ N, founder individuals, or colonists, are selected from a single colony (propagule model) or randomly over the n colonies (migrant pool model). The N – k additional individuals are produced by sexual reproduction from the k colonists. To avoid unnecessarily complicated expressions we restrict to the case mp = 0, ignoring pollen migration.
The simplest case is the propagule model with k = N colonizers, in which an existing colony is copied and reinstalled in full to account for recolonization in the event of extinction. It is clear that extinction and the consecutive reinstallment of a colony have no effect upon θ
1 or θ
2. Hence we derive the stationary solution for θ
3(t), taking into consideration the possibility that one of the two colonies representing the pair of genes selected for the determination of θ
3(t + h) goes extinct during (t, t + h], the probability of which is 2λh. The probability is 1/n that the particular replacement colony is chosen to which the other gene of the given pair belongs, and the probability is 1/nN that even the same individual is chosen to which the other gene of the given pair belongs. Thus
and so, replacing
Equation 13,
The modified equation for θ
3(t) results in the equilibrium probabilities
where
Moreover,
and
We observe that the case λ > n/2 is a nonbiological regime where the effect of recolonization is so strong that it is more likely for two genes to be identical by descent if they are picked from separate colonies rather than from the same. Restricting to λ ≤ n/2, we note that F
ST is decreasing as a function of λ, u, and m
s. The effect of increasing the number of colonies n is to reduce the effect of extinctions and enhance the effect of migration on F
ST.
Next we turn to the general propagule and migrant pool schemata with k colonizers. The Equations
11 for θ
1(t) and
21 for θ
3(t) with k = N remain unchanged, whereas
Equation 12 for θ
2(t) changes. To find the modified equation for θ
2(t), suppose we pick at time t + h two alleles from two different individuals in a given colony. With probability λh + o(h), the colony suffers extinction and is recolonized by k founders, all this within (t, t + h]. Let
noting a
1 = 1, a
N = 0. If the two alleles represent two different founders of their colony we make a further distinction by introducing
so that in the propagule model we have ψ = 1 and in the migrant pool model ψ = 1/n.
Thus, temporarily switching off mutation, reproduction, and migration and considering only extinction/recolonization,
and therefore
In combination with (
12) we find the new equation
The corresponding steady-state solutions (
22) change into
where
and
It remains to calculate the probabilities α
k, a task that can be rephrased as follows. Suppose we start with k individuals marked 1,..., k, and select from them by random sampling with replacement 2(N – k) allele copies, forming N – k new individuals. This scheme yields a colony of 2N genes classified in k categories that are represented by at least one gene each. Now select randomly with equal probabilities two individuals and in each of them a gene. We want to find the probability a
k that both genes belong to the same category (originate from the same founder). However, as long as the two chosen individuals do not both belong to the original group of size k, the probability is 1/k that they are marked with the same number. Hence,
A simpler scheme is to assume that a new colony consisting of N individuals is instantly sampled from the k founding individuals. Then
The exact expressions for Wright's indices are somewhat unwieldy in this case, so we defer the further discussion to the final section where we apply some approximations and state a result.
IBD PROBABILITY IN VARYING POPULATION SIZE
An obvious drawback with the previous model is the restriction to fixed colony sizes N throughout the population. In population genetics modeling it has been natural to apply simple birth-death-immigration (BDI) models to widen the scope toward varying population size. In particular, there is a rich probabilistic structure in haploid populations based on the demographic decomposition into the number of families of allele types with a given number of representatives in the population (Tavaré 1989). Rannala and Hartigan (1995) explore some of these ideas for computing IBD probabilities in a metapopulation run by birth, death, and immigration. Rannala (1996) considers the corresponding sampling distribution of alleles under the same model and derives an expression for FST, which then coincides with θ2 in our notation.
We take a different approach and extend the analysis of Rannala and Hartigan to include, under equilibrium conditions, selfing and migration between colonies. Our study may also be seen as a further alternative to the extinction/recolonization scheme in the fixed-size model.
A birth-death-immigration model: We start afresh from the situation that led to Equation 15, keeping seed migration but replacing the size-preserving Moran mechanism by death and immigration, as explained in generalities and scope.
Within this setting, δ introduces the potential for local population extinctions and β that for local recolonizations. Out of several possible schemes regarding the probability of IBD between a newly immigrated gene and an existent one, we assume that all immigrants are taken from a pool of individuals such that
We assume that only seed migration is in effect. Hence individuals are exchanged between colonies without changing the population size. Let
and
It is clear that N
popt is a continuous-time Markov birth-and-death process and that the components in the vector of colony sizes (N
1t,..., N
nt) are dependent random processes.
The conditional jump probabilities at a time t, such that N
popt = r, are given by
This is a standard birth-and-death process, known as a BDI process with birth intensity ½, death intensity δ, and immigration intensity nβ. For each δ > ½, it has a negative binomial distribution as its stationary steady-state distribution. We let N
pop denote a random variable having this distribution. From properties of the BDI process, for large t,
Moreover, expectation and variance are given by
Next we study the sizes of the single colonies. We refer to their stationary distributions by means of random variables (N
1,..., N
n). By symmetry we immediately have
Furthermore, in the special case m
s = 0 the n components N
j are obviously independent and thus each N
j possesses a negative binomial distribution with parameters 2β and 2δ. In fact, their sum is then also negative binomial with parameters 2nβ and 2δ, in agreement with (
27). However, we do want to consider the case m
s > 0 where the colony sizes are no more independent.
To this end, we note that in the present framework migration corresponds to the death of an individual in one colony followed instantly by immigration into a randomly chosen colony. Hence, focusing on a specific colony size, N
jt say, the total external immigration intensity at time t is given by
because a migrating gene from any other colony will end up in colony nr j with probability 1/n. The approximation should be valid at least for large values of n in view of the law of large numbers. Hence we introduce for each component N
jt a local immigration intensity.
and, similarly, a local death intensity by
Indeed, a migrating gene from colony nr j is going to enter a different colony with probability 1 – 1/n.
As an approximation for the colony size we have thus found a slightly different BDI process. Its conditional jump probabilities given that N
jt = r are
Consistent with our previous results the corresponding average colony size at stationarity is again
It is interesting to compare the variance for the components N
j with the variance of their sum N
pop in the dependent case m > 0. We obtain
It is a simple observation that V(N
j) is maximal for m
s = 0 [where of course V(N
pop) = nV(N
1)] and decreases toward its minimum value, coinciding with the mean E(N
j), as the migration rate m is allowed to increase. In this sense migration is seen to stabilize the system and counteract fluctuations in the colony sizes.
Probabilities of IBD in the varying-population-size model: Because in the present model with positive probability some colonies or even the whole population may be extinct from time to time, the definition of the IBD probabilities θ
k is not as straightforward as in the previous constant-size model. To define θ
2 there must be at least one colony with two individuals, and to define θ
3 at least two nonempty colonies. It is reasonable to argue that otherwise corresponding weights should be assigned to let θ
3 = 0 and θ
2 = 0, respectively. However, we have, for example,
which suggests that we simply ignore this effect. Hence to derive a balanced differential equation for the IBD probability within a colony we assume that at some time t a colony j has size
, where r ≥ 3, and we let θ
2(t) refer to the probability that two genes from different individuals in this colony are identical by descent.
Now we go over the various events that may occur in (t, t + h]. First of all, with intensity there was a death leading to r – 1 ≥ 2 individuals in the selected colony at time t + h. However, the removal of r – 1 such potential pairings does not change the probability of IBD for the remaining pairs (cf. Appendix in Rannala and Hartigan 1995).
Next, with intensity
an individual immigrated between t and t + h, increasing the colony size from r to r + 1. With probability
a gene from this newly immigrated individual is selected to the pairing needed to determine θ
2. If the colony immigrant is an external immigrant to the population this results in the probability f for the pair to be identical by descent. If the colony immigrant migrated from another colony the gene pair is related according to θ
3. Otherwise, if the immigrant was not chosen for the pair, the probability for IBD remains the same as before immigration.
Finally, with intensity r/2 a birth occurs in (t, t + h] such that again the size changes from r to r + 1. As a pair of genes A and B are selected at time t + h, the probability is 2/(r + 1) that one of them belongs to the newly reproduced individual. Let us suppose that A does. Then we have two possibilities. With probability 1/r gene A was produced by random sampling from the individual that gene B belongs to, in which case the probability of IBD is 1/2 + θ1(t)/2. With probability 1 – 1/r the relationship between A and B is given simply by θ2(t).
Summing up,
Hence,
By a similar argument
Turning to θ
3 suppose two colonies i and j have been chosen. We must determine the effect of immigration into any one of them during (t, t + h]. Suppose colony i of size r is subject to external immigration (or colony j of size q). Then with probability 1/(r + 1) the immigrant belongs to the select pair resulting in IBD probability f. Similarly, if the immigrant is a migrant from j, then again with probability 1/(r + 1) the IBD probability of the select pair changes, this time into θ
2. The intensities are for the first event β and for the second approximately mE(N
j)/n. Hence
At this point we observe that the steady-state probabilities corresponding to the balanced equations we have derived will depend on r and q unless u = 0. In the present model the external immigration parameter β has a similar effect and purpose as mutation, namely, to guarantee fresh import of genetic material and enhance genetic diversity. The difference is that mutation acts only on the individual level while immigration occurs in colonies regardless of their sizes. We therefore make the approximation in the sequel to ignore mutation and simply restrict the case to u = 0. Moreover, because θ′
1(t) = 0 implies
and thus
for large r, which is again the consistency relation (
34), we are led to study the balanced equations
The solution is given by
where now
In conclusion we have found, under the simplifying assumptions of mutation being negligible and the typical size of a colony being large, that the conditional IBD probabilities at equilibrium for given colony sizes are expressed by the relations (
30). Again relying on, e.g., (
29), we let expression (
30) represent the unconditional IBD probabilities for the variable-size model. Thus Wright's genetic indices for the BDI model turn out to be
and
Interestingly, the last quantity depends on neither f nor n.
DISCUSSION
Relation of fixed-size model to previous work: We compare the expressions derived so far with those previously obtained by
Maruyama and
Tachida (
1992) and
Whitlock and
McCauley (
1990). To recover the results of
Maruyama and
Tachida (
1992) from (
20) and (
23), or the combination of them, we observe that if u, λ, and m are small compared to N, then
with
hence
These are the same approximate results as those found by
Maruyama and
Tachida (
1992). However, it should be kept in mind that in the quoted article the parameters λ, u, and m
s are probabilities, whereas in our study they denote Poisson rates.
We take these findings as a basis for claiming that the continuous time approach has some advantages over the discrete analog for investigating the detailed influence on Wright's indices of the various random mechanisms that change population genetic diversity.
We continue with some additional remarks on the approximate analysis discussed above. It is interesting that the equilibrium relation (
14) is still valid for the approximative solutions, that is,
Further insight into the background of this approximation is gained from the following simple argument. Start from the reasonable assumption that θ
1 and θ
2 can only differ if individuals are subject to selfing. Suppose that a fraction s of the population is selfing, so that if an individual is picked randomly in the population it reproduces by selfing with probability s. Then we are led to the consistency relation
This gives an alternative simple relationship between θ
1 and θ
2, and it is easy to check that
and
are indeed related by
At this point we emphasize one of the refinements in our model relative to the approximation discussed in this subsection, that θ
1 and θ
2 may actually differ from reasons other than selfing. In fact, when we form a pair of genes the total mutation rate is 2u irrespective of whether they belong to the same individual or not, but the genetic drift differs. If we select the pair, within an individual the infinitesimal rate is only 1/2 for reproduction to possibly affect the IBD probability, whereas if two individuals are involved they both contribute, yielding a total rate for θ
2 twice as high as that of θ
1.
For the case of extinction/recolonization with k founding individuals, we apply the approximation (
34) and obtain from (
25)
One can check that F
ST is increasing in λ if and only if
which is trivially true if m
s + u ≥ (1 – ψ + 1/n)/ψ, i.e.,
In particular, for the propagule case, ψ = 1, and with k < N and large n, extinction always favors genetic fixation in the sense that F
ST increases with λ. This behavior has been observed previously (
McCauley 1993) and should be compared to the pure propagule model with k = N for which F
ST is a decreasing function of λ.
The remaining case
includes migrant pool and mixed versions of propagule and migrant pool recolonization. Specializing to the choice a
k = 1/k in (
26), condition (
36) takes the form
where the final approximation is for large N and negligible mutation. This is similar to the condition obtained by
Whitlock and
McCauley (
1990). For alleles moving as gametes (their Equation 10a), they obtained
where m denotes the proportion of migrating gametes in each generation, so that m
s = 2m, and φ is the probability of common origin of gametes. According to
Whitlock and
McCauley (
1990, p. 1720), “when alleles move only as diploid individuals, with equal representation of parental populations,”
(
Whitlock and McCauley 1990,
Equation 7). The last two relations together imply that
which is indeed similar to the condition at which we arrived.
Finally, it is instructive to look at the expressions for F
ST as N → ∞ and k is fixed. In the migrant pool case φ = 1/n,
and in the propagule case φ = 1
Contrary to the situation without extinction/recolonization or to the propagule model with k = N, for which F
ST → 0 as N → ∞, the limit value of F
ST is now strictly positive. This is, of course, due to the fact that the founder effect that follows recolonization restores a substantial amount of genetic drift.
Discussion of the variable size model: Writing N
col for the size of a typical colony we have equivalently to (
31)
which should be compared to the fixed-size model results, e.g., (
33). This expression stresses the fact that migration and population size cannot be easily separated in the variable population model because the “migration term” now comprises the quantity δ – ½, which together with the immigration intensity determines the average population size.
In the special case m
s = 0, for which
, (
37) simplifies further into
hence notably the death intensity parameter δ disappears entirely from Wright's index. This is basically the result derived by
Rannala and
Hartigan (
1995) in the case s = 0, taking into account that they assume θ
3 = 0. Taking in their notation birth intensity λ = ½ and immigration intensity φ = 2β, they found
in agreement with our result. The approach in
Rannala and
Hartigan (
1995) is based on properties of the conditional allele distribution given a colony size N and hence does not preclude the transient situation δ < λ for which there is no steady-state distribution of population size or colony size. Although their conclusion is justified for the simpler BDI model, with m
s = 0, we have found that the probability of IBD also depends on δ and m
s. Moreover, for small values of δ–½ the resulting equation (
31) can be highly sensitive to small changes in the parameter m
s (see also
Gaggiotti 1996). In our extension of the BDI model, m
s > 0, the assumption δ > λ = ½ is crucial for the mathematical derivations. It is natural therefore to ask whether it is possible to include the case m
s > 0 in the derivation that leads to (
38). To indicate heuristically what may happen in this situation, suppose that the dynamics of a large number of colonies are given by the BDI model with parameters β and δ, and suppose that migrants are exchanged at rate m
s between colonies. Focus on a single colony. At a fixed time t since the populations were initiated (N
0 = 0), we may argue as in (
28) that, due to migration between colonies, β should be replaced by the effective immigration parameter
If we apply this approximation and follow the approach of Rannala and Hartigan, then
Equation 38 (with f = 0) yields
In the subcritical case δ > ½ we recover
Equation 31 (s = 0) as t → ∞. For fixed t, however, the expression for F
ST is now continuous in δ > 0 and seems to exhibit less dramatic change when we compare m
s = 0 and m
s > 0.
Our model as well as that of Rannala and Hartigan assume (i) the presence of a continuous flow of immigrants from a mainland in which allele frequencies remain constant and (ii) the absence of spatial structure. Clearly, in many biological situations these are strong assumptions. Therefore a natural next step is to study probabilities of IBD in a metapopulation where colony sizes are regulated by a birth-death process, without external immigration, and in which the migration rate among colonies depends on their spatial location. The demography of similar systems has been studied (e.g., Pollett 1998) but, to our knowledge, their genetic properties remain unexplored.
Acknowledgement
We thank Dr. Steve Krone and two anonymous referees for useful comments on the manuscript.
LITERATURE CITED
Cockerham
C C
, Weir
B S
,
1987
Correlations, descent measures: drift with migration and mutation
.
Proc. Natl. Acad. Sci. USA
84
:
8512
–
8514
.
Gaggiotti
O E
,
1996
Population genetic models of source-sink metapopulations
.
Theor. Popul. Biol.
50
:
178
–
208
.
Kimura
M
, Crow
J F
,
1964
The number of alleles that can be maintained in a finite population
.
Genetics
49
:
725
–
738
.
Malécot
G
,
1948
Les Mathématiques de l' Hérédité
.
Masson et Cie, Paris
.
Maruyama
K
, Tachida
H
,
1992
Genetic variability and geographical structure in partially selfing populations
.
Jpn. J. Genet.
67
:
39
–
51
.
McCauley
D E
,
1993
Genetic consequences of extinction and recolonization in fragmented habitats
, pp.
217
–
233
in
Biotic Interactions and Global Change
, edited by
Kareiva
P M
, Kingsolver
J G
, Huey
R B
.
Sinauer Associates
,
Sunderland, MA
.
Moran
P A P
,
1958
Random processes in genetics
.
Proc. Camb. Philos. Soc.
54
:
60
–
71
.
Pollett
P K
,
1998
Modelling quasi-stationary behaviour in metapopulations
. .
Rannala
B
,
1996
The sampling theory of neutral alleles in an island population of fluctuating size
.
Theor. Popul. Biol.
50
:
91
–
104
.
Rannala
B
, Hartigan
J A
,
1995
Identity by descent in island-mainland populations
.
Genetics
139
:
429
–
437
.
Rousset
F
,
1996
Equilibrium values of measures of population subdivision for stepwise mutation processes
.
Genetics
142
:
1357
–
1362
.
Slatkin
M
,
1977
Gene flow and genetic drift in a species subject to frequent local extinctions
.
Theor. Popul. Biol.
12
:
253
–
262
.
Tavaré
S
,
1989
The genealogy of the birth, death, and immigration process
, pp.
41
–
56
in
Mathematical Evolutionary Theory
, edited by
Feldman
M W
.
Princeton University Press
,
Princeton, NJ
.
Whitlock
M C
, McCauley
D E
,
1990
Some population genetic consequences of colony formation and extinction: genetic correlation within founding groups
.
Evolution
44
:
1717
–
1724
.
Wright
S
,
1969
Evolution and the Genetics of Populations II, The Theory of Gene Frequencies
.
University of Chicago Press
,
Chicago
.
© Genetics 1999