Skip to main content
  • Facebook
  • Twitter
  • YouTube
  • LinkedIn
  • Google Plus
  • Other GSA Resources
    • Genetics Society of America
    • G3: Genes | Genomes | Genetics
    • Genes to Genomes: The GSA Blog
    • GSA Conferences
    • GeneticsCareers.org
  • Log in
Genetics

Main menu

  • HOME
  • ISSUES
    • Current Issue
    • Early Online
    • Archive
  • ABOUT
    • About the journal
    • Why publish with us?
    • Editorial board
    • Early Career Reviewers
    • Contact us
  • SERIES
    • All Series
    • Genomic Prediction
    • Multiparental Populations
    • FlyBook
    • WormBook
    • YeastBook
  • ARTICLE TYPES
    • About Article Types
    • Commentaries
    • Editorials
    • GSA Honors and Awards
    • Methods, Technology & Resources
    • Perspectives
    • Primers
    • Reviews
    • Toolbox Reviews
  • PUBLISH & REVIEW
    • Scope & publication policies
    • Submission & review process
    • Article types
    • Prepare your manuscript
    • Submit your manuscript
    • After acceptance
    • Guidelines for reviewers
  • SUBSCRIBE
    • Why subscribe?
    • For institutions
    • For individuals
    • Email alerts
    • RSS feeds
  • Other GSA Resources
    • Genetics Society of America
    • G3: Genes | Genomes | Genetics
    • Genes to Genomes: The GSA Blog
    • GSA Conferences
    • GeneticsCareers.org

User menu

  • Log out

Search

  • Advanced search
Genetics

Advanced Search

  • HOME
  • ISSUES
    • Current Issue
    • Early Online
    • Archive
  • ABOUT
    • About the journal
    • Why publish with us?
    • Editorial board
    • Early Career Reviewers
    • Contact us
  • SERIES
    • All Series
    • Genomic Prediction
    • Multiparental Populations
    • FlyBook
    • WormBook
    • YeastBook
  • ARTICLE TYPES
    • About Article Types
    • Commentaries
    • Editorials
    • GSA Honors and Awards
    • Methods, Technology & Resources
    • Perspectives
    • Primers
    • Reviews
    • Toolbox Reviews
  • PUBLISH & REVIEW
    • Scope & publication policies
    • Submission & review process
    • Article types
    • Prepare your manuscript
    • Submit your manuscript
    • After acceptance
    • Guidelines for reviewers
  • SUBSCRIBE
    • Why subscribe?
    • For institutions
    • For individuals
    • Email alerts
    • RSS feeds
Previous ArticleNext Article

Effective Population Size and Population Subdivision in Demographically Structured Populations

Valérie Laporte and Brian Charlesworth
Genetics September 1, 2002 vol. 162 no. 1 501-519
Valérie Laporte
Institute of Cell, Animal and Population Biology, University of Edinburgh, Edinburgh EH9 3JT, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: valerie.laporte@ed.ac.uk
Brian Charlesworth
Institute of Cell, Animal and Population Biology, University of Edinburgh, Edinburgh EH9 3JT, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Article
  • Figures & Data
  • Info & Metrics
Loading

Abstract

A fast-timescale approximation is applied to the coalescent process in a single population, which is demographically structured by sex and/or age. This provides a general expression for the probability that a pair of alleles sampled from the population coalesce in the previous time interval. The effective population size is defined as the reciprocal of twice the product of generation time and the coalescence probability. Biologically explicit formulas for effective population size with discrete generations and separate sexes are derived for a variety of different modes of inheritance. The method is also applied to a nuclear gene in a population of partially self-fertilizing hermaphrodites. The effects of population subdivision on a demographically structured population are analyzed, using a matrix of net rates of movement of genes between different local populations. This involves weighting the migration probabilities of individuals of a given age/sex class by the contribution of this class to the leading left eigenvector of the matrix describing the movements of genes between age/sex classes. The effects of sex-specific migration and nonrandom distributions of offspring number on levels of genetic variability and among-population differentiation are described for different modes of inheritance in an island model. Data on DNA sequence variability in human and plant populations are discussed in the light of the results.

IN an ideal (Wright-Fisher) population of N breeding adult diploid individuals, the rate of genetic drift is equal to 1/(2N) (Wright 1931). Natural populations, however, depart from many of the assumptions of the ideal population, notably panmixia and a binomial distribution of the number of successful offspring per parent. The effective population size, Ne, can be used to describe the rate of genetic drift in such populations and replaces the ideal population size N in the resulting formulas (Wright 1931; Wang and Caballero 1999). For example, under the infinite-sites model of neutral variability, widely used in molecular population genetics, we can write θ= 4 Neμ for the scaled mutation rate, where μ is the neutral mutation rate per nucleotide site (Kimura 1971).

Population subdivision is a major factor that can influence Ne and has attracted much attention. Wright (1943) derived an expression for the effective size of the whole set of populations in an island model, where each local population has an equal probability of contributing to the pool of migrants. He showed that, in this case, subdivision increases Ne for the whole population over that under panmixia. Since this pioneering work, additional factors (e.g., non-Poisson offspring number distributions, different modes of inheritance, sex-specific migration rates) have been included in the island model, and other migration models have been studied, such as stepping-stone models and metapopulations with local extinction and recolonization events (Whitlock and Barton 1997; Wang and Caballero 1999).

The total effective size of a subdivided population can be expressed as a function of the mean within-deme effective population size and of FST (Wright 1951), which describes the amount of between-deme differentiation relative to the variability in the set of populations as whole. In contrast to the results for the island model, population structure may reduce the total effective population size below that for panmixia when different demes make unequal contributions to the migrant pool (Whitlock and Barton 1997). A difficulty is that effective population size is defined somewhat loosely as the size of an ideal population with the same properties as the nonideal study population with respect to the effects of genetic drift (Wright 1931). Different effective population sizes can, however, be defined according to the variable of interest (Whitlock and Barton 1997) and do not necessarily give exactly equivalent results.

In particular, the inbreeding effective size of a subdivided population determines the asymptotic rate of the increase of the probability of identity of a pair of alleles sampled randomly from the population at large. The variance effective size determines the variance of the change in allele frequency across a generation for genes drawn from the population as whole. With constant population sizes across generations, these two parameters converge to the same value (Whitlock and Barton 1997). Similarly, the mutation effective size is defined as the population size that gives the average pairwise genetic diversity between alleles sampled randomly from the population as a whole, when substituted into the above equation for θ (Whitlock and Barton 1997). The migration effective size for a subdivided population determines a weighted mean pairwise nucleotide site diversity for alleles sampled from the same population (Nagylaki 1998a).

These measures of effective population size were originally derived from recursion equations for identity probabilities of pairs of alleles or variances in allele frequencies (reviewed by Wang and Caballero 1999). More recently, use has been made of the coalescent process, which is very useful for studying alleles sampled in a prescribed way from the set of local populations (Hudson 1990; Slatkin 1991; Wilkinson-Herbots 1998; Wakeley 1999, 2001). This approach has been applied mostly to models of autosomal loci, in which sex-specific migration, and departures from the assumptions of the ideal population for individual demes, have been ignored, although Rousset (1999) introduced a coalescence approach to the study of between-deme differentiation at autosomal loci with sex-specific migration rates.

There is, however, increasing interest in comparing the patterns of genetic variation and population differentiation for genes with different modes of inheritance (Ennos 1994; Hurles and Jobling 2001). This is particularly important in the context of studies of sex chromosome evolution, in which it is of interest to compare variability at homologues on the X and Y chromosomes (Bachtrog and Charlesworth 2000, 2002; Filatov et al. 2000, 2001). The expectation is that selective forces acting on a nonrecombining genome can greatly reduce the level of variability on an evolving Y or neo-Y chromosome (Charlesworth and Charlesworth 2000), but this effect may be confounded with the effects of demographic differences between males and females (Charlesworth 2001) or of sex-specific migration in a subdivided population (Filatovet al. 2001). These effects have primarily been investigated using the classical recursion equation approach, either for the effects of demography (Charlesworth 2001) or for modeling sex-specific migration under various modes of inheritance (Chesseret al. 1993; Chesser and Baker 1996; Wang 1997a,b, 1999).

Here we first present a general framework for studying the pairwise coalescent times and identity probabilities of genes with different modes of inheritance in a metapopulation consisting of demes connected by an arbitrary migration scheme. Individuals within each deme are divided into different age and sex classes, with different demographic properties and migration probabilities. By using the “fast-timescale” approximation of Nordborg (1997) for the flow of genes among different classes within demes, it is possible to reduce the complexities of this general model to a manageable size. This yields simple expressions for the effective sizes of single populations and the expected levels of genetic diversity between and within demes. We provide some explicit results for the case of an island model with equal deme sizes and review data on human and plant populations in their light.

A GENERAL MODEL OF MIGRATION AND DRIFT IN STRUCTURED POPULATIONS

We assume that individuals within a local population (deme) can be classified into different classes, e.g., with respect to age and/or sex. Let fijrs be the probability of identity of a pair of genes sampled from an individual of class r in deme i and an individual of class s in deme j. We can write mijru for the probability that a gene sampled from an individual of class r in deme i originated from an individual of class u in deme j.

When the probability per time interval of mutation of an allele to a new state, μ, is independent of allelic state, class, and deme, fijrs is equivalent to the momentgenerating function for the time to coalescence of a pair of genes of the specified type, with parameter -2μ (Hudson 1990; Wilkinson-Herbots 1998). (To allow for the possibility of age structure, the time interval does not necessarily correspond to one generation.) In particular, their mean time to coalescence, Tijrs, is equal to the derivative of fijrs with respect to -2μ at μ= 0. As usual, we assume that μ is sufficiently small that second-order terms can be neglected. We later consider the case of sex-specific mutation rates, using mean coalescence times derived in this way; this is legitimate, since coalescence times are independent of the mutation model.

Assuming that different individuals migrate independently of each other, and using the standard approach to modeling a geographically structured population (Nagylaki 1982, 1998a), we can write the equation for equilibrium, fijrs=(1−2μ){∑k≠l∑uvmikrumjlsvfkluv+∑k∑uvmikrumjksv[Prsuvijk+(1−Prsuvijk)fkkuv]}, (1) where Prsuvijk is the probability of coalescence over one time interval of two genes sampled from individuals of class r in deme i and s in deme j, derived from individuals from deme k who belong to classes u and v, respectively (note that Prsuvijk=0 unless u = v).

For i = j and r = s, and if migration involves individuals rather than gametes, we should strictly include a term on the right-hand side that covers the case when the two genes sampled come from the same migrant individual. The approximations leading to Equation 3 mean that this is unnecessary, since ignoring it introduces only a second-order term into the final result. (This is because the chance that a pair of genes from the same deme are sampled from the same individual is of the order of the reciprocal of the deme size, and we ignore products of this and the migration probabilities.)

Writing hijrs = 1 - fijrs for the probability of nonidentity of a pair of alleles, Equation 1 can conveniently be rewritten as hijrs=2μ+(1−2μ){∑uv[∑klmikrumjlsvhkluv−∑kmikrumjksvPrsuvijkhkkuv]}. (2)

To make further progress, we assume that migration rates between demes are of the same order as the probabilities that two genes sampled from the same deme coalesce over one time unit and that both are sufficiently small that second-order terms in these, as well as mutation rates, can be neglected. This assumption is commonly made in applications of coalescent theory (Hudson 1990; Wilkinson-Herbots 1998); if migration rates are much greater than these coalescent probabilities, little differentiation between demes is possible, so that this is not a particularly restrictive assumption.

To this order of approximation, the last term in braces on the right-hand side of (2) is nonzero only when i = j = k, corresponding to the case of two genes drawn from the same deme, hijrs≈2μ+(1−2μ)∑uv∑klmikrumjlsvhkluv−δij∑uPrsuihijuu, (3) where δij is the Kronecker delta (δij = 1 when i = j and is otherwise 0); Pirsu is the probability that two genes sampled from deme i from individuals belonging to classes r and s coalesce in an individual of class u in the previous time interval, treating the deme as a closed population.

To obtain manageable results, we simplify further by removing the dependence of hijrs on r and s. We can average hijrs values over classes within demes by using a weight of αir for the contribution from class r in the ith deme. αir is chosen to be equal to the rth element of the left leading eigenvector of the matrix that describes the flow of genes among classes within deme i, scaled such that the elements of this eigenvector sum to one. This element represents the stationary-state probability that a randomly sampled gene from deme i originates from class r (Nordborg 1997). If the convergence to this stationary state is much faster than the rate of coalescence and migration, αir gives an accurate approximation to the probability of origin of the sampled gene (Nordborg 1997); this is the case, for example, for a discrete generation population, when the classes refer to different sexes. This also implies that the hijrs values can be treated as approximately independent of r and s; i.e., they can be equated to a common term, hij. This is equivalent to the fast-migration limit of Nagylaki (1980) for a geographically structured population.

We can then conveniently define the net probability of origin from deme k of a gene sampled from deme i, mik, as mik=∑ruαirαkumijru. (4)

Substituting into Equation 3 and summing αirαjshijrs over r and s, we obtain hij=∑rsαirαishijrs≈2μ+(1−2μ)∑klmikmjlhkl−δijPihii, (5) where Pi is the probability of coalescence of a pair of genes sampled randomly from deme i, such that Pi=∑rsuαirαisPrsui. (6)

Using the argument of Charlesworth (2001), we can write Pi=12tiNie, (7) where ti is the generation time of individuals from deme i and Nie is the deme’s effective population size. In the case of discrete generations, ti = 1 for all demes.

By differentiating hij with respect to -2μ, we obtain the following expression for the mean coalescent time for a pair of alleles sampled from demes i and j (Hudson 1990): Tij≈1+∑klmikmjlTkl−δijPiTii. (8)

This provides a general set of linear relations that can be used to solve for the Tij, to the assumed order of approximation, for any specific model. Some examples are described below. Higher moments of the distribution of pairwise coalescence times can similarly be obtained by successive differentiation of Equation 5 (Wilkinson-Herbots 1998).

It is also useful to apply the method of Nagylaki (1998a) for determining a weighted mean coalescent time for a pair of genes sampled from the same deme. The migration rates defined by the right-hand side of Equation 4 are identical in properties to the components of the standard genic migration matrix. Under the usual assumptions about this matrix (Nagylaki 1980, 1982, 1998a), it has a leading left eigenvector, ν, whose elements can be assumed to sum to one without loss of generality. Using the argument of Nagylaki (1998a), this suggests the definition of a weighted mean coalescence time, T0, for a pair of alleles sampled from the same deme as T0=∑iνi2PiTii∑iνi2Pi=1∑iνi2Pi, (9) where the coalescent time for each deme is weighted by the product of the reciprocal of the coalescence probability and the square of the corresponding component of ν. In the discrete generation case, T0 corresponds to twice the “migration effective size” as defined by Nagylaki (1980, 1982, 1998a) and reduces to the standard expression for this quantity in the case of Wright-Fisher sampling within demes of size Ni, for which Pi = 1/(2Ni).

COALESCENT PROBABILITIES AND EFFECTIVE POPULATION SIZES

Separate sexes with discrete generations: The utility of the approach described above to determining the effective population size of a deme can be illustrated by its application to the discrete-generation case with separate sexes, previously treated by Caballero (1995). It enables us to provide biologically explicit expressions for Ne under different modes of inheritance, on the lines developed by Charlesworth (2001) for the age-structured case.

From the assumption that the products of coalescence probabilities and migration probabilities are negligible, used to derive (3) and its successors, it is legitimate to treat each deme as a closed population. We have to consider the origins of genes sampled from individuals of all permissible combinations of sexes and parents (which combination is permissible depends on the mode of genetic transmission). For a pair of maternally derived genes sampled from two different females, the probability that they both come from the same mother is approximately Qfffi=∑kdik(dik−1)(Nifd‒i)2 (10) (Caballero 1995), where Nif is the number of breeding females in deme i, dik is the number of daughters produced by the kth female in deme i, and d¯i is the mean number of daughters per female in deme i.

This can be simplified by noting that the Poisson value for the variance in numbers of daughters per female, Vffi , is d¯i, so that in general the deviation of this variance from the Poisson value can be written as ΔVffi=Vffi−d‒i (11) so that Qfffi=(ΔVffi∕d‒i2)+1Nif. (12) Similar expressions can be obtained for other pairs of individuals and their parents (see Table 1).

Let βirsu be the probability that a pair of genes sampled from individuals of sexes r and s in deme i both come from parents of sex u. Let γirsu be the probability that this pair of genes shared a common ancestral allele in the previous generation, given that they have a parent in common; i.e., they coalesce. From Equation 7, the net probability that a randomly chosen pair of genes coalesce in the previous generation is 12Nie=Pi=∑rsuαirαisβirsuγirsuQrsui, (13) where Qrsui is the probability that a pair of genes sampled from individuals of sexes r and s in deme i with a parent of sex u derives from the same parental individual.

View this table:
  • View inline
  • View popup
TABLE 1

Origins of genes sampled from a given deme

Table 2 gives the values of these transmission probabilities for different modes of inheritance; these can be combined with the Q values from Table 1 and substituted into Equation 15, to obtain the final expressions for effective population sizes. Simplified expressions can be obtained when there is a fixed sex ratio among breeding individuals, so that there is a binomial distribution of the proportions of sons and daughters of a given individual, counted at maturity (see the appendix).

For convenience, we henceforth drop the superscripts and subscripts indicating deme identities. Using Tables 1 and 2, substituting into Equation 15, and noting that the Poisson expectation for variance in total offspring number is 1/(1 - c) for females and 1/c for males, where c is the proportion of males among breeding adults, we obtain the expression for effective population size with autosomal inheritance, 1NeA≈(1+F)4{1Nf+1Nm+(1−c)2ΔVfNf+c2ΔVmNm}, (14) where ΔVf and ΔVm are the excesses over Poisson expectation of the total numbers of offspring per capita for female and male parents, respectively; F is the inbreeding coefficient associated with departure from random mating within demes.

For X-linked inheritance with male heterogamety, we have 1NeX≈19{4(1+F)Nf+2Nm+4(1+F)(1−c)2ΔVfNf+2c2ΔVmNm}. (15)

For Y-linked inheritance 1NeY≈2(1+c2ΔVm)Nm. (16) With female heterogamety, males and females are interchanged in these expressions.

For maternal inheritance 1NeC≈2(1+[1−c]2ΔVf)Nf. (17)

Partially self-fertilizing hermaphrodites: It is also of interest to consider the case of a nuclear gene in a population of partially self-fertilizing hermaphrodites; this is particularly relevant for plants. (Maternally transmitted genes are clearly unaffected by the selfing rate, except as far as changes in the breeding system alter the distribution of numbers of successful offspring produced through seed.) With a Poisson distribution of offspring number for both seed and pollen, the effective size is reduced below that for random mating by a factor of 1/(1 + F), where F is the equilibrium inbreeding coefficient produced by the level of self-fertilization (Pollak 1987; Nordborg and Donnelly 1997). We examine the effect on this result of deviations from a Poisson offspring number distribution.

Let there be N breeding individuals in the deme under consideration; the kth individual is assumed to produce dkS progeny through self-fertilized seed that survive to form part of the breeding population next generation, and dkO such progeny through outcrossed seed. In addition, it contributes pk progeny through pollen that fertilizes other individuals. The overall probability that a progeny individual is the product of self-fertilization is then S=∑idiS∑i(diS+diO)=d‒S(d‒S+d‒O), (18) where d¯S and d¯O are the mean numbers of successful progeny per capita produced by selfed and outcrossed seed, respectively.

Details of the derivation of probabilities of gene origins are given in the appendix. In the case of constant population size, we have d¯ = 1, d¯S = S, and d¯O = p¯ = (1 - S). The expression for the effective population size for a single deme, NeH, is 1NeH≈(1+F)N{1+(ΔVf+ΔVm+3ΔVfS)+2[CfOfS+Cfm+CfSm]4} (19a) (definitions of the terms in the numerator are given in the appendix).

If the number of successful offspring through selfed seed, conditioned on the total number of successful offspring produced through seed, can be regarded as a binomial variate with parameter S (i.e., there is the same expected selfing rate for each individual), this simplifies further to 1NeH≈(1+F)N{1+([1+S]2ΔVf+ΔVm+2[Cfm+CfSm])4}. (19b) In the limit of complete self-fertilization, the contributions from outcrossing pollen vanish, and F = 1, so that 1NeH≈2(1+ΔVf)N. (20)

A model is presented in the appendix for analyzing the case of intermediate selfing rates. The overall conclusion is that non-Poisson variation in reproductive capacity may not greatly alter the standard result that Ne for a selfing population is reduced by a factor of 1/(1 + F), relative to the value for an outcrossing population (Pollak 1987), although a very high sensitivity of female fitness to allocation to reproduction could lead to a greater reduction in Ne than this.

THE ISLAND MODEL WITH SEX-SPECIFIC MIGRATION PARAMETERS

In this section, we apply the above principles to the simple case of an island model (Wright 1943) with sex-specific migration rates, for dioecious populations under a variety of modes of inheritance. We assume that the total population is divided into n demes, each with the same effective population size, Ne, given by the equations derived above. There are three dispersal parameters, describing the probability that a male gamete (mm) in a zygote in a given deme has migrated from a different deme, the corresponding probability for a female gamete (mf), and the probability that a zygote is itself derived from a different deme (mz; Chesser and Baker 1996; Wang 1999). Under the assumptions used above, these migration events can be treated as mutually exclusive, since their probabilities are each low. Migration probabilities between specific demes, as considered above, are given by dividing the dispersal parameters by (n - 1).

In plant species, male gametes can disperse through pollen but female gametes cannot disperse, so that only mm and mz can be nonzero. For animals, migration of male and female gametes occurs via dispersal of males and unmated females; dispersal of zygotes occurs when females migrate after mating but before laying eggs or giving birth. All three migration parameters can therefore be nonzero.

Using the argument leading to Equation 4, we can then define the net migration rates for different modes of inheritance (autosomal, X-linked, Y-linked, and cytoplasmic) as mA, mX, mY, and mC, respectively. We have mA=mz+12(mm+mf) (21a) mX=mz+13(mm+2mf) (21b) mY=mz+mm (21c) mC=mz+12mf. (21d)

View this table:
  • View inline
  • View popup
TABLE 2

Transmission parameters for different modes of inheritance

Coalescence times and expected nucleotide site diversities: Results for mean pairwise coalescence times in the island model can be obtained by substituting these expressions into standard formulas (Slatkin 1991; Wilkinson-Herbots 1998), which correspond to approximate solutions to Equation 8. To the assumed order of approximation, with mode of inheritance G (where G is A, X, Y, or C) the expected coalescence time, T0G, for two genes sampled from the same deme is given by T0G=2nNeG, (22a) where NeG is the appropriate effective population size for a single deme.

Using Equation 8, or the argument of Slatkin (1991), the approximate expected coalescence time for a pair of genes sampled from different demes is T1G=T0G+(n−1)2mG, (22b) where mG is the migration rate defined by the relevant choice of Equations 21a, 21b, 21c, 21d for mode of inheritance G.

The expected coalescence time for a pair of genes sampled randomly from the set of populations is TG=1nT0G+(n−1)nT1G. (22c)

The expected pairwise nucleotide site diversity under the infinite-sites model is given by the product of coalescence time and the mutation rate μG for the given mode of inheritance (Hudson 1990). From Equations 22a, 22b, 22c, the expectations of nucleotide diversities for alleles sampled within demes, πSG, and from the population as a whole, πTG, are given, respectively, by πSG=4nNeGμG (23a) πTG=πSG(1+(n−1)2n214mGNeG). (23b)

Mutation rates may differ among genetic systems because of differences in mutation rates between males and females (Hurst and Ellegren 1998; McVean 2000) or because of chromosome-specific mutation rates (McVean and Hurst 1997). In the former case, the overall mutation rate is given by weighting the sex-specific or chromosome-specific mutation rates by the appropriate α values (Charlesworth 2001).

The absolute magnitude of between-population subdivision can be measured by the difference between the nucleotide diversity for the population as a whole and the mean for a pair of alleles sampled from the same deme (Charlesworth 1998). The expectation of this is given by the second term in brackets on the right of (23b). It is often more convenient to use a relative measure, such as FST. Using the definition of Hudson et al. (1992), the theoretical value of FST for a given mode of inheritance is given by FST,G=(πTG−πST)πTG. (24a) With a large number of demes, this yields the familiar result of Wright (1951): FST,G=14NeGmG+1. (24b)

We now consider the application of these formulas to some specific examples. We focus initially on the effects of population subdivision, assuming a 1:1 sex ratio, Poisson variances in fertility for both sexes, and equal mutation rates for males, females, and different chromosomes. Discrete generations and a deme size of N breeding adults for all demes are also assumed. In this case, the ratios of effective population sizes and within-deme diversities for different modes of inheritance G and G′ (rGG′) are simply equal to the ratios of the respective numbers of gene copies in the population: rXA = ¾, rXY = rXC = 3, rAY = rAC = 4, the same values as for panmixia, which are often quoted in the literature on molecular population genetics.

The effects of sex-specific migration and population subdivision in dioecious plants: In Figure 1, we plot the values of FST for nuclear and cytoplasmic genes and the ratios of the total diversities for different modes of inheritance, as functions of the total number of migrants per generation (give by the sum of pollen and seed migration rates times the deme size). Male heterogamety is assumed.

As expected, reduced migration increases FST for all modes of inheritance. However, the rate of increase of FST as the amount of migration decreases is different for different modes of inheritance and also depends on the mode of migration. For instance, with equal pollen and seed migration rates, FST,Y and FST,C increase faster than FST,A and FST,X (Figure 1A). These differences result from differences in the numbers of effective migrants, due to two factors: (i) different effective population sizes and (ii) different gene migration rates. The effective number of X chromosomal migrants is lower than the effective number of autosomal migrants, due to a lower effective population size of the X chromosomes (3N/4) as well as its lower migration rate (mz + 1/3mm vs. mz + ½mm). The effective number of Y chromosomal migrants is lower than both the effective number of chromosomal X or autosomal migrants, due to the lower effective population size of the Y chromosome (N/4), which is not compensated for by its higher migration rate (mz + mm) relative to the X chromosome. A similar argument applies to cytoplasmic genes.

Population subdivision also modifies the ratios of the total diversities for different modes of inheritance: the ratios πTX/πTY(RXY) and πTA/πTY(RAY) decrease sharply with reduced gene flow and πTX/πTA(RXA) increases slightly (Figure 1D). From Equations 23a, 23b, it is easily seen that, in the limit when the migration rates tend toward zero, the R values are equal to the reciprocals of the ratios of the respective migration rates given by Equations 21a, 21b, 21c, 21d, since the right-hand side is dominated by the term involving the reciprocal of the product of effective size and migration rate. In this case, the limiting total diversities for autosomal and X-linked genes are only 1.33- and 1.5-fold higher than for Y-linked loci, respectively, with the diversity for X-linked loci being 1.12 that for autosomal loci. Similarly, the limiting ratio of autosomal to cytoplasmic total diversities becomes 0.67.

Differences between the pollen and seed migration rates also influence the rate of increase in FST. For instance, if migration occurs primarily through pollen (e.g., with mm = 100mz, Figure 1B), FST,A and FST,X increase faster with decreasing migration than when migration occurs equally through pollen and seeds (Figure 1A), or primarily through seeds (mm = 0.01mz, Figure 1C), whereas FST,C increases more slowly, but is higher throughout the range of Nm displayed. This is because X-linked and autosomal genes are haploid in pollen grains but diploid in seeds, and cytoplasmic genes move only through seed. In contrast, the Y chromosome, which is haploid in both dispersal units, is not affected by the relative pollen and seed migration rates. In consequence, the curves relating the diversity ratios to total migration rate are also modified. In particular, when migration occurs primarily through pollen, the increased migration rate of the Y chromosome compared to the X chromosome (mm vs. mz/3) compensates nearly totally for its reduced effective population size, so that RXY is independent of population subdivision (Figure 1E). In contrast, RXA is now slightly more affected by population subdivision, because the difference between the X and autosomal migration rates increases with the pollen migration rate. Hence, in the limit when the migration rates tend toward zero, RXY barely declines below 3, but RAY declines to 2, and RXA increases to 1.5.

The effects of sex-specific migration and population subdivision in animals: We consider a model of animal migration with up to a sixfold difference between male and female migration rates and no zygotic migration (see above), assuming male heterogamety (for female heterogamety, male and female parameters are interchanged). As compared with the plant case, we can see that the case of predominantly male migration (Figure 2, B and E) is similar to the case of predominantly pollen migration in the plant model. The case of equal male and female migration rates is similar to the case of equal pollen and seed migration rates, except that limited migration has a much stronger effect on FSTY relative to FSTA and FSTX (Figure 2A). The effect is even stronger with predominantly female migration where FSTY increases much faster with reduced migration than FSTA or FSTX (Figure 2C). Consequently RAY and RXY show a greater reduction when migration is restricted (Figure 2, D and F) than in the plant model (Figure 1, D and F). These differences are easily understood since Y-linked genes now experience migration through only 1 unit (male individuals), while they had two in the plant model (pollen and seeds).

Figure 1.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 1.

—The FST parameters (A-C) and the ratios (R) of total diversities between genes with different modes of inheritance (D-F) in a subdivided plant population (mf = 0), as functions of the number of migrants per deme, N(mm + mz). (A and D) Equal pollen and seed migration. (B and E) Predominantly pollen migration (mm = 100mz). (C and F) Predominantly seed migration (mz = 100mm). We assume NeA = N, NeX = ¾.N, NeY = Nec = ¼.N.

Figure 2.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 2.

—The FST parameters (A-C) and the R values (D-F) in a subdivided animal population (mz = 0), as functions of the number of migrants per deme, N(mm + mf). (A and D) Equal male and female migration. (B and E) Predominantly male migration (mm = 6mf). (C and F) Predominantly female migration (mf = 6mm). We assume NeA = N, NeX = ¾.N, NeY = Nec = ¼.N.

The effects of sex- or chromosome-specific mutation rates: Under the infinite-sites model, the mutation rate is so low compared with any realistic migration rate that differences in mutation rates among different modes of inheritance will not affect the relative FST values. Absolute values of diversities will, however, be affected, since the ratios for different modes of inheritance will be multiplied by the ratios of the respective mutation rates. From Equations 23a, 23b, this does not affect the shapes of the R values as functions of migration rates, but does affect their heights. For example, if the male mutation rate is higher than the female mutation rate (Hurst and Ellegren 1998; McVean 2000), this will have the effect of reducing RXA, RXY, and RAY. This would enhance the effect of high relative female migration rates in the animal model on these ratios, which has just been noted.

The effects of nonrandom variation in fertility: It is well known that an increase in the variance of fertility reduces Ne. Sex differences in the distributions of fertility modify the relative Ne values of genes with different modes of inheritance (Caballero 1995; Charlesworth 2001). For example, an increased variance in male fertility reduces the effective deme size of the Y chromosome more drastically than that for the X or the autosomes. With an extremely high variance in male fertility relative to female fertility, the ratios NeX/NeY and NeA/NeY become 9 and 8, respectively, so that NeX/NeA is 9/8. In contrast, a high variance in female fertility reduces Ne for the X and the autosomes but does not affect the Y chromosome: the ratios NeX/NeY and NeA/NeY tend to zero and NeX/NeA to 9/16.

As far as total diversity measures are concerned, population subdivision always counteracts the effect of an increased variance in male fertility by reducing the ratios RXY and RAY, as illustrated in the animal model with large ΔVm (Figure 3). Since NeY is reduced by an increased male fertility variance, the effective number of Y migrants is much lower than the number of X or autosomal migrants. As the migration rate decreases, FST,Y now increases much faster than FST,X or FST,A, and the ratios RXY and RAY decrease toward one with equal male and female migration (Figure 3, B and E). With predominantly female migration, this effect is manifest even with relatively high migration (Figure 3F). For instance, RXY = 6.2 and RAY = 5.7 with very high migration, but are already halved with as many as 10 effective migrants. With an increased variance in female fertility, the effect of population subdivision depends on both the relative effective population sizes and the relative migration rates for different modes of inheritance. Population subdivision increases RXY (RAY) if NeX·mX < NeY·mY (NeA·mA < NeY·mY). However, this effect occurs only with very restricted migration, as illustrated in the plant model with ΔVf = 5 (Figure 4). Moreover, the maximum value of RXY (RAY) is always lower than the expectation for a panmictic population with Poisson distributions of offspring numbers.

Selection on a nonrecombining genome, such as organelle genomes and the Y chromosome, can further reduce Ne (Charlesworth and Charlesworth 2000). Such a reduction of Nec and Ney will increase the corresponding FST values. We have investigated the case with NeX/NeY = 30 for the plant model (Figure 5), since this is the ratio suggested by observed within-deme diversity ratios in Silene latifolia (see below). With such a low NeY, the decrease in RXY and RAY due to population subdivision now occurs even with relatively high migration rates. Moreover, with such a low NeY, differences between pollen and seed dispersal rates have little effect on the relative total diversities of nuclear genes with different modes of inheritance.

COMPARISONS WITH DATA ON NATURAL POPULATIONS

In this section, we compare some of the results derived above with data from surveys of DNA sequence variation in populations of humans and plants.

Human populations: There is a large literature on genetic diversity in humans, and many different types of markers have been employed (including protein polymorphisms, restriction fragment length polymorphisms, microsatellites, Alu insertions, and single-nucleotide polymorphisms). These data have, however, several biases, which limit their utility for our purposes. First, worldwide population structure has rarely been investigated using markers with different modes of inheritance in the same samples (but see Poloniet al. 1997; Jordeet al. 2000). In addition, the mode of sampling from different populations, or of grouping the populations, may influence FST estimates, as described by Stoneking (1998) and Hammer et al. (2001; see Table 3). Moreover, different estimators of FST, which depend in different ways on the sample sizes and numbers of populations sampled (Cockerham and Weir 1993; Charlesworth 1998), have been used in different studies, making it hard to compare different results. This is illustrated by the different estimates of FST,Y obtained from the same dataset by Seielstad et al. (1998) and by Hammer et al. (2001). Finally, because of the low level of sequence diversity in humans, investigators have favored the use of highly polymorphic markers, especially for the Y chromosome. This may bias FST estimates, as noted by Nagylaki (1998b) and Bertranpetit (2000), since diversity values for loci with high mutation rates are not proportional to coalescent times.

Figure 3.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 3.

—Same as Figure 2 but assuming an increased variance of male fertility compared to the Poisson expectation (ΔVm = 10).

Figure 4.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 4.

—Same as Figure 1 but assuming an increased variance of female fertility compared to the Poisson expectation (ΔVf = 5).

Figure 5.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 5.

—Same as Figure 1 but assuming NeY = NeX/30. The figures also give the FST and RXY estimates from Silene latifolia populations with the number of migrants expected under an island model (Triangle, FˆST,Y; square, FˆST,A; circle, FˆST,X; and cross, RˆXY).

View this table:
  • View inline
  • View popup
TABLE 3

Review of FST estimates at a worldwide spatial scale in human populations

In Table 3, we present a compilation of FST estimates from worldwide samples of different types of markers, with cytoplasmic, Y, X, and autosomal inheritance. These all provide evidence for population structure, with autosomal markers yielding the lowest mean FST estimates, as expected from the results shown in Figure 2. The estimates obtained from Y chromosome markers are, however, highly variable between studies (0.09-0.65). Microsatellites often show smaller FST values (0.09-0.23), as outlined in studies that have compared both kinds of markers (Hammeret al. 1997; Jordeet al. 2000), probably because of their high mutation rates. Excluding microsatellites, the average FST estimated from the Y chromosome markers (0.40) is higher than from the mitochondrial markers (0.27). While this has often been interpreted as evidence for higher female than male migration rates, it might also reflect differences in effective population sizes (Hurles and Jobling 2001). A recent comparison of local population differentiation in Thai communities with patrilocal or matrilocal dispersal patterns is, however, in agreement with the expectation that restricted male migration increases population differentiation for Y markers relative to mitochondrial markers (Ootaet al. 2001).

DNA sequence data have recently been obtained for 4 Y-linked and 15 autosomal loci, using similar worldwide samples (Shenet al. 2000; Thomsonet al. 2000; Thorstensonet al. 2001). The authors estimated RAY as five, from data on both coding and noncoding regions. Unfortunately, population structure was not investigated explicitly in these studies, although the data should allow partitioning of diversity between and within populations. Thomson et al. (2000) constructed a genealogy of their sample of Y chromosomes, and this can be used to obtain a value of 0.14 for FST,Y between continents. This is smaller than that obtained from other studies of biallelic markers and challenges the notion that there is restricted male vs. female migration in most human populations. Mitochondrial data for the same samples would help to resolve this question, and it would also be desirable to obtain estimates of FST,A and rAY.

At present, it is hard to reach firm conclusions about the influence of sex-specific migration vs. differences in effective population sizes on the relative levels of diversity and divergence between human populations for different inheritance modes; there does not, however, seem to be strong evidence for a greatly reduced effective size of the human Y chromosome, in contrast to what is observed in Silene (see below) or in Drosophila (Zurovcova and Eanes 1999; Bachtrog and Charlesworth 2000, 2002).

View this table:
  • View inline
  • View popup
TABLE 4

Silent site diversity in Silene latifolia

Plant populations: There are relatively few species of plants with sex chromosomes, and diversity data on nuclear genes with different modes of inheritance are available only for the close relatives S. latifolia and S. dioica (Table 4). In S. dioica, there is only a single polymorphic site on the Y, which is too little to provide any useful information. In S. latifolia, nine polymorphic sites were found on the Y. All nuclear genes display quite strong population structure, with the lowest FST for the autosomal gene and the highest for a Y-linked gene, as expected from the above analyses. The ratio RXY over all demes (RXY = 23) is smaller than the ratio rXY for within-deme diversity (rXY = 29), as expected from the effects of subdivision (Figure 5).

It is, however, impossible to reconcile the estimates of FST for all three modes of heredity under a simple island model of population subdivision: given the estimates of FST,A and FST,X, FST,Y is expected to be much higher (and RXY to be much lower) than is observed (Figure 5). Deviations from the model assumed here (e.g., the occurrence of a selective sweep on the Y) might account for these discrepancies, if they are not simply due to sampling error due to the small number of informative sites on the Y.

DISCUSSION

In the first part of this article, we have shown that the use of the fast-timescale approximation (Nordborg 1997) to the coalescent process in a single population, structured according to sex and/or age, provides a simple general expression for the probability that a pair of alleles sampled from the population coalesce in the previous time interval (Equation 6). This approximation assumes that the flow of genes among age and sex classes is much faster than the effects of drift or migration between populations, so that the state of an allele can be treated as independent of the class from which it was sampled: this is equivalent to the “strong-migration limit” of Nagylaki (1980) for a geographically structured population.

By use of a suitable definition of generation time, we can also define the effective size of a population as the reciprocal of twice the product of generation time and the probability of coalescence per time interval (Equation 7). This expression can be used to generate explicit formulas for effective population size with discrete generations and separate sexes, under a variety of different modes of inheritance, and with arbitrary distributions of offspring numbers (Equations 14, 15, 16, 17). The same approach can be used for the case of a nuclear gene in a population of partially self-fertilizing hermaphrodites (Equations 19a, 19b and 20). An important conclusion in this case is that the standard formula for Ne for selfing populations (Pollak 1987) is likely to be approximately valid even if there is a non-Poisson distribution of offspring number; i.e., Ne is reduced below the value for random mating by a factor of 1/(1 + F), where F is the inbreeding coefficient.

The approach can also be applied to the standard model of an age-structured population with discrete time intervals, recently revisited by Charlesworth (2001); his Equation A7 is equivalent to our Equations 6 and 7. Finally, we note that the effective population size defined in this way is equivalent to that which describes the asymptotic rate of increase in the probability of identity of a pair of randomly sampled genes in the absence of mutation, i.e., to the inbreeding effective population size. This equivalence is dependent on the validity of the fast-timescale approximation (Nagylaki 1980; Charlesworth 2001).

We also show how to simplify the analysis of the effects of population subdivision of a demographically structured population by defining a migration matrix that describes the net rates of movement of genes between different local populations. This involves weighting the migration probabilities of individuals of a given age-sex class by the contribution of this class to the leading left eigenvector of the matrix describing movements of genes between age and sex classes (Equation 4). This enables the determination of the moment-generating functions for the distributions of coalescent times for pairs of genes sampled from a given pair of populations, under any well-defined migration model (Equation 5), under the standard assumption that migration and drift are both weak evolutionary forces. From these, the expected coalescent times and higher moments can easily be found (Hudson 1990; Wilkinson-Herbots 1998).

Under the infinite-sites model, the expected number of nucleotide differences between a pair of alleles sampled from a prescribed pair of populations is equal to the product of the mutation rate and the corresponding expected coalescent time (Hudson 1990), so that the results can be used to generate expressions for equilibrium levels of nucleotide site diversities. In doing this, the mutation rates for different age-sex classes must be weighted in the same way as the migration rates. This approach can be extended to extinction-recolonization metapopulation models, if it is assumed that the extinction of a local population is immediately followed by its recolonization (Wakeley and Aliacar 2001).

An important general conclusion is that population subdvision makes it very hard to describe the expected level of genetic variability in a population by a single formula such as θ= 4Neu. While it is possible to define a simple expression for a weighted mean coalescence time for a pair of alleles sampled from the same deme for a general migration model (Equation 9), this involves both the effective population sizes of all the individual demes and their contributions to the leading left eigenvector of the migration matrix defined by Equation 4. In general, these are unknowable quantities, making it very hard to equate any empirical estimate of the mean within-population nucleotide site diversity to a simple scaled mutation rate parameter. The details of the demography and migration parameters of a species may greatly influence the estimated scaled mutation rate based on unweighted mean within-population nucleotide site diversities, making comparisons between different species difficult to interpret.

The situation is even worse for the nucleotide diversity for a pair of alleles sampled from the population at large, since this is related to the within-deme value by 1/(1 - FST). In general, FST depends in a complex way on migration rates and deme sizes, and simple formulas are available for only a few limiting cases, such as the island model. Only when there is negligible genetic differentiation between local populations can one confidently relate mean nucleotide site diversity to the coalescent time for a randomly sampled pair of alleles and hence to a scaled mutation rate parameter. Many empirical investigations of DNA sequence variation in natural populations do not state explicitly what population parameters are of primary interest and often make little distinction between measures of variation based on whole-population and within-population estimates. More attention to these issues in presenting analyses of data on DNA sequence variability is desirable.

Our investigation of the island model also shows that strong population subdivision, coupled with sex-specific migration rates, can greatly affect the relative values of the expected total genetic diversities for different modes of inheritance and may even reverse some of the patterns expected under panmixia. For example, Figure 3 displays the expected patterns of genetic variability for animal populations with a high variance in male reproductive success. From Figure 3C, it may be seen that predominantly female migration can cause the ratios of autosomal or X-linked variability to Y-linked variability to decline below one with extreme subdivision, in contrast to a value of over four with panmixia. Encouragingly, however, the relative values of expected autosomal and X-linked total population diversities are insensitive to all but the most extreme population subdivision, for both plant and animal models (Figures 1, 2, 3, 4). This suggests that the use of ratios of X-linked to autosomal diversities to make inferences about the strength of sexual selection (Charlesworth 2001) may be feasible even if there is a high level of population subdivision.

We have confined ourselves to deriving expressions for expected coalescence times and nucleotide site differences between alleles. However, expressions for the distribution of coalescent times for a set of n alleles sampled from a specified set of populations can easily be written down, using the migration probabilities defined by Equation 4 and coalescence probabilities defined by Equation 7 to generate the expectations of competing exponential distributions of waiting times to migration or coalescence events (Hudson 1990). Although in general it is difficult to obtain useful analytical results on the properties of these distributions (but see Wakeley 1999, 2001), they can readily be implemented in computer simulations and used to derive sampling properties of estimators of the type that we have discussed.

APPENDIX

Variances in offspring numbers with a binomial distribution of the proportion of sons vs. daughters: Let the proportion of males among breeding individuals be c. From the properties of the binomial distribution, it is easily seen that the variance of the number of sons per female is Vfm=c2Vf+n‒fc(1−c), (A1a) where n¯f and Vf are the mean and variance of total offspring number per female. If the population size is stationary, we have n¯f = 1/(1 - c), and so Vfm=c2Vf+c. (A1b) The Poisson expectation for Vfm is c/(1 - c), so this yields ΔVfm=c2{(1−c)Vf−1}(1−c). (A1c) Similarly, we have ΔVff=(1−c){(1−c)Vf−1} (A2) ΔCffm=c(1−c)Vf−c. (A3) Corresponding expressions can be derived for the progeny of male parents, interchanging c and (1 - c).

Effective population size with self-fertilization: Consider first two maternally derived genes, sampled from different individuals (probability one-quarter). Their probability of origin from a common parent in the previous generation is independent of whether the individuals are the products of selfing or outcrossing and is simply Qff=(ΔVf∕d‒2)+1N, (A4a) where d¯ = d¯S + d¯O is the mean per capita number of successful offspring produced through seed, and ΔVf is the excess over d¯ of the variance in the number of successful offspring produced through seed.

Next, consider a maternally derived and a paternally derived gene from different individuals (probability one-half). The latter has a probability of S of being derived by selfing, in which case the probability that it came from the same parent as the maternally derived gene is QfmS=∑kdkS(dkO+dkS−1)N2d‒Sd‒=(CfOfS+ΔVfS)∕(d‒Sd‒)+1N, (A4b) where Cf Of S is the covariance between the number of offspring produced through outcrossed and selfed seeds, respectively, and ΔVf S is the excess over d¯S of the variance in number of offspring produced through selfing.

If the paternal allele was derived by outcrossing (probability 1 - S), the probability that the pair came from a common parent is QfmO=(Cfm∕d‒p‒)+1N, (A4c) where Cfm is the covariance between the total number of offspring produced by seed and the number of offspring produced through outcrossed pollen, and p¯ is the mean number of offspring produced through outcrossed pollen.

Finally, consider a pair of paternally derived genes (probability one-quarter). There is a probability of approximately S 2 that they are both products of selfing, in which case the probability of sharing a common parent is QfSfS=(ΔVfS∕d‒S2)+1N. (A4d)

There is a probability 2S(1 - S) that one is derived from selfing and the other from outcrossing, in which case the probability of common parentage is QfOfS=(CfSm∕d‒sp‒)+1N, (A4e) where Cf Sm is the covariance between the number of offspring produced through selfed seed and outcrossed pollen.

There is a probability (1 - S)2 that both are products of outcrossing, in which case their probability of common parentage is QmOmO=(ΔVm∕p‒2)+1N, (A4f) where ΔVm is the excess over p¯ of the variance in number of offspring produced through outcrossed pollen.

Conditioned on common parentage, each of these possible origins of gene pairs has a probability of (1 + F)/2 of resulting in coalescence; the reciprocal of the effective population size, NeH, is thus given by multiplying (1 + F) by the sum of the products of the probabilities of origins and common parentage conditioned on origin, where at equilibrium under selfing and random mating F = S/(2 - S).

Assume that the kth individual has a total amount of resource Rk available for reproduction, of which a fraction ck is devoted to pollen production and 1 - ck to seed production. Let the net expected contribution of offspring through seed be a function f(Rk[1 - ck]) of the individual’s allocation to seed production. If the population size is stationary, this has a mean of one.

If fitnesses are normalized so that mean fitnesses through male and female contributions are equal (Lloyd 1977), the expected contribution to the next generation through outcrossed pollen is p‒i=g(Rci)(1−S)g‒, (A5) where the function g describes the dependence of relative male reproductive success on allocation to pollen (the expectation of g is 1).

The actual numbers of offspring produced are Poisson variates with f¯ and p¯ as parameters. The excess of the variance over Poisson expectation in the number of offspring produced through seed is ΔVf≈(∂f∂R)2VR+(∂f∂c)2Vc (A6a) (assuming that Rk and ck are independent), where the derivatives are evaluated at the population means. ΔVm≈(1−S)2g‒2{(∂g∂R)2VR+(∂g∂c)2Vc}. (A6b)

Similarly Cfm≈(1−S)g‒{(∂f∂R)(∂g∂R)VR+(∂f∂c)(∂g∂c)Vc} (A6c) and Cfsm≈S(1−S)g‒{(∂f∂R)(∂g∂R)VR+(∂f∂c)(∂g∂c)Vc}. (A6d)

Substituting these equations into Equation 19b, we obtain 1NeH≈(1+F)N{1+14[(1+S)∂f∂R+(1−S)∂g∂R]2VR+14[(1+S)∂f∂c+(1−S)g‒∂g∂c]2Vc}. (A7) The second term in brackets is equal to the square of the derivative of total fitness with respect to sex allocation (Charlesworth and Charlesworth 1981) and is therefore expected to be close to zero in populations that have had time to adjust their sexual allocation to its evolutionary optimum. The effect of non-Poisson variation in fitness on Ne is therefore likely to be largely determined by effects on R. It can be seen that the derivative of the female component of fitness with respect to R has a higher weight, the higher the selfing rate, whereas the derivative of outcrossed male fitness has a weight that declines to zero for complete selfing. If the two derivatives are of similar magnitude, there will be very little change in the contribution from VR with changes in S.

More complex models, which allow for variation in the selfing rate and for covariances between R and c, can be written down, but the conclusions are not greatly changed. The covariance term can be seen to carry a weight that involves a factor equal to the derivative of total fitness with respect to sex allocation, which is expected to be close to zero (see above) from the above argument. The variance in selfing rate is necessarily close to zero for outcrossing and highly selfing populations and so will not influence their relative Ne values.

Acknowledgments

We thank N. Barton, D. Charlesworth, F. Depaulis, and two anonymous reviewers for their comments on the manuscript. B.C. acknowledges support by the Royal Society and the Engineering and Physics Research Council, and V.L. acknowledges support by the Biotechnology and Biological Sciences Research Council.

Footnotes

  • Communicating editor: G. B. Golding

  • Received April 2, 2002.
  • Accepted June 18, 2002.
  • Copyright © 2002 by the Genetics Society of America

LITERATURE CITED

  1. ↵
    1. Bachtrog D.,
    2. Charlesworth B.
    , 2000 Reduced levels of microsatellite variability on the neo-Y chromosome of Drosophila miranda. Curr. Biol. 10: 1025–1031.
    OpenUrlCrossRefPubMedWeb of Science
  2. ↵
    1. Bachtrog D.,
    2. Charlesworth B.
    , 2002 Reduced adaptation of an evolving neo-Y chromosome. Nature 416: 323–326.
    OpenUrlCrossRefPubMedWeb of Science
    1. Barbujani G.,
    2. Magagni A.,
    3. Minch E.,
    4. Cavalli-Sforza L. L.
    , 1997 An apportionment of human DNA diversity. Proc. Natl. Acad. Sci. USA 94: 4516–4519.
    OpenUrlAbstract/FREE Full Text
  3. ↵
    1. Bertranpetit J.
    , 2000 Genome, diversity, and origins: the Y chromosome as a storyteller. Proc. Natl. Acad. Sci. USA 97: 6927–6929.
    OpenUrlFREE Full Text
    1. Bowcock A. M.,
    2. Kidd J. R.,
    3. Mountain J. L.,
    4. Hebert J. M.,
    5. Carotenuto L.,
    6. et al.
    , 1991 Drift, admixture, and selection in human evolution—a study with DNA polymorphisms. Proc. Natl. Acad. Sci. USA 88: 839–843.
    OpenUrlAbstract/FREE Full Text
  4. ↵
    1. Caballero A.
    , 1995 On the effective size of populations with separate sexes, with particular reference to sex-linked genes. Genetics 139: 1007–1011.
    OpenUrlAbstract/FREE Full Text
  5. ↵
    1. Charlesworth B.
    , 1998 Measures of divergence between populations and the effect of forces that reduce variability. Mol. Biol. Evol. 15: 538–543.
    OpenUrlAbstract
  6. ↵
    1. Charlesworth B.
    , 2001 The effect of life-history and mode of inheritance on neutral genetic variability. Genet. Res. 77: 153–166.
    OpenUrlCrossRefPubMedWeb of Science
  7. ↵
    1. Charlesworth B.,
    2. Charlesworth D.
    , 2000 The degeneration of Y chromosomes. Philos. Trans. R. Soc. Lond. B 355: 1563–1572.
    OpenUrlAbstract/FREE Full Text
  8. ↵
    1. Charlesworth D.,
    2. Charlesworth B.
    , 1981 Allocation of resources to male and female functions in hermaphrodites. Biol. J. Linn. Soc. 15: 57–74.
    OpenUrlCrossRefWeb of Science
  9. ↵
    1. Chesser R. K.,
    2. Baker R. J.
    , 1996 Effective sizes and dynamics of uniparentally and diparentally inherited genes. Genetics 144: 1225–1235.
    OpenUrlAbstract/FREE Full Text
  10. ↵
    1. Chesser R. K.,
    2. Rhodes O. E. J.,
    3. Sugg D. W.,
    4. Schnabel A.
    , 1993 Effective sizes for subdivided populations. Genetics 135: 1221–1232.
    OpenUrlAbstract/FREE Full Text
  11. ↵
    1. Cockerham C. C.,
    2. Weir B. S.
    , 1993 Estimation of gene flow from F-statistics. Evolution 47: 855–863.
    OpenUrlCrossRefWeb of Science
  12. ↵
    1. Ennos R. A.
    , 1994 Estimating the relative rates of pollen and seed migration among plant populations. Heredity 72: 250–259.
    OpenUrlCrossRefWeb of Science
  13. ↵
    1. Excoffier L.,
    2. Smouse P. E.,
    3. Quattro J. M.
    , 1992 Analysis of molecular variance inferred from metric distances among DNA haplotypes—application to human mitochondrial-DNA restriction data. Genetics 131: 479–491.
    OpenUrlAbstract/FREE Full Text
  14. ↵
    1. Filatov D. A.,
    2. Monéger F.,
    3. Negrutiu I.,
    4. Charlesworth D.
    , 2000 Evolution of a plant Y-chromosome: variability in a Y-linked gene of Silene latifolia. Nature 404: 388–390.
    OpenUrlCrossRefPubMedWeb of Science
  15. ↵
    1. Filatov D. A.,
    2. Laporte V.,
    3. Vitte C.,
    4. Charlesworth D.
    , 2001 DNA diversity in sex linked and autosomal genes of the plant species Silene latifolia and S. dioica. Mol. Biol. Evol. 18: 1442–1454.
    OpenUrlAbstract/FREE Full Text
  16. ↵
    1. Hammer M. F.,
    2. Spurdle A. B.,
    3. Karafet T.,
    4. Bonner M. R.,
    5. Wood E. T.,
    6. et al.
    , 1997 The geographic distribution of human Y chromosome variation. Genetics 145: 787–805.
    OpenUrlAbstract/FREE Full Text
  17. ↵
    1. Hammer M. F.,
    2. Karafet T. M.,
    3. Redd A. J.,
    4. Jarjanazi H.,
    5. Santachiara-Benerecetti S.,
    6. et al.
    , 2001 Hierarchical patterns of global human Y-chromosome diversity. Mol. Biol. Evol. 18: 1189–1203.
    OpenUrlAbstract/FREE Full Text
  18. ↵
    1. Hudson R. R.
    , 1990 Gene genealogies and the coalescent process. Oxf. Surv. Evol. Biol. 7: 1–45.
    OpenUrl
  19. ↵
    1. Hudson R. R.,
    2. Boos D. D.,
    3. Kaplan N. L.
    , 1992 A statistical test for detecting geographic subdivision. Mol. Biol. Evol. 9: 138–151.
    OpenUrlAbstract
  20. ↵
    1. Hurles M. E.,
    2. Jobling M. A.
    , 2001 Haploid chromosomes in molecular ecology: lessons from the human Y. Mol. Ecol. 10: 1599–1613.
    OpenUrlCrossRefPubMedWeb of Science
  21. ↵
    1. Hurst L. D.,
    2. Ellegren H.
    , 1998 Sex biases in the mutation rate. Trends Genet. 14: 446–452.
    OpenUrlCrossRefPubMedWeb of Science
  22. ↵
    1. Jorde L. B.,
    2. Watkins W. S.,
    3. Bamshad M. J.,
    4. Dixon M. E.,
    5. Ricker C. E.,
    6. et al.
    , 2000 The distribution of human genetic diversity: a comparison of mitochondrial, autosomal, and Y-chromosome data. Am. J. Hum. Genet. 66: 979–988.
    OpenUrlCrossRefPubMedWeb of Science
    1. Kayser M.,
    2. Krawczak M.,
    3. Excoffier L.,
    4. Dieltjes P.,
    5. Corach D.,
    6. et al.
    , 2001 An extensive analysis of Y-chromosomal microsatellite haplotypes in globally dispersed human populations. Am. J. Hum. Genet. 68: 990–1018.
    OpenUrlCrossRefPubMedWeb of Science
  23. ↵
    1. Kimura M.
    , 1971 Theoretical foundations of population genetics at the molecular level. Theor. Popul. Biol. 2: 174–208.
    OpenUrlCrossRefPubMed
  24. ↵
    1. Lloyd D. G.
    , 1977 Genetic and phenotypic models of natural selection. J. Theor. Biol. 69: 543–560.
    OpenUrlCrossRefPubMedWeb of Science
  25. ↵
    1. McVean G.
    , 2000 Evolutionary genetics: what is driving male mutation? Curr. Biol. 10: R834–R835.
    OpenUrlCrossRefPubMed
  26. ↵
    1. McVean G. A. T.,
    2. Hurst L. D.
    , 1997 Evidence for a selectively favourable reduction in the mutation rate of the X chromosome. Nature 386: 388–392.
    OpenUrlCrossRefPubMedWeb of Science
  27. ↵
    1. Nagylaki T.
    , 1980 The strong migration limit in geographically structured populations. J. Math. Biol. 9: 101–114.
    OpenUrlCrossRefPubMedWeb of Science
  28. ↵
    1. Nagylaki T.
    , 1982 Geographical invariance in population genetics. J. Theor. Biol. 99: 159–172.
    OpenUrlCrossRefPubMed
  29. ↵
    1. Nagylaki T.
    , 1998a The expected number of heterozygous sites in a subdivided population. Genetics 149: 1599–1604.
    OpenUrlAbstract/FREE Full Text
  30. ↵
    1. Nagylaki T.
    , 1998b Fixation indices in subdivided populations. Genetics 148: 1325–1332.
    OpenUrlAbstract/FREE Full Text
  31. ↵
    1. Nordborg M.
    , 1997 Structured coalescent processes on different time scales. Genetics 146: 1501–1514.
    OpenUrlAbstract/FREE Full Text
  32. ↵
    1. Nordborg M.,
    2. Donnelly P.
    , 1997 The coalescent process with selfing. Genetics 146: 1185–1195.
    OpenUrlAbstract/FREE Full Text
  33. ↵
    1. Oota H.,
    2. Settheetham-Ishida W.,
    3. Tiwawech D.,
    4. Ishida T.,
    5. Stoneking M.
    , 2001 Human mtDNA and Y-chromosome variation is correlated with matrilocal versus patrilocal residence. Nat. Genet. 29: 20–21.
    OpenUrlCrossRefPubMedWeb of Science
  34. ↵
    1. Pollak E.
    , 1987 On the theory of partially inbreeding populations. I. Partial selfing. Genetics 117: 353–360.
    OpenUrlAbstract/FREE Full Text
  35. ↵
    1. Poloni E. S.,
    2. Semino O.,
    3. Passarino G.,
    4. Santachiara-Benerecetti A. S.,
    5. Dupanloup I.,
    6. et al.
    , 1997 Human genetic affinities for Y-chromosome P49a,f/TaqI haplotypes show strong correspondence with linguistics. Am. J. Hum. Genet. 61: 1015–1035.
    OpenUrlCrossRefPubMedWeb of Science
    1. Quintana-Murci L.,
    2. Semino O.,
    3. Poloni E. S.,
    4. Liu A.,
    5. Van Gijn M.,
    6. et al.
    , 1999 Y-chromosome specific YCAII, DYS19 and YAP polymorphisms in human populations: a comparative study. Ann. Hum. Genet. 63: 153–166.
    OpenUrlCrossRefPubMedWeb of Science
  36. ↵
    1. Rousset F.
    , 1999 Genetic differentiation in populations with different classes of individuals. Theor. Popul. Biol. 55: 297–308.
    OpenUrlCrossRefPubMed
  37. ↵
    1. Seielstad M. T.,
    2. Minch E.,
    3. Cavalli-Sforza L. L.
    , 1998 Genetic evidence for a higher female migration rate in humans. Nat. Genet. 20: 278–280.
    OpenUrlCrossRefPubMedWeb of Science
  38. ↵
    1. Shen P. D.,
    2. Wang F.,
    3. Underhill P. A.,
    4. Franco C.,
    5. Yang W. H.,
    6. et al.
    , 2000 Population genetic implications from sequence variation in four Y chromosome genes. Proc. Natl. Acad. Sci. USA 97: 7354–7359.
    OpenUrlAbstract/FREE Full Text
  39. ↵
    1. Slatkin M.
    , 1991 Inbreeding coefficients and coalescence times. Genet. Res. 58: 167–175.
    OpenUrlCrossRefPubMedWeb of Science
  40. ↵
    1. Stoneking M.
    , 1998 Women on the move. Nat. Genet. 20: 219–220.
    OpenUrlCrossRefPubMedWeb of Science
  41. ↵
    1. Thomson R.,
    2. Pritchard J. K.,
    3. Shen P. D.,
    4. Oefner P. J.,
    5. Feldman M. W.
    , 2000 Recent common ancestry of human Y chromosomes: evidence from DNA sequence data. Proc. Natl. Acad. Sci. USA 97: 7360–7365.
    OpenUrlAbstract/FREE Full Text
  42. ↵
    1. Thorstenson Y. R.,
    2. Shen P. D.,
    3. Tusher V. G.,
    4. Wayne T. L.,
    5. Davis R. W.,
    6. et al.
    , 2001 Global analysis of ATM polymorphism reveals significant functional constraint. Am. J. Hum. Genet. 69: 396–412.
    OpenUrlCrossRefPubMedWeb of Science
    1. Underhill P. A.,
    2. Jin L.,
    3. Lin A. A.,
    4. Mehdi S. Q.,
    5. Jenkins T.,
    6. et al.
    , 1997 Detection of numerous Y chromosome biallelic polymorphisms by denaturing high-performance liquid chromatography. Genome Res. 7: 996–1005.
    OpenUrlAbstract/FREE Full Text
  43. ↵
    1. Wakeley J.
    , 1999 Nonequilibrium migration in human history. Genetics 153: 1863–1871.
    OpenUrlAbstract/FREE Full Text
  44. ↵
    1. Wakeley J.
    , 2001 The coalescent in an island model of population subdivision with variation among demes. Theor. Popul. Biol. 59: 133–144.
    OpenUrlCrossRefPubMedWeb of Science
  45. ↵
    1. Wakeley J.,
    2. Aliacar N.
    , 2001 Gene genealogies in a metapopulation. Genetics 159: 893–905.
    OpenUrlAbstract/FREE Full Text
  46. ↵
    1. Wang J.
    , 1997a Effective size and F-statistics of subdivided populations: I. Monoecious species with partial selfing. Genetics 146: 1453–1463.
    OpenUrlAbstract/FREE Full Text
  47. ↵
    1. Wang J.
    , 1997b Effective size and F-statistics of subdivided populations: II. Dioecious species. Genetics 146: 1465–1474.
    OpenUrlAbstract/FREE Full Text
  48. ↵
    1. Wang J.
    , 1999 Effective size and F-statistics of subdivided populations for sex-linked loci. Theor. Popul. Biol. 55: 176–188.
    OpenUrlCrossRefPubMed
  49. ↵
    1. Wang J.,
    2. Caballero A.
    , 1999 Developments in predicting the effective size of subdivided populations. Heredity 82: 212–226.
    OpenUrlCrossRefWeb of Science
    1. Watkins W. S.,
    2. Ricker C. E.,
    3. Bamshad M. J.,
    4. Carroll M. L.,
    5. Nguyen S. V.,
    6. et al.
    , 2001 Patterns of ancestral human diversity: an analysis of Alu-insertion and restriction-site polymorphisms. Am. J. Hum. Genet. 68: 738–752.
    OpenUrlCrossRefPubMedWeb of Science
  50. ↵
    1. Whitlock M. C.,
    2. Barton N. H.
    , 1997 The effective size of a subdivided population. Genetics 146: 427–441.
    OpenUrlAbstract/FREE Full Text
  51. ↵
    1. Wilkinson-Herbots H. M.
    , 1998 Genealogy and subpopulation differentiation under various models of population structure. J. Math. Biol. 37: 535–585.
    OpenUrlCrossRefWeb of Science
  52. ↵
    1. Wright S.
    , 1931 Evolution in Mendelian populations. Genetics 16: 97–169.
    OpenUrlFREE Full Text
  53. ↵
    1. Wright S.
    , 1943 Isolation by distance. Genetics 28: 114–138.
    OpenUrlFREE Full Text
  54. ↵
    1. Wright S.
    , 1951 The genetical structure of populations. Ann. Eugen. 15: 323–354.
    OpenUrlPubMedWeb of Science
    1. Yu N.,
    2. Fu Y. X.,
    3. Sambuughin N.,
    4. Ramsay M.,
    5. Jenkins T.,
    6. et al.
    , 2001 Global patterns of human DNA sequence variation in a 10-kb region on chromosome 1. Mol. Biol. Evol. 18: 214–222.
    OpenUrlAbstract/FREE Full Text
    1. Zhao Z. M.,
    2. Jin L.,
    3. Fu Y. X.,
    4. Ramsay M.,
    5. Jenkins T.,
    6. et al.
    , 2000 Worldwide DNA sequence variation in a 10-kilobase non-coding region on human chromosome 22. Proc. Natl. Acad. Sci. USA 97: 11354–11358.
    OpenUrlAbstract/FREE Full Text
    1. Zietkiewicz E.,
    2. Yotova V.,
    3. Jarnik M.,
    4. Korah-Laskowska M.,
    5. Kidd K. K.,
    6. et al.
    , 1998 Genetic structure of the ancestral population of modern humans. J. Mol. Evol. 47: 146–155.
    OpenUrlCrossRefPubMedWeb of Science
  55. ↵
    1. Zurovcova M.,
    2. Eanes W. F.
    , 1999 Lack of nucleotide polymorphism in the Y-linked sperm flagellar dynein gene Dhc-Yh3 of Drosophila melanogaster and D. simulans. Genetics 153: 1709–1715.
    OpenUrlAbstract/FREE Full Text
View Abstract
Previous ArticleNext Article
Back to top

PUBLICATION INFORMATION

Volume 162 Issue 1, September 2002

Genetics: 162 (1)

ARTICLE CLASSIFICATION

INVESTIGATIONS
View this article with LENS
Email

Thank you for sharing this Genetics article.

NOTE: We request your email address only to inform the recipient that it was you who recommended this article, and that it is not junk mail. We do not retain these email addresses.

Enter multiple addresses on separate lines or separate them with commas.
Effective Population Size and Population Subdivision in Demographically Structured Populations
(Your Name) has forwarded a page to you from Genetics
(Your Name) thought you would be interested in this article in Genetics.
Print
Alerts
Enter your email below to set up alert notifications for new article, or to manage your existing alerts.
SIGN UP OR SIGN IN WITH YOUR EMAIL
View PDF
Share

Effective Population Size and Population Subdivision in Demographically Structured Populations

Valérie Laporte and Brian Charlesworth
Genetics September 1, 2002 vol. 162 no. 1 501-519
Valérie Laporte
Institute of Cell, Animal and Population Biology, University of Edinburgh, Edinburgh EH9 3JT, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: valerie.laporte@ed.ac.uk
Brian Charlesworth
Institute of Cell, Animal and Population Biology, University of Edinburgh, Edinburgh EH9 3JT, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
del.icio.us logo Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
Citation

Effective Population Size and Population Subdivision in Demographically Structured Populations

Valérie Laporte and Brian Charlesworth
Genetics September 1, 2002 vol. 162 no. 1 501-519
Valérie Laporte
Institute of Cell, Animal and Population Biology, University of Edinburgh, Edinburgh EH9 3JT, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: valerie.laporte@ed.ac.uk
Brian Charlesworth
Institute of Cell, Animal and Population Biology, University of Edinburgh, Edinburgh EH9 3JT, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero

Related Articles

Cited By

More in this TOC Section

  • Comparative Genomics and Transcriptomics To Analyze Fruiting Body Development in Filamentous Ascomycetes
  • The Fate of Deleterious Variants in a Barley Genomic Prediction Population
  • The Enigmatic Canal-Associated Neurons Regulate Caenorhabditis elegans Larval Development Through a cAMP Signaling Pathway
Show more Investigations
  • Top
  • Article
    • Abstract
    • A GENERAL MODEL OF MIGRATION AND DRIFT IN STRUCTURED POPULATIONS
    • COALESCENT PROBABILITIES AND EFFECTIVE POPULATION SIZES
    • THE ISLAND MODEL WITH SEX-SPECIFIC MIGRATION PARAMETERS
    • COMPARISONS WITH DATA ON NATURAL POPULATIONS
    • DISCUSSION
    • APPENDIX
    • Acknowledgments
    • Footnotes
    • LITERATURE CITED
  • Figures & Data
  • Info & Metrics

GSA

The Genetics Society of America (GSA), founded in 1931, is the professional membership organization for scientific researchers and educators in the field of genetics. Our members work to advance knowledge in the basic mechanisms of inheritance, from the molecular to the population level.

Online ISSN: 1943-2631

  • For Authors
  • For Reviewers
  • For Subscribers
  • Submit a Manuscript
  • Editorial Board
  • Press Releases

SPPA Logo

GET CONNECTED

RSS  Subscribe with RSS.

email  Subscribe via email. Sign up to receive alert notifications of new articles.

  • Facebook
  • Twitter
  • YouTube
  • LinkedIn
  • Google Plus

Copyright © 2019 by the Genetics Society of America

  • About GENETICS
  • Terms of use
  • Advertising
  • Permissions
  • Contact us
  • International access