# Genetic Load in Sexual and Asexual Diploids: Segregation, Dominance and Genetic Drift

- 1
*Corresponding author:*University of Edinburgh, Institute of Evolutionary Biology, Ashworth Lab. 2, W. Mains Rd., Edinburgh EH9 3JT, United Kingdom. E-mail: christoph.haag{at}ed.ac.uk

## Abstract

In diploid organisms, sexual reproduction rearranges allelic combinations between loci (recombination) as well as within loci (segregation). Several studies have analyzed the effect of segregation on the genetic load due to recurrent deleterious mutations, but considered infinite populations, thus neglecting the effects of genetic drift. Here, we use single-locus models to explore the combined effects of segregation, selection, and drift. We find that, for partly recessive deleterious alleles, segregation affects both the deterministic component of the change in allele frequencies and the stochastic component due to drift. As a result, we find that the mutation load may be far greater in asexuals than in sexuals in finite and/or subdivided populations. In finite populations, this effect arises primarily because, in the absence of segregation, heterozygotes may reach high frequencies due to drift, while homozygotes are still efficiently selected against; this is not possible with segregation, as matings between heterozygotes constantly produce new homozygotes. If deleterious alleles are partly, but not fully recessive, this causes an excess load in asexuals at intermediate population sizes. In subdivided populations without extinction, drift mostly occurs locally, which reduces the efficiency of selection in both sexuals and asexuals, but does not lead to global fixation. Yet, local drift is stronger in asexuals than in sexuals, leading to a higher mutation load in asexuals. In metapopulations with turnover, global drift becomes again important, leading to similar results as in finite, unstructured populations. Overall, the mutation load that arises through the absence of segregation in asexuals may greatly exceed previous predictions that ignored genetic drift.

MOST eukaryotes engage in sexual reproduction despite potentially high costs, such as the famous twofold cost of sex (Maynard Smith 1978; Barton and Charlesworth 1998). Genetically, the key components of sexual reproduction are recombination and, in diploid organisms, segregation. Both are absent under pure asexual reproduction. Recombination and segregation rearrange the genotypic composition of offspring from sexual matings, by bringing together novel allelic combinations at a locus (segregation) or at a set of different loci (recombination). Hence, these processes may affect the distribution of fitness values within populations and may therefore generate indirect selective pressure for sexual reproduction (Barton and Charlesworth 1998; Otto and Lenormand 2002; Otto 2003; Agrawal 2006; de Visser and Elena 2007).

One possible advantage of recombination and segregation is that they allow sexual populations to reduce their genetic load through an improved efficiency of selection against deleterious alleles (Kimura and Maruyama 1966; Crow 1970; Crow and Kimura 1970). This requires the existence of negative disequilibria such as when beneficial and deleterious alleles (within or between loci) occur more often in the same individual than expected by chance. Recombination and segregation bring together favorable alleles within the same individuals (and unfavorable alleles in others) and hence improve the efficiency of natural selection. Selection against recurrent deleterious mutation can create negative disequilibria between loci (“negative linkage disequilibrium”) if deleterious alleles at different loci interact synergistically (Kondrashov 1982; Charlesworth 1990). Equivalently, selection can create negative disequilibria within loci (“heterozygote excess”) if deleterious alleles are fully or partially recessive. This is because (with partially recessive deleterious alleles) the fitness of heterozygotes is higher than the average fitness of the homozygotes, and hence heterozygote excess develops during selection. Once a heterozygote excess is established, sexual reproduction leads to improved selection and therefore to reduced genetic load, because segregation eliminates the heterozygote excess, resulting in an increased variance in fitness (Chasnov 2000).

Arguments based on the genetic load are, however, not sufficient to predict how a modifier gene affecting the balance between sexual and asexual reproduction will evolve (*e.g.*, Barton 1995; Otto 2003). Indeed, there is always a cost (in terms of mean fitness of offspring) of breaking genetic associations that have been generated by selection. This cost is termed “recombination load” or “segregation load” (depending on whether negative linkage disequilibrium or heterozygote excess is broken). Analyses of modifier models have shown that, in infinite, randomly mating populations, sexual reproduction may be favored only when dominance and/or epistasis are sufficiently weak relative to the strength of selection, so that the recombination load and/or segregation load is not too high (Barton 1995; Otto 2003). These models have also shown that even a low rate of inbreeding may allow sex and recombination to be favored under less restrictive conditions than with random mating (Otto 2003; Roze and Lenormand 2005). Whereas there is little empirical support for widespread weak synergistic epistasis (Rice 2002), there is ample evidence that new deleterious mutations are, on average, partly recessive (Muller 1950; Simmons and Crow 1977; Lynch and Walsh 1998; Szafraniec *et al.* 2003). In diploid populations, genetic associations generated by dominance may thus play a greater role in the evolution of sex than genetic associations generated by epistasis (Otto 2003).

Another factor that may contribute to the creation of negative linkage disequilibria is genetic drift in conjunction with directional selection. This is because genetic drift randomly creates positive and negative associations, but positive associations are rapidly consumed by selection (because they represent the most extreme fitness values), while negative associations tend to last longer (Hill and Robertson 1966; Felsenstein 1974). Genetic drift together with directional selection can lead to an advantage of recombination without the requirement of synergistic epistasis (Otto and Barton 1997, 2001; Iles *et al.* 2003; Barton and Otto 2005; Keightley and Otto 2006; Roze and Barton 2006), especially in subdivided populations (Martin *et al.* 2006; Salathé *et al.* 2006).

Whether genetic drift can also lead to an advantage of segregation is less clear. Genetic drift has two important effects: first, in sexual populations, it may increase the average strength of selection against recessive deleterious alleles, an effect that has been termed “purging by drift” (Glémin 2003); it is unclear whether this effect can also occur in asexuals. Second, it leads to random changes in allele frequencies, which renders selection less efficient: if drift is too strong compared to selection, frequency changes of deleterious alleles may be similar to those of neutral alleles (Kimura *et al.* 1963). However, the strength of this effect may differ between sexual and asexual populations; indeed, due to the absence of segregation, asexuals inherit genotypes rather than alleles, which increases the sampling variance of genotype frequencies in asexual populations and thus reduces their variance effective size relative to sexual populations (Balloux *et al.* 2003).

Here, we analyze both of the effects of genetic drift explicitly by using equilibrium models to investigate the expected genetic load due to recurrent deleterious mutation in sexual and asexual populations subject to drift. These models do not directly study the evolution of sex, because we fix the rate of sexual reproduction to either zero or one. Rather, they aim, as a first step, at comparing the relative effects of drift and selection between sexual and asexual diploids subject to recurrent deleterious mutation. To concentrate only on effects that are due to segregation, we use simple one-locus two-allele models, starting with a single population of varying effective size. We then extend this to metapopulations with finite, but large numbers of demes. This extension is important because it is likely to represent the natural situation, as most populations are subdivided to some extent, and because single small populations are unlikely to persist over long periods of time. Agrawal and Chasnov (2001) derived the mutation load in diploid, infinite, and spatially structured sexual and asexual populations. In their model, population regulation occurs at the level of the whole population, and the only effect of population structure is to increase homozygosity in sexuals. However, it is likely that, in subdivided populations, most competition occurs locally, decreasing the efficiency of selection by increasing competition among related individuals (“local drift”). Population structure may also affect the load through effects on genetic drift at the total population level and on demography. We investigate these different effects using a finite-island model with extinction and recolonization. Overall, we show that in single undivided populations, as well as in metapopulations, the cumulative effects of genetic drift and segregation across a realistic number of loci may lead to an equilibrium fitness in sexuals that is many times higher than that in asexual populations.

## THE MODEL

Throughout, we calculate the genetic load in sexual and asexual populations due to a single locus that mutates with rate *u* from a wild-type allele, *A*, to a mutant allele, *a*. Back mutation from *a* to *A* occurs at rate *v*, . Relative genotypic fitness values for *AA*, *Aa*, and *aa* are 1, , and , respectively, where *h* is the dominance coefficient and *s* is the selection coefficient. The genetic load *L* is defined as , where *W* is the mean fitness of a population. Following Chasnov (2000) and Agrawal and Chasnov (2001), we extrapolate our results to many loci by assuming that each locus contributes independently (multiplicatively) to the genetic load; that is, , where *n* is the number of loci. As is discussed later, this single-locus load underestimates the total load of asexuals, as interference between loci may greatly reduce the efficiency of selection at each locus.

#### Single sexual population:

The mean fitness *W*_{sex} of a randomly mating population is determined by the frequency *p* of the deleterious allele: , and thus . The expected allele frequency (and the expected squared frequency, ) in a population of arbitrary size *N*, subject to mutation, selection, and genetic drift, can be obtained by numerical integration of Wright's distribution (Wright 1937; Kimura *et al.* 1963; see also Caballero and Hill 1992; Bataillon and Kirkpatrick 2000; Glémin 2003). All numerical calculations were done with Mathematica (Wolfram 2003), and we checked the approximations against simulation results, obtained by averaging the observed load over 10^{8} generations, after the mutation–selection–drift equilibrium had been reached (which can easily be checked by visual inspection of the results).

#### Single asexual population:

Due to the lack of segregation in obligate asexual diploids, their two haploid genomes will acquire mutations independently. Thus, a new mutation that arises in one of the two homologous chromosomes of an asexual will be restricted to that chromosome unless an independent mutation occurs at the same locus in the second chromosome (see also Charlesworth and Charlesworth 1997). Calculating the mutation–selection–drift balance for a diploid asexual population hence requires solving a two-dimensional stochastic model representing the change in frequency of genotypes *Aa* and *aa*. However, this can be simplified by noting that, when mutations are (partially) recessive, only two genotypes will usually segregate in the population. When *N*_{e} is large, the population is at mutation–selection balance and mutant homozygotes can be neglected (provided that ). As *N*_{e} decreases, selection against *Aa* individuals becomes inefficient (roughly when *N*_{e} < 1/*hs*), and *Aa* goes to fixation. However, selection against *aa* individuals remains efficient, and the frequency of these individuals remains small, until, when *N*_{e} decreases to ∼ <1/*s*, selection against *aa* also becomes inefficient, and *aa* will eventually fix. Each of these two processes can be analyzed separately by standard diffusion models for haploid populations with only two genotypes with different fitnesses (Crow and Kimura 1970).

The first process is represented by a diffusion in a population composed of *Aa* and *AA* individuals, corresponding to a standard haploid diffusion where *Aa* individuals have relative fitness . The mutation rate from *AA* to *Aa* is 2*u* (because mutation in either of the two homologous chromosomes will form an *Aa* individual; Charlesworth and Charlesworth 1997), and the back-mutation rate (from *Aa* to *AA*) is *v*. Integration of the haploid diffusion described by these parameters yields *Q*, the expected frequency of *Aa* individuals in a population of *Aa* and *AA* individuals. The second process is represented by a diffusion in a population composed only of *aa* and *Aa* individuals (assuming the back-mutation rate is sufficiently small), that is, a standard haploid diffusion, where *aa* individuals have relative fitness , which equals to the first order in *s*. The mutation rate from *Aa* to *aa* is *u*, and the back-mutation rate (from *aa* to *Aa*) is 2*v*. This yields *R*, the expected frequency of *aa* individuals in a population of *aa* and *Aa*. To combine these two processes, we approximate the expected frequencies of mutant homozygotes and heterozygotes, and , by *QR* and *Q*(1 − *R*), respectively. Although mathematically not strictly correct, this gives good results (compared to simulations) here because it is only when the genotype *Aa* is close to fixation in the first diffusion (*Q* close to 1) that the frequency of *aa* is not negligibly small in the second diffusion. The load is then given by .

#### Large population approximation:

The expected genetic load in sexual and asexual populations of infinite size is between *u* and 2*u*, but stays close to 2*u* for most biologically realistic parameter values. Only when *h* is quite small () is the load significantly reduced in sexuals compared to asexuals, because, as *h* decreases, *L* tends to *u* more quickly in sexuals than in asexuals (Chasnov 2000; for the sexual case, see also Kimura *et al.* 1963).

#### Small population approximation:

When is very small, the population is fixed for one genotype most of the time, and selection has little effect on the fixation probabilities. Neglecting the effect of selection, a simple calculation shows that asexual populations are fixed for *aa*, *Aa*, or *AA* with probabilities , , and , respectively. On average, the load is thus given by . Sexual populations are fixed for *aa* or *AA* with probabilities and , respectively, and the load is given by . When , and are both close to *s*; however, the sexual load will be slightly higher than the asexual load as long as : .

#### Subdivided population:

We next use the island model to consider the effects of population structure. The population is subdivided into *n* demes, each containing *N* diploid adults. These adults produce a large but, depending on their fitness, variable number of gametes (in the sexual case) or diploid juveniles (in the asexual case) and then die. In the sexual case, gamete fusion is random within each deme. Each juvenile then disperses with probability *m*. Each deme thus contributes to the pool of migrants in proportion to its average fecundity. Each migrant can reach any other deme with the same probability. Finally, *N* individuals are sampled randomly among all the juveniles present in each deme, to form the next adult generation. We also consider the effect of local extinctions of demes; for this, we use Slatkin's extinction–recolonization model (Slatkin 1977). At the beginning of a generation, each deme goes extinct with probability *e*. During the dispersal phase, juveniles reaching an extinct deme do not survive. Then (after the dispersal phase), each extinct deme is recolonized by *k* juveniles, either sampled randomly from the whole population of juveniles (migrant pool model) or derived from the same deme (propagule pool model). In both cases, each juvenile has an equal probability to become a recolonizer. It is assumed that recolonizers reproduce immediately, so that deme size goes back to *N* in all demes.

Our model corresponds, for instance, to a population subdivided into discrete patches in which the number of breeding sites is fixed (*N* adults per patch). This introduces local population regulation, but regulation is not completely local (as long as some migration occurs) because more fertile patches will produce more migrants. Therefore, our model may be seen as intermediate between complete local regulation (“soft selection”) and complete global regulation (sometimes called “hard selection”). Under complete global regulation, each deme contributes to the next generation (and not just to the migrant pool) in proportion to its mean fecundity. However, it is difficult to imagine a simple biological scenario that would correspond to complete global regulation in a spatially structured population. This can easily be seen by considering the limit when migration tends to zero, in which case one would have to assume that deme sizes can grow indefinitely, at rates depending on their mean fecundity. Also at intermediate levels of migration, it is difficult to imagine a life cycle that would make the contribution of each deme to the next generation exactly proportional to its mean fecundity. Therefore, rather than using a parameter that measures the “degree of local competition” as is sometimes done to scale between soft and hard selection, we preferred to investigate the effects of local competition in a simple life cycle where all parameters have immediate biological meanings (deme size, migration rate, extinction rate). Still, we can note that our model is equivalent to soft selection when and to hard selection when .

We use the method of Roze and Rousset (2003), which is sketched in appendix a, to derive expressions for the expectation and the variance of the change in frequency of the deleterious allele, over one generation. Importantly, this method uses a separation-of-timescales argument that works best when selection is weak relative to migration (). In sexuals, the expected change in frequency *p* of the mutant allele *a*, , is given by(1)with and(2)(3)In the expressions above, , , and α are probabilities of coalescence within demes under neutrality, which are functions of *N*, *m*, *e*, and *k*, and are given in appendix a; is the “backward” migration rate (the probability that, after dispersal, a juvenile comes from another deme), given by .

For asexuals, we calculate the expected genotype frequencies using two diffusions, as described previously for the single population. In the first case, where *aa* individuals are very rare, and genotypes *Aa* and *AA* segregate in the population with frequencies *p* and *q*, respectively, the expected change in the frequency of *Aa* individuals over one generation is given by(4)with(5)where is the probability that the ancestral lineages of two genes sampled with replacement from the same deme stay in the same deme and coalesce, in a haploid model under neutrality (see appendix a). In the second case, where *AA* is very rare, and *aa* and *Aa* segregate in the population (with *p* now being defined as the frequency of *aa* and *q* as the frequency of *Aa*), the expected change in frequency of *aa* individuals over one generation is given by(6)with(7)where is the same as above.

We then have for both the sexual and the asexual cases(8)(*e.g.*, Roze and Rousset 2003), where is again the neutral probability of coalescence for two genes sampled with replacement from a deme. The expression for differs between the sexuals and the asexuals (see appendix a), and, as above, *p* is the frequency of *a* in the sexual case and the frequency of *Aa* and *aa* in the first and second asexual models, respectively, and *q* = 1 − *p*.

The equations above take the same form as in a single finite population, the selection coefficients and the effective population size now depending on *N*, *m*, *e*, and *k*. As for a single population, these diffusion equations can be integrated numerically (see appendix a) to obtain the load at equilibrium, the only difference being that, in the sexual case, the frequency of homozygotes is affected by population structure, and the load is now given by(9)where and are averages over the probability distribution of *p*, the frequency of allele *a* (*e.g.*, Roze and Rousset 2003). For asexuals, we combine the two diffusions as described above for the single population and express the load as(10)

## RESULTS

#### Single population:

Figure 1 shows some results obtained by numerical integration for a partially recessive deleterious allele (*h* = 0.1, *s* = 0.05) in sexual and asexual populations. For sexual populations, Figure 1 illustrates findings already obtained by others (Kimura *et al.* 1963; Bataillon and Kirkpatrick 2000; Glémin 2003): in large populations, the mean mutant allele frequency and load are very close to mutation–selection balance. As population size decreases, genetic drift increases, leading to partial purging by drift (Glémin 2003), that is, a reduction in the mean frequency of the deleterious allele. The load also decreases slightly, although the effect is rather small for the parameter values used in Figure 1. This effect was first described by Kimura *et al*. (1963, p. 1306), who noted that “Here there is the paradox that a finite population has a smaller load than an infinite population, which would seem to imply that a random process produces a higher average fitness than a deterministic one.” As shown by Glémin (2003), this can be interpreted by considering the effect of averaging over a distribution of allele frequencies. When selection is weak, the change in frequency of the deleterious allele due to selection is given (to the first order in *s*) by(11)Assuming that *p* follows a frequency distribution, we have(12)where the overbar stands for the average of the distribution. Assuming that most of the distribution stands at low values of *p* (selection remains efficient relative to drift), we may neglect the third moment and obtain(13)where is the variance of the distribution of *p*. Equation 13 thus shows that when , the efficiency of selection (as measured by ) increases as the variance increases (as long as drift is not too strong).

Whereas this effect may lead to a reduced load at intermediate population sizes in sexuals, decreasing population size also increases the stochastic component of allele frequency change. Hence, below a certain population size, selection is overwhelmed by drift, causing the mean frequency of the deleterious allele to rise until close to its neutral expectation, that is, close to fixation for . This transition occurs over a narrow range of population sizes, leading also to a sharp increase in genetic load once population sizes decrease below a certain point. To quantify the population sizes at which this transition occurs, we arbitrarily define a limiting population size, *N*_{lim}, so that *L* = 10*u* at *N*_{lim} (**×**'s in Figure 1B).

In large asexual populations, mean allele frequency and load are very similar to values for large sexual populations (unless *h* is very small; Chasnov 2000). However, a decrease in population size does not initially lead to purging, which can be understood from a similar argument as above. As long as selection against *Aa* remains efficient, the population consists essentially of *Aa* and *AA* individuals. Calling now *p* the frequency of *Aa*, we have (to the first order in *s*) , and thus(14)showing that the variance in the distribution of *p* now decreases the efficiency of selection , for all (Equations 13 and 14 become equivalent for ). Again, decreasing population size also increases the stochastic component of allele frequency change, and below a certain size selection against heterozygotes becomes inefficient, causing their frequency to increase to almost 1 (Figure 2). As a result, the load increases and *N*_{lim} is higher than in sexuals (Figure 1B). At the same time, homozygous mutants appear more frequently (because mutations arise in heterozygotes), but selection against these homozygotes remains efficient and hence their frequency remains low (Figure 2). However, as the population size decreases again, drift eventually overwhelms selection against deleterious homozygotes, and hence the mean mutant allele frequency and the load increase sharply a second time (Figure 1B). For the range of intermediate population sizes, in which selection is efficient against homozygous mutants, but not heterozygotes, the mutation load is substantially higher in asexuals than in sexuals, and hence the fitness of sexuals relative to asexuals peaks at these population sizes (Figure 1C). This is a direct consequence of the absence of segregation, because, with segregation, matings between heterozygotes constantly produce new homozygotes. Hence, in sexuals, heterozygotes cannot reach high frequencies due to drift while mutant homozygotes are still efficiently selected against (see also Figure 2). Finally, Figure 1 also shows that our approximations, which are based on the assumption that only two genotypes segregate in asexual populations most of the time, work very well when compared with simulation results.

The stepwise increase in genetic load with decreasing population size in asexual populations is due to dominance. This can be seen in Figure 3, which shows the mean frequency of a deleterious mutation under multiplicative selection, that is, for . In this case, genotype frequencies in infinite populations at mutation–selection balance are in Hardy–Weinberg proportions in both sexuals and asexuals (Chasnov 2000; Otto 2003). Figure 3A shows that under multiplicative selection, selection is overwhelmed by drift at larger population sizes in asexuals than in sexuals. This may be explained by the fact that drift has stronger effects in asexuals; indeed, the variance effective population size of asexuals is only half the variance effective size of sexuals, because drift in asexuals occurs through random sampling of genotypes, whereas in sexuals it occurs through random sampling of alleles (Balloux *et al.* 2003). Indeed, Figure 3A shows that under multiplicative selection, the equilibrium frequency of the deleterious allele in an asexual population is the same as in a sexual population of half its size. As with the Hill–Robertson effect that occurs between selected loci (Hill and Robertson 1966), an alternative interpretation of this process is through its effect on genetic associations: in the same way as drift and selection combine to generate negative linkage disequilibria between selected loci, they also combine to generate a negative intralocus association (excess of heterozygotes), which may reach high values when segregation is reduced or absent. This is due to the fact that drift generates a variance in genotype frequencies, leading to situations where heterozygotes are in excess and situations where they are in deficit. Because the variance in fitness is lower in the first situation, negative associations tend to last longer and may accumulate over time if they are not broken down every generation by segregation (an extreme case being the situation where *Aa* is fixed in the population). This is illustrated in Figure 3B, which shows the mean intralocus association (the equivalent of the linkage disequilibrium between loci), defined as , for different values of *N* in asexuals. Note that an equivalent definition is , where *F* is the inbreeding coefficient (we use rather than *F* to emphasize the parallel to linkage disequilibrium between loci and because *F* is not defined for monomorphic populations). As shown by Figure 3B, is negative and peaks at intermediate values of *N* (where *Aa* often reaches high frequencies). However, it is important to note that this intralocus effect is not exactly equivalent to the Hill–Robertson effect, because asexual reproduction generates a negative , on average, even in the absence of selection (under mutation and drift alone), while the mean linkage disequilibrium between loci is zero under neutrality. In appendix b, we show that in an asexual population, and under neutrality, at equilibrium is given by(15)where (for small *u* and *v*). Thus, in the neutral case, is close to −0.005 for the parameter values used in Figure 3B (, ); this is confirmed by simulations (not shown). Finally, one can note that in the sexual case, genetic drift also generates a negative in the neutral case (*e.g.*, pp. 39–40 in Gale 1990) or under multiplicative selection, but this is much smaller in magnitude than in the asexual case. Under neutrality, it is given by(16)at equilibrium (appendix b), that is, for the parameter values in Figure 3B roughly four orders of magnitude lower than in the neutral case for asexuals.

Figure 4 shows the genetic load in sexual and asexual populations for deleterious alleles with a range of biologically realistic values of *h* and *s* (Simmons and Crow 1977; Lynch and Walsh 1998; Szafraniec *et al.* 2003). *N*_{lim}, the population sizes at which the load starts to increase above equilibrium values for infinite populations, is always higher for asexuals than for sexuals. There is thus always a range of intermediate population sizes where the load in asexual populations is considerably higher than in sexual ones. Figure 5 shows how *N*_{lim} depends on *s* for a range of different *h*. In the parameter range considered, *N*_{lim} in sexuals is almost independent of *h* and is ∼4/*s*. In contrast, in asexuals, *N*_{lim} depends strongly on both *h* and *s* and is ∼4/*hs* (*N*_{lim} = 4/*s* and *N*_{lim} = 4/*hs* are numerical fits used to illustrate that *N*_{lim} depends on *s* in sexuals and on *hs* in asexuals).

Figure 4 also shows that for large populations, the genetic load is very similar in sexual and asexual populations (∼2*u*), except when *h* and *s* are small (*e.g.*, *h* = 0.001, *s* = 0.01), when the load in sexuals is somewhat smaller (tending to *u* as *h* becomes 0). This is the effect studied by Chasnov (2000). Multiplied across loci, it can compensate for the twofold cost of sex, but only for very small values of *h*. In contrast, in small populations we find a somewhat larger load in sexuals than in asexuals (see also Figure 1C). This is in agreement with the *small population approximation* given above and is due to the fact that small sexual populations are fixed either for *AA* or for *aa* most of the time, while asexual populations can be fixed for the *Aa* genotype.

It is difficult to extrapolate single-locus equilibrium results to a multilocus setting, but we follow previous work (Chasnov 2000; Agrawal and Chasnov 2001) in assuming complete independence of different loci and multiplicative fitness. This focuses only on the effects of segregation, neglecting any potential effects of linkage among loci. Our results thus underestimate the load in asexuals, as interference among loci can greatly reduce the effective population size of asexuals (Hill and Robertson 1966; Felsenstein 1974; Charlesworth *et al.* 1993a; Keightley and Otto 2006; Paland and Lynch 2006). Figure 6 shows that the total fitness of sexuals relative to asexuals at equilibrium can greatly exceed 2, even for conservative estimates of the genomewide deleterious mutation rate *U* (Haag-Liautard *et al.* 2007). As already noted, the fitness of sexuals exceeds that of asexuals mainly for intermediate population sizes (and to a lower extent with large population sizes if *h* and *s* are small, see Chasnov 2000). Conversely, for low population sizes, the fitness of sexuals is lower than that of asexuals (Figure 6). Therefore, these results indicate that sexuals may have a stronger advantage over asexuals at intermediate (rather than small or high) values of . It would thus be interesting to study the effect of population size in a more dynamic model where an asexual mutant would spread within a sexual population, in which case the “population sizes” of sexuals and asexuals would vary over time.

#### Subdivided population:

One might expect that increasing the degree of spatial structure of a large metapopulation (for example, by decreasing the size of local demes) has the same qualitative effect as decreasing *N* has in a single finite population. Indeed, population structure generates a variance in allele frequency between demes, which may lead to a similar effect as purging by drift in sexuals, but may also lead to local fixation of deleterious alleles if local drift is too strong. However, population structure increases the effective size of the total population (in the absence of any local demographic effects, *i.e.*, assuming constant deme sizes) and thus decreases the effect of drift on the change in allele frequencies in the whole metapopulation (Wang and Caballero 1999; Rousset 2003); therefore, the expected effect of increasing spatial structure will not be strictly equivalent to the effect of decreasing in the single-population model. Figure 7 shows results for the genetic load, obtained using the island model of population structure without extinction. Here, we fixed the total population size at (where *n* is the number of demes and *N* is the number of adults per deme) and vary *n* and *N*. The *x*-axis in Figure 7 shows , so one moves from demes, each with a single individual, at the left, to a single population of individuals at the right (*i.e.*, decreasing population structure). In sexuals, an effect similar to the purging by drift described in the single-population case occurs: a moderate degree of population structure increases the efficiency of selection , either under soft selection or under the present life cycle (Whitlock 2002; Roze and Rousset 2004; Theodorou and Couvet 2006). However, spatial structure also increases the degree of competition among related individuals (which tend to carry the same alleles), thereby reducing the effect of selection. As a result, population structure has a nonmonotonic effect on the mutation load in sexuals: the load first decreases slightly (with increasing population structure) and then increases with stronger population structure. Previous results have shown that, in general, population structure involving local competition does not greatly reduce the mutation load; rather, its main effect is that strong structure increases the load (Glémin *et al.* 2003; Roze and Rousset 2004; Glémin 2005; Theodorou and Couvet 2006). In asexuals, population structure does not improve the efficiency of selection; its only effect is to increase local competition, which increases the load. Although population structure increases the effective size of the whole metapopulation, this has little effect in the absence of extinction, for the parameter values used in Figure 7: is always large, and the population is at mutation–selection balance over the whole range of deme sizes. Indeed, a deterministic solution (for an infinite population size ) gives a good approximation of our diffusion results. In the deterministic limit, and neglecting back mutation, the equilibrium frequency of allele *a* is ∼ in the sexual case, where is given by Equation 2, while the frequency of the genotype *Aa* is ∼ in the asexual case, where is given by Equation 5. From this, one can obtain simple approximations for the load, assuming small *m* and large *N*,(17)(18)and thus(19)Approximations (17) and (18) are shown in Figure 7 and exhibit a reasonable match with the diffusion and simulation results. Equation 19 indicates that, even when and when the effective size of the total population is infinite, sexuals may have a significantly lower load than asexuals. This contrasts with load in an infinite panmictic population, which, for both sexuals and asexuals, is very close to 2*u* when . When , simplifies to (while ), and simplifies to . The probabilities of identity by descent and can be shown to be equivalent to Wright's *F*-statistics and in a neutral infinite-island model (Hudson 1998; Rousset 2002). Although our model includes selection, it is sufficient to use expressions for these *F*-statistics in a neutral model; indeed, taking into account the effect of selection on *F*_{IT} and *F*_{ST} would generate terms of order in the expressions of and that would disappear in the diffusion limit. Using the relation , one arrives at the deterministic expressions (still for ):(20)The factor comes from the fact that population structure decreases the efficiency of selection by increasing kin competition, while the factor corresponds to the fact that, in sexuals, two deleterious alleles can be eliminated at the same time when they occur in the same individual. Note that although we assume random mating within demes, is positive because it is measured after migration, which occurs in the diploid stage (and thus the two homologous genes of an individual are more likely to share a common ancestor than two genes in two different individuals from the same deme). Here, simply equals : thus, this effect occurs mainly when deme size is small. More importantly, is not the same in the sexual and the asexual cases: in the asexual case, the that enters into Equation 20 corresponds to the of a *haploid* population (∼), which is higher than the of a diploid population with the same *N* and *m* (∼). For large *N*, and small *m*, we have:(21)This effect is equivalent to the result that, in a single population, asexuals have half the variance effective size of sexuals. Although here there is no drift at the whole-population level (as we are at the deterministic limit), local drift due to population structure (which reduces the efficiency of selection) is stronger in asexuals than in sexuals, generating a higher load in asexuals.

Importantly, including local competition leads to a different effect of population structure than using a model of global competition as in Agrawal and Chasnov (2001). The dashed-dotted curve in Figure 7 corresponds to Equation 5 in Agrawal and Chasnov (2001) (load in sexuals), replacing *f* by (note that this is not strictly correct, as it is difficult to imagine both global regulation and deme size staying constant over time, but it serves to illustrate their result). Here, the only effect of population structure is to increase homozygosity in sexuals (because individuals tend to mate with relatives), which leads to better purging and lowers the load. Conversely, population structure has no effect on the asexual load, which remains equal to 2*u* [that is, for the parameters in Figure 7]. With local competition, the overall effect of population structure is to increase the load of both sexuals and asexuals (although a moderate degree of structure can slightly reduce the sexual load, as shown in Figure 7), the asexual load increasing faster than the sexual load.

Although population structure increases the effective size of metapopulations under the hypothesis of a constant deme size over time, adding local demography may reverse this relation (*e.g.*, Whitlock and Barton 1997). In particular, local extinctions may greatly reduce the effective size of a metapopulation. We investigated the effect of extinctions on the mutation load in sexual and asexual metapopulations using Slatkin's (1977) extinction–recolonization model (described above). Increasing the rate of deme extinctions (*e*) has similar effects to decreasing in the case of a single population: as *e* increases, the importance of drift in the whole metapopulation increases, which may lead to the fixation (or quasi-fixation) of the deleterious allele. For intermediate values of *e*, selection against *Aa* individuals becomes ineffective in the case of asexuals, while selection remains efficient in the case of sexuals. For higher values of *e*, drift becomes stronger than selection in both sexuals and asexuals. This leads to stepwise increases of the genetic load, as shown in Figure 8, A (for the migrant pool model) and B (propagule pool model). In particular, Figure 8A shows that the asexual load may be far greater than the sexual load over a wide range of extinction rates, due to the lack of segregation (and a lower effective size) in asexuals.

## DISCUSSION

In this article, we investigated the effects of population size and spatial structure on the mutation load in sexuals and asexuals. It is important to note that, in finite or structured populations, one cannot simply predict the outcome of competition between sexuals and asexuals by comparing their mutation loads at equilibrium: in a finite population, a mutation causing a transition to asexuality may reach fixation (if it occurs in a good genetic background) before mutation–selection balance is reached. In addition, in subdivided populations, one would have to account for the fact that asexuals compete more often with other asexuals (due to limited dispersal and local competition) than if the population was well mixed. Bearing this in mind, our results show that drift and population structure have different effects on the equilibrium mean fitness of sexuals and asexuals.

#### Single population:

We found that, at intermediate population sizes, mutation load may be far greater in asexual populations than in sexual populations. Which population sizes are “intermediate” depends on the values of the parameters *h* and *s*, but, as a rule of thumb, we found that a 10-fold increase in mutation load compared to *u* was reached when *N* < 4/*s* in sexuals and when *N* < 4/*hs* in asexuals. In Saccharomyces and Drosophila, estimates indicate that *h* > 0.01 (Simmons and Crow 1977; Lynch and Walsh 1998; Szafraniec *et al.* 2003), so the parameter range at which drift overwhelms selection against heterozygotes in asexuals but is still effective in sexuals should span less than two orders of magnitude in *N*. However, as we discuss below, our estimate of mutation load in asexuals may be a gross underestimate when deleterious mutations occur at many loci, because it neglects the effects of clonal interference on the effective population size. Hence, it is possible that selection against heterozygotes in asexuals is ineffective even in populations ≫*N* = 4/*hs*.

Calculations of genetic loads are useful to assess the effects of segregation on the efficiency of selection in equilibrium situations. In contrast, to study the gradual evolution of sexual reproduction, the immediate costs (in terms of mean offspring fitness) of breaking genetic associations that have been generated by selection need to be taken into account (see review by Agrawal 2006). This can be done, for instance, by studying the evolution of a modifier locus with different alleles that increase or decrease the proportion of offspring produced sexually. Using such a modifier model, Otto (2003) found that, in infinite populations, conditions under which an advantage of segregation leads to an increase in frequency of a modifier promoting sexual reproduction are quite restricted, unless there is some inbreeding, which causes an excess of homozygotes. It may be argued that genetic drift has similar effects to inbreeding because both processes may lead to purging in sexuals. However, drift also leads to random changes in allele frequencies, which counteract the effectiveness of selection and may therefore generate a greater advantage for sex at intermediate population sizes, when deleterious alleles are fixed in the heterozygous state in asexuals, but not in sexuals. It would be interesting to see how this effect translates into an advantage for sex in a modifier model representing a finite diploid population.

A potential shortcoming of our model is that it neglects mitotic recombination in asexuals (Barbera and Petes 2006; Omilian *et al.* 2006). Mitotic recombination leads to formation of homozygotes from heterozygous asexuals and is equivalent to increasing mutation rates from heterozygotes to homozygotes. High rates of mitotic recombination (of the order of magnitude of mutation rates or higher) may decrease the effects observed here because fixation in the heterozygous state becomes less common, if heterozygotes often “back mutate” to homozygotes. However, the significance of mitotic recombination in nature is unknown. For instance, if it involves long stretches of DNA containing many loci (Omilian *et al.* 2006), most cases of mitotic recombination may be deleterious because mutant homozygotes will be created at some loci.

#### Subdivided population:

In large metapopulations without extinction and recolonization, drift at the global level can be neglected. In such metapopulations, the effect of increasing population structure (by decreasing deme size or migration rate) on the mutation load depends critically on the assumptions concerning population regulation. Here we found that the realistic assumption of local population regulation (*i.e.*, competition within local populations) has a strong effect on mutation load in both sexuals and asexuals. Local competition reduces the efficiency of selection, as individuals who tend to have similar genotypes compete with each other. Hence, in our model, the overall effect of population structure is to increase the load in both sexuals and asexuals (although a moderate degree of structure can lead to a small reduction of the load in sexuals). This effect is stronger in asexuals than in sexuals, because local drift has more effect in asexuals. Indeed, within a deme, the variance effective size of asexuals is half the effective size of sexuals. In the deterministic limit (very large total population size), we found that . As for the case of a single population, it would be interesting to model the evolution of a modifier locus affecting the proportion of sexually produced offspring in a metapopulation. This could be done using the method developed in Roze and Rousset (2005).

Local demography can have important effects on the effective size of metapopulations (Whitlock and Barton 1997). Here we used a very simple demographic model, where local extinctions occur at a rate *e* and where deme size immediately goes back to *N* after recolonization (Slatkin 1977). Increasing the extinction rate reduces the effective size of the total population. Hence we found that, for a wide range of extinction rates, deleterious mutations may be fixed in the heterozygous state in asexuals, while still being efficiently eliminated in sexuals. More realistic demographic models of subdivided populations could be explored, for example, by using the method of Rousset and Ronce (2004).

#### Mutation load in small populations:

In contrast to populations of intermediate size, sexuals have a similar or even a higher genetic load than asexuals in small populations (single-population model) and also in metapopulations with very high turnover rates (metapopulation model, large *e*). This occurs for parameter ranges at which selection is ineffective against all genotypes in both sexuals and asexuals and so that populations are fixed for the mutant genotype *aa* most of the time (when ).

The result that the improved effectiveness of selection in sexuals disappears, or is even reversed compared to that in asexuals at low effective population size, may be consistent with the well-documented pattern of geographic parthenogenesis (Vandel 1928): asexuals often occur in more marginal (as opposed to core) habitats, such as at high latitudes or altitudes, than their closely related sexual counterparts (*e.g.*, Bell 1982; Bierzychudek 1985). Because marginal habitats may be environmentally less predictable and more patchily distributed than core habitats, populations in these habitats may tend to be smaller, subject to wider stochastic fluctuations in density, and hence subject to stronger drift (Haag and Ebert 2004). If drift in marginal populations is indeed so strong that it overwhelms selection in asexuals and in sexuals, our model predicts that, in marginal habitats, asexuals have a slightly lower load than sexuals.

It is, however, unclear whether equilibrium results (such as those described here) apply to situations in which genetic drift is so strong that a majority of deleterious mutations are effectively neutral. If this were the case at many loci, deleterious mutations may accumulate through fixation and Muller's ratchet in both sexual and asexual populations (although the effects differ between sexuals and asexuals, *e.g.*, Pamilo *et al.* 1987; Charlesworth *et al.* 1993b; Charlesworth and Charlesworth 1997). This may eventually drive the populations to extinction (Lynch *et al.* 1995), and, thus, equilibrium conditions would not be met. Nonetheless, if sexual populations fix mutations in a homozygous state, whereas asexual ones fix them first as heterozygotes, the fitness decline may be faster in sexuals than in asexuals.

Multilocus simulation studies suggest that the advantage of recombination may also increase with population size (even without leveling off, at least for the studied parameter range; Iles *et al.* 2003; Keightley and Otto 2006; Salathé *et al.* 2006), due to the fact that a greater number of selected mutations segregate in larger populations. Hence these studies, as well as ours, are consistent with the idea that the fitness of sexuals relative to asexuals increases with population size (in our study only up to a certain point). Again, this may be a possible explanation for geographical parthenogenesis, if marginal populations have a lower effective size than core populations.

#### Extrapolation to multilocus situations:

The models presented here are single-locus models with recurrent deleterious mutations, but in real organisms mutations occur at many loci throughout the genome. It is difficult to extrapolate from single-locus models to multilocus situations. Nonetheless, our model suggests that, if deleterious alleles are at least partly recessive, asexuals suffer an increased load compared to sexuals at intermediate population sizes. Multiplied across loci, this could substantially reduce genetic load in sexual populations even for conservative estimates of *U* and in the absence of any effect of recombination. Neglecting potential effects of recombination (or its absence in asexuals) in a multilocus setting is, of course, unrealistic, but was done here to isolate the effects of segregation from those of recombination. In asexuals, the effectiveness of selection can be greatly reduced below levels predicted by single-locus models because of strong selective interference among nonrecombining loci (the “Hill–Robertson effect”; Hill and Robertson 1966). Hence single-locus models underestimate the force of drift, and hence load, in asexuals. Selective interference also occurs in sexuals, but is much weaker, due to recombination (Hill and Robertson 1966; Felsenstein 1974; Charlesworth *et al.* 1993a; Keightley and Otto 2006). Thus, it seems likely that combining the effects of segregation and recombination would lead to an increased parameter range in which asexuals have an increased load compared to sexuals.

## APPENDIX A

We derive here expressions for the expected change in frequency of the deleterious allele (*a*) over one generation, in the island model of population structure, with extinctions and recolonizations. Note that to use the diffusion method, it is sufficient to express this expected change in the limit as the number of demes *n* tends to infinity (*e.g*., Roze and Rousset 2003). Our life cycle assumes that at each generation, each deme may go extinct with probability *e*. In nonextinct demes, individuals produce a large but, depending on their fitness, variable number of gametes, which fuse immediately and at random within the deme (in the sexual case), or juveniles (in the asexual case). Each juvenile then has a probability *m* of entering the migrant pool. Therefore, demes contribute to the migrant pool on proportion of their mean fecundity. Migrants reach any other deme with the same probability, but following Slatkin's (1977) extinction–recolonization model we assume that migrants arriving in an extinct deme do not survive, each extinct deme being recolonized later by *k* juveniles. In the absence of extinction, the expected contribution of adult *j* in deme *i* to the next adult generation is given by(A1)where is the fecundity of individual *ij*, is the average fecundity in deme *i*, and *w* is the average fecundity in the whole metapopulation; is given by(A2)where , , and are the frequencies of *Aa* individuals, of *aa* individuals, and of the *a* allele in deme *i*, before selection. When extinctions occur, the backward migration rate (probability that, after dispersal, a juvenile comes from another deme) is different from *m* and is given in the neutral case by . Although selection affects , this effect generates a term of order in the expression of the change in frequency of the deleterious allele, which disappears in the diffusion limit. In the sexual case, the frequency of *a* in deme *i* (given that deme *i* does not go extinct) after selection (and before dispersal) is given by(A3)where is given by equation (A2) above. The expected frequency of *a* in deme *i*, after dispersal and recolonization of extinct demes is then given by(A4)where *w* is again the average fecundity in the whole metapopulation (the average over *i* of ), and where is the frequency of *a* in the whole population, after selection, given by(A5)where the overbar means the average over all demes *i*. The first and second terms of Equation A4 represent the case where deme *i* went extinct and the case where it did not go extinct (respectively). In this second case, juveniles come from deme *i*, while juveniles are migrants coming from other nonextinct demes. After expressing Equation A4 to the first order in *s* and averaging over all *i*, one obtains an expression for (the change in frequency of *a* over one generation) as a function of the moments and (where again the overbar means the average over all *i*). When selection is weaker than migration, we can use a separation-of-timescales argument and replace these moments by their equilibrium expressions under neutrality, for the current allele frequencies (*e.g.*, Whitlock 2002; Cherry and Wakeley 2003; Roze and Rousset 2003). This gives(A6)(A7)where is the probability that the ancestral lineages of two genes sampled with replacement from the same deme stay in the same deme and coalesce (in the neutral case and for an infinite number of demes), while , , and are the probabilities that the ancestral lineages of the two homologous genes of an individual, plus a third gene sampled with replacement from the same deme, all stay in the same deme and coalesce (), that only two of them coalesce (), or that no coalescence occurs (). These can be expressed in terms of probabilities of coalescence of genes sampled *without* replacement,(A8)where and are the probabilities that the two homologous genes of an individual, and two genes sampled from two different individuals from the same deme (respectively), coalesce within the deme. Similarly, one has(A9)(A10)where α and γ are the probabilities that the two homologous genes of an individual, plus a gene from a different individual from the same deme, all coalesce (α) or that no coalescence occurs (γ). After some simplifications, and using the fact that , one arrives at Equation 1 in the text.

The coalescence probabilities , , and α are obtained by solving recurrence equations. For this, one defines (and ) as the probability that two (three) individuals colonizing the same extinct deme come from the same deme (before dispersal). In the migrant pool model , while in the propagule pool model and . Recursions for and are given by(A11)(A12)Defining δ as the probability that the ancestral lineages of three genes sampled from three different individuals from the same deme stay in the same deme and coalesce, one then has(A13)Finally, is given by the same equation, replacing by . Solving these equations gives expressions for , , and α in terms of *N*, *m*, *e*, and *k*.

Using the same method for the asexual case yields an expression that depends on , the probability that the ancestral lineages of two genes sampled with replacement from the same deme stay in the same deme and coalesce, this time in a *haploid* model. This is given by(A14)where is obtained by solving the recurrence equation(A15)

This method leads to Equations 1–8 in the text, describing the expectation and variance of the change in allele frequency in sexual and asexual metapopulations. These expressions take the same form as in a single, nonsubdivided population, and standard diffusion methods (*e.g*., Crow and Kimura 1970) yield(A16)where is the probability that allele *a* is present in frequency *p* in the metapopulation at equilibrium, in the sexual case, is a constant, and are given by Equations 2 and 3, and , where is given by Equation 8 (see also Roze and Rousset 2004). In the first asexual model (describing a population consisting of *Aa* and *AA* individuals), the equilibrium probability distribution of the frequency of *Aa* individuals is given by(A17)where is given by Equation 5, and . In the second asexual model (describing a population consisting of *aa* and *Aa* individuals), the equilibrium probability distribution of the frequency of *aa* individuals is given by(A18)where is given by Equation 7, and again . We then used Mathematica to integrate numerically over these distributions, to obtain and in the sexual case and , in the asexual case.

## APPENDIX B

We calculate here the equilibrium value of the intralocus association at the mutation–drift equilibrium (no selection), in the sexual and asexual cases. We have(B1) can thus be expressed by calculating , , , and at equilibrium. In the asexual case, recursions for and are given by (to the first order in *u* and *v*)(B2)(B3)giving at equilibrium , . A recursion for is obtained as follows: is the probability that, at the next generation, two individuals sampled with replacement from the whole population are both *aa*. With probability 1/*N*, the same individual is sampled twice, and this individual is *aa* with probability . With probability , two different individuals are sampled, in which case they are both *aa* with probability , where is the frequency of *aa* after mutation, and before random sampling (). This gives(B4)Similarly, one obtains the following recursions for and :(B5)(B6)Solving these recursions gives, at equilibrium,(B7)where . In the sexual case, one can write recursions for and ,(B8)where is the frequency of *a* after mutation, given by . Therefore,(B9)and(B10)Given that , one obtains at equilibrium(B11)

## Acknowledgments

We thank S. Glémin for discussion and for sharing his algorithm and B. Charlesworth, D. Charlesworth, P. Keightley, S. Otto, and an anonymous reviewer for discussion and comments on earlier versions of the manuscript. We are also grateful to S. Otto for pointing out the full extent of the analogy to the Hill–Robertson effect. This work was supported by the Swiss National Science Foundation, a Marie Curie Fellowship (C.R.H.), and a European Molecular Biology Organization Fellowship (D.R.).

## Footnotes

2

*Present address:*Station Biologique de Roscoff, CNRS UMR 7144, 29682 Roscoff Cedex, France.Communicating editor: H. G. Spencer

- Received March 8, 2007.
- Accepted April 19, 2007.

- Copyright © 2007 by the Genetics Society of America