- THIS ARTICLE
-
Abstract
- Full Text (PDF)
- Alert me when this article is cited
- Alert me if a correction is posted
- SERVICES
- Similar articles in this journal
- Similar articles in PubMed
- Alert me to new issues of the journal
- Download to citation manager
- Reprints & Permissions
- CITING ARTICLES
- Citing Articles via HighWire
- Citing Articles via Google Scholar
- GOOGLE SCHOLAR
- Articles by Doligez, A.
- Articles by Joly, H. I.
- Search for Related Content
- PUBMED
- PubMed Citation
- Articles by Doligez, A.
- Articles by Joly, H. I.
Fine-Scale Spatial Genetic Structure with Nonuniform Distribution of Individuals
Agnès Doligeza, Claire Baril1,a, and Hélène I. Jolyaa CIRAD-Forêt, B. P. 5035, 34032 Montpellier Cedex 01, France
Corresponding author: Hélène I. Joly, CIRAD-Forêt, U. R. Diversité et Amélioration Génétique, Baillarguet, B. P. 5035, 34032 Montpellier Cedex 01, France, joly.h{at}cirad.fr (E-mail).
Communicating editor: B. S. WEIR
| ABSTRACT |
|---|
This paper presents the first theoretical study of spatial genetic structure within nonuniformly distributed continuous plant populations. A novel individual-based model of isolation by distance was constructed to simulate genetic evolution within such populations. We found larger values of spatial genetic autocorrelations in highly clumped populations than in uniformly distributed populations. Most of this difference was caused by differences in mean dispersal distances, but aggregation probably also produced a slight increase in spatial genetic structure. Using an appropriate level of approximation of the continuous distribution of individuals in space, we assessed the potential effects of density, seed and pollen dispersal, generation overlapping, and overdominance selection at an independent locus, on fine-scale genetic structure, by varying them separately in a few particular cases with extreme clumping. When selfing was allowed, all these input variables influenced both aggregation and spatial genetic structure. Most variations in spatial genetic structure were closely linked to variations in clumping and/or local density. When selfing was not allowed, spatial genetic structure was lower in most cases.
BEGINNING with ![]()
![]()
![]()
![]()
More recently, a few authors have used individual-based stochastic simulation models to describe spatial patterns of genotypic variation within continuous uniformly distributed populations, under various conditions of gene dispersal, mutation, migration, allele frequency, turnover of individuals, and selection (![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
There is a complete lack of theoretical studies describing spatial genetic structure within plant populations with nonuniform and thus more realistic spatial distributions of individuals. Most distributions are either random or clumped in nature (see ![]()
![]()
![]()
![]()
![]()
The study of these relationships presents an opportunity to understand better the determinants of spatial genetic structure in real populations. In addition, it may have several fields of application, for example, inference of the factors responsible for spatial patterns observed in natural populations, or prediction of the effects of various management practices on the evolution of diversity level and structure to provide guidelines for in situ conservation programs.
Therefore we propose a novel individual-based stochastic model of isolation by distance allowing for different kinds of spatial distributions, from uniform to highly clumped. Our objective in this paper is to determine which of several input variables (density, dispersal, generation overlapping, and selection) may affect spatial genetic differentiation within the population under extreme clumping conditions. We have used Monte Carlo simulations to examine the consequences of separate variations of these variables in a few particular cases. Spatial distribution of individuals was characterized by means of a statistic based on nearest neighbor distances, and spatial genetic structure by means of autocorrelation statistics. Whenever some effect on spatial genetic structure was found, we have proposed hypotheses to explain the mechanisms involved and have described relationships with the spatial distribution of individuals.
| METHODS |
|---|
The model:
Each simulated population consisted of 10,000 mature, hermaphroditic, diploid individuals. Individuals were located at the intersection points of a square grid of size L x L, with L measured in grid units. There was at most one individual at each intersection point. Thus the grid unit was the minimum distance allowed between two individuals. The grid size could be larger than the population size. There was no density-dependent regulation, apart from the minimum spacing of individuals due to their location on a grid.
Each individual of the population was characterized by its location on the grid (x,y coordinates), its genotype at a neutral marker locus, and its genotype at each of a few loci undergoing symmetrical overdominance selection with multiplicative effects between selected loci. Individual fitness was defined as (1-S)k where S was the selection coefficient (identical for all selected loci) and k was the number of homozygous selected loci. All loci were bi-allelic and segregated independently from each other. There was no mutation and no migration.
Time was discrete and reproduction occurred synchronously throughout the population. In each time cycle, a number of new individuals was created. In case of nonoverlapping generations, 10,000 new individuals were created in each cycle, and all individuals alive in the preceding cycle died. In case of overlapping generations, the population was divided into C age classes of identical size, and 10,000/C new individuals were created in each cycle. Only those 10,000/C individuals alive in the preceding cycle that belonged to the oldest age class died, and all others stayed alive and moved up one age class. Thus in all cases, the population size was kept constant through cycles, it took C cycles for a complete turnover of individuals, and the lifespan of each individual was C cycles. In what follows, we will refer to generation overlapping as C, that is, the maximum number of successive reproduction cycles that might occur between the birth of two mates, including the cycles during which these mates were created.
To create each new individual in a given cycle, the following procedure was used (all distances in grid units): (1) One mother individual was drawn at random among all individuals alive in the preceding cycle. (2) One point of the grid was then drawn at random among all grid points located within a circle of radius D centered on the mother. (3) If no individual (be it a new individual or an individual of the preceding cycle in case of generation overlapping) was already present at that point, a father individual was then drawn at random among all individuals of the preceding cycle located within a circle of radius P centered on the mother. Otherwise, the procedure was started again from step (1). (4) At each selected locus, Mendelian segregation of genes was assumed to obtain the genotype of the new individual. Lastly, a number was drawn at random from a uniform [0; 1] distribution and the new individual was actually created if this number was smaller than its fitness value (see definition above). Otherwise no new individual was created. The whole procedure [steps (1)(4)] was carried out as many times as necessary to create the appropriate number of new individuals in each cycle.
For a given simulation, all individuals were either self-compatible (could self) or self-incompatible (could not self). When all individuals were self-compatible, mothers were one of the potential fathers of their own progeny, so that outcrossing rate was determined by local density. When all individuals were self-incompatible, isolated individuals could not reproduce and outcrossing rate was 1. It should be noted that no gametophytic or sporophytic self-incompatibility genetic system was used here, but only the possibility or impossibility for individuals to self.
The grid was assumed toroidal (i.e., the top and bottom of the grid were joined together, and so were the two sides), to avoid edge effects. Such connections inevitably lead to some spatial overlapping, which is expected to prevent the development of spatial genetic structure for small grid sizes. We checked that a grid size of 100 x 100 was sufficient for this overlapping effect to become negligible, by comparing spatial genetic autocorrelations obtained for values of L increasing from 25 to 125 by 25, for D = P = C = 1 and S = 0 (data not shown). Moreover, as grid sizes were large, this assumption was expected not to affect, too much, comparisons with previous simulation studies such as EPPERSON's (1990, 1995a,b), which rely on different assumptions concerning edge effects.
Simulation program:
The simulation program was written in C, and run on Hewlett-Packard work stations under a Unix operating system. We used the random number generator recommended by ![]()
Simulation runs:
To create initial populations for each simulation, the coordinates and genes of individuals were drawn at random, with initial allele frequency 0.5 at each locus. All simulations were run over 200 x C cycles. Previous simulation studies of the isolation by distance model with nonoverlapping generations have shown that 200 generations were enough for spatial genetic structure to stabilize in most cases (![]()
![]()
|
The objective of simulations in set 1 was to determine the effects, on spatial genetic structure, of approaching the continuous distribution of individuals by decreasing grid spacing with a fixed global density. Note that subset 1a (uniform distribution and minimal dispersal) is very close to the models of ![]()
![]()
![]()
In sets 1 and 2, the total number of grid points (L x L) varied. But in set 1 grid spacing was varied in order to keep the total grid area constant, whereas in set 2 the total grid area was varied in order to keep grid spacing constant. In all other sets, both grid spacing and total grid area were kept constant, and one other input variable varied (set 3: D ; set 4: P ; set 5: C ; set 6: S). In set 6, there was only one selected locus. All simulations were carried out once assuming self-compatibility (SC) and once assuming self-incompatibility (SI). As a complement to set 6, a few additional simulations with SC were run, to assess the effect of 10 selected loci with much lower selection coefficients (0.01, 0.02, 0.05, 0.10, 0.15, and 0.20).
Statistics measured:
The following variables were recorded every 5 x C cycles: outcrossing rate after selection (effective outcrossing rate, Te); allele frequency (PA); genotypic frequencies (PAA, Paa, and PAa). To characterize spatial genetic structure, we used two complementary spatial autocorrelation statistics (![]()
![]()
![]()
![]()
These spatial statistics were computed after the last cycle for all simulations. To get an idea of their patterns of evolution in time, we also computed these statistics every 10 x C cycles for only a few simulation subsets with contrasted values of input variables, because of the very large amount of computation time required (Table 1). Subsets with the highest dispersal distances were not included because spatial genetic structure was very weak after the last cycle. All genetic statistics (fixation indices, I- and join-count statistics) were computed both at the neutral locus and at the selected loci.
Moran's I-statistic for individual allele frequencies is defined as:
is the mean allele frequency in the population. This statistic was computed for different distance classes by setting wij to one when the distance between individuals i and j belonged to the distance class under consideration, and to zero otherwise. For simulation set 1 (Table 1), 30 distance classes were used. The first class was ]0; d + 0.5], and subsequent classes were ]k; k + d], where k varied from d + 0.5 to 29d + 0.5 by d, and d was the distance class width (see Table 1). For all other simulation sets, wider distance classes were used for large distances in order to save computing time, with a total of 37 distance classes. The first class was ]0; 1.5], the 29 following classes were ]k; k + 1], where k varied from 1.5 to 29.5 by 1, and the last seven classes were ]k, k + 10], where k varied from 29.5 to 99.5 by 10. We performed no test of the null hypothesis of random distribution of genotypes with respect to individual locations, because there is no evidence that a normal approximation can be used for the sampling distribution of I-statistics for individual allele frequencies, and a Monte Carlo approach would have been far too time-consuming.
Join-count statistics were computed as follows. Using the same distance classes as for I-correlograms, all (10,000 x 9,999)/2 pairs of individual genotypes were classified according to both distance and type of join (AA-AA, aa-aa, Aa-Aa, AA-Aa, Aa-aa, or AA-aa), which allowed computation of all n ij(k) [where n ij(k) is the number of joins between genotypes i and j in distance class k]. The expected values µ ij(k) and standard deviations SDij(k) of each n ij(k) under the null hypothesis, H 0, that genotypes are distributed at random with respect to individual location, are given in ![]()
j. Under H 0, the statistics SNDij(k) = [n ij(k) - µ ij(k)]/SDij(k) are distributed asymptotically as standard normal deviates (![]()
Diggle's d w statistic is defined as follows. Let the random variable W be the nearest neighbor distance. Under the assumption of random spatial distribution of individuals (not only on grid intersections, but continuously over the whole area), the expected cumulative distribution function of W is G(w) = 1 - (exp - 
w 2), where
is the mean intensity of individuals per unit area. Then d w = sup |G(w) -
(w)|, where
(w) is the observed distribution function for the sample individuals on grid intersections. d w takes values between 0 and 1. If G(w) >
(w), there is excess uniformity; if
(w) > G(w), there is excess clumping. For each simulation, we tested the null hypothesis of random distribution of individuals on the grid using a Monte Carlo approach since no approximation of the sampling distribution of d w is known. For each combination of grid spacing and total grid area, we generated 1000 separate random configurations of 10,000 points on the intersections of the grid, and computed d w for each configuration. These 1000 simulated values were used as an estimate of the sampling distribution of d w under the assumption of random distribution of individuals.
| RESULTS |
|---|
Effect of grid spacing under constant global density and dispersal distances:
For all simulations of set 1, patterns of mean I-correlograms (Figure 1) and SND-correlograms were typical of a patchy distribution of genotypes, with large homozygote patches separated by smaller and less homogenous heterozygote patches, as already described in previous simulation studies. Moran's I-values were highly positive in short and intermediate distance classes, which means that two individuals separated by such a given distance had more similar genotypes than expected under the assumption of random distribution of genotypes in space.
|
With self-compatibility (SC), Moran's I-values in small and intermediate distance classes increased when L was increased. They stabilized for L
1000 (Figure 2). Values of X-intercepts varied little with L. This means that two individuals separated by a given small or intermediate distance had more similar genotypes when the minimum distance allowed between individuals (grid spacing) was smaller.
|
At short to intermediate distances, when L was increased, SNDs for joins between like homozygotes decreased, SNDs for Aa-Aa joins increased, there was no important change in SNDs for joins between one heterozygote and one homozygote, and SNDs for AA-aa joins decreased. Such variations reflected larger distances between unlike homozygote patches, higher concentration of heterozygotes, and reduced homogeneity of homozygote patches. These conflicting effects resulted in increased autocorrelations for individual allele frequencies.
The aforementioned changes were closely related to changes in the amount of aggregation, as shown by both maps of individuals (Figure 3) and values of Diggle's d w statistic (Figure 4), but also to changes in effective dispersal distances (Table 2). The higher values of Moran's I-statistics for cases with nonuniform distribution (subsets 1b to 1h), compared to the case with uniform distribution (subset 1a), are consistent with the lower dispersal distances and variances in dispersal distances. However, it is not excluded that the higher aggregation also had some effect on its own, in the following way: unlike homozygote patches became more and more distant, which could partly explain both the decrease in SNDs for like homozygote joins and the decrease in SNDs for AA-aa joins. Heterozygotes grouped at the fewer areas of contact between homozygote patches, instead of almost continuously bordering homozygote patches as was the case when individuals were uniformly distributed (i.e., for L = 100). This led to a reduced number of heterozygote patches in the population, which could contribute to the increase in SNDs for Aa-Aa joins. Lastly, in these contact areas, there was more mixing of genotypes within distance classes, which could also enhance the decrease in SNDs for like homozygotes. However, the simulations of set 1 do not allow one to distinguish between the direct effect of clumping, if any, and the effect of mean dispersal distances.
|
|
|
With self-incompatibility (SI), autocorrelations at short and intermediate distances were always larger for nonuniform (L
200) than for uniform (L = 100) spatial distributions of individuals, except in the first distance class where autocorrelations for L = 100 were similar to those for large L values (Figure 5). Autocorrelation values first increased with increasing L, and then stabilized for L
10001500. This general pattern could be due, as above, to the decrease in mean dispersal distances, as well as to the large increase in clumping of individuals. Heterozygote patches were fewer, and thus values of SNDs for Aa-Aa joins were larger. This effect on Moran's I-values was opposed to the effects of reduced homogeneity of homozygote patches and lower distance between unlike homozygote patches. The particular pattern observed in the first distance classes (decrease with large L values), seemed to be primarily due to very high local densities leading to more mixing of heterozygotes with homozygotes at short distances, as shown by a decrease in SNDs for Aa-Aa joins. X-intercepts of SND-correlograms for AA-AA joins decreased with increasing L values, reflecting a reduction in the size of homozygote patches.
|
Introduction of SI had no effect on I-correlograms (Figure 1 and Figure 5) and SND-correlograms, for L = 100. However for L = 100, patches were slightly larger with SI than with SC, as shown by values of X-intercepts. For larger L values, Moran's I-statistics at short and intermediate distances were lower with SI than with SC. This difference was greater for L
1000 than for L < 1000. Differences in outcrossing rates (Figure 4) could not explain these differences in spatial autocorrelations. Conversely, differences in the level of clumping (as measured by d w , Figure 4) and mean pollen dispersal distances (Table 2) were linked to these differences in spatial autocorrelations. With SI, clumps were denser, larger, and more isolated from each other than with SC, because isolated individuals could not reproduce. This increase in local density yielded more mixing of homozygotes with heterozygotes at short and intermediate distances, which could partly explain the decrease in autocorrelations obtained with the introduction of SI, in combination with the larger effective pollen dispersal distances.
Evolution in time from random initial state:
In all cases for which we recorded the evolution of spatial structure in time (Table 1), Moran's I-statistics (Figure 6) and d w initially increased rapidly, and then considerably more slowly. Although in some cases they were still increasing after 100 x C cycles (e.g., with selection), their mean value always varied little between cycles 100 x C and 200 x C. Therefore the time required for spatial genetic structure to stabilize was similar in nonuniformly and uniformly distributed populations. Fixation indices and effective outcrossing rates showed a similar evolution in time (i.e., rapid increase or decrease in earlier cycles, and little change in later cycles). For genetic variables, variability between replicates tended to increase with time (Figure 6). In some cases, it was relatively high after the last cycle (Table 3).
|
|
Effect of global density (simulation set 2):
Moran's I-statistics always increased with decreasing global density (Figure 7). This resulted from an increase in SNDs for like homozygote joins, an increase in SNDs for Aa-Aa joins, and a decrease in SNDs for AA-aa joins. This increase in spatial genetic structure was attributable primarily to the decrease in mean relative dispersal distances (measured in units of average interplant absolute distances). To test whether the growing isolation of clumps, which resulted in increased d w values, had any additional effect, we computed relative dispersal distances for the first nine replicates of subsets 1a (L = 100, D = P = 1) and 1f (L = 1000, D = P = 10) with SC. Simulations of these two subsets produced very different spatial distributions of individuals (uniform and clumped, respectively), whereas we expected them to yield very similar relative dispersal distances. However, mean seed and pollen relative dispersal distances were slightly higher for subset 1a (0.021 and 0.021, respectively) than for subset 1f (0.018 and 0.014, respectively), and it was not possible to know whether this difference was sufficient to explain the large discrepancy observed in Moran's I-values (Table 3). So we could not make any conclusion about the existence of any additional effect of clumping in this case.
|
Note that on Figure 7 Figure 8 Figure 9, each correlogram consists of two different parts, corresponding to the two widths that were used for distance classes (one grid unit for small distances and 10 grid units for larger distances). These two parts cannot be compared within each correlogram because Moran's I-value is dependent on the width of distance class. But each part can be compared among different correlograms.
|
|
The increase in Moran's I-statistics with decreasing global density was less pronounced with SI than with SC for large L values (Figure 7), which resulted from lower SNDs for Aa-Aa and AA-Aa joins, and higher SNDs for AA-aa joins. This could be due to differences in relative dispersal distances, clumping, and/or outcrossing. With SC, most clumps were small and consisted of one homozygote patch, and there was substantial selfing. With SI, owing to larger, denser, and more isolated clumps, together with the absence of selfing, heterozygosity and d w were higher, and considerable local mixing of genotypes was maintained even at very low global densities.
Effects of seed and pollen dispersal (simulation sets 3 and 4):
Moran's I-values always decreased with increasing D (maximum seed dispersal distance) (Figure 8). They were always lower with SI than with SC, due to a smaller excess of Aa-Aa joins and a smaller deficit of AA-aa joins. Both higher local densities and the absence of selfing seemed to be responsible for these differences. With SC, individuals were significantly clumped for D
20, and their distribution became random for D
50 (Table 3). With SI, d w values also decreased with increasing D, but spatial distribution of individuals remained highly clumped even for large D values; all individuals of the population were then grouped within a single large clump spread over approximately half the whole grid area, because isolated individuals did not reproduce.
Moran's I-values decreased with increasing P (maximum pollination distance) for P
20 (Figure 9). In these cases, I-values, spatial distribution of individuals, and outcrossing rates were similar with SI and with SC, which was due to the fact that pollination distances were large enough for spatially isolated individuals not to be reproductively isolated. For P = 10 (reference subset), I-values were lower with SI than with SC and d w values were higher, because spatially isolated individuals were reproductively isolated. In this case, clumping developed with SI and there was substantial selfing with SC (Te
0.8).
Comparing the effects of D and P (Figure 8 and Figure 9) shows that a given increase in D had a stronger decreasing effect on Moran's I-statistics than the same increase in P. This difference presumably resulted from a combination of two causes: (1) the difference in ploidy level between seeds and pollen (and thus the difference in the number of genes dispersed during each dispersal event), and (2) the difference in relative dispersal distances (when D was increased, effective pollen dispersal distances were expected to be larger and interplant distances lower, whereas when P was increased, both effective seed dispersal distances and interplant distances were expected not to change). The opposed effects of differences in local density on Moran's I were weaker because when D was increased it decreased, whereas when P was increased, it essentially did not change. The effect of outcrossing was also opposed to this ploidy and relative dispersal effects, leading to more difference between the effects of D and P with SI than with SC, since outcrossing rate decreased with increasing D, whereas it increased with increasing P.
Effect of generation overlapping (simulation set 5):
Moran's I-statistics always increased when C (generation overlapping) was increased (Table 3). This resulted from an increase in SNDs for like homozygote joins and Aa-Aa joins, and a decrease in SNDs for AA-aa joins. However, the largest difference was found between the reference subset (no overlap) and 5b or 5c (some overlap), and the difference between subsets 5b (C = 5) and 5c (C = 10) was much smaller. This effect of overlapping was presumably due to the increased kinship between mates, which is expected to result from the combination of restricted gene flow and mating of individuals with their ancestors. Specific computations are required to test for this hypothesis. However, mean values of the probability that, at the neutral locus, two genes randomly drawn without replacement among all the genes of the population were identical by descent, were higher with overlap than without (0.013, 0.032, and 0.037 for the reference subset and subsets 5b, 5c, respectively), which strongly supports this interpretation. I-correlograms were always lower with SI than with SC, which was probably due to the absence of selfing, higher local density, and lower relative dispersal distances.
Effect of selection (simulation set 6):
In this section, all genetic values reported are for the unlinked neutral locus, unless otherwise stated. With SC, Moran's I-statistics decreased for S (selection coefficient) from 0 to 0.6, and then increased again for S up to 1; I-values were similar for S = 1 and S = 0 (Figure 10). These changes were related to changes in d w values (Table 3) and in local density (as illustrated by the differences between Figure 3 and Figure 11), which were both highest for intermediate S values. These variations in clumping resulted in more spatial mixing of genotypes and larger outcrossing rates for intermediate than for extreme S values, which could explain the increase in heterozygosity and the lower spatial genetic autocorrelations. In additional simulations with uniform distribution of individuals (with SC, L = 100, D = P = C = 1, S = 0.5, 10 replicates), the average Moran's I value in the first distance class was 0.36 (SD 0.028). This result confirms that the effect of selection on spatial genetic structure was due to variations in clumping. In turn, these changes in clumping could result from the fact that the mean effective number of offspring produced by homozygotes at the selected locus was lower than the mean number of offspring produced by heterozygotes (data not shown). But this hypothesis has not been tested and there might be alternative explanations. Since gene flow was restricted enough to allow for the development of homozygote patches, mating between like genotypes was enhanced. Therefore, at the selected locus, heterozygotes on average produced a larger proportion of heterozygote offspring (i.e., offspring more likely to survive in the presence of overdominance selection) than did homozygotes. This relative reproductive advantage of heterozygotes over homozygotes was further increased by a significant part of selfing among matings. When the proportion of heterozygotes in the population was intermediate (i.e., for S ~0.5), heterozygotes at the selected locus would have the highest relative reproductive advantage, which would lead to clumping of surviving offspring around the locations of mothers heterozygous at the selected locus. However, the global variance in the number of surviving offspring was only slightly, and not significantly, larger for intermediate than for extreme S values (results not shown). When S = 0 (no selection) or S = 1 (100% heterozygotes at the selected locus), heterozygotes at the selected locus had no particular advantage, which could explain why there was less clumping of individuals.
|
|
With SI, differences in I-correlograms between subsets of set 6 were much lower than with SC (Figure 10). This is consistent with the above interpretation, since with SI, heterozygotes at the selected locus have less reproductive advantage, because of both lower spatial autocorrelations and the absence of selfing. This was confirmed by the absence of systematic difference between the mean number of offspring produced by homozygotes and heterozygotes at the selected locus (data not shown).
The case of study with only one selected locus and the widest possible range for S values (from 0 to 1) was useful to help understand the mechanisms through which spatial genetic structure at neutral loci may be modified by the presence of selection at other, independent loci. However, the complementary simulations carried out with 10 selected loci and selection coefficients from 0.01 to 0.20, represented situations far more likely in real populations. For selection coefficients
0.02, d w statistics had mean values similar to or greater than the highest values obtained with only one selected locus (i.e., 0.43 for S = 0.6), and I-correlograms were lower.
Relationships between clumping, local density, and spatial genetic structure:
For simulation sets 26, the relationship between spatial genetic structure and spatial distribution of individuals (as measured by Diggle's d w) was different among sets (Figure 12, A and B). In some cases (for variations in S with SC, or introduction of SI), increased clumping was related to lower spatial autocorrelations. This pattern presumably resulted from the increase in both outcrossing rate and local density, which led to an increase in heterozygosity and spatial mixing of genotypes. Conversely, in other cases (for variations in C with SC, variations in D, or variations in global density), increased clumping was related to higher spatial autocorrelations. This common pattern, however, had three different explanations. First, the changes in kinship directly due to variations in generation overlapping (C) were strong enough to counter and overcome the aforementioned changes in clumping. Second, reduced maximum seed dispersal distances (D) resulted in less spatial mixing of genotypes and therefore more local genetic differentiation, in spite of higher local densities and clumping. Lastly, when global density was decreased, d w increased, but this reflected variations in clumping different from those described above (smaller and more numerous clumps developed with lower local density, and thus a lower outcrossing rate, instead of fewer and larger clumps with higher local density). Meanwhile, relative dispersal distances decreased, leading to less mixing of genotypes and more local fixation of alleles.
|
| DISCUSSION |
|---|
Nonuniform spatial distribution of individuals:
The present study is the first one to give a quantitative description of fine-scale spatial organization of genotypes in theoretical plant populations with nonuniform spatial distribution of individuals. Studying cases of extreme aggregation has allowed us to bring to light some potential effects of clumping on spatial genetic structure. Our main findings are the following: (1) the presence of strong clumping leads to a slight increase in spatial autocorrelations compared with uniformly distributed populations, independently of the effect of dispersal distances; (2) global density, seed and pollen dispersal, overlap of generations, overdominance selection at unlinked loci, and self-incompatibility may all influence aggregation; and (3) the effect of each of these variables on spatial genetic structure, except maybe global density, may be affected by clumping.
Before discussing these results in more detail, it is worth examining the appropriateness and expected consequences of three assumptions that might be considered as major limits to our model: the absence of microenvironmental heterogeneity, the absence of density-dependent survival, and the uniform distribution of dispersal distances.
Our simulations with restricted dispersal produced very clumped spatial distributions of individuals, sometimes with large clumps separated by large empty areas. Such extreme configurations can be found in some species, in particular in tropical forest tree species (e.g., d w values > 0.20 were observed for several tropical forest tree species in French Guiana; CIRAD-Forêt, unpublished data). But then, spatial aggregation may be due primarily to microenvironmental heterogeneity of survival rates rather than to restricted dispersal. In such cases, metapopulation models would certainly be more appropriate. Our model is very different in that clumps can change sizes, shapes, and locations over time. Nevertheless, in many cases, there is as yet no evidence that habitat characteristics have any influence on spatial distribution of individuals. In a neotropical forest, ![]()
![]()
Evidence of density-dependent mortality in natural plant populations is scarce. ![]()
We assumed uniform distributions of pollen and seed dispersal distances, both to save computer time and for consistency with previous simulation studies. For highly clumped distributions such as those obtained in our simulations, one important consequence of this assumption, together with the assumption that pollinator movement is independent of local plant density, is that unoccupied habitat between patches is an absolute barrier to gene flow when gaps are larger than maximum dispersal distances. This is not supported by the empirical evidence available to date. The consequence of introducing a low proportion of long-distance dispersal, in addition to high amounts of short-distance dispersal would certainly be comparable to the effect of random long-distance immigration as modelled by ![]()
![]()
![]()
![]()
![]()
Therefore, even though our model may lack realism because it is based on very simple assumptions about dispersal and the spatial distribution of individuals, the study of its behavior in a few cases of extreme clumping has provided results very useful for a better understanding of the distributions of genotypes observed in real plant populations.
First, we found that allowing for strong clumping leads to higher values of spatial genetic autocorrelations than in uniformly distributed populations. Simulations in set 1 do not allow us to distinguish between the direct effect of clumping, if any, and the effect of mean dispersal distances, since the only way to obtain clumped distributions is to change dispersal distances. However, comparison with the results of ![]()
2axial) is not proportional to Wright's neighborhood area, because axial dispersal distances are not normally distributed. But in our simulations, for subsets 1b to 1h with SC,
2axial values were between 0.34 and 0.37, and I-values in the first distance class were all higher than 0.5, whereas in EPPERSON's (1995b) simulation set 1, Ic was lower than 0.5 for
2axial = Ne/4
= 0.33. Moreover, since local density is higher in our simulations, the neighborhood size is certainly higher too. In our simulation set 1 with SI, Moran's I-values were not a monotonous function of mean dispersal distances, which is additional evidence that these distances are not the only factor influencing spatial genetic structure in nonuniformly distributed populations. In addition to the higher local densities, the effect of clumping seems to be mainly due to the fact that under nonuniform distributions, homozygote patches can be separated either by empty grid areas or by heterozygote patches, whereas with a uniform distribution they can be separated only by heterozygote patches.
For given values of maximal dispersal distances and global density, both mean dispersal distances and Moran's I-values stabilized when a continuous distribution of individuals was approached (for L around 1000 in simulations of set 1). We chose this particular case as the reference case to study the separate effects of a few demographic (global density, generation overlapping), reproductive (gene dispersal, self-incompatibility), and genetic (selection) factors, on the spatial distribution of individuals and genotypes. Note that we mainly used small seed dispersal distances relative to global density. Thus we obtained highly clumped distributions, the study of which is useful to understand better the potential determinants of aggregation and spatial genetic structure.
Second, our simulations have shown that aggregation can be influenced by self-compatibility, pollination distances, generation overlapping, and selection, in addition to the more straightforward effects of seed dispersal distances and global density. These previously undescribed mechanisms are potentially active in all plant species. Thus further studies are needed to assess their effective importance in real populations.
Third, we found that these changes in spatial distributions of individuals in turn have major consequences on the way dispersal, generation overlapping, selection, and self-compatibility influence spatial genetic structure, as will now be discussed. In all cases, part of the effect of clumping is likely to be due to changes in relative dispersal distances, since clumping is expected to reduce mean interplant distances. However, our simulations did not allow us to dissociate these effects of relative dispersal from hypothetical specific effects of clumping, since we did not systematically record relative dispersal distances.
Relationships between clumping, local density, and spatial genetic structure:
We have found no systematic relationship between variations in clumping as measured by d w, and spatial genetic structure, in the few particular cases studied here. There are two reasons for this. First, two input variables (the maximum seed dispersal distance and the amount of generation overlapping) had direct effects on spatial genetic structure that were opposed to and stronger than their indirect effect through variations in clumping. Second, d w statistics measure the extent to which distributions deviate from randomness, but they are not sufficient to entirely characterize clumping. The same d w value can be obtained in situations showing very different local densities. In the present study, local density (of which outcrossing rate before selection is a direct measure here) was positively related with d w within all simulation sets except the one involving variations in global density. Therefore, clumping is better described with both the d w statistic and outcrossing before selection than with the d w statistic alone. These results imply that the level of clumping as measured by the d w statistic only cannot directly be used to predict spatial genetic structure.
Comparative effects of both gene dispersal modes:
Previous simulation studies showed that increasing pollination distances with little or no seed dispersal, or simultaneously increasing seed and pollen dispersal distances (with the same neighborhood sizes), both led to reduced fine-scale genetic structure (e.g., ![]()
![]()
![]()
![]()
Effects of generation overlapping and selection:
Our results suggest that with nonuniform distributions of individuals, the effect of generation overlapping on spatial genetic structure was reduced by variations in clumping. Therefore, we would expect this effect to be even stronger with uniform distributions of individuals. But the effect of generation overlapping has scarcely been studied in such cases. The simulations of ![]()
![]()
Spatial genetic structure at a neutral locus when selection occurs at independent loci has never been described before. Both ![]()
![]()
![]()
= 5%) for 49, 47, 37, 32, 21, and 4 replicates out of 50, for the reference subset and subsets 6b to 6f with SC, respectively. Therefore it is not an effect at the individual level, but rather at the population level. As a consequence, this effect is expected to be found only in nonuniformly distributed populations with spatial genetic structure and mixed mating, as supported by the few complementary simulations we carried out with selection in a uniformly distributed population. Moreover, only very low selection coefficients (e.g., 0.02) at a few (e.g., 10) selected loci were sufficient to obtain large changes in the spatial features of the population. Even though very few data are available to date, overdominance selection events might not be scarce in natural populations, especially for long-lived perennials [e.g., ![]()
Consequences of self-incompatibility and outcrossing determinism:
Our results stress the importance of knowing whether a species is self-compatible or not in order to understand its fine-scale spatial genetic structure. The introduction of self-incompatibility (when defined as the inability to self) can induce very large decreases in spatial genetic structure within populations with nonuniform distributions, whereas it has no effect in uniformly distributed populations. This difference was not directly due to the absence of selfing itself, but to the higher clumping and local density that indirectly resulted from the absence of selfing. Even though the high levels of clumping reached in some of our simulations are not very realistic, as discussed above, these simulations help understand the extent to which the ability of isolated individuals to reproduce through selfing may influence spatial genetic structure in nonuniformly distributed populations. Introducing a gametophytic self-incompatibility genetic system (SI locus with several SI alleles) instead of merely prohibiting selfing is expected to further reinforce the effect of self-incompatibility on spatial genetic structure through a reduction in biparental inbreeding. In the simulations of ![]()
The effect of overdominance selection on the spatial distribution of individuals and genotypes which was found here is dependent upon both some selfing among matings and a large influence of local density on outcrossing rate. Therefore, it will be particularly important to assess the extent to which the amount of outcrossing is determined by local density in self-compatible species. Relationships between outcrossing rate and density of mature individuals have been found in natural populations for a few species [e.g., ![]()
![]()
Consequences for interpretation of I-correlograms observed in real populations:
![]()
![]()
![]()
![]()
In conclusion, our results show the potential benefits of enlarging the framework for simulation studies of fine-scale spatial genetic structure, compared to what has been done so far, by taking into account a nonuniform rather than a uniform spatial distribution of individuals. The model presented here is based on very simple assumptions and thus it has many limits. Nevertheless, the study of its behavior in a few particular cases of extreme aggregation has brought to light some mechanisms, never described before, through which variations in clumping might influence spatial genetic structure within plant populations (e.g., the effects of self-incompatibility or overdominance selection on spatial genetic structure). In particular, our results stress the need to assess the real importance of self-incompatibility, generation overlapping, and overdominance selection in natural populations, as well as the effective influence of local density on outcrossing rate, and the extent of differences between seed and pollen dispersal distances. Further study of the present model is required (1) to determine the conditions under which each potential mechanism described here will effectively play a significant role in shaping fine-scale spatial genetic structure, and (2) to assess the exact contribution of relative dispersal distances to the effect of clumping on genetic structure.
| FOOTNOTES |
|---|
1 Present address: GEVES, La Minière, 78285 Guyancourt Cedex, France. ![]()
| ACKNOWLEDGMENTS |
|---|
The authors thank B. K. EPPERSON, J. S. HEYWOOD, and an anonymous reviewer, for constructive comments on the manuscript, as well as D. DOLIGEZ for helpful advice on the implementation of simulations, and L. HOUDE for his help with the use of spatial statistics. This investigation was supported by a grant from the French Ministère de la Recherche to A.D. and funds from CIRAD-Forêt.
Manuscript received March 3, 1997; Accepted for publication October 10, 1997.
| LITERATURE CITED |
|---|
ANTONOVICS, J. and D. A. LEVIN, 1980 The ecological and genetic consequences of density-dependent regulation in plants. Annu. Rev. Ecol. Syst. 11:411-452.
ARMESTO, J. J., J. D. MITCHELL, and C. VILLAGRAN, 1986 A comparison of spatial patterns of trees in some tropical and temperate forests. Biotropica 18:1-11.
BERG, E. E. and J. L. HAMRICK, 1995 Fine-scale genetic structure of a turkey oak forest. Evolution 49:110-120.
BARBUJANI, G., 1987 Autocorrelation of gene frequencies under isolation by distance. Genetics 117:777-782
BOS, M. and E. VAN DER HARING, 1988 Gene flow in Plantago. II. Gene flow pattern and population structure. A simulation study. Heredity 61:1-11.
BUSH, R. M. and P. E. SMOUSE, 1991 The impact of electrophoretic genotype on life history traits in Pinus taeda.. Evolution 45:481-498.
CHARLESWORTH, D., 1991 The apparent selection on neutral marker loci in partially inbreeding populations. Genet. Res. 57:159-175.
CLIFF, A. D., and J. K. ORD, 1981 Spatial processes. Pion, London.
CONDIT, R., S. P. HUBBELL, and R. B. FOSTER, 1992 Recruitment near conspecific adults and the maintenance of tree and shrub diversity in a neotropical forest. Am. Nat. 140:261-286.
CRAWFORD, T. J., 1984 The estimation of neighbourhood parameters for plant populations. Heredity 52:273-283.
EPPERSON, B. K., 1990 Spatial autocorrelation of genotypes under directional selection. Genetics 124:757-771[Abstract].
EPPERSON, B. K., 1993 Recent advances in correlation studies of spatial patterns of genetic variation. Evol. Biol. 27:95-155.
EPPERSON, B. K., 1995a Spatial distributions of genotypes under isolation by distance. Genetics 140:1431-1440[Abstract].
EPPERSON, B. K., 1995b Fine-scale spatial structure: correlations for individual genotypes differ from those for local gene frequencies. Evolution 49:1022-1026.
EPPERSON, B. K. and T. LI, 1996 Measurement of genetic structure within populations using Moran's spatial autocorrelation statistics. Proc. Natl. Acad. Sci. USA 93:10528-10532
EPPERSON, B. K. and T. LI, 1997 Gene dispersal and spatial genetic structure. Evolution in press.
HEYWOOD, J. S., 1991 Spatial analysis of genetic variation in plant populations. Annu. Rev. Ecol. Syst. 22:335-355.













, reference subset (marked by an arrow); for all other subsets, the size of the letter used in the plot is proportional to the value of the input variable, which differs from the reference (see