Abstract
Common principal components (CPC) analysis is a technique for assessing whether variancecovariance matrices from different populations have similar structure. One potential application is to compare additive genetic variancecovariance matrices, G. In this article, the conditions under which G matrices are expected to have common PCs are derived for a twolocus, twoallele model and the model of constrained pleiotropy. The theory demonstrates that whether G matrices are expected to have common PCs is largely determined by whether pleiotropic effects have a modular organization. If two (or more) populations have modules and these modules have the same direction, the G matrices have a common PC, regardless of allele frequencies. In the absence of modules, common PCs exist only for very restricted combinations of allele frequencies. Together, these two results imply that, when populations are evolving, common PCs are expected only when the populations have modules in common. These results have two implications: (1) In general, G matrices will not have common PCs, and (2) when they do, these PCs indicate common modular organization. The interpretation of common PCs identified for estimates of G matrices is discussed in light of these results.
COMPARISON of additive genetic variancecovariance matrices (the G matrices) of different populations is an important goal in evolutionary quantitative genetics (Steppanet al. 2002). Such comparisons identify commonalities (e.g., Kohn and Atchley 1988; Phillips and Arnold 1999; Roff 2000) or summarize differences (e.g., Shawet al. 1995; Paulsen 1996; Steppan 1997) in the structure of G. The broad motivation behind such analysis is clear: G is a key component for predicting trait evolution under directional selection and genetic drift as well as for retroactively estimating the selection gradient (Lande 1979; Lande and Arnold 1983). Comparison of G therefore reveals whether differences in genetic variation may have played a role in divergent evolutionary trajectories (Priceet al. 1993). Furthermore, as the structure of G depends on pleiotropic effects of segregating alleles, commonalities may also contain information on genetic architecture shared by populations (Phillips and Arnold 1999). Comparison of G matrices has been the primary goal of many studies (Arnold 1981; Lofsvold 1986; Kohn and Atchley 1988; Wilkinsonet al. 1990; Atchleyet al. 1992; Brodie 1993; Shawet al. 1995; Paulsen 1996; Podolskyet al. 1997; Steppan 1997; Arnold and Phillips 1999; Camara and Pigliucci 1999; Badayaev and Hill 2000; Pfrender and Lynch 2000; Roff, 2000, 2002; Service 2000; Waldmann and Anderson 2000; Phillipset al. 2001). The results have been used to address issues in areas as diverse as evolution of predation patterns (Arnold 1981), covariance patterns resulting from mutation (Camara and Pigliucci 1999), and change in the G matrix itself (see, e.g., Wilkinsonet al. 1990; Pfrender and Lynch 2000).
Although the utility of comparing G matrices seems clear, which methods will furnish informative conclusions is not (Steppanet al. 2002). Difficulty arises in any case where trait number is greater than one. For n traits, the G matrix includes n(n + 1)/2 variance and covariance elements, and each of these may be larger or smaller than its corresponding element in other G matrices. Moreover, the number of factors that potentially influence the values of the n(n + 1)/2 variances and covariances is considerable, including the specific pleiotropic effects of segregating alleles, the frequency of these alleles, gameticphase disequilibrium, nonadditive effects, and the effects of mutation. Therefore, it is not easy to define a statistic that both summarizes structure shared by G matrices and provides a clear interpretation of the implications of shared structure.
Of the many multivariate statistical techniques proposed for comparison of G matrices (reviewed in Steppanet al. 2002), common principal components (CPC) analysis (Flury 1987, 1988) is fast becoming the method of choice (Arnold and Phillips 1999; Camara and Pigliucci 1999; Phillips and Arnold 1999; Pfrender and Lynch 2000; Roff 2000). The goal of CPC analysis is to summarize the structure of two (or more) matrices in terms of common principal components, those principal components (PCs) that have the same direction, and to test for differences from this common model (Flury 1987, 1988). The statistical models of CPC arranged in order of increasing amount of common structure are the following: no similarity among matrices; PCPC(1), where matrices have a single common PC; PCPC(2);...; PCPC(n – 2); CPC(All); matrix proportionality; and matrix equality. The m common PCs referred to in PCPC(m) may refer to any m PCs shared among matrices, and common PCs referred to in the CPC models need not be associated with eigenvalues of the same rank. A number of approaches are possible for determining which CPC model correctly describes matrix structure (Flury 1988; Phillips and Arnold 1999). The software implementations used by almost all CPC analyses to date (Phillips 1998a,b,c) employ a maximumlikelihood approach and a hierarchy of hypothesis tests (the Flury hierarchy) to determine whether matrices have common principal components (see Phillips and Arnold 1999 for a discussion of the alternatives when implementing the Flury hierarchy).
The hierarchy of CPC models provides a valuable descriptive summary of matrix structure. However, the biological meaning of the results is unclear (Houleet al. 2002). In this article, we address the problem of interpreting common PCs by deriving the conditions under which we expect common PCs among G matrices. We perform the analysis for both a twolocus, twoallele model and the model of constrained pleiotropy (Wagner 1989). The analyses demonstrate that common PCs are expected only when pleiotropic effects are constrained to a modular organization (Wagner and Altenberg 1996). When populations being compared have a modular organization in common, they have a common PC. As is discussed, this latter result provides a biological interpretation of common PCs when power is sufficient to reveal differences in the direction of PCs among G matrices.
PLEIOTROPIC MODULARITY
Wagner and Altenberg (1996) defined a modular organization as a case in which “pleiotropic effects of the genes fall mainly among members of the same character complex and less frequently between members of different complexes” (p. 971). A modular organization is therefore defined in terms of pleiotropic effects when considering a specific set of traits. In this article, we address an extreme case of modular organization, in which x > 2 nonoverlapping sets of pleiotropic effects can be defined in which all pleiotropic effects in a set are orthogonal (oriented at 90°) to all other pleiotropic effects. Each of these x sets is a “perfect” module in the sense that, for the n measured traits, an orthogonal rotation can cause the pleiotropic effects of each module to fall entirely on a subset of the new axes and not at all on new axes affected by the other modules. Distinct populations have a perfect module in common if the same orthogonal rotation also results in perfect modules. The hypothetical case of perfect modules is used to illustrate the point that considerable constraints on pleiotropic effects are required for common PCs to be expected among G matrices.
Modularity can be defined both in terms of pleiotropic effects of alleles segregating in a population and in terms of the pleiotropic effects that may be introduced into the population by mutation (Wagner and Altenberg 1996). Modularities at these two levels are clearly related. If the pleiotropic distribution of possible mutations is modular, then the pleiotropic effects of segregating alleles will also be modular. Although cases are possible where segregating variation is modular while the distribution of mutations is not, such cases are expected to be transient because mutations introduce nonmodular variation (Wagner and Altenberg 1996; Mezeyet al. 2000). Modularity of segregating variation is therefore expected to be a strong indicator of modularity in the distribution of mutations. Our treatment concerns cases in which the distribution of mutations is modular.
TWOLOCUS, TWOALLELE MODEL
The goal of discussing this simple model is to provide an intuitive illustration of the relationship between modules and common PCs that also applies to the more general model of constrained pleiotropy (Wagner 1989). In a population, all genetic variation in n = 2 traits is assumed to be determined entirely by alleles segregating at the N = 2 loci. We assume complete additivity of allelic effects (no dominance or epistasis), no disequilibrium (gameticphase or otherwise), no maternal effects, no sex linkage, and no genotypeenvironment covariance or genotypeenvironment interactions. We also assume random mating among diploid individuals. In this case, the additive effect on the traits associated with allele k at locus j can be expressed as a vector,
The relationship between these two vectors is
In this twolocus, twoallele model, existence of a module depends on the orientation of the allelic vectors associated with each locus. If the allelic vectors at one locus are orthogonal to the allelic vectors at the other locus, two perfect modules are present, because the pleiotropic effects can be divided into two groups that do not have overlapping effects. As an example, consider the case diagrammed in Figure 1a.1, where the allelic vectors of the loci are orthogonal to one another. Rotating the trait axes to the direction of the allelic vectors associated with each locus produces two new traits, f_{1} and f_{2}, where the effects of allelic substitutions at each of the two loci are limited entirely to one of the two traits. Both of these new traits, f_{1} and f_{2}, therefore define perfect modules. Figure 1b.1 diagrams a case without perfect modules. Because the allelic vectors are not orthogonal, two modules cannot be defined by a rotation of the axes. However the axes are rotated, allelic substitutions at both loci have effects on both new traits. Note that a modular organization is possible in Figure 1b.1 if a nonorthogonal rotation is used, but such transformations do not result in modules in an evolutionary sense; i.e., directional selection cannot be applied to such a “module” without resulting in a correlated response. Such nonorthogonal modules will be the subject of another article (J. G. Mezey and D. Houle, unpublished results). We confine the discussion here to modules that can be defined by rotations of the trait axes.
In this twolocus, twoallele model, the existence of modules places a major constraint on the possible orientations of G matrix PCs. When modules exist, the PCs of the G matrix have the direction of the modules, regardless of allele frequencies and changes in allele frequencies (appendix a). To visualize this relationship between PCs and modules, again consider Figure 1. Figure 1a.2 diagrams the G matrix and PCs associated with two populations, both of which have the modules diagrammed in Figure 1a.1. The populations have different allele frequencies at the two loci, and as a result, the G matrices of the populations differ. Although the PCs of G are associated with different eigenvalues, the PCs have the same direction as the modules in both populations. Further, all variation attributable to the allelic substitutions defining an individual module is described by a single PC and its associated eigenvalue. Contrast this case with that diagrammed in Figure 1b.2, which diagrams G and the PCs for two populations where allelic vectors are described by Figure 1b.1. In this case, the different allele frequencies correspond to different structures of G and PCs that have different directions. In such cases, there is no simple relationship between PCs and the allelic effects associated with each locus.
For an individual population, the directions of G matrix PCs are always the same regardless of allele frequencies only if perfect modules exist (appendix a). Therefore, if distinct populations have such modules in common (the modules have the same direction), the G matrices of the populations will always have common PCs, regardless of allele frequencies. Note that this relationship depends entirely on the direction of the modules and not the specific allelic effects defining the modules. Populations with different allelic effects at the two loci always have common PCs as long as both populations have modules in the same direction. In contrast, if the populations being compared have no modules, only a restricted subset of allele frequencies results in common PCs (appendix a), even if the allelic vectors are the same in the populations being compared.
The cases diagrammed in Figure 2 illustrate these concepts. Figure 2a diagrams two populations (A and B) that have modules in common. For these populations, Figure 2a.1 diagrams in gray the allele (heterozygote) frequencies in population A that result in common PCs, given fixed allele frequencies in population B. Figure 2a.2 provides the equivalent diagram for population B given fixed allele frequencies in population A. Note that every possible allele frequency results in common PCs, regardless of the allele frequency in the other population. Contrast this situation with the case diagrammed in Figure 2b.1, where the populations have the same allelic effect vectors, but no modules are present. For these populations, the only allele frequencies for which common PCs occur are described by the dashed and dotted lines in Figure 2, b.1 and b.2, respectively. The allele frequencies that do not fall on these lines result in no common PCs. Therefore, only a very constrained set of allele frequencies results in common PCs when no modules are present.
The implication of these results is that, when comparing evolving populations, we should not expect common PCs unless there are common modules. Without modules, the allele frequencies required for common PCs are so constrained that they are unlikely to occur given the stochastic effects of mutation and genetic drift. As an example, consider a case in which populations A and B have the same allelic vectors but no modules exist (as in Figure 2a.1). As demonstrated in appendix a, common PCs occur in these two populations when the following constraint is satisfied,
Figure 3 provides a summary of the four possible cases that can arise when two populations are compared for the twolocus, twoallele model: (1) The populations have modules in common (Figure 3a), (2) both populations have modules but the directions are different (Figure 3b), (3) one population has a module and the other does not (Figure 3c), and (4) neither population has modules (Figure 3d). Only the case in Figure 3a will always have common PCs. For the cases in Figure 3, b–d, the vast majority of allele frequencies will result in no common PCs, and we should not expect to find common PCs when the populations are evolving.
Note that, if the allelic vectors in the two populations approximate a perfectly modular case (they are almost but not quite 90°), only very constrained allele frequencies result in common PCs as in Figure 3d. This result may seem strange. The reason for it is that the PCs in each G matrix must have exactly the same direction for common PCs to exist. In the absence of perfect modules, the vast majority of allele frequencies result in slight differences in the directions of the PCs in the populations and therefore in no common PCs. This is not to say that we would be able to determine that the PCs are different in such a case when analyzing estimates of the G matrices. The effects of sample size will tend to obscure such subtle differences, so cases that approximate perfect modules will be indistinguishable from perfect modules in practice. We return to this issue of how sample size affects the expectation of finding common PCs in the discussion.
MODEL OF CONSTRAINED PLEIOTROPY
Constraints on pleiotropic effects are the key to whether common PCs are expected. The model of constrained pleiotropy (Wagner 1989) formalizes a type of constraint that can result in common PCs. The conceptual underpinning of the model is the assumption that allelic variation at a given locus additively affects variation in a physiological property associated with a gene product of the locus. The relationship between variation in the property and the genetic variation in n phenotypic traits is assumed to be linear and is expressed as a matrix transformation (hence the model is sometimes referred to as the “Bmatrix” model). These assumptions constrain mutations at a given locus to have the same pleiotropic effects on the n traits, although the magnitudes of the pleiotropic effects associated with particular allelic substitutions at the locus may differ. In this way, the model differs from the more general additive pleiotropic model presented by Lande (1980), where the alleles at a locus may have different pleiotropic effects.
In a quantitative genetic formulation, the model of constrained pleiotropy makes the assumption that the absolute values of the additiveeffect vectors of any alleles k and l at a locus j are proportional:
In the model of constrained pleiotropy, modules exist if, for the N loci that may result in genetic variation in n traits, a subset of M loci (M < N) can be defined where the allelic vectors at each of these M loci are orthogonal to the allelic vectors at each of the other N – M loci. In this case, for each α_{jk}_{(}_{M}_{)} that may occur at the M loci and each α_{jk}_{(}_{N}_{–}_{M}_{)} that may occur at the remaining N – M loci,
Just as in the twolocus, twoallele model, when a onedimensional module exists in a population, a PC with the same direction as the module will exist regardless of allele frequencies (appendix b). Therefore, when populations have a module in common, they will always have a common PC with the same direction as the modules, regardless of allele frequencies in the populations. Also as in the twolocus twoallele model, if the populations do not have a onedimensional module in common, very restricted allele frequencies are required for common PCs to exist (appendix b). In the model of constrained pleiotropy, x common modules can exist, 0 < x ≤ n, when n traits are considered. The same reasoning applies to such cases: If populations have x modules in common, 0 < x ≤ n, at least x (excluding n – 1) common PCs will exist, and very restricted allele frequencies in the two populations will result in more than x common PCs (appendix b).
Figure 4 illustrates these concepts. It diagrams three different possibilities that may arise when two populations (A and B) are compared when n = 3. In Figure 4a, the two populations have three modules in common. The G matrices of these populations will always have three common PCs; i.e., all PCs will be common PCs. Note that even if both A and B had three modules but the modules had different directions in the two populations, only very restricted allele frequencies would result in common PCs (appendix b). In Figure 4b, the two populations have a single onedimensional module in common. In this case, the G matrices will always have one PC in common, although the PC may be associated with different eigenvalues in the two populations. For there to be more than a single common PC in case 4b, very restricted allele frequencies are required in the two populations. In Figure 4c, neither population has any modules. Again, only very restricted combinations of allele frequencies would yield common PCs.
In summary, when comparing evolving populations with x common onedimensional modules, we expect to find exactly x common PCs. The stochastic effects of mutation and genetic drift are very likely to result in allele frequencies where the other PCs differ in their orientiations (appendix b).
DISCUSSION
The goal of the theory developed in this article is to assess whether the CPC model that is the basis of the CPC analysis can be informative for comparing G matrices beyond a descriptive summary of matrix similarity. When assessed solely from this perspective, the results are quite positive. Because of the close relationship between common PCs and modular structure, when common PCs do exist they have a biological interpretation: Common PCs indicate the existence of common modules. The intuition that common PCs have a biologically meaningful interpretation is therefore well founded (Phillips and Arnold 1999).
The modular structure that is sufficient to create common PCs is quite restrictive. It requires that the genetic effects of some set of loci be orthogonal to those of all other segregating loci. This requirement is equivalent to the requirement that some rotation of the axes in phenotype space that produces traits that are independent of all other traits exists. Given the general assumption that pleiotropy is ubiquitous, which we share, the existence of such extreme modules seems somewhat unlikely. Thus, we expect that the form of modular structure and therefore common PCs is unusual. This is not to say that cases approximating modular organizations are expected to be so rare that the possibility of their existence should be discounted. As discussed by a number of authors (Wagner and Altenberg 1996; Cheverudet al. 1997; Rice 2000), pleiotropic distributions that approximate the perfect case (i.e., where pleiotropic effects are “mainly” limited to a particular subset of traits) are not necessarily unexpected, particularly when appropriate sets of traits are considered.
How are we to reconcile these results with those of studies that have applied CPC analysis to G matrices and reported many common PCs? For example, Arnold and Phillips (1999) compared G matrices for six morphological traits for two populations (inland and coastal) of the garter snake Thamnophis elegans. CPC analyses were performed for all possible pairwise comparisons of G estimated for both males and females in both populations. For almost all comparisons, the CPC model CPC(All) could not be rejected. Pfrender and Lynch (2000) estimated G for lifehistory traits for a population of Daphnia pulex at four different times. CPC analyses were performed for pairwise comparisons among three of these matrices. CPC models including at least one common PC could not be rejected for each of these comparisons.
If the intuition that common modules should be rare is correct, the most likely explanation is that the power to detect differences in the direction of matrix PCs is low for the sample sizes commonly used in estimates of G. This explanation seems particularly likely given the results of Houle et al. (2002). For example, in the Houle et al. (2002) study, matrices were simulated using an additive factor model in which the angle between the directions of the second PCs (corresponding to the second eigenvalue) in two matrices was altered. CPC analysis using the software of Phillips (1998c) was performed on 100 pairs of estimates of the matrices, with a sample size of 300. For differences in direction of up to 6°, both the jumpup and Akaike information criterion approaches indicated the CPC model equality (all PCs are in common) for as many as 50% of the comparisons. Because the sample sizes for most estimates of G are not large (Steppanet al. 2002), this result indicates that CPC analysis may be indicating more common PCs than actually exist as a result of low sample sizes.
One reason for the inability of CPC analysis to distinguish distinct PCs when sample sizes are low may be the way that position in the Flury hierarchy is assessed (Phillips and Arnold 1999). For example, the decision to move up in the Flury hierarchy in the jumpup approach advocated by Phillips and Arnold (1999) is based on being unable to reject a hypothesis of common PCs vs. a hypothesis that no common PCs exist. One moves up in the hierarchy until a hypothesis of x common PCs can be rejected. As Phillips and Arnold (1999) stated, such a result should not be interpreted as demonstrating that the matrices have x – 1 common PCs, only that the presence of x – 1 common PCs cannot be rejected. It is tempting, however, to interpret stopping position as a reflection of matrix similarity. In fact, the results of a CPC analysis reflect both matrix similarity and statistical power.
The sensitivity of CPC results to sample size means that, in practice, we cannot necessarily interpret common PCs as a demonstration of common modules. However, CPC analysis could be a useful tool for indicating which sets of traits are likely to have a modular organization, particularly if methods for assessing confidence in the existence of common PCs could be developed. We would not expect to have high confidence in a common PC among G matrices unless the populations have approximately modular organizations in common. The existence of modules would always have to be confirmed by independent means, because even without common modular organization, the allele frequencies required to produce a true common PC among G matrices could have occurred by chance.
In the context of identifying which sets of traits may have a modular organization, the reordering option available in the CPC analysis software of Phillips (1998a,b,c) is valuable. The default is that the program estimates the model CPC(All) for the combined data and builds the common PC models of the Flury hierarchy [PCPC(1), PCPC(2), etc.], using the common PCs of CPC(All) in rank order according to the size of their eigenvalues. The reordering option allows the user to designate a different ordering scheme. This flexibility is useful in a case where matrices have a common PC but the common PC is not associated with eigenvalues first in the default rank ordering. In such a case, the default generally causes CPC analysis to indicate no similarity among the matrices (Houleet al. 2002), but if the reordering option is used to place the true common PC first in the ordering scheme, CPC analysis should indicate that a common PC exists.
The possibility that CPC analysis could be developed for the detection of modules is a particularly exciting prospect because modules have clearly defined genetic and evolutionary properties. For example, from a genetics perspective, modules represent a specific constraint on how variation at the gene level is related to variation in the phenotype (Bonner 1988; Cheverudet al. 1997; Mezeyet al. 2000). When there is modular organization, the effects associated with groups of genes are entirely limited to distinct aspects of the phenotype. From an evolutionary perspective, modules are units in the sense that variation in the traits defining a module will be uncorrelated with variation in other traits, given appropriate assumptions about random mating and no gameticphase disequilibrium (Magwene 2001). Because selection can act on traits in a module without causing a correlated response in traits outside the module, identification of modules provides a foundation for constructing hypotheses about the evolutionary properties of a population that could be tested experimentally. Modular organization also plays an important conceptual role in relation to the evolvability of a genetic system (Wagner and Altenberg 1996; Wagner and Mezey 2003). It is therefore of interest to identify whether and to what degree cases of approximate modularity exist in nature.
In conclusion, our results suggest (1) that common PCs are unlikely without modular organization and (2) that there is a biological interpretation of common PCs and a possible role for common PCs in the identification of modular organization. In both cases, interpretation of common PCs will be stymied until a systematic study of the sensitivity of CPC analysis to sample size is performed. If this problem could be addressed, CPC analysis of G matrices could provide biologically useful insight beyond a summary of matrix structure. In this role, CPC analysis could be particularly useful for addressing questions that require a relatively complete picture of genetic architecture: Do modules correspond to functional architectures (Houleet al. 2002; Steppanet al. 2002)? To what extent is the structure of the G matrix constrained (Turelli 1988)? How modular is the GP map (Wagner 1996)?
Acknowledgments
We thank Kyle Galivan, Thomas F. Hansen, Frances C. James, Eric Klassen, Joseph Travis, ZhaoBang Zeng, and two anonymous reviewers for their comments on this manuscript. This work was supported by National Science Foundation grant no. 0129219.
APPENDIX A: TWOLOCUS, TWOALLELE MODEL
It is assumed that the entirety of the genetic variation in n = 2 traits is determined by alleles segregating at N = 2 loci where only two alleles are possible at each locus. Forward and backward mutations occur at locus j at the same rate, μ_{j}. We assume no dominance, epistasis, disequilibrium (linkage or otherwise), maternal effects, sex linkage, genotypeenvironment covariance, or genotypeenvironment interactions. We assume random mating among diploid individuals. α_{jk}_{.}_{i} is the additive effect of allele k at locus j associated with trait i, p_{jk} is the frequency of allele k, and
Result A1: If populations have modules with the same direction, the G matrices have common PCs for all allele frequencies.
The matrix G is a real, 2 × 2, symmetric matrix. An orthonormal matrix Q and a diagonal matrix Λ therefore exist, such that
If two modules exist (
This relation holds with the same orthonormal matrixÃ no matter what the allele frequencies in the population. Because this expression is a diagonalization of G, the uniqueness of the spectral decomposition implies that Q = Ã (up to column permutation and multiplication of columns by –1). Therefore, if
Note that in the special case where an allele at one locus goes to fixation in one of the populations, the same argument can be used to demonstrate that there will still be two common PCs if the
Result A2: If populations A and B have no modules, given heterozygote frequencies in population A, a line intersecting the region bounded by the square of possible heterozygote frequencies in population B (0 ≤ H_{j.B} ≤ 0.5) describes the frequencies that result in common PCs in G_{A} and G_{B}.
An intuitive interpretation of this result is that the number of heterozygote (allele) frequencies for which G_{A} and G_{B} have common PCs is far smaller than the number of heterozygote (allele) frequencies for which the PCs are different. For example, given heterozygote frequencies in population A (H_{1.}_{A} and H_{2.}_{A}) for every heterozygote frequency H_{1.}_{B} at the first locus in population B, a single frequency H_{2.}_{B} at the second locus produces common PCs. All other frequencies at the second locus will result in different PCs.
Assume that there are no modules in population B, such that
The constraint of (A10) makes common PCs unexpected among the G matrices of populations A and B if there are no modules. The reason is that, even if this constraint is satisfied at some point, any change in allele frequencies at one locus must be exactly balanced by a change at the other locus that preserves the ratios in (A10). The stochastic changes in allele frequencies due to mutation and genetic drift are therefore not expected to preserve the necessary ratios.
Note that, although two populations are considered in this section, the reasoning can also be extended to multiple populations. Also, the same reasoning can be used to demonstrate that, in the case where an allele at one locus goes to fixation in one of the populations or where both
APPENDIX B: THE MODEL OF CONSTRAINED PLEIOTROPY
appendix b extends the framework outlined in Result A1 and Result A2 to the model of constrained pleiotropy of Wagner (1989). The model of constrained pleiotropy assumes that all segregating alleles and all possible mutant alleles at an individual locus j have effects that fall along a single vector. In a quantitative genetic formulation, the absolute values of the additive effect vectors of any alleles k and l at a locus j are proportional: α_{jk} ∝ α_{jl}. Mutations are assumed to occur at each locus j at a rate μ_{j}. Here, n traits are being considered in all populations being compared, although the populations may have different numbers of loci. We assume random mating among diploid individuals in a population. We also assume no dominance, epistasis, disequilibrium (linkage or otherwise), maternal effects, sex linkage, genotypeenvironment covariance, or genotypeenvironment interactions.
The additive effect of allele k at some locus j for n traits is
Modules exist in a population if a subset of M loci (M < N) exists in which the allelic vectors at each of these M loci are orthogonal to the allelic vectors at each of the other N – M loci. This means that, for each α_{jk}_{(}_{M}_{)} that may occur at the M loci and each α_{jk}_{(}_{N}_{–}_{M}_{)} that may occur at the remaining N – M loci,
Result B1: For each pair of modules that populations have in common, the G matrices have a common PC with the same direction as the module, regardless of allele frequencies or effects of mutations in the populations.
In a population in which M < N loci define a module, the G matrix can be written as
Result B2: For populations A and B with no common modules, given allele frequencies in population A, the allele frequencies that result in common PCs in G_{A} and G_{B} are described by n overlapping quadratic (N_{B}J̄_{B} – n + 1)dimension planes intersecting the N_{B}J̄_{B} dimension region describing the possible allele frequencies at each of the N_{B} loci in population B.
Ψ_{(}_{x}_{)} indicates a matrix with elements ψ_{ij}, where element ψ_{xx} is a positive value, all other elements in column ψ_{x}_{–} and row ψ_{–}_{x} are zero, and all other elements may or may not be equal to zero. For example, Ψ_{(1)} is an instance of a matrix with the following form,
The constraint of (B16) makes common PCs unexpected among the G matrices of populations A and B if no modules exist. The reason is the same as for the twolocus, twoallele model. Even if the constraint is momentarily satisfied, any change in allele frequencies must be exactly balanced by changes in other allele frequencies to satisfy the constraint, and these frequencies represent a small fraction of possible allele frequencies. The stochastic changes in allele frequencies due to mutation and genetic drift are therefore not expected to preserve the constraint in (B16).
If relation (B16) is satisfied, the populations have a single PC x in common, but for n traits, n PCs may be in common. For each of these, a system of n – 1 equations of the form of Equation B16 define a quadratic (N_{B}J̄_{B} – n + 1)dimension plane (for n = 2, the systems are the same). Where these planes intersect the figure, at least one PC is in common, and where these planes overlap, there is more than one common PC. Although more traits define more planes, each plane is of correspondingly lower dimension. Therefore, as n gets larger, the ratio of the number of allele frequencies for which at least one PC is in common to the number of possible allele frequencies gets smaller.
Note that, although two populations are considered in this section, the reasoning can also be extended to multiple populations. Also, for completeness, three special cases must be considered. In the case where populations A and B have x pairs of common modules, at least x common PCs will exist, as discussed above. In this case, the frequencies in population B for which x + 1 (where x < n – 2) common PCs exist are described by the portion of an (N_{B}J̄_{B} – C_{B}) – (n + x) + 1dimension plane that intersects the region bounded by a (N_{B}J̄_{B} – C_{B})dimension figure, where C_{B} is the number of allelic vectors defining the modules in population B. Two other special cases are those in which each population has a module but the modules have different directions. If the modules are orthogonal in the space of n traits, common PCs are again described by the portion of an (N_{B}J̄_{B} – C_{B}) – (n + x) + 1dimension plane that intersects the region bounded by the (N_{B}J̄_{B} – C_{B})dimension figure. If the modules in the two populations are not orthogonal in the space of n traits, at least two PCs will exist that are not in common unless allele frequencies are such that a multiplicity of eigenvalues occurs.
Footnotes

Communicating editor: ZB. Zeng
 Received November 4, 2002.
 Accepted April 28, 2003.
 Copyright © 2003 by the Genetics Society of America