Genetics, Vol. 165, 411-425, September 2003, Copyright © 2003

Comparing G Matrices: Are Common Principal Components Informative?

Jason G. Mezeya and David Houlea
a Department of Biological Science, Florida State University, Tallahassee, Florida 32306-1100

Corresponding author: Jason G. Mezey, Florida State University, Tallahassee, Florida 32306-1100., mezey{at}bio.fsu.edu (E-mail)

Communicating editor: Z-B. ZENG


*  ABSTRACT
*TOP
*ABSTRACT
*PLEIOTROPIC MODULARITY
*TWO-LOCUS, TWO-ALLELE MODEL
*MODEL OF CONSTRAINED PLEIOTROPY
*DISCUSSION
*APPENDIX A
*APPENDIX B
*LITERATURE CITED

Common principal components (CPC) analysis is a technique for assessing whether variance-covariance matrices from different populations have similar structure. One potential application is to compare additive genetic variance-covariance matrices, G. In this article, the conditions under which G matrices are expected to have common PCs are derived for a two-locus, two-allele model and the model of constrained pleiotropy. The theory demonstrates that whether G matrices are expected to have common PCs is largely determined by whether pleiotropic effects have a modular organization. If two (or more) populations have modules and these modules have the same direction, the G matrices have a common PC, regardless of allele frequencies. In the absence of modules, common PCs exist only for very restricted combinations of allele frequencies. Together, these two results imply that, when populations are evolving, common PCs are expected only when the populations have modules in common. These results have two implications: (1) In general, G matrices will not have common PCs, and (2) when they do, these PCs indicate common modular organization. The interpretation of common PCs identified for estimates of G matrices is discussed in light of these results.


COMPARISON of additive genetic variance-covariance matrices (the G matrices) of different populations is an important goal in evolutionary quantitative genetics (STEPPAN et al. 2002 Down). Such comparisons identify commonalities (e.g., KOHN and ATCHLEY 1988 Down; PHILLIPS and ARNOLD 1999 Down; ROFF 2000 Down) or summarize differences (e.g., SHAW et al. 1995 Down; PAULSEN 1996 Down; STEPPAN 1997 Down) in the structure of G. The broad motivation behind such analysis is clear: G is a key component for predicting trait evolution under directional selection and genetic drift as well as for retroactively estimating the selection gradient (LANDE 1979 Down; LANDE and ARNOLD 1983 Down). Comparison of G therefore reveals whether differences in genetic variation may have played a role in divergent evolutionary trajectories (PRICE et al. 1993 Down). Furthermore, as the structure of G depends on pleiotropic effects of segregating alleles, commonalities may also contain information on genetic architecture shared by populations (PHILLIPS and ARNOLD 1999 Down). Comparison of G matrices has been the primary goal of many studies (ARNOLD 1981 Down; LOFSVOLD 1986 Down; KOHN and ATCHLEY 1988 Down; WILKINSON et al. 1990 Down; ATCHLEY et al. 1992 Down; BRODIE 1993 Down; SHAW et al. 1995 Down; PAULSEN 1996 Down; PODOLSKY et al. 1997 Down; STEPPAN 1997 Down; ARNOLD and PHILLIPS 1999 Down; CAMARA and PIGLIUCCI 1999 Down; BADAYAEV and HILL 2000 Down; PFRENDER and LYNCH 2000 Down; ROFF 2000 Down, ROFF 2002 Down; SERVICE 2000 Down; WALDMANN and ANDERSON 2000 Down; PHILLIPS et al. 2001 Down). The results have been used to address issues in areas as diverse as evolution of predation patterns (ARNOLD 1981 Down), covariance patterns resulting from mutation (CAMARA and PIGLIUCCI 1999 Down), and change in the G matrix itself (see, e.g., WILKINSON et al. 1990 Down; PFRENDER and LYNCH 2000 Down).

Although the utility of comparing G matrices seems clear, which methods will furnish informative conclusions is not (STEPPAN et al. 2002 Down). Difficulty arises in any case where trait number is greater than one. For n traits, the G matrix includes n(n + 1)/2 variance and covariance elements, and each of these may be larger or smaller than its corresponding element in other G matrices. Moreover, the number of factors that potentially influence the values of the n(n + 1)/2 variances and covariances is considerable, including the specific pleiotropic effects of segregating alleles, the frequency of these alleles, gametic-phase disequilibrium, nonadditive effects, and the effects of mutation. Therefore, it is not easy to define a statistic that both summarizes structure shared by G matrices and provides a clear interpretation of the implications of shared structure.

Of the many multivariate statistical techniques proposed for comparison of G matrices (reviewed in STEPPAN et al. 2002 Down), common principal components (CPC) analysis (FLURY 1987 Down, FLURY 1988 Down) is fast becoming the method of choice (ARNOLD and PHILLIPS 1999 Down; CAMARA and PIGLIUCCI 1999 Down; PHILLIPS and ARNOLD 1999 Down; PFRENDER and LYNCH 2000 Down; ROFF 2000 Down). The goal of CPC analysis is to summarize the structure of two (or more) matrices in terms of common principal components, those principal components (PCs) that have the same direction, and to test for differences from this common model (FLURY 1987 Down, FLURY 1988 Down). The statistical models of CPC arranged in order of increasing amount of common structure are the following: no similarity among matrices; PCPC(1), where matrices have a single common PC; PCPC(2); ... ; PCPC(n - 2); CPC(All); matrix proportionality; and matrix equality. The m common PCs referred to in PCPC(m) may refer to any m PCs shared among matrices, and common PCs referred to in the CPC models need not be associated with eigenvalues of the same rank. A number of approaches are possible for determining which CPC model correctly describes matrix structure (FLURY 1988 Down; PHILLIPS and ARNOLD 1999 Down). The software implementations used by almost all CPC analyses to date (PHILLIPS 1998A Down, PHILLIPS 1998B Down, PHILLIPS 1998C Down) employ a maximum-likelihood approach and a hierarchy of hypothesis tests (the Flury hierarchy) to determine whether matrices have common principal components (see PHILLIPS and ARNOLD 1999 Down for a discussion of the alternatives when implementing the Flury hierarchy).

The hierarchy of CPC models provides a valuable descriptive summary of matrix structure. However, the biological meaning of the results is unclear (HOULE et al. 2002 Down). In this article, we address the problem of interpreting common PCs by deriving the conditions under which we expect common PCs among G matrices. We perform the analysis for both a two-locus, two-allele model and the model of constrained pleiotropy (WAGNER 1989 Down). The analyses demonstrate that common PCs are expected only when pleiotropic effects are constrained to a modular organization (WAGNER and ALTENBERG 1996 Down). When populations being compared have a modular organization in common, they have a common PC. As is discussed, this latter result provides a biological interpretation of common PCs when power is sufficient to reveal differences in the direction of PCs among G matrices.


*  PLEIOTROPIC MODULARITY
*TOP
*ABSTRACT
*PLEIOTROPIC MODULARITY
*TWO-LOCUS, TWO-ALLELE MODEL
*MODEL OF CONSTRAINED PLEIOTROPY
*DISCUSSION
*APPENDIX A
*APPENDIX B
*LITERATURE CITED

WAGNER and ALTENBERG 1996 Down defined a modular organization as a case in which "pleiotropic effects of the genes fall mainly among members of the same character complex and less frequently between members of different complexes" (p. 971). A modular organization is therefore defined in terms of pleiotropic effects when considering a specific set of traits. In this article, we address an extreme case of modular organization, in which x > 2 nonoverlapping sets of pleiotropic effects can be defined in which all pleiotropic effects in a set are orthogonal (oriented at 90°) to all other pleiotropic effects. Each of these x sets is a "perfect" module in the sense that, for the n measured traits, an orthogonal rotation can cause the pleiotropic effects of each module to fall entirely on a subset of the new axes and not at all on new axes affected by the other modules. Distinct populations have a perfect module in common if the same orthogonal rotation also results in perfect modules. The hypothetical case of perfect modules is used to illustrate the point that considerable constraints on pleiotropic effects are required for common PCs to be expected among G matrices.

Modularity can be defined both in terms of pleiotropic effects of alleles segregating in a population and in terms of the pleiotropic effects that may be introduced into the population by mutation (WAGNER and ALTENBERG 1996 Down). Modularities at these two levels are clearly related. If the pleiotropic distribution of possible mutations is modular, then the pleiotropic effects of segregating alleles will also be modular. Although cases are possible where segregating variation is modular while the distribution of mutations is not, such cases are expected to be transient because mutations introduce nonmodular variation (WAGNER and ALTENBERG 1996 Down; MEZEY et al. 2000 Down). Modularity of segregating variation is therefore expected to be a strong indicator of modularity in the distribution of mutations. Our treatment concerns cases in which the distribution of mutations is modular.


*  TWO-LOCUS, TWO-ALLELE MODEL
*TOP
*ABSTRACT
*PLEIOTROPIC MODULARITY
*TWO-LOCUS, TWO-ALLELE MODEL
*MODEL OF CONSTRAINED PLEIOTROPY
*DISCUSSION
*APPENDIX A
*APPENDIX B
*LITERATURE CITED

The goal of discussing this simple model is to provide an intuitive illustration of the relationship between modules and common PCs that also applies to the more general model of constrained pleiotropy (WAGNER 1989 Down). In a population, all genetic variation in n = 2 traits is assumed to be determined entirely by alleles segregating at the N = 2 loci. We assume complete additivity of allelic effects (no dominance or epistasis), no disequilibrium (gametic-phase or otherwise), no maternal effects, no sex linkage, and no genotype-environment covariance or genotype-environment interactions. We also assume random mating among diploid individuals. In this case, the additive effect on the traits associated with allele k at locus j can be expressed as a vector,

(1)

where the ith element of the vector {alpha}jk.i is the additive effect of allele k associated with trait i (LYNCH and WALSH 1998 Down). This model can also be expressed in terms of the average effect of an allelic substitution at locus j:

(2)

The relationship between these two vectors is pjkj.i = {alpha}jk.i, where pjk is the frequency of allele k at locus j. Because there are two alleles at each locus and no nonadditive effects, each j is constant, and the direction of each {alpha}jk is constant where {alpha}j1 {propto} -{alpha}j2. An allelic substitution at locus j has a pleiotropic effect in this model if both of the elements of the allelic vectors are nonzero: j.i != 0, {alpha}jk.i != 0. In this model, forward and backward mutations occur at each locus j at the same rate µj, where a mutation changes an allele's identity to the other possible allelic state. The structure of the G matrix depends on the j (the {alpha}jk) vectors and on the allele frequencies at the two loci (Appendix A). Note that, in the following, we assume that G can be estimated without error. We return to sampling issues in the DISCUSSION.

In this two-locus, two-allele model, existence of a module depends on the orientation of the allelic vectors associated with each locus. If the allelic vectors at one locus are orthogonal to the allelic vectors at the other locus, two perfect modules are present, because the pleiotropic effects can be divided into two groups that do not have overlapping effects. As an example, consider the case diagrammed in Fig 1A.1, where the allelic vectors of the loci are orthogonal to one another. Rotating the trait axes to the direction of the allelic vectors associated with each locus produces two new traits, f1 and f2, where the effects of allelic substitutions at each of the two loci are limited entirely to one of the two traits. Both of these new traits, f1 and f2, therefore define perfect modules. Fig 1B.1 diagrams a case without perfect modules. Because the allelic vectors are not orthogonal, two modules cannot be defined by a rotation of the axes. However the axes are rotated, allelic substitutions at both loci have effects on both new traits. Note that a modular organization is possible in Fig 1B.1 if a nonorthogonal rotation is used, but such transformations do not result in modules in an evolutionary sense; i.e., directional selection cannot be applied to such a "module" without resulting in a correlated response. Such nonorthogonal modules will be the subject of another article (J. G. MEZEY and D. HOULE, unpublished results). We confine the discussion here to modules that can be defined by rotations of the trait axes.



View larger version (19K):
In this window
In a new window
Download PPT slide
 
Figure 1. Cases of (a.1) two modules and (b.1) no modules for the two-locus, two-allele model. The j are vectors of the average effect of an allelic substitution at locus j. The orientation of the j reflects the relative effects on the first and second traits (x-axis and y-axis, respectively). Two modules exist in a.1 because 1 is orthogonal to 2, and no modules exist in b.1 because the vectors are not orthogonal. The diagrams of a.2 and b.2 are the G matrices and PCs for different allele frequencies if we assume the vectors of a.1 and b.1, respectively. PC1 and PC2 indicate the PCs corresponding to the larger and smaller eigenvalues, respectively, where length of the PC reflects the relative size of the eigenvalue. Note that in a.2, different allele frequencies lead to different forms of G, but the direction of the PCs is the same. In contrast, in b.2, the different allele frequencies lead to different forms of G and different directions of PCs.

In this two-locus, two-allele model, the existence of modules places a major constraint on the possible orientations of G matrix PCs. When modules exist, the PCs of the G matrix have the direction of the modules, regardless of allele frequencies and changes in allele frequencies (Appendix A). To visualize this relationship between PCs and modules, again consider Fig 1. Fig 1A.2 diagrams the G matrix and PCs associated with two populations, both of which have the modules diagrammed in Fig 1A.1. The populations have different allele frequencies at the two loci, and as a result, the G matrices of the populations differ. Although the PCs of G are associated with different eigenvalues, the PCs have the same direction as the modules in both populations. Further, all variation attributable to the allelic substitutions defining an individual module is described by a single PC and its associated eigenvalue. Contrast this case with that diagrammed in Fig 1B.2, which diagrams G and the PCs for two populations where allelic vectors are described by Fig 1B.1. In this case, the different allele frequencies correspond to different structures of G and PCs that have different directions. In such cases, there is no simple relationship between PCs and the allelic effects associated with each locus.

For an individual population, the directions of G matrix PCs are always the same regardless of allele frequencies only if perfect modules exist (Appendix A). Therefore, if distinct populations have such modules in common (the modules have the same direction), the G matrices of the populations will always have common PCs, regardless of allele frequencies. Note that this relationship depends entirely on the direction of the modules and not the specific allelic effects defining the modules. Populations with different allelic effects at the two loci always have common PCs as long as both populations have modules in the same direction. In contrast, if the populations being compared have no modules, only a restricted subset of allele frequencies results in common PCs (Appendix A), even if the allelic vectors are the same in the populations being compared.

The cases diagrammed in Fig 2 illustrate these concepts. Fig 2A diagrams two populations (A and B) that have modules in common. For these populations, Fig 2A.1 diagrams in gray the allele (heterozygote) frequencies in population A that result in common PCs, given fixed allele frequencies in population B. Fig 2A.2 provides the equivalent diagram for population B given fixed allele frequencies in population A. Note that every possible allele frequency results in common PCs, regardless of the allele frequency in the other population. Contrast this situation with the case diagrammed in Fig 2B.1, where the populations have the same allelic effect vectors, but no modules are present. For these populations, the only allele frequencies for which common PCs occur are described by the dashed and dotted lines in Fig 2B.1 and b.2, respectively. The allele frequencies that do not fall on these lines result in no common PCs. Therefore, only a very constrained set of allele frequencies results in common PCs when no modules are present.



View larger version (26K):
In this window
In a new window
Download PPT slide
 
Figure 2. Allele (expressed as heterozygote) frequencies for which populations A and B have the same PCs. (a) Populations A and B have common modules. a.1 depicts in gray the heterozygote frequencies for which population A has the same PCs as population B, fixing the heterozygote frequencies in population B at H1.B = 0.3 for locus 1 and H2.B = 0.2 for locus 2 (the black diamond indicates the frequencies in population B). a.2 is the corresponding graph for population B, fixing the heterozygote frequency in population A at H1.B = 0.1 and H2.B = 0.4. Note that in both cases any heterozygote frequencies result in the same PCs. (b) Populations A and B have no modules. b.1 and b.2 depict the heterozygote frequencies for which PCs will be the same when heterozygote frequencies are fixed in the other population. In this case, the heterozygote frequency must fall on the lines in b.1 and b.2 for the two populations to have the same PCs.

The implication of these results is that, when comparing evolving populations, we should not expect common PCs unless there are common modules. Without modules, the allele frequencies required for common PCs are so constrained that they are unlikely to occur given the stochastic effects of mutation and genetic drift. As an example, consider a case in which populations A and B have the same allelic vectors but no modules exist (as in Fig 2A.1). As demonstrated in Appendix A, common PCs occur in these two populations when the following constraint is satisfied,

(3)

where Hj.P = 2pjkpjl is the heterozygote frequency at locus j in population P and pjk and pjl are the frequencies of the alleles k and l. Note that, even if this constraint is satisfied at some point, any change in allele frequencies at the loci in one of the populations must be exactly balanced by a specific change in allele frequencies in the other population to preserve the ratios in (3). Any other changes in allele frequencies in the other population result in no common PCs. Stochastic changes are therefore not expected to preserve the necessary ratios. Of course, situations can be constructed in which the probability of common PCs is high even in the absence of modules. For example, infinite populations with the same allelic-effect vectors that have reached the same mutation-selection equilibrium would be such a case, but barring such extreme conditions, we should generally expect common PCs only when populations have common modules.

Fig 3 provides a summary of the four possible cases that can arise when two populations are compared for the two-locus, two-allele model: (1) The populations have modules in common (Fig 3A), (2) both populations have modules but the directions are different (Fig 3B), (3) one population has a module and the other does not (Fig 3C), and (4) neither population has modules (Fig 3D). Only the case in Fig 3A will always have common PCs. For the cases in Fig 3B–D, the vast majority of allele frequencies will result in no common PCs, and we should not expect to find common PCs when the populations are evolving.



View larger version (13K):
In this window
In a new window
Download PPT slide
 
Figure 3. Four cases that may arise in a comparison of two populations described by the two-locus, two-allele model: (a) The populations have common modules; (b) the populations each have modules but have no common modules; (c) population A has modules and population B has no modules; (d) neither population has modules.

Note that, if the allelic vectors in the two populations approximate a perfectly modular case (they are almost but not quite 90°), only very constrained allele frequencies result in common PCs as in Fig 3D. This result may seem strange. The reason for it is that the PCs in each G matrix must have exactly the same direction for common PCs to exist. In the absence of perfect modules, the vast majority of allele frequencies result in slight differences in the directions of the PCs in the populations and therefore in no common PCs. This is not to say that we would be able to determine that the PCs are different in such a case when analyzing estimates of the G matrices. The effects of sample size will tend to obscure such subtle differences, so cases that approximate perfect modules will be indistinguishable from perfect modules in practice. We return to this issue of how sample size affects the expectation of finding common PCs in the DISCUSSION.


*  MODEL OF CONSTRAINED PLEIOTROPY
*TOP
*ABSTRACT
*PLEIOTROPIC MODULARITY
*TWO-LOCUS, TWO-ALLELE MODEL
*MODEL OF CONSTRAINED PLEIOTROPY
*DISCUSSION
*APPENDIX A
*APPENDIX B
*LITERATURE CITED

Constraints on pleiotropic effects are the key to whether common PCs are expected. The model of constrained pleiotropy (WAGNER 1989 Down) formalizes a type of constraint that can result in common PCs. The conceptual underpinning of the model is the assumption that allelic variation at a given locus additively affects variation in a physiological property associated with a gene product of the locus. The relationship between variation in the property and the genetic variation in n phenotypic traits is assumed to be linear and is expressed as a matrix transformation (hence the model is sometimes referred to as the "B-matrix" model). These assumptions constrain mutations at a given locus to have the same pleiotropic effects on the n traits, although the magnitudes of the pleiotropic effects associated with particular allelic substitutions at the locus may differ. In this way, the model differs from the more general additive pleiotropic model presented by LANDE 1980 Down, where the alleles at a locus may have different pleiotropic effects.

In a quantitative genetic formulation, the model of constrained pleiotropy makes the assumption that the absolute values of the additive-effect vectors of any alleles k and l at a locus j are proportional:

(4)

Any number of alleles may be segregating at each locus, and mutations may introduce new alleles at a locus, although the effects of all alleles conform to the constraint of Equation 4. The structure of the G matrix depends on the effects and frequencies of the alleles segregating in a population. As in the two-locus, two-allele model, we consider the exact structure of G and assume no disequilibrium (gametic-phase or otherwise), no maternal effects, no sex linkage, no genotype-environment covariance or genotype-environment interactions, and random mating among diploid individuals.

In the model of constrained pleiotropy, modules exist if, for the N loci that may result in genetic variation in n traits, a subset of M loci (M < N) can be defined where the allelic vectors at each of these M loci are orthogonal to the allelic vectors at each of the other N - M loci. In this case, for each {alpha}jk(M) that may occur at the M loci and each {alpha}jk(N-M) that may occur at the remaining N - M loci,

(5)

Each subset M defines a module because new traits can be defined by a rotation of the n trait axes where genetic variation in the new traits affected by the M loci is independent of genetic variation in the rest of the new traits. Note that variation associated with a module M need not fall along a single vector. Modules may therefore be multidimensional, but the relationship of such higher-dimensional modules to the PCs of the G matrix is more complicated. In this article, we restrict the discussion to modules in which the allelic vectors at all of the M loci defining a module have the same direction; i.e., |{alpha}jk(M)| {propto} |{alpha}mk(M)| for all loci j, m in M. We consider higher-dimensional modules in another article (J. G. MEZEY and D. HOULE, unpublished results).

Just as in the two-locus, two-allele model, when a one-dimensional module exists in a population, a PC with the same direction as the module will exist regardless of allele frequencies (Appendix B). Therefore, when populations have a module in common, they will always have a common PC with the same direction as the modules, regardless of allele frequencies in the populations. Also as in the two-locus two-allele model, if the populations do not have a one-dimensional module in common, very restricted allele frequencies are required for common PCs to exist (Appendix B). In the model of constrained pleiotropy, x common modules can exist, 0 < x <= n, when n traits are considered. The same reasoning applies to such cases: If populations have x modules in common, 0 < x <= n, at least x (excluding n - 1) common PCs will exist, and very restricted allele frequencies in the two populations will result in more than x common PCs (Appendix B).

Fig 4 illustrates these concepts. It diagrams three different possibilities that may arise when two populations (A and B) are compared when n = 3. In Fig 4A, the two populations have three modules in common. The G matrices of these populations will always have three common PCs; i.e., all PCs will be common PCs. Note that even if both A and B had three modules but the modules had different directions in the two populations, only very restricted allele frequencies would result in common PCs (Appendix B). In Fig 4B, the two populations have a single one-dimensional module in common. In this case, the G matrices will always have one PC in common, although the PC may be associated with different eigenvalues in the two populations. For there to be more than a single common PC in case 4b, very restricted allele frequencies are required in the two populations. In Fig 4C, neither population has any modules. Again, only very restricted combinations of allele frequencies would yield common PCs.



View larger version (20K):
In this window
In a new window
Download PPT slide
 
Figure 4. Three cases comparing populations (A and B) where variation in three traits (T.1, T.2, and T.3) is described by the model of constrained pleiotropy. Each {gamma}j is a vector in the direction of additive allelic-effect vectors associated with a locus j. Populations A and B have (a) three common modules, (b) a single common module, and (c) no modules.

In summary, when comparing evolving populations with x common one-dimensional modules, we expect to find exactly x common PCs. The stochastic effects of mutation and genetic drift are very likely to result in allele frequencies where the other PCs differ in their orientiations (Appendix B).


*  DISCUSSION
*TOP
*ABSTRACT
*PLEIOTROPIC MODULARITY
*TWO-LOCUS, TWO-ALLELE MODEL
*MODEL OF CONSTRAINED PLEIOTROPY
*DISCUSSION
*APPENDIX A
*APPENDIX B
*LITERATURE CITED

The goal of the theory developed in this article is to assess whether the CPC model that is the basis of the CPC analysis can be informative for comparing G matrices beyond a descriptive summary of matrix similarity. When assessed solely from this perspective, the results are quite positive. Because of the close relationship between common PCs and modular structure, when common PCs do exist they have a biological interpretation: Common PCs indicate the existence of common modules. The intuition that common PCs have a biologically meaningful interpretation is therefore well founded (PHILLIPS and ARNOLD 1999 Down).

The modular structure that is sufficient to create common PCs is quite restrictive. It requires that the genetic effects of some set of loci be orthogonal to those of all other segregating loci. This requirement is equivalent to the requirement that some rotation of the axes in phenotype space that produces traits that are independent of all other traits exists. Given the general assumption that pleiotropy is ubiquitous, which we share, the existence of such extreme modules seems somewhat unlikely. Thus, we expect that the form of modular structure and therefore common PCs is unusual. This is not to say that cases approximating modular organizations are expected to be so rare that the possibility of their existence should be discounted. As discussed by a number of authors (WAGNER and ALTENBERG 1996 Down; CHEVERUD et al. 1997 Down; RICE 2000 Down), pleiotropic distributions that approximate the perfect case (i.e., where pleiotropic effects are "mainly" limited to a particular subset of traits) are not necessarily unexpected, particularly when appropriate sets of traits are considered.

How are we to reconcile these results with those of studies that have applied CPC analysis to G matrices and reported many common PCs? For example, ARNOLD and PHILLIPS 1999 Down compared G matrices for six morphological traits for two populations (inland and coastal) of the garter snake Thamnophis elegans. CPC analyses were performed for all possible pairwise comparisons of G estimated for both males and females in both populations. For almost all comparisons, the CPC model CPC(All) could not be rejected. PFRENDER and LYNCH 2000 Down estimated G for life-history traits for a population of Daphnia pulex at four different times. CPC analyses were performed for pairwise comparisons among three of these matrices. CPC models including at least one common PC could not be rejected for each of these comparisons.

If the intuition that common modules should be rare is correct, the most likely explanation is that the power to detect differences in the direction of matrix PCs is low for the sample sizes commonly used in estimates of G. This explanation seems particularly likely given the results of HOULE et al. 2002 Down. For example, in the HOULE et al. 2002 Down study, matrices were simulated using an additive factor model in which the angle between the directions of the second PCs (corresponding to the second eigenvalue) in two matrices was altered. CPC analysis using the software of PHILLIPS 1998C Down was performed on 100 pairs of estimates of the matrices, with a sample size of 300. For differences in direction of up to 6°, both the jump-up and Akaike information criterion approaches indicated the CPC model equality (all PCs are in common) for as many as 50% of the comparisons. Because the sample sizes for most estimates of G are not large (STEPPAN et al. 2002 Down), this result indicates that CPC analysis may be indicating more common PCs than actually exist as a result of low sample sizes.

One reason for the inability of CPC analysis to distinguish distinct PCs when sample sizes are low may be the way that position in the Flury hierarchy is assessed (PHILLIPS and ARNOLD 1999 Down). For example, the decision to move up in the Flury hierarchy in the jump-up approach advocated by PHILLIPS and ARNOLD 1999 Down is based on being unable to reject a hypothesis of common PCs vs. a hypothesis that no common PCs exist. One moves up in the hierarchy until a hypothesis of x common PCs can be rejected. As PHILLIPS and ARNOLD 1999 Down stated, such a result should not be interpreted as demonstrating that the matrices have x - 1 common PCs, only that the presence of x - 1 common PCs cannot be rejected. It is tempting, however, to interpret stopping position as a reflection of matrix similarity. In fact, the results of a CPC analysis reflect both matrix similarity and statistical power.

The sensitivity of CPC results to sample size means that, in practice, we cannot necessarily interpret common PCs as a demonstration of common modules. However, CPC analysis could be a useful tool for indicating which sets of traits are likely to have a modular organization, particularly if methods for assessing confidence in the existence of common PCs could be developed. We would not expect to have high confidence in a common PC among G matrices unless the populations have approximately modular organizations in common. The existence of modules would always have to be confirmed by independent means, because even without common modular organization, the allele frequencies required to produce a true common PC among G matrices could have occurred by chance.

In the context of identifying which sets of traits may have a modular organization, the reordering option available in the CPC analysis software of PHILLIPS 1998A Down, PHILLIPS 1998B Down, PHILLIPS 1998C Down is valuable. The default is that the program estimates the model CPC(All) for the combined data and builds the common PC models of the Flury hierarchy [PCPC(1), PCPC(2), etc.], using the common PCs of CPC(All) in rank order according to the size of their eigenvalues. The reordering option allows the user to designate a different ordering scheme. This flexibility is useful in a case where matrices have a common PC but the common PC is not associated with eigenvalues first in the default rank ordering. In such a case, the default generally causes CPC analysis to indicate no similarity among the matrices (HOULE et al. 2002 Down), but if the reordering option is used to place the true common PC first in the ordering scheme, CPC analysis should indicate that a common PC exists.

The possibility that CPC analysis could be developed for the detection of modules is a particularly exciting prospect because modules have clearly defined genetic and evolutionary properties. For example, from a genetics perspective, modules represent a specific constraint on how variation at the gene level is related to variation in the phenotype (BONNER 1988 Down; CHEVERUD et al. 1997 Down; MEZEY et al. 2000 Down). When there is modular organization, the effects associated with groups of genes are entirely limited to distinct aspects of the phenotype. From an evolutionary perspective, modules are units in the sense that variation in the traits defining a module will be uncorrelated with variation in other traits, given appropriate assumptions about random mating and no gametic-phase disequilibrium (MAGWENE 2001 Down). Because selection can act on traits in a module without causing a correlated response in traits outside the module, identification of modules provides a foundation for constructing hypotheses about the evolutionary properties of a population that could be tested experimentally. Modular organization also plays an important conceptual role in relation to the evolvability of a genetic system (WAGNER and ALTENBERG 1996 Down; WAGNER and MEZEY 2003 Down). It is therefore of interest to identify whether and to what degree cases of approximate modularity exist in nature.

In conclusion, our results suggest (1) that common PCs are unlikely without modular organization and (2) that there is a biological interpretation of common PCs and a possible role for common PCs in the identification of modular organization. In both cases, interpretation of common PCs will be stymied until a systematic study of the sensitivity of CPC analysis to sample size is performed. If this problem could be addressed, CPC analysis of G matrices could provide biologically useful insight beyond a summary of matrix structure. In this role, CPC analysis could be particularly useful for addressing questions that require a relatively complete picture of genetic architecture: Do modules correspond to functional architectures (HOULE et al. 2002 Down; STEPPAN et al. 2002 Down)? To what extent is the structure of the G matrix constrained (TURELLI 1988 Down)? How modular is the G-P map (WAGNER 1996 Down)?


*  ACKNOWLEDGMENTS

We thank Kyle Galivan, Thomas F. Hansen, Frances C. James, Eric Klassen, Joseph Travis, Zhao-Bang Zeng, and two anonymous reviewers for their comments on this manuscript. This work was supported by National Science Foundation grant no. 0129219.

Manuscript received November 4, 2002; Accepted for publication April 28, 2003.


*  APPENDIX A
*TOP
*ABSTRACT
*PLEIOTROPIC MODULARITY
*TWO-LOCUS, TWO-ALLELE MODEL
*MODEL OF CONSTRAINED PLEIOTROPY
*DISCUSSION
*APPENDIX A
*APPENDIX B
*LITERATURE CITED

TWO-LOCUS, TWO-ALLELE MODEL
It is assumed that the entirety of the genetic variation in n = 2 traits is determined by alleles segregating at N = 2 loci where only two alleles are possible at each locus. Forward and backward mutations occur at locus j at the same rate, µj. We assume no dominance, epistasis, disequilibrium (linkage or otherwise), maternal effects, sex linkage, genotype-environment covariance, or genotype-environment interactions. We assume random mating among diploid individuals. {alpha}jk.i is the additive effect of allele k at locus j associated with trait i, pjk is the frequency of allele k, and j.i is the average effect of an allelic substitution at locus j on trait i such that pjkj.i = {alpha}jk.i (LYNCH and WALSH 1998 Down). Designating Hj = 2pjkpjl as the heterozygote frequency at locus j (0 <= Hj <= 0.5), the G matrix can be written as

(A1)

Note that, under the assumption of no nonadditive effects, the j.i are constant, so the structure of G is a function of the allele frequencies, which may change as a result of mutation, selection, or genetic drift. A population is defined as having two modules if the vectors describing the average effect of an allelic substitution are orthogonal: , where j = [j.1, j.2]. In such a case, each j defines a module (see text). The existence of modules can also be written as for all alleles k and l at the two loci where {alpha}jk = [{alpha}jk.1, {alpha}jk.2]. If these conditions do not apply, no modules exist. Below, we assume that G has no multiplicity of eigenvalues and is of full rank unless noted. This latter assumption requires that all pjk > 0 and that 1 and 2 have different directions.

Result A1:
If populations have modules with the same direction, the G matrices have common PCs for all allele frequencies.

The matrix G is a real, 2 x 2, symmetric matrix. An orthonormal matrix Q and a diagonal matrix {Lambda} therefore exist, such that

(A2)

where each column vector q of Q is a PC, an eigenvector, of G (q1 {perp} q2 = 0 and ) and each diagonal element of {Lambda} ({lambda}1 and {lambda}2) is an eigenvalue. In the absence of a multiplicity of eigenvalues, the spectral decomposition of G exists and is unique. In this case, no other matrix Q, defined up to the multiplication of columns by -1 and column permutation, produces a diagonalization of G (FLURY 1988 Down).

If two modules exist (1 {perp} 2), the matrix à can be defined as

(A3)

where . Note that à is an orthonormal matrix with column vectors that have the same direction as 1 and 2. Also define the diagonal matrix D with diagonal elements dj = Hj||j||2. The matrix G can be written as

(A4)

This relation holds with the same orthonormal matrix à no matter what the allele frequencies in the population. Because this expression is a diagonalization of G, the uniqueness of the spectral decomposition implies that Q = à (up to column permutation and multiplication of columns by -1). Therefore, if 1 {perp} 2, the matrix G has PCs with the same direction as the modules (i.e., the same direction as 1 and 2) regardless of allele frequencies. Also, each eigenvalue {lambda}j is a function of the allele frequency at a single locus: {lambda}j = Hj||j||2. Therefore, in a population with modules, each PC accounts for the entirety of the variation attributable to a single module, which in this case is defined by allelic variation at a single locus. Because the G of a population with modules has the same PCs regardless of allele frequencies, if other populations have modules with the same direction, the G matrices of the populations will have two common PCs (i.e., both PCs will have the same direction) although the eigenvalues associated with the PCs in the two populations may differ.

Note that in the special case where an allele at one locus goes to fixation in one of the populations, the same argument can be used to demonstrate that there will still be two common PCs if the j at the other locus has the same direction as a module in the other population. In such a case, a zero eigenvalue will be associated with one of the PCs in the population with the fixed allele. Similarly, if both j in one population have the same direction, if the direction is the same as that of a module in the other populations, common PCs will still exist. These results also hold in the case where one or several of the G matrices have a multiplicity of eigenvalues. The reason is that the common PC model framework handles such cases where the eigenvector matrix is not unique by choosing the direction of the PCs to correspond to the PCs of other matrices (if possible). Also note that, if populations have modules with different directions, they will never have common PCs, unless allele frequencies are such that a multiplicity of eigenvalues exists. Because the approach used in Result A2 can be used to demonstrate that a multiplicity of eigenvalues in the G matrix occurs only for a very restricted set of the possible allele frequencies, common PCs are not expected when modules are not in common.

Result A2:
If populations A and B have no modules, given heterozygote frequencies in population A, a line intersecting the region bounded by the square of possible heterozygote frequencies in population B (0 <= Hj.B <= 0.5) describes the frequencies that result in common PCs in GA and GB.

An intuitive interpretation of this result is that the number of heterozygote (allele) frequencies for which GA and GB have common PCs is far smaller than the number of heterozygote (allele) frequencies for which the PCs are different. For example, given heterozygote frequencies in population A (H1.A and H2.A) for every heterozygote frequency H1.B at the first locus in population B, a single frequency H2.B at the second locus produces common PCs. All other frequencies at the second locus will result in different PCs.

Assume that there are no modules in population B, such that T1.B2.B != 0. Fix H1.A and H2.A between 0 and 0.5 and assume that alleles are segregating at both loci in population B. Define the PC matrices of GA and GB as QA and QB. By the spectral theorem, if population B has the same PCs as population A, then QB = QA (up to column permutation and multiplication of columns by -1), and we can write

(A5)

From (A1), we can rewrite GB as

(A6)

Therefore, for population B to have the same PCs as population A, the following relation must be satisfied:

(A7)

The off-diagonal elements of the matrix on the left side of (A7) are the same and are equal to zero elements in the matrix on the right side:

(A8)

where j.i.B is the average effect of an allelic substitution at locus j on trait i in population B and qjk.A is element k of column j of matrix QA. Note that if we assume population B has no modules, T1.B2.B != 0, and because , if the term in (A8) associated with either H1.B is zero, the other term associated with H2.B is positive. In such a case (A8) can be satisfied only if H2.B = 0 (and vice versa). Because we are currently concerned with common PCs and assume alleles are segregating at all loci, it is the case that

(A9)

Given (A8) and (A9), for population B to have the same PCs as population A, the heterozygote frequencies at the two loci in population B must satisfy the following relationship:

(A10)

Note that a single set of PCs is associated with each pair of heterozygote frequencies, so if (A10) holds for H1.B and H2.B, the equations defined by the diagonal elements of (A7) are satisfied by the eigenvalues of GB corresponding to H1.B and H2.B:

(A11)

Therefore, only when the heterozygote frequencies in population B satisfy (A10) are the PCs of GB in the same direction as the PCs of GA (i.e., two PCs are in common). These heterozygote frequencies can be visualized as falling on a one-dimensional "plane" that cuts through the region bounded by the square of possible heterozygote frequencies in population B where 0 <= Hj.B <= 0.5. The ratio of the number of heterozygote frequencies for which common PCs occur to all possible heterozygote frequencies is small. Therefore, the vast majority of the possible allele frequencies in population B result in different PCs in the two populations and similarly for population A when heterozygote frequencies in population B are held constant. Under the special case in which the same alleles are segregating in both populations (1.A = 1.B and 2.A = 2.B), common PCs occur only when

(A12)

which happens when GA {propto} GB.

The constraint of (A10) makes common PCs unexpected among the G matrices of populations A and B if there are no modules. The reason is that, even if this constraint is satisfied at some point, any change in allele frequencies at one locus must be exactly balanced by a change at the other locus that preserves the ratios in (A10). The stochastic changes in allele frequencies due to mutation and genetic drift are therefore not expected to preserve the necessary ratios.

Note that, although two populations are considered in this section, the reasoning can also be extended to multiple populations. Also, the same reasoning can be used to demonstrate that, in the case where an allele at one locus goes to fixation in one of the populations or where both j in one population have the same direction, the allele frequencies required for common PCs are highly constrained in the same fashion. The case of multiplicity of eigenvalues does not occur in the special case of the two-locus, two-allele model when the j in a population are not orthogonal.


*  APPENDIX B
*TOP
*ABSTRACT
*PLEIOTROPIC MODULARITY
*TWO-LOCUS, TWO-ALLELE MODEL
*MODEL OF CONSTRAINED PLEIOTROPY
*DISCUSSION
*APPENDIX A
*APPENDIX B
*LITERATURE CITED

THE MODEL OF CONSTRAINED PLEIOTROPY
Appendix B extends the framework outlined in Result A1 and Result A2 to the model of constrained pleiotropy of WAGNER 1989 Down. The model of constrained pleiotropy assumes that all segregating alleles and all possible mutant alleles at an individual locus j have effects that fall along a single vector. In a quantitative genetic formulation, the absolute values of the additive effect vectors of any alleles k and l at a locus j are proportional: |{alpha}jk| {propto} |{alpha}jl|. Mutations are assumed to occur at each locus j at a rate µj. Here, n traits are being considered in all populations being compared, although the populations may have different numbers of loci. We assume random mating among diploid individuals in a population. We also assume no dominance, epistasis, disequilibrium (linkage or otherwise), maternal effects, sex linkage, genotype-environment covariance, or genotype-environment interactions.

The additive effect of allele k at some locus j for n traits is

(B1)

where pjl is the frequency of allele l, Jj is the number of alleles at locus j, each entry of gkl = [gkl.1, ... , gkl.n] is the mean phenotype of trait i given alleles k, l at locus j, and each entry of µg = [µg.1, ... , µg.n] is the mean genotypic value of trait i (LYNCH and WALSH 1998 Down). In the absence of dominance gkl = (gkk + gll)/2, and making this substitution into (B1), we can write the additive effect of an allele as follows:

(B2)


(B3)

Under the assumption of constrained pleiotropy, the genotypic values associated with a locus are proportional and this condition requires that gkl {propto} gqr for all alleles k, l, q, r at locus j. We can therefore write

(B4)

where {gamma}j = [{gamma}j.1, ... , {gamma}j.n] is the unit scaled vector in the direction of the pleiotropic effect associated with locus j, and the {Gamma} are scalars. In the model of constrained pleiotropy, each of the {gamma}j are constant. Note that under the assumed conditions the G matrix can be written

(B5)

Setting and we can write G as

(B6)

Below, we assume that G has no multiplicity of eigenvalues and is of full rank unless noted.

Modules exist in a population if a subset of M loci (M < N) exists in which the allelic vectors at each of these M loci are orthogonal to the allelic vectors at each of the other N - M loci. This means that, for each {alpha}jk(M) that may occur at the M loci and each {alpha}jk(N-M) that may occur at the remaining N - M loci,

(B7)

If no subsets of M loci satisfy this relationship, a module does not exist. Note that in the following, we are concerned only with modules that are one-dimensional where |{alpha}jk(M)| {propto} |{alpha}mk(M)| for all loci j, m in a subset of M loci that define a module.

Result B1:
For each pair of modules that populations have in common, the G matrices have a common PC with the same direction as the module, regardless of allele frequencies or effects of mutations in the populations.

In a population in which M < N loci define a module, the G matrix can be written as

(B8)

where the first summation is over the M loci defining the module and the second is over the remaining N - M loci. Each summation term is itself a matrix:

(B9)

Because we have assumed that G is of full rank, at least two alleles are segregating at at least one locus in M. If so, regardless of the specific set of alleles and the frequency of alleles in the population, the allelic vectors defining the module span a one-dimensional space in n, and the rest of the allelic vectors span an n - 1-dimensional space that is orthogonal. Correspondingly, the matrix GM is of rank 1 and GN-M is of rank n - 1. The spectral decomposition of GN-M produces a zero eigenvalue:

(B10)

Because the allelic vectors of the N - M loci span the n - 1 space, the eigenvector corresponding to the zero eigenvalue is orthogonal to the n - 1-dimensional space and has the same direction as the module regardless of allele frequencies or the effects of mutation in the population. Applying QN-M to the matrix GM produces

(B11)

where {lambda}M corresponds to the eigenvector with the same direction as the module. The orthonormal matrix QN-M therefore diagonalizes G,

(B12)

and is unique up to column permutation and multiplication of columns by -1, by the spectral theorem. Therefore a PC of G that has the same direction as the module always exists, and from (B11), this PC also accounts for the entirety of the variation segregating at the loci defining the module. Because the population has a PC that always corresponds to the module regardless of allele frequencies, if several populations have a module with the same direction, these populations will always have a common PC with the same direction as the module. The argument also holds for any number of modules (up to n), so a common PC occurs in the G matrices corresponding to each pair of modules that the populations have in common. Note that the same argument can be used to demonstrate that populations with modules in common will have common PCs corresponding to the modules even when the G matrix is not of full rank. Similarly, as explained for the two-locus, two-allele model, common PCs will occur when common modules do, even if G has a multiplicity of eigenvalues.

Result B2:
For populations A and B with no common modules, given allele frequencies in population A, the allele frequencies that result in common PCs in GA and GB are described by n overlapping quadratic (NB B - n + 1)-dimension planes intersecting the NB B-dimension region describing the possible allele frequencies at each of the NB loci in population B.

{Psi}(x) indicates a matrix with elements {psi}ij, where element {psi}xx is a positive value, all other elements in column {psi}x- and row {psi}-x are zero, and all other elements may or may not be equal to zero. For example, {Psi}(1) is an instance of a matrix with the following form,

(B13)

where each {psi}ij except {psi}11 may or may not be equal to zero. In this notation, if populations A and B have PC x in common, the following relation holds:

(B14)

We consider the GA and GB associated with populations A and B at a given point in time. The populations segregate for NA and NB loci, respectively. By the spectral theorem and (B6), for population B to have PC x in common with population A, the following relation must be satisfied:

(B15)

Under the definition of Lj above, the off-diagonal elements of column (or row) x of the matrix on the left side of this relation define a system of n - 1 equations of the form

(B16)

for 0 < i <= n, i != x. Note that each {kappa}k is a linear function of the allele frequencies at locus j, excluding allele k. The highest-order terms of (B16) are therefore quadratic. The allele frequencies in population B that satisfy the