Genetics, Vol. 153, 2001-2012, December 1999, Copyright © 1999

Fluxes and Metabolic Pools as Model Traits for Quantitative Genetics. I. The L-Shaped Distribution of Gene Effects

Bruno Bosta, Christine Dillmanna,b, and Dominique de Viennea
a Station de Génétique Végétale, INRA/UPS/INAPG, Ferme du Moulon, 91190 Gif-sur-Yvette, France
b Institut National Agronomique Paris-Grignon, 75005 Paris, France

Corresponding author: Bruno Bost, Station de Génétique Végétale (INRA-UPS-INAPG), Ferme du Moulon, 91190 Gif-sur-Yvette, France., bost{at}moulon.inra.fr (E-mail)

Communicating editor: P. D. KEIGHTLEY


*  ABSTRACT
*TOP
*ABSTRACT
*THEORETICAL BACKGROUND
*METHODS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*LITERATURE CITED

The fluxes through metabolic pathways can be considered as model quantitative traits, whose QTL are the polymorphic loci controlling the activity or quantity of the enzymes. Relying on metabolic control theory, we investigated the relationships between the variations of enzyme activity along metabolic pathways and the variations of the flux in a population with biallelic QTL. Two kinds of variations were taken into account, the variation of the average enzyme activity across the loci, and the variation of the activity of each enzyme of the pathway among the individuals of the population. We proposed analytical approximations for the flux mean and variance in the population as well as for the additive and dominance variances of the individual QTL. Monte Carlo simulations based on these approximations showed that an L-shaped distribution of the contributions of individual QTL to the flux variance (R2) is consistently expected in an F2 progeny. This result could partly account for the classically observed L-shaped distribution of QTL effects for quantitative traits. The high correlation we found between R2 value and flux control coefficients variance suggests that such a distribution is an intrinsic property of metabolic pathways due to the summation property of control coefficients.


THE pioneering work of KACSER and BURNS 1981 Down illustrated the power of the metabolic control theory (MCT) in accounting for fundamental genetic phenomena such as recessivity of deleterious alleles, epistasis, or selection of selective neutrality. The MCT describes how the properties of individual enzymes of a pathway influence the flux through the pathway, and thus provides a biochemical link between the genetically determined enzyme activities/concentrations and the flux, which is a global property of the pathway. This mechanistic model of a quantitative phenotype has been successfully used in quantitative and population genetics. The variability of the flux was theoretically analyzed as a function of the effect and frequency of mutations in populations (KEIGHTLEY 1989 Down), or within sibship when parental genotypic values are known (WARD 1990 Down). Developments of this model shed light on the variability of enzyme activities in populations under mutation-selection balance (CLARK 1991 Down; HASTINGS 1992 Down). The relationship between metabolic flux and fitness was explored in Escherichia coli (DYKHUIZEN et al. 1987 Down), leading to the concept of natural selection of selective neutrality (HARTL et al. 1985 Down). BEAUMONT 1988 Down pointed out that stabilizing selection arises as a consequence of the structure of metabolic pathways; and KEIGHTLEY 1996 Down showed how dominance and directional epistasis, which are automatically generated in metabolic pathways, lead to an asymmetrical pattern of response to directional selection. Finally SZATHMARY 1993 Down showed that epistasis between deleterious mutations for enzyme activity is synergistic in most kinds of selection, except for selection for maximizing the flux, where epistasis is antagonistic.

In the terminology of modern quantitative genetics, the enzymatic loci can be regarded as putative quantitative trait loci (QTL) of the flux, characterized by their contribution to the flux variance in a population. Assuming that macroscopic and quantitative traits are proportional to metabolic fluxes in the cell, we considered the fluxes as model traits to analyze the quantitative genetic variation. In this work, the MCT was used to predict the shape of the distribution of flux QTL effects in a segregating population derived from the cross between two individuals drawn at random in a species. We considered both the variation of the average enzyme activities across the metabolic pathway and the variation of activity of single enzymes between individuals of the population. Using analytical developments and simulations, we showed that an L-shaped distribution of flux QTL effects is consistently observed. This distribution is related to the L-shaped distribution of flux control coefficients, which is a consequence of the summation theorem (KACSER and BURNS 1973 Down), and is also observed experimentally.


*  THEORETICAL BACKGROUND
*TOP
*ABSTRACT
*THEORETICAL BACKGROUND
*METHODS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*LITERATURE CITED

Metabolic flux as a function of enzyme activity:
The flux through a linear metabolic pathway is described as a hyperbolic function of the activity of each enzyme involved in the pathway (KACSER and BURNS 1973 Down). Consider a linear metabolic pathway, with n enzymes (E1, E2... , Ei, ... , En), converting a substrate (S1) into a final product (Sn+1),

and define Ei, which for simplicity will be called the activity of enzyme i, as

where Vi is the maximum velocity of enzyme Ei, Mi is its Michaelis constant, and K1,i = {Pi}i-1j=1Kj,j+1 is the product of equilibrium constants of reactions 1, 2, ... , i. At the steady state, and assuming that all enzymes are far from saturation, the flux through the pathway is

(1)

where [S1] and [Sn+1] are the concentrations of the substrate S1 and product Sn+1, respectively. [S1] and [Sn+1] are fixed parameters of the system, while the intermediate metabolite concentrations ([Si] for i = 2 to n) are variables.

Control coefficient of the flux:
To quantify how the flux reacts when an infinitesimal change occurs in the activity of a given enzyme, KACSER and BURNS 1973 Down defined the control coefficient C J i of the flux J, with respect to activity Ei of enzyme Ei, as the ratio of an infinitesimal relative variation of the flux to an infinitesimal relative variation of an enzyme:

Under the assumptions mentioned above, we have

and hence {sum}ni=1C J i = 1.

This summation theorem (KACSER and BURNS 1973 Down) applies more generally than to linear pathways, for example in branched pathways, pathways with feedback regulation (KACSER and BURNS 1973 Down), pathways where some metabolites are involved in a moiety-conserved cycle (HOFMEYR et al. 1986 Down), or pathways with two steps catalyzed by the same protein molecule (CASCANTE et al. 1990 Down). The most important consequence of the theorem is that the control of the metabolic system may be shared among all the enzymes, a view quite different from that of "rate-limiting" or "bottleneck" concepts. In the context of the MCT, the rate-limiting steps are steps with control coefficients close to one, and they are highly dependent on the genetic background. Experimental data are rather consistent with a distribution of the control across the metabolic pathway (see RESULTS).


*  METHODS
*TOP
*ABSTRACT
*THEORETICAL BACKGROUND
*METHODS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*LITERATURE CITED

To study the flux QTL distribution, we considered the populations resulting from a cross between two diploid parents drawn at random. In those populations, we varied both the average activities among loci and the extent of the genetic variability of activity of the enzymes that control the metabolic pathway. The resulting variations observed at the flux level were analytically studied, and a set of relevant variables to describe QTL effects and metabolic control was defined. Then, we used Monte Carlo simulations to analyze the distribution of these variables and their relationships.

Variation at the enzyme level:
We defined a given individual k of the species by the vector Ek = {Ek1, Ek2, ... , Eki, ...} of enzyme activities of the biallelic loci governing the metabolic pathway. Without any knowledge of the distribution of enzyme activities among the loci of actual metabolic pathways, we supposed that the Eki are random variables, independently and identically distributed according to a given law L({theta}k), where {theta}k is the vector of the parameters of L for individual k. We considered the population resulting from the cross between two individuals k and h and supposed that the loci governing enzyme activities are independent, and without linkage disequilibrium in the population. In this case, the distribution of the flux is determined by the enzyme activities Ek and Eh for each parent, and by the matrix of allelic frequencies {pij}, where pij is the frequency of allele j for enzyme i in the resulting population.

In particular, we considered the F1 hybrid resulting from the cross between two inbred lines and the F2 population obtained by selfing the F1 hybrid. In case of independent loci and with no dominance at the enzyme level, enzyme activity at locus i is defined by the average activity mi = and the additive allelic effect ai = . Note that mi and ai are not independent, because for all i, |ai| <= mi. In an F2 population, it is easily shown (Appendix A) that the coefficient of variation (cvi) of the activity of enzyme i is

(2)

Hence, the F2 population can alternatively be described by the distribution law L({theta}m) of the mi and the distribution law L({theta}cv) of the cvi. The former describes the distribution of the average enzyme activity across the loci, while the latter describes the distribution of the differences between the parents k and h, because for a given mi value, the ai value can be deduced from the cvi value.

Variation at the flux level:
In a segregating population, each enzyme whose activity is genetically variable explains a part of the genetic variance of the flux. In other words, the polymorphic loci responsible for enzyme variations are QTL of the flux and of any trait proportional to the flux. However, there is no simple relationship between the variation of enzyme activity at a locus i and its additive and dominance effect ({alpha}i and ßi, respectively) upon the flux.

The simplest case of an F2 population where all enzymes but one have the same activity in both parents is represented in Figure 1. The hyperbolic relationship between flux and enzyme activity leads to a saturation in the flux when the activity increases (Figure 1A; Equation 1). The additive effect {alpha}i of QTL i upon the flux is equal to half of the difference between the parents, and the degree of dominance ßi is one-half the difference between the flux of the F1 hybrid and the mean flux of the parents. Figure 1B shows that the control coefficient increases as the enzyme activity decreases, and is close to zero for values of activity corresponding to the plateau in the flux curve. Due to this hyperbolic relationship between flux and activity, a relative change in activity leads to a smaller relative change of the flux, and the low-effect alleles are recessive (KACSER and BURNS 1981 Down). If both parents exhibit low control coefficients (i.e., high values of activity), the flux values will be close to each other, while if both parents have high control values, the heterozygote will exhibit additivity, a prediction consistent with known cases of heterozygotes between sets of "lower" alleles in a series, as for pigmentation in guinea pig (WRIGHT 1960 Down) or mouse (GRUNENBERG 1952 Down).



View larger version (11K):
In this window
In a new window
Download PPT slide
 
Figure 1. Relationship between the parameters that describe the variability at the flux level, and the corresponding parameters at the enzyme level, in a simple situation where all enzymes of the pathway, except one (enzyme i), have the same activity for the cross between two inbred lines k and h. (A) Eki, Ehi, and EF1, are the enzyme activities of parent k, parent h and F1 hybrid, respectively. mi is the average activity and ai is the additive allelic effect of the locus i. Jki, Jhi, and JF1 are the flux values of parent k, parent h, and F1 hybrid, respectively. µJ is the average flux in an F2 population resulting from the selfing of the F1 hybrid. {alpha}i and ßi are, respectively, the additive and dominance effect of the locus i: 2{alpha}i = Jhi - Jki and 2ßi = JF1 - . (B) CkiJ, ChiJ, and CJF1, are the flux control coefficients of the enzyme i, in parent k, parent h, and the F1 hybrid, respectively. Parent k has a control coefficient higher than parent h, which is nearly on the plateau (CkiJ is five times higher than ChiJ). This situation leads to a high additive effect with an intermediary degree of positive dominance.

In the general case, all enzyme activities may differ between the parents. Hence, the flux of each individual depends on its genotype at the different enzymatic loci. The average flux µJ in a population can be approximated by developing the function (1) expressing the flux into a second-order Taylor series. We chose the second order as a good compromise between precision and heaviness of the calculations. Provided there is no linkage disequilibrium, and taking the derivatives of µJ with respect to allelic frequencies, the additive and dominance effects ({alpha}i and ßi) of QTL i, and the epistatic (additive x additive) effect ({alpha}{alpha}ij) of a pair (i, j) of QTL, could be calculated (Appendix B; KOJIMA 1959 Down), as well as the contributions of the QTL to the components of the flux variance: additive ({sigma}2Ai) and dominance ({sigma}2Di)variances at QTL i, and epistatic variance ({sigma}2AAij) for the pair of QTL (i, j). The QTL i contributes for a fraction R2i of the total variance of the flux,

(3)

where {sigma}2A, {sigma}2D, and {sigma}2AA, are the additive, dominance, and epistatic (additive x additive) variances of the flux in the population, respectively. The total genetic variance of the flux also comprises other components, like the (additive x dominance) and (dominance x dominance) epistatic variances, as well as higher-order variance components, which were neglected here. In this article, the "additive allelic effect" of enzyme locus i refers to ai, while the "flux QTL effect" of QTL i refers to R2i.

The sharing out of the control between the enzymes of the metabolic pathway is different for each individual of the population. To characterize each F2 population, we computed, for each enzyme i, the average flux control coefficient, and its variance, Var[CJi], and we defined the concept of "populational control coefficient" for enzyme i as

(4)

where i (respectively j) is the average activity of enzyme i (respectively j) in the population.

The populational control coefficient of enzyme Ei is not equal to its average control coefficient, but corresponds to the control coefficient of an "average" individual, i.e., an individual displaying the average activities for all enzymes. It does not depend on the additive allelic effect ai of the QTL, unlike the R2i. In F2 populations without dominance at the enzyme level, i = mi so that the populational flux control coefficient is also the control coefficient of the F1 hybrid for enzyme i.

Simulation of flux QTL effects and control coefficient distributions:
To simulate the distributions of flux QTL effects or control coefficients we considered 50 independent enzymatic loci in F2 populations. A four-step procedure was used:

  1. Draw of the mi values. We considered several distribution laws for mi, corresponding to different degrees of dispersion of the average enzyme activity across the metabolic pathway. Those distributions were constant (mi = 10, {forall}i - reference case), uniform (in the range [0, 30]), normal (µ = 10, {sigma} = 2.5), or exponential ({theta} = 16.2, {sigma} = 1.2). The value of {theta} was chosen so that all the distributions have roughly the same range of variation, and the probability density function of the exponential law is f(x) = ()exp(-).

  2. Draw of the cvi values. We have chosen to consider the distribution of cvi rather than the distribution of ai for two reasons. First, it made it easier to take the constraint |ai| <= mi, {forall}i into account. Second, we observed that our approximations for the average flux and its variance were better for cvi values <=0.3. Three contrasted distributions were considered: (i) cvi = 0.2, {forall}i, i.e., there is a strict positive relationship between mean and additive effect of enzyme activity; (ii) normal, with an average of 0.35 (middle of the range for the possible values of cvi, given mi; see Appendix A) and standard deviation fitted to get all values within the range of possible cv values; (iii) gamma, fitted to get 95% of the values between 0 and 0.3.

  3. Computation of the flux QTL effects. For each pair of {mi} and {cvi} vectors, we used our approximations to compute the flux of the F1 hybrid and the parameters of the F2 population: populational flux control coefficient Ji, average flux µJ, total genetic variance, and the flux QTL effects R2i (see Appendix B).

  4. Distribution of the flux control coefficients. For an F2 population, 10,000 individuals were randomly generated, according to the parental genotypes at each locus. For each individual and each locus, we computed the flux control coefficient and inferred the corresponding variance Var[CJi] for each enzyme.

Those steps were iterated 100 times to simulate 100 different F2 populations. Hence, we computed 100 µJ values and a total of 100 x 50 = 5000 different values for mi, cvi, Ji, R2i, and Var[CJi]. The expected distributions of those parameters were obtained by pooling the 5000 resulting values. The populational control coefficient and R2i distributions were characterized by the following parameters: mean, skewness (SOKAL and ROHLF 1995 Down), {sum}ni=1R2i averaged over populations, and percentage of values <0.02, which is the value expected for 50 equivalent QTL (1/50).


*  RESULTS
*TOP
*ABSTRACT
*THEORETICAL BACKGROUND
*METHODS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*LITERATURE CITED

Relationships between enzymatic allelic effects and flux QTL effects:
As the relationship between flux and enzyme activities is nonlinear, the average flux µJ in a population depends not only on the average enzyme activities, i, but also on the variances, {sigma}2Ei, of enzyme activities, with a negative relationship between average flux and activity variances. As shown in Appendix B,

(5)

For the same reason, the flux variance is related not only to the variances of enzyme activity at each QTL but also to their average activities. These features clearly differ from the classical additive models used in quantitative genetics. Moreover, the formulas show that it is not the average activity of the enzymatic locus that directly influences the flux variance, but the relative weight of the enzyme in the pathway, expressed as the "populational control coefficient" (Ji), or control coefficient of the "average" individual (Equations B13–B15). The additive contribution of QTL i to the flux variance ({sigma}2Ai) is also affected by the other QTL through their variability (contribution of a QTL is reduced by an increase of the variability of the other enzymes) and through their populational control coefficients, which are related by the summation property,

(6)

where cvi = , with mi the average activity of enzyme i and ai its additive allelic effect.

It is worth noting that the relationship between those factors is not tight: an enzymatic locus with a large additive allelic effect may have a small effect upon the flux variance if its control on the pathway is weak in both parents (Figure 1).

The flux QTL effects are L-shaped distributed:
Figure 2 compares the distributions of the average enzyme activity, mi, across the loci, to the corresponding distributions of flux QTL effects, Ri2, for a gamma distribution of cvi. When all enzymes have identical mi and ai, the QTL have the same Ri2 (Figure 2A). Our simulations show that, as soon as there is any difference in their average activity or variability, the distribution of Ri2 exhibits an L shape: few steps have high Ri2 values, more have moderate values, and a large number have small or very small values. This shape is more pronounced with the uniform distribution of mi, which leads to more contrasted mi values, but remains with the exponential, which still displays a J-shaped distribution of mi, or with the normal distribution (Figure 2). Numerical characteristics of the Ri2 distributions confirm these observations: all skewness values are significantly positive (P = 0.001). As shown Table 1, 61.4–91.5% of the QTL have an Ri2 value below 1/50, depending on the mi and cvi distributions, and do not really contribute to the variance of the flux in the population. However, this skewness is more affected by unequal mi values across loci rather than by unequal cvi values: for a given distribution of the mi values, it is about the same whatever the distribution of the cvi.



View larger version (27K):
In this window
In a new window
Download PPT slide
 
Figure 2. Distributions of flux QTL R2 (percentages) for a 50-enzyme pathway in 100 F2 populations, computed for four distributions of average activity (mi): (A) Reference (same mi for all the QTL); (B) normal; (C) exponential; and (D) uniform. The coefficients of genetic variation of the QTL (cvi) are randomly drawn in a gamma distribution.


 
View this table:
In this window
In a new window

 
Table 1. Distributions of flux QTL R2 for a 50-enzyme pathway in an F2 population

Another feature of flux QTL is that they behave as if they are nearly additive. Without epistasis, the Ri2 should sum up to 100% in a given population (Equation 3). With the parameters chosen in our simulations, the average Ri2 value is just below 1/50, and their average sum over the flux QTL ranges from 95.1 to 99.9%. Of course we did not consider all the epistatic terms in the denominator of Equation 3. However, we checked, by calculating the difference between the total genetic variance and the denominator, that the epistatic terms of higher order are negligible so this approximation does not significantly modify the results. The more pronounced the L-shaped distribution across loci, the higher the variance that results from additive x additive interactions: it is equal to 0.6% with constant mi and cvi, and rises up to 4.9% with uniform mi and normal cvi. Moreover, when the number of QTL decreases, the additive x additive epistasis increases—for example, from 0.6 to 2.8% for 50 and 10 QTL, respectively (constant mi and cvi, not shown).

As seen in Figure 1 and METHODS, the additive and dominance effects of an enzymatic locus upon the flux are related to the difference in the control coefficients between the parental genotypes. On the other hand the control coefficients are linked through the summation property. Thus, looking for a possible intrinsic relationship between flux QTL R2 and summation property, we analyzed the sharing out of the control in the parents, and we studied the relationship between the R2i and the variance of the control coefficients in the population.

The parental flux control coefficients are L-shaped distributed:
In linear pathways of unsaturated Michaelian enzymes, the flux control coefficients are all positive. With an n-enzymes pathway, the summation theorem implies that, for a given individual, when one or a few steps have control coefficients greater than 1/n, the other steps will necessarily have coefficients below 1/n. Hence, the average value of the control coefficients is expected to be 1/n. If a mutation decreases one enzyme activity close to 0, its control coefficient will rise up to a value close to 1, and the other coefficients will become negligible. Thus, as soon as there is some variation for enzyme activity across loci in large metabolic systems, there would be many steps exhibiting small or very small control coefficients and a few steps with a large control; i.e., the distribution of control coefficients across loci is expected to exhibit an L shape.

Experimental data: We analyzed three experimental or modeling studies by pooling for each one all the control coefficients estimated under various conditions (GROEN et al. 1986 Down; ALBE and WRIGHT 1992 Down; HILL et al. 1993 Down). GROEN et al. 1986 Down have estimated the control coefficients of eight steps of gluconeogenesis in isolated rat liver cells under various experimental conditions (each "step" actually included several reactions, the whole pathway being considered). About 70% of the control coefficients were under the average value (0.125), with 50% under 0.05. The shape of the distribution of the coefficients was skewed to the right, with a skewness value of 20.42 (significant with P = 0.001; Figure 3A). Similar results were obtained for five steps of succinate oxidation in cucumber cotyledon mitochondria under various experimental conditions (HILL et al. 1993 Down; skewness 27.00, P = 0.001, Figure 3B). Finally, a steady-state model for the tricarboxylic acid cycle has been established from experimental data in Dictyostelium discoideum (ALBE and WRIGHT 1992 Down). The control coefficients for the CO2 flux produced by the cycle and the total CO2 production were estimated for each of the 26 steps, with six different ranges of variation of enzyme activities. The distribution of the coefficients was skewed toward weak control (skewness 12.29, P = 0.001, Figure 3C), and 66% of the values were below the expected average of 1/26. These data are consistent with numerous partial characterizations of other metabolic pathways, which show that the control is not equally shared between the different steps of metabolic chains.



View larger version (18K):
In this window
In a new window
Download PPT slide
 
Figure 3. (A) Distribution of flux control coefficients for 8 steps of the gluconeogenesis pathway, from lactate to glucose, in isolated rat liver cells in various experimental conditions. Data from GROEN et al. 1986 Down. (B) Distribution of flux control coefficients for 5 steps of the succinate oxidation pathway in isolated cucumber cotyledon mitochondria. The flux is the O2 consumption. Data from HILL et al. 1993 Down. (C) Distribution of CO2 flux control coefficients for 26 steps of a computer model of the tricarboxylic acid cycle in Dictyostelium discoideum. Data from ALBE and WRIGHT 1992 Down.

Simulations: The same kind of L-shaped distribution was found in our simulations, considering the 2 x 100 parents of the F2 populations and a 50-step metabolic pathway (results not shown). Whatever the distribution of enzyme activity, skewness values are positive and highly significant (P = 0.001) and there are much more control values below 1/50 (51–81% of the values) than over.

Relationship between control coefficients and flux QTL effects:
From Equation 5 and B20, it appears that the contributions of QTL i to the flux additive and dominance variances are related to the populational control coefficient and to the variance of the activity of the enzyme i. However, the relationship between the Ri2 and the populational control coefficient in a given population is complex and appears to be very loose (Figure 4A). QTL with similar populational control coefficients may have quite different Ri2, depending on ai values, while QTL may exhibit low Ri2 even though the control coefficient is high, if ai is low (compare QTL 1, 2, and 3 in Figure 4B).



View larger version (12K):
In this window
In a new window
Download PPT slide
 
Figure 4. Relationship between flux QTL R2 and populational control coefficients (A and B) or variance of the control coefficient (C) for a 50-enzyme pathway in 100 F2 populations. Average activity (mi) is normally distributed (see Figure 2B). (A) The coefficient of genetic variation of each QTL (cvi) is randomly drawn in a gamma distribution. (B) As for A, but only 1 of the 100 populations is represented. For the three marked QTL, see RESULTS. (C) Relationship between R2 and control coefficient variances in the same population as for B.

On one hand, a given allelic additive effect (ai) is expected to affect the flux all the more if the difference between parental flux control coefficients of enzyme i is high (Figure 1). On the other hand, a large difference between parental flux control coefficients of enzyme i would result in a high variance (Var[CJi]) of the flux control coefficient in the resulting population. We therefore expect a positive relationship between Ri2 and Var[CJi]. Such a high positive correlation was actually found in our simulations for each F2 population as illustrated in Figure 4C. This striking result implies that the major flux QTL are those for which the parents are the most contrasted for the flux control coefficients. We can relate this result to the L-shaped distribution of flux control coefficients in the parents. As shown previously, the summation theorem implies that any variation of enzyme activity across loci would result in a few enzymes with a high control on the metabolic pathway. If the parents of the cross are nonrelated, those enzymes with a high control on the flux are not expected to be the same in both parents. They will therefore appear as major QTL for the flux. Hence, the L-shaped distribution of flux QTL effects is simply a consequence of intrinsic properties of metabolic pathways, through the summation property of flux control coefficients.


*  DISCUSSION
*TOP
*ABSTRACT
*THEORETICAL BACKGROUND
*METHODS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*LITERATURE CITED

Understanding the relationship between gene polymorphism and quantitative trait variability is one of the main goals of quantitative genetics. The MCT provides a theoretical framework to analyze the consequences of the polymorphism of the genes controlling the enzymes concentration/activity in a linear pathway on the steady-state flux through this pathway, or on any trait proportional to this flux. We developed approximations for the flux variance components in any population without linkage disequilibrium and for any number of biallelic enzymatic QTL. These approximations lose precision for high coefficients of variation of enzymatic activities (roughly >0.3). Other methods (e.g., KEIGHTLEY 1989 Down) allow us to take into account deleterious variants but are analytically restricted to models with one or two QTL.

Simulations based on these formulas have shown that the L-shaped distribution of flux QTL R2 in a segregating population is inevitable for a flux through a linear pathway at the steady state: L-shaped distributions are generated as soon as there is any difference between the activities of the enzymes across the pathway. We have shown that such distributions arise as an indirect consequence of the summation theorem for the flux control coefficients, through the sharing out of the control in the parents. Flux QTL with major effect should correspond to enzymes that exhibit a great difference between parental flux control coefficients, namely enzymes that have a high control, i.e., a low activity, in one parent only. Thus the measurement of parental control coefficients, through metabolic control analysis methods (see FELL 1992 Down for a review), allows us to identify which enzymes could provide good candidate loci—regulatory or structural loci of these enzymes—to explain the quantitative trait variation in the segregating population studied.

In the framework of the metabolic control theory, the low-activity alleles appear to be recessive at the flux level. Unrelated parents with different evolutionary histories are not expected to exhibit the same deficient enzymes. As a consequence, their hybrid will exhibit heterosis for the flux due to positive dominance at different loci (B. BOST, C. DILLMANN and D. DE VIENNE, unpublished results). This result generalizes the classical result from KACSER and BURNS 1981 Down on the biochemical basis of dominance to a multilocus situation.

For simplicity, we took into account a 50-enzyme pathway with only one structural or regulatory polymorphic locus per enzyme. It is now well documented that the amounts/activities of enzymes are themselves polygenic traits (LAURIE-AHLBERG et al. 1982 Down; CLARK and KEITH 1988 Down; DAMERVAL et al. 1994 Down; CAUSSE et al. 1995 Down; MITCHELL-OLDS and PEDERSEN 1998 Down). However, this does not modify our conclusions. In fact, considering polygenic enzyme activities will probably only result in partitioning the control of a given enzyme between its different loci. In the simulations, we also considered that the enzymatic loci have additive effects. Even though there are some exceptions (CLARK and WANG 1997 Down), this is consistent with most experimental studies on enzyme activity (KACSER and BURNS 1981 Down). In maize, most of the proteins revealed by 2D-PAGE displayed additive inheritance for their quantity (LEONARDI et al. 1988 Down). The majority were found to be enzymes (TOUZET et al. 1996 Down).

In maize, tomato, rice, or Drosophila, where numerous QTL have been mapped for various complex traits, compilations consistently revealed extremely skewed distributions of QTL effects, with few QTL having large effects, more QTL having moderate effects, and likely a lot having small effects (depending on the power of detection methods), resulting in a typical L-shaped distribution. For example in Drosophila, many loci have small effects on abdominal and sternopleural bristle number, but few loci cause most of the genetic variation (MACKAY 1996 Down). EDWARDS et al. 1987 Down searched for associations between ~20 marker loci and 82 traits in two F2 populations of maize, each of ~1900 individuals. With a type I error of 0.05 they found 2460 significant associations, with a typically L-shaped distribution of the R2, and 94.5% of the associations exhibiting R2 values smaller than 5%. Other examples can be found in the literature (e.g., SING and BOERWINKLE 1987 Down; SHRIMPTON and ROBERTSON 1988 Down; PATERSON et al. 1991 Down; ZEHR et al. 1992 Down; SCHON et al. 1994 Down; GRANDILLO and TANKSLEY 1996 Down; LEE et al. 1996 Down).

Statistical artifacts can contribute to that distribution (CARBONELL et al. 1992 Down, CARBONELL et al. 1993 Down; BEAVIS 1994 Down). Moreover, factors such as linkage between QTL, phase (coupling/repulsion), or heritability, may produce L-shaped distributions for traits following the classical additive model of quantitative genetics, even though the skewness is consistently higher with the metabolic model (B. BOST, C. DILLMANN and D. DE VIENNE, unpublished results). However, it is highly likely that the L-shaped distribution also has biological bases. First, there is no genetic reason for a discontinuity between all-or-null (wild-type/mutant) variation and quantitative variation. Thus in pea, a major gene for Ascochyta blight resistance was mapped on chromosome 4 using both a QTL detection approach and a Mendelian analysis after partitioning the distribution of the resistance in the progeny into two classes (DIRLEWANGER et al. 1994 Down). Second, a survey of the literature shows that for various traits, the same major QTL may be found in different populations or environments, which seems quite unlikely in case of erroneous estimates of the QTL effects. For example, in two different F2 populations derived from crosses between maize and teosinte, DOEBLEY and STEC 1993 Down found similar suites of QTL for architectural traits, with the same order of QTL effects for several of them. In three tomato populations derived from the same parents (one F2 and two F2/3) and grown in different environments, PATERSON et al. 1991 Down found QTL for fruit traits common to two or even three conditions. Also significant are the QTL apparently commonly found in different species of Poaceae (PATERSON et al. 1995 Down) or Fabaceae (FATOKUN et al. 1992 Down; MAUGHAN et al. 1996 Down; TIMMERMAN-VAUGHAN et al. 1996 Down). Finally, in a noninfinitesimal model, equal QTL effects would imply that for any given trait, there would be a given, limited number of detectable QTL, which seems to be biologically irrelevant.

Our biochemical modeling is relevant to account for quantitative traits that are themselves components of metabolic pathways (starch concentration in grain, angiotensinogen concentration in blood, maysin content in maize silks, anthocyanin content, etc.) or that are proportional to such components. For example the weight of maize kernel is correlated to starch amount, so that the study of the variability of kernel weight could come within the framework of the MCT, relying on the well-known steps of the starch biosynthesic pathway. On the other hand, the analysis is valid for any gene, structural or regulatory, controlling the activity of an enzyme. Some identified QTL proved to be genes coding for enzymes, such as angiotensin-converting enzyme (ACE; SCHACHTER et al. 1994 Down), inducible nitric oxide synthase (DENG and RAPP 1995 Down), or renin (RAPP et al. 1989 Down) in rat. Transcription factors controlling enzyme amounts also obey the same theoretical framework, such as locus p1 in maize, which is a transcription activator of a portion of the biosynthesis pathway of flavonoids (BYRNE et al. 1996 Down). But there are examples of QTL that do not control, at least in a direct way, enzyme activities such as angiotensinogen-coding gene (JEUNEMAITRE et al. 1992 Down), apolipoprotein gene (CORDER et al. 1993 Down), or HLA loci (THOMSON 1988 Down; TOMLINSON and BODMER 1995 Down) in humans. Actually, the proportion of polymorphic genes encoding or controlling enzymes is not known, and the extent to which our analysis applies to gene products involved in the timing or tissue specificity of gene expression, or to hormone/receptor systems, is difficult to appreciate.

In conclusion, the hyperbolic relation between enzyme activity and metabolic flux accounts for the L-shaped distribution of control coefficients and hence could be one of the factors explaining the L-shaped distribution of gene effects in quantitative genetics.


*  ACKNOWLEDGMENTS

Thanks go to F. Hospital for useful discussions and reading the manuscript, and also to C. Damerval and A. Leonardi for reading the manuscript. We thank G. de Jong, who brought to our attention the Kojima approach on gene effect decomposition. B. Bost was supported by a Ph.D. grant from the French Ministry of National Education, Research and Technology (MENRT).

Manuscript received June 7, 1999; Accepted for publication August 25, 1999.


*  APPENDIX A
*TOP
*ABSTRACT
*THEORETICAL BACKGROUND
*METHODS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*LITERATURE CITED

Coefficient of genetic variation for an enzymatic locus in an F2 population:
The coefficient of variation (cvi) for the enzymatic locus i is

(A1)

where i and {sigma}2Ei are, respectively, the average activity and the variance of activity of the enzyme i in the population. In an F2 population derived from the cross between two homozygous lines, we have, with our notations, i = mi and {sigma}2Ei = a2i. Thus

(A2)

As there is a constraint on the possible ai values—{forall}i, |ai| <= mi—, the values of cvi are in the range

(A3)


*  APPENDIX B
*TOP
*ABSTRACT
*THEORETICAL BACKGROUND
*METHODS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*LITERATURE CITED

QTL contributions to the flux variance components:
Shown are calculation of approximations of the additive and dominance effects ({alpha}i and ßi) and additive and dominance variance ({sigma}2Ai and {sigma}2Di) at QTL i, and approximations of epistasis (additive x additive) effect and variance ({alpha}{alpha}ij and {sigma}2AAij) for a pair of QTL, i and j.

The QTL are controlling the flux through a linear pathway of n enzymes, in a segregating population, assuming that the QTL are not linked. Each enzyme of the pathway is controlled by one biallelic QTL i, with a frequency pi for the upwardly acting allele.

The flux in the pathway for an individual j in the population is

(B1)

where K = [S1] - (see Equation 1).

Expanding B1 into a second-order Taylor series, we have

(B2)

where i is the population average activity of the enzyme i.

The population average flux is

(B3)

Some simplification occurs:

And as there is no linkage disequilibrium in the population,

Combining (B2), (B3), and these simplifications, an approximation of the population average flux is

(B4)

The second partial derivative of the function f with respect to Ei is

(B5)

Thus, introducing (B5) into (B4), we have an approximation of the population average flux,

(B6)

where {sigma}2Ei and cvi are, respectively, the variance and the genetic coefficient of variation of activity of enzyme i in the population, and Ji is the "populational" flux control coefficient of enzyme i:

KOJIMA 1959 Down showed that the additive and dominance effects of a gene i upon a trait are related, respectively, to first and second partial derivatives of the population mean of the trait with respect to allelic frequency (pi). The (additive x additive) epistasis effect of a pair of QTL (i and j) is related to the mixed partial derivative of the mean with respect to both allelic frequencies (pi and pj). Applying these formulas to the flux, we get

(B7)


(B8)


(B9)


(B10)


(B11)


(B12)

So following Equation B7, we have, for the flux additive effect of the locus i,

(B13)

From Equation B9 we get the flux dominance effect of the locus i,

(B14)

And from Equation B11 we get the epistatic effect of the pair of loci i, j,

(B15)

In an F2 population with additive enzymatic loci, there are some simplifications in Equations B13 to B15:

Hence an approximation of the average flux in an F2 population is

(B16)

An approximation of the additive effect of the QTL i upon the flux is

(B17)

and the contribution of the QTL i to the flux additive variance is

(B18)

The dominance effect of the QTL i is

(B19)

and the contribution of the QTL i to the flux dominance variance is

(B20)

The epistatic effect of the pair of loci i, j (i != j) is

(B21)

and the contribution of the QTL i and j (i != j) to the flux epistatic variance is

(B22)

with {forall}i, {sigma}2AAii = 0.

The R2 of the QTL i is calculated as described in Equation 3 (METHODS), with


*  LITERATURE CITED
*TOP
*ABSTRACT
*THEORETICAL BACKGROUND
*METHODS
*RESULTS
*DISCUSSION
*APPENDIX A
*APPENDIX B
*LITERATURE CITED

ALBE, K. R. and B. E. WRIGHT, 1992  Systems analysis of the tricarboxylic acid cycle in Dictyostelium discoideum. II. Control analysis. J. Biol. Chem. 267:3106-3114[Abstract/Free Full Text].

BEAUMONT, M. A., 1988  Stabilizing selection and metabolism. Heredity 61:433-438.

BEAVIS, W. D., 1994 The power and deceit of QTL experiments: lessons from comparative QTL studies, pp. 250–266 in 49th Annual Corn and Sorghum Research Conference. ASTA, Washington, DC.

BYRNE, P. F., M. D. MCMULLEN, M. E. SNOOK, T. A. MUSKET, and J. M. THEURI et al., 1996  Quantitative trait loci and metabolic pathways: genetic control of the concentration of maysin, a corn earworm resistance factor, in maize silks. Proc. Natl. Acad. Sci. USA 93:8820-8825[Abstract/Free Full Text].

CARBONELL, E. A., T. M. GERIG, E. BALANSARD, and M. J. ASINS, 1992  Interval mapping in the analysis of nonadditive quantitative trait loci. Biometrics 48:305-315.

CARBONELL, E. A., M. J. ASINS, M. BALSEGA, E. BALANSARD, and T. M. GERIG, 1993  Power studies in the estimation of genetic parameters and the localization of quantitative trait loci for backcross and doubled haploid populations. Theor. Appl. Genet. 86:411-416.

CASCANTE, M., E. I. CANELA, and R. FRANCO, 1990  Control analysis of systems having two steps catalyzed by the same protein molecule in unbranched chains. Eur. J. Biochem. 192:369-371[Medline].

CAUSSE, M. A., J.-P. ROCHER, A.-M. HENRY, A. CHARCOSSET, and J.-L. PRIOUL et al., 1995  QTLs for carbon metabolism and early growth in maize, with emphasis on key-enzyme loci. Mol. Breed. 1:259-272.

CLARK, A. G., 1991  Mutation-selection balance and metabolic control theory. Genetics 129:909-923[Abstract].

CLARK, A. G. and L. E. KEITH, 1988  Variation among extracted lines of Drosophila melanogaster in triacylglycerol and carbohydrate storage. Genetics 119:595-607[Abstract/Free Full Text].

CLARK, A. G. and L. WANG, 1997  Epistasis in measured genotypes: Drosophila P-element insertions. Genetics 147:157-163[Abstract].

CORDER, E. H., A. M. SAUNDERS, W. J. STRITTMATTER, D. E. SCHMECHEL, and P. C. GASKELL et al., 1993  Gene dose of apolipoprotein E type 4 allele and the risk of Alzheimer's disease in late onset families. Science 261:921-923[Abstract/Free Full Text].

DAMERVAL, C., A. MAURICE, J. M. JOSSE, and D. DE VIENNE, 1994  Quantitative trait loci underlying gene product variation: a novel perspective for analyzing regulation of genome expression. Genetics 137:289-301[Abstract].

DENG, A. Y. and J. P. RAPP, 1995  Locus for the inducible, but not a constitutive, nitric oxide synthase cosegregates with blood pressure in the Dahl salt-sensitive rat. J. Clin. Invest. 95:2170-2177.

DIRLEWANGER, E., P. G. ISAAC, S. RANADE, M. BELAJOUZA, and R. COUSIN et al., 1994  Restriction fragment length polymorphism analysis of loci associated with disease resistance genes and developmental traits in Pisum sativum L. Theor. Appl. Genet. 88:17-27.

DOEBLEY, J. and A. STEC, 1993  Inheritance on the morphological differences between maize and teosinte: comparison of results for two F2 populations. Genetics 134:559-570[Abstract].

DYKHUIZEN, D. E., A. M. DEAN, and D. L. HARTL, 1987  Metabolic flux and fitness. Genetics 115:25-31[Abstract/Free Full Text].

EDWARDS, M. D., C. W. STUBER, and J. F. WENDEL, 1987  Molecular-marker-facilitated investigations of quantitative-trait loci in maize. I. Numbers, genomic distribution and types of gene action. Genetics 116:113-125[Abstract/Free Full Text].

FATOKUN, C. A., D. I. MENANCIO-HAUTEA, D. DANESH, and N. D. YOUNG, 1992  Evidence for orthologous seed weight genes in cowpea and mung bean based on RFLP mapping. Genetics 132:841-846[Abstract].

FELL, D. A., 1992  Metabolic control analysis: a survey of its theoretical and experimental background. Biochem. J. 286:313-330.

GRANDILLO, S. and S. D. TANKSLEY, 1996  QTL analysis of horticultural traits differentiating the cultivated tomato from the closely related species Lycopersicon pimpinellifolium.. Theor. Appl. Genet. 92:935-951.

GROEN, A. K., C. W. T. VAN ROERMUND, R. C. VERVOORN, and J. M. TAGER, 1986  Control of gluconeogenesis in rat liver cells. Flux control coefficients of the enzymes in the gluconeogenic pathway in the absence and presence of glucagon. Biochem. J. 237:379-389[Medline].

GRÜNENBERG, H., 1952 The Genetics of the Mouse. Martinus Nijhoff, The Hague.

HARTL, D. L., D. E. DYKHUIZEN, and A. M. DEAN, 1985  Limits of adaptation: the evolution of selective neutrality. Genetics 111:655-674[Abstract/Free Full Text].

HASTINGS, I. M., 1992  The population genetics of alleles affecting enzyme activity. J. Theor. Biol. 157:305-316[Medline].

HILL, S. A., J. H. BRYCE, and C. J. LEAVER, 1993  Control of succinate oxidation by cucumber (Cucumis sativus L.) cotyledon mitochondria. The role of the adenine-nucleotide translocator and extra-mitochondrial reactions. Planta 190:51-57.

HOFMEYR, J.-H., H. KACSER, and K. J. VAN DER MERWE, 1986  Metabolic control analysis of moiety-conserved cycles. Eur. J. Biochem. 155:631-641[Medline].

JEUNEMAÎTRE, X., F. SOUBRIER, Y. V. KOTELEVTSEV, R. P. LIFTON, and R. R. WILLIAMS et al., 1992  Molecular basis of human hypertension: role of angiotensinogen. Cell 71:169-180[Medline].

KACSER, H. and J. A. BURNS, 1973  The control of flux. Symp. Soc. Exp. Biol. 27:65-104[Medline].

KACSER, H. and J. A. BURNS, 1981  The molecular basis of dominance. Genetics 97:639-666[Abstract/Free Full Text].

KEIGHTLEY, P. D., 1989  Models of quantitative variation of flux in metabolic pathways. Genetics 121:869-876[Abstract/Free Full Text].

KEIGHTLEY, P. D., 1996  Metabolic models of selection response. J. Theor. Biol. 182:311-316[Medline].

KOJIMA, K.-I., 1959  Role of epistasis and overdominance in stability of equilibria with selection. Proc. Natl. Acad. Sci. USA 45:984-989[Free Full Text].

LAURIE-AHLBERG, C. C., A. N. WILTON, J. W. CURTSINGER, and T. H. EMIGH, 1982  Naturally occurring enzyme activity variation in Drosophila melanogaster. I. Sources of variation for 23 enzymes. Genetics 102:191-206[Abstract/Free Full Text].

LEE, S. H., M. A. BAILEY, M. A. R. MIAN, T. E. CARTER, JR., and E. R. SHIPE et al., 1996  RFLP loci associated with soybean seed protein and oil content across populations and locations. Theor. Appl. Genet. 93:649-657.

LEONARDI, A., C. DAMERVAL, and D. DE VIENNE, 1988  Organ-specific variability and inheritance of maize proteins revealed by two-dimensional electrophoresis. Genet. Res. Camb. 52:97-103.

MACKAY, T. F., 1996  The nature of quantitative genetic variation revisited: lessons from Drosophila bristles. Bioessays 18:113-121[Medline].

MAUGHAN, P. J., M. A. S. MAROOF, and G. R. BUSS, 1996  Molecular-marker analysis of seed-weight: genomic locations, gene action, and evidence for orthologous evolution among three legum