Genetics, Vol. 149, 1677-1691, August 1998, Copyright © 1998

Demand Theory of Gene Regulation. II. Quantitative Application to the Lactose and Maltose Operons of Escherichia coli

Michael A. Savageaua
a Department of Microbiology and Immunology, The University of Michigan, Ann Arbor, Michigan 48109-0620

Corresponding author: Michael A. Savageau, 5641 Medical Science Bldg. II, Department of Microbiology and Immunology, The University of Michigan Medical School, Ann Arbor, MI 48109-0620., savageau{at}umich.edu (E-mail).

Communicating editor: R. H. DAVIS


*  ABSTRACT
*TOP
*ABSTRACT
*Life cycle of E....
*Ecology and gene expression
*ESTIMATION OF PARAMETER VALUES
*SELECTION OF WILD-TYPE...
*DISCUSSION
*LITERATURE CITED

Induction of gene expression can be accomplished either by removing a restraining element (negative mode of control) or by providing a stimulatory element (positive mode of control). According to the demand theory of gene regulation, which was first presented in qualitative form in the 1970s, the negative mode will be selected for the control of a gene whose function is in low demand in the organism's natural environment, whereas the positive mode will be selected for the control of a gene whose function is in high demand. This theory has now been further developed in a quantitative form that reveals the importance of two key parameters: cycle time C, which is the average time for a gene to complete an ON/OFF cycle, and demand D, which is the fraction of the cycle time that the gene is ON. Here we estimate nominal values for the relevant mutation rates and growth rates and apply the quantitative demand theory to the lactose and maltose operons of Escherichia coli. The results define regions of the C vs. D plot within which selection for the wild-type regulatory mechanisms is realizable, and these in turn provide the first estimates for the minimum and maximum values of demand that are required for selection of the positive and negative modes of gene control found in these systems. The ratio of mutation rate to selection coefficient is the most relevant determinant of the realizable region for selection, and the most influential parameter is the selection coefficient that reflects the reduction in growth rate when there is superfluous expression of a gene. The quantitative theory predicts the rate and extent of selection for each mode of control. It also predicts three critical values for the cycle time. The predicted maximum value for the cycle time C is consistent with the lifetime of the host. The predicted minimum value for C is consistent with the time for transit through the intestinal tract without colonization. Finally, the theory predicts an optimum value of C that is in agreement with the observed frequency for E. coli colonizing the human intestinal tract.


THE life cycle of a microbe, in the simplest case, consists of alternative phases. The demand for expression of some effector genes will be high in one phase and low in the other, and adapting the level of expression to this varying demand requires a functional regulatory mechanism. It has long been known that the same regulatory function, for example, induction of gene expression, can be accomplished in one of two different modes: the negative mode involves the removal of a restraining element, which permits expression from a high-level promoter, whereas the positive mode involves the provision of a stimulatory element, which facilitates expression from a low-level promoter. The demand theory of gene regulation provides a selectionist explanation for this fundamental duality (e.g., see SAVAGEAU 1977 Down, SAVAGEAU 1989 Down).

The components of a minimal regulatory mechanism consist of a promoter site, a modulator site, and a regulator gene encoding the protein that binds the modulator site in response to environmental cues. Each of these components is subject to a rate of mutation that is determined by the number of critical bases in its nucleotide sequence and the mutation rate per base per round of DNA replication. A mutant altered in one of these components may exhibit two different phenotypes depending upon the phase of the life cycle in which it is expressed. The growth rate of the organism serves as the relevant phenotype, and selection is based upon differences in growth rate among wild-type and mutant organisms.

The quantitative development of demand theory (SAVAGEAU 1998 Down) combines these elements of life cycle, ecology, physiology, and molecular genetics to predict regions of the C vs. D plot within which selection for the wild-type regulatory mechanisms is realizable. These regions define minimum and maximum values for demand. This theory ties together a number of important variables, including growth rates, mutation rates, and minimum and maximum demands for gene expression. We apply demand theory here to the lactose (lac) and maltose (mal) catabolic systems of Escherichia coli, and we show that this theory also yields predictions for the rate and extent of selection and for minimum, maximum, and optimal cycle times of E. coli that are in reasonable agreement with independent experimental data.


*  Life cycle of E. coli
*TOP
*ABSTRACT
*Life cycle of E....
*Ecology and gene expression
*ESTIMATION OF PARAMETER VALUES
*SELECTION OF WILD-TYPE...
*DISCUSSION
*LITERATURE CITED

The normal life cycle of an organism defines the demand for expression of its effector genes. The life cycle of E. coli as it passes from one host to another will be considered here in terms of two different environments (Figure 1). The first will be identified with the proximal portions of the digestive tract for a lactose-tolerant host that ingests both lactose and starch (which consists largely of maltose). This is the environment in which rapid growth occurs during the transition between stable association with one host and then another. The second environment will be identified with the distal end of the small intestine and the colon of the host in which colonization and slow growth take place. Also included in the second environment will be the host's surroundings through which the bacteria pass to enter a subsequent host. This is admittedly a simplification of a more complex ecology (COOKE 1974 Down; FRETER 1976 Down; SAVAGEAU 1983 Down), but nevertheless it captures the essential features for our purposes here. Additional environments and more complex linkages among them in principle can be handled by the same methods.



View larger version (12K):
In this window
In a new window
Download PPT slide
 
Figure 1. The life cycle of Escherichia coli alternates between two different environments, the proximal portions of the digestive tract (A), where lactose levels are relatively high and maltose levels relatively low, and the distal portions (B), where lactose levels are relatively low and maltose levels relatively high. In the environment labeled A, there is a high demand for expression of the A-specific lac genes and a low demand for expression of the B-specific mal genes, whereas in the alternative environment labeled B the demand for these same genes is reversed. As described in the text, the average time required for the organism to complete its life cycle is denoted by C, which also represents the average time for the A-specific (or B-specific) genes to complete their ON/OFF cycle. The fraction of this cycle time spent in environment A (or B), which also represents the demand for expression of the A-specific (or B-specific) genes, is denoted by D.


*  Ecology and gene expression
*TOP
*ABSTRACT
*Life cycle of E....
*Ecology and gene expression
*ESTIMATION OF PARAMETER VALUES
*SELECTION OF WILD-TYPE...
*DISCUSSION
*LITERATURE CITED

We shall consider the lac operon as representative of a low-demand function governed by the negative mode of control (MILLER and REZNIKOFF 1980 Down) and the mal operon as representative of a high-demand function governed by the positive mode of control (SCHWARTZ 1987 Down). These systems are well studied at the molecular level, and the evidence regarding their mode of control is clear.

Evidence regarding demand for expression of the lac andmal operons of E. coli comes from studies of intestinal ecology. Lactose is a relatively rare sugar in nature (SHALLENBERGER 1974 Down). The host's lactase enzymes, which hydrolyze this disaccharide and thereby permit its utilization by the host, are located in the proximal small intestine and are subject to developmental regulation (DAHLQVIST 1961 Down; KOLDOVSKY and CHYTIL 1965 Down; WALKER 1968 Down). In contrast, maltose, the breakdown product of all dietary starch, is among the most abundant sugars (WIDDAS 1971 Down). The host's maltase enzymes, which hydrolyze this disaccharide and thereby permit its utilization by the host, are located at the distal end of the small intestine and in the colon (DAHLQVIST 1961 Down; ROSENSWEIG and HERMAN 1968 Down). This information suggests that the lac operon of E. coli is likely to be expressed at high levels in the first environment and at low levels in the second, whereas the mal operon is likely to be expressed at high levels in the second environment and at low levels in the first.

The time required for E. coli to pass through the high-demand environment for lactose utilization is about 3 hr. This is one-half the average time required to reach the colon (MADSEN 1992 Down); the 3-hr figure is also based on measured patterns of lactose utilization (BOND and LEVITT 1976 Down; MALAGELADA et al. 1984 Down). Much of the ingested lactose is hydrolyzed to constituent sugars that are absorbed by the host in the proximal small intestine; the remainder is catabolized by the bacteria so that very little lactose normally reaches the colon (BOND and LEVITT 1976 Down). From this 3-hr figure for time in the high-demand environment, one can predict that the cycle time of E. coli will be inversely related to the demand for expression of its lactose operon C = .

We estimate the time for passage through the low-demand environment for maltose utilization to be ~6 hr. This is the average time required for a bolus of ingested food to reach the distal portions of the small intestine and colon (MADSEN 1992 Down). We assume that free maltose is sparse in the proximal portions of the small intestine and that it becomes abundant only in the distal portion of the intestinal tract where the host's maltase enzymes are localized (DAHLQVIST 1961 Down; ROSENSWEIG and HERMAN 1968 Down). From this 6-hr figure for time in the low-demand environment, one can predict that the cycle time of E. coli will be inversely related to 1 minus the demand for expression of its maltose operon C = .


*  ESTIMATION OF PARAMETER VALUES
*TOP
*ABSTRACT
*Life cycle of E....
*Ecology and gene expression
*ESTIMATION OF PARAMETER VALUES
*SELECTION OF WILD-TYPE...
*DISCUSSION
*LITERATURE CITED

The demand theory of gene regulation involves three levels of parameters (SAVAGEAU 1998 Down): constituent parameters, individual growth rates and mutation rates, and macroscopic rate constants. Estimates for the values of the constituent parameters in our model are given below. In a subsequent section we will determine the consequences of these choices by examining other values for each of the parameters. As we shall see, the exact values are not critical for about half of the parameters, whereas the values for two of them are extremely influential.

Reference mutation rate, µ:
For E. coli, the spontaneous mutation rate is estimated to have a nominal value of µ0 = 6E–10 per base per DNA replication (DRAKE 1991 Down). The spontaneous mutation rate for loss of function in the modulator or promoter sites of our model can be determined from estimates of the spontaneous mutation rate per base and the number of critical bases that define these sites.

Relative mutation rate for loss of a high-level promoter site, {pi}:
Promoters encompass a region of ~75 nucleotides upstream of the RNA start site (HARLEY and REYNOLDS 1987 Down; LISSER and MARGALIT 1993 Down). Although most of the information in promoter sequences is localized within two 6-base blocks with variable spacing between them (the "consensus" elements at positions -10 and -35 relative to the start site), there is only limited base conservation at most positions. If we assume that each of 10 nucleotides is critical for the definition of a high-level promoter (SCHNEIDER et al. 1986 Down), then the relative mutation rate is {pi}0 = 10.

Relative mutation rate for gain of a high-level promoter site, {upsilon}:
Spontaneous up-promoter mutations with the positive mode occur at about one-tenth the frequency of spontaneous down-promoter mutations with the negative mode (G. GUSSIN, personal communication). If we assume that a low-level promoter can be converted to a high-level promoter by a single mutation in a critical base, then the relative mutation rate for this gain of a high-level promoter site is {upsilon}0 = 1.

Relative mutation rate for loss of a regulator's functional target site, {tau}:
Targets for the binding of regulator proteins are the modulator sites—operator sites in the case of the negative mode and initiator sites in the case of the positive mode. Operator sites span a region of ~100 nucleotides upstream of the RNA start site (REALLA and COLLADO-VIDES 1996 Down). If we assume that each of 20 nucleotides is critical for the definition of the operator (SCHNEIDER et al. 1986 Down), then the relative mutation rate in this case is {tau}0 = 20. Although the sizes of initiator sites are about one-half those of operator sites, 50 nucleotides upstream of the RNA start site (REALLA and COLLADO-VIDES 1996 Down), the information needed to locate these sites within the genome is similar to that for operators (SCHNEIDER et al. 1986 Down). If we assume that each of 20 nucleotides also is critical for the definition of the initiator, then the relative mutation rate in this case is {tau}0 = 20.

Relative mutation rate for loss of a functional regulator protein, {rho}:
We shall assume that a typical regulator protein has 30 amino acid residues that are critical for binding to its modulator site (operator in the case of the negative mode or initiator in the case of the positive mode) and for properly affecting transcription initiation. This implies that the regulator gene has ~60 bases that are critical because the identity of the base in the third codon position is largely irrelevant. Thus, we obtain a relative mutation rate of {rho}0 = 60 for loss of a functional regulator protein.

Relative mutation rate as a function of gene expression, {epsilon}:
There is evidence to suggest that the rate of spontaneous mutation increases with the rate at which the DNA is being transcribed (DATTA and JINKS-ROBERTSON 1995 Down). There also is evidence to suggest that the rate of spontaneous mutation decreases because of transcription-coupled repair mechanisms (FRANCINO et al. 1996 Down). This is one of the questions we wish to examine further in our quantitative analysis. We initially assume that there is no significant effect on the mutation rate either way; therefore, we assign a value of {epsilon}0 = 1 for this relative mutation rate. If one were to assume a change in mutation rate that is proportional (or inversely proportional) to the rate of transcription, then the mutation rate relative to the reference would be given by {epsilon} = k x 100 (or ), where k is the proportionality constant. (Recall that the capacity for regulation is assumed to be 100 and that expression is assumed to be fully ON or fully OFF.)

Reference growth rate, {gamma}:
We shall assume that E. coli grows with a doubling time of 1 hr in the nutritionally richer of the two environments; thus, the nominal value for the reference growth rate is {gamma}0 = 1.0. This is not an unreasonable value because it is known that bacteria like E. coli can double in a period as short as 20 min (MAALOE and KJELDGAARD 1965 Down). In any case, the simple value of unity provides a convenient reference; should the actual value be different, one can simply rescale the time accordingly, and none of our results would change.

Relative growth rate with loss of normal expression, {lambda}:
Because expression is either fully ON or fully OFF and the capacity for regulation is 100, which supports the nominal growth rate, a failure of expression is assumed to result in a basal level of expression, which would support only a 100-fold reduction in growth rate if there were no other carbon source in the environment. However, in the complex environment of the intestinal tract there are multiple carbon sources, and the reduction in growth rate will therefore be less. We shall assume a 3% reduction in growth rate. Thus, the nominal value for this parameter is set at {lambda}0 = 0.97.

Relative growth rate with superfluous expression, {sigma}:
When the demand is such that a function is normally turned OFF and a regulatory mutation causes the function to be fully expressed under inappropriate circumstances, the cell unnecessarily expends resources for material and energy. Experimental evidence in the case of ß-galactosidase expression in E. coli (NOVICK and WEINER 1957 Down; KOCH 1983 Down) suggests that such inappropriate expression decreases the growth rate by <1%; we shall assume a 0.1% reduction. The growth rate, relative to the reference growth rate, is thus assigned a nominal value of {sigma}0 = 0.999.

Relative growth rate in the more nutritionally deficient of the two environments, {delta}:
From measurements of the mean transit time through the human intestinal tract (CUMMINGS and WIGGINS 1976 Down; GEAR et al. 1980 Down), and the assumption that it is a well-stirred chemostat, one can calculate that the average doubling time for net growth of E. coli in the intestinal tract is about 40 hr (SAVAGEAU 1983 Down). Because the intestinal tract is not a well-stirred chemostat, but rather a very heterogeneous environment in which the growth is undoubtedly faster in the proximal regions and slower in the distal, the doubling time of E. coli in the more deficient distal environment will be longer than the average. There are no good measurements to go by, so we will arbitrarily set the doubling time for growth in the more deficient environment to be two times the average value given above. Thus, the growth rate in the more deficient environment, relative to the reference growth rate in the richer environment, is given by a nominal value of {delta}0 = 0.0125.

Criterion for selection, {theta}:
Our criterion for selection is that each mutant population shall be reduced to no more than 0.05% of the wild-type population or, alternatively, that the sum of the two mutant populations shall be reduced to no more than 0.1% of the wild-type population. This is similar to values that are found in the literature (LECLERC et al. 1996 Down). Thus, the criterion for selection is assigned a nominal value of {theta}0 = 0.0005.

Estimation of macroscopic parameters:
The values of the macroscopic parameters in each environment and for each mode of control are determined as follows. First, the constituent parameters given above are combined to represent the relevant growth rates (gw, gp, gm, gd). The growth rate of the wild-type organism gw in the first environment is {gamma} (the reference), and in the second it is {gamma} multiplied by {delta}, the relative growth rate in the nutritionally deficient environment. The growth rates of the promoter mutants gp, modulator mutants gm, and promoter/modulator (double) mutants gd in the two environments are the same as those of the wild type, but multiplied when appropriate by relative growth rates that reflect either loss of expression that is normally ON ({lambda}) or superfluous expression that is normally OFF ({sigma}). For example, the growth rate of a lac modulator mutant (gm) is {gamma} in the first environment, where its pattern of gene expression mimics that of the wild type, and {gamma}{delta}{sigma} in the second, where expression of the lac operon is superfluous. The growth rate of a mal modulator mutant is {gamma} in the first environment, where its pattern of gene expression mimics the wild type, and {gamma}{delta}{lambda} in the second, where there is a failure to express the mal operon.

Second, the constituent parameters are combined to represent the mutation rates between populations (mpw, mmw, mdp, mdm). Each mutation rate is given by the product of the number of critical bases that define the structure in question ({pi}, {upsilon}, {tau}, or {rho}), the spontaneous mutation rate per base per DNA replication (µ), and a factor reflecting transcription-related mutation or repair ({epsilon}) when appropriate. For example, the rate of production of lac promoter mutants from wild-type organisms (mpw) is {pi}µ{epsilon} in the first environment, where the lac operon is being actively transcribed, and {pi}µ in the second, where it is not. The rate of production of mal promoter mutants from wild-type organisms is {upsilon}µ in the first environment, where the mal operon is not being transcribed, and {upsilon}µ{epsilon} in the second, where it is.

Finally, the growth rates and mutation rates are combined to represent the macroscopic rate-constant parameters that characterize the population dynamics ({alpha}ww, {alpha}pp, {alpha}pw, {alpha}mm, {alpha}mw, {alpha}dd, {alpha}dp, {alpha}dm). For example, the rate constant for net growth of the promoter mutant {alpha}pp is given by its intrinsic growth rate gp minus the rate of loss due to the production of double mutants, which is given by the mutation rate per DNA replication mdp times the intrinsic growth rate of the promoter mutant gp. The rate constant for production of promoter mutants from the wild-type population {alpha}pw is given by the mutation rate per DNA replication mpw times the intrinsic growth rate of the wild-type organism gw. The other rate-constant parameters are determined in a similar fashion. Thus, {alpha}ww = [1 - (mpw + mmw)]gw, {alpha}pp = (1 - mdp)gp, {alpha}pw =mpwgw, {alpha}mm = (1 -mdm)gm, {alpha}mw = mmwgw, {alpha}dd = gd, {alpha}dp = mdpgp, {alpha}dm = mdmgm.


*  SELECTION OF WILD-TYPE REGULATORY MECHANISMS
*TOP
*ABSTRACT
*Life cycle of E....
*Ecology and gene expression
*ESTIMATION OF PARAMETER VALUES
*SELECTION OF WILD-TYPE...
*DISCUSSION
*LITERATURE CITED

Determination of the thresholds for selection:
The threshold for selection of a wild-type promoter or modulator is determined as follows. As described above, the constituent parameters are combined to represent the relevant growth rates and mutation rates that enter into the macroscopic parameters that characterize the population dynamics of mutant and wild-type organisms. The population dynamic equations are solved in the two environments to yield an equation for the ratio of mutant to wild-type population numbers. This ratio is set equal to {theta}, the criterion for selection, and the resulting equation expresses the relationship between cycle time C and the demand for gene expression D that constitutes the threshold for selection.

The threshold for selection of a wild-type promoter or modulator (regulator) can be obtained by solving the threshold equation for C as a function D using the method of bisection (PRESS et al. 1988 Down). These numerically calculated thresholds and the analytically determined asymptotes (SAVAGEAU 1998 Down) are nearly indistinguishable, except in the region of transition between low- and high-C asymptotes. Only those values of C and D that lie in the region of overlap below both the promoter and modulator thresholds will allow selection of the wild-type regulatory mechanism.

Regions in which selection for negative and positive modes is realizable:
When nominal values are assumed for the parameters of the model (see Table 1), one finds the thresholds plotted in Figure 2 and Figure 3. The thresholds shown in Figure 2 for the negative mode of control (3% selection coefficient against the promoter mutant and 0.1% against the modulator mutant) exhibit a narrow region of overlap. In contrast, the thresholds shown in Figure 3 for the positive mode of control (0.1% selection coefficient against the promoter mutant and 3% against the modulator mutant) exhibit a wide region of overlap. As predicted (SAVAGEAU 1998 Down), the positive and negative modes are associated with asymmetric regions of the C vs. D plot in which selection is realizable. Note that the horizontal axis in the case of the positive mode is plotted as values of 1 - D, instead of D; this allows us to distinguish more clearly the threshold values near unity when plotting demand in logarithmic coordinates. The extent of the asymmetry is perhaps more evident when the thresholds for selection against the modulator mutants are plotted on the same scale for both the negative and positive modes (Figure 4). The regions for which selection is realizable are nonoverlapping, which indicates discriminate selection for the positive and negative modes.



View larger version (25K):
In this window
In a new window
Download PPT slide
 
Figure 2. Thresholds for selection of a wild-type regulatory mechanism with a negative mode. (A) The demand is represented with a logarithmic scale, and one sees that Dmin = 4.8E–6 and Dmax = 0.1. (B) The demand is represented with a linear scale. Results are shown for the nominal values of the parameters in Table 1. See text for discussion.



View larger version (32K):
In this window
In a new window
Download PPT slide
 
Figure 3. Thresholds for selection of a wild-type regulatory mechanism with a positive mode. (A) The demand is represented with a logarithmic scale, and one sees that Dmin = 1–0.8 and Dmax = 1–1.5E–5. (B) The demand is represented with a linear scale. Results are shown for the nominal values of the parameters in Table 1. See text for discussion.



View larger version (42K):
In this window
In a new window
Download PPT slide
 
Figure 4. Thresholds for discriminate selection of wild-type regulatory mechanisms with negative or positive modes. The results in Figure 2B and Figure 3B are shown here on the same axes, where it is clear that the Dmax for selection of the negative mode is less than the Dmin for selection of the positive mode.


 
View this table:
In this window
In a new window

 
Table 1. Definitions and nominal values for the constituent parameters that determine the growth rates and mutation rates for organisms with positive and negative modes of control in high- and low-demand environments

Influence of parameters on minimum and maximum values for demand:
Selection requires the demand for gene expression to be greater than the asymptote for the minimum threshold and less than the asymptote for the maximum threshold, that is, Dmin < D < Dmax. The parameters in our model influence these asymptotic values to various degrees. It is important to examine a range of values for each of the parameters because there is some uncertainty in the nominal values for many of them. We have systematically varied each parameter about its nominal value given in Table 1 and observed the resulting changes in the minimum and maximum values for demand. The results are shown in Figure 5.






View larger version (134K):
In this window
In a new window
Download PPT slide
 
Figure 5. Influence of the constituent parameters on the values for Dmin and Dmax. Each parameter is varied about its nominal value, and the resulting values forDmin and Dmax are calculated. Results for the parameter {gamma} are not shown because it exhibits no influence in all cases. (A) Dmax for the negative mode of control. (B) Dmin for the negative mode of control. (C) Dmin for the positive mode of control. (D) Dmax for the positive mode of control. The local parameter sensitivities are summarized in Table 2.

Five classes of influence can be discerned in Figure 5A Figure 5B Figure 5C Figure 5D. First, in many cases there is no discernible influence [ Figure 5A ({pi} and {lambda}), Figure 5B ({rho}, {tau}, {epsilon}, and {sigma}), Figure 5C ({upsilon}, {tau}, {epsilon}, and {sigma}), and Figure 5D ({rho}, {tau}, and {lambda})]. Second, in several cases there is a nearly linear variation with the change in parameter value [ Figure 5A (µ, {rho}, {epsilon}, {delta}, and {theta}), Figure 5B (µ, {pi}, {delta}, and {theta}), and Figure 5D (µ, {upsilon}, {epsilon}, {delta}, and {theta})]. Third, in five cases there is a nearly cube-root influence [ Figure 5A ({tau}) and Figure 5C (µ, {rho}, {delta}, and {theta})]. Fourth, in two cases there is a moderate (order of magnitude) amplification of the response to a change in parameter value [ Figure 5B ({lambda}) and Figure 5C ({lambda})]. Finally, in two cases there is an extreme (1000-fold) amplification of the response to a change [ Figure 5A ({sigma}) and Figure 5D ({sigma})]. The results obtained for the negative and positive modes exhibit different patterns. The influences in the local region about the nominal values can be summarized numerically by the parameter sensitivities (SHIRAISHI and SAVAGEAU 1992 Down), as shown in Table 2.


 
View this table:
In this window
In a new window

 
Table 2. Influence of the constituent parameters on the values of Dmin and Dmax

From these results one can see that the most influential parameter is the selection coefficient that reflects the diminished growth rate of the organism when there is superfluous gene expression. For the negative mode of gene control, this corresponds to the diminished growth rate of the modulator (regulator) mutants that express the effector function constitutively when it should be OFF. For the positive mode, this corresponds to the diminished growth rate of the promoter mutants that express the effector function at a high level when it should be OFF.

Time course of selection:
The numbers of wild-type, modulator-mutant, and promoter-mutant organisms are represented by the variables Xw, Xm, and Xp. The ratio is equal to the reciprocal of the mutant fraction, which we define as fm. If we start with equal numbers for the two types of mutants and a ratio that is one-tenth of its steady-state value, then the enrichment of the wild-type regulatory mechanism with time is obtained from the solution of the population dynamic equations (Equation 9 in SAVAGEAU 1998 Down). Given the nominal values for the parameters in Table 1, the time course of selection for various values of demand D is as shown in Figure 6. The temporal behavior of the populations is independent of the cycle time C. The time scale is actually discrete, given by values of nC, where n is the number of cycles. Thus, within a fixed time period, the same degree of enrichment can be achieved with either a large value for C and a small number n, or a small value for C and a larger number n. Note that the negative mode of gene control emerges more rapidly as demand for gene expression increases, whereas the positive mode of gene control emerges more rapidly as demand for gene expression decreases (Figure 6). The extent and rate of selection are examined in greater detail below.



View larger version (20K):
In this window
In a new window
Download PPT slide
 
Figure 6. Enrichment of the wild-type regulatory mechanism with time. The numbers of wild-type, modulator-mutant, and promoter-mutant organisms are represented by the variables Xw,Xm, and Xp. Initially, the ratio , which is the reciprocal of the mutant fraction fm, is one-tenth of its steady-state (ss) value, and the two types of mutants are equally abundant. The ratio is normalized and plotted as a function of time in units of nC (see text for discussion). The normalized ratio is given by 10 x or 10 so that its values vary between 0 and 1 on a logarithmic scale. Time courses are shown for various values of demand D. (A) The negative mode of gene control with a cycle time of C = 3000 hr. (B) The positive mode of gene control with a cycle time of C = 300 hr.

Extent of selection:
We define the extent of selection as the steady-state value of the ratio , which is the inverse of the mutant fraction in the population (). Although there is selection for the wild-type regulatory mechanism throughout the region of overlap beneath the thresholds (e.g., Figure 2A and Figure 3A), the extent of the selection varies as a function of cycle time C and demand D. For a given value of C, the extent of selection reaches its maximum at a value of D that is roughly the geometric mean of its threshold values. With the nominal values for the parameters (Table 1), the results for the negative mode of gene control are as shown in Figure 7A; the results for the positive mode are similar to those for the negative mode, except that the allowable values for demand now occur in the high-demand region of the plot (Figure 7B). The maximum extent of selection for the positive mode of gene control is ~10-fold greater than that for the negative mode.



View larger version (17K):
In this window
In a new window
Download PPT slide
 
Figure 7. The extent of selection as a function of demand for gene expression D and cycle time C. The extent of selection is given by the steady-state value of the ratio or . The parameters have the nominal values given in Table 1. (A) Negative mode of gene control. (B) Positive mode of gene control. See text for discussion.

Rate of selection:
The rate at which selection occurs is independent of cycle time. We define response time as the time required for the ratio to reach 99% of its steady-state value, starting from an initial state in which the numbers of the two types of mutants are equal and the ratio is equal to one-tenth of its steady-state value. Recall that the time points are given in units of nC, where C is the cycle time and n is the number of cycles. The same temporal behavior is obtained regardless of whether C is large (n small) or small (n large). However, the resolution is poorer for large values of C because the minimum value of n is one.

Like the extent of selection, the rate of selection is strongly dependent on the demand for gene expression. Although selection in the case of the negative mode can occur near the lower limit of allowable values for D, the response time is very long. Response time decreases in an inverse fashion as D increases, until a lower plateau is reached (Figure 8A). The break in the curve occurs at approximately the value of D that yields the maximum extent of selection (see Figure 7A). The minimum response time with the nominal values for the parameters is ~294,000 hr (~36 yr).



View larger version (14K):
In this window
In a new window
Download PPT slide
 
Figure 8. The rate of selection as a function of demand for gene expression D. Response time is measured as the time required to achieve 99% of the steady-state value for the ratio starting from an initial state in which this ratio is initially one-tenth of the steady-state value and the mutants are equally abundant. The parameters have the nominal values given in Table 1. (A) Negative mode of gene control. (B) Positive mode of gene control. See text for discussion.

Similar results are found with the positive mode of gene control (Figure 8B), except that the long response times occur near the upper limit of allowable values for D. The response time decreases as D decreases until a minimum is reached, and then it increases. For the same extent of selection as the negative mode (18,400), the positive mode exhibits a faster response time (~17,000 vs. 294,000 hr); alternatively, for the same response time as the negative mode (294,000 hr), the positive mode exhibits a greater extent of selection (~214,000 vs. 18,400). Thus, it appears that the positive mode of gene control is capable of achieving greater extents of selection with faster response times than is the case for the negative mode of control.

Minimum cycle time:
Estimates for the minimum cycle time of E. coli passing from one host to another can be obtained by combining the information in Figure 2A with the inverse relationship between C and D for the lactose operon of E. coli. (Recall from the ECOLOGY AND GENE EXPRESSION section that C = .) The intersection of this inverse relationship with the threshold for selection of the wild-type modulator (regulator) gives a value of Cmin = 26 hr (Figure 9A). Another estimate of minimum cycle time can be obtained by combining the information in Figure 3A with the inverse relationship between C and (1 - D) for the maltose operon of E. coli. [Recall from the ECOLOGY AND GENE EXPRESSION section that C = .] The intersection of this inverse relationship with the threshold for selection of the wild-type modulator (regulator) gives a value of Cmin = 10 hr (Figure 9B).



View larger version (20K):
In this window
In a new window
Download PPT slide
 
Figure 9. Predicted values for cycle times Cmin,Cmax, and Cop based on the transit time through the proximal portions of the intestinal tract in humans. (A) Locus of C values for the negative mode of control is given by the 1/D relationship based on the period available for bacterial utilization of lactose (~3 hr). (B) Locus of C values for the positive mode of control is given by the 1/D relationship based on the period not available for utilization of maltose (~6 hr). The locus in each case intersects the threshold for selection of the wild-type modulator (regulator) at the value for Cmin and the threshold for selection of the wild-type promoter at the value for Cmax. The value of Cop is found on the locus at a value of Dop, which corresponds to the value of D that yields the optimum extent (Figure 7) and rate (Figure 8) of selection. See text for discussion.

Maximum cycle time:
Estimates for the maximum cycle time of E. coli passing from one host to another also can be obtained by combining the information in Figure 2A with the inverse relationship between C and D for the lactose operon of E. coli. The intersection of the inverse relationship C = with the threshold for selection of the wild-type promoter (Figure 9A) gives a value of Cmax = 580,000 hr (~66 yr). Again, the data for the maltose operon in Figure 3A provide an alternative estimate. The intersection of the inverse relationship C = with the threshold for selection of the wild-type promoter (Figure 9B) gives a value of Cmax = 502,000 hr (~57 yr).

Optimal cycle time:
Although the estimates for minimum and maximum cycle time in the preceding sections are of some interest, perhaps the more relevant issue is the nominal value of the cycle time for E. coli in its natural environment. We argue that the most probable values for the cycle time will be those corresponding to the values for demand that lead to the optimal extent and rate of selection. For the negative mode, the optimum extent and rate of selection occur with a value of demand Dop {approx} 0.001 (Figure 7A and Figure 8A). Combining this optimum value for D with the inverse relationship C = in Figure 9A for the lactose operon yields an estimate for the nominal value of the cycle time, namely, Cop = 3000 hr (~4 mon). For the positive mode, the optimum extent and rate of selection occur with a value of demand 1 - Dop {approx} 0.01 (Figure 7B and Figure 8B). Combining this optimum value for 1 - D with the inverse relationship C = in Figure 9B for the maltose operon yields an estimate of Cop = 800 hr (~33 days).


*  DISCUSSION
*TOP
*ABSTRACT
*Life cycle of E....
*Ecology and gene expression
*ESTIMATION OF PARAMETER VALUES
*SELECTION OF WILD-TYPE...
*DISCUSSION
*LITERATURE CITED

The application of demand theory presented in this article provides an opportunity to test a number of the theory's quantitative implications. The results in SAVAGEAU 1998 Down led to the prediction of well-defined regions within the C vs. D plot where selection for the positive and negative modes of gene control is realizable. These regions allow one for the first time to specify precisely what is meant by high and low demand. With the nominal values for the parameters of the lactose and maltose operons in E. coli, selection of the negative mode of control requires a demand between 0.000005 and 0.1 (Figure 2A), whereas selection of the positive mode requires a demand between 0.2 and 0.999985 (Figure 3A). Furthermore, these regions were predicted to exhibit an inherent asymmetry with the positive mode having the larger region within which selection is realizable. This is clearly seen in the case of the lactose and maltose examples analyzed here (Figure 4).

Although the minimum and maximum values of demand are influenced by a number of parameters, by far the most influential parameter is {sigma}, which reflects the reduction in growth rate when there is superfluous expression of a gene (Figure 5A and Figure 5D). The nominal value for this parameter was set at 0.1%, on the basis of data for the lactose operon that suggest a value <1% for the reduction in growth rate of operator-constitutive mutants in a low-demand environment. In the case of the positive mode, the same value was used to characterize the reduction in growth rate of an up-promoter mutant in a low-demand environment. A 0.1% variation in {sigma} yields a twofold change in the value of Dmax for both the negative and positive mode (Table 2). The remaining parameters have much less influence on the limits of D; approximately one-half exhibit a nearly linear influence, whereas the other half have a negligible influence.

The ratio of mutation rate to selection coefficient is the most relevant determinant of the realizable region for selection. Indeed, if the target sizes for the various types of mutations are increased by an order of magnitude (e.g., to match the footprint for binding a regulator protein to its modulator site on the DNA) at the same time the selection coefficients are increased by an order of magnitude, then the results are essentially unchanged (data not shown).

The results in Figure 5 suggest that the effect of transcription on mutation rate may be significant only if it reduces the mutation rate. The parameter {epsilon}, which represents this effect, has no influence on the selection of the wild-type promoter when there is a negative mode of control (Figure 5B). This is counter to the intuitive expectation that suggests a lower mutation rate would aid the selection of the wild-type promoter when it is not in use. The results in Figure 5A show that the parameter {epsilon} can represent an increased selection for the wild-type repressor-modulator interaction (increased Dmax) if there is an increase in transcription-coupled repair (decrease in {epsilon}). In the case of the positive mode, {epsilon} has negligible influence on the selection of the wild-type activator-modulator interaction (Figure 5C), and its influence on the selection of the wild-type promoter (Figure 5D) would appear to be of little consequence because the threshold value of Dmax in this case is already so high. Given the nominal values we have used for the parameters, {epsilon} does not seem to be highly significant, and similar effects can be achieved by varying other parameters; nevertheless, {epsilon} might still be important for selection under other conditions.

The equations that characterize the population dynamics of mutant and wild-type organisms (Equations 7–11 in SAVAGEAU 1998 Down) led to the prediction that the extent of selection is a function of cycle time C and maximal at intermediate values of demand D, whereas the rate of selection is independent of cycle time C. Indeed, with the parameter values in Table 1, the extent of selection increases, reaches a maximum, and then declines as demand increases (Figure 7). As seen in Figure 8, the time required to reach full selection decreases until a minimum is reached with increasing demand (negative mode) or decreasing demand (positive mode). The combination of these results suggests that the optimum extent and rate of selection occurs at around D = 0.001 for the negative mode and 1 - D = 0.01 for the positive mode. In the case of the positive mode, this represents a choice of 1 - D that yields a rate of selection that is nearly equivalent to the optimum for the negative mode.

The quantitative theory reveals a number of new relationships involving cycle time that can be tested against experimental data in the case of the lactose and maltose operons of E. coli. The first such relationship provides an estimate for the minimum value of the cycle time Cmin. We obtained values of 26 hr (Figure 9A) and 10 hr (Figure 9B), which is on the same order of magnitude as the 40 hr required on average for transit through the entire intestinal tract (CUMMINGS and WIGGINS 1976 Down; GEAR et al. 1980 Down; SAVAGEAU 1983 Down). Under these circumstances, E. coli is simply passing through the intestinal tract without colonizing the colon. Clearly, the cycle time can be no shorter than this period.

The second relationship provides an estimate for the maximum value of the cycle time Cmax. We have estimated this value to be ~580,000 hr (~66 yr) in the case of the lactose operon (Figure 9A) and 502,000 hr (~57 yr) in the case of the maltose operon (Figure 9B). These values for Cmax are on the same order of magnitude as the 120-yr maximum for the life span of humans (HAYFLICK 1977 Down). Clearly, the cycle time for E. coli can be no longer than the life time of the host because the bacteria will die with the host if they do not colonize a new host.

The final relationship provides an estimate for the optimum value of the cycle time Cop. The optimum extent and rate of selection determined for the lactose operon suggest a demand in the neighborhood of Dop = 0.001. This value of D, taken together with the relationship D = , predicts an optimum cycle time of Cop = 3000 hr (~4 mon). The corresponding estimate based on the maltose operon is Cop = 800 hr (~33 days). These predicted values for the cycle time of E. coli are comparable with the cycle times (recolonization rates) of months to years that have been observed in humans for resident strains of E. coli (SEARS et al. 1950 Down; SEARS and BROWNLEE 1952 Down; CAUGANT et al. 1981 Down).

In summary, the quantitative development of demand theory presented in SAVAGEAU 1998 Down and applied here provides the first estimates for the minimum and maximum values of demand that are required for selection of the positive and negative modes of gene control. The specific application to the maltose and lactose operons of E. coli suggests that the positive and negative modes of control for these genes are subject to selection throughout the full range of cycle times that are possible for this microbe. Moreover, the cycle times predicted on the basis of the optimal extent and rate of selection are in agreement with the typical cycle times that have been observed experimentally.


*  ACKNOWLEDGMENTS

I thank Drs. S. COOPER, R. G. FRETER, D. E. KIRSCHNER, J. V. NEEL, and M. S. SWANSON for critically reading the manuscript and two anonymous reviewers who made valuable suggestions for improving the parameter estimates. This work was supported in part by U.S. Public Health Service grant RO1-GM30054 from the National Institutes of Health and U.S. Department of Defense grant N00014-97-1-0364 from the Office of Naval Research.

Manuscript received December 15, 1997; Accepted for publication May 6, 1998.


*  LITERATURE CITED
*TOP
*ABSTRACT
*Life cycle of E....
*Ecology and gene expression
*ESTIMATION OF PARAMETER VALUES
*SELECTION OF WILD-TYPE...
*DISCUSSION
*LITERATURE CITED

BOND, J. H. and M. D. LEVITT, 1976  Quantitative measurement of lactose absorption. Gastroenterology 70:1058-1062[Medline].

CAUGANT, D. A., B. R. LEVIN, and R. K. SELANDER, 1981  Genetic diversity and temporal variation in the E. coli population of a human host. Genetics 98:467-490[Abstract/Free Full Text].

COOKE, E. M., 1974 Escherichia coli and Man. Churchill Livingstone, London.

CUMMINGS, J. H. and H. S. WIGGINS, 1976  Transit through the gut measured by analysis of a single stool. Gut 17:219-223[Abstract/Free Full Text].

DAHLQVIST, A., 1961  The location of carbohydrases in the digestive tract of the pig. Biochem. J. 78:282-288[Medline].

DATTA, A. and S. JINKS-ROBERTSON, 1995  Association of increased spontaneous mutation rates with high levels of transcription in yeast. Science 268:1616-1619[Abstract/Free Full Text].

DRAKE, J. W., 1991  A constant rate of spontaneous mutation in DNA-based microbes. Proc. Natl. Acad. Sci. USA 88:7160-7164[Abstract/Free Full Text].

FRANCINO, M. P., L. CHAO, and M. A. RILEY, 1996  Asymmetries generated by transcription-coupled repair in enterobacterial genes. Science 272:107-109[Abstract].

FRETER, R., 1976 Factors controlling the composition of the intestinal micro-flora, pp. 109–120 in Proceedings of the Microbial Aspects of Dental Caries, special supplement, Microbiology Abstracts, Vol. 1, edited by H. M. STILES, W. J. LOESCHE and T. C. O'BRIEN. American Society for Microbiology, Washington, DC.

GEAR, J. S. S., A. J. M. BRODRIBB, A. WARE, and J. T. MANN, 1980  Fiber and bowel transit times. Br. J. Nutr. 45:77-82.

HARLEY, C. B. and R. P. REYNOLDS, 1987  Analysis of E. coli promoter sequences. Nucleic Acids Res. 15:2343-2361[Abstract/Free Full Text].

HAYFLICK, L., 1977 The cellular basis for biological aging, pp. 159–186 in Handbook of the Biology of Aging, edited by C. E. FINCH and L. HAYFLICK. Van Nostrand Reinhold, New York.

KOCH, A. L., 1983  The protein burden of lac operon products. J. Mol. Evol. 19:455-462[Medline].

KOLDOVSKY, O. and F. CHYTIL, 1965  Postnatal development of ß-galactosidase activity in the small intestine of the rat: effect of adrenalectomy and diet. Biochem. J. 94:266-270[Medline].

LECLERC, J. E., B. LI, W. L. PAYNE, and T. A. CEBULA, 1996  High mutation frequencies among Escherichia coli andSalmonella pathogens. Science 274:1208-1211[Abstract/Free Full Text].

LISSER, S. and H. MARGALIT, 1993  Compilation of E. coli mRNA promoter sequences. Nucleic Acids Res. 21:1507-1516[Abstract/Free Full Text].

MAALØE, O., and N. O. KJELDGAARD, 1965 Control of Macromolecular Synthesis. Benjamin, New York.

MADSEN, J. L., 1992  Effects of gender, age, and body mass index on gastrointestinal transit times. Digest. Dis. Sci. 37:1548-1553.

MALAGELADA, J.-R., J. S. ROBERTSON, M. L. BROWN, M. REMINGTON, and J. A. DUENES et al., 1984  Intestinal transit of solid and liquid components of a meal in health. Gastroenterology 87:1255-1263[Medline].

MILLER, J. H., and W. S. REZNIKOFF, 1980 The Operon. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY.

NOVICK, A. and M. WEINER, 1957  Enzyme induction as an all-or-none phenomenon. Proc. Natl. Acad. Sci. USA 43:553-566[Free Full Text].

PRESS, W. H., B. P. FLANNERY, S. A. TEUKOLSKY and W. T. VETTERLING, 1988 Numerical Recipes in C. Cambridge University Press, New York.

REALLA, J. D., and J. COLLADO VIDES, 1996 Organization and function of transcription regulatory elements, pp. 1232–1245 in Escherichia coli and Salmonella, Vol. I, Ed. 2, edited by F. C. NEIDHARDT. ASM Press, Washington, DC.

ROSENSWEIG, N. S. and R. H. HERMAN, 1968  Control of jejunal sucrase and maltase activity by dietary sucrose or fructose in man: a model for the study of enzyme regulation in man. J. Clin. Invest. 47:2253-2262.

SAVAGEAU, M. A., 1977  Design of molecular control mechanisms and the demand for gene expression. Proc. Natl. Acad. Sci. USA 74:5647-5651[Abstract/Free Full Text].

SAVAGEAU, M. A., 1983  Escherichia coli habitats, cell types, and molecular mechanisms of gene control. Am. Naturalist 122:732-744.

SAVAGEAU, M. A., 1989 Are there rules governing patterns of gene regulation? pp. 42–66 in Theoretical Biology—Epigenetic and Evolutionary Order, edited by B. C. GOODWIN and P. T. SAUNDERS. Edinburgh University Press, Edinburgh.

SAVAGEAU, M. A., 1998  Demand theory of gene regulation. I. quantitative development of the theory. Genetics 149:1665-1676[Abstract/Free Full Text].

SCHNEIDER, T. D., G. D. STORMO, L. GOLD, and A. EHRENFEUCHT, 1986  Information content of binding sites on nucleotide sequences. J. Mol. Biol. 188:415-431[Medline].

SCHWARTZ, M., 1987 The maltose regulon, pp. 1482–1502 in Escherichia coli and Salmonella typhimurium: Cellular and Molecular Biology, edited by F. C. NEIDHARDT. American Society for Microbiology, Washington, DC.

SEARS, H. I. and I. BROWNLEE, 1952  Further observations on the persistence of individual strains of Escherichia coli in the intestinal tract of man. J. Bacteriol. 63:47-57[Free Full Text].

SEARS, H. I., I. BROWNLEE, and J. K. UCHIYAMA, 1950  Persistence of individual strains of E. coli in the intestinal tract of man. J. Bacteriol. 59:293-301[Free Full Text].

SHALLENBERGER, R. S., 1974 Occurrence of various sugars in foods, pp. 67–80 in Sugars in Nutrition, edited by H. L. SIPPLE and K. W. MCNUTT. Academic Press, New York.

SHIRAISHI, F. and M. A. SAVAGEAU, 1992  The tricarboxylic acid cycle in Dictyostelium discoideum II. Evaluation of model consistency and robustness. J. Biol. Chem. 267:22919-22925[Abstract/Free Full Text].

WALKER, D. G., 1968 Developmental aspects of carbohydrate metabolism, pp. 465–496 in Carbohydrate Metabolism, Vol. 1, edited by F. DICKENS, P. J. RANDLE and W. J. WHELAN. Academic Press, New York.

WIDDAS, W. F., 1971 The role of the intestine in sucrose absorption, pp. 155–171 in Sugar, edited by J. YUDKIN, J. EDELMAN and L. HOUGH. Butterworths, Woburn, MA.




This article has been cited by other articles:


Home page
Proc. Natl. Acad. Sci. USAHome page
E. Libby, T. J. Perkins, and P. S. Swain
Noisy information processing through transcriptional regulation
PNAS, April 24, 2007; 104(17): 7151 - 7156.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
G. Shinar, E. Dekel, T. Tlusty, and U. Alon
Rules for biological regulation based on error minimization.
PNAS, March 14, 2006; 103(11): 3999 - 4004.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
M. Thattai and A. van Oudenaarden
Stochastic Gene Expression in Fluctuating Environments
Genetics, May 1, 2004; 167(1): 523 - 530.
[Abstract] [Full Text] [PDF]


Home page