| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Genetics, Vol. 173, 1851-1869, August 2006, Copyright © 2006
doi:10.1534/genetics.105.049619
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||



* Laboratoire Adaptation et Pathogénie des Microorganismes, Université Joseph Fourier, CNRS UMR 5163, 38041 Grenoble, France,
Laboratoire Chimie des Proteines DRDC-CP, ERM 0201 CEA/INSERM/UJF, 38054 Grenoble, France and
Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, Michigan 48824
1 Corresponding author: Laboratoire Adaptation et Pathogénie des Microorganismes, Université Joseph Fourier, CNRS UMR 5163, Institut Jean Roget, Domaine de la Merci, La Tronche 38700, France.
E-mail: dominique.schneider{at}ujf-grenoble.fr
| ABSTRACT |
|---|
|
|
|---|
Parallel evolution is defined as the independent evolution of the same trait in different lineages (FUTUYMA 1986). Independent populations of various types of organismsfrom bacteriophages and bacteria to insects and vertebrateshave in many studies been shown to evolve parallel phenotypic traits under similar environmental conditions (VASI et al. 1994; LOSOS et al. 1998; RAINEY and TRAVISANO 1998; FEREA et al. 1999; WICHMAN et al. 1999; HUEY et al. 2000; NOSIL et al. 2002; NACHMAN et al. 2003; ZHANG 2003; SCHLUTER 2004; COLOSIMO et al. 2005). The statistically predictable relationship between environmental features and the evolution of organismal traits is unlikely to arise by genetic drift and instead strongly implicates natural selection. Thus, parallel and convergent changes in phenotypic traits are widely regarded as hallmarks of adaptive evolution.
Molecular studies focusing on specific phenotypic traits have also sometimes been successful in identifying the genetic basis of the parallel evolution of these phenotypic characters. For example, parallel evolution of resistance to heavy metals, insecticides, and drugs led to the discovery of the molecular changes underlying these adaptations in organisms as diverse as plants, insects, and viruses (SCHAT et al. 1996; CRANDALL et al. 1999; DABORN et al. 2002). Most such studies of parallel molecular evolution have focused on the genes underlying a few traits, such as resistance, that reflect adaptation to a specific selective agent or other environmental factor. However, fitness itself is a complex phenotypic trait that emerges from all the interactions among the molecular components of an entire organism. Therefore, it is of considerable interest, as well as a great challenge, to assess the extent of parallel evolution for as many traits and genes as possible in a given system.
The development of tools to measure global gene expression profiles offers the new opportunity to measure simultaneously hundreds or thousands of expression-level phenotypic traits. These measurements are possible either at the level of messenger RNAs by using DNA macroarrays (DERISI et al. 1997) or at the protein level with two-dimensional proteomic electrophoresis (LEE and LEE 2003). A model system in which both sets of data could be obtained and compared would, on the one hand, advance our understanding of the overall extent and importance of parallelism during the evolutionary process and, on the other hand, provide new insight into the relative contribution of different levels of gene regulation to parallel changes. Both methodologies could also be used, alone or in combination, to help reveal the type and identity of the underlying mutations. The strength of such an analysis depends on the assumption that any parallel changes observed are independent and not based on common ancestry. This assumption can be fulfilled with certainty in experiments where replicate microbial populations independently evolve from the same founding ancestral genotype (ELENA and LENSKI 2003). The availability of genomic resources for the study organism also facilitates further research in identifying the molecular mechanisms underlying adaptation to particular environments on the basis of the pattern of parallel phenotypic changes.
A clone of Escherichia coli B was used as the common ancestor to found the longest-running evolution experiment (LENSKI 2004). Twelve independent populations were initiated from this ancestor and propagated by daily serial transfer in the same defined glucose-limited environment for >20,000 generations (LENSKI et al. 1991; LENSKI and TRAVISANO 1994; COOPER and LENSKI 2000). Over time, the populations diverged genetically from their ancestor and genetic diversity accumulated within the evolving populations (PAPADOPOULOS et al. 1999). All the replicate populations achieved more or less parallel gains in competitive fitness, indicative of substantial adaptation, on the basis of competition experiments with marked variants of the ancestor in the same glucose-limited environment (LENSKI and TRAVISANO 1994; COOPER and LENSKI 2000). Several other phenotypic aspects also evolved in parallel, including cell size (LENSKI et al. 1998), growth parameters (VASI et al. 1994), catabolic functions (COOPER and LENSKI 2000), and DNA topology (CROZAT et al. 2005). Moreover, analyses of gene expression profiles using DNA macroarrays for 2 of the populations revealed parallel changes in the transcription of 59 genes after 20,000 generations of experimental evolution (COOPER et al. 2003). The genetic bases of some of these parallel phenotypic changes were investigated and mutations discovered in genes involved in ribose utilization (COOPER et al. 2001), DNA topology (CROZAT et al. 2005), and the stringent response (COOPER et al. 2003). In each of these cases, relevant mutations were found in many or all of the independently evolved populations, implying parallel molecular evolution. By constructing isogenic strains that differed by single mutations and performing competition experiments in the same glucose-limited environment of the evolution experiment, 4 of these genes were demonstrated to be targets of natural selection (COOPER et al. 2001, 2003; CROZAT et al. 2005): spoT, involved in the metabolism of the signaling molecule ppGpp (CASHEL et al. 1996); topA, encoding topoisomerase I (WANG 1971); fis, encoding a histone-like protein (DORMAN and DEIGHAN 2003); and the rbs ribose-utilization operon (ABOU-SABÉ et al. 1982). However, summing the fitness gains caused by the mutations in these 4 genes explains only about one-third of the average fitness increase after 20,000 generations of evolution, indicating that more beneficial mutations await discovery, consistent with an estimate of
1020 beneficial mutations substituted in each population (LENSKI 2004). By contrast to these cases of parallel genetic changes, sequencing 36 gene regions chosen at random found no mutations at all except in 4 populations that evolved mutator phenotypes (SNIEGOWSKI et al. 1997; COOPER et al. 2003; LENSKI et al. 2003).
In this study, we analyzed the extent of parallel change at the protein level in two evolved clones, each from an independently evolved population, by performing two-dimensional protein gel electrophoresis. Their proteomic profiles were compared with one another and with the profile obtained for their common ancestor. The two evolved clones that we analyzed were the same ones that had been investigated previously using DNA macroarrays (COOPER et al. 2003), which allowed us to compare the inferences on the basis of transcriptional and proteomic profiles. Although we observed impressive correspondence between these two approaches in terms of an important suite of parallel changes involving a global regulatory pathway, our study also identified an additional target of natural selection involving a lower level of regulation that affected 8 of the 12 populations.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Strain BW3216, isolated after 280 generations of glucose-limited growth in a chemostat (NOTLEY-MCROBB and FERENCI 1999), was used to isolate a malT mutant allele that shows constitutive upregulation of the maltose regulon. Strain BW2951 is an MC4100 derivative that carries a lamBlacZ transcriptional fusion marked with kanamycin resistance (NOTLEY and FERENCI 1995). Strain pop7164 carries a malT deletion allele (SCHREIBER et al. 2000) and was used as a control in immunoblotting experiments.
For cloning experiments, we used E. coli JM109 (YANISCH-PERRON et al. 1985) with standard procedures (SAMBROOK et al. 1989) or TOP10 (Invitrogen, San Diego) according to the supplier's recommendations. Plasmids pCRII-Topo (Invitrogen) and pKO3 (LINK et al. 1997) were used for cloning experiments and allelic replacements, respectively.
Strains were grown in Davis minimal (DM) medium supplemented with 25 µg/ml glucose (DM25) as in the evolution experiment (LENSKI et al. 1991) or with 250 µg/ml glucose (DM250) or were grown in rich LB medium (SAMBROOK et al. 1989). The same minimal medium except with maltose substituted for glucose was used to assess maltose utilization capacity. Unless otherwise stated, DM25 or DM250 means that glucose was the carbon source. Chloramphenicol (30 µg/ml) or kanamycin (50 µg/ml) were added as needed.
Genetic techniques and ß-galactosidase assay:
The lamBlacZ transcriptional fusion was moved into the ancestral or derived backgrounds by P1 transduction, using P1vir (MILLER 1992), strain BW2951 as donor, and selecting for kanamycin resistance. The ß-galactosidase activities were assayed by use of o-nitrophenyl ß-D-galactopyranoside as a substrate and were calculated as micromolars of o-nitrophenol per minute per milligram of cellular protein (MILLER 1992). All activity values reported are the average of three independent experiments.
Protein preparation and two-dimensional polyacrylamide gel electrophoresis:
Cultures were incubated at 37° for 24 hr in DM250 medium. Cell pellets were harvested by centrifugation at 5000 x g, washed once with 50 mM Tris, pH 8, resuspended in 200 µl of 50 mM Tris, 0.2 M DTT, 0.3% SDS (w/v), and 1 mM EDTA, and boiled for 5 min. Twenty microliters of 50 mM Tris, pH 7.5, 50 mM MgCl2, containing 1 mg/ml DNase I and 0.25 mg/ml RNase A (Roche), were added to the cell extracts and the resulting suspensions were incubated for 10 min at 4°. Finally, proteins were solubilized by adding 800 µl of buffer containing 10 M urea, 4% NP40 (v/v), 0.1 M DTT, and 2.2% ampholines with a pH range from 3 to 10 (v/v) (Amersham Pharmacia). Protein samples were stored at 20°.
Two-dimensional polyacrylamide gel electrophoresis (PAGE) was performed using a Multiphor II system (Amersham Pharmacia) for isoelectrofocalization and the Protean II xi cell (Bio-Rad, Hercules, CA) for SDSPAGE. Dry strips (laboratory made, 18 cm, pH range from 4 to 8) were hydrated in the protein samples (500 µg) prepared in 2 M thiourea, 5 M urea, 1.6% 3-[(3-cholamidopropyl)-dimethylammonio]-1-propanesulfonate (w/v), 100 µM DTT, and 0.8% ampholines (v/v). Isoelectrofocalizations were run until pH equilibrium was reached (LELONG and RABILLOUD 2003). These strips were equilibrated for 10 min in Tris, pH 6.8, containing 2.5% SDS (w/v), 30% glycerol (v/v), 6 M urea, and 50 mM DTT. This step was repeated in the same buffer, with DTT replaced by 100 mM iodoacetamide. For the second-dimension electrophoresis, SDSPAGE was performed as described (LAEMMLI 1970), using 12% acrylamide gels, with the following modifications: cathode and anode migration buffers consisted of 50 mM Tris, 0.1% SDS (w/v) with 200 mM taurine, and 380 mM glycine, respectively. Proteins were stained with either Coomassie brilliant blue R-250 or silver.
Analysis of two-dimensional protein gels:
Quantitative analysis of the protein gels was performed using Melanie II software (Genebio, Geneva, Switzerland). The global protein profile of the three strains (two evolved, one ancestral) was performed in triplicate from three independent cultures of each strain. Each three-part set of comparisons was performed in parallel on the same day and with the same media and solutions to produce gels with similar background and signal levels for quantitative analysis. This approach allowed us to achieve excellent reproducibility by making gel parameters maximally consistent for each set of comparisons. Our quantitative analysis proceeded in three steps as follows. First, for every protein spot on each gel, we calculated its relative abundance as the ratio between its volume and the total volume of all spots on the same gel. Volume is defined as a function of optical density integrated over the spot area. Second, changes in protein expression of each evolved clone were standardized using the ancestor as a reference. Thus, a value of 2.0 means that the relative abundance of a particular protein was twice as high in the evolved clone as in its ancestor, whereas a value of 0.5 indicates that the relative abundance in the evolved clone was only half that in the ancestor. Third, the mean and standard deviations of the standardized values were computed for each spot from the three replicate gels, and the data were analyzed statistically using the R-package 0.5.13 software. In particular, two-tailed t-tests were used to determine whether the expression of a particular protein in an evolved clone was greater or less than 1 (i.e., the null expectation in the absence of any evolutionary change) using P < 0.05 as the criterion for statistical significance. Although our statistical analyses used data that were standardized relative to the total abundance integrated over all protein spots, for purposes of illustration we identify two proteins, AhpC and GapA, as "internal standards" because their relative abundance was nearly constant over all three genetic backgrounds and all three sets of gels.
Mass spectrometry peptide sequencing:
Protein spots were manually excised from Coomassie-blue-stained 2-D gels, oxidized with 7% H2O2, and subjected to in-gel tryptic digestion. For matrix-assisted laser desorption ionizationtime of flightmass spectrometry (MALDITOFMS), peptides were mixed with matrix solution [
-cyano-4-hydroxycinnamic acid at half saturation in 60% acetonitrile/0.1% trifluoroacetic acid (v/v)] and analyzed with a MALDITOF mass spectrometer (Autoflex, Bruker Daltonik, Bremen, Germany) in reflector/delayed extraction mode over a mass range of 04200 Da. Consecutive automatic searches against the Swissprot Trembl database were performed for each sample using version 1.9 of Mascot software. For nano-liquid chromatographytandem mass spectrometry (nano-LCTMS/MS), peptides were extracted with formic acid and acetonitrile and injected into a CapLC (Waters, Milford, MA) nanoLC system that was directly coupled to a QTOF Ultima mass spectrometer (Waters). MS and MS/MS data were acquired and processed automatically using MassLynx 4.0 software (Waters). Consecutive searches against the Swissprot Trembl database were performed for each sample using Mascot 1.9.
Electrophoresis and immunoblot analyses of proteins:
Cells were grown in triplicate in DM250 medium for the time indicated. Cell pellets were resuspended in lysis buffer (70 mM Tris, pH 7.4, 1 mM EDTA, 10% glycerol, 1 mM DTT) before sonication. The lysates were centrifuged at 10,000 x g for 30 min at 4°, and total protein concentrations were determined using the Bradford protein assay kit (Bio-Rad) and bovine serum albumin as a standard. Equal amounts of protein samples were analyzed by SDSpolyacrylamide gel electrophoresis. Prestained protein standards (Amersham, Buckinghamshire, UK) were used for molecular weight estimation. Immunoblot analyses of proteins electrotransferred (Bio-Rad) onto polyvinylidene difluoride membranes (Amersham Pharmacia) were performed with polyclonal antibodies raised against LamB (courtesy of A. Meinke, Intercell AG), and monoclonal antibodies raised against RpoA as an internal standard (courtesy of M. Cashel). The blots were developed using nitro blue tetrazolium/5-bromo-4-chloro-3-indolyl-phosphate (Sigma, St. Louis) systems.
Strain construction:
Each evolved malT allele was recombined into the chromosome of the ancestral clone using suicide plasmid pKO3, which has a temperature-sensitive replication origin (LINK et al. 1997), as described previously (CROZAT et al. 2005). Briefly, the evolved malT alleles were cloned into pKO3, and the resulting plasmids were electrotransformed into the ancestor clone. Integration of the evolved allele into the ancestral malT gene was selected by plating transformed cells on chloramphenicolLB agar and incubating at high temperature. Chloramphenicol-resistant cells were subsequently plated on sucrose-containing LB agar to select for plasmid loss, as the plasmid carries sacB, which renders cells sensitive to killing by sucrose. Sucrose-resistant and chloramphenicol-sensitive plasmid-free clones were then screened for the presence of the evolved malT alleles either by a PCRrestriction fragment length polymorphism (PCRRFLP) approach using PauI (Euromedex) to distinguish between ancestral and evolved alleles, in the case of the Ara 1 population, or by characterizing the length of the PCR product containing the malT ancestral or evolved allele, in the case of the Ara + 1 population. All constructed strains were also "deconstructed" such that the evolved alleles were replaced by their ancestral counterparts. Reversion of the relevant phenotypes (fitness, Western blots) was always confirmed in these deconstructed strains, indicating the absence of secondary mutations during the strain construction process. The same strategy was used to move the upregulated constitutive malT allele from BW3216 into the ancestor.
Fitness assays:
Isogenic ancestral strains with the introduced evolved malT alleles competed against the ancestor carrying the original malT allele and the opposite neutral Ara marker (LENSKI et al. 1991). Each pairwise competition was replicated sixfold, with competitions performed in the same medium as the evolution experiment for 6 days with 1:100 daily transfers to allow detection of small fitness effects. Fitness assays thus encompass all the same phases of population growth that occurred in the evolution experiment, including lag phase, exponential growth, and stationary phase (VASI et al. 1994). Immediately prior to each assay, the competitors were separately acclimated to the same regime and then transferred together from stationary-phase cultures into fresh medium (with 1:200 dilution each and thus 1:100 combined). Samples were taken immediately after mixing at day 0 and again after 6 days of competition to measure the abundance of the competitors; at both times, all the cells are in stationary phase (24 hr after the previous transfer). From the initial and final cell counts and the known dilution factors, we calculated the realized (net) population growth of each competitor. Fitness was then calculated as the ratio of the realized growth rates of the two strains during their direct competition, and t-tests were performed to evaluate whether the measured fitness values differed significantly from the null hypothetical value of 1.
Measurement of the mutation rate from Mal+ to Mal:
A LuriaDelbrück fluctuation test (LURIA and DELBRÜCK 1943) was performed to estimate the mutation rate from Mal+ to Mal using 60 replicate cultures. Independent cultures were each founded from a small number (
100) of cells of the Mal+ ancestor in flasks containing 10 ml of DM25. After 24 hr, each culture was diluted, several hundred cells were spread on tetrazoliummaltose indicator plates, and Mal+ (pink) and Mal (red) colonies were counted. These data yielded the total number of cells and the frequency of Mal mutants in each culture, from which the rate of mutation from Mal+ to Mal was estimated (LEA and COULSON 1949; MA et al. 1992).
Evolutionary dynamics of malT substitutions:
The approximate time of origin and the subsequent dynamics of the malT mutations that were substituted in each focal population were investigated using a PCRRFLP strategy with clones isolated from samples frozen at various generations during the long-term evolution experiment. For population Ara 1, numerous clones from generations 2500, 3000, 4000, and 5000 were subjected to PCR using primers ODS273 (5'-GTCGCTGCTGGAAGAGTCGC-3') and ODS304 (5'-ACCACGGGCCAGCGAGCATT-3'). The presence or absence of the evolved malT mutation was determined by PauI digestion of the PCR products. For population Ara + 1, numerous clones from generations 4000, 5000, 7000, and 10,000 were subjected to PCR using primers ODS291 (5'-CAGCCCCTGACGCTCAATCT-3') and ODS302 (5'-GCAGAATACCCACTCAGCCC-3'). The size of the PCR product was used to score the malT allele as either ancestral or evolved.
Evolutionary dynamics of the growth capacity on maltose:
Numerous clones were isolated from population Ara + 1 at generations 2000, 3000, 4000, 5000, and 7000 (for the last three time points, the same clones as those analyzed to ascertain the dynamics of the malT substitutions were used) and from population Ara 6 at generations 2000, 3000, 4000, 5000, 6000, 7000, and 9000. All clones were grown at 37° in DM medium supplemented with 250 µg/ml maltose to score their growth ability using the following criteria. Growth within 8 hr of incubation was interpreted as a Mal+ phenotype, whereas no growth even after 24 hr indicated the inability to use maltose as the sole carbon source (Mal). Also, growth within 24 hr but without any detectable growth after 8 hr was interpreted as the occurrence of suppressor mutations in those clones restoring growth (MalS; see RESULTS). In all cases, growth was characterized by a sustained increase in viable cell counts and optical density.
| RESULTS |
|---|
|
|
|---|
300400 protein spots, representing 610% of the entire coding capacity of the E. coli genome, and we observed similar numbers in our gels. Figure 1 shows a pair of representative gels comparing the ancestor and one evolved clone. Quantitative analysis of the three replicate sets of gels for all three clones was performed using the Melanie II software (Genebio), as described in MATERIALS AND METHODS. We first standardized the abundance of each protein spot relative to the total abundance of all spots on the same gel. We then expressed any evolved change in the expression of a given protein by calculating the ratio of its standardized abundance in an evolved clone to that in the ancestor, using the values obtained in the same gel set. The three replicate sets of gels thus gave three estimates of the extent of change in expression level for a given protein in each evolved clone, and these estimates were used to test whether a change was statistically significant. Table 1 provides the identity of all the protein spots whose relative abundance changed significantly in one or both evolved clones. It also includes two reference proteins whose abundances were quite constant among the ancestral and evolved clones and over all three replicate sets of gels. A total of 60 spots showed significant changes in one or both of the evolved clones. However, 12 proteins were each identified in two different spots, where the duplicate forms represent post-translational modifications or degradation products; in all 12 cases, the two different forms showed very similar evolutionary changes. Thus, 48 proteins showed significant changes in expression in one or both populations. Thirty-eight of these proteins changed in both independently evolved clones and, strikingly, all 38 had evolved in the same direction relative to the ancestral expression level, indicating strong evolutionary parallelism. Parallel changes across lineages are often indicative of adaptive evolution, and thus these 38 proteins are of particular interest. Seven parallel changes reflect increased expression of proteins in the evolved clones, including 5 proteins without any detectable expression in the ancestor. The other 31 parallel changes indicate reduced expression in the evolved clones, including 7 proteins with no measurable expression in either evolved clone. Two protein spots that differ between the ancestor and the evolved clones confirm previous mutations identified using insertion sequence elements as molecular markers in these long-term populations (PAPADOPOULOS et al. 1999). In particular, IS-associated deletions of the rbs operon, including its promoter region, were found in all 12 populations (COOPER et al. 2001), and the pykF gene was inactivated by an IS transposition in population Ara 1 (SCHNEIDER et al. 2000). Here, we observed the parallel loss of RbsB, the D-ribose-binding periplasmic protein, in both focal populations, consistent with these known deletions. By contrast, PykF (pyruvate kinase I) disappeared only in population Ara 1, with no significant change in expression in population Ara + 1 (Table 1). These same mutations also explain why parallel changes at the transcription level were seen in these same clones for the rbs operon, but not for pykF (COOPER et al. 2003).
|
|
10 and 18% of the E. coli genome, respectively; hence, these two categories appear to be substantially overrepresented among those found to have undergone parallel evolutionary changes in protein expression.
Comparison of protein and transcription profiles:
The same set of clones used in this study were previously analyzed using macroarrays to measure changes in their global transcription profiles (COOPER et al. 2003), allowing us to compare the changes found by the two methodologies. To the best of our knowledge, this is the first such comparison between mRNA and protein levels in an evolutionary context, where the magnitude of expression changes is often quite subtle. We should also emphasize, however, that the culture conditions and physiological state of cells were somewhat different in the two studies: the global mRNA levels were measured in the exact same medium as in the evolution experiment (DM25) and during exponential growth, whereas we prepared protein extracts from stationary-phase cells grown in DM250. This methodological difference reflected the need for larger quantities of protein to perform two-dimensional protein electrophoresis and to quantify a reasonable number of proteins. Nevertheless, the proteomic data support the transcription profiling in four important respects.
Moreover, when we examine in greater detail the lists of genes with parallel changes, important commonalities emerge between mRNA and protein expression at two regulatory levels. First, although the genes that were identified by parallel expression changes using the two methodologies were different, most in fact belong to the same global regulatory network. In particular, approximately half (17/35) of the parallel expression changes for functionally characterized proteins detected by protein profiling (Table 1) correspond to genes regulated by guanosine tetraphosphate (ppGpp) and cAMPcAMP receptor protein (cAMPCRP), which is itself under the control of ppGpp (JOHANSSON et al. 2000). In the macroarray study, also about half of the functionally characterized genes with parallel expression changes were regulated by ppGpp and cAMPCRP (COOPER et al. 2003). This pattern led to the discovery of beneficial mutations in the spoT gene, which encodes a protein involved in synthesis and degradation of ppGpp, the molecular effector of the stringent response, which is the global regulatory network involved in adaptation to nutritional stresses (CASHEL et al. 1996). Thus, despite little strict identity in the gene targets identified using macroarrays and proteomics, strong correspondence was revealed by the shared regulatory network overlying many of the parallel expression changes seen after 20,000 generations of evolution. Therefore, both parallel changes observed after 20,000 generations of experimental evolution using mRNA and protein profiling methodologies, although measured under slightly different environmental conditions, reflect the same underlying mutational basis of adaptation. However, this evolved change in the regulation of the stringent response evidently caused different subsets of genes in this network to be affected during exponential growth in DM25 medium vs. stationary phase in DM250.
Second, at the lower level of genes and operons, two parallel expression changes were detected using both methodologies. Specifically, expression of both the ribose operon and the maltose regulon decreased after 20,000 generations of experimental evolution (Table 1; see COOPER et al. 2003 for mRNA data). These changes may therefore reflect beneficial mutations in these regulons. Indeed, deletions of the rbs operon were found previously in all 12 of the evolved populations, thus explaining the loss of expression of those genes, and these mutations were also shown to be beneficial by competitions between otherwise isogenic strains (COOPER et al. 2001). Given the existence of this information for the ribose operon, we therefore further investigated the maltose regulon in both focal populations (see Parallel substitutions in malT, the transcriptional activator of the maltose regulon).
Also, six parallel changes observed in the protein expression profiles corresponded to significant transcriptional changes in only one of the two focal populations. In all cases, the changes in protein and transcriptional expression levels were in the same direction compared to the ancestral values. These differences might, in principle, indicate mutational changes in the coding sequence or post-transcriptional regulation of the relevant genes in the population where no transcriptional differences were observed. Alternatively, occasional discrepancies might reflect the vagaries of statistical analyses. In support of the latter explanation, we note that the transcription level differed significantly between the two evolved populations in only one of these six instances (nfnB: t = 3.0483, d.f. = 6, two-tailed P = 0.0226 using data from http://myxo.css.msu.edu/ecoli/arrays).
Parallel substitutions in malT, the transcriptional activator of the maltose regulon:
Sequencing the malT gene in both evolved clones from 20,000 generations revealed a 25-bp internal deletion in population Ara + 1, leading to a truncated protein and a point mutation in population Ara 1 (Figure 2). To investigate further the extent of genetic parallelism, we sequenced the malT gene in one evolved clone isolated at 20,000 generations from each of the other 10 long-term populations. Nine more mutations were found in this gene in 6 of the other populations. Therefore, including the 2 focal populations, a total of 11 mutations were found in malT spread among 8 of the 12 populations (Figure 2). Such strong parallelism suggests that these mutations have a beneficial effect on fitness in the glucose-limited environment of the evolution experiment, and this hypothesis will be tested in a later section. Two mutations were found in population Ara 4, 3 in population Ara + 6, and 1 in each of six other populations. Four of the 12 populations became mutators during the evolution experiment (SNIEGOWSKI et al. 1997; COOPER and LENSKI 2000), including both populations that substituted multiple mutations in malT. Populations Ara + 1 and Ara + 2 acquired small internal deletions, while all other substitutions in malT were nonsynonymous point mutations. The MalT protein contains four domains, and the various point mutations affect all four domains in different evolved populations: 2 in DT1, 2 in DT2, 4 in DT3, and 1 in DT4 (Figure 2). The diversity of affected domains implies that the hypothesized beneficial affect does not depend on the precise location of the mutation, and it is also consistent with mutations that reduce MalT activity. Because MalT is a transcriptional activator, we also predict that the populations that evolved mutations in malT have reduced growth on maltose, which we test in the next section.
|
|
|
However, there were three exceptions to the overall pattern summarized above. First, the evolved clone from population Ara 1, as well as the ancestral strain with the corresponding evolved malT allele, showed the ancestral phenotype when grown in DM250 maltose (Figure 3A and Table 2). However, a strong decrease of both the LamB protein level of the evolved clone (Figure 3B) and the lamB promoter activity (Table 2) was detected in glucose minimal medium. Therefore, this particular evolved mutation appears to affect the basal activity of the MalT transcriptional activator during growth on glucose, leaving unchanged the induction level during growth on maltose. Second, the evolved clone sampled from population Ara 4 grew perfectly well in maltose minimal medium and had ancestral levels of LamB in both glucose and maltose minimal media, despite having two nonsynonymous point mutations in malT. These mutations evidently do not affect the activity of MalT. Third, the evolved clone from population Ara + 3 was unable to grow on maltose and revealed a complete loss of the LamB protein (Figure 3B). However, this clone has no mutation anywhere in malT, and we also found no mutations in lamB, suggesting the presence of some (as yet unidentified) mutation in the promoter region of the malKlamBmalM operon or in another regulator of the maltose regulon.
These few exceptions notwithstanding, the data demonstrate that the parallel genetic changes in malT were linked to parallel phenotypic losses of expression of the maltose regulon during the 20,000 generations of evolution in a glucose-limiting medium.
Beneficial fitness effects of the malT mutations:
The isogenic ancestral strains with the Ara + 1 or Ara 1 evolved malT allele were each competed against the ancestor carrying the wild-type malT allele under the same culture conditions that prevailed during the long-term evolution experiment (Figure 4). In each case, the genotype of interest competed against a variant of the ancestor bearing a neutral marker (arabinose utilization) that allowed the strains to be distinguished. As shown previously (LENSKI et al. 1991), this marker had no discernible effect on fitness (H0 = 1, n = 6, ts = 0.2470, P = 0.8147). However, each evolved allele was beneficial, with fitness relative to the ancestral malT allele of 1.0146 for the Ara + 1 mutation (H0 = 1, n = 6, ts = 5.5603, P = 0.0026) and 1.0044 for the Ara 1 mutation (H0 = 1, n = 6, ts = 5.8080, P = 0.0021). Although these fitness increments are small, both are highly significant when measured over several days in replicated experiments. Moreover, these gains were eliminated when each evolved allele was replaced again by the ancestral allele (data not shown), confirming that the gains were caused by the malT mutations rather than by some hypothetical secondary mutations that might have accidentally occurred during the strain constructions. Also, when assayed in an environment where glucose was replaced by maltose, the Ara 1 malT evolved allele was neutral (data not shown). The Ara + 1 evolved allele could not be assayed in this second environment because it confers a Mal phenotype and, therefore, its fitness is effectively zero.
|
Like the malT mutations, the rbs deletions also conferred only a small fitness benefit (COOPER et al. 2001). This small benefit acted in concert with an unusually high mutation rate from Rbs+ to Rbs, leading to the rapid substitution of the rbs mutations in all 12 populations. To investigate whether a high mutation rate might also have influenced the parallel substitutions in malT, we performed a fluctuation test with 60 independent cultures to measure the ancestor's rate of mutation from Mal+ to Mal. To analyze the data, we employed a program written by P. J. Gerrish (Los Alamos National Laboratory) that generates expected LuriaDelbrück distributions according to the method of MA et al. (1992) and then uses maximum likelihood to refine the mutation rate estimate. This procedure gave a mutation rate of 4.2 x 106/cell/generation. The entire maltose regulon is
30,000 bp. Excluding third positions (synonymous) and other sites where nonsynonymous mutations would not cause a Mal phenotype leaves
10,000 bp that could be mutated to produce a Mal phenotype. The resulting mutation rate from Mal+ to Mal is thus
4 x 1010/bp, which is close to typical mutation rates estimated previously for E. coli (DRAKE 1991; LENSKI et al. 2003). Therefore, locus-specific hypermutability does not contribute to the parallel substitutions observed in malT during the long-term evolution experiment. Also, given the small fitness effects of the evolved malT alleles relative to some other beneficial mutations observed in the long-term experiment, some of which are
10% (LENSKI and TRAVISANO 1994; COOPER et al. 2003; CROZAT et al. 2005), we expect that the malT mutations were substituted either fairly late in the evolution experiment or by hitchhiking with other beneficial mutations of stronger effect. We investigate these dynamics in the next section.
Genetic and phenotypic evolutionary dynamics:
To examine in more detail the temporal dynamics of the appearance and substitution of the evolved malT alleles in populations Ara 1 and Ara + 1, we performed PCRRFLP experiments (see MATERIALS AND METHODS) using 63 clones sampled from population Ara +1 and 70 clones isolated from population Ara 1. In population Ara + 1, the malT mutation was present in 0/13 clones at 4000 generations, in 14/18 clones at 5000 generations, and in 32/32 clones from 7000 and 10,000 generations. In population Ara 1, the evolved malT allele was present in 0/36 clones at 2500 and 3000 generations, in 15/16 clones at 4000 generations, and in 18/18 clones at 5000 generations. Therefore, the evolved malT alleles contributed to the somewhat slower fitness increase that followed the very rapid fitness gains seen over the first 2000 generations of the experiment (LENSKI and TRAVISANO 1994).
In population Ara + 1, clones isolated at 20,000 generations and carrying the identified malT mutation were unable to grow on maltose minimal medium (Figure 3A). We would therefore expect to observe two different phenotypic subpopulations during the evolution experiment: one able to grow on maltose and retaining the ancestral malT allele and the other unable to grow on maltose and carrying the evolved malT allele. In fact, on the basis of the ability to grow on maltose minimal medium, we identified three different subpopulations in population Ara + 1 that coexisted between 4000 and 7000 generations (Table 3). One type grew on maltose within 8 hr at 37° and was therefore phenotypically Mal+. A second type was unable to grow on maltose even after 24 hr, and it was therefore phenotypically Mal. The third type showed no growth on maltose after 8 hr but strong growth after 24 hr, which we designated as MalS. We then sought to understand the molecular mechanism underlying the MalS phenotype of the third subpopulation. The malT gene of the MalS clone sampled at 5000 generations (Table 3) was sequenced. We found a nonsynonymous point mutation leading to an I138F change in domain DT1 of the MalT protein. This mutation may explain the absence of growth in the maltose medium after 8 hr. When left for 24 hr, however, this clone grew perfectly well. This culture was subsequently plated on LB agar and four isolated colonies were chosen at random. All four now showed growth on maltose minimal medium after 8 hr of growth, strongly suggesting that they had acquired suppressor mutations of the original I138F mutant allele. Sequencing malT revealed no further mutation, however, and therefore implies extragenic suppression. Therefore, this third category of evolved clones seems to have a distinct class of malT mutations, leading to a suppressible deficiency of maltose utilization. In any case, this third MalS category, together with the ancestral Mal+ category, eventually disappeared and was replaced by the nonsuppressible Mal clones.
|
| DISCUSSION |
|---|
|
|
|---|
As noted, slightly different growth conditions were used for the transcription (COOPER et al. 2003) and proteomic analyses of the evolved clones. In the case of the transcript study, cells were collected in exponential phase during growth in DM25 (25 µg glucose/ml), whereas the protein profiles were performed with cells in stationary phase from cultures in DM250 (250 µg glucose/ml) to obtain a sufficient yield of proteins for the two-dimensional protein electrophoresis. One might therefore be concerned that the two sets of data are not comparable. However, our comparisons emphasize the occurrence of beneficial mutations that affected two levels of gene regulation, rather than the precise identity of the affected genes. Thus, differences in the physiological state of the bacteria probably explain why only two identical changes were observed when we compared expression at the mRNA and protein levels in the same evolved clones. However, these changes allowed identification of two beneficial mutations in the focal populations: one already known that affects the rbs operon for ribose catabolism (COOPER et al. 2001) and one discovered in this study affecting the malT transcriptional regulator of the maltose operons (see below). The fact that the same changes were identified under the different culture conditions may reflect the specific effects of mutations that govern expression at the operon level.
More generally, although the two methodologies detected parallel expression changes in different genes, about half of them belong to the stringent response pathway. Therefore, both methodologies revealed the same overarching changes in a global regulatory network. These high-level changes reflect mutational events that occurred during the long-term evolution experiment in genes encoding global regulators, which control the architecture of the cell's regulatory network. In particular, changes in gene expression detected by both transcription and proteomic analyses reflect the presence of the beneficial mutations that were found in spoT (COOPER et al. 2003). Thus, parallel changes in transcription and proteomic profiles that occurred during experimental evolution reveal bacterial adaptation through modifications that occurred at multiple levels of genetic organization, ranging from the level of operons up to global regulators of genetic networks. These parallel changes in expression allowed the discovery of beneficial mutations that show a high degree of molecular parallelism, as mutations in the same three genes (rbs, malT, spoT) were found in at least 8 of the 12 independently evolved populations (Figure 2; COOPER et al. 2001, 2003). By contrast, no parallel mutations were found when randomly chosen genes were sequenced (LENSKI et al. 2003). These parallel genetic changes, discovered by parallel changes in gene expression, thus highlight important targets of selection and adaptation in this experiment. Genetic parallelism has also been reported in other evolution experiments using bacteriophages (WICHMAN et al. 1999) and bacteria (TREVES et al. 1998), although these other studies did not examine resulting changes in global regulatory networks.
Despite the similarities observed in the transcription and protein profiles, the maltose regulon shows some interesting exceptions. Transcription profiling found significant and parallel changes in three genes in this regulon (malK, malP, and malQ). However, the same transcription profiles reveal no significant changes in either lamB or malT in either focal population (four two-tailed t-tests in all, each with 6 d.f., all P > 0.1, using data from COOPER et al. 2003 available at http://myxo.css.msu.edu/ecoli/arrays). The transcriptional profiles also show no effect of an evolved spoT allele on expression of either lamB or malT (two t-tests, each with 6 d.f., both P > 0.1, using the same data source as above). Thus, the proteomic data resolved certain changes, such as the parallel reductions in LamB, that were not found by the transcriptional study; these changes in LamB expression were unequivocally confirmed by finding the responsible mutations in malT. Another difference between the two studies is that the transcriptional analysis found significant parallel changes in
1.5% of all genes (COOPER et al. 2003), whereas in our proteomic study
10% of the 300400 proteins detected on the gels showed significant parallel changes. This difference may reflect post-transcriptional levels of gene regulation or, alternatively, it could reflect differences in the statistical power of the two studies and their methodologies.
A new set of beneficial mutations in this long-term evolution experiment was discovered that affected the malT gene. A total of 11 mutations were found in eight different populations. Measurement of the mutation rate from Mal+ to Mal in the ancestor indicated a mutation rate typical of other estimates for E. coli (DRAKE 1991). Five mutations occurred in populations that had retained the low ancestral mutation rate, with 1 mutation in each of five different populations. Two of these mutations were small internal deletions with no obvious direct sequence repeats at the extremities. The 6 other mutations occurred in populations that had evolved defects in their DNA repair pathways, including 1 in population Ara 2, 2 in Ara 4, and 3 in Ara + 6. Populations Ara 2 and Ara 4 evolved defects in methyl-directed mismatch repair (SNIEGOWSKI et al. 1997; SHAVER et al. 2002); all three mutations in these populations were C·G to T·A transitions, which are characteristic of this class of mutators. Population Ara + 6 evolved a mutator phenotype that was previously suggested to involve a defect in mutT on the basis of the finding that six mutations found in randomly screened genes were all A·T to C·G transversions (LENSKI et al. 2003). All three malT mutations found here in Ara + 6 were of the same class, strongly supporting the previous suggestion of a mutT defect.
The mutations found in malT in the various populations were located throughout the gene, affecting the several domains of the MalT activator protein. The MalT protein belongs to a family of bacterial transactivators characterized by high molecular mass, usually >90 kDa (DANOT 2001). The MalT protein has four domains (Figure 2): DT1 (residues 1241), DT2 (residues 250436), DT3 (residues 437806), and DT4 (residues 807901) (DANOT 2001; RICHET et al. 2005). Numerous signals control the activity of the MalT transcriptional activator. Activation of MalT and its subsequent binding to target promoters requires the formation of a high-order MalT oligomer, induced by the maltotriose inducer and ATP. By contrast, the monomeric form of MalT, which is inactive, is stabilized by three different proteins: MalK, the ABC subunit of the maltose transporter; MalY, a cytoplasmic protein of unknown function that has a ßC-S lyase activity; and Aes, a protein with sequence similarities to lipases that has esterase activity. The DT1 domain binds ATP (with a Walker A motif from residues 42 to 47) and the repressor proteins Aes and MalK, while DT1 and DT2 are involved in recognition of the repressor protein MalY; DT2 and DT3 are required together for strong maltotriose binding, while DT3 alone binds it with low affinity; and DT4 carries the DNA-binding site (DANOT 2001; JOLY et al. 2002; SCHLEGEL et al. 2002).
The malT mutations in the eight evolved populations produced a Mal phenotype in six cases (Ara + 1, Ara + 2, Ara + 6, Ara 2, Ara 3, and Ara 6). In four cases, a likely mechanism can be found: populations Ara + 1 and Ara + 2 have malT deletions leading to truncated proteins without the DT2DT4 domains and without domain DT4, respectively; the mutation in population Ara 3 eliminates the threonine residue 46, which lies in the Walker A motif and is involved in ATP binding; and the mutation in population Ara 2 affects the DT4 domain and may interfere with DNA-binding activity, the interaction with RNA polymerase, or both. In the two other cases, possible explanations are more speculative. The mutation in population Ara 6 affects the DT2 domain and might interfere with the transition from the monomeric to the oligomeric forms or, alternatively, with the overall conformation of MalT. A similar effect could also apply to population Ara + 6, which has three mutations, one in DT2 and two in DT3. In two populations (Ara 1 and Ara 4), the malT mutations did not lead to a Mal phenotype. Despite having a Mal+ phenotype that is indistinguishable from the ancestor, population Ara 4 has two mutations in malT, one in DT1 and the other in DT3. Population Ara 1 has one mutation in the DT3 domain leading to reduced activity of MalT in minimal glucose medium, although the induction level is fully conserved in minimal maltose medium. The modified residue (R455) may therefore be involved in the affinity of maltotriose for the MalT protein. The crystal structure of DT3 has been solved and reveals eight copies of a two-helix bundle motif arranged in a right-hand superhelical fold encompassing
80% of DT3 (STEEGBORN et al. 2001). The structure also indicates interaction between the DT3 domains, which may explain the MalT oligomerization. Two residues are characteristic for the DT3 motif, G541 and A547, and they are well conserved in the different helices of the structure. One of the mutations found in population Ara 4 affects the A547 residue, although the MalT activity is not measurably affected. More specific studies of these mutations of the complex DT3 domain might give more information about its functioning.
Most of the observed mutations in malT caused loss of the ability to grow on maltose as a sole carbon source, a trend consistent with a previously reported tendency toward ecological specialization in these populations (COOPER and LENSKI 2000). Such specialization can occur by two population-genetic processes: mutation accumulation, whereby neutral mutations are substituted in genes that are not maintained by selection, and antagonistic pleiotropy, where mutations substituted because they are beneficial in the selective environment have detrimental side effects in other environments (FUTUYMA and MORENO 1988; ROSE 1991; COOPER and LENSKI 2000). We constructed isogenic strains with the ancestral and evolved malT alleles, and we performed competitions that show that the evolved malT alleles are beneficial in the glucose medium in which they evolved. This result provides a direct demonstration of the role of antagonistic pleiotropy in the ecological specialization of these populations, as did previous experiments on the parallel losses of the ribosecatabolic function encoded by the rbs operon (COOPER et al. 2001). The benefit associated with the loss of MalT activity may reflect, at least in part, the elimination of unneeded functions costly to fitness in the glucose medium (GRAÑA and ACERENZA 2001; see also DEKEL and ALON 2005). Alternatively, the benefit caused by the loss of MalT activity might be more complex, involving, for example, regulatory feedbacks or metabolic intermediates (e.g., DYKHUIZEN 1978). Using the Melanie II software, we estimate that the LamB spot represents
0.13% of the total protein on the ancestral gels, while the two MalE spots together represent
0.48%. However, these are only 2 of the 14 proteins in the maltose operons; the others were not detected on our gels, nor were hundreds of other proteins. Therefore, we are not in a position to say whether the 1.5% fitness gain that resulted from the complete loss of expression of the maltose operons in population Ara + 1 is fully explained by energetic savings. Moreover, it has been shown that MalT is a pivotal component of an elaborate regulatory network that connects several metabolic pathways (BOOS and BÖHM 2000). As noted before, the DT1 and DT3 domains of MalT interact directly with proteins including MalK, MalY, and Aes, whose functions are not fully understood (RICHET et al. 2005). In any case, the benefit of the malT mutations seems to be related to reduced expression of the maltose regulon; and the deliberate introduction of a malT constitutive mutation, which caused high expression of the maltose regulon even in glucose medium, was detrimental in the glucose-limited environment.
Mutations in malT were also found in another evolution experiment performed in a very different environment, with E. coli propagated for several hundred generations in glucose-limited chemostats (NOTLEY-MCROBB and FERENCI 1999). In contrast to the serial transfer, or seasonal, environment in our experiments (with lag, exponential, and stationary phases each day), bacteria in chemostats experience constant physiological conditions with very low glucose concentration and high cell density. Under the chemostat conditions, malT mutations evolved that caused constitutive expression of the maltose regulon, which enhanced fitness in the severely glucose-limited environment (NOTLEY-MCROBB and FERENCI 1999). These mutations upregulated the lamB gene, leading to increased expression of this high-affinity glucose transport pathway through the outer membrane maltoporin. Increased expression of LamB was also found by proteomic profiling of the ancestor and evolved clones in another chemostat evolution experiment using E. coli, although the responsible mutation was not identified (KURLANDZKA et al. 1991). By contrast, the loss of maltose regulon expression was shown to be beneficial under the seasonal conditions in our evolution experiment. As a further confirmation of this difference, introduction of a chemostat-evolved malT constitutive mutation into our ancestral strain strongly reduced fitness in the seasonal environment used in our experiments. This difference also indicates that, under these conditions, glucose uptake involves porins other than LamB. The best candidate is OmpF because the ancestral E. coli B strain lacks OmpC (SCHNEIDER et al. 2002), although other porins like OmpN might also be involved (PRILIPOV et al. 1998). In the chemostat environment, increased expression of the OmpC and OmpF porins also evolved along with upregulation of LamB (KURLANDZKA et al. 1991); by contrast, we found no significant changes in OmpF expression after 20,000 generations in the seasonal environment (data not shown). The following factors might explain the opposing selection on the malT gene and the LamB expression in these two environments. In chemostats, cells can draw down the concentration of limiting glucose to very low levels, at which point porins may limit glucose uptake. Although the LamB porin is specifically identified as a maltose porin, it also allows glucose molecules to diffuse across the outer membrane because glucose is smaller than maltose (LUCKEY and NIKAIDO 1980). By contrast, the serial-transfer regime produces boombust cycles of glucose availability rather than perpetual scarcity, and so precise regulation may be more important than maximizing expression of total porins. The malT gene regulates lamB, and different mutations prevailed in malT that led to increased or reduced expression of LamB in the chemostat and the serial-transfer regimes, respectively.
The dynamics of variation in the Mal phenotype were examined more closely in three of the long-term populations in our study. In two of them, extensive phenotypic polymorphism was detected with at least three subpopulations coexisting for several thousand generations: one Mal+, one Mal, and one suppressible MalS. In one population, we showed that at least five different malT alleles were present. This finding implies clonal interference, in which subpopulations that carry different beneficial