Dispensability of Escherichia coli’s latent pathways

Edited by José N. Onuchic, University of California San Diego, La Jolla, CA, and approved December 27, 2010 (received for review August 13, 2010)
February 7, 2011
108 (8) 3124-3129

Abstract

Gene-knockout experiments on single-cell organisms have established that expression of a substantial fraction of genes is not needed for optimal growth. This problem acquired a new dimension with the recent discovery that environmental and genetic perturbations of the bacterium Escherichia coli are followed by the temporary activation of a large number of latent metabolic pathways, which suggests the hypothesis that temporarily activated reactions impact growth and hence facilitate adaptation in the presence of perturbations. Here, we test this hypothesis computationally and find, surprisingly, that the availability of latent pathways consistently offers no growth advantage and tends, in fact, to inhibit growth after genetic perturbations. This is shown to be true even for latent pathways with a known function in alternate conditions, thus extending the significance of this adverse effect beyond apparently nonessential genes. These findings raise the possibility that latent pathway activation is in fact derivative of another, potentially suboptimal, adaptive response.
Living cells are surprisingly robust against mutations and, in particular, against gene knockouts (15). The origin of mutational robustness—whether it is a directly evolved trait or a byproduct of evolutionary history—remains debatable (6). In either case, metabolic network analysis shows that the nonessentiality of enzymes and associated genes is largely due to the inactivity of the corresponding metabolic reactions under laboratory conditions (79). This leaves environmental robustness as the natural candidate to explain gene nonessentiality. Yet, apart from chemical stress-based assays (10), studies designed to test whether nonessential genes become essential under different conditions have failed to identify a phenotype for more than a small fraction of additional genes (11). A recent groundbreaking study has shown, however, that a large fraction of reactions not active under standard laboratory conditions become transiently active after a genetic or environmental perturbation (12, 13). Why? The prevailing interpretation has been that the transient activation of such latent pathways facilitates adaptation to new conditions, thereby attributing function to genes that have been classified as dispensable for the lack of phenotype in steady-state experiments. This is naturally formulated as the hypothesis that latent pathways have a positive impact on postperturbation growth (cellular reproduction), which is a measure of competitive advantage with a strong empirical basis (13, 10, 11, 13). Even for genes with known functions under different conditions, this hypothesis is appealing as it suggests the possibility of an alternate phenotype that would not be detected in traditional high-throughput screens of knockout mutants (14).
Here, we test this hypothesis using the most complete in silico reconstruction of the metabolic network of Escherichia coli K-12 MG1655 (15, 16) and perturbations caused by single-gene knockouts. The response of the metabolic network to knockout perturbations is modeled using both model-independent analysis and the two most accepted phenomenological approaches, minimization of metabolic adjustment (MOMA) (17) and regulatory on/off minimization (ROOM) (18) (Materials and Methods). Starting from a growth-maximizing state determined by flux balance analysis (FBA) (19), we compare the early postperturbation growth rate (Materials and Methods) of the original organism with that of a modified organism in which the latent reactions have been disabled. We consider glucose minimal medium and gene knockouts that necessarily change the original metabolic flux distribution but that nonetheless are compatible with nonzero growth according to FBA. There are 52 enzyme-coding genes associated with 97 metabolic reactions in the reconstructed network that satisfy this condition. We systematically predict the impact of latent pathway activation on growth rate following perturbations caused by the knockouts of each of these genes.

Results

Phenomenological Analysis.

Fig. 1 illustrates the essence of our approach. In an optimal growth state, as observed for E. coli after adaptive evolution in fixed environmental conditions (20), many metabolic reactions are inactive (8, 12). Shortly after a perturbation, however, both the original and modified strain operate in a suboptimal growth state (17, 18), which we model using MOMA or ROOM (Fig. 1A). In silico (8) and laboratory (12) experiments show that this change is accompanied by a burst of reaction activity (Fig. 1B), reflecting regulatory changes that locally reroute fluxes in the short-term metabolic response to the perturbation (21, 22). If the perturbation is nonlethal, the perturbed organisms will undergo adaptive evolution—adopting beneficial mutations over longer timescales (23, 24) to achieve a new optimal growth state, which can be predicted by FBA (18, 25).
Fig. 1.
Hypothetical growth impact of latent pathways under a knockout perturbation. (A and B) The solid lines indicate the drop in growth rate (A) and the burst in latent reaction activation (B) that follow a gene knockout. The dashed lines indicate the possible behavior for a modified strain in which the original latent pathways have been removed: The postperturbation growth rate may decrease (red), remain the same (green), or increase (blue), and a smaller number of new latent pathways may be created. If the postperturbation growth rate is nonzero, the mutant is viable, and after a period of adaptive evolution it will converge to a new optimal growth state. This postperturbation optimal growth state is identical for both the modified and unmodified strain, and it is characterized by a reduced number of active reactions relative to the suboptimal states.
For the perturbations considered in this study, the average and standard deviation of the number of transiently active reactions is 291 ± 83 and 120 ± 59 for MOMA and ROOM, respectively. This difference is expected because ROOM, by design, favors a small number of significant flux changes, which reflects the fact that ROOM may model a later stage in the adaptive response pictured in Fig. 1B than MOMA (18). These numbers should be compared with the number of active reactions in the corresponding growth-maximizing states, which is 385 in the wild type and remains 385 on average in the knockout mutants, with an average of ≈98% overlap between the two sets for the simplex solutions we consider; these numbers are representative for other choices of optima (26, 27) within our models (Sensitivity to Alternate Optima section in SI Text). We emphasize that because the modified organism lacks only transiently active (latent) metabolic reactions, the optimal steady states are identical to those of the unmodified strain both before and after the perturbation. Our question is then whether the early postperturbation growth rate (before adaptive evolution) will be lower (red), remain equal (green), or become higher (blue) when these latent pathways are not present (Fig. 1A).
Our principal result is that the strains lacking latent pathways systematically show equal or better adaptation as determined by growth within our in silico model, regardless of the approach used to model the organisms’ response to perturbations. We assume that the organisms are in an optimal growth state both before and long after the perturbation, which accounts for cases that have received much attention in the literature (8, 12, 13), but we note that our conclusions remain equally valid when this assumption is relaxed (Effects of Nonoptimal Reference States section in SI Text). Table 1 summarizes the results for all 52 single-gene knockouts considered in our study. Relative to the unmodified strain in the suboptimal state following a gene knockout, the strain lacking the latent pathways exhibits equal or improved growth in 100% of the cases according to MOMA (Fig. 2A) and in 98% of the cases according to ROOM (Fig. 2B). Across all knockout perturbations, this corresponds to an average change of +8.5 and +1.2% of the optimal wild-type growth rate, respectively. With either approach, a large fraction of mutants (50% for MOMA, 77% for ROOM) show a negligible difference in growth rate (within ± 1% of the wild-type growth rate) when the latent pathways are disabled. If only cases exhibiting significant changes are considered, the removal of latent pathways consistently increases the early postperturbation growth rate for all mutants, by an average of +16.9% for MOMA (Fig. 2A) and +4.6% for ROOM (Fig. 2B). Thus, for almost all knockouts, the strain lacking latent pathways is predicted to suffer no competitive disadvantage compared to the latent pathway-enabled strain. On the contrary, we predict that it more often shows improved growth in the suboptimal regime shortly after the perturbation.
Fig. 2.
Predicted adaptive impact of latent pathways in E. coli under single-gene knockout perturbations. Each row indicates the difference in early postperturbation growth rate between the strains with and without latent pathways for the knockout perturbation indicated along the vertical axis, when: (A and B) the modified organism lacks all reactions that are transiently active under the corresponding knockout perturbation; (C and D) the modified organism lacks the set of simultaneously nonessential latent reactions, which is the subset of such latent reactions that are not needed for optimal growth in any of the other 51 single-gene knockout mutants. For both MOMA (A and C) and ROOM (B and D), only cases showing a significant change in growth rate (> 1% of the wild-type optimal growth rate) are shown. Arrows indicate the extent of the increase in growth rate when the latent pathways are removed. The shaded background indicates the theoretical maximum growth rate for the mutant strain predicted by FBA. All growth rates are normalized by the optimal wild-type growth rate. The statistics are summarized in the left and center columns of Table 1 for MOMA and ROOM, respectively.
Table 1.
Summary of the predicted impact of latent pathways in E. coli K-12 MG1655 under single-gene knockout perturbations
 MOMAROOMRandom
Latent reactions for individual perturbations:
 All knockout perturbations+8.5 (12.5)%+1.2 (2.8)%+70.0 (10.8)%
 Significant differences*+16.9 (13.2)%+4.6 (4.5)%+70.0 (10.8)%
 Number of reactions removed291 (83)120 (59)1,019 (5)
Simultaneously nonessential latent reactions:
 All knockout perturbations+7.4 (10.9)%+1.2 (2.8)% 
 Significant differences+14.8 (11.4)%+5.2 (4.6)% 
 Number of reactions removed258 (79)109 (53) 

Each column corresponds to the average (and standard deviation) of the difference in growth rate between the latent pathway-disabled and wild-type organisms for 52 different single-gene knockouts. The differences are expressed as percentages of the optimal wild-type growth rate. For all cases, the average postperturbation growth rate is higher for the strain without latent pathways.

*By more than 1% of the wild-type growth rate: 50% (MOMA), 23% (ROOM), and 100% (random) of the perturbations.
By more than 1% of the wild-type growth rate: 50% (MOMA) and 19% (ROOM) of the perturbations.
The set of transiently active (latent) reactions depends on the perturbation. Even though we predict that, in general, the removal of one of these 52 sets increases growth under the corresponding knockout, the same removal may in principle have an adverse effect under a different knockout. To address this possibility, we first note that the sets of simultaneously nonessential latent reactions remain sizeable: an average of 258 ± 79 for MOMA and 109 ± 53 for ROOM. For a given knockout perturbation, this set is defined as the subset of the original latent reactions that are inactive in the optimal growth state we consider for each of the other 51 knockout mutants. These reactions are therefore dispensable for optimal growth, both in the wild type and all 52 single gene knockout strains we consider, but are nonetheless transiently activated in response to the given perturbation. We have tested the impact of disabling these reduced sets of latent reactions under the corresponding knockouts (Materials and Methods). As shown in Fig. 2 C and D, the presence of these simultaneously nonessential latent reactions has the same trend of inhibiting growth adaptation as found for the full sets of transiently activated reactions.
The possibility that latent pathway activation enhances cells’ viability following a perturbation is a compelling hypothesis, as it would reveal functions for genes that have thus far eluded high-throughput phenotype screens. We note, however, that our analysis also predicts the transient activation of pathways that do, in fact, have known phenotypes under different conditions. For example, activation of the glyoxylate shunt is known to mitigate growth defects of E. coli on glucose following phosphofructokinase mutations (28). Because we focus on single knockouts, the genes affected by such mutations, pfkA and pfkB, are not among the perturbations we consider. Nonetheless, out of the 52 unrelated knockout perturbations in our study, our models show the transient activation of the glyoxylate shunt in response to 25 of them according to MOMA and 7 according to ROOM. The same phenomenon can be observed for reactions that are essential under different environmental conditions but inactive in the aerobic glucose medium employed in our simulations. Pyruvate formate lyase is required for anaerobic growth in xylose medium according to experiments (29) and our models, but is transiently active for 2 (MOMA) and 18 (ROOM) of the 52 genetic perturbations in this study. This interesting effect—the nonspecific use of pathways under an array of perturbations quite different from the conditions under which they have an observed phenotype—indicates that the phenomenon of latent pathway activation extends beyond the set of apparently nonessential genes.

Model-Independent Analysis.

The analysis above shows that the availability of latent pathways inhibits growth in the short term after a genetic perturbation. But how sensitive are these conclusions to the models we used to simulate the response of the network? To provide model-independent evidence, we have determined how the volume of the space of feasible metabolic states (Materials and Methods) depends on the growth rate. As shown in Fig. 3 A and B (green lines) for the cyoA and lpd knockout mutants, the volume systematically decreases as a function of growth rate for the single-gene knockout mutants considered in our study, indicating that the number of metabolic states available to the unevolved mutant is much larger at lower growth rates. When the latent reactions are disabled, however, the relative volume, and hence the relative number of available metabolic states, increases for large growth rates (Fig. 3 A and B, blue lines). Therefore, the principal effect of removing latent pathways appears to be an increase in the relative frequency of high-growth states due to the preferential elimination of low-growth states. It should be noted, however, that a large number of high-growth states are also disabled in this process because of the “entanglement” between latent pathways and biomass-producing pathways that exist under the metabolic steady-state conditions of our models (Elementary Mode Analysis section in SI Text).
Fig. 3.
Properties of the space of feasible metabolic states. (A and B) Volume of the metabolic solution space as a function of the growth rate for the cyoA- (A) and the lpd- (B) knockout mutant. All values are normalized so that the area under each curve is 1. Thus, each curve can be interpreted as a probability distribution of growth rate within the corresponding metabolic space, either with or without latent pathways. For increasing growth rate, the normalized volume decreases systematically when the latent pathways are available (green lines), but the curve becomes less steep and the volume may even peak at nonzero growth when latent pathways are disabled (blue lines). These results were obtained for the MOMA-predicted response of the network, and similar behavior is observed for the other single-gene knockout mutants in our model. Disabling latent pathways therefore increases the relative frequency of high-growth metabolic states at the expense of low-growth states. For numerical feasibility, this calculation was implemented using the central metabolism of E. coli (Materials and Methods). (C) Postperturbation growth rate if the network were to respond by moving to a random position in the space of feasible states. The dots and error bars correspond to the average and standard deviation for each of the knockout mutants indicated at the bottom, when latent reactions are available (green) and when latent pathways associated with the particular perturbation and random response are disabled (blue). Interpreting a random response as a lower bound for the likely response of the organisms, the systematically higher growth of the latent pathway-disabled strains corroborates the conclusion that latent pathways do not facilitate adaptation. These statistics are summarized in the right column of Table 1. All growth rates are normalized by the FBA-predicted rate of the knockout mutants.
To appreciate the constraints imposed by this structure, imagine that the organism responds to perturbations by moving to a random metabolic state in the feasible space of fluxes. We simulated this hypothetical response using an implementation of the hit-and-run sampling algorithm (Materials and Methods). As shown in Fig. 3C, the postperturbation growth rate is nearly zero for all mutants with latent pathways and close to the theoretical maximum for all mutants without them. This random response is arguably a lower bound for the actual response of organisms that have evolved to cope with perturbations, but the conclusion is clear: unless we assume that organisms have evolved to respond to perturbations in a highly specific manner, which appears to be inconsistent with experiments (30), the availability of latent pathways does not facilitate growth, and this prediction is largely independent of the network response to perturbation. This holds true in particular for MOMA and ROOM, which incorporate (in different ways) the main flux rerouting features observed in the activation of latent pathways in E. coli (12).
Further mechanistic insight comes from the recently identified synthetic rescue interactions (31), in which the knockout of a gene inhibits growth but, counterintuitively, the targeted concurrent knockout of additional genes recovers the ability of the organisms to grow. The reactions catalyzed by such rescue genes are predicted to be active in typical suboptimal states and inactive in growth-maximizing states of the knockout mutant (8, 31). Now, given the observation above that the set of active reactions predicted by FBA is only slightly modified by a gene knockout, it follows that most rescue genes are in fact associated with latent pathways. This, in turn, explains why the latent pathway-disabled strains show improved growth. Note that this argument cannot be anticipated from intuition, because an enormous number of low-growth states (up to several orders of magnitude larger than for near-optimal growth) may exist even when latent pathways are disabled (Fig. 3 A and B). Furthermore, even in the extreme case when one disables all reactions that are inactive in the optimal state of the knockout mutant, metabolic states with a very low growth rate (≈10% or less of the wild-type optimum) exist in 47 out of the 52 mutants (Elementary Mode Analysis section in SI Text). This threshold is significant because all but five unmodified knockout mutants exceed this growth rate according to MOMA, and all but one according to ROOM. Thus, although our model-independent analysis suggests that latent pathway activation inhibits growth under a general choice of metabolic state after a perturbation, this is not directly imposed by the geometry of the solution space. Rather, the predicted growth benefit associated with latent pathway removal and synthetic rescues is partly a reflection of the cells’ adaptive response.
An extreme example of this rescue effect is provided by the cyoA-deficient mutant, which is predicted by MOMA to drop to < 10% of the optimal wild-type growth rate following the perturbation, but recovers to ≈60% if the latent pathways are also disabled. In addition, cases in which the single-gene knockout mutant operates near the theoretical optimum but growth nonetheless improves upon the removal of latent pathways, such as for the folD mutant, can be related to weaker forms of synthetic rescues (31).
This surprising relation to synthetic rescues is particularly interesting when we note that latent reactions define several pathways whose participation in E. coli’s metabolism has been controversial or elusive. The Entner–Doudoroff (ED) pathway, an alternative to glycolysis for glucose catabolism, is inactive in wild-type E. coli according to in vivo experiments in glucose (32) but becomes transiently active in mutants lacking phosphoglucose isomerase (12). This activation may serve to reduce NADPH accumulation accompanying increased flux through the pentose phosphate pathway (33). Both MOMA and ROOM predict a small, transient flux through the ED pathway in response to the knockout of pgi, the gene coding phosphoglucose isomerase. In triphosphate isomerase-deficient strains, our model predicts the activation of the normally inactive methylglyoxal bypass (34). Experimentally, this pathway is observed to channel excess dihydroxyacetone phosphate (DHAP) into pyruvate following glycolytic flux splitting into glyceraldehyde 3-phosphate and DHAP after the knockout of the associated gene, tpi (12). These findings emphasize the importance of probing multiple gene knockouts or perturbations—previously suggested in the context of synthetic lethality (35), synthetic rescues (31, 36), multitarget drug discovery (36, 37), and neutral mutations (38)—as a means to determine the puzzling role of transients.

Discussion

Latent pathways and their associated genes are, by definition, dispensable for optimal growth under the given conditions both before and after a perturbation. Several explanations have been offered to justify the persistence of apparently nonessential metabolic genes within the genome. Environmental robustness is the most natural hypothesis, as it acknowledges the unpredictable, time-varying conditions that confront single-cell organisms in natural environments. Yet, in silico (7, 39) and experimental (11) studies of model organisms under various environmental conditions likely to occur in nature suggest that condition-dependent robustness is inadequate to fully explain the dispensability of metabolic genes. Part of the remaining redundancy may be attributed to the varying efficiencies of redundant pathways under different environments (40), or selective pressure for increased metabolic capacity across all conditions (39). An alternative to environmental robustness is genetic robustness, in which redundant pathways buffer against null mutations (41, 42). But regardless of which explanation might apply in the context of latent pathways, the question that follows from our analysis is not why these dispensable pathways are present in the genome, but rather what causes their transient activation in response to genetic perturbations. Our results indicate that this activation confers no advantage in fitness as measured by growth, and more often hinders growth in the short term following a perturbation.
The dispensability of latent pathway activation predicted in this study can be interpreted in three different ways, which are not necessarily mutually exclusive. First, it is possible that temporary reaction activation does not provide necessary intermediate states but is instead a byproduct of the network’s suboptimal response to perturbations. The fact that E. coli undergoes a period of suboptimal growth following genetic perturbations is well-established in both in silico and in vivo experiments (17, 18). This interpretation is thus supported by the recent observation that typical suboptimal metabolic states have a much larger number of active reactions (more than 2.5 times larger for the conditions considered here) than typical states that optimize growth (8).
A second interpretation is that cells activate a large number of reactions as a global nonspecific response to perturbations that nevertheless creates a library of possible metabolic states that can be subsequently fine-tuned by adaptive evolution. This scenario cannot be rejected by existing experimental results (30) and is appealing as it allows for an indirect, long-term benefit of temporary reaction activation even if, as predicted here, this activation inhibits growth in the short term. Although whole-cell regulation remains largely unexplained, there are known mechanisms that could subsequently lead to optimal growth (43, 44). This interpretation suggests that fine-tuning of whole-cell reaction activity is best achieved by down-regulating specific overexpressed reactions rather than by the coordinated up-regulation of entire underexpressed pathways, a principle that is widely appreciated in metabolic engineering but that is yet to be demonstrated for natural systems. The recent observation that E. coli’s response to perturbations is more stable from the viewpoint of metabolite concentration and protein or mRNA abundance (45) than reaction flux (12) may be relevant for the validation of this interpretation. The objection to the second interpretation is that it cannot explain (not even in the long term) the availability of pathways that are simultaneously dispensable for all known perturbations that have shaped the evolution of the organisms. This “paradox of latency” reinforces the need to study the mechanisms that govern latent pathway activation.
In experimental studies of gene function in microbial organisms, growth is the most often used indicator of fitness, owing to both its accessibility for measurement and its intrinsic importance in determining viability (13, 46). Although the results of this study suggest that the availability of latent pathways does not promote faster growth following a perturbation, the conclusion that this negatively impacts adaptation is reached by studying these pathways in the context of metabolism alone. Thus, a third interpretation is that, in general, the cellular objective invoked in the adaptive response is not growth. Instead, the activation of otherwise latent metabolic pathways may accompany other cellular processes, either inside or outside metabolism, that are initiated to ensure survival. For example, yeast adopts changes in cell shape and internal pressure in response to osmotic shock, a process that recruits the metabolism to accumulate specific metabolites (47). Moreover, enzymes and metabolic cofactors involved in transiently upregulated pathways may have regulatory or signaling roles in addition to their metabolic function (48). This third possibility allows for an advantageous, external function for pathways whose activation appears disadvantageous when only the metabolism is considered. In this way, latent pathway activation is incorporated into a larger, more sophisticated adaptive response. This explanation, however, seems inconsistent with our observation of the apparently highly nonspecific activation of pathways in response to different perturbations.
Regardless of which interpretation proves to be correct, this study leads to a clear, experimentally testable prediction; namely, that latent pathway activation does not enhance, and in fact often inhibits, early postperturbation growth. Although we expect this behavior to be observed experimentally under a wide range of conditions, regardless of the specific suboptimal response of the cell, deviations from this behavior would also be highly informative because they would uncover growth phenotypes not detected in previous steady-state experiments. Varying environments can also generate strong (45) and sometimes counterintuitive responses, such as the possibility of accelerating the evolution of unevolved strains (49). Therefore, it may well be the case that in time-varying environments, which are beyond current modeling capability, latent pathway activation will exhibit a different fitness effect.

Materials and Methods

Network and Perturbations.

We used the iAF1260 E. coli model (15), which is the most up-to-date reconstruction of the metabolic network of E. coli MG1655. The network consists of 2,082 unique reactions catalyzed by 1,260 genes and involves 1,369 metabolites, as well as 299 exchange fluxes and the biomass flux. We focused on the 52 single-gene knockout strains that are compatible with growth but for which the original growth-maximizing flux state becomes infeasible after the gene knockout. The volume calculation, which remains challenging for the full network, was performed using a reduced network consisting of 62 reactions, 101 genes, 49 metabolites, and 14 exchange fluxes representing the central metabolism of E. coli (50).

Medium and Constraints.

The simulated medium consisted of limited amounts of glucose (8 mmol/g DW-h) and oxygen (18.5 mmol/g DW-h), and unlimited amount of carbon dioxide, iron (II), protons, water, potassium, sodium, ammonia, phosphate, and sulfate. Irreversible reactions are constrained to have nonnegative fluxes. The flux through the ATP maintenance reaction was set to 8.39 mmol/g DW-h. A total of 1,432 reactions in the iAF1260 model, including 61 reactions in the subnetwork of the central metabolism, can be active under these medium conditions. An analysis of the effect of regulatory limitations (51) that may constrain this reaction activity in vivo is considered in the Regulatory Constraints section in SI Text. There, it is shown that our results remain valid under the additional constraints imposed by these limitations.

Feasible Metabolic States.

The state of the metabolic network is represented by a vector of all reaction fluxes ν = (νj). Because the time scale at which the network responds to perturbations is much shorter than the characteristic time for adaptive evolution, we focused on steady-state flux distributions both before and after perturbation. Steady-state fluxes are solutions of the mass-balance equation S·ν = 0, where S = (Sij) is the matrix of stoichiometric coefficients, under the constraint imposed by the medium, reaction irreversibility, ATP maintenance requirements, and possibly gene knockouts. A knockout of the enzyme-coding genes associated with reaction j is implemented through the additional constraint νj = 0. The solution space is the set of all such steady-state flux vectors and it has the form of a convex polytope. We refer to the individual solutions in this space as feasible metabolic states.

Objective Functions.

FBA (19) identifies a growth-maximizing state within the space of feasible metabolic states by maximizing the flux νb through a biomass reaction that drains biomass precursors. With respect to this original state, MOMA (17) and ROOM (18) identify feasible states that minimize the distance in the space of fluxes and the number of significant flux changes, respectively (the MOMA and ROOM sections in SI Text). Our implementations of FBA, MOMA, ROOM, and the hit-and-run algorithm used a commercial optimization package (ILOG CPLEX, version 11.0, www.ilog.com). For all FBA results, we have used the growth-maximizing states provided by the simplex algorithm. More information about the computational methods is provided in SI Text, where it is also shown that our results do not depend sensitively on the assumption of optimal growth in the reference states either before or after the perturbation (Effects of Nonoptimal Reference States), nor on particular choices for the growth-maximizing states used throughout the paper (Sensitivity to Alternate Optima).

Biomass Flux and Growth Rates.

The in silico model predicts the biomass flux, but for exponential growth the result can be expressed in terms of a normalized growth rate . This follows from the observation that biomass production is governed by , where κ is the growth rate, N measures the population size, and is proportional to the biomass flux νb. Therefore, when normalized with respect to the wild-type or theoretical maximum, the normalized biomass flux νb/νb,0 equals the corresponding normalized growth rate κ/κ0 used throughout the paper.

Identification of Latent Pathways.

We define the latent reactions (pathways) associated with the knockout of gene A to be the set LA of all reactions predicted to be transiently active in the unevolved mutant shortly after the perturbation (according to MOMA, ROOM, or the hit-and-run algorithm), but inactive in both the optimal wild-type and mutant strains (according to FBA). This set is therefore nonessential for optimal growth under the knockout of A, although in principle some of these reactions may be necessary for growth under the knockout of a different gene, B. To account for this, we tested the impact of a set of latent reactions that are simultaneously nonessential in the optimal growth states of the other 51 single-gene knockout mutants that we consider.

Volume Calculation.

The exact calculation of the volume of the (high-dimensional) solution space is computationally intractable and, because this set is very skewed (52), even approximate calculations are computationally demanding. To determine the volume of the solution space as a function of growth rate, we used an approximate inference algorithm based on belief propagation (53). In this approach, the convex polytope representing the constrained flux space is tiled with hypercubes of size ε. We then use a message-passing algorithm to approximate the probability distribution P(ν) over the discretized space. Using this, we define the associated entropy , which counts the log of the number of ε-hypercubes that overlap the space of feasible states. The total volume covered by these cubes is then V = 10Sεn, where n is the dimension of the space. If m is the number of linearly independent mass balance constraints and f is the number of available fluxes after a given knockout, then the dimension of the unmodified metabolic space is n = f - m, and the solution space for the modified organism has dimension n = f - m - l, where l is the number of independent latent reactions removed. We used a granularity of for all calculations presented in the paper.

Hit-and-Run Algorithm.

To randomly sample the metabolic solution space, we implemented a hit-and-run algorithm (54) with artificial centering (55), which is a quickly converging sampler for high-dimensional convex sets. The algorithm is based on selecting a randomly oriented line l passing through the current sample point and then selecting the point for the next iteration from a uniform distribution along l. For efficient sampling, we employed artificial centering (55), where the orientation of the line l is obtained from the direction defined by the current sample point and the center (mean) of a subset C of already-sampled points. The subset C was initially created by taking 10,000 warm-up points on the boundary of the space and was updated recursively by replacing a randomly selected point of C with the currently sampled point. In all calculations presented in the paper, we generated a set of 5 × 106 points to sample the solution space. All calculations were performed with the COBRA Toolbox (56).

Acknowledgments.

This work was supported by the National Science Foundation under Grant DMS-0709212, the National Cancer Institute under Grant 1U54CA143869-01, and a Sloan Research Fellowship to A.E.M.

Supporting Information

Supporting Information (PDF)
Supporting Information

References

1
T Baba, et al., Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: The Keio collection. Mol Syst Biol 2, 2006.0008 (2006).
2
SY Gerdes, et al., Experimental determination and system level analysis of essential genes in Escherichia coli MG1655. J Bacteriol 185, 5673–5684 (2003).
3
G Giaever, et al., Functional profiling of the Saccharomyces cerevisiae genome. Nature 418, 387–391 (2002).
4
A DeLuna, et al., Exposing the fitness contribution of duplicated genes. Nat Genet 40, 676–681 (2008).
5
MA D’Elia, MP Pereira, ED Brown, Are essential genes really essential? Trends Microbiol 10, 433–438 (2009).
6
M Kupiec, R Sharan, E Ruppin, Genetic interactions in yeast: Is robustness going bust? Mol Syst Biol 3, 97 (2007).
7
B Papp, C Pál, LD Hurst, Metabolic network analysis of the causes and evolution of enzyme dispensability in yeast. Nature 429, 661–664 (2004).
8
T Nishikawa, N Gulbahce, AE Motter, Spontaneous reaction silencing in metabolic optimization. PLoS Comput Biol 4, e1000236 (2008).
9
LM Blank, L Kuepfer, U Sauer, Large-scale 13C-flux analysis reveals mechanistic principles of metabolic network robustness to null mutations in yeast. Genome Biol 6, R49 (2005).
10
ME Hillenmeyer, et al., The chemical genomic portrait of yeast: Uncovering a phenotype for all genes. Science 320, 362–365 (2008).
11
R Harrison, B Papp, C Pál, SG Oliver, D Delneri, Plasticity of genetic interactions in metabolic networks of yeast. Proc Natl Acad Sci USA 104, 2307–2312 (2007).
12
SS Fong, A Nanchen, BØ Palsson, U Sauer, Latent pathway activation and increased pathway capacity enable Escherichia coli adaptation to loss of key metabolic enzymes. J Biol Chem 281, 8024–8033 (2006).
13
SS Fong, AR Joyce, BØ Palsson, Parallel adaptive evolution cultures of Escherichia coli lead to convergent growth phenotypes with different gene expression states. Genome Res 15, 1365–1372 (2005).
14
M Ito, T Baba, H Mori, Functional analysis of 1440 Escherichia coli genes using the combination of knock-out library and phenotype microarrays. Metab Eng 7, 318–327 (2005).
15
AM Feist, et al., A genome-scale metabolic reconstruction for Escherichia coli K-12 MG1655 that accounts for 1260 ORFs and thermodynamic information. Mol Syst Biol 3, 121 (2007).
16
AM Feist, BØ Palsson, The growing scope of applications of genome-scale metabolic reconstructions using Escherichia coli. Nat Biotechnol 26, 659–667 (2008).
17
D Segre, D Vitkup, GM Church, Analysis of optimality in natural and perturbed metabolic networks. Proc Natl Acad Sci USA 99, 15112–15117 (2002).
18
T Shlomi, O Berkman, E Ruppin, Regulatory on/off minimization of metabolic flux changes after genetic perturbations. Proc Natl Acad Sci USA 102, 7695–7700 (2005).
19
JS Edwards, BØ Palsson, The Escherichia coli MG1655 in silico metabolic genotype: Its definition, characteristics, and capabilities. Proc Natl Acad Sci USA 97, 5528–5533 (2000).
20
RU Ibarra, JS Edwards, BØ Palsson, Escherichia coli K-12 undergoes adaptive evolution to achieve in silico predicted optimal growth. Nature 420, 186–189 (2002).
21
M Emmerling, et al., Metabolic flux responses to pyruvate kinase knockout in Escherichia coli. J Bacteriol 184, 152–164 (2002).
22
Q Huang, C Yang, T Baba, H Mori, K Shimizu, Responses of the central metabolism in Escherichia coli to phosphoglucose isomerase and glucose-6-phosphate dehydrogenase knockouts. J Bacteriol 185, 7053–7067 (2003).
23
CD Herring, et al., Comparative genome sequencing of Escherichia coli allows observation of bacterial evolution on a laboratory timescale. Nat Genet 38, 1406–1412 (2006).
24
DH Lee, BØ Palsson, Adaptive evolution of Escherichia coli K-12 MG1655 during growth on a nonnative carbon source, L-1,2-propanediol. Appl Environ Microbiol 76, 4158–4168 (2010).
25
SS Fong, BØ Palsson, Metabolic gene-deletion strains of Escherichia coli evolve to computationally predicted growth phenotypes. Nat Genet 36, 1056–1058 (2004).
26
R Mahadevan, CH Schilling, The effects of alternate optimal solutions in constraint-based genome-scale metabolic models. Metab Eng 5, 264–276 (2003).
27
JL Reed, BØ Palsson, Genome-scale in silico models of E coli have multiple equivalent phenotypic states: Assessment of correlated reaction subsets that comprise network states. Genome Res 14, 1797–1805 (2004).
28
RT Vinopal, DG Fraenkel, Phenotypic suppression of phosphofructokinase mutations in Escherichia coli by constitutive expression of the glyoxylate shunt. J Bacteriol 118, 1090–1100 (1974).
29
A Hasona, Y Kim, FG Healy, LO Ingram, KT Shangmugam, Pyruvate formate lyase and acetate kinase are essential for anaerobic growth of Escherichia coli on xylose. J Bacteriol 186, 7593–7600 (2004).
30
S Stern, T Dror, E Stolovicki, N Brenner, E Braun, Genome-wide transcriptional plasticity underlies cellular adaptation to novel challenge. Mol Syst Biol 3, 106 (2007).
31
AE Motter, N Gulbahce, E Almaas, AL Barabási, Predicting synthetic rescues in metabolic networks. Mol Syst Biol 4, 168 (2008).
32
S Flores, G Gosset, N Flores, AA de Graaf, F Bolívar, Analysis of carbon metabolism in Escherichia coli strains with an inactive phosphotransferase system by 13C labeling and NMR spectroscopy. Metab Eng 4, 124–137 (2002).
33
Q Hua, C Yang, T Baba, H Mori, K Shimizu, Responses of the central metabolism in Escherichia coli to phosphoglucose isomerase and Glucose-6-Phosphate dehydrogenase knockouts. J Bacteriol 185, 7053–7067 (2003).
34
GP Ferguson, S Tötemeyer, MJ MacLean, IR Booth, Methylglyoxal production in bacteria: Suicide or survival? Arch Microbiol 170, 209–218 (1998).
35
D Deutscher, I Meilijson, M Kupiec, E Ruppin, Multiple knockout analysis of genetic robustness in the yeast metabolic network. Nat Genet 38, 993–998 (2006).
36
AE Motter, Improved network performance via antagonism: From synthetic rescues to multi-drug combinations. Bioessays 32, 236–245 (2010).
37
AL Hopkins, Network pharmacology: The next paradigm in drug discovery. Nat Chem Biol 4, 682–690 (2008).
38
A Wagner, Neutralism and selectionism: A network-based reconciliation. Nat Rev Genet 9, 965–974 (2008).
39
S Freilich, et al., Decoupling environment-dependent and independent genetic robustness across bacterial species. PLoS Comput Biol 6, e1000690 (2010).
40
Z Wang, J Zhang, Abundant indispensable redundancies in cellular metabolic networks. Genome Biol Evol 1, 23–33 (2009).
41
AL Barabási, Z Oltvai, Network biology: Understanding the cell’s functional organization. Nat Rev Genet 5, 101–113 (2004).
42
Z Gu, et al., Role of duplicate genes in genetic robustness against null mutations. Nature 421, 63–66 (2003).
43
S Goyal, J Yuan, T Chen, JD Rabinowitz, NS Wingreen, Achieving optimal growth through product feedback inhibition in metabolism. PLoS Comput Biol 6, e1000802 (2010).
44
L Pelosi, et al., Parallel changes in global protein profiles during long-term experimental evolution in Escherichia coli. Genetics 173, 1851–1869 (2006).
45
N Ishii, et al., Multiple high-throughput analyses monitor the response of E coli to perturbations. Science 316, 593–597 (2007).
46
A Typas, et al., High-throughput, quantitative analyses of genetic interactions in E. coli. Nat Methods 5, 781–787 (2006).
47
E Klipp, B Nordlander, R Krüger, P Gennemark, S Hohmann, Integrative model of the response of yeast to osmotic shock. Nat Biotechnol 23, 975–982 (2005).
48
Y Shi, Y Shi, Metabolic enzymes and coenzymes in transcription—A direct link between metabolism and transcription? Trends Genet 20, 445–452 (2004).
49
N Kashtan, E Noor, U Alon, Varying environments can speed up evolution. Proc Natl Acad Sci USA 104, 13711–13716 (2007).
50
JL Reed, TD Vo, CH Schilling, BØ Palsson, An expanded genome-scale model of Escherichia coli K-12 (iJR904 GSM/GPR). Genome Biol 4, R54 (2003).
51
MW Covert, EM Knight, JL Reed, MJ Herrgard, BØ Palsson, Integrating high-throughput and computational data elucidates bacterial networks. Nature 429, 92–96 (2004).
52
G Bianconi, Flux distribution of metabolic networks close to optimal biomass production. Phys Rev E 78, 035101 (2008).
53
A Braunstein, R Mulet, A Pagnani, Estimating the size of the solution space of metabolic networks. BMC Bioinformatics 9, 240 (2008).
54
RL Smith, Efficient Monte Carlo procedures for generating points uniformly distributed over bounded regions. Oper Res 32, 1296–1308 (1984).
55
DE Kaufman, RL Smith, Direction choice for accelerated convergence in hit-and-run sampling. Oper Res 46, 84–95 (1998).
56
SD Becker, et al., Quantitative prediction of cellular metabolism with constraint-based models: The COBRA Toolbox. Nat Protoc 2, 727–738 (2007).

Information & Authors

Information

Published in

The cover image for PNAS Vol.108; No.8
Proceedings of the National Academy of Sciences
Vol. 108 | No. 8
February 22, 2011
PubMed: 21300895

Classifications

Submission history

Published online: February 7, 2011
Published in issue: February 22, 2011

Keywords

  1. complex networks
  2. flux balance analysis
  3. metabolic networks
  4. gene dispensability
  5. synthetic rescues

Acknowledgments

This work was supported by the National Science Foundation under Grant DMS-0709212, the National Cancer Institute under Grant 1U54CA143869-01, and a Sloan Research Fellowship to A.E.M.

Notes

This article is a PNAS Direct Submission.

Authors

Affiliations

Sean P. Cornelius
Department of Physics and Astronomy, Northwestern University, Evanston, IL 60208;
Joo Sang Lee
Department of Physics and Astronomy, Northwestern University, Evanston, IL 60208;
Adilson E. Motter1 [email protected]
Department of Physics and Astronomy, Northwestern University, Evanston, IL 60208;
Northwestern Institute on Complex Systems, Northwestern University, Evanston, IL 60208; and
Department of Molecular Biology, Princeton University, Princeton, NJ 08544

Notes

1
To whom correspondence should be addressed. E-mail: [email protected].
Author contributions: S.P.C. and A.E.M. designed research; S.P.C. and J.S.L. performed research; S.P.C. and J.S.L. analyzed data; and S.P.C. and A.E.M. wrote the paper.

Competing Interests

The authors declare no conflict of interest.

Metrics & Citations

Metrics

Note: The article usage is presented with a three- to four-day delay and will update daily once available. Due to ths delay, usage data will not appear immediately following publication. Citation information is sourced from Crossref Cited-by service.


Citation statements




Altmetrics

Citations

Export the article citation data by selecting a format from the list below and clicking Export.

Cited by

    Loading...

    View Options

    View options

    PDF format

    Download this article as a PDF file

    DOWNLOAD PDF

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Personal login Institutional Login

    Recommend to a librarian

    Recommend PNAS to a Librarian

    Purchase options

    Purchase this article to access the full text.

    Single Article Purchase

    Dispensability of Escherichia coli’s latent pathways
    Proceedings of the National Academy of Sciences
    • Vol. 108
    • No. 8
    • pp. 3093-3453

    Media

    Figures

    Tables

    Other

    Share

    Share

    Share article link

    Share on social media