Previous Article |
Table of Contents
| Next Article
Applied Mathematics
Saccharomyces cerevisiae phenotypes can be predicted by using constraint-based analysis of a genome-scale reconstructed metabolic network



*Department of Bioengineering, University of California at San Diego, 9500 Gilman Drive, La Jolla, CA 92093-0412; and
Center for Process Biotechnology, BioCentrum-DTU, Technical University of Denmark, 2800 Lyngby, Denmark
Communicated by Yuan-Cheng B. Fung, University of California at San Diego, La Jolla, CA, September 10, 200310. (received for review June 19, 2003)
| Abstract |
|---|
|
|
|---|
7080% of the conditions considered.
Integrated functions of reconstructed metabolic networks can be determined in silico by using a number of analytical approaches (911). The relatively young constraint-based approach differs fundamentally from the more traditional kinetic theory-based approaches in that it is not aimed at finding the solution or behavior of the network under certain conditions but rather at eliminating solutions (behaviors) that the network cannot exhibit (Fig. 1). By using this approach, a network of interactions is successively constrained by defining the stoichiometry of the interacting components, the direction of network reactions, and the maximum allowable throughput. In this way, candidate solutions to the network equations are systematically eliminated by the successive application of the governing constraints, i.e., stoichiometry, thermodynamics, and maximal enzymatic rates (12). One thus can define the range of capabilities of the reconstructed network and then, through the use of optimization procedures, calculate the "best" solution within the allowable range (1315). If the network has evolved to produce this best or optimal function, then an agreement is obtained between experimentally determined behavior and the in silico computations (16).
|
The constraint-based approach provides an appropriate simulation platform for studying the overall phenotypic behavior for S. cerevisiae. A tremendous amount of experimental data has been and continues to be generated for S. cerevisiae (1719). A number of databases have been dedicated to store and update yeast experimental data sets, and several mathematical models are available that use kinetic information to capture cell behaviors (2023). Limited availability of kinetic information restricts such models to a subset of the whole cell. The constraint-based approach is not limited by the availability of the kinetic data and thus can capture the complex content of the whole cell by using the existing genomewide knowledge and maximally used high-throughput data on yeast.
The constraint-based approach (1215) has been productively used to study the properties of genome-scale reconstructions of bacterial metabolic networks including Haemophilus influenzae (24), Escherichia coli (25), and Helicobacter pylori (26). Here we apply this approach to the recently reconstructed genome-scale S. cerevisiae metabolic network (8) to compute the properties of a eukaryotic cell and compare them to experimental observations.
| Methods |
|---|
|
|
|---|
16% of the assigned ORFs in the S. cerevisiae genome (7). The network includes a total of 1,175 biochemical reactions, of which 124 are mitochondrial, 702 are cytosolic, and 349 are transport reactions across the plasma and mitochondrial membranes. The metabolic network contains a total of 584 unique metabolites, which are distributed among different compartments: 559 are in the cytoplasm, 164 are in the mitochondria, and 121 are extracellular. The complete model is available at www.cpb.dtu.dk/models/yeastmodel.html and http://systemsbiology.ucsd.edu/organisms/yeast.html. In Silico Computations. The metabolic capabilities of the S. cerevisiae network were calculated by using flux balance analysis and linear optimization (14, 15, 27). For growth simulations, biomass synthesis (i.e., production of biosynthetic components at the physiological level) was selected as the objective function to be maximized, and optimization was done subject to stoichiometric, limited thermodynamics, and reaction capacity constraints by using established procedures (1315). Optimization problems were solved by using the commercially available package LINDO (Lindo Systems, Chicago).
In Silico Number of ATP Molecules Produced per Pair of Electrons Donated to the Electron Transport System (ETS) (P/O) Calculation. The P/O ratio, or ATP yield of respiration, is a measure of the efficiency of oxidative phosphorylation in energy metabolism. The P/O ratio is calculated as the relative amount of ATP molecules produced per pair of electrons donated to the ETS. The theoretical value of the P/O ratio for S. cerevisiae is 1.5 (28), i.e., 18 ATP molecules per 12 pairs of electrons transferred via the ETS. The in silico P/O ratio was calculated by determining the maximum number of ATP molecules produced through the ETS per consumed molecule of glucose by using the reconstructed model (maximizing ATP production in the S. cerevisiae model grown on one molecule of glucose yields YATP,max = 12.5 ATP molecules that are generated via the ETS). The in silico P/O ratio thus was calculated as the ratio of the maximum in silico ATP yield over total electron pairs transferred to the ETS.
Energy Requirement for Biomass Formation. The amount of ATP required for biomass formation, i.e., growth-associated ATP requirement, was calculated by summing the ATP needed for the formation of precursor metabolites [9.89 mmol of ATP/grams dry weight (gDW)], for polymerization of macromolecules (23.92 mmol of ATP/gDW), and the model-based ATP requirement to meet the biomass yield of 0.51 gDW/g glucose (29) in an aerobic glucose-limited continuous culture (35.36 mmol of ATP/gDW).
Chemostat Growth Simulation. In a continuous culture, growth rate is equivalent to the dilution rate and kept at a constant value. Calculations of the continuous culture of S. cerevisiae were performed by fixing the in silico growth rate to the experimentally determined dilution rate and minimizing the glucose uptake rate. This formulation is equivalent to maximizing biomass production given a fixed glucose uptake value and was used to simulate a continuous-culture growth condition.
| Results |
|---|
|
|
|---|
0.95 (28). This difference is generally attributed to the use of the mitochondrial transmembrane proton gradient needed to drive metabolite exchange (such as the proton-coupled translocation of pyruvate) across the inner mitochondrial membrane. In the reconstructed network, which contains no proton leakage, 12.5 molecules of ATP are generated via the ETS. As complete oxidation of glucose leads to donation of 12 electron pairs (10 NADH and 2 FADH2) to the electron transport chain, the in silico P/O ratio is 1.04 for oxidation of NADH and FADH2 during growth on glucose, i.e., 12.5/12 = 1.04, agreeing well with the measured value without including any proton leakage. The network-based computation systemically accounts for all the steps required to import and export compounds from the mitochondria, computing a net overall P/O ratio. Cells require energy for both growth- and non-growth-associated activities (29). The energy requirement for the formation of biomass has been measured experimentally for S. cerevisiae, and reported values range from 62.5 to 71.4 mmol of ATP/gDW (29, 30). A network-based calculation procedure of the growth-associated energy requirement has been developed (31), and when applied to the reconstructed S. cerevisiae network, a value of 69.2 mmol of ATP/gDW was computed (see Methods), which falls in the range of experimentally determined values. Energy required for precursor metabolite formation and macromolecule polymerization can be calculated from the biosynthetic composition of the cell. The model-based ATP requirement is entirely network-dependent and was derived from the in silico calculations.
The reconstructed metabolic network of S. cerevisiae can be used to represent whole-cell functions by placing simultaneously all growth and maintenance demands on the network (14, 15). These demands include the production of all the biomass components in the appropriate physiological proportion (8) while meeting both growth-associated (69.2 mmol of ATP/gDW) and non-growth-associated (1 mmol of ATP/gDW per h) energy requirements (30, 32). The in silico representation of all metabolic demands on the reconstructed map can be used simultaneously for analyzing, interpreting, and predicting the phenotypic behavior of S. cerevisiae, such as in anaerobic and aerobic culture and during metabolic growth shifts.
Optimal growth properties of S. cerevisiae were computed under anaerobic glucose-limited continuous culture at dilution rates varying between 0.1 and 0.4 h-1 (see Methods). The computed by-product secretion rates then were compared with the experimental data (33). The calculated uptake rates of glucose and the production of ethanol, glycerol, succinate, and biomass were in good agreement with the independently obtained experimental data (Fig. 2, and Fig. 5 and Table 2, which are published as supporting information on the PNAS web site) and, as for E. coli (34), lie at the edge of the allowable solution range. The relatively low observed acetate secretion rates were not predicted by the in silico model. The in silico analysis of the reconstructed network indicates that the release of acetate does not improve the optimal solution of the network, most likely because of the absence of kinetic and regulatory constraints. It is possible to constrain the in silico model further to secrete acetate at the experimental level and recompute an optimal solution under these additional constraints. This calculation results in values that are closer to the measured glucose uptake rates (Fig. 2 A). This procedure is an example of an iterative data-driven constraint-based modeling approach, by which the successive incorporation of experimental data is used to improve the in silico model (12, 35) without the addition of rigorous analytical tools and complex information. Besides the ability to describe the overall growth yield, the model allows further insight into how the metabolism operates at a more detailed level. From analysis of the metabolic fluxes at anaerobic growth conditions, the flux through the glucose-6-phosphate dehydrogenase is found to be 5.32% of the glucose uptake rate at a dilution rate of 0.1 h-1, which is consistent with the experimentally determined value (6.34%) for this flux when cells are operating with fermentative metabolism (33).
|
Optimal growth properties of S. cerevisiae were computed under aerobic glucose-limited continuous culture, where the Crabtree effect plays an important role. The Crabtree effect refers to alcoholic fermentation under aerobic conditions and occurs when the dilution rate in sugar-limited chemostat cultures exceeds a critical value that depends on the strain (29). The molecular mechanisms underlying the Crabtree effect in S. cerevisiae are not known. The regulatory features of the Crabtree effect (36), however, can be included in the in silico model as an experimentally determined (37) growth-rate-dependent maximum oxygen uptake rate (38). With this additional constraint, the in silico model makes quantitative predictions about the respiratory quotient, glucose uptake, ethanol, CO2, and glycerol secretion rates under aerobic glucose-limited continuous conditions (Fig. 3, and Fig. 6 and Table 3, which are published as supporting information on the PNAS web site). In addition, the capabilities of the reconstructed network to quantitatively predict experimental observations of growth in S. cerevisiae extends to metabolic by-product yields, specific growth rate, and specific substrate uptake rates under both batch culture and chemostat culture by using various carbon sources (e.g., glucose, galactose, and ethanol) and oxygenation conditions (i.e., aerobic and anaerobic conditions) (Tables 4 and 5, which are published as supporting information on the PNAS web site), demonstrating the utility of the model in a wide range of conditions.
|
A whole-genome in silico representation of an organism has the capability to predict the phenotypic consequences of gene deletions from the genome (25, 26). The impact of single-gene deletions on growth was analyzed by simulating growth on a synthetic complete medium containing glucose, amino acids, and purines and pyrimidines, where the reaction corresponding to the deleted gene was removed. The in silico results were compared with experimental results as supplied by the Saccharomyces Genome Database (5). For central metabolism, growth was predicted under various experimental conditions (Table 1), and 81.5% (93 of 114 cases) of the in silico predictions were in agreement with in vivo phenotypes. In all cases the disagreements could be explained by known regulatory mechanisms, illustrating that deviations of the model predictions from experimental data will generally point to the existence of specific regulatory structures. In a more detailed study (39), in 87.8% of all considered cases (526 of 599), the in silico prediction was in qualitative agreement with experimental results.
|
The in silico model was used to simulate the outcome of expression-profiling experiments during metabolic shifts. The computed changes in the flux maps then were compared on a qualitative basis (i.e., up or down) with experimentally determined mRNA expression profiles. Two metabolic shifts were considered: (i) aerobic-to-anaerobic glucose-limited continuous cultures (40) and (ii) a shift from respiro-fermentative growth on glucose to aerobic growth on ethanol (41), referred to as diauxic shift. The qualitative changes in calculated flux levels and experimentally determined transcription levels of the corresponding genes correlated in
58% for the central metabolic reactions (26 of 45; Table 6, which is published as supporting information on the PNAS web site) and 49% of all cases (215 of 436; Table 7, which is published as supporting information on the PNAS web site), including all the internal and transport reactions in anaerobic/aerobic glucose-limited continuous study. For the diauxic shift, the qualitative comparison between transcription and flux levels agreed in 61% of the central metabolic reactions (27 of 44 cases) (Table 8, which is published as supporting information on the PNAS web site).
By using the mRNA expression changes as additional biological constraints, flux changes during metabolic shifts were reexamined. Constraining the internal fluxes in both metabolic shifts improved the correlation between expression levels and flux changes to 78% for the central metabolic reactions (35 of 45 cases; Fig. 4 and Table 9, which is published as supporting information on the PNAS web site) and 63% for all cases (275 of 436; Table 10, which is published as supporting information on the PNAS web site) in anaerobic/aerobic glucose-limited continuous study and 84% (37 of 44 cases; Fig. 7 and Table 11, which are published as supporting information on the PNAS web site) in the diauxic shift study.
|
| Discussion |
|---|
|
|
|---|
Unlike genome-sequencing projects that have a well defined endpoint, the in silico model-building procedure at the genome scale is an iterative and ongoing process during which the content of a model grows, and thus the scope of properties that can be computed widens. The 13-year history of building models for E. coli demonstrates this iterative model-building process (43). Importantly, the in silico model-building procedure in this way provides a framework for the integration of an expanding number of heterogeneous data types. The in silico computation of cellular functions from reconstructed networks allows for study of the relationship between environmental and genetic factors and how they come together to produce cellular phenotypes. These capabilities will be important in our quest to mathematically model and computer-simulate complex biological functions, which represents a fundamental goal of in silico biology.
| Acknowledgements |
|---|
| Footnotes |
|---|
I.F. and J.F. contributed equally to this work. ![]()
Present address: Fluxome Sciences A/S, Søltofts Plads, Building 223, Technical University of Denmark, 2800 Lyngby, Denmark. ![]()
¶ To whom correspondence should be addressed. E-mail: palsson{at}ucsd.edu.
| References |
|---|
|
|
|---|
This article has been cited by other articles in HighWire Press-hosted journals:
![]() |
T. Shlomi, M. Herrgard, V. Portnoy, E. Naim, B. O. Palsson, R. Sharan, and E. Ruppin Systematic condition-dependent annotation of metabolic genes Genome Res., November 1, 2007; 17(11): 1626 - 1633. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Rokhlenko, T. Shlomi, R. Sharan, E. Ruppin, and R. Y. Pinter Constraint-based functional similarity of metabolic genes: going beyond network topology Bioinformatics, August 15, 2007; 23(16): 2139 - 2146. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Steuer, A. N. Nesi, A. R. Fernie, T. Gross, B. Blasius, and J. Selbig From structure to dynamics of metabolic pathways: application to the plant mitochondrial TCA cycle Bioinformatics, June 1, 2007; 23(11): 1378 - 1385. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Zhu, M. Gerstein, and M. Snyder Getting connected: analysis and principles of biological networks Genes & Dev., May 1, 2007; 21(9): 1010 - 1024. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. G. Bundy, B. Papp, R. Harmston, R. A. Browne, E. M. Clayson, N. Burton, R. J. Reece, S. G. Oliver, and K. M. Brindle Evaluation of predicted network modules in yeast metabolism using NMR-based metabolite profiling Genome Res., April 1, 2007; 17(4): 510 - 519. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Harrison, B. Papp, C. Pal, S. G. Oliver, and D. Delneri Plasticity of genetic interactions in metabolic networks of yeast PNAS, February 13, 2007; 104(7): 2307 - 2312. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Teusink, A. Wiersma, D. Molenaar, C. Francke, W. M. de Vos, R. J. Siezen, and E. J. Smid Analysis of Growth of Lactobacillus plantarum WCFS1 on a Complex Medium Using a Genome-scale Metabolic Model J. Biol. Chem., December 29, 2006; 281(52): 40041 - 40048. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Steuer, T. Gross, J. Selbig, and B. Blasius Structural kinetic modeling of metabolic networks PNAS, August 8, 2006; 103(32): 11868 - 11873. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-P. Hsu, P.-H. Lee, C.-W. Chang, and C.-T. Lee Constructing quantitative models from qualitative mutant phenotypes: preferences in selecting sensory organ precursors Bioinformatics, June 1, 2006; 22(11): 1375 - 1382. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Herrgard, B.-S. Lee, V. Portnoy, and B. O. Palsson Integrated analysis of regulatory and metabolic networks reveals novel regulatory mechanisms in Saccharomyces cerevisiae Genome Res., May 1, 2006; 16(5): 627 - 635. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Imielinski, C. Belta, H. Rubin, and A. Halasz Systematic Analysis of Conservation Relations in Escherichia coli Genome-Scale Metabolic Network Reveals Novel Growth Media Biophys. J., April 15, 2006; 90(8): 2659 - 2672. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Mahadevan, D. R. Bond, J. E. Butler, A. Esteve-Nunez, M. V. Coppi, B. O. Palsson, C. H. Schilling, and D. R. Lovley Characterization of Metabolism in the Fe(III)-Reducing Organism Geobacter sulfurreducens by Constraint-Based Modeling Appl. Envir. Microbiol., February 1, 2006; 72(2): 1558 - 1568. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Dolinski and D. Botstein Changing perspectives in yeast research nearly a decade after the genome sequence Genome Res., December 1, 2005; 15(12): 1611 - 1619. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Wagner and R. Urbanczik The Geometry of the Flux Cone of a Metabolic Network Biophys. J., December 1, 2005; 89(6): 3837 - 3845. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. N. Vemuri and A. A. Aristidou Metabolic Engineering in the -omics Era: Elucidating and Modulating Regulatory Networks Microbiol. Mol. Biol. Rev., June 1, 2005; 69(2): 197 - 216. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Borodina, P. Krabben, and J. Nielsen Genome-scale analysis of Streptomyces coelicolor A3(2) metabolism Genome Res., June 1, 2005; 15(6): 820 - 829. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Shlomi, O. Berkman, and E. Ruppin Regulatory on/off minimization of metabolic flux changes after genetic perturbations PNAS, May 24, 2005; 102(21): 7695 - 7700. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. D. King, S. M. Garrett, and G. M. Coghill On the use of qualitative reasoning to simulate and identify metabolic pathways Bioinformatics, May 1, 2005; 21(9): 2017 - 2026. [Abstract] [Full Text] [PDF] |
||||
![]() |