生物谷報道:今天出版的Nature報道采用生物信息學(xué)方法研究代謝水平上基因調(diào)控的網(wǎng)絡(luò)調(diào)控的機(jī)理 Nature 427, 839 - 843 (26 February 2004); doi:10.1038/nature02289
Global organization of metabolic fluxes in the bacterium Escherichia coli
E. ALMAAS1, B. KOVÁCS1,2, T. VICSEK2, Z. N. OLTVAI3 & A.-L. BARABÁSI1
1 Department of Physics, University of Notre Dame, Notre Dame, Indiana 46556, USA
2 Biological Physics Department and Research Group of HAS, Eötvös University, H-1117 Budapest, Hungary
3 Department of Pathology, Northwestern University, Chicago, Illinois 60611, USA
Correspondence and requests for materials should be addressed to A.-L.B. ([email protected]).
Cellular metabolism, the integrated interconversion of thousands of metabolic substrates through enzyme-catalysed biochemical reactions, is the most investigated complex intracellular web of molecular interactions. Although the topological organization of individual reactions into metabolic networks is well understood1-4, the principles that govern their global functional use under different growth conditions raise many unanswered questions5-7. By implementing a flux balance analysis8-12 of the metabolism of Escherichia coli strain MG1655, here we show that network use is highly uneven. Whereas most metabolic reactions have low fluxes, the overall activity of the metabolism is dominated by several reactions with very high fluxes. E. coli responds to changes in growth conditions by reorganizing the rates of selected fluxes predominantly within this high-flux backbone. This behaviour probably represents a universal feature of metabolic activity in all cells, with potential implications for metabolic engineering.
To identify the interplay between the underlying topology1-3 of the metabolic network of E. coli K12 MG1655 and its functional organization, we focus on the global features of potentially achievable flux states in this model organism with a fully sequenced and annotated genome13, 14. In accordance with flux balance analysis (FBA)8-12, we first identified the solution space (that is, all possible flux states under a given condition) by using constraints imposed by the conservation of mass and the stoichiometry of the reaction system for the reconstructed E. coli metabolic network8-12. Assuming that cellular metabolism is in a steady state and optimized for the maximal growth rate, FBA allows us to calculate the flux for each reaction using linear optimization8-11, which provides a measure of the relative activity of each reaction.
As previously shown8-10, the steady state and optimality approximations offer experimentally verifiable predictions on the flux states of the cell. Under any condition, however, there are also expected differences, some stemming from the fact that there are transient effects and that the growth rate is not always exactly optimal12. A marked feature of the obtained flux distribution is its overall inhomogeneity: reactions with fluxes spanning several orders of magnitude coexist under the same conditions. For example, under optimal growth conditions in a glutamate-rich culture, the dimensionless flux of the succinyl coenzyme A synthetase reaction is 0.185, whereas the flux of the aspartate oxidase reaction is four orders of magnitude smaller, with a value of 2.2 10-5 in dimensionless units (the flux vector is normalized to unity).
To characterize the coexistence of such widely different flux values, we plot the flux distribution for active (non-zero flux) reactions of E. coli grown in a glutamate- or succinate-rich substrate (Fig. 1a). The distribution is best-fitted with a power law with a small flux constant, indicating that the probability that a reaction has flux follows P()( + 0)-, where the constant is 0 = 0.0003 and the flux exponent has the value = 1.5. The observed power law is also consistent with published experimental data. Indeed, the flux distribution obtained from the measured fluxes of the central metabolism of E. coli15 is best-fitted with the power-law form (Fig. 1d). As the central metabolism is characterized by high fluxes, the small-flux saturation seen in Fig. 1a is absent from these data.
Figure 1 Characterizing the overall flux organization of the E. coli metabolic network. Full legend
High resolution image and legend (82k)
To examine whether the observed flux distribution is independent of the environmental conditions, we mimicked the influence of various growth conditions by randomly choosing 10%, 50% or 80% of the 96 potential substrates that E. coli can consume (the input substrates are listed in Supplementary Table S2). After optimizing the growth rate, we find that the power-law distribution of metabolic fluxes is independent of the external conditions (Fig. 1b). As the metabolic activity of E. coli frequently deviates from the optimal growth state under variable growth conditions10, 12, we inspected whether the wide flux distribution is also present in non-optimal states. For this, we implemented a "hit-and-run" method16, 17 that randomly samples the full solution space, enabling us to calculate the flux for each reaction in 50,000 distinct non-optimal states.
Although the obtained average flux distribution is consistent in shape and flux ranges with those obtained by the optimal FBA, the flux exponent is slightly larger ( = 2, Fig. 1c), and the quality of the scaling is slightly weaker. Notably, many individual non-optimal states (Fig. 1c, inset) are consistent with an exponent = 1, in accord with the experimental results (Fig. 1d) and supporting the prediction10, 12 that these organisms may not have achieved optimality. These findings imply that the observed flux distribution is a generic feature of flux conservation18 on a scale-free network19, as it is independent of the optimal or non-optimal nature of the growth rate or the growth conditions. The exponent, however, may depend on the organism's position in the solution space.
An exponentially decaying distribution (for example, a gaussian) would predict that under a given condition most reactions are characterized by comparable fluxes. By contrast, the identified power-law flux distribution suggests a highly uneven use: most reactions have small fluxes and coexist with a few reactions with very high flux values. Hence, the biochemical activity of the metabolism is dominated by several 'hot' reactions, which are embedded in a network of mostly small-flux reactions. The fact that the observed flux exponent is less or equal to 2 implies that the first and higher moments of the flux distribution are not defined from a mathematical perspective (they diverge for an infinite system). The divergence of the average flux (first moment) indicates that no flux value can be designated as characteristic for the system. Although can be calculated numerically by averaging over the flux of all reactions, with g = 0.0171 and s = 0.0173 for glutamate and succinate uptakes, respectively, these values are by no means indicative of the overall activity of the metabolic reactions. Indeed, most reactions (86% for glutamate and 89% for succinate uptake) have a smaller flux than this average, and a few reactions have an activity that is orders of magnitude higher.
The observed flux distribution is compatible with two different potential local flux structures. A homogeneous local organization would imply that all reactions producing (consuming) a given metabolite have comparable fluxes; however, a more delocalized 'hot backbone' is expected if the local flux organization is heterogeneous, such that each metabolite has a dominant source (consuming) reaction (Supplementary Fig. S7). To distinguish between these two schemes for each metabolite i produced (consumed) by k reactions, we define the measure20
where ^ij is the mass carried by reaction j which produces (consumes) metabolite i. If all reactions producing (consuming) metabolite i have comparable ^ij values, Y(k,i) scales as 1/k. If, however, the activity of a single reaction dominates equation (1), we expect Y(k,i)1, in other words, Y(k,i) is independent of k. For the E. coli metabolism optimized for succinate and glutamate uptake, we find that both the 'in' and 'out' degrees follow Y(k,i)k-0.27 (Fig. 2a), representing an intermediate behaviour between the two extreme cases. This indicates that the large-scale inhomogeneity observed in the overall flux distribution is also increasingly valid at the level of the individual metabolites: the more reactions that consume (produce) a given metabolite, the more likely it is that a single reaction carries most of the flux.
Figure 2 Characterizing the local inhomogeneity of the metabolic flux distribution. Full legend
High resolution image and legend (110k)
Such inhomogeneity is obvious, for example, for flavin adenine dinucleotide (FAD), whose production (consumption) is dominated by only one of the two (three) contributing reactions (Fig. 2a, inset), and the high-flux reactions are catalysed by succinate dehydrogenase complex, EC 1.3.5.1 (succinate dehydrogenase, EC 1.3.99.1). We find that Y(k,i) scales in a similar fashion when E. coli is grown in rich Luria-Bertani medium or non-optimized configurations, indicating that the local inhomogeneity is not a unique feature of the optimized state or a specific growth condition but represents a generic property of the local flux distribution (see Supplementary Information).
The local flux inhomogeneity indicates that for most metabolites we can identify a single reaction that dominates its production (consumption). This observation can be turned into a simple algorithm, which for each metabolite systematically removes all reactions but the one providing the largest incoming (outgoing) flux contribution. The algorithm uncovers the 'high-flux backbone' (HFB) of the metabolism, a distinct structure of linked reactions that forms a giant component21 with a star-like topology (Fig. 3 and Supplementary Fig. S12), including most metabolites produced under the given growth condition. Only a few pathways appear disconnected, indicating that although these pathways are part of the HFB, their end product is only the second-most important source for another HFB metabolite. Of note, groups of individual HFB reactions largely overlap with the traditional, biochemistry-based partitioning of cellular metabolism: for example, all metabolites of the citric acid cycle of E. coli are recovered, and so are a considerable fraction of other important pathways, such as those involved in histidine, murein and purine biosynthesis. The HFB captures the subset of reactions that dominate the activity of the metabolism. As such, it offers a complementary approach to elementary flux mode analyses22-24, which successfully capture the available modes of operation for smaller metabolic subnetworks.
Figure 3 High-flux backbone for FBA-optimized metabolic network of E. coli on a glutamate-rich substrate (see Supplementary Fig. S12b for succinate-rich substrate). Full legend
High resolution image and legend (223k)
As the flux of the individual metabolic reactions depends on the growth conditions, we need to inspect how sensitive the HFB is to changes in the environment. Unexpectedly, Fig. 2b, c, which records the relationship between the individual fluxes under succinate and glutamate uptake, indicates that only the reactions in the high-flux territory undergo noticeable flux changes, whereas the reactions in the intermediate- and low-flux regions remain virtually unaltered (a small shift can be observed, however, because there is a 41% increase in biomass production in glutamate-rich versus succinate-rich media). The observed flux changes correspond to two types of event. First, some pathways are turned off completely (type I reactions), having zero flux under one growth condition and high flux in the other. These reactions are shown as symbols along the horizontal and vertical axis in Fig. 2b. By contrast, other reactions remain active but show an orders-of-magnitude shift in flux under the two different growth conditions (type II reactions). Notably, with two exceptions these marked type II changes are limited to the HFB reactions. The same phenomenon is predicted when we inspect the transition from glucose to succinate uptake, or for transitions among various random uptake conditions (see Supplementary Information).
To test the generality of this finding, we mimicked the effect of various growth conditions by randomly choosing 50% of the potential input substrates, measuring in each input configuration the flux for each reaction. For each reaction, the average flux () and the standard deviation ( ) around this average were determined by averaging over 5,000 random input conditions. For small fluxes, all reactions closely follow a straight line (corresponding to ), supporting the above finding that the small fluxes remain essentially unaltered as the external conditions change (Fig. 2c). For the high-flux reactions, however, there are noticeable deviations from this line, indicating that there are considerable variations in flux from one external condition to the other (Fig. 2d, e). A closer inspection of the flux distribution shows that the reactions on the curve all have a clear unimodal flux distribution (Fig. 4a, c), indicating that shifts in growth conditions lead to only small changes (within ) of their flux values. By contrast, the reactions deviating from the curve have a bi- or trimodal distribution, indicating that under different growth conditions they show several discrete and distinct flux values (Fig. 4b, d).
Figure 4 Effect of growth conditions on individual fluxes. Full legend
High resolution image and legend (60k)
Therefore, Figs 2c–e, 3 and 4 offer valuable insights into how E. coli responds to changes in growth conditions: it activates or deactivates specific metabolic reactions among the HFB metabolites in new ways without altering the identity of the principal pathways that participate in the backbone. For example, switching from a glutamate to a succinate substrate turns off vitamin B6 biosynthesis (type I change), reconnects in other ways several metabolic substrates, such as those of the coenzyme A and NAD biosynthesis pathways as well as the respiration pathways, and modifies the use of the citric acid cycle (Supplementary Fig. S12). Apart from minor changes, the use of the other pathways remains unaltered. These reorganizations result in large, discrete changes in the fluxes of the HFB reactions.
The power-law distribution of metabolic fluxes in the E. coli metabolism indicates a highly uneven use of the underlying metabolic network topology. Although wide flux differences among various pathways are known from individual experiments15, 25-27, we find that they are part of a scale-invariant continuum, which follows a scaling law. The uneven flux use is present both at the global level (Fig. 1) and at the level of the individual metabolites (Fig. 2a). This observation allows us to uncover automatically the HFB of the metabolism and could provide insights into metabolic organization and regulation as well as offering valuable inputs for metabolic engineering.
The observation and theoretical prediction of a power-law load distribution in simple models (ref. 18 and Supplementary Information), coupled with the presence of a power law in both the optimal and non-optimal flux states, suggests that the metabolic flux organization is a direct consequence of the scale-free topology of the network19. As all organisms examined so far are characterized by a scale-free metabolic network topology1, the observed scaling in the flux distribution is probably not limited to E. coli, but characterizes all organisms from eukaryotes to archaea. As FBA is available for an increasing number of prokaryotic and eukaryotic organisms, this prediction could be verified both experimentally and theoretically in the near future. Hence, the observed uneven local and global flux distribution seems to be rooted in the subtle, yet generic, interplay of the network's directed topology and flux balance, channelling the numerous small fluxes into high-flux pathways. The dependence of the scaling exponents characterizing the flux distributions on the nature of the optimization process, as well as the experimentally observed exponent, may be a benchmark for future structural and evolutionary models aiming to explain the origin, the organization and the modular structure3, 4, 28, 29 of cellular metabolism.
Methods
Flux balance analysis Starting from a stoichiometric matrix of the MG1655 strain8, 9 of E. coli, containing 537 metabolites and 739 reactions, the steady-state concentrations of all the metabolites satisfy:
where Sij is the stoichiometric coefficient of metabolite Ai in reaction j and j is the flux of reaction j. We use the convention that if metabolite Ai is a substrate (product) in reaction j, then Sij < 0 (Sij > 0), and we constrain all fluxes to be positive by dividing each reversible reaction into two 'forward' reactions with positive fluxes. Any vector of positive fluxes {j} that satisfies equation (2) corresponds to a state of the metabolic network and, hence, a potential state of operation of the cell. We restrict our study to the subspace of solutions for which all components of satisfy the constraint j > 0 (ref. 8). We denote the mass carried by reaction j producing (consuming) metabolite i by:
Random uptake conditions We choose randomly X% (where X = 10, 50 or 80) of the 89 potential input substrates that E. coli consumes in addition to the minimal uptake basis. For each of the transport reactions, we set the uptake rate to 20 mmol per gram of dry weight per hour. As there are a very large number of possible combinations of the selected input substrates, we repeat this process 5,000 times and average over each realization.
The hit-and-run method We select a set of basis vectors spanning the solution space using singular-value decomposition. Because the reaction fluxes must be positive, the 'bouncer' is constrained to the part of the solution space that intersects the positive orthant. We constrain the bouncer within a hypersphere of radius Rmax and outside a hypersphere of radius Rmin < Rmax, where we find that the sampling results are independent of the choices of Rmin and Rmax. Starting from a random initial point inside the positive flux cone in a randomly chosen direction, the bouncer travels deterministically a distance d between sample points. Each sample point, corresponding to a solution vector where the components are the individual fluxes, is normalized by projection onto the unit sphere. After every bth bounce off the internal walls of the flux cone, the direction of the bouncer is randomized.
High-flux backbone For each metabolite we keep only the reactions with the largest flux that produces and consumes the metabolite. Metabolites that are not produced (consumed) are discounted. Subsequently, a directed link is introduced between two metabolites A and B if (1) A is a substrate of the most active reaction producing B, and (2) B is a product of the maximal reaction consuming A. We consider only metabolites that are connected to at least one other metabolite after steps (1) and (2). For clarity, we remove Pi, PPi and ADP. Further details and figures are provided in the Supplementary Information.
Supplementary information accompanies this paper.
Received 29 August 2003;accepted 12 December 2003