ABSTRACT
Understanding the dynamics of biological systems in evolving environments is a challenge due to their scale and complexity. Here, we present a computational framework for the timescale decomposition of biochemical reaction networks to distill essential patterns from their intricate dynamics. This approach identifies timescale hierarchies, concentration pools, and coherent structures from time-series data, providing a system-level description of reaction networks at physiologically important timescales. We apply this technique to kinetic models of hypothetical and biological pathways, validating it by reproducing analytically characterized or previously known concentration pools of these pathways. Moreover, by analyzing the timescale hierarchy of the glycolytic pathway, we elucidate the connections between the stoichiometric and dissipative structures of reaction networks and the temporal organization of coherent structures. Specifically, we show that glycolysis is a cofactor-driven pathway, the slowest dynamics of which are described by a balance between high-energy phosphate bond and redox trafficking. Overall, this approach provides more biologically interpretable characterizations of network dynamics than large-scale kinetic models, thus facilitating model reduction and personalized medicine applications.
IMPORTANCE
Complex interactions within interconnected biochemical reaction networks enable cellular responses to a wide range of unpredictable environmental perturbations. Understanding how biological functions arise from these intricate interactions has been a long-standing problem in biology. Here, we introduce a computational approach to dissect complex biological systems' dynamics in evolving environments. This approach characterizes the timescale hierarchies of complex reaction networks, offering a system-level understanding at physiologically relevant timescales. Analyzing various hypothetical and biological pathways, we show how stoichiometric properties shape the way energy is dissipated throughout reaction networks. Notably, we establish that glycolysis operates as a cofactor-driven pathway, where the slowest dynamics are governed by a balance between high-energy phosphate bonds and redox trafficking. This approach enhances our understanding of network dynamics and facilitates the development of reduced-order kinetic models with biologically interpretable components.
KEYWORDS: kinetic models, timescale decomposition, dynamic-mode decomposition, data-driven approach, coherent structures
INTRODUCTION
Growth and adaptation are hallmarks of all living systems. They are inherently dynamic processes that are orchestrated by interacting networks of biochemical reactions. Understanding the dynamics underpinning the intricate behavior of biological systems has been a major challenge of systems biology. Kinetic models based on detailed enzymatic rate laws can capture the complex interactions and dynamics of biological systems (1, 2). However, the application of these models to large-scale networks is hindered by numerical challenges and their limited interpretability (3).
Biochemical reaction networks have complex structures. Yet, they often exhibit multi-scale and comparatively simple dynamics due to the presence of timescale hierarchies (4). Separation of timescales is believed to be an essential feature of highly evolved reaction networks, conferring stability and robustness (4, 5). It can lead to the modularization of network dynamics and emergence of “independent” functional units, both of which underlie the robustness and evolvability of living organisms (5, 6). Although our knowledge of the links between biological complexity and functional modularization is limited, we can gain a deeper understanding of the structural evolution and organization of biochemical reaction networks through systematic studies of their timescale hierarchies.
Timescale decomposition is a common approach for analyzing the dynamics of complex systems. Several techniques have been developed for biochemical reaction networks, most of which fall into two main categories: top-down and bottom-up approaches. The first leverages statistical methods and clustering algorithms to identify collections of metabolites with correlated concentration trajectories—referred to as concentration pools—from experimentally measured time-series data (7, 8). These techniques do not rely on evolution equations or kinetic rate laws, although they can use numerically generated time-series data furnished by mass-balance equations. They can handle large-scale systems of varying complexity but do not mechanistically relate concentration pools to the stoichiometric and dissipative structures of reaction networks. The second approach determines timescale hierarchies from steady-state eigenvalues (9, 10). It hinges on precise formulations of all reaction rates, providing a mechanistic association between concentration pools and structural properties of reaction networks. Although computations are tractable for large-scale networks, inaccuracies may arise for networks with highly nonlinear rate laws where Jacobian spectra are time-dependent.
Dimensionality reduction is a key step of timescale decomposition. Several classes of model reduction techniques have been previously developed to reduce the dimensionality of complex biochemical reaction networks (11). Trajectory-based techniques (12) describe the dynamics of the system without relying on any assumptions about timescales, making them well-suited for systems with complex time-dependent patterns. However, these techniques can be computationally intensive and sensitive to noise in the data, requiring substantial amount of data to accurately capture system dynamics (13). Singular perturbation techniques (14, 15) effectively reduce the dimensionality by isolating fast and slow dynamics, making them suitable for systems with disparate timescales. However, they are challenging to automate, requiring the functional form of rate laws and kinetic parameters. Lumping techniques (16, 17) offer simplicity and efficiency in model reduction, but how well the reduced model preserves the dynamic characteristics of the original system depends on an ad hoc choice of the lumping function.
In this paper, we develop a computational framework for the timescale decomposition of biochemical reaction networks (Fig. 1), which leverages the strengths of both top-down and bottom-up techniques. This approach, termed dynamic mode analysis (DMA), determines the timescale hierarchy of a reaction network from experimentally measured or numerically generated time-series data. It can identify concentration pools for complex networks with uncharacterized rate laws as reliably as top-down techniques. It can also provide a mechanistic description of concentration pools and their organization with respect to the energetics and stoichiometry of reaction networks in the same way as bottom-up techniques. A key component of DMA is an extension of dynamic mode decomposition (DMD) (18 – 20)—a technique originally developed to characterize coherent structures arising in fluid flows—that we introduce to identify the dominant exponential decay modes associated with each timescale. We study the timescale hierarchies of hypothetical and biological pathways using DMA, showing that this approach can reproduce the previously characterized concentration pools of these pathways. We also establish a connection between the time-delayed autocorrelation matrix—a statistical descriptor used in top-down techniques (8)—and Jacobian spectra, demonstrating why this descriptor is a useful metric for concentration-pool classifiers.
Fig 1.
Overview of DMA. (A) Steady states of biochemical reaction networks are perturbed by introducing concentration or flux disturbances. (B) Concentration trajectories are constructed by integrating transient mass-balance equations. (C) For a general nonlinear system, dynamic modes are ascertained by linearizing mass-balance equations in a sliding time window, resulting in a local inhomogeneous linear system. (D) Time-series data are generated by evaluating local concentration deviations at equally spaced time points in the interval that spans the time window. (E) An ODMD developed in this work identifies dominant exponential decay modes in the time window from time-series data. (F) The pooling matrix , defined as the Moore-Penrose inverse (21) of the modal matrix , is determined from dominant decay modes. (G) Disparate timescales are identified from dominant eigenvalues. (H) Biologically interpretable pools are constructed from the pooling matrix to modularize network dynamics.
RESULTS
We illustrate the application of DMA by analyzing the timescale hierarchies of chemical reaction networks in three case studies. The first two are hypothetical pathways that possess key characteristics of typical biological pathways. The last is glycolysis—a well-studied biological pathway, the concentration pools of which were previously characterized (9, 22). We examine this case study to validate our computational framework by reproducing some of the known concentration pools of glycolysis and their respective timescales and to gain insight into its higher-level organization. Three important concepts we encounter in this section are concentration pools, coherent structures, and transitory regimes between two consecutive timescales, which are all defined in Materials and Methods (see “Concentration pools and coherent structures”).
Toy Model 1
The first case study is Toy Model 1, which is the same pathway examined in Materials and Methods (see “Concentration pools and coherent structures” and Fig. S1 and S2). It converts a substrate (metabolite 1) into a product (metabolite 2) without energy coupling. The substrate and product are energetically equivalent, and all the equilibrium constants are of the same order of magnitude ( for ); therefore, the flux is driven through the pathway by maintaining a gradient between the extracellular concentrations and (Fig. 2A). The boundary reactions (reactions 1 and 5) are the rate-limiting steps, and the rate constants of consecutive intracellular reactions are separated by two orders of magnitude (Fig. S1C).
Fig 2.
Dynamic mode analysis of Toy Model 1 described in Fig. S1. (A) Visualization of the steady-state solution in a network map. Red arrows indicate metabolites with perturbed concentrations. (B) Dynamic response to concentration perturbations. Deviations from the steady-state and dimensionality are plotted along the dynamic trajectory. Dashed lines indicate a transition between two time periods with distinct exponential decay modes, partitioning the overall relaxation time into seven characteristic time intervals. (C) Optimal amplitudes, dominant eigenvalues, and pooling matrix for each time interval identified in panel B. Black arrows in pool structures indicate parts of the network that are affected the most by the initial perturbation in the corresponding time interval. Gradient and uniformly colored envelopes enclose metabolites, the concentration trajectories of which are negatively and positively correlated in the corresponding time interval, respectively.
Toy Model 1 has a timescale hierarchy due to the separation of rate constants. We identified its timescales and their respective concentration pools using DMA (Fig. 2). We induced a dynamic response by perturbing and , which activated all three timescales associated with the rate constants of the intracellular reactions (Fig. 2A and B). The successive equilibration of the intracellular flux disturbances partitioned the total relaxation time into seven distinct time intervals, corresponding to the three main timescales and their transitory counterparts we highlighted previously (Fig. 2B and C). Of particular interest are the slowest transients near the steady state. Here, all the intracellular metabolites pool together into a single aggregate metabolite with coefficients , the concentration of which is controlled by the boundary reactions. Overall, the timescales and pooling matrices furnished by DMA for each time interval agreed well with our estimates based on the approximate analytical method (“Concentration pools and coherent structures”; Fig. 2C).
Furthermore, DMA could accurately characterize the transitory regime between two consecutive timescales (“Concentration pools and coherent structures”), identifying the disequilibrium and conservation stages associated with the relaxation of each reaction (e.g., see the coefficients of and in the first and second pooling maps of interval 2, characterizing the relaxation of reaction 2). These general characteristics can help understand the dynamic responses of more complex reaction networks.
Toy Model 2
The second case study, referred to as Toy Model 2, involves the same pathway as in Toy Model 1, but with energy coupling (Fig. S3). It converts a high-energy substrate (metabolite 1) into a low-energy product (metabolite 2). The released energy is then utilized to convert a low-energy cofactor (metabolite 6) into its high-energy counterpart (metabolite 5). The high-energy cofactor drives an uphill step at the beginning (reaction 2) and is recovered in a downhill step at the end (reaction 4). Thus, it serves a similar metabolic function to glycolysis. The cofactors are exchanged with the extracellular environment depending on their concentration gradient across the membrane, and so are the substrate and product. As in Toy Model 1, reactions 1 and 5 are the rate-limiting steps. However, the boundary reactions for the cofactors (reactions 6 and 7) have the largest rates. Unlike Toy Model 1, the mass-balance equations for this model are nonlinear due to the bilinearity of the mass-action rates for the cofactor-coupled reactions. In this case study, an interplay between the rate-limiting steps, cofactor exchange reactions, and cofactor-coupled intracellular reactions shapes the structure of concentration pools.
We analyzed the timescale hierarchy of Toy Model 2 by examining its dynamic response to concentration perturbations (Fig. 3A). During the relaxation of the ensuing concentration and flux disturbances, DMA identified eight time intervals associated with distinct timescales and concentration pools (Fig. 3B and C). In this model, besides reaction rates, energy coupling also influences the chronology of reactions by linking the equilibration of the cofactor-driven steps. Here, the disequilibrium stages of reactions 2 and 4 occur on the ~1- to 10-s timescale, and their conservation stages occur on the ~10- to 30-s timescale. Once the concentration and flux disturbances of these reactions have relaxed, and form two coherent structures with and , respectively, giving rise to two aggregate variables and . Next, the disequilibrium stage of reaction 3 occurs on the ~100–1,000 s timescale followed by a conservation stage that persists until the steady state has been reached. During this conservation stage, and pool together, forming a larger coherent structure. Accordingly, the slowest transients are characterized by pooling together into a single aggregate metabolite with coefficients . As the steady state is approached, the concentration of this aggregate metabolite is controlled by reactions 1 and 5.
Fig 3.
DMA of Toy Model 2 described in Fig. S3. (A) Visualization of the steady-state solution in a network map. Red arrows indicate metabolites with perturbed concentrations. (B) Dynamic response to concentration perturbations. Deviations from the steady-state and dimensionality are plotted along the dynamic trajectory. Dashed lines indicate a transition between two time periods with distinct exponential decay modes, partitioning the overall relaxation time into eight characteristic time intervals. (C) Optimal amplitudes, dominant eigenvalues, and pooling matrix for each time interval identified in panel B. Black arrows in pool structures indicate parts of the network that are affected the most by the initial perturbation in the corresponding time interval. Gradient and uniformly colored envelopes enclose metabolites, the concentration trajectories of which are negatively and positively correlated in the corresponding time interval, respectively.
Finally, we highlight the important role of cofactors in the dynamics of pathways with energy coupling. In this case study, because the cofactor exchange reactions are the fastest in the network, they control the short-term responses ( s), only influencing the disequilibrium and conservation stages of reactions 2 and 4. Upon concentration perturbation, and become negatively correlated, forming a coherent structure almost immediately ( s). As mentioned above, the long-term responses of Toy Model 2 are mostly controlled by reactions 1 and 5 near the steady state with minimal effect from the cofactors.
Glycolysis
Glycolysis is a central energy-conversion pathway in biology (23). It converts a high-energy substrate (glucose) into low-energy products (pyruvate and lactate). The energy released in this process is then used to produce high-energy cofactors (ATP and NADH) (see “Energetics of glycolysis” for details). To generate a sufficient amount of high-energy phosphate bonds, it also imports inorganic phosphate (pi). A key step of glycolysis is catalyzed by fructose-bisphosphate aldolase, splitting fructose 1,6-bisphosphate into the triose phosphates dihydroxyacetone phosphate and glyceraldehyde 3-phosphate. This step makes the energy stored in the chemical bonds of the pentose ring accessible for energy conversion in downstream reactions.
In the third case study, we analyzed the timescale hierarchy of the glycolytic pathway in human red blood cells (Fig. 4). We solved a mass-action kinetic model of glycolysis using the MASSpy package (model parameters are provided in Data S1) (24) and computed its dynamic response to a perturbation in the ATP load (Fig. 4A). Several reactions in upper and lower glycolysis are coupled to ATP hydrolysis, so their relaxations are tightly linked together. These energy-coupling mechanisms impart an autocatalytic structure to the glycolytic pathway, a characteristic of which is oscillatory dynamics (25). Interestingly, during the relaxation of the load perturbation, concentration trajectories exhibit oscillatory dynamics on timescales ranging from minutes to a day, coinciding with the average erythrocyte circulation time and circadian period, respectively (26). We analyzed the concentration trajectories using DMA and identified 11 time intervals with distinct timescales and concentration pools (seven of which are highlighted in Fig. 4B and C). In the following, we highlight a few coherent structures associated with these timescales that were characterized previously. A complete list of the dominant eigenvalues and their respective pooling matrices for all time intervals is provided in Data S2.
Fig 4.
DMA of the glycolytic pathway. (A) A network map, highlighting the energetics of substrate, products, and cofactors. The flux of the reaction indicated by the red arrow is perturbed. The retro-aldol cleavage of fructose 1,6-bisphosphate is a key step, making the energy stored in the chemical bonds of the pentose ring accessible for the production of high-energy cofactors in downstream reactions. (B) Dynamic response to flux perturbations. Deviations from the steady-state and dimensionality are plotted along the dynamic trajectory. Dashed and dotted lines indicate a transition between two time periods with distinct exponential decay modes, partitioning the overall relaxation time into seven characteristic time intervals. Dimensionality changes across dashed lines but remains the same across dotted lines. (C) Dominant eigenvalues and pooling matrix for each time interval identified in panel B. Gradient and uniformly colored envelopes in pool structures enclose metabolites, the concentration trajectories of which are negatively and positively correlated in the corresponding time interval, respectively.
The first coherent structure in this case study arises from the relaxation of phosphoglycerate mutase (PGM) on the ~0.1–100 s timescale, where the concentrations of 3-phosphoglycerate (3pg) and 2-phosphoglycerate (2pg) become positively correlated. Next, the flux and concentration disturbances of phosphoglucoisomerase and PGM fully relax on the ~1- to 30-min timescale. Here, the concentrations of glucose 6-phosphate and fructose 6-phosphate become positively correlated, forming the well-known hexose phosphate coherent structure (22). On this timescale, the concentration of 1,3-bisphosphoglycerate also becomes correlated with those of 3pg and 2pg, leading to a larger phosphoglycerate coherent structure. On the ~10-hr timescale, the dynamics of glyceraldehyde phosphate dehydrogenase relax, resulting in glyceraldehyde 3-phosphate merging with the phosphoglycerate coherent structure. All these coherent structures and their respective timescales are consistent with previous studies of the glycolytic dynamics using bottom-up approaches (22).
In general, coherent structures associated with slow timescales are more physiologically relevant than those forming on fast timescales (27). Thus, we studied the two slowest pools of the glycolytic pathway forming in the last two of the 11 time intervals that DMA identified (Fig. 4B, intervals 6 and 7), examining their respective time evolutions and (Fig. 5). Here, and are the pools associated the second slowest and slowest timescales, where denotes the representation of pool with respect to concentrations rather than concentration deviations. We found that a balance between high-energy phosphate bond and redox trafficking shapes the interconnected dynamics of upper and lower glycolysis (Fig. 5), highlighting the coupling role of the cofactors.
Fig 5.
Dynamic trajectories of the slowest pools of glycolysis. The characteristic time intervals are identical to those in Fig. 4B. Note that pools and are defined with respect to concentrations rather than concentration deviations . Both pools are dominant in interval 6, and is the dominant pool in interval 7. The coherent structures of form near the end of interval 6, while those of form in interval 7. The dynamics of these pools are driven by a balance between high-energy phosphate bond and redox trafficking. The circle size for each metabolite or aggregate metabolite on the right panel is proportional to its coefficient in the respective pooling map.
Finally, we note that the magnitude of the coefficients of the cofactors (AMP, ADP, ATP, and inorganic phosphate) in the pooling matrix is the largest across all timescales, implying that the glycolytic dynamics are mostly determined by the energetics of cofactor interconversions. Importantly, ATP and inorganic phosphate are the high-energy cofactors that control the dynamics on the circulation ( min) and circadian ( hr) timescales, respectively.
DISCUSSION
Living systems grow and evolve in constantly changing environments, adapting to external fluctuations through interconnected networks of chemical transformations. Understanding how these dynamics emerge from the underlying molecular processes is a major challenge of systems biology. While whole-cell kinetic models offer a thorough description of the intricate interactions within biological networks, their sheer scale often obscures the fundamental patterns that govern the system-level behavior. Using timescale decomposition techniques, we can overcome this limitation by simplifying the complex kinetics into a coarse-grained model with biologically interpretable components. This approach allows us to glean meaningful insights from the model and identify critical components that drive the dynamics at the system level.
In this paper, we introduced DMA—a data-driven approach for timescale decomposition of chemical reaction networks. This approach characterizes concentration pools and coherent structures emerging from network dynamics using time-series data. A key component of our computational framework is an extended version of optimal DMD (ODMD) (18 – 20), which we developed to identify characteristic timescales of reaction networks. We showed that the dominant eigenvalues and eigenmodes furnished by DMA in any time interval along with dynamic trajectories are determined by two important statistical descriptors, namely, the time-delayed autocorrelation and covariance. Interestingly, the former was a basis of previous top-down approaches for identification of coherent structures in reaction networks (7). Given that timescale hierarchies often underlie the formation of dynamical patterns that lead to dimensionality reduction, our analysis provides a theoretical basis for why time-delayed autocorrelation is an effective metric for statistical analyses of coherent structures.
DMA is a trajectory-based technique that utilizes standard matrix decomposition, eliminating the need to numerically solve nonlinear optimization problems. Thus, it is robust and applicable to large-scale kinetic models. It identifies coherent structures such that they align with a specific exponential decay modes and reflect correlations among concentration trajectories, linking them to the dissipative structure of the reaction system. Therefore, unlike other trajectory-based techniques, how energy is dissipated throughout reaction networks is inferable from the dynamics of coherent structures. Nevertheless, the results can still be affected by noise, especially in transitory regimes where DMA cannot fully decompose the dominant decay modes if timescales are not sufficiently separated. To alleviate the effect of noise, various preprocessing techniques can be applied to the data matrices, such as sparsity-promoting low-rank decomposition, low-rank projection (solving a generalized form of equation 32), and debiased DMD (13).
We validated the results of DMA for a hypothetical pathway with analytically characterized concentration pools. We also reproduced some of the concentration pools and coherent structures of the glycolytic pathway that were determined previously using bottom-up approaches (22), further confirming the validity of our approach.
Overall, the outcomes of timescale decomposition are more amenable to biological interpretation than the solutions of mass-balance equations. For example, our analysis of the glycolytic pathway in human red blood cells indicated two physiologically relevant characteristics: (i) glycolysis is mostly a cofactor-driven pathway, the dynamics of which are controlled by ATP and inorganic phosphate on the circulation and circadian timescales, respectively, and (ii) the slowest dynamics are dominated by a balance between high-energy phosphate bond and redox trafficking. By chronologically organizing major events along with time evolutions, timescale decomposition could provide physiological insights into the dynamics of more complex biological pathways.
More broadly, analyzing the dynamics of whole-cell networks, such as that of human red blood cells, using the techniques developed here can elucidate important cellular functions associated with coherent structures forming on separate timescales, allowing us to uncover the underlying mechanisms governing various physiological processes. At the fastest timescales, coherent structures reflect rapid chemical reactions that enable essential functions such as oxygen transport and gas exchange, while those forming on intermediate timescales reveal the coordination of major subsystems involved in energy and redox balance, such as glycolysis and the pentose phosphate pathway. At the slowest timescales, coherent structures can explain regulatory processes that maintain homeostasis and facilitate cellular adaptation. Therefore, understanding these hierarchical timescales in human red blood cells not only enhances our fundamental knowledge of cellular biology but also has implications for diseases such as anemia, malaria, and sickle cell disease, where disruptions in these timescales play a critical role.
Quantitative models have been a chief driver of progress in biology in recent years (28 – 30). Kinetic-based (1, 2, 31, 32) and constraint-based (33 – 38) approaches have been, and will likely continue to be, instrumental in these developments. Both approaches provide a mechanistic description of cellular functions, but each has its own limitations. Kinetic models are generally complex and require numerous parameters that are subject to large uncertainties, while constraint-based models are not suitable for predicting inherently dynamic phenotypes. The timescale decomposition technique presented in this study can bridge the gap between kinetic- and constraint-based modeling by furnishing a systematic framework for construction of coarse-grained kinetic models (39 – 41) based on intrinsic timescales of biochemical reaction networks. Such reduced-order models are more tractable than whole-cell models and can capture essential characteristics of biological systems at physiologically relevant timescales.
Personalized medicine is another area where our timescale decomposition technique can play a major role. For example, when individual variations of pathological features or risk for drug side effects manifest in cellular dynamics, studying the timescale hierarchies can help identify the underlying metabolic causes (26). Using kinetic constants as a proxy for individual genotypes in these cases, analyzing the patterns in timescales and coherent structures allows us to understand and classify disease phenotypes through the lens of timescale hierarchies.
MATERIALS AND METHODS
Concentration pools and coherent structures
We first introduce the concept of pools and coherent structures for a case study, providing a biological motivation for their definitions. We define these concepts formally in the general case in the next section.
Consider a reaction network involving a single linear pathway with four metabolites and five reactions. We refer to this reaction network as Toy Model 1 (Fig. S1A). The pathway imports metabolite 1 from and exports metabolite 4 into the extracellular environment. Here, and denote the extracellular concentration of metabolites 1 and 4, respectively. The rate constants of consecutive reactions in this pathway are separated by two orders of magnitude (Fig. S1C), so its dynamics are expected to unfold over separate timescales. Dynamic responses to perturbations of steady states are ascertained from transient mass-balance equations:
| (1) |
with the concentration vector, initial concentration perturbations, stoichiometric matrix, flux vector, and time. Upon perturbations, the dynamics of this system relax over four time intervals associated with three separate timescales (Fig. S1B, dashed lines). Because of the disparity of rate constants, reactions 1–4 reach their steady states consecutively on their respective timescales. As the dynamics of a given reaction relax, the concentrations of its substrates and products become correlated in the intervals between consecutive timescales. Consequently, trajectories move in a low-dimensional space, suggesting that the dynamics can be adequately described with respect to metabolite pools with correlated concentrations. This is a general behavior that most biochemical reaction networks exhibit, and it is the basis of the definition of concentration pools and coherent structures in this section.
To formalize the concept of concentration pools, it is more convenient to express concentrations and fluxes relative to their steady-state values. Concentration and flux deviations are defined as and , respectively. Accordingly, mass-balance equations can be expressed with respect to these deviation variables:
| (2) |
Assuming mass-action kinetics, reaction rates are expressed with respect to deviation variables as follows:
| (3a) |
| (3b) |
| (3c) |
| (3d) |
| (3e) |
The first pool of Toy Model 1 is associated with the first timescale with the order-of-magnitude operator. On this timescale, all flux deviations but are negligible. Therefore, , and the mass-balance equations describing the dynamics are as follows:
| (4a) |
| (4b) |
Eliminating and , equations 4a and b can be expressed with respect to as follows:
where , and is the initial condition for the flux deviation of reaction 2. From this analysis, we derive a more accurate approximation of the first timescale as . Substituting the solution of back in equations 4a and 4b, we obtain the trajectory of concentration deviations:
| (5) |
where is the eigenmode associated with the first pool in the primal space (see Fig. 1F) with and the integration constants of equations 4a and 4b. We observe from equations 4a and 4b that and are negatively correlated on this timescale. Once the dynamics of reaction 2 have relaxed, its flux deviation equilibrates so that
| (6) |
with the equilibrium constant of reaction 2 and the second timescale. Note that equilibration here refers to the relaxation of flux disturbances as , which is not the same as the equilibration of reaction 2 when .
If the relaxation time of reaction 2 is faster than the second timescale of this system, then there is an intermediate timescale that characterizes the transition between the first and second timescales during which reaction 2 equilibrates (see Fig. S1B). In this transition period, the flux deviations of reactions 2 and 3 are of the same order, so that . To quantify this transitory period, suppose that with a coefficient that characterizes the transition between the two timescales when . Accordingly, the mass-balance equations simplify to
| (7a) |
| (7b) |
| (7c) |
From the asymptotic relation between and stated in equation 6, we find that equations 7a–7c with approximate the dynamics in this transitory regime when the second timescale of the system is approached. Note that, in the transitory interval, the coefficient in these equations varies from a large value near the first timescale to its asymptotic value near the second timescale. However, we treat it as a constant to approximate local solutions of equations 7a–7c in the transitory regime. We can eliminate and in equations 7b and 7c in the same way as in equations 4a and 4b to express the mass-balance equations with respect to :
where and is the initial condition for the flux deviation of reaction 3. The timescale associated with this regime is estimated as , and the trajectory of concentration deviations is ascertained by integrating equations 7a–7c using the solution of :
| (8) |
where are integration constants. Note that (Fig. S1C) and in Toy Model 1, implying that and are positively correlated, and, together, they are negatively correlated with on this timescale. Finally, we highlight an important feature of dominant decay modes during the transition between two consecutive timescales in relation to the role of kinetic parameters. Consider the timescales and of Toy Model 1, for example. Here, the coefficients of and in the decay mode only depend on their stoichiometric coefficients in reaction 2. However, in the decay mode , they depend on both the stoichiometric coefficients and rate constants and .
Having qualitatively described how dynamic trajectories become correlated on the first timescale, we provide a quantitative representation of the first pool. In this work, we define a pool associated with a given timescale as a linear combination of concentrations, the time-dependent representation of which aligns with the corresponding exponential decay. If multiple dominant timescales coexist in a time interval, we require the corresponding pools to be dynamically independent. For example, in the transitory period outlined above where both exponential decay modes in equations 5 and 8 are dominant, the first pool and its transitory counterpart are defined as follows:
| (9a) |
| (9b) |
where
| (10a) |
| (10b) |
are normalized vector representations of the pools. On occasion, we also refer to these vector representations as pooling maps. Here, and are the residual terms approaching a steady-state value at a faster rate than the exponential decay term for each pool. Moreover, the timescale associated with each pool is defined as a time point beyond which the pool is sufficiently close to its steady-state value (Fig. S2A). For example, the timescales of the foregoing two pools are defined as follows:
| (11a) |
| (11b) |
Here, is a tolerance threshold controlling the closeness of pools to their steady-state values with and appropriate concentration scales. For example, for and , and are reasonable concentration scales.
As noted in previous works, a proper definition of pools should ensure that each pool is dynamically independent of other pools in a system when they form on the same timescale (22, 27). In top down approaches, such as principal component analysis or independent component analysis, orthogonality and independence are equivalent concepts (42). However, as discussed at the beginning of this section, a biological motivation for introducing the concept of concentration pools is identifying aggregate variables that naturally arise in systems with low-dimensional dynamics—a key characteristic of reaction networks with timescale hierarchies. Accordingly, alignment with exponential decay modes associated with separate timescales is considered a more appropriate notion of independence (18, 22). Since eigenmodes are not necessarily orthogonal for a general reaction network, dynamical independence does not imply the orthogonality of pools in the concentration space. In this work, we adopt the same notion of independence. However, in addition to alignment with exponential decay modes, we also require a reciprocal orthogonality between the vector representation of the pools and eigenmodes. Here, we present this reciprocal relationship for and in the transitory regime between the first two timescales of Toy Model 1, deferring the formulation for the general case to the next section. The reciprocal orthogonality conditions for the first pool and its transitory counterpart are as follows:
| (12a) |
| (12b) |
These conditions are motivated by their important geometric and physical interpretations, and they are linked to the concept of flux-concentration duality (43) in chemical reaction networks (see “Reciprocal orthogonality conditions for Toy Model 1” and Fig. 1F).
The transitory regime between two consecutive timescales generally have two stages. In the first, the concentrations evolve in a direction corresponding to a maximal energy dissipation of the reaction, where the substrate and product concentrations are negatively correlated. In the second, the concentrations evolve in a direction associated with the equilibration of the flux disturbance, where the substrate and product concentrations are positively correlated. We refer to the first as the “disequilibrium” and the second as the “conservation” stages of relaxation for each reaction.
Coherent structure is another concept that we examine in this paper. Its definition, which is derived from, but is more restrictive than that of concentration pools, centers around correlations among concentration trajectories. As we noted for Toy Model 1, the dominant eigenmodes on a given timescale span a reduced concentration space in which the dynamics unfold. Accordingly, metabolite coefficients in the pooling maps, which are in turn ascertained from the respective eigenmodes, determine metabolites the concentrations of which are affected on that timescale. The concentration trajectories of these metabolites remain correlated until the next timescale of the system has been reached. For a linear system, such as Toy Model 1, the eigenmodes and their respective pooling maps do not vary with time. Therefore, the correlation coefficients among metabolites that are present in a pool remain constant. However, in a general nonlinear system, the Jacobian matrix and its eigenmodes can vary along dynamic trajectories. Consequently, correlation coefficients also become time-dependent, but they can plateau in small intervals between the timescales of the system.
For a general nonlinear reaction system, we define a coherent structure as a subset of metabolites in a concentration pool, the correlation coefficients of which vary within a prescribed tolerance threshold in time intervals between the timescales of the system. To make this definition more precise, suppose that metabolites and in a reaction system are part of a pool that becomes active in the interval , where and are two consecutive timescales. Then, for these metabolites to form a coherent structure, their concentration trajectories must satisfy
| (13) |
where is a tolerance threshold controlling variations of correlation coefficients, and the correlation function is given by
| (14a) |
| (14b) |
| (14c) |
Here, and are the expectation and standard deviation of a time-dependent function . Moreover, the correlation function is evaluated in the interval in which the coherent structure forms, where . Note that superscripts of in equations 14a–14c are indices referring to time-interval bounds and should not be confused with exponents. For coherent structures with more than two metabolites, equation 13 must be satisfied for all pairwise correlations among the metabolites. For linear systems, such as Toy Model 1, concentration pools always form a coherent structure because the pooling maps and correlation coefficients are constant along dynamic trajectories. Also note that coherent structures can generally span several timescales.
Having discussed the concepts of concentration pools and coherent structures for the first timescale of Toy Model 1, we follow the same procedure to characterize concentration pools and coherent structures that form at slower timescales. The second pool arises when the dynamics of reaction 3 begin to relax. It is associated with the timescale with . On this timescale, we have , leading to the following expression for the second pool:
| (15) |
| (16) |
Here, denotes a residual term that approaches a steady-state value at a faster rate than the exponential decay term, so its leading-order term is . Another expression for the second pool can be derived by combining with pools that form on faster timescales than . For example, once the dynamics of reaction 2 have relaxed in the transitory interval , and become correlated through the relationship:
| (17) |
where is a residual, the leading-order term of which is . Substituting equation 17 in equation 15 results in the following:
| (18) |
| (19) |
A sum-total pool can also be constructed by adding the first two expressions in equations 15 and 18, resulting in
| (20) |
| (21) |
with the sum-total residual term. Note that the leading-order terms of , , and are . Thus, , , and can be regarded as three representations of the second pool since they all align with , exhibiting a similar asymptotic behavior for (Fig. S2B). Note that the pooling maps for all three representations are orthogonal to the second transitory mode that becomes active in the interval , so they satisfy the reciprocal orthogonality condition:
| (22) |
which is similar to the reciprocal orthogonality condition in equation 12a for the first timescale.
The third timescale is associated with the relaxation of reaction 4. The procedure for characterizing its concentration pool is similar to that outlined for the second timescale (see “Derivation of the third and fourth approximate eigenmodes of Toy Model 1”). We only highlight the representations of the third pooling map:
| (23a) |
| (23b) |
| (23c) |
| (23d) |
The corresponding pools , , , and exhibit a similar asymptotic behavior for (Fig. S2C). As with the second timescale, denotes the sum-total representation of the third pooling map.
The dynamics of reaction 4 relax fully once the initial perturbations have propagated throughout the network and reached the boundary reactions. Upon relaxation, all flux and concentration disturbances equilibrate toward a steady state on a transitory timescale . On this timescale, which we regard as the fourth timescale of Toy Model 1 (i.e., ), all the concentration deviations align with the slowest eigenmode, forming a concentration pool containing all the intracellular metabolites. Since this pool corresponds to the only dominant eigenmode for , there are no slower eigenmodes to evolve into; hence, it needs not satisfy any reciprocal orthogonality conditions. Accordingly, the concentration pool associated with the largest timescale is defined as a normalized form of the slowest eigenmode. This condition automatically arises from the general definition of the pooling matrix—a matrix containing all the dominant pooling maps on a given timescale—which will be formalized for a general nonlinear reaction system in the next section.
The fourth pool has the following representations (see “Derivation of the third and fourth approximate eigenmodes of Toy Model 1”):
| (24a) |
| (24b) |
| (24c) |
| (24d) |
| (24e) |
The asymptotic profiles of the respective pools , , , , and are similar for (Fig. S2D). As before, denotes a sum-total representation.
Lastly, we emphasize that the purpose of the analysis presented here and subsequent sections (see “Derivation of the third and fourth approximate eigenmodes of Toy Model 1”) is to elucidate the connections between concentration pools, timescales, and the equilibration of flux disturbances using approximate methods. This approach is justified since a fundamental characteristic of biochemical reaction networks underlying the separation of timescales is the presence of vastly disparate rate constants. The solutions provided here and in subsequent sections (see “Derivation of the third and fourth approximate eigenmodes of Toy Model 1”) are not exact, so the eigenvalues estimated here should be regarded as approximations of the exact eigenvalues . Since Toy Model 1 is a linear system, its eigenvalues remain constant along dynamic trajectories. Consequently, in any time interval, the exact solution of mass-balance equations can be expressed as a linear combination of exponential decay modes with varying amplitudes without needing transitory eigenvalues. However, the reason why we encountered the transitory eigenvalues here lies in our approximation method describing the kinetics on a given timescale by the flux of a single reaction, the rate constant of which gives rise to that timescale.
Dynamic mode analysis
DMA is a data-driven approach to identify timescale hierarchies of a chemical reaction network and characterize its dynamic responses to flux or concentration perturbations (Fig. 1). Specifically, this approach aims to compute concentration pools and coherent structures that emerge as a reaction network evolves in time from time-series data. Although data can be generated from experimental measurements or numerical simulations, we only focus on numerical solutions of kinetic models of biochemical reaction networks in this paper. We assume that the concentration trajectories of all the metabolites in the network of interest are given. The goal is then to identify the dominant eigenvalues, dominant eigenmodes, and the respective concentration pools algorithmically in any time interval along dynamic trajectories of the network.
The algorithm begins by solving the mass-balance equations (equation 1) for a reaction network with metabolites and reactions, the steady-state concentrations or fluxes of which is perturbed (Fig. 1A). The resulting concentration trajectory is then computed numerically (Fig. 1B). The goal in subsequent steps of DMA is to identify the dominant eigenvalues and eigenmodes in a sliding time window spanning the interval as it moves from to , where is the total relaxation time—a time by which all flux and concentration disturbances have relaxed to within a tolerable threshold. To analyze the dynamics in the sliding time window, the mass-balance equations are linearized locally around a reference time :
| (25) |
where
Here, is the local Jacobian matrix, the concentration vector at the reference time, and the local concentration deviations. Note that, for a general nonlinear reaction network, equation 25 is an inhomogeneous system of equations, the solutions of which cannot be expressed with respect to purely exponential decay terms (Fig. 1C). Thus, we recast it into the homogenous system:
| (26) |
by introducing a new variable , we term the local transformed concentration deviation, where captures the inhomogeneity of equation 25. The general solution of equation 26 is written (44) as follows:
| (27) |
where is the exponential map. Since we are only concerned with small perturbations of noncritical stable steady states, we assume that along all dynamic trajectories. Therefore, is always nonsingular, so is a well-defined quantity. Here, returns the real part of a complex argument, and is the eigenvalue vector of the Jacobian. The linearization of mass-balance equations introduced here allows the application of top-down timescale decomposition techniques (e.g., DMD) to identify timescale hierarchies from time-series data.
In the next step, time-series data are generated from the numerical solution of local concentration deviations in the current time window by evaluating at equally spaced time points in (Fig. 1D), compiling their values in two data matrices:
| (28a) |
| (28b) |
where is the vector of local concentration deviations evaluated at the th time point in with the gap between consecutive time points. The corresponding transformed data matrices and are similarly defined in terms of . From equation 27, we find the relationship between transformed concentration deviations at two consecutive time points:
| (29) |
Here, is an error vector associated with the linearization of mass-balance equations. It also accounts for the errors arising from numerical simulation (e.g., truncation errors) or experimental measurements, depending on how data are generated. Since is a constant matrix, we can write equation 29 in a matrix form:
| (30) |
where is a matrix of the same size as and , containing all the error vectors .
If mass-balance equations are known, or there are approximation methods to estimate from time-series data, then the dominant eigenvalues and eigenmodes of the reaction network in the current time window can be directly determined from and using top-down approaches, such as DMD (18) or ODMD (19, 20). In the following, we briefly outline the procedure for ODMD.
To determine the dynamic dimensionality of the system, the singular-value decomposition of is computed:
where is the number of singular values that are nonzero to within a tolerable threshold . The columns of are the proper orthogonal modes (20) of , and they span a reduced -dimensional subspace of the locally transformed concentration-deviation space where the dynamic trajectories lie in the current time window. We refer to this subspace as the space of transformed proper orthogonal modes. The projection of onto this reduced space is
| (31) |
The projected matrix in this equation can be proven to be an upper Hessenberg matrix (45). Because and are related through a similarity transformation, contains the dominant eigenvalues of in the current time window. Errors in equation 30 are then minimized by solving the optimization problem:
| (32) |
to identify a matrix that best describes the data in and , where denotes the Frobenius norm. Substituting from equation 31 in equation 32, the solution of the foregoing optimization problem is ascertained (20) as follows:
| (33) |
where the superscript denotes the Moore-Penrose generalized inverse (21). The eigenmodes of are the projections of the dominant eigenmodes of in the space of transformed proper orthogonal modes, and its eigenvalues are the dominant eigenvalues of .
In this paper, both and are treated as unknowns, so and are not provided at the outset. Therefore, in the following, we introduce an extension of ODMD (see Fig. S4) to determine the dominant eigenvalues and eigenmodes of the inhomogeneous system (equation 25) directly from the time-series data in and . We start by rewriting equations 29 and 30 in terms of local concentration deviations:
| (34) |
| (35) |
where is an all-one column vector with components. As previously stated, because of the inhomogeneous term in equation 34 or in equation 35, DMD or ODMD is not directly applicable, although we can show that in the limit (see “Inhomogeneity of evolution equations for local concentration deviations and temporal grid size”). Thus, the inhomogeneity can theoretically be eliminated from equation 34 by generating an arbitrarily fine temporal grid in the sliding time window. However, it is impractical to do so because of the computational costs and additional errors it introduces in subsequent steps of the algorithm. Following the same procedure as DMD, the singular-value decomposition of is computed:
As in the previous case, reflects the dynamic dimensionality of the system in the current time window, and the columns of are the proper orthogonal modes of , spanning the local concentration-deviation space. We refer to this subspace as the space of proper orthogonal modes. As before, is expressed with respect to the proper orthogonal modes through the similarity transformation:
| (36) |
The goal now is to find and such that the errors in equation 35 are minimized. The corresponding optimization problem is written as follows:
| (37) |
This is an unconstrained quadratic program, so its solution can be determined analytically (see “Relationship between time-delayed autocorrelation, covariance, and local Jacobian spectra” for the proof):
| (38a) |
| (38b) |
where
| (39a) |
| (39b) |
and
| (40) |
Note that the barred vectors and matrices denote the representation of their unbarred counterparts in the space of proper orthogonal modes. Accordingly,
| (41) |
Here, and are the average of concentration-deviation data in and , respectively. Moreover, and are the time-delayed autocorrelation and covariance matrices of the time-series data represented with respect to the proper orthogonal modes. Note that equation 38a may be regarded as a generalization of the DMD solution (equation 33) it approaches to as , which in turn occurs when the steady state has been attained. We also emphasize that metrics based on time-delayed autocorrelation were used in previous top-down approaches along with clustering algorithms to identify concentration pools irrespective of the underlying biochemical mechanisms (7, 8). However, as we highlighted in the previous section, what underlies the formation of concentration pools and coherent structures in biochemical reaction networks is the presence of timescale hierarchies. Furthermore, the optimal solution in equation 38a implies that the time-delayed autocorrelation matrix contains information about the local Jacobian spectra. Although the expressions used in previous studies are not identical to equation 38a, our analysis demonstrate why time-delayed autocorrelation is a useful metric for statistical classifications of concentration pools.
Once has been computed from equation 38a, the dominant eigenvalues and eigenmodes can readily be determined (Fig. 1E). Let be the eigenvalue decomposition of with a diagonal matrix, where the diagonal entries are the eigenvalues of (not to be confused with the approximate eigenvalues of Toy Model 1 in the previous section). Then, the columns of correspond to the dominant eigenmodes of in the current time window. Because and are related through a similarity transformation, is also the dominant eigenvalues of . From equation 29, it follows that
| (42) |
with the dominant eigenvalues of the local Jacobian matrix .
The next steps of DMA are similar to those of ODMD (20), but modifications are required due to the inhomogeneity of equation 34. Here, we note that the inhomogeneous term can be eliminated by introducing the differential time-series data matrix , so that
| (43) |
where is the th column of and . Since equation 43 is a homogenous system, we can follow the same procedure as ODMD to determine the optimal amplitudes associated with the dominant eigenvalues and eigenmodes ascertained in the previous step. Let
be the singular-value decomposition of and the Vandermonde matrix constructed from the eigenvalues . Then, the optimal amplitudes are given by (20)
| (44) |
where an overline denotes the complex conjugate of a matrix, the superscript denotes the complex-conjugate transpose, and indicates elementwise matrix multiplication. Accordingly, the modal matrix reads
| (45) |
the columns of which are the dominant eigenmodes of . Here, is a diagonal matrix with diagonal entries . Finally, we define the pooling matrix as
| (46) |
Note that the pooling maps defined in the previous section correspond to the rows of . Here, the definition equation 46 ensures that the modal and pooling matrices satisfy the reciprocal orthogonality conditions in all time intervals. It is also consistent with the relaxation of the slowest eigenmode discussed in the previous section. Recall, we defined the pooling map near the steady state corresponding to the slowest mode as a normalized form of the slowest eigenmode, which is compatible with equation 46 since the Moore-Penrose inverse of a vector yields the transpose of the same vector with a normalized length.
The analysis of concentration pools and coherent structures for a given reaction system using the approach introduced in this section depend on the initial conditions and how they are perturbed. Because the results may vary based on the specific perturbations applied, dependence on initial conditions could be considered a limitation. However, we note that these concentration pools and coherent structures emerge from the inherent timescales of the system’s dynamics. As a result, the characteristics of these structures should, in principle, remain consistent across different perturbations. Performing this analysis for a wide range of perturbations can furnish a more comprehensive and accurate description of concentration pools and coherent structures. Therefore, while the method may seem sensitive to initial conditions, it is not inherently limiting.
Reciprocal orthogonality conditions for Toy Model 1
The physical interpretation of the reciprocal orthogonality conditions equations 12a and 12b is connected to the dissipative structure of chemical reaction networks. Since the transitory timescale that we are concerned with bridges two dynamic regimes associated with a fast ( ) and slow ( ) timescale characterizing the relaxation of reactions 2 and 3, their respective local eigenmodes represent the equilibration direction of these reactions in the concentration space. In particular, the slow eigenmode , which is the mode that converges to when , aligns with equilibration directions of reaction 3:
| (47) |
where denotes an equilibration direction. Concentration vectors that are parallel to this direction leave reaction 3 at its steady state, while those perpendicular to this direction maximally change the flux of reaction 3. Therefore, by requiring to be orthogonal to in equation 12a, we ensure that, at any time along a dynamic trajectory, it points in a direction in the concentration space that maximizes the energy dissipation of reaction 3. In contrast, the fast eigenmode aligns with the disequilibrium direction of reaction 2 (see chapter 4 of Palsson and Abrams [27]) for the definition of disequilibrium pools], where and are negatively correlated. Thus, by requiring to be orthogonal to in equation 12b, we ensure that it points in a direction corresponding to the conservation of metabolites involved in reaction 2 (see chapter 4 of Palsson and Abrams [(27] for the definition of conservation pools), where and are positively correlated.
The reciprocal orthogonality conditions in equations 12a and 12b also have an important geometric interpretation. For a given time interval along a dynamic trajectory, dominant eigenmodes represent the principal directions in a part of the concentration space where the dynamics occur. Therefore, if the underlying reaction system in the time interval of interest is low-dimensional, the eigenmodes can be viewed as a natural basis spanning a reduced concentration space where dynamic trajectories lie. Regarding this reduced concentration space as primal space (see Fig. 1F), equations 12a and 12b ensure that the vector representation of pools are a natural basis of a corresponding dual space. From this standpoint, the pools and in Toy Model 1 are scalar quantities defined as the inner product of a vector in the primal with another vector in the dual space, the algebraic form of which remain invariant with respect to any linear coordinate transformation (46). This geometric interpretation is compatible with the physical interpretation of flux-concentration duality in dissipative systems (43). As mentioned above, the vector representations of pools are closely related to flux-disturbance relaxation or maximal energy dissipation of reactions, so they form a basis for a dual space in which to represent flux dynamics naturally. Similarly, eigenmodes constitute a basis for a primal space in which to represent concentration dynamics naturally.
Derivation of the third and fourth approximate eigenmodes of Toy Model 1
Here, we provide more details on the derivation of approximate expressions for the third and fourth eigenmodes of Toy Model 1 discussed in previous sections. Because the third eigenmode arises from the second transitory eigenmode in the interval , we may regard as the limit of when . The second transitory regime occurs in the interval when reaction 3 equilibrates. In this transition period, the flux deviations of reactions 3 and 4 are of the same order, so that and . As with the first transitory period, we consider a coefficient such that to quantify the transition between and with . Accordingly, the mass-balance equations simplify to
| (48a) |
| (48b) |
| (48c) |
| (48d) |
Eliminating and , equations 48a–48d is expressed with respect to as follows:
where , and is the initial condition for the flux deviation of reaction 4. Substituting the solution of back in equations 48a–48d, we arrive at
| (49) |
where are the integration constants. Note that the coefficient varies from a large value to its asymptotic value as . This asymptotic coefficient is in turn determined from the equilibrium of Reaction 3
| (50) |
with the equilibrium constant of reaction 3. It follows from equations 50, 48b, and 48c that
| (51) |
We now can express and as the limiting cases of and :
| (52) |
The third transitory regime occurs in the interval , where is a sufficiently large time—referred to as the total relaxation time—by which all flux and concentration disturbances have relaxed to within a tolerable threshold. This regime is characterized by , , and . We express the order-of-magnitude balance between the flux deviations of reactions 4 and 5 by a coefficient such that to describe the transitory regime between and with , simplifying the mass-balance equations to
| (53a) |
| (53b) |
| (53c) |
| (53d) |
From equation 53d, it follows that
where , and is the initial condition for the flux deviation of reaction 5. Substituting from this equation in equation 53a, we obtain the following solution for :
with the initial condition for the flux deviation of reaction 1. From this solution, it follows that the dynamics of in the third transitory regime are described by two exponential decay modes of the same order of magnitude (note that in Toy Model 1; see Fig. S1C). Our goal in the analysis of this transitory regime is to ascertain the slowest eigenmode of Toy Model 1 and describe the dynamics near its steady state. Therefore, we assume in the remainder of this section, so that the slowest timescale is characterized by . A similar analysis can be performed for the opposite case. Substituting the solutions of and in equations 53a–53d and neglecting the faster mode associated with , we obtain the dynamic trajectory of concentration deviations:
| (54) |
where . Since the flux deviations of reaction 1 becomes nonnegligible in this regime, the equilibration of reactions 2 and 3 is disturbed once again on the third transitory timescale . Therefore, the asymptotic coefficients and change during the secondary relaxation of reactions 2 and 3 as . We determine the asymptotic value of in the third transitory regime from equations 6, 53a, and 53b:
| (55) |
and of from equations 50, 53b, and 53c:
| (56) |
The coefficient varies from a large value to its asymptotic value as , which is ascertained from the equilibrium of reaction 4:
| (57) |
where is the equilibrium constant of reaction 4. From equations 57, 53c, and 53d, we find that
| (58) |
which is a quadratic polynomial to be solved with respect to . As with previous cases, and are the limiting cases of and , so that
| (59) |
Inhomogeneity of evolution equations for local concentration deviations and temporal grid size
We show that the discrete form of evolution equations for local concentration deviations (equation 34) becomes homogeneous asymptotically for vanishingly small temporal grid size, that is, as . We start from the definition of in equation 34:
Substituting the definitions of and from previous sections (see “Dynamic mode analysis”) in this equation, we have
| (60a) |
| (60b) |
| (60c) |
| (60d) |
In equation 60d, the expression enclosed in parentheses is a matrix with a nonzero norm. Moreover, the constant vector is generally nonzero in all sliding time windows, and it only approaches zero asymptotically as dynamic trajectories near a steady-state solution. It follows that the inhomogeneous term approaches zero asymptotically in time windows away from the steady state only if an arbitrarily small temporal grid size is chosen.
Relationship between time-delayed autocorrelation, covariance, and local Jacobian spectra
Here, we prove that and in equations 38a–38b are the solution of the optimization problem (equation 37). We begin by expanding the objective:
| (61) |
where , and the dots represent constant terms that do not affect the optimal solution of equation 37. As stated before, can be regarded as a transformation, mapping matrices and vectors from the space of proper orthogonal modes into the original concentration space, so we may write with . In the following, we further expand each term in equation 61 to derive expressions that can be readily recast into a standard-form quadratic program.
We start from the first term
| (62a) |
| (62b) |
| (62c) |
| (62d) |
| (62e) |
| (62f) |
where denotes a identity matrix, and vectorizes a matrix by stacking its columns on top of one another. Here, we used the cyclic property of the trace in equation 62a and applied the identity vec in equation 62c. Expanding the second term, we arrive at
| (63) |
The third term is written as follows:
| (64a) |
| (64b) |
| (64c) |
| (64d) |
| (64e) |
| (64f) |
where is the th column of . Again, we used the cyclic property of the trace in equation 64a and the identity vec in equation 64c. Expanding the fourth term, we have
| (65a) |
| (65b) |
| (65c) |
where
| (66) |
with denoting the th column of . The fifth term is expanded similarly:
| (67) |
where denotes the th column of , and .
We now can rewrite the optimization problem equation 37 as a quadratic program in standard form by substituting the five terms – from equations 62f, 63, 64f, 65c, 66, and 67 in equation 61:
| (68) |
Here,
| (69) |
with
| (70) |
, , and . The solution of the minimization problem (equation 68) is ascertained from the Karush-Kuhn-Tucker conditions (47):
| (71) |
To derive compact from expressions for and , the Hessian is to be inverted explicitly with respect to its elements. Since the Hessian in equation 69 is a block matrix, we derive its block inverse
| (72) |
using the Schur complement. With respect to the matrix blocks in equation 69, the Schur complement of is written as follows:
| (73) |
where is the covariance of time-series data defined in equation 39b. We can now express the individual blocks in equation 72 in terms of the Schur complement :
| (74a) |
| (74b) |
| (74c) |
| (74d) |
Accordingly, the two parts of the optimal solution are ascertained separately by plugging from equation 72 into equation 71:
| (75a) |
| (75b) |
Substituting the inverse blocks from equations 74a and 74b in equation 75a, we obtain the first part
| (76) |
with denoting the th component of . From equation 76, one can verify that the optimal solution in matrix form is
| (77) |
where
| (78) |
Since is symmetric and , the optimal solution given in equation 38a follows from equation 77. Similarly, the second part of the optimal solution is obtained by substituting the inverse blocks from equations 74c and 74d in equation 75b:
| (79) |
from which the optimal solution given in equation 38b is derived using the relationship .
Energetics of glycolysis
Overall, glycolysis breaks glucose down into pyruvate and lactate according to
Since glucose carries a higher chemical potential energy than pyruvate and lactate combined, as measured by their relative transformed Gibbs free energy of formation (48), the transformation is exergonic:
where is the standard transformed Gibbs free energy of reaction. This energy drives the synthesis of high-energy cofactors ( and ) from the low-energy counterparts ( , , and ) through interconversion reactions
Accordingly, and carry a higher chemical potential energy than and in an aqueous environment, respectively. In this sense, and are regarded as high-energy cofactors with , , and the corresponding low-energy counterparts.
Supplementary Material
ACKNOWLEDGMENTS
This work was funded by the Novo Nordisk Foundation (grant number NNF10CC1016517) and the National Institutes of Health (grant number GM057089).
Conceptualization: A.A. and B.O.P.; methodology: A.A.; validation: A.A.; formal analysis: A.A.; investigation: A.A. and Z.B.H.; writing – original draft: A.A.; writing – review & editing: A.A.: Z.B.H., and B.O.P.; funding acquisition: B.O.P.; resources: B.O.P.; and supervision: B.O.P.
Contributor Information
Bernhard O. Palsson, Email: bpalsson@ucsd.edu.
Lennart Schada von Borzyskowski, Universiteit Leiden, Leiden, Netherlands .
DATA AVAILABILITY
All data generated or analyzed during this study are included in this published article and its supplemental files. The source code used in this study is available online (https://github.com/akbari84/Dynamic-Mode-Analysis.git).
SUPPLEMENTAL MATERIAL
The following material is available online at https://doi.org/10.1128/msystems.01001-23.
Information about the kinetic model of the glycolytic pathway discussed in the text.
Complete pooling matrices of the glycolytic pathway associated with the sliding time window as it moves from the initial time to the steady state.
Fig. S1-S4.
An accounting of the reviewer comments and feedback.
ASM does not own the copyrights to Supplemental Material that may be linked to, or accessed through, an article. The authors have granted ASM a non-exclusive, world-wide license to publish the Supplemental Material files. Please contact the corresponding author directly for reuse.
REFERENCES
- 1. Karr JR, Sanghvi JC, Macklin DN, Gutschow MV, Jacobs JM, Bolival B, Assad-Garcia N, Glass JI, Covert MW. 2012. A whole-cell computational model predicts phenotype from genotype. Cell 150:389–401. doi: 10.1016/j.cell.2012.05.044 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Macklin DN, Ahn-Horst TA, Choi H, Ruggero NA, Carrera J, Mason JC, Sun G, Agmon E, DeFelice MM, Maayan I, Lane K, Spangler RK, Gillies TE, Paull ML, Akhter S, Bray SR, Weaver DS, Keseler IM, Karp PD, Morrison JH, Covert MW. 2020. Simultaneous cross-evaluation of heterogeneous E. coli datasets via mechanistic simulation. Science 369:eaav3751. doi: 10.1126/science.aav3751 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Palsson BO, Joshi A. 1987. On the dynamic order of structured Escherichia coli growth models. Biotechnol Bioeng 29:789–792. doi: 10.1002/bit.260290623 [DOI] [PubMed] [Google Scholar]
- 4. Heinrich R, Sonntag I. 1982. Dynamics of non-linear biochemical systems and the evolutionary significance of time hierarchy. Biosystems 15:301–316. doi: 10.1016/0303-2647(82)90045-4 [DOI] [PubMed] [Google Scholar]
- 5. Stelling J, Sauer U, Szallasi Z, Doyle FJ, Doyle J. 2004. Robustness of cellular functions. Cell 118:675–685. doi: 10.1016/j.cell.2004.09.008 [DOI] [PubMed] [Google Scholar]
- 6. Hintze A, Adami C. 2008. Evolution of complex modular biological networks. PLoS Comput Biol 4:e23. doi: 10.1371/journal.pcbi.0040023 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Arkin A, Ross J. 1995. Statistical construction of chemical reaction mechanisms from measured time-series. J Phys Chem 99:970–979. doi: 10.1021/j100003a020 [DOI] [Google Scholar]
- 8. Arkin A, Shen P, Ross J. 1997. A test case of correlation metric construction of a reaction pathway from measurements. Science 277:1275–1279. doi: 10.1126/science.277.5330.1275 [DOI] [Google Scholar]
- 9. Kauffman KJ, Pajerowski JD, Jamshidi N, Palsson BO, Edwards JS. 2002. Description and analysis of metabolic connectivity and dynamics in the human red blood cell. Biophys J 83:646–662. doi: 10.1016/S0006-3495(02)75198-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Jamshidi N, Palsson BØ. 2008. Top-down analysis of temporal hierarchy in biochemical reaction networks. PLoS Comput Biol 4:e1000177. doi: 10.1371/journal.pcbi.1000177 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Radulescu O, Gorban AN, Zinovyev A, Lilienbaum A. 2008. Robust simplifications of multiscale biochemical networks. BMC Syst Biol 2:86. doi: 10.1186/1752-0509-2-86 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Reinhardt V, Winckler M, Lebiedz D. 2008. Approximation of slow attracting manifolds in chemical kinetics by trajectory-based optimization approaches. J Phys Chem A 112:1712–1718. doi: 10.1021/jp0739925 [DOI] [PubMed] [Google Scholar]
- 13. Schmid PJ. 2022. Dynamic mode decomposition and its variants. Annu Rev Fluid Mech 54:225–254. doi: 10.1146/annurev-fluid-030121-015835 [DOI] [Google Scholar]
- 14. Lam SH, Goussis DA. 1994. The csp method for simplifying kinetics. Int J of Chemical Kinetics 26:461–486. doi: 10.1002/kin.550260408 [DOI] [Google Scholar]
- 15. Chiavazzo E, Gorban AN, Karlin IV. 2007. Comparison of invariant manifolds for model reduction in chemical kinetics. Commun Comput Phys 2:964–992. [Google Scholar]
- 16. Tomlin AS, Li G, Rabitz H, Tóth J. 1997. The effect of lumping and expanding on kinetic differential equations. SIAM J Appl Math 57:1531–1556. doi: 10.1137/S0036139995293294 [DOI] [Google Scholar]
- 17. Conzelmann H, Saez-Rodriguez J, Sauter T, Kholodenko BN, Gilles ED. 2006. A domain-oriented approach to the reduction of combinatorial complexity in signal transduction networks. BMC Bioinformatics 7:1–15. doi: 10.1186/1471-2105-7-34 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Schmid PJ. 2010. Dynamic mode decomposition of numerical and experimental data. J Fluid Mech 656:5–28. doi: 10.1017/S0022112010001217 [DOI] [Google Scholar]
- 19. Wynn A, Pearson DS, Ganapathisubramani B, Goulart PJ. 2013. Optimal mode decomposition for unsteady flows. J Fluid Mech 733:473–503. doi: 10.1017/jfm.2013.426 [DOI] [Google Scholar]
- 20. Jovanović MR, Schmid PJ, Nichols JW. 2014. Sparsity-promoting dynamic mode decomposition. Phys Fluids 26. doi: 10.1063/1.4863670 [DOI] [Google Scholar]
- 21. Ben-Israel A, Greville TNE. 2003. Generalized inverses: theory and applications. Springer Science & Business Media. [Google Scholar]
- 22. Jamshidi N, Palsson BØ. 2008. Formulating genome-scale kinetic models in the post-genome era. Mol Syst Biol 4:171. doi: 10.1038/msb.2008.8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Berg JM, Tymoczko JL, Stryer L. 2002. Biochemistry. 5th ed. Freeman, New York. [Google Scholar]
- 24. Haiman ZB, Zielinski DC, Koike Y, Yurkovich JT, Palsson BO. 2021. Masspy: building, simulating, and visualizing dynamic biological models in python using mass action kinetics. PLoS Comput Biol 17:e1008208. doi: 10.1371/journal.pcbi.1008208 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Chandra FA, Buzi G, Doyle JC. 2011. Glycolytic oscillations and limits on robust efficiency. Science 333:187–192. doi: 10.1126/science.1200705 [DOI] [PubMed] [Google Scholar]
- 26. Bordbar A, McCloskey D, Zielinski DC, Sonnenschein N, Jamshidi N, Palsson BO. 2015. Personalized whole-cell kinetic models of metabolism for discovery in genomics and pharmacodynamics. Cell Syst 1:283–292. doi: 10.1016/j.cels.2015.10.003 [DOI] [PubMed] [Google Scholar]
- 27. Palsson BO, Abrams M. 2011. Systems biology: simulation of dynamic network states. Cambridge University Press, Cambridge. [Google Scholar]
- 28. Mogilner A, Wollman R, Marshall WF. 2006. Quantitative modeling in cell biology: what is it good for? Dev Cell 11:279–287. doi: 10.1016/j.devcel.2006.08.004 [DOI] [PubMed] [Google Scholar]
- 29. McCloskey D, Palsson BØ, Feist AM. 2013. Basic and applied uses of genome-scale metabolic network reconstructions of Escherichia coli. Mol Syst Biol 9:661. doi: 10.1038/msb.2013.18 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Del Sol A, Jung S. 2021. The importance of computational modeling in stem cell research. Trends Biotechnol 39:126–136. doi: 10.1016/j.tibtech.2020.07.006 [DOI] [PubMed] [Google Scholar]
- 31. Joshi A, Palsson BO. 1989. Metabolic dynamics in the human red cell: part I—a comprehensive kinetic model. J Theor Biol 141:515–528. doi: 10.1016/s0022-5193(89)80233-4 [DOI] [PubMed] [Google Scholar]
- 32. Berndt N, Bulik S, Wallach I, Wünsch T, König M, Stockmann M, Meierhofer D, Holzhütter H-G. 2018. Hepatokin1 is a biochemistry-based model of liver metabolism for applications in medicine and pharmacology. Nat Commun 9:2386. doi: 10.1038/s41467-018-04720-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Segrè D, Vitkup D, Church GM. 2002. Analysis of optimality in natural and perturbed metabolic networks. Proc Natl Acad Sci U S A 99:15112–15117. doi: 10.1073/pnas.232349399 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Beg QK, Vazquez A, Ernst J, de Menezes MA, Bar-Joseph Z, Barabási A-L, Oltvai ZN. 2007. Intracellular crowding defines the mode and sequence of substrate uptake by Escherichia coli and constrains its metabolic activity. Proc Natl Acad Sci U S A 104:12663–12668. doi: 10.1073/pnas.0609845104 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Henry CS, Broadbelt LJ, Hatzimanikatis V. 2007. Thermodynamics-based metabolic flux analysis. Biophys J 92:1792–1805. doi: 10.1529/biophysj.106.093138 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Lerman JA, Hyduke DR, Latif H, Portnoy VA, Lewis NE, Orth JD, Schrimpe-Rutledge AC, Smith RD, Adkins JN, Zengler K, Palsson BO. 2012. In silico method for modelling metabolism and gene product expression at genome scale. Nat Commun 3:929. doi: 10.1038/ncomms1928 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Akbari A, Palsson BO. 2021. Scalable computation of intracellular metabolite concentrations. Comput Chem Eng 145:107164. doi: 10.1016/j.compchemeng.2020.107164 [DOI] [Google Scholar]
- 38. Akbari A, Yurkovich JT, Zielinski DC, Palsson BO. 2021. The quantitative metabolome is shaped by abiotic constraints. Nat Commun 12:3178. doi: 10.1038/s41467-021-23214-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Basan M, Hui S, Okano H, Zhang Z, Shen Y, Williamson JR, Hwa T. 2015. Overflow metabolism in Escherichia coli results from efficient proteome allocation. Nature 528:99–104. doi: 10.1038/nature15765 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Erickson DW, Schink SJ, Patsalo V, Williamson JR, Gerland U, Hwa T. 2017. A global resource allocation strategy governs growth transition kinetics of Escherichia coli. Nature 551:119–123. doi: 10.1038/nature24299 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Avanzini F, Freitas N, Esposito M. 2023. Circuit theory for chemical reaction networks. Phys Rev X 13. doi: 10.1103/PhysRevX.13.021041 [DOI] [Google Scholar]
- 42. Hyvärinen A, Oja E. 2000. Independent component analysis: algorithms and applications. Neural Netw 13:411–430. doi: 10.1016/s0893-6080(00)00026-5 [DOI] [PubMed] [Google Scholar]
- 43. Warren PB, Jones JL. 2007. Duality, thermodynamics, and the linear programming problem in constraint-based models of metabolism. Phys Rev Lett 99:108101. doi: 10.1103/PhysRevLett.99.108101 [DOI] [PubMed] [Google Scholar]
- 44. Ivancevic VG, Ivancevic TT. 2007. Applied differential geometry: a modern introduction. World Scientific. [Google Scholar]
- 45. Edwards WS, Tuckerman LS, Friesner RA, Sorensen DC. 1994. Krylov methods for the incompressible navier-stokes equations. J Comput Phys 110:82–102. doi: 10.1006/jcph.1994.1007 [DOI] [Google Scholar]
- 46. Wrede RC. 2013. Introduction to vector and tensor analysis. Courier Corporation. [Google Scholar]
- 47. Nocedal J, Wright S. 2006. Numerical optimization. Springer Science & Business Media. [Google Scholar]
- 48. Alberty RA. 2005. Thermodynamics of biochemical reactions. John Wiley & Sons. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Information about the kinetic model of the glycolytic pathway discussed in the text.
Complete pooling matrices of the glycolytic pathway associated with the sliding time window as it moves from the initial time to the steady state.
Fig. S1-S4.
An accounting of the reviewer comments and feedback.
Data Availability Statement
All data generated or analyzed during this study are included in this published article and its supplemental files. The source code used in this study is available online (https://github.com/akbari84/Dynamic-Mode-Analysis.git).





