Skip to main content
PLOS Computational Biology logoLink to PLOS Computational Biology
. 2021 Oct 13;17(10):e1009513. doi: 10.1371/journal.pcbi.1009513

Modular assembly of dynamic models in systems biology

Michael Pan 1,2,3,*, Peter J Gawthrop 1, Joseph Cursons 4, Edmund J Crampin 1,2,3,5
Editor: Anders Wallqvist6
PMCID: PMC8544865  PMID: 34644304

Abstract

It is widely acknowledged that the construction of large-scale dynamic models in systems biology requires complex modelling problems to be broken up into more manageable pieces. To this end, both modelling and software frameworks are required to enable modular modelling. While there has been consistent progress in the development of software tools to enhance model reusability, there has been a relative lack of consideration for how underlying biophysical principles can be applied to this space. Bond graphs combine the aspects of both modularity and physics-based modelling. In this paper, we argue that bond graphs are compatible with recent developments in modularity and abstraction in systems biology, and are thus a desirable framework for constructing large-scale models. We use two examples to illustrate the utility of bond graphs in this context: a model of a mitogen-activated protein kinase (MAPK) cascade to illustrate the reusability of modules and a model of glycolysis to illustrate the ability to modify the model granularity.

Author summary

The biochemistry within a cell is complex, being composed of numerous biomolecules and reactions. In order to develop fully detailed mathematical models of cells, smaller submodels need to be constructed and connected together. Software and standards can assist in this endeavour, but challenges remain in ensuring that submodels are both consistent with each other and consistent with the fundamental conservation laws of physics. In this paper, we propose a new approach using bond graphs from engineering. In this approach, connections between models are defined using physical conservation laws. We show that this approach is compatible with current software approaches in the field, and can therefore be readily used to incorporate physical consistency into existing model integration methodologies. We illustrate the utility of this approach in streamlining the development of models for a signalling network (the MAPK cascade) and a metabolic network (the glycolysis pathway). The advantage of this approach is that models can be developed in a scalable manner while also ensuring consistency with the laws of physics, enhancing the range of data available to train models. This approach can be used to quickly construct detailed and accurate models of cells, facilitating future advances in biotechnology and personalised medicine.

Introduction

Over the past few decades, advances in both data generation and computational resources have enabled the construction of large-scale kinetic models in systems biology, including whole-cell models that represent every known biomolecule in the cell [1]. An accurate and robust whole-cell model can provide several benefits to the community: data on specific organisms can be cross-evaluated and reconciled [2]; simulations could be used to rule out fruitless experiments and clinical trials; the models themselves could be used as a basis for designing novel circuits in synthetic biology; and fundamental questions about biology may be addressed in a holistic and systematic manner [3, 4].

The first comprehensive whole-cell model was developed for Mycoplasma genitalium [1] and there are ongoing efforts to develop whole-cell models of Escherichia coli [5] and human cells [6]. However, it has been acknowledged that the highly manual practices used in the development of the initial model of M. genitalium are unlikely to scale up to more complex organisms. The biomodelling community has identified several potential roadblocks to whole-cell modelling, including the lack of sufficient biological knowledge and data, model incompatibility, inadequate model development tools, inadequate model formats and parameter uncertainty [4, 6].

This paper addresses the issue of approaching model development in a modular manner. Typical requirements for such model development strategies involve reusing and integrating submodels together into more comprehensive models, and swapping between alternative models of the same system for benchmarking and comparison [79]. There have been several software-related developments in the systems biology community focussing on improving the reusability of models, some of which are beginning to be used in whole-cell modelling [10]. However, ensuring the reusability of fully integrated cell models remains a challenge [11]. While adequate software frameworks are essential to the modular development of large-scale kinetic models, an understanding of the physics of biological systems is also necessary to address issues in model compatibility and provenance. In particular, the laws of thermodynamics have been invoked in the context of metabolic modelling, allowing modellers to estimate the energetic favourability of reactions [12], to constrain fluxes in constraint-based models [13, 14] and to improve parameter estimation in large-scale kinetic models [1517] and whole-cell models [5]. This paper argues that the concepts of module interconnection from physics and thermodynamics are consistent with current model development practices in systems biology, and we suggest the use of bond graphs (from the discipline of engineering) as a framework for unifying developments from both software development and biological thermodynamics.

We begin by defining modularity in systems biology and arguing that an improved understanding of biophysics can contribute to this area. We then use biochemical examples to illustrate how bond graphs incorporate physical constraints into a modular framework for systems biology. In the Results section, we illustrate the benefits of this approach by applying the principles of modular development to bond graph models of a mitogen-activated protein kinase (MAPK) cascade and glycolysis. Finally, we summarise ongoing developments in unifying modularity and thermodynamics in systems biology and conclude with some suggestions to enable the development of fully detailed models of cells.

Methods

Modularity in systems biology

Due to their complexity, large-scale models in systems biology need to be constructed by dividing the problem into manageable submodels. Early notions of modularity in biomodelling were borrowed from principles in engineering and software development [18, 19]. In those disciplines, modules can be defined as parts of a system that (a) retain their own identity and are often developed and operated independently, but interact with other parts of the system and (b) hide the details of their implementation from the rest of the system, except through pre-defined interfaces [9]. Using this notion of modularity, the parts of a module that are available for connection and communication are said to be “exposed”.

However, the definition above—which is also known as “black-box” modularity—is not conducive to the incremental accumulation of knowledge that occurs in biology. Advancements in our understanding of biology may force modellers to interface with previously hidden components within existing models [7, 9]. It is becoming increasingly apparent that modules in biological modelling need to be more flexible than engineering modules. As a result, the notion of modularity in systems biology is far less clear than in the established disciplines of engineering and software development. In recent years, systems biology has favoured the use of a “white-box” approach to modularity in which modules do not completely hide the details of their implementation, but instead allow individual variables and components to be exposed as required [20].

Broadly speaking, notions of modularity used in systems biology can be categorised into computational modularity, the ability for models to communicate and interact with each other in a physically consistent manner; and functional (or behavioural) modularity, the ability of modules to be isolated from the effects of other modules. This paper will focus on computational modularity. The role of functional modularity is pivotal to systems and synthetic biology, particularly in reducing loading effects between engineered genetic circuits [21]. However, functional modularity can only be analysed and designed through the lens of computational modularity [22].

If handled correctly, modular model development can provide a number of benefits to modellers (see Fig 1), including:

Fig 1. The importance of modularity in models for systems biology.

Fig 1

Modularity can facilitate the construction of whole-cell models by (1) providing unambiguous and flexible interfaces for submodels to communicate; (2) allowing model development and unit testing to be done on individual submodels; (3) separating the description of the model from its implementation; (4) allowing models to be iteratively updated with a record of how the equations and parameters were derived; and (5) allowing repeated motifs to be abstracted into reusable structures.

  1. Enabling large-scale models to be built from smaller submodules that communicate through clear and unambiguous interfaces.

  2. Providing a framework for models to be developed, tested and validated in isolation before incorporating them into larger models.

  3. Separating the description of model equations from the software implementation of the model (including simulation).

  4. Allowing incremental changes to be made to existing models in light of new measurements or knowledge, and allowing the provenance of models to be tracked.

  5. Enabling the abstraction of important modules, providing the means to instantiate multiple copies of repeated motifs and swapping out a submodel for another model with a different level of granularity.

Thus, modularity can facilitate collaborative efforts to build whole-cell models, allows models to be updated where necessary and enhances the usefulness of models beyond their initial publication.

Current approaches to model reuse and integration in systems biology can be broadly categorised into three approaches, ordered by increasing flexibility:

  1. Standard model description formats. To enable the reuse of biomodels by different research groups, biomodellers have developed standards for describing models. Of these, the Systems Biology Markup Language (SBML) [23] and CellML [19] are two prominent examples. Once encoded within such standard model descriptions, analysis and simulation can be run on separately developed software such as OpenCOR (CellML) and COPASI (SBML) [24, 25]. The simulation protocols can themselves be specified using the Simulation Experiment Description Markup Language (SED-ML) [26].

  2. Biological modularity. Biological-level modularity introduces white-box modularity to systems biology by annotating model variables and parameters with standardised, machine-readable ontological terms [9]. This enables a strategy where software can automatically compose separately developed models together [9, 20]. SemGen and semanticSBML are two software tools that implement biological-level modularity [20, 27]. SBML also supports both white and black box modularity through its hierarchical package [28], which is implemented in the graphical tool iBioSim [29].

  3. Programmatic approaches. The “programmatic approach” to modelling was developed to allow models to integrate together in a flexible manner, but also to address the relative inflexibility of standard modelling languages. In the programmatic approach, models are treated as declarative programming objects rather than mathematical equations [7]. Using this approach, models can be embedded within programming languages such as Python. This approach automates model construction by allowing models to be defined at different hierarchical levels, for example by generating equations through the specification of macros for repeating motifs. Two implementations of the programmatic approach are little b [7] and PySB [30]. More recently, the BondGraphTools package has been developed to introduce thermodynamics into such an approach [31].

Note that these approaches are not independent of each other, and many modelling frameworks use several of these approaches [32, 33].

Despite recent computational advances in enabling the modular development of biomodels, there remain key limitations in current approaches:

  1. There is no guarantee that the integrated model will be consistent with basic physical principles such as conservation of mass, charge and energy.

  2. It remains difficult to resolve points of conflict between models, such as conflicts between parameters and assumptions.

  3. There is limited scope for dealing with multi-physics systems that arise in electrophysiology and mechanochemistry.

Resolving these issues requires the conservation laws of physics to be embedded within computational modules. Network Thermodynamics, using bond graphs, is a modelling framework that fits with the requirement of developing physically consistent models, while retaining compatibility with existing approaches.

Bond graphs

Bond graphs provide a modular framework for constructing physically and thermodynamically consistent models in systems biology. The framework was first applied to biology by Oster, Perelson and Katchalsky in the context of Network Thermodynamics, as a method for incorporating the laws of thermodynamics into theoretical models of living systems [34, 35]. This work followed in the tradition in physics and engineering that if you “get the physics right”, “the rest is mathematics” [36, 37]. Bond graph models are defined by combining constitutive relations with physical conservation laws, giving rise to a declarative model structure. This confers some advantages from a modelling perspective:

  1. Models can be specified in terms of physical connections between components, giving rise to a graphical representation of the model equations, which are consistent with the conservation laws of physics.

  2. Bond graphs inherently support modular modelling, as components can easily be swapped in and out without affecting the high-level model structure.

  3. Due to the fundamental nature of energy in all physical systems, a thermodynamic approach can be used to link together models of systems from different physical domains such as the electrical, mechanical, chemical and hydraulic domains. Therefore, bond graphs models can be constructed for a wide range of multi-physical biological systems, including electrophysiology and mechanobiology [3840].

There has been a long history of thermodynamic modelling for biochemical reaction networks [5, 13, 15]. In this section, we introduce bond graphs as an intuitive method for embedding such approaches within a modular framework.

An explicit graphical representation of biochemical systems

We first use a simple example to illustrate the how the structure of a bond graph encodes differential equations [35, 41, 42]. Consider the enzyme-catalysed reaction in Fig 2A, noting that all chemical reactions are thermodynamically reversible. Assuming that the reactions follow the law of mass action, the system can be described using the differential equations

dxEdt=-v1+v2 (1a)
dxCdt=v1-v2 (1b)
dxSdt=-v1 (1c)
dxPdt=v2 (1d)

where the fluxes through the reactions v1 and v2 are given by

v1=k1+xExS-k1-xC (2a)
v2=k2+xC-k2-xExP. (2b)
Fig 2. The energetics of an enzyme-catalysed reaction.

Fig 2

(A) Chemical reaction scheme. (B) A graph representation of the reaction scheme typically seen in systems biology. Species are represented as circles and reactions are represented as squares. The grey arrows indicate the flow of mass. (C) A bond graph representation of the network. Note that in contrast to the graph representation in (B), additional elements have been added to the representation to represent conservation of mass (closed circles ●) and conservation of energy (triangles ▼). The arrows here represent the molar flow rate (green) and the associated chemical potential (blue), thus the flow of both mass and energy is accounted for. The arrowheads indicate the direction of positive flux, but all reactions can proceed in the reverse (negative) direction as well.

In the above equations, the rate parameters k1+, k1-, k2+ and k2- are defined in Fig 2A, and xE, xC, xS, xP are the concentrations (or molar amounts) of E, C, S and P respectively.

Because it is often useful to consider rate laws independently from the stoichiometry of the system, systems biologists may favour an expanded representation of the network as shown in Fig 2B, where the reactions and species reside in their own components. This representation follows the Systems Biology Graphical Notation (SBGN) standard [43].

The bond graph representation in Fig 2C is a further expansion of the diagram in Fig 2B. This representation firstly adds two physical variables to the edges: a chemical potential μ [J/mol] (blue variables) and molar flux v [mol/s] (green variables). Since μ and v multiply to give power P [J/s], each connection transfers energy between components. In addition, separate nodes (● and ▼) are used to model mass and energy conservation laws inherent within these systems, discussed further below.

Every component (node) within the system contains its own independent set of equations and parameters. Each chemical species (open circles ◯ in Fig 2C) is associated with a chemical potential μ. In dilute systems at constant temperature and pressure, this quantity is related to abundance x by

μ=RTln(Kx) (3)

where x [mol] is the amount of the species, K [mol−1] is the thermodynamic parameter for that species, R = 8.314 JK−1mol−1 is the ideal gas constant and T [K] is the absolute temperature. The parameter K is related to the standard free energy of the species; one can also write Eq 3 as

μ=μ0+RTln(c/c0)=μ0+RTln(xc0V) (4)

where c [M] is the concentration of species, V [L] is the volume of the compartment and μ0 [J/mol] is the standard chemical potential taken at a concentration of c0 (for simplicity, c0 is often taken to be 1M). By equating Eqs 3 and 4, K is related to μ0 through the equation

K=1c0Vexp(μ0/RT). (5)

Similarly, the rate of each reaction (squares □ in Fig 2C) is given by a constitutive relationship between reaction rate v and the thermodynamic potentials. For example, the thermodynamic Marcelin-de Donder equation represents reversible mass action kinetics [42]:

v=κ[exp(AfRT)-exp(ArRT)] (6)

where Af (Ar) is the forward (reverse) affinity, or the sum of chemical potentials within the reactants (products). We note that while we have used K and κ as our parameters, these values can also be expressed in terms of energetic quantities such as the free energy of formation (see Appendix A in S1 Text).

Therefore, the species in Fig 2C encode the relationships

μE=RTln(KExE) (7a)
μC=RTln(KCxC) (7b)
μS=RTln(KSxS) (7c)
μP=RTln(KPxP) (7d)

and the reactions encode the relationships

v1=κ1[exp(A1fRT)-exp(A1rRT)] (8a)
v2=κ2[exp(A2fRT)-exp(A2rRT)]. (8b)

To obtain the correct fluxes for each reaction, the chemical potentials μ of the species need to be correctly mapped onto the reaction affinities Af and Ar. Because reactions 1 and 2 are connected directly to μC in Fig 2C, it is clear that

A1r=A2f=μC. (9)

However, conservation of chemical potential (energy per mole) needs to be considered when determining A1f and A2r. These affinities are related to the species potentials through the relationships

A1f=μS+μE (10a)
A2r=μP+μE. (10b)

The above energy conservation equations are encoded within triangular (▼) components (Fig 2C), which constrain the model such that the sum of potentials of the edges directed into the triangles is equal to those directed outwards. Note also that fluxes v of the connected edges are equal as each reaction consumes reactants and produces products at the same rate. These common flow junctions are analogous to Kirchhoff’s voltage law in electrical circuits.

Finally, the fluxes through the reactions are related back to the rates of change in species through the conservation of mass components represented by the closed circles (●) in Fig 2C. These components constrain the fluxes such that the sum of fluxes into the component is equal to the sum of fluxes out of the component. These encode the mass balance equations in Eq 1. The common potential junction is analogous to Kirchhoff’s current law in electrical circuits.

Once Eqs 1, 710 are combined, one can derive the differential equations

dxEdt=-κ1KEKSxExS+κ1KCxC+κ2KCxC-κ2KEKPxExP (11a)
dxCdt=κ1KEKSxExS-κ1KCxC-κ2KCxC+κ2KEKPxExP (11b)
dxSdt=-κ1KEKSxExS+κ1KCxC (11c)
dxPdt=κ2KCxC-κ2KEKPxExP. (11d)

This thermodynamic formulation has the same form as the kinetic formulation (Eqs 1 and 2) with the parameters redefined as

k1+=κ1KEKS (12a)
k1-=κ1KC (12b)
k2+=κ2KC (12c)
k2-=κ2KEKP. (12d)

While the thermodynamic formulation contains more parameters (6) than the kinetic formulation (4), they overcome a limitation of kinetic parameters. Whereas kinetic parameters are not free to be independently specified and require detailed balance constraints to be thermodynamically consistent, thermodynamic parameters can be chosen independently. Systems biologists have previously used thermodynamic parameters to avoid thermodynamically inconsistent model behaviour [15, 44]. More recently, the approach has been suggested for whole-cell modelling as a method for resolving points of conflict between data [5].

Thus, a strength of bond graph models in this context is that the differential equations of a biological network can be directly derived from the network structure, which paves the way for the modular construction of such models.

A remark on notation: Traditionally, the bond graph representation uses a textual notation for components rather than the graphical notation used in this paper. Specifically, species (◯) are represented as C components, reactions (□) as Re components, common potentials (●) as 0-junctions and common flows (▼) as 1-junctions. Furthermore, bonds are depicted using half-arrows rather than full arrows. However, in light of recent efforts to “modernise” the bond graph representation [45], we have depicted components in shapes rather than letters to make the representation closer to conventions seen in systems biology.

A modular representation for enzymes

In systems biology, there are often several plausible equations for modelling enzyme-catalysed reactions. A modular approach is desirable in allowing one enzyme model to be swapped out for another [46]. We illustrate this by considering a modular version of the enzyme-catalysed reaction of Fig 2. The system can be represented using the diagram in Fig 3A, where S and P are connected via a yet-to-be-defined module shown by the light blue box. This arbitrary module can then be substituted for any component describing a plausible reaction mechanism.

Fig 3. Representation of enzymes as interchangeable modules.

Fig 3

(A) A modular representation of an enzyme. The module representing the enzyme, shown as a blue module, can be swapped depending on model requirements. (B) and (C) describe possible contents of the blue module. (B) Rate law representations of the enzyme, including mass action (white box) and the Michaelis-Menten equation (orange box). (C) Detailed multi-state representations of the enzyme, including the full two-state representation of the Michaelis-Menten enzyme (left) and a three-state representation with an explicit step for the conversion of substrate to product.

Enzyme-catalysed reactions can be described by rate laws (Fig 3B). The simplest of these is the law of mass action in Eq 6 (Fig 3B; white box). This can be substituted for more complex kinetics, for example, the reversible Michaelis-Menten equation (orange box)

v=κ¯e0eμS/RT-eμP/RT1+eμS/RTRb0+eμP/RTRb1 (13)

with the parameters κ¯ (rate constant [s−1]), Rb0 (binding constant of the substrate [dimensionless]), Rb1 (binding constant of the product [dimensionless]) and e0 (total amount of enzyme [mol]) [42]. Thus, a bond graph approach allows alternative rate laws to be easily swapped for one another while retaining thermodynamic consistency. It is worth noting that the Michaelis-Menten rate law can be derived as a simplification of a more complex mass action model [47, 48].

In some cases, the full dynamics of the enzymatic reaction need to be considered. The advantage of a modular representation is that groups of reactions can be encapsulated into a model component. The diagram in Fig 3A can be converted into a simple two-state mechanism by defining the light blue box as the network in the left panel of Fig 3C; this is the same system as seen in Fig 2. Alternatively, to consider the conversion of substrate-bound enzyme to product-bound enzyme, the module defined in the right panel of Fig 3C could be used.

As seen in the above examples, parts of a module can be exposed by leaving open one end of a connection, which imposes a boundary condition on the model, allowing it to be connected to an external component. This is analogous to leaving ports open in electrical circuits. This kind of modularity provides tools for managing model complexity: generic modules are easily replicated and reused for different reactions that use the same mechanism, and the internal details of complex enzymatic mechanisms can be hidden. We now illustrate these ideas through the modular development of a MAPK signalling cascade model, and then by considering the glycolytic metabolic pathway modelled using different reaction rate laws.

Results

Modular development of a model of the MAPK cascade

The MAPK cascades are a family of biochemical signalling pathways that regulate important biological processes including growth, proliferation, migration and differentiation [49]. These systems are composed of a series of phosphorylation events in which the phosphorylated substrate at one level of the cascade catalyses reactions at the next level, leading to signal amplification. From a modelling perspective, MAPK cascades contain a number of repeated motifs and therefore serve as interesting case studies for modular model development.

There are multiple MAPK cascades that naturally occur in eukaryotic cells. In this paper, we deal with the Mos/MAPK pathway found in Xenopus oocytes, a key regulator of maturation in these cells [50, 51]. MAPK cascades in human cells include the ERK-MAPK, c-Jun N-terminal kinase (JNK) and p38 MAPK pathways, which are of clinical relevance as they are implicated in both inflammation and cancer [52, 53]. While the kinetic properties may differ between pathways, a strength of taking a modular approach is that one can use similar network structures to account for many MAPK cascades.

Core model

Here we construct a model of the Mos/MAPK cascade in a modular fashion using bond graphs. The core MAPK cascade model we considered was based on a study by Huang and Ferrell [50]. We chose this model in particular as it accounts for the elementary mass-action between enzyme states, which is essential when dealing with systems with coupled enzymatic reactions [54].

The presence of repeated network motifs consisting of kinases and phosphatases motivates an approach where generic modules corresponding to the kinases and phosphatases are first constructed, and then assembled into a model of the MAPK cascade. This gives rise to a model with a hierarchical structure. The modules for the kinase and phosphatase are defined in Fig 4A and 4B. These use mechanisms similar to the Michaelis-Menten model in the left panel of Fig 3C, but the free enzyme E, ATP, ADP and Pi need to be shared between modules and are therefore represented as external connections. X and XP are generic labels referring to the unphosphorylated and phosphorylated substrate respectively.

Fig 4. A hierarchical and thermodynamic model of the MAPK cascade.

Fig 4

Generic modules can be made for (A) kinases and (B) phosphatases. (C) These modules can then be assembled into larger modules defining phosphorylation loops. To ease biological interpretation, the specific biological names are given in green where known. (D) Multiple copies of phosphorylation loops can be reused and connected to form a model of the MAPK cascade. For clarity, ATP, ADP and Pi have been omitted in (D).

In the Methods section, it was clear how components were connected to the ports of each module. However, the increased number of ports in this example demands a more precise method of specifying external connections. For each module, each port is labelled in parentheses, i.e. (label). These port labels are then used to define the connections to external components when the module is reused in a larger system—as indicated by the labels in red parentheses in Fig 4C and 4D. Using this notation, the kinase and phosphatase modules are combined into a generic model of a phosphorylation cycle (Fig 4C). Note that X and XP have been linked through a conservation of mass relationship, but the ports are otherwise exposed because all quantities are shared between modules in the full MAPK cascade model. In cases where there are multiple kinases and phosphatases operating in parallel, modules for these additional enzymes are easily added and connected to the mass conservation junction.

An advantage of defining generic modules is that multiple copies of these modules can be constructed and connected together. Since the MAPK cascade consists of multiple phosphorylation cycles, copies of the phosphorylation cycle module can be coupled together into a model of a full MAPK cascade (Fig 4D). Here, specific biomolecules are now assigned to the previously generic X and XP ports of the phosphorylation cycles. The multiple levels of the cascade are coupled by connecting the phosphorylated substrate in one level to the kinase port of the next cascade. We note that while this representation is abstracted, it is still a fully functional bond graph satisfying thermodynamic consistency. One could in principle flatten the bond graph by iteratively replacing modules with their definitions in Fig 4A–4C. Indeed, this approach can be used to algorithmically derive the equations of a bond graph model.

We chose model parameters to match the Huang and Ferrell [50] model as closely as possible. Because in that model the energetics of ATP hydrolysis were ignored and irreversible reactions were used, we reformulated the model to reintroduce both the effects of ATP hydrolysis and reversibility (details in Appendix A of S1 Text, with parameters given in S1 Table). Because the Huang and Ferrell model used irreversible reactions, an exact fit was impossible. Nonetheless, the reformulated model behaves almost identically to the original model (Figure A in S1 Text) under comparable physiological conditions.

We construct and run simulations of the model using the Python package BondGraphTools [31]; a tutorial for using this package can be found in Appendix C of S1 Text. The results are shown in Fig 5A, which plots the percentage of the activated kinase at each level of the cascade. We found that the concentrations settled to steady-state concentrations. These concentrations were recorded for different concentrations of input, resulting in the signal-response curves in Fig 5B. Under different input concentrations, the response of each substrate was sigmoidal. As expected from existing modelling studies [50], there was an amplification effect as the steepness of the transition from inactivated to activated forms became more pronounced towards the end of the cycle. When the energy supplied by ATP hydrolysis was reduced, a greater concentration of input was required to activate each of the kinases, with the effects being amplified at the lower levels of the cascade (Fig 5C).

Fig 5. Simulations of the model of the MAPK cascade.

Fig 5

(A) The activation of kinases over time. Activation is defined as the percentage of each kinase in its active state, i.e. the phosphorylated form of MAP3K and the biphosphorylated forms of MAP2K and MAPK. (B) The effect of input concentration on the steady-state concentrations of each of the activated kinases. Each curve is normalised to the highest concentration achieved for that species. (C) The activation curve in (B), but with reduced (80%) energy from ATP hydrolysis. The model was simulated with the initial conditions xMAP3K = 3 nM, xMAP2K = 1.2 μM, xMAPK = 1.2 μM, xMAP3K-Pase = 0.3 nM, xMAP2K-Pase = 0.3 nM, xMAPK-Pase = 0.12 μM. In (A), we initially set xMAP4K = 0.03 nM, whereas this initial condition was varied for (B) and (C). All other species had an initial concentration of zero.

While we took a black-box approach to modularity in this example, recent developments in bond graph modelling have enabled a more flexible white-box approach to modularity [55]. In Appendix B in S1 Text, we outline how this model could be constructed using such an approach where each module is itself a simulatable model.

Incorporation of feedback

An advantage of using a graphical and modular representation is that it is relatively easy to make incremental changes to models. We demonstrate this by modifying the model in Fig 4D—which we will now refer to as the “core” model—to include the effects of feedback. Both positive and negative feedback loops exist in MAPK cascades, but these often operate on molecules upstream of the cascade represented here [56]. To keep the model simple, we incorporate the effects of feedback with a generic mechanism assuming that the feedback operates on the input molecule MAP4K, and that the feedback occurs due to a phosphorylation event that either activates or inactivates the kinase.

To incorporate feedback, the core model of the MAPK cascade is rewired so that the input MAP4K and output MAPKPP are connected through a feedback module (Fig 6A). Because feedback is through a phosphorylation event, we model feedback by reusing the phosphorylation cycle module in Fig 4C. The new model contains the MAP4K-I species, representing the inactive form of MAP4K. Note that in Fig 6A, the feedback ports that the input and output species connect to (referred to as X1 and X2) determine whether positive or negative feedback results. In the case of positive feedback, the active form of MAP4K is the phosphorylated form, i.e. X1 = XP and X2 = X (Fig 6B). For negative feedback, the active form of MAP4K is the unphosphorylated form, i.e. X1 = X and X2 = XP (Fig 6C).

Fig 6. Feedback within the MAPK cascade.

Fig 6

(A) Feedback can be added by modifying the core model of the MAPK cascade such that the input (MAP4K) and output (MAPKPP) are connected through a feedback module, which is implemented using the phosphorylation cycle defined in Fig 4C. The nature of the connections can give rise to both positive and negative feedback loops. ATP, ADP and Pi have been omitted for clarity. (B) For positive feedback, the active form of MAP4K is the phosphorylated species; (C) whereas for negative feedback, the active form of MAP4K is the dephosphorylated species. (D) The steady-state response (MAPKPP) of each system in response to changing input. (top) No feedback; (middle) positive feedback; (bottom) negative feedback. The initial conditions are the same as in Fig 5. The model with positive feedback is bistable, and the upper curve is obtained by setting the initial conditions for MAP3K, MAP2K and MAPK to zero, and instead using xMAP3KP = 3 nM, xMAP2KPP = 1.2 μM, xMAPKPP = 1.2 μM.

Simulations of the MAPK cascade with feedback to steady-state are shown in Fig 6D, along with the corresponding simulation of the model without feedback for comparison. As has been predicted previously, negative feedback reduced ultrasensitivity [57, 58]. When viewed as an amplifier, negative feedback can increase the range of usable input concentrations at the expense of reduced gain [22, 59, 60]. The model with positive feedback exhibited bistability, a property seen in previous models of the MAPK cascade [56]. When the model was initialised in an active state, the response curve was virtually identical to the system without feedback. However, when the model was initialised to an inactive state, the response curve remained inactive for a wide range of input concentrations and only activated at high input concentrations.

Benchmarking rate laws in a model of glycolysis

Kinetic models of metabolic systems make use of numerous rate laws, such as mass action, Michaelis-Menten and Hill equations. However, in many cases, these rate laws are not thermodynamically consistent. The bond graph approach builds on existing work by using thermodynamically independent parameters to ensure that rate laws are thermodynamically consistent [15, 44, 48]. In this section, we use glycolysis as an example to demonstrate the ability of bond graphs to swap out rate laws for one another and to benchmark the performance of alternative rate laws.

One can create a model of glycolysis by using the stoichiometry to define a high-level reaction structure with swappable modules for each enzyme, and then choose an appropriate rate law for each enzyme, depending on the fit to data (Fig 7). Indeed, this approach was taken by Gawthrop et al. [48].

Fig 7. Modelling of enzymes within the glycolysis pathway.

Fig 7

(A) A network-level representation of the system, where the blue modules are free to be swapped depending on the rate law. Species without circles are considered to be external to the system, and in cases where they occur more than once, they are connected by equal potential components (●, omitted for compactness) to ensure mass conservation. (B) For illustrative purposes, we show the rate laws for the pgk enzyme. (C) The enzyme can be modelled using the mass action (top), Michaelis-Menten (middle) or generalised kinetics (bottom) rate laws. The notation for the mass action and Michaelis-Menten components are defined in Fig 3B. Note that since generalised kinetics rate laws depend on the chemical potentials of all substrates (and not just their sums), they cannot be decomposed into smaller modules.

A detailed rate law, used by Mason and Covert [5], is given by the equation

v=κ¯e0eAf/RT-eAr/RT-1+sS(1+eμs/RTRb,s)+pP(1+eμp/RTRb,p) (14)

where κ¯ is a rate parameter, S is the set of all reactants, P is the set of all products and Rb,z is the binding parameter associated with the substrate z. In the case of multiple stoichiometries, each binding site has a separate parameter Rb,z; S and P include multiple instances of such species (distinguished by numerical indices) in this scenario. For reactions where all reactants and products have a stoichiometry of one, the rate law is identical to convenience kinetics [44]. However, when multiple stoichiometries are involved (for example, in the enzyme pps, Fig 7A), this contains additional parameters. For this reason, we will refer to this rate law as the “generalised kinetics” (GK) rate law.

In many cases, it can be helpful to substitute complex rate laws with simpler ones, for example, to ease parameter estimation or to make mathematical analyses more tractable [61]. Thus, we constructed simplified versions of the generalised kinetics model (with parameters taken from Mason and Covert [5]) using both Michaelis-Menten kinetics (Eq 13) and mass action kinetics (Eq 6). In brief, the parameters were chosen to match the steady state of the generalised kinetics model. In the case of Michaelis-Menten kinetics, the binding parameters were chosen to match the behaviour of the enzyme to internal species where possible. Details of how parameters were derived for the simplified models are given in Appendix A of S1 Text, with parameters in S2 Table.

In order to obtain nonzero steady-state flows through the system, we assume that certain species (G6P, PYR, NAD, NADH, ATP, ADP, AMP, Pi, H, H2O) have a zero rate of change, modelling their replenishment through the environment. These are the external species, or “chemostats” in bond graph terminology [47].

Transient perturbations

We firstly tested each of the models by perturbing the concentrations of each of the internal species, causing transient shifts away from the reference steady state (Fig 8). The responses of the simplified models were firstly compared against the original generalised kinetics model by calculating the response time, which was defined as the time required for the system to return to within 5% of its maximum deviation. Distance from the reference steady state was calculated using the Euclidean norm

d=sSi(xs-xs,ss)2 (15)

where Si={F6P,F16P,DHAP,GAP,13DPG,3PG,2PG,PEP} is the set of all internal species and xs,ss is the concentration of s at the reference steady state. The response times are plotted in the top row of Fig 8. We also compared the models by plotting the concentration of PEP, the most downstream internal species against time (Fig 8; bottom row).

Fig 8. Response of glycolysis models to perturbations to internal species.

Fig 8

Each of the species (column titles) had its concentration instantaneously increased by 30% from steady state. Top row: response times; bottom row: change in [PEP] over time. The colour key is blue: generalised kinetics (GK), green: Michaelis-Menten (MM), red: mass action (MA). In cases where the curve for the generalised kinetics model is not visible, it matches with the Michaelis-Menten model.

As seen in all perturbations, the Michaelis-Menten model matched with the original generalised kinetics model extremely well. This was expected as the Michaelis-Menten model was parameterised to match the behaviour of the generalised kinetics models with respect to the internal species. The minor differences between the models stem from the fba enzyme, which has two products and therefore could not be exactly matched to the generalised kinetics model (see Appendix A of S1 Text).

The mass action model behaved substantially differently from the generalised kinetics model with respect to the perturbations, notably having a faster response time. Additionally, the concentration of PEP had a greater maximum deviation from its steady-state value in response to perturbations of the more upstream species. Unlike the other rate laws, the law of mass action does not account for the rate-limiting step of enzyme complexes releasing product. Thus, these observations could potentially be attributed to the lack of saturation in the mass action rate law, causing increased reaction fluxes.

Steady-state perturbations

We also tested the response of the models to prolonged perturbations with external species, which caused the models to move to different steady states (Fig 9). Once again, the models were compared using the response time (top row) and concentration of PEP (bottom row). While all models have the same steady state without the perturbation, they settle to different steady states under prolonged perturbations due to differences in how the rate laws react to changes in the concentration of boundary species. Accordingly, we also quantified the magnitude of the shift in steady state using the Euclidean norm (middle row).

Fig 9. Response of glycolysis models to prolonged perturbations to external species.

Fig 9

Each of the species (column titles) had its concentration instantaneously increased by 30% from steady state. Top row: response times; middle row: steady state deviation; bottom row: change in [PEP] over time. The colour key is blue: generalised kinetics (GK), green: Michaelis-Menten (MM), red: mass action (MA). The response of the model to NADH was omitted as there was a negligible change in steady state.

While the Michaelis-Menten model still qualitatively resembled the generalised kinetics model, differences started to emerge when external species were perturbed, as there were insufficient parameters to match the behaviour in response to external species. These differences appeared to be most significant for perturbations to NAD, ATP and Pi. In general, the response time for the Michaelis-Menten model was slightly longer than the full model.

Following the trend for internal perturbations, the mass action model behaved significantly differently from the original model. The mass action model had a shorter response time and reached a different steady state in many cases.

These results would appear to suggest that saturation is an important property to consider when modelling the dynamic behaviour of metabolic networks, mirroring results from previous studies [62, 63]. Comparisons between the generalised kinetics and Michaelis-Menten models illustrate that while quantitative differences arise from simplifying out the complex binding properties of enzymes, simpler models may nonetheless be useful in studying the qualitative behaviour of metabolic networks, particularly under conditions where appropriate assumptions are satisfied.

Energetics of the glycolysis pathway

In addition to exploring fluxes and concentrations, bond graph models can be used to study the energetics of metabolic pathways, allowing modellers to incorporate thermodynamic measurements into methodologies for model parameterisation and validation. In some cases, analysing the transduction and dissipation of energy can result in novel insights and predictions [55].

The glycolysis pathway contains two points (fba/fbp and pyk/pps) at which carbon species are cycled by two enzymes while dissipating energy. These futile cycles (or “Cyclic Flow Modulation” [64]) are critical points of control, allowing the system to switch between glycolysis and gluconeogenesis [65, 66]. Much of this regulation is performed by allosteric regulation, which is not accounted for in this model. Nonetheless, the enzyme concentrations e0 can be changed to model the effects of allosteric regulation.

We analyse the energetics of the generalised kinetics model of glycolysis in this section. To simplify our analysis, we switch off the fbp and pps enzymes (generally associated with gluconeogenesis) by setting e0 to zero. The remaining reactions form a pathway, which we analyse at steady state. Using the methods of Gawthrop and Crampin [67], the glycolysis pathway can be defined as the sum of reactions

pgi+pfk+fba+tpi+2gap+2pgk+2gpm+2eno+2pyk. (16)

This pathway has the overall reaction

G6P+3ADP+2NAD+2Pi2PYR+3ATP+H+2NADH+2H2O. (17)

Thus we can calculate the overall affinity of the pathway to be

Aglycolysis=μG6P+3μADP+2μNAD+2μPi-2μPYR-3μATP-μH-2μNADH-2μH2O (18a)
=Apgi+Apfk+Afba+Atpi+2Agap+2Apgk+2Agpm+2Aeno+2Apyk. (18b)

The energy-based approach reveals a more detailed picture of how energy is dissipated throughout the pathway. Using the concentrations of each metabolite at steady state, one can calculate the affinity of each individual reaction. As expected, when scaled by the contribution of each reaction to the pathway, the affinities of the reactions add up to the total affinity (Table 1).

Table 1. Distribution of free energy changes in the glycolysis pathway.

The total free energy corresponds to the overall reaction G6P+3ADP+2NAD+2Pi2PYR+3ATP+H+2NADH+2H2O.

Reaction Affinity (kJ/mol)
pgi 43.4
pfk 81.0
fba 14.6
tpi 8.4
gap 51.5 (×2)
pgk 23.9 (×2)
gpm 13.5 (×2)
eno 45.1 (×2)
pyk 17.0 (×2)
Total 449.4

The total pathway affinity predicted by the model is higher than the experimentally measured values [68]. Furthermore, all reactions contribute significantly to the overall affinity, which differs from experimental measurements finding that the pfk and pyk are the predominant contributors to overall affinity, with the other reactions near equilibrium [68]. We note that for this particular model, the parameters were derived in the absence of standard free energies of formation [5]. Thus, a natural improvement to the model would be to use these values to parameterise models [69], which would likely improve the fit to experimental data. After incorporating data on thermodynamic constants, this model could be regarded as a validated module available for coupling with other reactions to form more comprehensive models. For example, recycling reactions such as the adenylate kinase and alcohol dehydrogenases could be added to study variations in the concentrations of ATP, ADP, AMP, NAD and NADH.

Discussion

It is widely accepted that a modular approach is essential to developing large-scale models in systems biology. While significant progress has been made in using computational resources to support modular modelling, it remains challenging to ensure the integrated models are consistent with the laws of physics. In this paper, we have illustrated that bond graphs are both modular and physically consistent, allowing them to unify developments from both software and thermodynamical modelling. These principles were demonstrated by constructing physically consistent models of two well-studied systems using the modular properties of bond graphs.

To construct the model of the MAPK cascade, we took advantage of the ability of bond graphs to embed modules into reusable templates. Due to the presence of repeating motifs, the development of the model was substantially simplified, illustrating how similar concepts may be usable in streamlining the development of models of more complex systems. As demonstrated in Appendix B of S1 Text, this approach can be extended to use white-box modules with flexible interfaces. Furthermore, the merging of models can be automated through the use of semantic annotations, and bond graphs have shown great potential in this space due to their biophysical detail [70].

In addition to embedding multiple components into a module, bond graphs also enable reactions to be modelled by a wide array of thermodynamically consistent rate laws. Using a network-level representation of glycolysis as a template, we showed how parts of a bond graph can be substituted for rate laws of varying complexity. This enabled a principled approach for benchmarking and comparing models of glycolysis with different levels of complexity. While not explored in this paper, a strength of this approach is that different parts of a model can be represented at different levels of granularity. This modular approach would allow one to study subsystems of interest using highly detailed models while using more manageable coarse-grained representations for the rest of the model.

Due to their modular nature, bond graphs are compatible with existing approaches to coupling reaction networks, as implemented in the SBML hierarchical package and PySB [28, 30]. These approaches are similar in that component definitions can be defined by either outer modules (black-box modularity) or be merged with other components (white-box modularity). This means that bond graphs could potentially be built on top of existing software infrastructure for model annotation, composition and simulation [24, 25, 71]. However, the bond graph approach expands on existing approaches in a few ways. Firstly, bond graphs explicitly account for the transfer of power, ensuring that models are thermodynamically consistent. Secondly, unlike in typical kinetic models, species components contain their own independent equations and parameters, allowing conflicts between models to be resolved. Finally, the approach is generalisable to multi-physics systems such as mechanobiology and electrophysiology.

An ongoing challenge to creating truly large-scale bond graph models is increasing the number of models available to modellers. Ideally, one would look to convert existing kinetic models in the literature into bond graphs. However, while bond graphs are readily converted into kinetic models, the conversion in the other direction is not always possible as existing kinetic models often use irreversible reactions or fail to satisfy detailed balance [47, 72, 73]. Thus, to maintain the advantages of using bond graphs, an avenue of future work would be to automatically generate bond graph versions of existing biomodels, such as those in the BioModels repository [74] or the Physiome Model Repository [75], and where necessary correcting thermodynamic inconsistencies while minimally changing model behaviour. In cases where the energetics of a process are considered negligible (for example gating in ion channels and neural control in locomotion), one may choose to embed the kinetics of such processes inside control elements of a bond graph [76]. This allows a kinetic model to be directly added to a bond graph as a “black box” in which the energetics underlying its dynamic behaviour are ignored.

Because bond graphs are inherently a modular and declarative representation, they are well suited for taking advantage of developments in programmatic modelling where models are constructed through a series of instructions from the software [7, 30]. Indeed, the models in this paper were constructed using the BondGraphTools Python package [31], a highly flexible and automatable approach for model construction that mirrors the approach of existing packages used within the systems biology community [30, 71]. Embedding bond graphs within a programmatic environment enables the construction of models using higher-level descriptions, allowing modellers define models using a relatively parsimonious set of commands. Previous work has shown that bond graphs can be constructed from a stoichiometric matrix [48] and a series of reactions [31]. Nonetheless, further work is needed to help automate model construction, in particular making use of rule-based approaches for constructing models of highly complex interactions between proteins, ligands and receptors [77, 78].

While we used the Python package BondGraphTools for the benchmarking and simulation in this paper, it would also be relatively straightforward to use typical systems biology simulators (for example COPASI [25], Tellurium [71] or PySB [30]) after a change of parameters. However, we note that these programs do not allow species equations to be specified. Thus, we advocate for retaining a bond graph representation of the model as a core representation that can be later modified should the need arise. We are working on exporters to convert bond graphs into more commonly used formats such as SBML and CellML to aid model reproducibility. A longer-term goal is to define bond graphs in SBML and CellML. CellML describes the mathematics of physical systems, thus bond graphs are generally straightforwardly defined in this standard. However, to implement the approach in SBML, one would need to extend the standard so that species have their own equations and parameters, potentially through a separate package.

In writing this paper, we hope to motivate the uptake of network thermodynamic models (and bond graph models in particular) by the systems biology community. One of the objectives of the Physiome Project is to embed bond graphs within the CellML modelling language [79]. This will facilitate the development of more user-friendly tools, allowing the conversion of existing models into bond graphs and subsequently coupling them. As the bond graph methodology gradually becomes more widely adopted by the community, we anticipate that existing mathematical and computational methods in the systems biology space will be used in conjunction with the bond graph approach, allowing biomodels to be integrated in a physically consistent manner.

In conjunction with the programmatic approach, bond graphs provide a useful framework for updating models and recording their provenance. In the development of the model of the MAPK cascade, we showed that incremental changes could be made to incorporate feedback. Through a modular approach, these changes were made by changing the components and connections within the network rather than deriving new equations entirely. Furthermore, these incremental changes can be recorded within the code used to construct such models, which could potentially enable an automated framework for profiling model provenance in the future.

In order to develop comprehensive models of cells, ongoing and future work will focus on expanding the range of cellular processes that bond graphs can represent. While significant progress has been made in metabolic modelling and transport processes, how to model gene regulation and signalling in an energy-based framework remains an open question. Modelling such processes will likely require theoretical groundwork to be established for:

  1. modelling discrete and stochastic systems

  2. choosing an appropriate level of model granularity to model each biological process

  3. dealing with the properties of macromolecules in a modular manner, so that the number of associated parameters remains manageable

While the Network Thermodynamics approach implemented using bond graphs requires modellers to take more care in developing their models in the short term, we believe that taking this approach will make large-scale models more robust, reusable and ultimately more useful to the systems biology community in the years to come.

Supporting information

S1 Text. Supplementary material.

Appendix A: Details of parameter identification for the MAPK cascade and glycolysis. Appendix B: Outline of a white-box approach to modelling the MAPK cascade. Appendix C: Tutorial on basic BondGraphTools functionality. Fig A: Comparison of the bond graph model of the MAPK cascade to the Huang and Ferrell model. Fig B: A white-box approach to defining a model of the MAPK cascade. Fig C: A bond graph model of the reaction A+BC. Fig D: BondGraphTools rendering of the reaction A+BC. Fig E: Simulation of the reaction A+BC.

(PDF)

S1 Table. Parameters for the MAPK cascade.

(XLSX)

S2 Table. Parameters for the glycolysis model.

(XLSX)

Data Availability

The code for this manuscript is available at https://github.com/mic-pan/Modularity-SysBio.

Funding Statement

This research was in part conducted and funded by the Australian Research Council Centre of Excellence in Convergent Bio-Nano Science and Technology (project number CE140100036, http://purl.org/au-research/grants/arc/CE140100036), awarded to EJC. MP was supported by a Postdoctoral Research Fellowship from the School of Mathematics and Statistics, University of Melbourne. PJG was supported by the Faculty of Engineering and Information Technology, University of Melbourne via a Professorial Fellowship. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1. Karr JR, Sanghvi JC, Macklin DN, Gutschow MV, Jacobs JM, Bolival B, et al. A Whole-Cell Computational Model Predicts Phenotype from Genotype. Cell. 2012;150(2):389–401. doi: 10.1016/j.cell.2012.05.044 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Macklin DN, Ahn-Horst TA, Choi H, Ruggero NA, Carrera J, Mason JC, et al. Simultaneous cross-evaluation of heterogeneous E. coli datasets via mechanistic simulation. Science. 2020;369(6502):eaav3751. doi: 10.1126/science.aav3751 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Carrera J, Covert MW. Why Build Whole-Cell Models? Trends in Cell Biology. 2015;25(12):719–722. doi: 10.1016/j.tcb.2015.09.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Babtie AC, Stumpf MPH. How to deal with parameters for whole-cell modelling. Journal of The Royal Society Interface. 2017;14(133):20170237. doi: 10.1098/rsif.2017.0237 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Mason JC, Covert MW. An energetic reformulation of kinetic rate laws enables scalable parameter estimation for biochemical networks. Journal of Theoretical Biology. 2019;461:145–156. doi: 10.1016/j.jtbi.2018.10.041 [DOI] [PubMed] [Google Scholar]
  • 6. Szigeti B, Roth YD, Sekar JAP, Goldberg AP, Pochiraju SC, Karr JR. A blueprint for human whole-cell modeling. Current Opinion in Systems Biology. 2018;7:8–15. doi: 10.1016/j.coisb.2017.10.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Mallavarapu A, Thomson M, Ullian B, Gunawardena J. Programming with models: modularity and abstraction provide powerful capabilities for systems biology. Journal of The Royal Society Interface. 2009;6(32):257–270. doi: 10.1098/rsif.2008.0205 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Cooling MT, Nickerson DP, Nielsen PMF, Hunter PJ. Modular modelling with Physiome standards. Journal of Physiology. 2016;594(23):6817–6831. doi: 10.1113/JP272633 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Neal ML, Cooling MT, Smith LP, Thompson CT, Sauro HM, Carlson BE, et al. A Reappraisal of How to Build Modular, Reusable Models of Biological Systems. PLOS Computational Biology. 2014;10(10):e1003849. doi: 10.1371/journal.pcbi.1003849 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Agmon E, Spangler RK, Skalnik CJ, Poole W, Peirce SM, Morrison JH, et al. Vivarium: an interface and engine for integrative multiscale modeling in computational biology. bioRxiv. 2021; p. 2021.04.27.441657. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Waltemath D, Karr JR, Bergmann FT, Chelliah V, Hucka M, Krantz M, et al. Toward Community Standards and Software for Whole-Cell Modeling. IEEE Transactions on Biomedical Engineering. 2016;63(10):2007–2014. doi: 10.1109/TBME.2016.2560762 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Henry CS, Jankowski MD, Broadbelt LJ, Hatzimanikatis V. Genome-Scale Thermodynamic Analysis of Escherichia coli Metabolism. Biophysical Journal. 2006;90(4):1453–1461. doi: 10.1529/biophysj.105.071720 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Soh KC, Hatzimanikatis V. Network thermodynamics in the post-genomic era. Current Opinion in Microbiology. 2010;13(3):350–357. doi: 10.1016/j.mib.2010.03.001 [DOI] [PubMed] [Google Scholar]
  • 14. Soh KC, Miskovic L, Hatzimanikatis V. From network models to network responses: integration of thermodynamic and kinetic properties of yeast genome-scale metabolic networks. FEMS Yeast Research. 2012;12(2):129–143. doi: 10.1111/j.1567-1364.2011.00771.x [DOI] [PubMed] [Google Scholar]
  • 15. Ederer M, Gilles ED. Thermodynamically Feasible Kinetic Models of Reaction Networks. Biophysical Journal. 2007;92(6):1846–1857. doi: 10.1529/biophysj.106.094094 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Lubitz T, Schulz M, Klipp E, Liebermeister W. Parameter Balancing in Kinetic Models of Cell Metabolism. Journal of Physical Chemistry B. 2010;114(49):16298–16303. doi: 10.1021/jp108764b [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Stanford NJ, Lubitz T, Smallbone K, Klipp E, Mendes P, Liebermeister W. Systematic Construction of Kinetic Models from Genome-Scale Metabolic Networks. PLoS ONE. 2013;8(11):e79195. doi: 10.1371/journal.pone.0079195 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Ginkel M, Kremling A, Nutsch T, Rehner R, Gilles ED. Modular modeling of cellular systems with ProMoT/Diva. Bioinformatics. 2003;19(9):1169–1176. doi: 10.1093/bioinformatics/btg128 [DOI] [PubMed] [Google Scholar]
  • 19. Lloyd CM, Halstead MDB, Nielsen PF. CellML: its future, present and past. Progress in Biophysics and Molecular Biology. 2004;85:433–450. doi: 10.1016/j.pbiomolbio.2004.01.004 [DOI] [PubMed] [Google Scholar]
  • 20. Neal ML, Carlson BE, Thompson CT, James RC, Kim KG, Tran K, et al. Semantics-Based Composition of Integrated Cardiomyocyte Models Motivated by Real-World Use Cases. PLOS ONE. 2015;10(12):e0145621. doi: 10.1371/journal.pone.0145621 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Del Vecchio D, Ninfa AJ, Sontag ED. Modular cell biology: retroactivity and insulation. Molecular Systems Biology. 2008;4(1). doi: 10.1038/msb4100204 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Gawthrop PJ, Crampin EJ. Modular bond-graph modelling and analysis of biomolecular systems. IET Systems Biology. 2016;10(5):187–201. doi: 10.1049/iet-syb.2015.0083 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Hucka M, Bergmann FT, Dräger A, Hoops S, Keating SM, Novère NL, et al. The Systems Biology Markup Language (SBML): Language Specification for Level 3 Version 2 Core. Journal of Integrative Bioinformatics. 2018;15(1). doi: 10.1515/jib-2017-0081 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Garny A, Hunter PJ. OpenCOR: a modular and interoperable approach to computational biology. Frontiers in Physiology. 2015;6. doi: 10.3389/fphys.2015.00026 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Hoops S, Sahle S, Gauges R, Lee C, Pahle J, Simus N, et al. COPASI — a COmplex PAthway SImulator. Bioinformatics. 2006;22(24):3067–3074. doi: 10.1093/bioinformatics/btl485 [DOI] [PubMed] [Google Scholar]
  • 26. Waltemath D, Adams R, Bergmann FT, Hucka M, Kolpakov F, Miller AK, et al. Reproducible computational biology experiments with SED-ML—The Simulation Experiment Description Markup Language. BMC Systems Biology. 2011;5:198. doi: 10.1186/1752-0509-5-198 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Krause F, Uhlendorf J, Lubitz T, Schulz M, Klipp E, Liebermeister W. Annotation and merging of SBML models with semanticSBML. Bioinformatics. 2010;26(3):421–422. doi: 10.1093/bioinformatics/btp642 [DOI] [PubMed] [Google Scholar]
  • 28. Smith LP, Hucka M, Hoops S, Finney A, Ginkel M, Myers CJ, et al. SBML Level 3 package: Hierarchical Model Composition, Version 1 Release 3. Journal of Integrative Bioinformatics. 2015;12(2):603–659. doi: 10.2390/biecoll-jib-2015-268 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Watanabe L, Nguyen T, Zhang M, Zundel Z, Zhang Z, Madsen C, et al. iBioSim 3: A Tool for Model-Based Genetic Circuit Design. ACS Synthetic Biology. 2019;8(7):1560–1563. doi: 10.1021/acssynbio.8b00078 [DOI] [PubMed] [Google Scholar]
  • 30. Lopez CF, Muhlich JL, Bachman JA, Sorger PK. Programming biological models in Python using PySB. Molecular Systems Biology. 2013;9(1):646. doi: 10.1038/msb.2013.1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Cudmore P, Pan M, Gawthrop PJ, Crampin EJ. Analysing and simulating energy-based models in biology using BondGraphTools. bioRxiv. 2021; p. 2021.03.24.436763. [DOI] [PubMed] [Google Scholar]
  • 32. Daly AC, Clerx M, Beattie KA, Cooper J, Gavaghan DJ, Mirams GR. Reproducible model development in the cardiac electrophysiology Web Lab. Progress in Biophysics and Molecular Biology. 2018;139:3–14. doi: 10.1016/j.pbiomolbio.2018.05.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Cowan AE, Mendes P, Blinov ML. ModelBricks—modules for reproducible modeling improving model annotation and provenance. npj Systems Biology and Applications. 2019;5(37):1–6. doi: 10.1038/s41540-019-0114-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Oster G, Perelson A, Katchalsky A. Network Thermodynamics. Nature. 1971;234:393–399. doi: 10.1038/234393a0 [DOI] [Google Scholar]
  • 35. Oster GF, Perelson AS, Katchalsky A. Network thermodynamics: dynamic modelling of biophysical systems. Quarterly Reviews of Biophysics. 1973;6(1):1–134. doi: 10.1017/S0033583500000081 [DOI] [PubMed] [Google Scholar]
  • 36.Kalman R. The evolution of system theory: My memories and hopes. In: IFAC World Congress. Prague; 2005.
  • 37. Willems JC. The Behavioral Approach to Open and Interconnected Systems. IEEE Control Systems Magazine. 2007;27(6):46–99. doi: 10.1109/MCS.2007.906923 [DOI] [Google Scholar]
  • 38. Gawthrop PJ, Siekmann I, Kameneva T, Saha S, Ibbotson MR, Crampin EJ. Bond graph modelling of chemoelectrical energy transduction. IET Systems Biology. 2017;11(5):127–138. doi: 10.1049/iet-syb.2017.0006 [DOI] [Google Scholar]
  • 39. Pan M, Gawthrop PJ, Tran K, Cursons J, Crampin EJ. Bond graph modelling of the cardiac action potential: implications for drift and non-unique steady states. Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences. 2018;474(2214):20180106. doi: 10.1098/rspa.2018.0106 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Díaz-Zuccarini V, Pichardo-Almarza C. On the formalization of multi-scale and multi-science processes for integrative biology. Interface Focus. 2011;1(3):426–437. doi: 10.1098/rsfs.2010.0038 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Cellier FE. Continuous System Modeling. New York: Springer; 1991. [Google Scholar]
  • 42. Gawthrop PJ, Crampin EJ. Energy-based analysis of biochemical cycles using bond graphs. Proceedings of the Royal Society of London A: Mathematical, Physical and Engineering Sciences. 2014;470(2171):20140459. doi: 10.1098/rspa.2014.0459 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Moodie S, Le Novère N, Demir E, Mi H, Villéger A. Systems Biology Graphical Notation: Process Description language Level 1 Version 1.3. Journal of Integrative Bioinformatics. 2015;12(2):263. doi: 10.1515/jib-2015-263 [DOI] [PubMed] [Google Scholar]
  • 44. Liebermeister W, Klipp E. Bringing metabolic networks to life: convenience rate law and thermodynamic constraints. Theoretical Biology and Medical Modelling. 2006;3:41. doi: 10.1186/1742-4682-3-41 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. de Bono B, Safaei S, Grenon P, Hunter P. Meeting the multiscale challenge: representing physiology processes over ApiNATOMY circuits using bond graphs. Interface Focus. 2018;8(1):20170026. doi: 10.1098/rsfs.2017.0026 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Liebermeister W, Uhlendorf J, Klipp E. Modular rate laws for enzymatic reactions: thermodynamics, elasticities and implementation. Bioinformatics. 2010;26(12):1528–1534. doi: 10.1093/bioinformatics/btq141 [DOI] [PubMed] [Google Scholar]
  • 47. Gawthrop PJ, Cursons J, Crampin EJ. Hierarchical bond graph modelling of biochemical networks. Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences. 2015;471(2184):20150642. doi: 10.1098/rspa.2015.0642 [DOI] [Google Scholar]
  • 48. Gawthrop PJ, Pan M, Crampin EJ. Modular dynamic biomolecular modelling with bond graphs: the unification of stoichiometry, thermodynamics, kinetics and data. Journal of the Royal Society Interface. 2021;18(181):20210478. doi: 10.1098/rsif.2021.0478 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Kholodenko BN. Negative feedback and ultrasensitivity can bring about oscillations in the mitogen-activated protein kinase cascades. European Journal of Biochemistry. 2000;267(6):1583–1588. doi: 10.1046/j.1432-1327.2000.01197.x [DOI] [PubMed] [Google Scholar]
  • 50. Huang CY, Ferrell JE. Ultrasensitivity in the mitogen-activated protein kinase cascade. Proceedings of the National Academy of Sciences. 1996;93(19):10078–10083. doi: 10.1073/pnas.93.19.10078 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51. Zhang W, Liu HT. MAPK signal pathways in the regulation of cell proliferation in mammalian cells. Cell Research. 2002;12:9–18. doi: 10.1038/sj.cr.7290105 [DOI] [PubMed] [Google Scholar]
  • 52. Huang P, Han J, Hui L. MAPK signaling in inflammation-associated cancer development. Protein & Cell. 2010;1(3):218–226. doi: 10.1007/s13238-010-0019-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53. Plotnikov A, Zehorai E, Procaccia S, Seger R. The MAPK cascades: Signaling components, nuclear roles and mechanisms of nuclear translocation. Biochimica et Biophysica Acta (BBA)—Molecular Cell Research. 2011;1813(9):1619–1633. doi: 10.1016/j.bbamcr.2010.12.012 [DOI] [PubMed] [Google Scholar]
  • 54. Ciliberto A, Capuani F, Tyson JJ. Modeling Networks of Coupled Enzymatic Reactions Using the Total Quasi-Steady State Approximation. PLOS Computational Biology. 2007;3(3):e45. doi: 10.1371/journal.pcbi.0030045 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55. Gawthrop PJ. Bond Graph Modeling of Chemiosmotic Biomolecular Energy Transduction. IEEE Transactions on NanoBioscience. 2017;16(3):177–188. doi: 10.1109/TNB.2017.2674683 [DOI] [PubMed] [Google Scholar]
  • 56. Shin SY, Rath O, Choo SM, Fee F, McFerran B, Kolch W, et al. Positive- and negative-feedback regulations coordinate the dynamic behavior of the Ras-Raf-MEK-ERK signal transduction pathway. Journal of Cell Science. 2009;122:425–435. doi: 10.1242/jcs.036319 [DOI] [PubMed] [Google Scholar]
  • 57. Ferrell JE, Machleder EM. The Biochemical Basis of an All-or-None Cell Fate Switch in Xenopus Oocytes. Science. 1998;280(5365):895–898. doi: 10.1126/science.280.5365.895 [DOI] [PubMed] [Google Scholar]
  • 58. Sturm OE, Orton R, Grindlay J, Birtwistle M, Vyshemirsky V, Gilbert D, et al. The Mammalian MAPK/ERK Pathway Exhibits Properties of a Negative Feedback Amplifier. Science Signaling. 2010;3(153):ra90. doi: 10.1126/scisignal.2001212 [DOI] [PubMed] [Google Scholar]
  • 59.Sauro HM, Ingalls B. MAPK Cascades as Feedback Amplifiers. arXiv e-prints. 2007;0710.5195.
  • 60. Sturm OE, Orton R, Grindlay J, Birtwistle M, Vyshemirsky V, Gilbert D, et al. The Mammalian MAPK/ERK Pathway Exhibits Properties of a Negative Feedback Amplifier. Science Signaling. 2010;3(153):ra90. doi: 10.1126/scisignal.2001212 [DOI] [PubMed] [Google Scholar]
  • 61. Gawthrop PJ, Cudmore P, Crampin EJ. Physically-plausible modelling of biomolecular systems: A simplified, energy-based model of the mitochondrial electron transport chain. J Theor Biol. 2020;493:110223. doi: 10.1016/j.jtbi.2020.110223 [DOI] [PubMed] [Google Scholar]
  • 62. Chakrabarti A, Miskovic L, Soh KC, Hatzimanikatis V. Towards kinetic modeling of genome-scale metabolic networks without sacrificing stoichiometric, thermodynamic and physiological constraints. Biotechnology Journal. 2013;8(9):1043–1057. doi: 10.1002/biot.201300091 [DOI] [PubMed] [Google Scholar]
  • 63. Du B, Zielinski DC, Kavvas ES, Dräger A, Tan J, Zhang Z, et al. Evaluation of rate law approximations in bottom-up kinetic models of metabolism. BMC Syst Biol. 2016;10(1):1–15. doi: 10.1186/s12918-016-0283-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64. Gawthrop PJ. Energy-Based Modeling of the Feedback Control of Biomolecular Systems With Cyclic Flow Modulation. IEEE Transactions on NanoBioscience. 2021;20(2):183–192. doi: 10.1109/TNB.2021.3058440 [DOI] [PubMed] [Google Scholar]
  • 65. Basan M, Honda T, Christodoulou D, Hörl M, Chang YF, Leoncini E, et al. A universal trade-off between growth and lag in fluctuating environments. Nature. 2020;584(7821):470–474. doi: 10.1038/s41586-020-2505-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66. Schink S, Christodoulou D, Mukherjee A, Athaide E, Sauer U, Basan M. Trade-offs in adaptation to glycolysis and gluconeogenesis result in a preferential flux direction in central metabolism. bioRxiv. 2021; p. 2021.05.07.443112. [Google Scholar]
  • 67. Gawthrop PJ, Crampin EJ. Energy-based analysis of biomolecular pathways. Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences. 2017;473(2202):20160825. doi: 10.1098/rspa.2016.0825 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68. Park JO, Rubin SA, Xu YF, Amador-Noguez D, Fan J, Shlomi T, et al. Metabolite concentrations, fluxes and free energies imply efficient enzyme usage. Nature Chemical Biology. 2016;12:482–489. doi: 10.1038/nchembio.2077 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69. Flamholz A, Noor E, Bar-Even A, Milo R. eQuilibrator–the biochemical thermodynamics calculator. Nucleic Acids Research. 2012;40(D1):D770–D775. doi: 10.1093/nar/gkr874 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70. Shahidi N, Pan M, Safaei S, Tran K, Crampin EJ, Nickerson DP. Hierarchical semantic composition of biosimulation models using bond graphs. PLOS Computational Biology. 2021;17(5):e1008859. doi: 10.1371/journal.pcbi.1008859 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71. Choi K, Medley JK, König M, Stocking K, Smith L, Gu S, et al. Tellurium: An extensible python-based modeling environment for systems and synthetic biology. Biosystems. 2018;171:74–79. doi: 10.1016/j.biosystems.2018.07.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72. Pan M, Gawthrop PJ, Tran K, Cursons J, Crampin EJ. A thermodynamic framework for modelling membrane transporters. Journal of Theoretical Biology. 2019;481:10–23. doi: 10.1016/j.jtbi.2018.09.034 [DOI] [PubMed] [Google Scholar]
  • 73. Pan M, Gawthrop PJ, Cursons J, Tran K, Crampin EJ. The cardiac Na+/K+ ATPase: An updated, thermodynamically consistent model. Physiome. 2020;7. [Google Scholar]
  • 74. Li C, Donizelli M, Rodriguez N, Dharuri H, Endler L, Chelliah V, et al. BioModels Database: An enhanced, curated and annotated resource for published quantitative kinetic models. BMC Systems Biology. 2010;4:92. doi: 10.1186/1752-0509-4-92 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75. Yu T, Lloyd CM, Nickerson DP, Cooling MT, Miller AK, Garny A, et al. The Physiome Model Repository 2. Bioinformatics. 2011;27(5):743–744. doi: 10.1093/bioinformatics/btq723 [DOI] [PubMed] [Google Scholar]
  • 76. Borutzky W. Bond Graph Methodology. London: Springer; 2010. [Google Scholar]
  • 77.Sekar JAP, Hogg JS, Faeder JR. Energy-based modeling in BioNetGen. In: 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM); 2016. p. 15–18.
  • 78. Ollivier JF, Shahrezaei V, Swain PS. Scalable Rule-Based Modelling of Allosteric Proteins and Biochemical Networks. PLOS Computational Biology. 2010;6(11):e1000975. doi: 10.1371/journal.pcbi.1000975 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79. Hunter P, Balzani D. Modeling Framework for Computational Physiology. In: Encyclopedia of Continuum Mechanics. Berlin, Germany: Springer; 2020. p. 1691–1702. doi: 10.1007/978-3-662-55771-6_29 [DOI] [Google Scholar]
PLoS Comput Biol. doi: 10.1371/journal.pcbi.1009513.r001

Decision Letter 0

Anders Wallqvist, Daniel A Beard

25 Aug 2021

Dear Dr. Pan,

Thank you very much for submitting your manuscript "Modular assembly of dynamic models in systems biology" for consideration at PLOS Computational Biology. As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. The reviewers appreciated the attention to an important topic. Based on the reviews, we are likely to accept this manuscript for publication, providing that you modify the manuscript according to the review recommendations.

Please prepare and submit your revised manuscript within 30 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email.

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to all review comments, and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Thank you again for your submission to our journal. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Anders Wallqvist

Associate Editor

PLOS Computational Biology

Daniel Beard

Deputy Editor

PLOS Computational Biology

***********************

A link appears below if there are any accompanying review attachments. If you believe any reviews to be missing, please contact ploscompbiol@plos.org immediately:

[LINK]

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: This is a manuscript that essentially summarizes the bond graph approach of network thermodynamics and proposes it to be a suitable method to modularize the development of large-scale models (such as whole-cell models). Overall I find the manuscript to be very clear and accessible and overall I think this will interest the readership of the journal.

My only major concern regards the glycolysis example used, where they chose to keep AMP, ADP, ATP, NAD and NADH constant. These conserved species are recycled and are too important to ignore by keeping them constant. It would have been much more interesting if they had included their recycling reactions (like alcohol dehydrogenase for NAD/NADH and an ATPase + an adenilate kinase for ATP/ADP/AMP. This would be much more in line with what they did in the MAPK example. However, I do understand that this is not essential for the purpose of their argument (the utility of bond graph approach), but it would be a whole lot more interesting. Importantly, doing that would allow a better energetic analysis of glycolysis...

There are a number of minor points that nevertheless I think they could easily address and which would make the manuscript much more useful for readers:

1- Thermodynamics

To be fair to readers it is important to be accurate in citing prior work. In this regard I bring attention to your sentence in the Introduction:

"While the whole-cell modelling community has emphasised the need to use physically measurable parameters, it is only recently that the importance of thermodynamics in these models has been acknowledged [5]."

the importance of thermodynamics in this type of models has been invoked prior to this by several authors, such as your references [32] and [33], Henry et al (2006), Soh et al (2012), Lubitz et al (2010) and very much in this large-scale kinetic model context by Stanford et al (2013) (disclaimer: I am a co-author of the latter and I am not writing this seeking a citation; rather I feel that a reader should not be mislead that your ref. [5] was the first time thermodynamics had been thought of in this context).

2- Functional modularity: I thought it could be pertinent to mention the work of Del Vecchio et al (2008) in this context.

3- Figure 2 is very useful, but it would be better to make 2B actually use the SBGN standard. I note that it would require only one change to make it compliant: the arrows should not point into the reaction nodes, only into the species nodes. Then you couldsay that this representation follows the standard (rather than it is similar to).

4- The MAPK are not only significant clinically for being involved in many cancers, but also in inflammation (P38 pathway).

5- Fig. 4, I note that in panel B you represent the phosphatase in the direction which seems to be the inverse of the biological flow. Normally phosphatases go from XP -> X rather than X -> XP (thermodynamically this direction is very unfavorable unless in the presence of a massive and unphysiological concentration of Pi... It would be much better if you would represent 4B in the physiological direction.

References that I cited which are not in the manuscript:

- Henry CS, Jankowski MD, Broadbelt LJ, Hatzimanikatis V. (2006) Genome-scale thermodynamic analysis of Escherichia coli metabolism. Biophys J. 90(4):1453-61.

- Soh KC, Miskovic L, Hatzimanikatis V. (2012) From network models to network responses: integration of thermodynamic and kinetic properties of yeast genome-scale metabolic networks. FEMS Yeast Res. 12(2):129-43.

- Lubitz T, Schulz M, Klipp E, Liebermeister W (2010) Parameter balancing for kinetic models of cell metabolism. J Phys Chem B 114: 16298-16303.

- Stanford NJ, Lubitz T, Smallbone K, Klipp E, Mendes P, Liebermeister W (2013) Systematic construction of kinetic models from genome-scale metabolic networks PLoS ONE 8:e79195

- Del Vecchio D,Ninfa AJ, Sontag ED (2008) Modular cell biology: retroactivity and insulation. Mol Syst Biol 4:161.

Reviewer #2: The manuscript by Michael Pan et al describes a physics-based approach for modularity in biological modeling. They describe modular construction of models based on bond graphs.

The first part of the manuscript described bond graphs, defining reaction network (RN) through thermodynamic (TD) notations, expressing all kinetic rate constants through TD parameters. Mathematically it means introducing a larger number of parameters that are energy-based and independent, contrary to dependent parameters in a reaction network. Visually it corresponds to decomposing the RN (usually represented as a bipartite graph of species and reaction nodes) into 4-partite graph (separate nodes for species, reactions, mass-conservation check-points and energy conservation check-points). This part is clearly written and very useful.

The second part describes using bond graphs to construct and combine modules into larger biomedical models. This is a nice and comprehensive description of modular approach, but the authors have not compared their approach with SBML hierarchical model composition package. It is described at http://co.mbine.org/standards/sbml/level-3/version-1/comp. Although SBML hierarchical package is too technical and not written for a general audience, it introduces the same concepts of black-box versus white-box encapsulation and “ports” as interfaces between a module and its containing model. A modular SBML-based approach is implemented in iBioSim (https://async.ece.utah.edu/tools/ibiosim/). Thus, I would appreciate if authors could address the following comments:

1. It would be useful to have more input on what in Figures 4, 6 and 7 is different from a regular modular composition of RNs using ports.

2. When reading manuscript beyond page 13, energy conservation is not mentioned anymore and bond graphs look as RN defined with new TD parameters. Is it true?

3. Is all TD machinery internal for each module and not exposed? Can we “mix and match” by defining some modules in terms of TD and some in terms of RN?

4. Can benchmarking rate laws and effects of perturbations be computed in a regular RN simulator (COPASI, Tellurium, PySB, etc) after a simple change of parameters?

5. Compare their modular approach with SBML hierarchical composition.

6. Discuss whether bond graphs can be described as an extension to SBML standard.

Reviewer #3: This is a nicely written paper on the use of bond-graphs to build composable systems biology models. I only have a couple of minor points to make and for the authors to clarify in the paper:

1. I assume the authors know that SBML supports a white/black box approach to composition in the hierarchical package:

https://pubmed.ncbi.nlm.nih.gov/26528566/

This should be cited, perhaps on page 6

2. One important question is whether bond-graph representations (I think they are stored as json files?) could be converted to SBML given the ubiquity of simulators for this format. One could in fact imagine the bondgraph description being a higher level of description that can be converted to a simpler SBML file where the bond graph representation is the model that is edited and improved. The SBML this acts as an intermediary. Or do the authors envisage adding new annotations to SBML (or a package) to extend SBML to store bondgraph models?

3. One the major disadvantages of the bondgraph approach is its verbosity (plus the terminology is not as well-known as the more simple kinetic formalisms). The same problem arises with Vivarium but this tool has also made the fatal mistake of mixing specification with implementation. Although perhaps not applicable to this manuscript, software tooling will be a major impediment to the widespread use of bond graphs. I looked at bondgraphstools and there might be a case of adding a higher layer of abstraction because currently is seems quite laborious to build even small models, I also wonder whether in its current state it could scale to whole-cell models (the same applies to vivarium) although the composition feature would help. There might need to be additional abstraction levels.

3. Page 10, the authors mention K as the thermodynamic parameter, what is that exactly? I see that they cite the appendix for further info but a mention in the main text as to what K is would be useful.

4. Page 21, second line where they site ref 15 with respect to linearization due to feedback. I believe this was first mention by Sauro and Ingalls (https://arxiv.org/abs/0710.5195, MAPK Cascades as Feedback Amplifiers) and has since been brought up since by other authors.

5. Page 25 second paragraph it says that the mass-action model reached a different steady state. I don’t think that can be true because at steady state the mass-action and Michalis formalisms as exactly the same. It must be that the bondgraph model is someone different. Differences in time scales I can understand but not differences in the steady state.

**********

Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Pedro Mendes

Reviewer #2: No

Reviewer #3: No

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

References:

Review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript.

If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1009513.r003

Decision Letter 1

Anders Wallqvist, Daniel A Beard

30 Sep 2021

Dear Dr. Pan,

We are pleased to inform you that your manuscript 'Modular assembly of dynamic models in systems biology' has been provisionally accepted for publication in PLOS Computational Biology.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. 

Best regards,

Anders Wallqvist

Associate Editor

PLOS Computational Biology

Daniel Beard

Deputy Editor

PLOS Computational Biology

***********************************************************

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1009513.r004

Acceptance letter

Anders Wallqvist, Daniel A Beard

10 Oct 2021

PCOMPBIOL-D-21-01387R1

Modular assembly of dynamic models in systems biology

Dear Dr Pan,

I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Zsofia Freund

PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Text. Supplementary material.

    Appendix A: Details of parameter identification for the MAPK cascade and glycolysis. Appendix B: Outline of a white-box approach to modelling the MAPK cascade. Appendix C: Tutorial on basic BondGraphTools functionality. Fig A: Comparison of the bond graph model of the MAPK cascade to the Huang and Ferrell model. Fig B: A white-box approach to defining a model of the MAPK cascade. Fig C: A bond graph model of the reaction A+BC. Fig D: BondGraphTools rendering of the reaction A+BC. Fig E: Simulation of the reaction A+BC.

    (PDF)

    S1 Table. Parameters for the MAPK cascade.

    (XLSX)

    S2 Table. Parameters for the glycolysis model.

    (XLSX)

    Attachment

    Submitted filename: Response combined.pdf

    Data Availability Statement

    The code for this manuscript is available at https://github.com/mic-pan/Modularity-SysBio.


    Articles from PLoS Computational Biology are provided here courtesy of PLOS

    RESOURCES