Abstract
Systems Biology has established numerous approaches for mechanistic modeling of molecular networks in the cell and a legacy of models. The current frontier is the integration of models expressed in different formalisms to address the multi-scale biological system organization challenge. We present MUFINS (MUlti-Formalism Interaction Network Simulator) software, implementing a unique set of approaches for multi-formalism simulation of interaction networks. We extend the constraint-based modeling (CBM) framework by incorporation of linear inhibition constraints, enabling for the first time linear modeling of networks simultaneously describing gene regulation, signaling and whole-cell metabolism at steady state. We present a use case where a logical hypergraph model of a regulatory network is expressed by linear constraints and integrated with a Genome-Scale Metabolic Network (GSMN) of mouse macrophage. We experimentally validate predictions, demonstrating application of our software in an iterative cycle of hypothesis generation, validation and model refinement. MUFINS incorporates an extended version of our Quasi-Steady State Petri Net approach to integrate dynamic models with CBM, which we demonstrate through a dynamic model of cortisol signaling integrated with the human Recon2 GSMN and a model of nutrient dynamics in physiological compartments. Finally, we implement a number of methods for deriving metabolic states from ~omics data, including our new variant of the iMAT congruency approach. We compare our approach with iMAT through the analysis of 262 individual tumor transcriptomes, recovering features of metabolic reprogramming in cancer. The software provides graphics user interface with network visualization, which facilitates use by researchers who are not experienced in coding and mathematical modeling environments.
Introduction
During the last two decades, Systems Biology has established numerous approaches to represent molecular biology knowledge in the form of mechanistic molecular interaction network models. This is accompanied by a legacy of thousands of experimentally validated models. Stochastic kinetic simulations provide the most detailed quantitative description, where individual reactive collisions occurring at exact times in single cells are simulated by the Gillespie algorithm.1 The Ordinary Differential Equation (ODE) formalism applied to study the time evolution of average molecular concentration in cellular population is a workhorse of quantitative modeling.2 Although quantitative biology is developing rapidly, it is still not possible to parameterize quantitative dynamic models of whole-cell scale networks and simulate genotype–phenotype relationship. Rather, Constrained Based Modeling (CBM) has achieved spectacular success in modeling metabolism at the full genome scale.3,4 The metabolic network can be modeled at quasi-steady state owing to timescale separation from gene regulation. This enables exploration of metabolic flux distributions consistent with stoichiometric and thermodynamic constraints as well as flux measurements and constraints formulated according to ~omics data on enzymatic gene expression.5 Modeling of large-scale gene regulatory and signaling networks is much more challenging and a number of qualitative simulation approaches have been formulated, such as analysis of steady states in logical hypergraphs,6 enumeration of states in dynamic Boolean models,7 Monte Carlo exploration of the alternative molecular transition sequences constrained by network connectivity expressed in Petri Net formalism.8,9 Application of these methods has led to a legacy of models describing different levels of cellular organizations in different modeling frameworks. A large proportion of these models are already expressed in Systems Biology Markup Language (SBML)10 and over 1,000 literature-based models are available in the most recent version of BioModels.11 Notably, BioModels is attracting interest from the Physiologically Based Pharmacokinetic (PBPK) modeling field,12 where ODE models of substance concentrations in physiological compartments are routinely used to inform drug development in the pharmaceutical industry.
Given this state-of-the-art and the multi-scale nature of biological systems, the current challenge is integration of models expressed in different formalisms towards a multi-formalism simulation covering all scales of biological organization. The model of Mycoplasma is currently the most complete in silico cell,13 and demonstrates that mechanistic modeling of the genotype–phenotype relationship requires the integration of subsystem models describing different spatial and temporal scales constructed in different formalisms. The application of the multi-formalism approach toward modeling the relationship between genotype and human physiology is an emerging field and an important component of the personalized medicine challenge. For example, the integration of a PBPK model with human liver-specific GSMN has allowed robust prediction of therapeutic response in humans.14 In our recent work, we integrated a liver-specific GSMN with a qualitative model of a large-scale regulatory network,9 demonstrating how integration of gene regulation and metabolism in the context of physiological modeling can provide novel insights into toxicology, non-alcoholic liver disease and metabolic syndrome. This was achieved by the application of our novel Quasi-Steady State Petri Net (QSSPN)9 approach integrating CBM and qualitative, Monte Carlo simulation of a regulatory network represented as a Signaling Petri Net.15 Numerous alternate methods have been proposed to integrate GSMNs with ODE16 and logical17 dynamic models, as well as hybrid algorithms, bridging the gap between exact stochastic and ODE regimes in fully parameterized dynamic models.18 To fully realize the potential of computational modeling, it is now imperative to develop software packages that allow the development and simulation of multi-formalism models in a user interface that is approachable for experimental scientists.
Here, we present MUFINS (MUlti-Formalism Interaction Network Simulator) software and argue that it is the first general software with Graphics User Interface (GUI) capable of integrating models developed in all major modeling frameworks of Computational Systems Biology. This is demonstrated through Use Cases; first, a model of the mammalian macrophage where linear inhibitory constraints are for the first time used to integrate a logical model of cellular signaling with the GSMN for the mammalian cell; second, a model of the human hepatocyte, where an extended version of our QSSPN method is used to integrate a human GSMN, a detailed kinetic model of cortisol signaling and a PBPK model; third, the analysis of clinical transcriptome data in the context of human GSMN using a novel variant of ~omics data integration approach. The Use Cases involve laboratory experiments to demonstrate how experimental biologists can utilize the MUFINS GUI in the iterative cycle of model development, hypothesis generation, experimental validation and model refinement. Moreover, a comprehensive comparison with other software shows that MUFINS implements the largest number of CBM methods under a GUI with interactive network visualization. Thus, MUFINS is uniquely suited for the development and simulation of multi-formalism models by a wide user community including experimental scientists with no/limited experience with programmatic interfaces and mathematical modeling environments.
Software overview
Figure 1 provides an overview of the MUFINS architecture. All simulations are performed in sfba and qsspn, two computational engines written in C++, which are run either through a GUI or in command line mode. The sfba code originates from our SurreyFBA software19 and implements a comprehensive set of CBM approaches. The major new multi-formalism simulation feature added is linear inhibitor and activator constraints, which are described and validated in Use Case 1. In addition to the basic CBM methods available in SurreyFBA, the sfba engine of MUFINS implements a large number of ~omics data integration algorithms such as iMAT,20 GIMME,20 GIM3E,21 GNI22 and our DPA.23 The GNI and DPA features of sfba have been already used to study Mycobacterium tuberculosis metabolism.23–25 Furthermore, we include Fast iMAT a new variant of iMAT approach demonstrated in Use Case 3 below. sfba uses the GLPK library for Linear Programming (LP) and Mixed Linear Integer Programming (MILP) calculations. However, as MILP implementation in GLPK is inefficient, a version of sfba ready for use with Gurobi library is provided to facilitate the application of MILP-based protocols (e.g., iMAT) on GSMN models.
The qsspn is a computational engine for integration of dynamic and CBM models. It implements QSSPN approach,9 where a dynamic model constructed in Petri Net (PN) formalism23 is connected to a steady-state Flux Balance Analysis (FBA4) through PN places setting FBA bounds and requesting evaluation of objective functions. Previously, we integrated a qualitative PN model with a hepatocyte GSMN and explored its qualitative dynamic behaviors through Monte Carlo simulation.15 Here we present a new version of the engine, providing full support for continuous Petri Nets, implementing ODE models and stochastic Petri Nets representing exact stochastic simulations. Use Case 2 demonstrates the first integration of quantitative models of gene regulation with human Recon2 GSMN coupled to an ODE model of nutrients in physiological compartments. The qualitative simulation features are also further extended by implementation of QSSPNclient, a version of qsspn, which speeds up sampling of alternative qualitative trajectories by executing FBA on a server that tabulates repeated evaluations of GSMN objective functions for re-occurring sets of bounds. The new version of the QSSPN algorithm, qsspn solver and QSSPNclient are described in detail as Supplementary Information (page 32).
Connectivity of a Petri Net representing the dynamical part of the QSSPN model and its interactions with GSMN can be graphically defined using the Petri Net editor Snoopy:25 while Snoopy provides PN simulation, it does not implement QSSPN and thus is used here exclusively as an external editor of PN connectivity. Parameters specific to qsspn simulation can be provided either through the Comments section of place and transition objects, or added later through the MUFINS GUI. The Snoopy XML file is parsed by the spept2qsspn Python script, which generates the qsspn input file in qsspn’s native, human readable, text-file format. To facilitate future integration with other interfaces, a JSON-based input file format is also available.
JyMet is the GUI of MUFINS, and provides an interface for multi-formalism simulations to users who are not familiar with programming, mathematical modeling environments or working in command line. JyMet code is written in Jython (Java/Python), and originates from SurreyFBA.19 Here, it has been significantly extended through addition of QSSPN simulation and improved visualization of metabolic networks. JyMet integrates all elements of the MUFINS environment (Figure 1). It loads a Snoopy file defining PN connectivity and provides a table interface for definition of QSSPN specific parameters: place and transition types, lookup tables linking dynamic and GSMN variables, and arithmetic formulas describing complex rate laws. We note that users can build dynamic model within the QSSPN spreadsheet interface of JyMet without using Snoopy, but this is unlikely to be a preferable solution, due to the advantages of PN graphical modeling. Both sfba and qsspn can be run from JyMet and results are loaded to spreadsheets and plotting functions. Each table in JyMet can be exported in tab-separated format for further analysis with other software. Full features of JyMet are described in detail in software documentation and tutorials (MUFINS1.0_Doc/). Use Case 1 shows application of network visualization for exploration of alternative model solutions during the iterative cycle of network reconstruction, simulation, experimental validation and refinement.
The sfba, qsspn and spept2qsspn tools can be run as stand-alone command line tools without external dependencies. Thus MUFINS is ideal for integration with web and desktop interfaces as well as computational pipelines. Use Case 3 shows integration of sfba engine with computational pipeline for analysis of clinical transcriptome data. Previous version of sfba engine19 has been already used in web interfaces supporting development and publication of bacterial pathogen GSMN models26,27 and as one of computational engines in METEXPLORE web environment.28
MUFINS is an open source software, distributed under the GNU GPL license. It can be run on OS X, Windows and Linux sand the majority of calculations can be run without dependencies. The methods applying MILP to genome-scale models are likely to be very computationally expensive unless the Gurobi library is installed.
Use Cases
Table 1 summarizes Use Cases illustrating the multi-formalism simulation abilities within MUFINS, including previously published works.
Table 1. Summary of simulation formalisms applied in Use Cases.
Use Case | Formalisms | Reference |
---|---|---|
Use Case 1: linear inhibitory constraints | CBM, logical hypergraph, inhibitor and activator constraints | This work |
Use Case 2: integration of regulatory networks and GSMNs | CBM, ODE, Gillespie, physiological compartment models | This work and ref. 9 |
Use Case 3: prediction of metabolic landscapes | CBM, congruency approach to analysis of ~omics data in the context of GSMNs | This work and ref. 45 |
Abbreviations: CBM, constraint-based modeling; GSMN, Genome-Scale Metabolic Network; ODE, Ordinary Differential Equation.
Use Case 1: whole-cell metabolic reprogramming by signaling and gene regulatory networks in the mammalian macrophage
An important innovation in MUFINS is the ability to include stimulation and inhibition reactions within the genome-scale metabolic network. To demonstrate the utility of this approach to derive biological insights we present a use case entailing integration of gene and signaling regulatory networks with genome-scale metabolism for the mammalian macrophage.
We applied MUFINS to integrate a logical hypergraph6 model of the large-scale regulatory network responsible for the pathogen response of mammalian macrophage with the published GSMN of the mouse RAW264.7 macrophage cell line.29 A signal transduction network of 286 interactions and 205 species was reconstructed in CellNetAnalyzer using logical hypergraph formalism:6 A manually created graph image is shown in Figure 2, with full description of model construction described in Supplementary Information, and a detailed description of species, logical formulas and literature references in Supplementary Table 1. Briefly, species within the regulatory network represent protein kinases, transcription factors, genes, antigens, cytokines and cellular behaviors (e.g., apoptosis) involved in the response of macrophages to bacterial pathogens.
To integrate signaling and metabolic networks, we have translated the logical hypergraph to a stoichiometric model with inhibitory constraints. Representation of inhibition in the CBM framework has always been a challenge, with proposed solutions generally being computationally expensive methods based on MILP.30 We have extended the approach of Vardi and colleagues31 and represented inhibition by linear constraints, enforcing a reciprocal relation between inhibitor production and inhibited flux. Differences between the original formulation and our extended implementation are presented as Supplementary Information. An example of the logical hypergraph conversion to MUFINS reaction formulas is shown in Figure 2. At the software level this is achieved by exporting CellNetAnalyzer logical rules as a text format and then using the text-replace function in Excel to change the formula format and create a MUFINS reaction table that can be opened by the JyMet GUI. We note these steps do not require programming experience. Subsequently, the model was edited in JyMet to define input fluxes. Flux Variability Analysis was undertaken to identify spurious activations in the model with all input fluxes constrained to 0. The details of these steps are given in Supplementary Information (page 7), where we also compare our model pre-processing steps with the much more complex, MILP-based protocol used by Vardi et al.31
A unique feature of MUFINS is reconstruction of models that combine GSMNs and regulatory networks with linear inhibitory constraints. To demonstrate this capability we integrated the signal transduction network model described above with the published GSMN of the RAW264.7 macrophage cell line.29 We have focused on nitric oxide production, a major metabolic function of macrophages interacting with bacterial pathogens. Our regulatory network model describes the regulation of the inducible nitric oxide synthase (iNOS) gene, which we have added as an activator for the NO synthase reaction in the GSMN (reaction id: R_NOS2). We used a linear activator constraint, as described in Supplementary Information (page 31), to ensure that stoichiometry of the R_NOS2 reaction is not affected. Figure 3a shows a portion of the total regulatory network, specifically the immediate signaling pathways regulating iNOS gene expression. We used this integrated model to simulate nitric oxide production in response to lipopolysaccharide (LPS). Following Bordbar and colleagues,29 we constrained biomass reaction flux to 0.0281/h, which reproduces experimentally measured growth rates. We calculated the maximal extracellular nitric oxide production when LPS input flux to the regulatory network was opened or closed and when phosphorylation of ERK by MEK1 was inhibited or not. Results obtained from these four simulations are shown on Figure 3b, and demonstrate that nitric oxide is produced only when LPS activates the regulatory network, while the inhibitor does not influence results. The maximal flux through R_NOS agrees with the value of 0.0399 mmol/gDW/h reported in original publication, thus verifying SBML import of the GSMN model to JyMet. We note that while in this Use Case, the regulatory network regulates only one enzyme, this is an example of major global metabolic reprogramming. The production of large amounts of nitric oxide in response to pathogen requires both precursors and energy, and the GSMN model accounts for stoichiometry of all reactions linking medium nutrients to metabolic output. Moreover, the GSMN assures that the cell satisfies other metabolic demands, such as the demand for biomass production whereby the GSMN model accounts for global stoichiometry of providing cellular components and maintaining energy during induced nitric oxide production.
To compare the model predictions with the experimental data, we treated RAW264.7 macrophages with LPS and a MEK inhibitor and measured nitrate concentration in the medium. As nitrate can be produced only by the non-enzymatic conversion of nitric oxide from cells, and there is no nitrate consumption in the medium, concentrations are proportional to the nitric oxide production flux. Figure 3c shows that the model correctly predicts that LPS is obligate for the production of nitric oxide in RAW264.7 macrophages. However, the model did not predict the decrease in nitric oxide production caused by MEK inhibition. To explore this inconsistency, we used the interactive network visualization available in JyMet (Figure 3a) to examine example FBA solutions. Multiple pathways lead to iNOS activation, some of which are not dependent on MEK. In the model, the ‘iNOS’ substance representing activity of the iNOS gene is produced by three reactions representing the activity of ERK1/2-, HIF1- and JNK-dependent regulation of iNOS gene expression. Each of these regulators is, activated by different upstream signaling cascades. We used JyMet to interactively simulate and visualize different scenarios and concluded that the experimental results can be replicated if the following assumptions are made: (i) ERK induces a more potent activation of the iNOS gene than JNK and HIF1 (ii) MEK1 is a more potent ERK kinase than PKC. These assumptions were introduced to the model by setting flux bounds of (0, 0.005) for the transitions ‘JNK-> iNOS’, ‘HIF1->iNOS’ and ‘PKC_a_b=ERK1/2’ (Figure 3a, right panel). The upper bound is arbitrary, and selected to ensure that the flux towards ‘iNOS’ via ‘HIF1’ or ‘JNK’ reaches only a fraction of value required for maximal activation of R_NOS, with the remaining activation occurring via ERK1/2 regulation. This refined model is now able to reproduce the decreased, but not complete inhibition, of nitric oxide production by a MEK inhibitor (Figure 3d). A full description of this cycle of prediction, experimental testing and model refinement is presented as Supplementary Information (page 11), detailing how MUFINS and JyMet aid the iterative refinement cycle required during model development. These data support the assumptions above being one possible mechanistic solution to reproduce the observed biological phenotype. However, we note that further experimental confirmation is required to confirm the predicted biological insight. In a full iterative cycle of prediction, experiment and model refinement, multiple molecular targets would be subject to independent experiment verification before the model was validated. Here, we show one full cycle of simulation and experiment to demonstrate how the JyMet interface is used in this iterative model development process.
To summarize, we present the first linear model for steady-state simulation of networks integrating signaling, gene regulation and whole-cell metabolism in a mammalian cell. Moreover, we present the first simulation of perturbation of a global metabolic output by a signaling network inhibitor, and demonstrate this is consistent with experimental data. The ability to formulate hypotheses in terms of continuous ‘regulatory strength’ is demonstrated. This offers significant advantages over MILP-based approaches such as SR-FBA,32 where the regulatory network is used exclusively to formulate Boolean, on/off constraints. Finally, the graphics user interface JyMet allows an interactive exploration of combined signaling and metabolic flux distributions that is easily approachable by non-specialists. Together, these tools provide an ideal platform for non-specialists to generate mechanistic hypotheses based upon the interaction of gene and signal regulatory networks with genome-scale metabolism. These hypotheses can then drive experimental testing, enhancing our ability to identify novel biological insights.
Use Case 2: kinetic model of cortisol signaling integrated with dFBA simulation of human GSMN
An important challenge in computational biology is the generation of large-scale models that are able to reproduce diverse biological functions. One approach to achieve this aim is the integration of validated small models (or modules) to form larger networks. A major consideration in such integration is the ability to combine models across different modeling formalisms and biological scales. In this Use Case, we demonstrate the utility of MUFINS for the generation and simulation of such multi-formalism, multi-scale models.
Cortisol acts as an important signaling molecule within the body, with roles in circadian biology and the response to stress episodes. The level of cortisol in the body is interpreted at the cellular level through interaction with three nuclear receptors: the glucocorticoid receptor; the mineralocorticoid receptor; the pregnane-X receptor. These signals are integrated and produce a global metabolic shift corresponding to the current cortisol level. To reproduce such a complex biological phenomenon, it is necessary to combine multiple signaling cascades with genome-scale reconstructions of metabolism. Here, we demonstrate the application of MUFINS for such a multi-formalism simulation, integrating a detailed kinetic model of cortisol signaling in the liver, a genome-scale model of liver metabolism, and an ODE model of glucose and lactate dynamics in the blood (Figure 4). This is the first simulation integrating a human GSMN, physiological level ODE model and detailed kinetic model of an intracellular regulatory network. Thus, this Use Case demonstrates that MUFINS provides a unique tool for the integration of PBPK models with the mechanistic models of molecular networks operating in mammalian tissues.
The model is depicted in Figure 4a. We represent our previously published kinetic model of cortisol signaling in liver33,34 as a Petri net (PN) using Snoopy software.25 Size and color of place/transition symbols was used to mirror Systems Biology Graphical Notation35 molecule types as well as QSSPN specific place/transition types. The PN transition rates were defined using the ODE terms of the kinetic model. The PN places represent molecular concentrations. This dynamic model of cortisol signaling was linked to the human GSMN Recon2, with the CYP3A4 enzyme used as the QSSPN constraint place. The details of the cortisol signaling model and its coupling to GSMN are available in Supplementary Information (page 16).
To model whole-cell metabolism of hepatocytes, we used the community-based Recon2 GSMN.36 This model incorporates liver-specific reactions from the HepatoNet1,37 but is a much more extensive reconstruction of cellular metabolism. Exchange fluxes were constrained using the HepatoNet1 Physiological Import and Export set. The objective function was set to glucose regeneration from lactate, a major physiological function of the liver, where blood glucose and lactate concentrations are ODE variables. The dFBA simulation is implemented using place and transition within the QSSPN, rather than coded as a separate approach. We have further capitalized on the flexibility of the PN representation to create a timer administering a cortisol infusion after 500 min. This demonstrates how multi-formalism simulations in MUFINS can include complex, time-dependent perturbations to the model such as boluses or cell division events. For clarity, the timer is contained within a coarse transition, with the sub-level shown as an inset to Figure 4a. A detailed description of QSSPN place and transition types is given as Supplementary Information (page 27), along with a detailed description of model construction (page 12).
Simulations of the systems response to cortisol infusion, plus experimental confirmation are shown in Figures 4b–g. As shown in Figure 4b, the glucose and lactate concentrations converge to their physiological levels of 4.45 and 1.48 mM, respectively38 and are maintained during cortisol infusion from 500 min onwards. Cortisol infusion from 500 min produces a number of effects, which are mediated through activation of the two cognate nuclear receptors for cortisol within the regulatory network, PXR and GR.39,40 The cortisol-mediated activation of PXR results in a predicted increase in expression of the CYP3A4 enzyme (Figure 4c), which is experimentally confirmed at the transcript, protein and activity levels in vitro using primary human hepatocytes (Figure 4d). CYP3A4 is one of the main enzymes responsible for the metabolic clearance of cortisol; thus, the combination of a constant infusion of cortisol, followed by increased metabolism, results in an elevated blood cortisol level (Figure 4e). We note that the transition to this new blood cortisol level is not instant, demonstrating a concentration spike consistent with the time delay caused by the de novo production (i.e., transcription and translation) of CYP3A4 protein. Finally, in Figure 4f, we demonstrate that the cortisol infusion propagates through the signaling and metabolic networks, leading to predicted changes in blood concentrations for other chemicals. Estradiol is an endogenous hormone important in a range of biological functions, including development of the secondary sexual organs in both sexes and proliferation during the menstrual cycle and pregnancy in women.41,42 It has considerable clinical application, most notably as a contraceptive, either as the native compound or synthetic derivatives.43 As shown in Figure 4e, following the cortisol infusion, predicted levels of blood estradiol drop rapidly, decreasing approximately by one-half within 500 min. This lower level is maintained throughout the period where CYP3A4 protein levels are elevated. To confirm this effect, we have measured the clearance rate for estradiol in naive primary human hepatocytes and compared it with hepatocytes pre-exposed to 1μM cortisol, demonstrating enhanced clearance in cortisol-exposed hepatocytes (Figure 4g). In addition, we note that activation of PXR has previously been linked with a number of drug–drug interactions with estrogens, demonstrating the extrapolation of these predictions to the clinical setting.40,44 It is important to note that estradiol concentration was not a variable of the detailed kinetic model, and was rather identified as a variable of interest by examination of perturbed GSMN fluxes. This demonstrates how integration of detailed kinetic models with GSMNs can lead to the identification of interactions of biological interest. As such, this approach has much potential for the prediction of clinically relevant drug-induced disruption of homeostasis, and drug–drug interactions.40,44
In summary, this Use Case demonstrates the utility of MUFINS to combine legacy models developed in different formalisms and link molecular network knowledge to quantitative data on substance concentration at physiological level. As such, MUFINS represents the first software to allow such multi-scale, multi-formalism simulations through a GUI approachable to the non-specialist.
Use Case 3: analysis of a clinical transcriptome data to understand in vivo tumor metabolism
An important branch of CBM methodology5 is dedicated to using ~omics data to create tissue and/or condition specific GSMNs. MUFINS is equipped with state-of-the-art CBM methods in this area; in addition, we have developed Fast iMAT, a new variant of the iMAT approach that is applicable to large ~omics sample numbers, where iMAT becomes impractical. We recently reported a preliminary version of Fast iMAT,45 dedicated to the analysis of expression data discretized to two states (absent or present transcript). Personalized GSMNs for 2,000 breast tumors were generated, identifying a low prognosis cluster with active serotonin production—an important biological insight.45 The first distribution version of MUFINS provides a mature version of the Fast iMAT algorithm. To demonstrate its utility, we analyze 262 previously unexamined paired clinical transcriptome samples.46 We demonstrate the significant upregulation of kyneurenine synthesis in tumor compared with normal breast tissue, and important pro-survival phenotype.47,48
Comparison of MUFINS with existing tools
A comparison of MUFINS with existing tools is presented as Supplementary Information. We conclude that MUFINS is currently the only software supporting integration of (i) exact stochastic, (ii) ODE, (iii) qualitative dynamic, (iv) logical steady state and (v) CBM models in a general software platform with GUI. The only alternative to achieve integration of this range of formalisms is coding of the model in mathematical modeling environment. Although this strategy has achieved success,13 it is not a plausible proposition for non-specialists who lack programming skills. Moreover, multi-formalism modeling in a mathematical language involves the implementation of a simulation algorithm dedicated to each model. In MUFINS, each model is run in the same QSSPN simulation algorithm, with multi-formalism functionality emerging from the interactions between the different types of Petri Net places and transitions that can be graphically assembled, leading to a combinatorial diversity of types of models that can be simulated. Using one algorithm and a few well-defined place and transition types provides clearer control and description of model assumptions than coding a different main simulation loop for each model. Also, the algorithms available in MUFINS are validated and optimized against the legacy of previous applications, while formulation, validation and description of a simulation algorithm dedicated to a particular model will take additional time. It is our experience that even scientists who can program will find it easier to implement complex multi-formalism models by connecting QSSPN places and transitions to off-the-shelf GSMN models imported to JyMet, rather then by the development of dedicated mathematical modeling code. Moreover, MUFINS provides a wide range of CBM methods that can be used to model GSMNs before their integration with dynamic models. As Supplementary Information, we perform the largest review of CBM methods conducted to date (165 methods and 30 software packages), demonstrating that MUFINS is the second most general CBM software after COBRA toolbox49 in terms of the number of methods implemented. However, it provides the largest number of CBM methods under GUI with interactive network visualization. Finally, all CBM methods can now be applied to models formulated with inhibitor and activator constraints, which again enables execution of new CBM protocols without the need of coding (e.g., iMAT applied to models involving steady-state regulatory network).
Future directions
We will continue to develop MUFINS towards improved interoperability with other tools and model databases, a key for model integration. Although currently QSSPN can be simulated only in MUFINS, definition of this multi-formalism framework in SBML will motivate the development of alternative tools. As shown in Figure 1, QSSPN models can currently be exported into two separate SBML files representing the CBM and PN parts of the model. We intend to represent QSSPN lookup tables, reset transitions and flux monitors with existing SBML objects, or to develop a bespoke SBML package. Furthermore, we will work towards improving integration of SBML files imported from public repositories into multi-formalism models in the JyMet GUI. This will involve further work on network visualization in JyMet, providing a graph editor dedicated to connecting different mechanistic models by common variables. We also plan to develop interoperability between MUFINS and Garuda (http://www.garuda-alliance.org) to make full use of our multi-formalism simulation tool within this established alliance of systems biology software. This will be facilitated by design of our software (Figure 1) providing stand-alone simulation engines ideal for embedding in different interfaces.
Conclusions
The multi-scale nature of complex biological systems is currently the major challenge preventing their computational understanding. A number of theoretical frameworks have achieved spectacular successes in mechanistically modeling different levels of cellular organization such as metabolic, signaling and gene regulatory networks. However, in a real cell, all these processes proceed simultaneously, and without multi-scale simulation the insight and predictive power provided by models will be limited. We present MUFINS, the first general software addressing this multi-formalism simulation challenge. Novel algorithms available in MUFINS provide solutions for three major technological challenges: (i) integration of CBM and hybrid stochastic/deterministic dynamic simulation; (ii) CBM of integrated signaling/metabolic models; (iii) analysis of large clinical transcriptome studies in the context of GSMN. This is demonstrated through three Use Cases, where we simulate models of mammalian systems composed of: GSMNs, logical hypergraph models of signaling, kinetic models of gene regulation and PBPK models. We experimentally validate model predictions and show how our software can aid experimental scientists through an iterative cycle of hypothesis generation, experimentation and model refinement. Because the need for a multi-scale, multi-formalism approach is currently most recognized in the context of Personalized Medicine and Quantitative Systems Pharmacology, we focused our Use Cases on mammalian cells. However, mechanistic simulation is a major tool in Synthetic Biology, where MUFINS will be ideal to integrate detailed models of genetic circuits with GSMNs and further extend molecular cell factory models to include bioreactor mass transfer. Therefore, we believe that multi-formalism simulation with MUFINS will find broad application in mechanistic modeling of biological systems.
Software availability
MUFINS is free, open source software available under GNU GPL license from:
MUFINS home page: http://sysbio3.fhms.surrey.ac.uk/mufins/
GitHub repository: https://github.com/kierzek/MUFINS
Acknowledgments
Development of MUFINS was funded by BBSRC TRDF grant BB/K015974/1 to AMK. AvK, SK, GRS and AMK were supported by EraSysBio+/BBSRC TB-HOST-NET grant BB/I00453X/1. The research leading to results obtained by CL, NS and DL has received support from the Innovative Medicines Initiative Joint Undertaking under grant agreement n 115308, resources of which are composed of financial contribution from the European Union's Seventh Framework Programme (FP7/2007-2013) and EFPIA companies in kind contribution. MG and JS were funded by National Science Centre (Poland) grant DEC-2012/07/D/ST6/02492. VL was funded by the BBSRC grant (BB/K501694/1).
Footnotes
Supplementary Information accompanies the paper on the npj Systems Biology and Applications website (http://www.nature.com/npjsba)
The authors declare no conflict of interest.
References
- Gillespie, D. T. Exact stochastic simulation of coupled chemical-reactions. J. Phys. Chem. 81, 2340–2361 (1977). [Google Scholar]
- Tyson, J. J., Chen, K. & Novak, B. Network dynamics and cell physiology. Nat. Rev. Mol. Cell Biol. 2, 908–916 (2001). [DOI] [PubMed] [Google Scholar]
- Bordbar, A. et al. Constraint-based models predict metabolic and associated cellular functions. Nat. Rev. Genet. 15, 107–120 (2014). [DOI] [PubMed] [Google Scholar]
- Orth, J. D., Thiele, I. & Palsson, B. O. What is flux balance analysis? Nat. Biotechnol. 28, 245–248 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lewis, N. E., Nagarajan, H. & Palsson, B. O. Constraining the metabolic genotype-phenotype relationship using a phylogeny of in silico methods. Nat. Rev. Microbiol. 10, 291–305 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Klamt S. et al. A methodology for the structural and functional analysis of signaling and regulatory networks. BMC Bioinformatics 7, 56 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Remy, E. et al. A description of dynamical graphs associated to elementary regulatory circuits. Bioinformatics 19, II172–II178 (2003). [DOI] [PubMed] [Google Scholar]
- Ruths D. et al. The signaling petri net-based simulator: a non-parametric strategy for characterizing the dynamics of cell-specific signaling networks. PLoS Comput. Biol. 4, e1000005 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fisher, C. P. et al. QSSPN: dynamic simulation of molecular interaction networks describing gene regulation, signalling and whole-cell metabolism in human cells. Bioinformatics 29, 3181–3190 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hucka, M. et al. The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models. Bioinformatics 19, 524–531 (2003). [DOI] [PubMed] [Google Scholar]
- Chelliah, V. et al. BioModels: ten-year anniversary. Nucleic Acids Res. 43, D542–D548 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jones, H. M. & Rowland-Yeo, K. Basic concepts in physiologically based pharmacokinetic modeling in drug discovery and development. CPT Pharmacometrics Syst. Pharmacol. 2, e63–e63 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Karr, J. R. et al. A whole-cell computational model predicts phenotype from genotype. Cell 150, 389–401 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Krauss M. et al. Integrating cellular metabolism into a multiscale whole-body model. PLoS Computat. Biol. 8, e1002750 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ruths, D., Nakhleh, L. & Ram, P. T. Rapidly exploring structural and dynamic properties of signaling networks using PathwayOracle. BMC Syst. Biol. 2, 76 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Covert, M. W. et al. Integrating metabolic, transcriptional regulatory and signal transduction models in Escherichia coli. Bioinformatics 24, 2044–2050 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Covert, M. W., Schilling, C. H. & Palsson, B. Regulation of gene expression in flux balance models of metabolism. J. Theor. Biol. 213, 73–88 (2001). [DOI] [PubMed] [Google Scholar]
- Puchalka, J. & Kierzek, A. M. Bridging the gap between stochastic and deterministic regimes in the kinetic simulations of the biochemical reaction networks. Biophys. J. 86, 1357–1372 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gevorgyan, A. et al. SurreyFBA: a command line tool and graphics user interface for constraint-based modeling of genome-scale metabolic reaction networks. Bioinformatics 27, 433–434 (2011). [DOI] [PubMed] [Google Scholar]
- Becker S. A. & Palsson B. O. Context-specific metabolic networks are consistent with experiments. PLoS Computat. Biol. 4, e1000082 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schmidt, B. J. et al. GIM(3)E: condition-specific models of cellular metabolism developed from metabolomics and expression data. Bioinformatics 29, 2900–2908 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Diamant, I. et al. A network-based method for predicting gene-nutrient interactions and its application to yeast amino-acid metabolism. Mol. Biosyst. 5, 1732–1739 (2009). [DOI] [PubMed] [Google Scholar]
- Breitling, R. et al. A structured approach for the engineering of biochemical network models, illustrated for signalling pathways. Brief. Bioinform. 9, 404–421 (2008). [DOI] [PubMed] [Google Scholar]
- Mendum T. A. et al. Lipid metabolism and Type VII secretion systems dominate the genome scale virulence profile of Mycobacterium tuberculosis in human dendritic cells. BMC Genomics 16, 372 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rohr, C., Marwan, W. & Heiner, M. Snoopy-a unifying Petri net framework to investigate biomolecular networks. Bioinformatics 26, 974–975 (2010). [DOI] [PubMed] [Google Scholar]
- Mendum T. A. et al. Interrogation of global mutagenesis data with a genome scale model of Neisseria meningitidis to assess gene fitness in vitro and in sera. Genome Biol. 12, R127 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Beste D. J. V. et al. GSMN-TB: a web-based genome scale network model of Mycobacterium tuberculosis metabolism. Genome Biol. 8, R89 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cottret, L. et al. MetExplore: a web server to link metabolomic experiments and genome-scale metabolic networks. Nucleic Acids Res. 38, W132–W137 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bordbar A. et al. Model-driven multi-omic data analysis elucidates metabolic immunomodulators of macrophage activation. Mol. Syst. Biol. 8, 558 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dasika, M. S., Burgard, A. & Maranas, C. D. A computational framework for the topological analysis and targeted disruption of signal transduction networks. Biophys. J. 91, 382–398 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vardi, L., Ruppin, E. & Sharan, R. A linearized constraint-based approach for modeling signaling networks. J. Computat. Biol. 19, 232–240 (2012). [DOI] [PubMed] [Google Scholar]
- Shlomi T. et al. A genome-scale computational study of the interplay between transcriptional regulation and metabolism. Mol. Syst. Biol. 3, 101 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kolodkin, A. et al. Optimization of stress response through the nuclear receptor-mediated cortisol signalling network. Nat. Commun. 4, 1972 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kolodkin, A. N. et al. Design principles of nuclear receptor signaling: how complex networking improves signal transduction. Mol. Syst. Biol. 6, 446 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Le Novere, N. et al. The systems biology graphical notation. Nat. Biotechnol. 27, 735–741 (2009). [DOI] [PubMed] [Google Scholar]
- Thiele, I. et al. A community-driven global reconstruction of human metabolism. Nat. Biotechnol. 31, 419 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gille, C. et al. HepatoNet1: a comprehensive metabolic reconstruction of the human hepatocyte for the analysis of liver physiology. Mol. Syst. Biol. 6, 411 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kolker, E. et al. MOPED: Model Organism Protein Expression Database. Nucleic Acids Res. 40, D1093–D1099 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- El-Sankary, W., Plant, N. & Gibson, G. Regulation of the CYP3A4 gene by hydrocortisone and xenobiotics: role of the glucocorticoid and pregnane X receptors. Drug Metab. Dispos. 28, 493–496 (2000). [PubMed] [Google Scholar]
- Plant, N. The human cytochrome P450 3A sub-family: transcriptional regulation, inter-individual variation and interaction networks. Biochim. Biophys. Acta 1770, 478–488 (2007). [DOI] [PubMed] [Google Scholar]
- Losordo, D. W. & Isner, J. M. Estrogen and angiogenesis—A review. Arterioscler. Thromb. Vasc. Biol. 21, 6–12 (2001). [DOI] [PubMed] [Google Scholar]
- Zheng, J. et al. Estrogen and progesterone receptors, cell proliferation, and c-fos expression in the ovine uterus during early pregnancy. Endocrinology 137, 340–348 (1996). [DOI] [PubMed] [Google Scholar]
- Kuhl, H. Pharmacology of estrogens and progestogens: influence of different routes of administration. Climacteric 8, 3–63 (2005). [DOI] [PubMed] [Google Scholar]
- Plant, N. & Gibson, G. G. Evaluation of the toxicological relevance of CYP3A4 induction. Curr. Opin. Drug Discov. Dev. 6, 50–56 (2003). [PubMed] [Google Scholar]
- Leoncikas, V. et al. Generation of 2,000 breast cancer metabolic landscapes reveals a poor prognosis group with active serotonin production. Sci. Rep. 6, 19771 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Curtis, C. et al. The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups. Nature 486, 346–352 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- DiNatale, B. C. et al. Kynurenic acid is a potent endogenous aryl hydrocarbon receptor ligand that synergistically induces interleukin-6 in the presence of inflammatory signaling. Toxicol. Sci. 115, 89–97 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Opitz, C. A. et al. An endogenous tumour-promoting ligand of the human aryl hydrocarbon receptor. Nature 478, 197–203 (2011). [DOI] [PubMed] [Google Scholar]
- Hoops, S. et al. COPASI—a COmplex PAthway SImulator. Bioinformatics 22, 3067–3074 (2006). [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.