Abstract
Models that capture the chemical kinetics of cellular regulatory networks can be specified in terms of rules for biomolecular interactions. A rule defines a generalized reaction, meaning a reaction that permits multiple reactants, each capable of participating in a characteristic transformation and each possessing certain, specified properties, which may be local, such as the state of a particular site or domain of a protein. In other words, a rule defines a transformation and the properties that reactants must possess to participate in the transformation. A rule also provides a rate law. A rule-based approach to modeling enables consideration of mechanistic details at the level of functional sites of biomolecules and provides a facile and visual means for constructing computational models, which can be analyzed to study how system-level behaviors emerge from component interactions.
Keywords: rule-based modeling, systems biology, cell signaling
Introduction
A model is a representation or imitation of a system. A model is especially useful when the system of interest is both complex and difficult to probe experimentally. Models have long been used to study biological systems, which are among the most complex systems studied in science and engineering. Models take many different forms. Two types of models that are familiar to biologists are 1) a living system amenable to experimental study (e.g., an animal in which characteristic features of a human disease are recapitulated, a mammalian cell line, or a model organism, such as Escherichia coli, Saccharomyces cerevisiae or Caenorhabditis elegans) and 2) a diagram, which is often used to graphically summarize one’s understanding of how a system operates.
Aspects of these two types of models, living systems and diagrams, are combined in mathematical/computational models. (We will hereafter refer primarily to computational models, because the vast majority of mathematical models are now analyzed with the aid of computers, and models formulated as computer programs, meaning computational models, are becoming more common and almost always have an underlying mathematical basis.) Like a diagram, a computational model is based on what a modeler knows or hypothesizes about a system. A difference is the greater precision of a computational model. Whereas a diagram tends to be ambiguous and qualitative, a computational model requires a modeler to make concrete, quantitative statements about how a system is believed to behave. The precision and rigor required to build a computational model can help to identify gaps in knowledge and understanding. When such gaps are encountered, a modeler is required to generate hypotheses that fill the gaps. Knowledge and hypotheses are formalized and used to create a set of equations and/or a computer program, which can be used, for example, to simulate how the state of a system evolves over time in response to a stimulus or perturbation. In this way, a computational model shares an important feature of a living model system: both a computational model and a living model enable biological phenomena to be studied in a controlled manner, in a setting that allows questions about system behavior and mechanisms to be answered with relative ease. Both types of models suffer from the pitfall that the insights obtained may depend in an unknown way on the degree to which the model at hand represents the true object of study, the biological/physiological system of interest. Nevertheless, models are essential for progress in biological research, because many questions simply cannot be studied without them.
Biologists routinely use diagrams to reason about biological systems, especially cellular regulatory networks. In the context of discussions about cell signaling systems, these diagrams are sometimes called pathway maps, or simply maps. Pathway maps, which are commonly drawn ad hoc, despite attempts to establish standardized conventions [1], can be viewed as a type of conceptual model. A map can be used to summarize and convey one’s understanding of how a system works or to provide a broad overview of what is known about a system. A limitation of a map is that its interpretation ultimately relies on the intuition of its reader, which is a serious weakness given the complexity of cellular regulatory networks.
The complexity of cellular networks arises from several sources. One is the ubiquitous presence of feedback and feedforward loops, which can generate exotic nonlinear behaviors, such as hysteresis and oscillations [2, 3]. Another complicating feature is the tendency of the biomolecules that comprise cellular regulatory networks to interact with multiple binding partners [4, 5]. This molecular promiscuity, or polyspecificity, can generate crosstalk, meaning influences between networks with distinct functions. Another complication is that biomolecular interactions and their consequences are sensitive to quantitative factors, including the copy numbers of binding partners, binding cooperativity, and binding affinities and lifetimes [6, 7]. In addition to these quantitative properties of biomolecular interactions, which can be viewed as intrinsic properties, interactions are also likely to be affected by extrinsic properties, such as compartmentalization [8] and binding competition within a compartment [9], which is a function of network connectivity. The features mentioned above can either alone or in combination give rise to non-intuitive system behaviors. These complicating features limit our ability to understand and manipulate cellular regulatory networks, and they beckon us to consider models more sophisticated and powerful than diagrams, such as computational models.
Computational models go beyond diagrams, because the logical consequences of the information used to formulate a model can usually be elucidated through computer-aided calculations, or simulations. The information that serves as the basis for a model may include a synthesis of multiple mechanistic insights, an integration of different types of quantitative data (e.g., measurements of protein copy numbers and dissociation constants), and plausible assumptions, which are often necessary to fill knowledge gaps as noted earlier. The analysis of a computational model can identify a potential system behavior that one would not have expected and suggest experiments to test the non-obvious (i.e., interesting) predictions of the model. Validation of such predictions increases one’s confidence in the model. Falsified predictions can also be useful if they lead to revisions that increase a model’s reliability and generality.
Modeling is not a monolithic practice. Different techniques are used to address different problems. Even the same problems are commonly attacked using a variety of techniques. Some models are formulated to study specific systems, whereas others are formulated to study phenomena found in many systems. The physical and chemical principles captured in models vary, and models incorporate biological knowledge at differing resolutions, from abstract and phenomenological to detailed and mechanistic [10]. Some approaches, such as logical modeling [11], are unconnected or only loosely coupled to physicochemical principles. This diversity reflects not only a need for different approaches to address different questions but also a lack of consensus about best practices.
Arguably, the field of computational systems biology is at a stage where many different modeling approaches are being tried partly because traditional modeling approaches, meaning those long used, mostly in non-biological fields, have shortcomings when applied to biological systems. Models based on ordinary differential equations (ODEs) are arguably the most popular type of physicochemical model employed in biology [12]; these models represent a traditional modeling approach. ODEs have been used to model (bio)chemical systems since before the existence of molecules was widely accepted [13, 14]. Although ODE models are undoubtedly useful, the ODE modeling approach when applied to cellular regulatory networks entails complicating requirements that call for new ideas about how to model [15–17]. Namely, this approach requires one to enumerate the potentially populated chemical species in a system. Because biomolecules can usually be found in numerous states and complexes, it is desirable to avoid this requirement.
Rule-based modeling allows one to avoid enumerating the potentially populated chemical species in a system; it is an approach tailored for modeling a biomolecular interaction network, or indeed, any system where structured objects interact via component parts in a modular way. Here, we provide a brief introduction to rule-based modeling in systems biology, which is characterized by the use of local rules to represent biomolecular interactions [18–24]. In biology, rule-based modeling has most often been used to study cell signaling systems, but this modeling paradigm is more broadly applicable [23, 25–27]. Indeed, rule-based modeling has antecedents in physics, chemistry, and computer science [22, 28–32]. Rules formalize mechanistic understanding of biomolecular interactions according to conventions that allow a set of rules to serve as an executable model, meaning a model that can be used to obtain simulations of the behavior of the system represented by the model. Simulations may leverage different algorithms to capture system behavior at varying levels of abstraction of the underlying physiochemical laws. For example, simulations are possible that do and do not account for fluctuations in copy numbers, depending on how the simulation algorithm relates to the chemical master equation [33]. Once a rule-based model is specified, it is relatively easy to perform multiple types of simulations, which facilitates analysis. For example, one can readily switch between stochastic and deterministic simulation methods [34]. A rule-based approach to model specification is most justified when one is concerned with biomolecular interactions that depend on and impact sitespecific details, or biomolecular site dynamics [23], and moreover the interactions of interest are modular, meaning somewhat independent of each other.
Many rule-based models reported in the literature are simulated using an ODE solver operating on rule-derived ODEs, and so can be viewed as equivalent to ODE models that produce the same simulation results. However, rule-based models are specified in terms of rules instead of equations. The different approaches used to specify ODE and rule-based models are not superficial; the differences in the way these models are written (i.e., specified) encourage and usually entail different modeling assumptions.
This review, mainly through discussion of examples of rules and simple but complete executable models, is meant to help biologists, including those who do not specialize in modeling, create models for the systems they study. Models can serve as valuable reasoning aids, and with rule-based modeling, more widespread use of models is foreseeable. This review may also be helpful for experienced modelers new to rule-based modeling, or even practitioners of the approach, because the example models illustrate a variety of rule-based modeling capabilities and may therefore serve a valuable reference purpose. We will focus on one particular approach that can be used to specify and analyze rule-based models, the approach enabled by the BioNetGen language (BNGL) and BNGL-compatible software tools, such as BioNetGen [34–36]. Our focus will also be on models appropriate for well-mixed reaction compartments, because such models are generally easier to specify and analyze. With such models, there is no need to specify reaction compartment geometry or boundary conditions, for example. Later, we will mention other model-specification approaches and software tools (many with capabilities beyond what is offered by the BNGL framework) and point to sources of information about these methods. A tight focus is necessary to provide a practical introduction to rule-based modeling and concrete examples. After learning the principles taught here, we encourage readers to explore other approaches.
What makes models useful?
Models have the potential to aid biologists in several ways. First, a model can be used as a roadmap for experimental design. Experiments are used to test hypotheses, and the more complex a hypothesis, the more complicated and numerous the necessary experimental tests are likely to be. Models can potentially be used to carefully design experimental tests that would be optimal for supporting or disproving a hypothesis [37–39]. Second, models can also be used to reconcile surprising or conflicting data. In some cases, seemingly contradictory results may actually be compatible when quantitative details are taken into account [40]. Third, models can be used to consolidate knowledge about a system [41, 42]. Analogous to assembly of a jigsaw puzzle, models can be used to piece together available information to form a more complete picture of how a system works. Furthermore, by comparing model-based simulations to experimental data, discrepancies between understanding and reality can be identified, which may point to areas where additional pieces of information need to be discovered through further experiments.
What constitutes a useful model? The answer to this question depends partly, or even largely, on current modeling capabilities. For example, models that require computer-aided operations to obtain predictions of system behavior (e.g., generation of random numbers as in Monte Carlo methods [43]) only became useful after technological advances made analyses of such models practical. It has recently been argued that those pursuing useful models in biology should adhere to the following guidelines [44]: 1) “ask a question,” 2) “keep it simple,” and 3) “If the model cannot be falsified, it is not telling you anything.” The latter guideline represents valuable advice, because if a model cannot be falsified, it lies outside the realm of science. However, this guideline should perhaps be augmented with an admonition to seek models that make not only testable predictions but also unexpected predictions. The first two guidelines also represent valuable advice, although their interpretation should not be static. These guidelines must be tempered with an awareness of what current modeling methodology allows. For example, although “asking a question” is necessary as a starting point, it does not mean that a model need be designed/used for only a single purpose to the exclusion of model reuse [45], which can be valuable and time saving. Similarly, “keeping it simple” is partly defined by the tools and data available to a modeler; what appears complex using one modeling approach may actually be simple using another. For example, the development of rule-based modeling approaches, as we will see below, has made it relatively easy to specify models that would be cumbersome if not impossible to specify as ODEs.
Simplicity is viewed as a virtue, perhaps especially so in mathematical pursuits, such as modeling. In his famous lecture on mathematical problems, Hilbert stated [46], “what is clear and easily comprehended attracts, the complicated repels us.” A view among some, if not most, modelers is that many biochemical details elucidated by biologists are too complicated to contemplate including in models. Accordingly, many known biochemical details are omitted in models. Simple, abstract models, which may focus, for example, on capturing certain limited influences among molecular entities and processes, have certainly been useful [47, 48], and are likely to continue to be useful for a long time. However, there are many important questions that can now be feasibly addressed that depend on consideration of biochemical details to an extent beyond what is usually considered by modelers. Easing the consideration of mechanistic biochemical details in models for cellular regulatory systems was one of the driving motivations for the development of the rule-based modeling approach in systems biology [18]. With this approach, there is a new definition of “simple.” We should heed the saying attributed to Einstein [49], "everything should be as simple as possible, but not simpler."
The ability to capture mechanistic details in a rule-based model has opened new frontiers, such as the development of “standard models,” which do not currently exist in most if not all of biology. Standard models in other fields, such as the Standard Model of particle physics [50], drive the activities of whole communities and tend to be detailed, because they consolidate knowledge and are useful in large part because they identify the outstanding gaps in understanding. Would standard models benefit biologists? An affirmative answer is suggested by the fact that there are many intricate cellular regulatory systems that have attracted enduring interest, such as the epidermal growth factor receptor (EGFR) signaling network [51, 52], which has been studied for decades for diverse reasons. Efforts to model EGFR signaling have been hailed as paradigmatic of systems biology [53]. A comprehensive, extensively tested, and largely validated model for the EGFR signaling network or any other well-studied system, meaning a standard model, would aid modelers by providing a trusted reusable starting point for asking not one but many questions.
Uses and advantages of rule-based modeling
Rules describe the interactions and processes in a system. Various established conventions exist for writing rules, such as BNGL [34], but in each of these, a modeler specifies 1) the properties required of reactants, 2) the outcome of a reaction, or, equivalently, the transformation that is applied to reactants to obtain products, and 3) the rate law that governs all reactions implied by the rule. A rule is minimal if it does not depend on any properties of the reactants other than those of the components that are modified by the reaction. For example, a minimal rule for association might only require that each of the two sites participating in formation of a noncovalent bond be free. The set of sites or molecular components modified by a rule is called the reaction center. Any additional requirements on the reactants are considered contextual. When the interactions in a system are known to be modular, the rules describing the system will incorporate relatively little information about molecular context and the contextual requirements that are expressed in rules will be local, such as a requirement that a site neighboring a reaction center be bound or in a particular modification state. Rules for the interactions in a cellular regulatory network provide a high-level and relatively easy-to-obtain representation of the network. As we will discuss later, the rules of a model can often be translated automatically into equations, which can then be analyzed using conventional numerical methods. In cases where translation of rules into equations is not possible, the rules of a model can be used as event generators in a discrete-event (stochastic) simulation algorithm.
Because rules can include information about specific biomolecular sites, such as a constraint limiting an interaction between two proteins to cases where a particular tyrosine residue in one of the proteins is phosphorylated, rule-based modeling is ideal for representing biomolecular site dynamics, the changes in states occurring at the functional sites of biomolecules [23]. For proteins, these sites include conserved domains [54], such as Src homology 2 (SH2) and SH3 domains [55], as well as catalytic domains; short linear motifs [56]; and sites of post-translational modifications [57], such as an amino acid residue modified through covalent enzyme-catalyzed addition of a chemical group (e.g., a phosphoryl group) or a peptide bond that can be cleaved by a protease. The concept of a rule, which provides an abstract representation of an interaction (or process), should become clearer as we discuss examples.
Rules can be specified using many different conventions and means [35, 58–70]. Here, as indicated earlier, we will focus on use of the BioNetGen language (BNGL) [34] for specifying rules. Many of the available rule-based models, which are essentially collections of rules, have been formulated using this language [71] or can be readily recast in this language. Much of what is said here about BNGL and BNGL-compliant software also applies to other model-specification languages and the software compatible with these languages, which will be briefly discussed later.
Several thorough but informal descriptions of BNGL are available [34, 72, 73]. A formal description is also available [74]. BNGL allows for the specification of complete, executable models, which again can be viewed as collections of rules, as well as parameter values and other ancillary information (e.g., initial conditions). BNGL also allows for the specification of actions, which include simulation protocols, and simulation outputs. BNGL is a machine-readable (i.e., text-based) language designed for modeling cellular regulatory systems (especially cell signaling systems), which is akin to a domain-specific programming language. It is compatible with a number of general-purpose software tools, including BioNetGen [34, 35], RuleBuilder [75], RuleBender [76, 77], MOSBIE [78], DYNSTOC [79], RuleMonkey [80], and NFsim [81]. Once specified, a BNGL-encoded model can be processed to perform different operations, including model visualization; translation of a BNGL-encoded model specification into different formats, such as an equivalent Systems Biology Markup Language (SBML) encoding [12] or a MATLAB M-file or MEX-file [82]; and simulation [34]. A variety of simulation methods (and simulators) are available [23], as we will discuss later.
It is important to understand how rule-based models differ from traditionally formulated models to appreciate when a rule-based approach is warranted and most likely to be helpful. As indicated earlier, a key distinction between a rule-based model and a traditional equation-based model is the way that it is specified, but there tend to be important differences beyond the means of specification. We note that equation-based models that are comparable to rule-based models may take the form of ODEs for systems with well-mixed chemical kinetics or the form of partial differential equations (PDEs) for systems in which reaction and diffusion are coupled. Although one could in principle use either rules or equations to specify essentially the same model for a system of interest, the different model-specification approaches tend to engender different modeling assumptions. Indeed, when one compares models in the literature, such as the ODE-based models of Kholodenko et al. [83] and Chen et al. [84] for ErbB receptor signaling and the rule-based models of comparable scope of Blinov et al. [85] and Creamer et al. [86], one sees that the different types of models are based on very different assumptions. Equation-based models are based on assumptions about which chemical species are populated and therefore tracked. Thus, the size of a model corresponds to the number of chemical species taken to be populated in a system. Rule-based models, in contrast, are based on assumptions about the modularity of interactions, which affect the forms taken by rules. The size of a model corresponds to the number of interactions of interest. These differences arise from the distinct approaches to model specification, which have different requirements.
To specify an ODE or PDE model, for example, one must enumerate the chemical species that are potentially populated and write an equation for each. Thus, in modeling a cell signaling system with ODEs or PDEs, a modeler is obligated to specify which protein states and signaling complexes are populated. Unfortunately, there is usually no empirical data available to guide a modeler in deciding which chemical species and reactions are important, and a comprehensive enumeration of all possibly populated chemical species is often impracticable. When site dynamics are considered there may be more chemical species that could potentially be populated than there are molecules in a cell. This barrier to model specification has been called combinatorial complexity [15, 16]. We note that lack of information about which chemical species are populated in a system is an issue of concern even if one takes a rule-based modeling approach (which eliminates the requirement to enumerate chemical species), because this information, if available, would impose useful constraints on the rules of model. However, the rules and parameters of a rule-based model implicitly identify important species and reactions, which can be revealed through simulation [87]. Thus, a rule-based modeling approach does not eliminate the problem of absence of knowledge about the important chemical species and reactions in a system but it does provide a tool for discovering this information starting from the known biomolecular interactions of the system.
An example of combinatorial complexity is provided in Figure 1A, which illustrates the reaction network that arises in a model for a protein with two phosphorylation sites that can bind other proteins when phosphorylated. This model implies a reaction network with 9 distinct chemical species and 12 possible reactions, but it can be encoded with a set of four rules under the assumption that phosphorylation or binding at one site does not affect the rate of reactions taking place at the other site. The use of rules (Figure 1B) circumvents the requirement to identify the possible chemical species and reactions in advance and greatly reduces the size of the model specification in this case (from 12 reactions to 4 rules). Each rule specifies the necessary and sufficient conditions for occurrence of the interaction represented by the rule. In our example, each rule only specifies the state of the site being modified by phosphorylation or binding, but not that of the other site. Each rule generates a reaction for each allowable set of reactant species it encounters, and furthermore, each reaction generated by a rule is parameterized by the same rate constant. The latter represents a coarse graining of the chemical kinetics, because in principle, each species may be associated with a unique reactivity, as suggested by Figure 1A. The effect of this modularity assumption is shown in Figure 1C, where each of the three reactions generated by each rule is governed by the same rate constant. This rule-based version of the model is thus not quite as general as the full reaction scheme.
Figure 1. Reaction network-based and rule-based representations of a simple biochemical system.
a) Reaction network for a protein (rounded box) that can be phosphorylated and modified at two sites. The unmodified forms and phosphorylated forms of each site are represented by an open circle and filled circle, respectively. Binding of another protein to a phosphorylated site is indicated by addition of an edge. Because of combinatorial complexity, there are a total of nine possible chemical species (labeled s1–s9) corresponding to distinct states of phosphorylation and binding, with a total of 12 possible (bio)chemical reactions (labeled r1–r12) connecting these states. Although such transformations are reversible in general, we have made the reactions unidirectional here for simplicity. b) Rules with modularity assumptions that represent the network with simplifying rate assumptions. Each of the four rules describes a transformation, either phosphorylation or binding, that occurs at one of the two sites. Because each rule refers to only one site, the state of the other site affects neither the applicability of the rule nor the rate of the reactions that are generated. c) Reaction network generated by the modular rules in (b). Labels on the reaction arrows indicate the rate constants governing each reaction. Each rule generates three reactions with identical rate constants as indicated. Although the number of species and reactions is the same as the network shown in (a), the number of parameters governing the rate equations in the model has been reduced from 12 to 4 by neglecting possible cooperative interactions between the sites. d) Simplified reaction network models with assumptions to limit combinatorial complexity. Two types of common implicit assumptions are illustrated: sequential modification, where the modifications of the sites are assumed to occur in a particular order, and competitive binding, where only one binding partner is allowed to bind at a time. These assumptions limit both the reaction network size and the number of parameters. All but one of the reactions depicted has a corresponding transformation in the rule-based version of the model (c), as indicated by the index of the rate constants. A prime is used to denote approximate correspondence, because the underlying assumptions of the models differ. The competitive binding model also includes a lumped reaction in which both sites are assumed to be phosphorylated to enable binding of one or the other of the protein’s binding partners.
Although rules tend to involve assumptions of modularity, rules can be made as precise as necessary to represent knowledge or hypotheses about the underlying biochemistry. It would be possible, for example, to use rules to fully specify the reaction network shown in Figure 1A using a set of 12 rules having more contextual constraints than the four rules discussed above, making each of these rules as specific in regards to reactants as the 12 distinct reactions. Thus, these rules would be equivalent to or essentially the same as the reactions shown in Figure 1A. Such a specification would not be more concise than a traditional one, but it would have the advantage that the precise structural (i.e., site-specific) requirements for each reaction would be recorded in the rules.
Rule-based modeling can be viewed as a generalization of traditional approaches to modeling biochemical kinetics that lifts the burden of needing to make an explicit listing of the chemical species and reactions that are important in a system. The burden of enumerating species and reactions often leads to ad hoc assumptions to limit the number of species and reactions included in a model. Two such assumptions are illustrated in Figure 1D. Both reaction schemes include the same types of reactions (binding and phosphorylation) used in the full and rule-based versions, but the majority of species and reactions are eliminated. Although assumptions of sequential modification or competitive binding may be justified in some cases, expedience is often the (unstated) rationale. Because traditional modeling approaches also lack standard nomenclature for tracking bonds or post-translational modifications, the composition of species can be ambiguous and the nature of assumptions made in constructing a model difficult to assess.
Although cell signaling has been studied for decades, we have limited knowledge about how modifications and binding at different sites of a protein are coupled (e.g., [20, 24, 88]). We argue that rules expressing a high degree of modularity represent a more natural starting point for model development than making assumptions for the sake of limiting network size. At the same time we caution that assumptions of modularity may not always be valid, as exemplified by the model of Ullah et al. [89] for the behavior of a ligand-gated ion channel, the inositol 1,4,5-trisphosphate receptor. This model includes only a fraction of the possible states of the receptor, those identified as being critical for reproducing an impressive collection of data. Although the model is inconsistent with ligand binding sites in the receptor behaving independently, or modularly, the states included in the model are highly idiosyncratic, meaning that the importance of these particular states is not at all apparent. Thus, this model is representative of models having structures that can only be found through inference from data. Models, especially at early stages of investigation, are usually taken to have less labyrinthine structures, regardless of modeling approach. Successive refinement of modular rules represents one principled approach that may prove useful in the future for the inference of such complex cooperative interactions from data.
Examples of rules for interactions in a cell signaling system
To make the concept of rules more concrete, let us introduce and then discuss three sets of BNGL-encoded rules, which describe various processes involved in IgE receptor (FcεRI) signaling (Figure 2). (As we discuss these rules, we will also discuss the underlying biological details, hopefully in an intelligible manner for readers unfamiliar with FcεRI signaling.) These rules are drawn from a larger set of such rules [41], discussed below, and do not by themselves form a coherent model of the system. Before discussing the example rules and presenting them in BNGL format, it should be noted that rules generally encode molecular mechanisms, which may seem arcane and which can make the writing or interpretation of a rule somewhat challenging. Of course, the ability to naturally capture (i.e., formalize) molecular mechanisms of biomolecular interactions is an advantage of rules, but because of the intimate link between the form of a rule and the underlying biological details, the presentation of rules below is necessarily entwined with a discussion of molecular mechanisms (i.e., site-specific details of biomolecular interactions) involved in FcεRI signaling. If these details obscure our introduction of rules (which we hope is not the case), we encourage the reader to return to this section after examining the complete models discussed later; these models consist of rules that can be interpreted more easily without background knowledge about a particular biological system.
Figure 2. Early events in IgE receptor (FcεRI) signaling.
a) An activation cascade in the signaling network of FcεRI, a multichain antigen-recognition receptor. A ligand or crosslinking reagent, such as a multivalent antigen or, as shown, an IgG antibody specific for the receptor’s α chain, activates the receptor by inducing receptor clustering. Receptor clustering allows Lyn, a Src-family protein kinase that constitutively associates with the receptor’s β chain, to phosphorylate tyrosines in neighboring, co-clustered receptors, which serves to recruit more Lyn to receptors as part of a positive feedback loop. Lyn kinase activity also serves to recruit a second protein kinase, Syk, to receptors, which enhances Syk’s ability to phosphorylate tyrosines in the transmembrane adaptor protein Lat. b) Pictorial representation of the steps involved in receptor crosslinking by a bivalent ligand (anti-FcεRIα): capture of free ligand (top) and crosslinking of receptors by a receptor-tethered ligand (bottom). c) Pictorial representation of reversible Lyn association with FcεRIβ. d) Pictorial representation of phosphorylation of Lat by receptor-bound Syk via a Michaelis-Menten mechanism.
FcεRI is a multichain antigen recognition receptor that is expressed, for example, on mast cells [90]. The antigen specificity of a given FcεRI receptor is determined by the IgE antibody with which the receptor is associated. IgE-FcεRI complexes are long lived relative to the time scale of the signaling events initiated by the interactions of IgE-FcεRI complexes with a multivalent antigen. Signaling may be initiated in the laboratory by crosslinking of FcεRI through use of various reagents, such as bivalent haptens, multivalent haptenated proteins (or other carrier molecules), chemically crosslinked oligomers of IgE, anti-IgE antibodies, and anti-FcεRI antibodies [91].
Here, we will discuss rules for the following signaling processes, which can all be considered early steps in FcεRI signaling (Figure 2A): 1) binding and crosslinking of receptors by a bivalent anti-FcεRI IgG antibody (Figure 2B); 2) recruitment of a protein tyrosine kinase, Lyn, to a docking site at Y218 in the β chain of an activated receptor (UniProt [92] numbering for the rodent protein Ms4a2), which depends on phosphorylation of Y218 (Figure 2C); and 3) phosphorylation of a membrane-localized substrate (Y175) in Lat by a second receptor-associated protein tyrosine kinase, Syk (Figure 2D). Rules for these processes can be written in BNGL as follows:
| (1a) |
| (1b) |
| (1c) |
| (2) |
| (3a) |
| (3b) |
| (3c) |
These rules, which represent direct binding and catalytic interactions, are visualized in Figure 3 in accordance with the conventions of Faeder et al. [93]. Note that bonds between molecular components are indicated by sharing of bond indices, which are prefixed by the “!” symbol. The notation “!+” indicates a bond without indicating the binding partner. Similarly, internal states of molecular components are prefixed by the “~” symbol. The conventions of BNGL-encoded rules are discussed further below.
Figure 3. Visualization of rules.
The rules of Equations (1)–(3) are visualized in accordance with the graphical conventions of BNGL [93]. According to these conventions, functional components of molecules (e.g., binding sites and sites of phosphorylation) are represented as vertices of colored graphs. The color of a graph corresponds to the type of molecule represented by the graph. The components of a given type of molecule share the same color (or equivalently, they share the same molecule type name). Vertices have optional attributes (or equivalently, labels), which indicate internal states (i.e., local properties, such as phosphorylation status). Bonds between molecular components are represented by (undirected) edges. As a simplification, one normally only uses edges to represent those bonds that break and/or form under conditions of interest. a,b) Graphical depiction of Equations (1a) and (1b). These rules formalize the pictorial representations of Figure 2B. c) Graphical depiction of Equation (2). This rule formalizes the pictorial representation of Figure 2C. d,e,f) Graphical depiction of Equations (3a), (3b) and (3c). These rules formalize the pictorial representations of Figure 2D. It should be noted that rules, which are used to represent the molecular interactions in a system, are simply collections of possibly connected subgraphs of the graphs used to represent the molecules in a system. The subgraphs comprising a rule identify the molecular components that affect, or are affected by, the interaction represented by the rule.
The rule of Equation (1a) represents capture of a free ligand by a free receptor site, meaning capture of free anti-FcεRI IgG antibody by the epitope in FcεRIα recognized by the antibody. The rule of Equation (1b) represents crosslinking of two receptors by a receptor-tethered ligand. The rule of Equation (1c) represents dissociation of a ligand-receptor (non-covalent) bond. Note that in the graphical depictions of these interactions, which are shown in Figures 2B and 3A,B, the two binding rules are depicted as reversible, whereas in Equation (1c) we have combined the two types of dissociation events captured in the reversible rules into a single dissociation rule. The rule of Equation (2) represents reversible recruitment of the Src-family protein tyrosine kinase Lyn to a phosphotyrosine docking site in the receptor via Lyn’s SH2 domain. The docking site is located at Y218 in the receptor’s β chain, which is part of an immunoreceptor tyrosine-based activation motif (ITAM). The rules of Equations (3a) through (3c) represent Syk-mediated phosphorylation of a substrate in the membrane adaptor protein Lat (Y175, UniProt numbering for the rodent protein) via a Michaelis-Menten mechanism [94, 95]. The rule of Equation (3a) indicates that a component in Syk, which is named tSH2 to suggest “tandem SH2 domains,” is bound, which is indicated by the notation “!+.” (The tandem SH2 domains in Syk interact with a pair of ITAM phosphotyrosines in each of the receptor’s γ chains.) Thus, the rule of Equation (3c) represents an activity of Syk that is restricted to receptor-associated forms of this protein tyrosine kinase, because binding of the kinase domain of Syk, denoted kin, to its substrate, Y175 in Lat, depends on Syk association with the receptor, as indicated by the rule of Equation (3a).
As can be seen from the example rules presented above, formal elements of a rule are often in one-to-one correspondence with material entities. For example, Lyn in the rule of Equation (2) refers to Lyn and SH2 refers to the SH2 domain of Lyn.
Each of our example rules ends with a listing of either one or two names of parameters. These rule-associated parameters are, by convention, taken to be rate constants in mass-action (elementary) rate laws. (Other types of rate laws can also be associated with rules, but when only a parameter name is given, a mass-action rate law is implied by convention.) The numerical values (and the units) of the rate constants associated with rules are usually defined separately from the rules within a model-specification file. There is at least one rate law associated with each rule. Reversible rules, such as the rule of Equation (2), may be associated with two rate laws, one for the forward transformation defined by the rule and one for the reverse transformation defined by the rule.
The rule of Equation (2) is associated with two rate constants and has two directions. It can be viewed as a generalized reversible reaction, or a reaction generator that defines forward and reverse transformations arising from a reversible interaction between two molecules named Lyn and Rec. The rule indicates that the interaction between Lyn and the receptor is mediated by molecular components named SH2 (a constituent of Lyn) and b_Y218 (a constituent of FcεRI’s β-chain ITAM). The nomenclature of rules is similar to that of standard chemical reactions; however, rules differ from standard chemical reactions in that they do not uniquely identify reactants (or products), which allows for concise model specification. For example, the rule of Equation (2) is silent about functional components of Lyn and FcεRI other than the SH2 domain in Lyn and its phosphotyrosine docking site in FcεRI. Thus, under a “don’t care, don’t write” convention [23], the rule potentially applies to multiple forms and states of these proteins and consequently defines multiple reactions if multiple forms and states are encompassed within a model specification.
The rule of Equation (2) has left- and right-hand sides, which are separated by the symbol “<−>.” This symbol indicates that the interaction represented by the rule is reversible and that the rule defines transformations/reactions in forward and reverse directions. The other rules presented above are each unidirectional, which is indicated by the symbol “−>.” By convention, unidirectional rules are written (and read) from left to right. In Equations (1a), (1b), (2) and (3a), the plus sign on the left-hand side indicates that the rule defines bimolecular (association) reactions, reading from left to right. The absence of a plus sign on the right-hand side of Equation (2) indicates that the rule defines unimolecular (dissociation) reactions when it is read in the opposite direction. The rules of Equations (1c), (3b), and (3c) also define unidirectional reactions. When two rate constants are associated with a rule, as is the case for Equation (2), two elementary mass-action rate laws are implied. In the rule of Equation (2), one rate law, with rate constant kpL, is implied for all association reactions defined by the rule, and a second rate law, with rate constant kmL, is implied for all dissociation reactions defined by the rule. The use of a single rate law for all association (or dissociation) reactions is a simplification, a type of coarse graining, as noted earlier.
The left-hand side of a rule defines necessary and sufficient conditions. In the case of Equation (2), this rule defines necessary and sufficient conditions for Lyn association with FcεRI. Namely, association may occur if and only if the SH2 domain of Lyn is free (indicated by an absence of a bond index appended to the component name SH2, which when present in a rule is prefixed by a “!” symbol) and Y218 in FcεRI’s β-chain is both free (again indicated by the absence of a bond index) and in an internal state labeled “P.” Internal states are convenient abstractions, which can be used to represent local properties of molecular components. Here, the label P is used to represent a phosphorylated tyrosine; the label 0 (which is intended to suggest no modification) could be used to represent an unphosphorylated tyrosine. Internal state labels are prefixed by a tilde (~). Similarly, as already noted, bond indices are prefixed by an exclamation mark (!). The right-hand side of the rule defines the outcome of Lyn association with FcεRI: the formation of a bond between Lyn’s SH2 domain and the phosphotyrosine in FcεRI’s β-chain. The bond is indicated through name sharing. In BNGL, associating a common bond index, such as “1” in the rules presented above, with a pair of component names indicates that the components are connected. A dot (․), which appears on the right-hand side of Equation (1a), (1b), (2) and (3a), serves as a separator. A dot also indicates connection. For example, the dot in Equation (2) indicates that Lyn and FcεRI are connected (without specifying how). In the case of this particular rule, the dot is redundant, because sharing of the bond index 1 by SH2 and b_Y218 also indicates that Lyn and FcεRI are connected. When read from right to left, the rule of Equation (2) indicates that a bond between Lyn’s SH2 domain and pY218 in FcεRI’s β-chain can be broken and that breaking of the bond causes dissociation of the proteins.
A complete, executable model and its analysis
The rules discussed above are simplified versions of rules taken from a recently developed library of rules for interactions involved in FcεRI signaling [41]. Each rule in this library is typically associated with comments that provide annotation, including a rationale for the formalization of the interaction represented by the rule and supporting references. This library was developed with the intention of enabling agile development of models for studying the system-level behavior of the FcεRI signaling network. A limiting step in model development is often a literature search and review, conducted for the purpose of collecting, organizing and formalizing the available mechanistic knowledge that is relevant for addressing a specific research question or other purpose. With a reliable rule library, this aspect of model development is streamlined. In Supplementary file S1, which is a plain-text BioNetGen input file, we have combined the rules given in Eqs. (1)–(3) with additional rules (sourced from the FcεRI rule library), and other elements of an executable model specification, such as parameter values, to obtain an executable model encompassing the rules illustrated in Figure 3. This model can be used, for example, to predict how the binding properties of a bivalent ligand, which we take here to be an FcεRI-specific IgG antibody, affect FcεRI signaling events downstream of FcεRI crosslinking. An example of the type of behavior we can investigate using the model is provided in Figure 4, which shows how the extent of Lat phosphorylation depends on the lifetime of a ligand-receptor bond.
Figure 4. Sensitivity of phosphorylation to the lifetime of a ligand-receptor bond.
The model of Supplementary file S1 was used to predict how the lifetime of a ligand-receptor bond influences the (steady-state) level of phosphorylation of Y218 in the β chain of FcεRI (dotted line), which is a docking site of Lyn’s SH2 domain; the steady-state level of phosphorylation of Y65 and Y76 in the γ chain of FcεRI (broken line), which are docking sites of Syk’s tandem SH2 domains; and the steady-state level of phosphorylation of Y136 in Lat (solid line), which is a substrate of Syk. The simulation results underlying the plots shown here are reported in a .gdat file that is created by BioNetGen after processing the model specified in Supplementary file S1 and then executing the actions defined in Supplementary file S1. The lifetime of a ligand-receptor bond corresponds to the inverse of the rate constant associated with the rule of Equation (1c), i.e., 1/koff. The dependence of phosphorylation levels on ligand-receptor bond lifetime arises from the interplay between kinetic proofreading, which tends to increase phosphorylation levels as lifetime increases, and serial engagement, which tends to decrease phosphorylation levels as lifetime increases [17, 151].
As noted above, the model specification of Supplementary file S1 was constructed by reusing rules collected in the FcεRI rule library of Chylek et al. [41]. The ability to construct a new model by assembling existing rules in a new way, albeit with some modifications, illustrates the compositionality of the rule-based modeling approach. In Table 1, we compare the rules of Supplementary file S1 with their parent rules in the FcεRI rule library. The modifications of the parent rules were introduced as simplifications and to obtain a self-consistent set of rules. The molecule types and interactions considered in the model are illustrated in Figure 5, which shows a contact map automatically generated from Supplementary file S1 by RuleBender [76, 77] (Figure 5A) and an extended contact map manually constructed in accordance with recommended guidelines [96] (Figure 5B). Boxes in the maps illustrate molecules and the functional components of molecules considered in the model. Arrows in the maps represent interactions captured by rules in the model.
Table 1. Comparison of two parent rules in the library of [41] and the derived rules of Supplementary file S1.
Each derived rule was obtained from its parent rule through a simplification, removal of a constraint requiring the SH3 domain in Lyn to be free, because in the model of Supplementary file S1, the binding partners of Lyn’s SH3 domain are not considered.
| Parent rule | Derived rule |
|---|---|
| # Lyn unique domain binds receptor | |
| Rec(b_Y218~0) + Lyn(U,SH3,SH2) −> \ | Lyn(U,SH2) + Rec(b_Y218~0)−> \ |
| Rec(b_Y218~0!1).Lyn(U!1,SH3,SH2) kfRecLyn1 | Lyn(U!1,SH2).Rec(b_Y218~0!1) |
| # Lyn SH2 domain binds receptor | |
| Rec(b_Y218~P) + Lyn(U,SH3,SH2) −> \ | Lyn(U,SH2) + Rec(b_Y218~P)−> \ |
| Rec(b_Y218~P!1).Lyn(U,SH3,SH2!1) kfRecLyn2 | Lyn(U,SH2!1).Rec(b_Y218~P!1) |
Figure 5. Visualization of a rule-based model.
Rules can be precisely visualized, as illustrated in Figure 3, but a coarse description of the rules comprising a model is often useful for obtaining an overview of what is captured in a model. Two visualizations of the model of Supplementary file S1, a contact map and an extended contact map, are shown here. a) Contact map. This type of visualization can be generated automatically from the rules of a model. RuleBender provides this functionality. A contact map shows the types of molecules considered in a model, their functional components, and the bonds that can form and break between these components. b) Extended contact map. This type of visualization adds information beyond that encoded directly in rules and serves an annotation purpose. At present, extended contact maps must be manually constructed. In an extended contact map, one represents the hierarchical substructures of molecules; enzyme-substrate relationships, which are usually implicit in rules; and the dependence of interactions on the internal states of molecular components. Each arrow in an extended contact map represents a set of rules that share a common reaction center. (Multiple rules correspond to one arrow if a transformation is taken to occur in multiple molecular contexts in a context-dependent manner.) Arrows that begin and end with arrowheads represent direct binding interactions. Arrows that begin at a nested box and end at a small circle represent catalytic interactions. The box associated with such an arrow represents an enzyme; the circle points to a substrate of the enzyme. Components taken to have internal states are attached to flags. Here, flags are used to identify tyrosine residues taken to have phosphorylated and unphosphorylated internal states. If an arrow representing a direct binding interaction points to a solid dot on a flag, then the arrow should be understood to represent an interaction that depends on the internal state indicated by the flag. Note that phosphatase activities are not represented here because phosphatases are implicit in the model of Supplementary file S1.
It should be noted that a complete model specification invariably consists of several other parts in addition to rules. These parts of a model may include, for example, a declaration of the different types of molecules considered in the model, parameter values, and user-defined functions for rate laws. A model is also usually associated with specifications of model outputs, which are called observables, and one or more (simulation) commands, which are called actions. Tables 2 and 3 provide listings of some of the available BNGL actions and their arguments.
Table 2. Summary of selected BioNetGen actions and arguments.
For a complete listing of actions and arguments, see the documentation available at [73].
| Action/argument | Description | Default |
|---|---|---|
| generate_network | Process the rules of a model to generate the reaction network implied by the rules | |
| overwrite=>1/0 | Allow an existing .net file to be overwritten | 0 |
| max_iter=>int | Specify the maximum number of iterations for the network generation procedure | 100 |
| max_agg=>int | Specify the maximum number of molecules in an aggregate or complex | 1e9 |
| simulate | Perform a deterministic or stochastic simulation (the ode, ssa and pla methods require prior execution of generate_network and the nf method requires NFsim) | |
| prefix=>”string” | Specify the base filename for output | Base name of .bngl file |
| suffix=>”string” | Specify a suffix to be added to output filenames | Empty |
| verbose=>1/0 | Request verbose output | 0 |
| method=>”string” | Specify simulation method, string must be ode, ssa, pla or nf | Required |
| t_start=>float | Start time for simulation | 0 |
| t_end=>float | End time for simulation | Required |
| n_steps=>int | Number of report times between t_start and t_end | 1 |
| print_functions=>1/0 | Request that function evaluations be sent to .gdat file | 0 |
| atol=>float | Absolute tolerance (used with ode) | 1e-8 |
| rtol=>float | Relative tolerance (used with ode) | 1e-8 |
| steady_state=>1/0 | Check for steady state (used with ode) | 0 |
| seed=>int | Seed for generation of pseudo random numbers (used with ssa, pla, and nf) | Random |
| parameter_scan bifurcate | Perform simulations to a specified end time over a specified range of values for a parameter (these commands are useful for making steady-state dose-response curves and bifurcation diagrams); use bifurcate if bistability is suspected | |
| The arguments of simulate are available, plus those described below | ||
| parameter=>"string" | Name of the parameter to be scanned | Required |
| par_min=>float | Minimum value of parameter | Required |
| par_max=>float | Maximum value of parameter | Required |
| n_scan_pts=>int | Number of points between par_min and par_max to sample | Required |
| log_scale=>1/0 | Sample points logarithmically/linearly | 0 |
Table 3. Summary of additional BioNetGen actions and arguments.
For a complete listing of actions and arguments, see the documentation available at [73].
| Action/argument | Description |
|---|---|
| readFile | Import text from a designated file |
| file=>”string” | Name of file to be read and path (required) |
| blocks=>[“string”,…] | An optional list of blocks to be imported from a file (by default, all available blocks are imported) |
|
writeSBML writeMfile writeMexfile |
Report as output a generated reaction network in SBML, M-file, or MEX-file format |
| prefix=>”string” | Attach a prefix to the base filename used for output |
| suffix=>”string” | Attach a suffix to the base filename used for output |
| setConcentration(“species”,val) | Set the value of the concentration of a species |
| species | The BNGL name of the species |
| val | A numerical value |
| saveConcentrations(“label”) | Save in memory all current species concentrations |
| label | An optional label that can be used to reference saved information |
| resetConcentrations(“label”) | Set all species concentrations to the values stored in memory (at the last save, by default, or to those values referenced by the indicated label) |
| label | An optional label that can be used to reference saved information |
| setParameter(“param”,val) | Set the value of a parameter |
| param | The name of the parameter |
| val | A numerical value |
| saveParameters(“label”) | Save in memory all current parameter values |
| label | An optional label that can be used to reference saved information |
| resetParameters(“label”) | Set all parameters to the values stored in memory (at the last save, by default, or to those values referenced by the indicated label) |
| label | An optional label that can be used to reference saved information |
Although the model of Figure 5 and Supplementary file S1 was constructed quickly, this model is fairly elaborate. To introduce and discuss the various parts of a complete executable BNGL-encoded model specification, let us consider a simpler model, a model for interaction of a soluble monovalent ligand with a bivalent cell-surface receptor (Figure 6). This model corresponds to the experimental system considered in the study of Erickson et al. [97], in which the ligand was DCT (a monovalent hapten) and the receptor was hapten-specific cell-surface IgE (bound to FcεRI). In this study, Erickson et al. [97] studied the effect of diffusion on ligand capture by the cell-surface receptor and escape of the ligand from the receptor to the bulk extracellular fluid.
Figure 6. A model for ligand-receptor binding with diffusion effects.
An annotated, executable version of this model is provided as Supplementary file S2. The model specification consists of several parts, which define parameters, molecule types, seed species, observables, functions, and rules. The model specification is accompanied by several commands, or actions. If Supplementary file S2 is submitted to BioNetGen for processing, these actions will be performed automatically after the model specification is parsed. The actions shown here instruct BioNetGen to perform three simulations of ligand-receptor binding kinetics with different values of the ligand diffusion coefficient.
The model of Figure 6 (and Supplementary file S2), as is typical of BNGL-encoded models, consists of several named blocks of code, as well as comments, which are each preceded by the ‘#’ symbol. In Figure 6, comments are used to indicate the units of parameters, which must be specified in a self-consistent unit system. Each block of code in Figure 6 is identified and delineated by a block name associated with the ‘begin’ and ‘end’ keywords. The model specification, which consists of six blocks, is contained within a block of code that is delineated by ‘begin model’ and ‘end model’ and followed by two commands or actions, which together define a simulation protocol. This protocol consists of two parts: 1) network generation (as specified by the ‘generate_network’ command), which involves finding the chemical reaction network (i.e., the reactions and species) implied by the rules of the model; and 2) construction and numerical integration of the corresponding ODEs for the chemical kinetics of the network (as specified by the ‘simulate’ command).
The blocks of code comprising the model specification of Figure 6 (i.e., the six blocks within the model block) are the parameters, molecule types, seed species, observables, functions, and reaction rules blocks. These blocks of code provide the following information: (i) parameters – values for useful physical/mathematical constants (e.g., π) and parameters of the model (e.g., ligand and receptor abundances, an equilibrium association constant for ligand-receptor binding, and a dissociation rate constant) in a consistent unit system, one in which concentrations are expressed on a per cell basis, which has certain advantages discussed elsewhere [34]; (ii) molecule types – definitions of the types of molecules considered in the model (here, a monovalent ligand and a bivalent receptor), including their relevant component parts (one ligand binding site and two identical cognate receptor binding sites); (iii) seed species – the species initially present in the system (here, the free form of the ligand and the free form of the receptor) and their abundances, which for this model serve to define an initial condition for an initial value problem; (iv) observables – the model outputs of interest, which are defined as sums over the concentrations of species matching a specified pattern or set of patterns (e.g., the amount of bound ligand); (v) functions – mathematical functions defined using parameters and observables that are used to specify complex observables (i.e., observables that cannot be represented by a sum or weighted sum of concentrations) and non-elementary rate laws (here, functions are used to define forward and reverse ligand-receptor binding coefficients in accordance with the theory of Berg and Purcell [98]); (vi) reaction rules – local rules that model/represent interactions (here, a reversible rule is defined for ligand-receptor association and dissociation and the rule is associated with a pair of forward and reverse rate laws specified using functions).
The use of rules does not simplify the specification of this model, but the model-specification file shown in Figure 6, introduces the various parts of BNGL-encoded models and illustrates the use of functions, which is a fairly new capability [23, 81]. Below, after briefly discussing available simulation methods and software tools, we will consider additional examples of model specifications and actions, including a (simple) model that would be difficult to specify in a traditional manner (Supplementary file S3).
A brief survey of useful methods and software tools
Various methods and software tools are available to support rule-based modeling. The vast majority of the presently available methods and tools are designed to support model specification and/or simulation. These tasks are somewhat interdependent but less so than is typical for traditional modeling approaches.
Methods
To specify models in terms of rules, one needs conventions and compatible software tools for operating on and analyzing compliant model specifications. Because of the newness of rule-based modeling, conventions are still emerging. However, BNGL and BNGL-compliant tools are used commonly [22, 71]. Alternative methods for specifying models include languages/formats with syntactic differences that tend to offer similar functionality [58, 62, 99, 100]; domain-specific languages that offer higher-level abstractions that can ease the task of model specification [101]; and embedded languages, which allow a modeler to leverage the power of a general-purpose programming language [69]. There are also BNGL extensions and other languages that enable modeling capabilities beyond what is available within the BNGL framework [67, 68, 102–107]. We note that rules and rule-based models are readily visualized [76–78, 93, 96, 105, 108, 109] and software tools exist that enable a visual approach to model specification, such as RuleBuilder [75] and Simmune [63, 104, 105]. (RuleBuilder is not currently supported but the Java code is available.)
As we have discussed above, the rule-based modeling paradigm allows site-specific details about biomolecular interactions to be captured and represented using an understandable language, such as BNGL. The understandability of BNGL stems from its underlying graphical formalism [36, 74, 110], which makes individual rules and collections of rules (models) amenable to visualization [76–78, 93, 96, 105, 108, 109], as well as annotation [96]. Computer-aided analysis of BNGL-encoded models is enabled by general-purpose BNGL-compatible software tools that can process model specifications to obtain simulation results. These tools implement simulation algorithms that fall into one of two categories: direct methods or indirect methods, both of which are based on physicochemical principles [23]. (Some researchers refer to direct methods, or even rule-based modeling, as a form of agent-based modeling; we discourage this practice, because agent-based models are not typically based on physicochemical principles.)
Indirect methods involve the interpretation of the rules of a model to obtain (through network generation, i.e., through enumerating the reactions implied by rules) an equivalent model specification that has a traditional form, such as that of a coupled system of ODEs [34–36, 74, 111]. Once a traditional formulation has been obtained, then standard tools can be applied, such as the ODE solvers suitable for stiff problems available within SUNDIALS [112] or MATLAB (MathWorks, Natick, MA). Use of an indirect method allows one to leverage mature, sophisticated, and diverse simulation and analysis tools. However, indirect methods have limited applicability, because a set of rules sometimes cannot be practically translated into an equivalent, traditional model form, often because of memory requirements [74]. (Recall the problem of combinatorial complexity.) A not unrelated issue is runaway network generation. In general, one cannot determine beforehand whether the network-generation step of an indirect method will finish running (yielding a finite-size list of reactions and/or equations) or continue indefinitely. In other words, when one tries to apply an indirect method, network generation may or may not terminate. In practice, one can anticipate a failure to terminate if one can detect rules that introduce a polymerization-like process. Trying to determine if network generation will terminate is equivalent to trying to solve the famous halting problem. Rule-based modeling systems have been shown to be Turing complete [113], and for such systems, the halting problem is known to be undecidable. In cases where indirect methods are inapplicable, direct methods, which are specialized for simulation of rule-based models, are used.
Direct methods are fairly new, and methodology tailored for simulation and analysis of rule-based models is still being developed. Thus, compared to the toolbox of methods available for traditionally formulated models, direct method options are relatively limited. The available direct methods are all particle-based stochastic simulation algorithms [79, 80, 114, 115]. Particle-based stochastic simulation allows the state of a system to be tracked in terms of molecular site states instead of population levels of chemical species, which obviates the need to enumerate the chemical species (and reactions) implied by rules. In direct methods, rules are used as event generators. An event changes the state of at least one molecular site, and the state of the system is found as a function of time through the firing of probabilistically chosen rule-defined reaction events, one event at a time. Although this type of simulation algorithm can be less efficient than an indirect method (because of the drawbacks of discrete-event procedures [116]), direct methods are the most generally applicable type of simulation method available for rule-based models. Some models that have been developed to study cellular regulatory systems can only be simulated using a direct method [86, 117–119].
A hybrid particle/population (HPP) method has recently been implemented that lies intermediate between indirect and direct methods [74]. Like indirect methods, HPP expands the rules of a model to obtain an alternative, equivalent model specification. However, unlike with indirect methods, the expansion with HPP is only partial and the resulting model is still rule-based, although with a larger number of rules than the original. The advantage of this partial expansion is that it allows a subset of the species to be treated as population variables rather than particles. This approach can significantly reduce computational memory requirements [74]. Simulations of the partially expanded model can be performed using a recent version of NFsim (1.11 or later) that can handle population variables.
Software tools
Two useful simulation tools for rule-based compartmental models are BioNetGen [34] and NFsim [81]. BioNetGen is a BNGL-compatible simulation tool that implements a variety of indirect methods, including both deterministic and stochastic methods. The options available include methods suitable for stiff problems, such as the default ODE solver (invoked with the ‘ode’ method flag) and the partitioned-leaping algorithm for stochastic simulation (invoked with the ‘pla’ method flag) (Table 2). The latter method is a variant of Gillespie’s tau-leaping algorithm [116, 120]. NFsim is a BNGL-compatible simulation tool that implements a (stochastic) direct method [81, 121]. BioNetGen and NFsim, which offer the ability to specify rate laws that have arbitrary functional forms [23, 81], can each be accessed from the command line, through scripting, or through a graphical user interface (GUI), which is provided by RuleBender [76, 77]. The Python-based framework PySB (for systems biology modeling) provides an additional means for using these and other tools [69]. The RuleBender/BioNetGen/NFsim software stack can be freely downloaded as a bundle, along with documentation, and installed (typically without a need for compilation) on commonly used platforms, including Windows, Mac OS, and Linux [122]. We should emphasize that this software stack is designed and intended for the analysis of models in which reaction compartments are spatially homogeneous (i.e., well-mixed). Consequently, other tools, such as Simmune [104, 105], must be used for spatial modeling. There are a number of software tools for rule-based spatial modeling based on physicochemical principles, which account for coupling between reaction and diffusion in different ways (viz., the next subvolume method, PDEs, and Brownian dynamics) [66, 67, 102–105, 123]. Although the definition of a reaction network does not provide a complete specification of a spatial model, it is often a significant part of a spatial model specification. Thus, it is perhaps worth noting that BioNetGen currently supports export of rule-derived reaction networks in a number of formats that can be loaded directly into various spatial modeling tools, including VCell [66], SSC [67], Smoldyn [123], and MCell [124]. Most spatial modeling tools implement indirect simulation methods (i.e., most tools rely on network generation). However, there are exceptions [102, 103].
More example models
To further illustrate the parts of a BNGL-encoded model, various features of BNGL, and some of the available capabilities of BNGL-compatible software tools, a series of example models is presented in Figures 7–9. The model specifications shown in these figures are complemented by annotated BioNetGen input files, which are provided in the Supplementary material (Supplementary files S3–S5). Figure 10 shows simulation results obtained from the models of Figure 7–9, as well as the model of Figure 6 considered earlier (Figure 10A). These results were obtained by passing the model specifications and actions of Supplementary files S2–S5 to BioNetGen for processing. Processing can be initiated either at the command line via a command of the form “path/BNG2.pl filename.bngl” or via a point-and-click procedure within the GUI of the RuleBender environment [76, 77].
Figure 10. Simulation results obtained from the models of Figures 6–9.
a) Simulation results for the model of Figure 6 (deterministic time courses of ligand-receptor binding for different values of the ligand diffusion coefficient). The solid, broken and dotted lines correspond to large, medium and small values for the ligand diffusion coefficient. Time is indicated on the horizontal axis, and the amount of bound ligand is indicated on the vertical axis in dimensionless units. The vertical axis is scaled such that a value of 1 corresponds to 100,000 bound ligands per cell. b) Simulation results for the model of Figure 7 (deterministic time courses for equilibration to a basal steady state and for the response to phosphatase inhibition). An equilibration simulation was started at an arbitrary, easy-to-specify starting condition at time t = 0. The purpose of this simulation is to find the non-trivial, difficult-to-specify basal steady state. A production simulation was started at time t = 100 s, with the addition of a phosphatase inhibitor. The horizontal axis indicates time, and the vertical axis indicates the amount of receptor phosphorylated at site Y1 (solid curve) and site Y2 (dotted curve) relative to the basal levels of phosphorylation for these sites (i.e., phosphorylation levels are normalized). c) Simulation results for the model of Figure 8 (a bifurcation diagram summarizing the stable steady states found through simulation and parameter scanning). In the top and bottom panels, the horizontal axis indicates the value of the bifurcation parameter. In the top panel, the vertical axis indicates the level of repressor X. In the bottom panel, the vertical axis indicates the level of repressor Y. Units are the same for all axes (copies per cell). Each point corresponds to a stable steady state (i.e., a fixed point) at the indicated value of the bifurcation parameter (Kxy). Only stable steady states are represented in these plots. d) Simulation results for the model of Figure 9 (stochastic time courses). The black curve indicates the fluctuating level of expression from the unregulated gene. The colored curves indicate the fluctuating levels of expression from the three regulated genes (red, gene 1; blue, gene 2; green, gene 3). The average level of expression is the same for each regulated gene. The horizontal axis has been scaled such that a value of 1 corresponds to 1000 s.
Figure 7. A model for receptor signaling with a non-trivial basal steady state.
An annotated, executable version of this model is provided as Supplementary file S3. Because of space limitations, the parameters block of the model specification is not shown. Instead, a “readFile” command is shown where the parameters block would normally appear. This command directs BioNetGen to read a (plain-text) file named “SupplementaryfileS3.bngl” in the current working directory and to import the parameters block from that file. (If the block is not present in the designated file, or the file cannot be found, an error report is generated.) The actions associated with the model specification instruct BioNetGen to find the basal steady state of the system, in which the receptor is constitutively phosphorylated, and then to simulate the response to addition of a phosphatase inhibitor.
Figure 9. A model for stochastic gene expression.
An annotated, executable version of this model is provided as Supplementary file S5. This model considers a gene expressed at a low constitutive level, and three regulated genes, which are each expressed at an identical average level. Although the regulated genes each have the same expected level of expression at steady state, their distributions of expression are different. The actions associated with the model specification instruct BioNetGen to perform a stochastic simulation.
The model of Figure 7 (Supplementary file S3) illustrates the conciseness that is possible with a rule-based approach to model specification. This model, although it accounts for interactions among only a handful of molecules, is rather large when expressed as an equivalent reaction network. The network implied by the rules of the model, which can be found automatically by BioNetGen and reported in the various formats mentioned above, consists of 156 chemical species and 1,218 unidirectional reactions. The corresponding system of coupled ODEs for the mass-action kinetics of this network consists of 156 equations, one for each species, which collectively contain 1,218 distinct right-hand-side terms, one for each unidirectional reaction. As indicated by its molecule types block, the model of Figure 7 accounts explicitly for four molecules. These molecules are a receptor tyrosine kinase (RTK), which has two sites of autophosphorylation; an SH2 domain-containing adaptor protein, which interacts with the two sites in the RTK when they are phosphorylated; a phosphatase; and a phosphatase inhibitor. The rules of the model capture the following picture. The two sites in the RTK are phosphorylated constitutively in a non-specific manner by cytoplasmic protein tyrosine kinases, which are implicitly considered, and dephosphorylated by a constitutively active phosphatase. The two sites in the RTK are also phosphorylated in a specific manner when the RTK is dimeric. In the absence of a ligand, the RTK constitutively dimerizes through a (weak) self-interaction.
The model of Figure 7, because of constitutive phosphorylation and dephosphorylation of the RTK, has a non-trivial basal steady state, meaning the steady state of the system before a perturbation. This steady state is non-trivial in the sense that its specification requires the definition of the population levels of each of the 156 chemical species implied by the rules of the model and many of these species have concentrations that are significantly different from zero. The basal steady state can be found through an equilibration procedure, which is specified in the actions block of Figure 7. In this procedure, the population levels specified in the seed species block of the model are taken to define an initial condition and a simulation starting from this initial condition and continuing for a sufficiently long time is then performed to find the basal steady state. After setting the phosphatase inhibitor abundance to a non-zero level, a second production simulation is performed, starting from the basal steady state, to predict the effect of phosphatase inhibition. Results from the equilibration and production simulations are shown in Figure 10B.
The model of Figure 8 captures processes involved in a genetic toggle switch [125], such as regulated mRNA/protein synthesis. This model, as parameterized, exhibits bistability. The ‘bifurcate’ action in the actions block of Figure 8 invokes a series of simulations that serve to find stable steady states of the system as a function of a bifurcation parameter (Figure 10C). The ‘writeMfile’ command creates an M-file that can be used within MATLAB together with MatCont [126], a MATLAB toolbox for numerical continuation and bifurcation analysis, to find unstable steady states.
Figure 8. A model for a genetic toggle switch.
An annotated, executable version of this model is provided as Supplementary file S4. The actions associated with the model specification instruct BioNetGen to find (stable) steady states through simulation for a long time for a range of values of a bifurcation parameter. Simulations are performed as the bifurcation parameter is varied from the designated minimum value to the designated maximum value, and then in the reverse direction. Simulations are initiated from the previously found steady state. The first simulation is initiated from the starting condition specified via “setConcentration” commands.
The model of Figure 9 is taken from the review of Munsky et al. [127]. It is intended to capture and illustrate fluctuations in gene expression arising from the inherent stochasticity of (bio)chemical reactions and small population sizes. A constitutively expressed gene and three regulated genes, which each have ‘on’ and ‘off’ states, are considered. The state of each gene is controlled by a transcription factor, which is implicitly considered. Parameters of the model are such that the regulated genes all have the same average level of expression; however, distributions of transcript abundance are distinct. As illustrated in the actions block of Figure 9, an (indirect) stochastic simulation method is invoked by simply specifying the appropriate method flag, ‘ssa,’ as an argument of the ‘simulate’ command (Table 2). The results of a single stochastic simulation run are shown in Figure 10D. An alternative, accelerated-stochastic simulation method [120] can be invoked by setting the method flag to ‘pla’ (Table 2) [73]. In addition, NFsim [81], which implements a network-free (i.e., direct) stochastic simulation method, can be invoked using the ‘nf’ method (Table 2). For cases where there are large differences in the population sizes of chemical species, the HPP method is available [73, 74].
Three further example models, which illustrate powerful but uncommonly used (to date) modeling capabilities, are provided in Supplementary files S6–S8. The first of these files is simply a preprocessed version of the model of Figure 7. The preprocessing step performed to obtain Supplementary file S6 is called fragmentation [111], which can be likened to a model reduction technique. However, the technique is not based on typical model reduction strategies, such as timescale separation [128]. Fragmentation simplifies a model specification, such that when the rules of the model are processed to derive the implied reaction network, a minimal network, which can be viewed as a reduced form of the network that would be found through naïve rule specification and application, is found. Simplifications are obtained, for example, by representing independent sites in a molecule separately. Fragmentation is compatible with deterministic simulation methods. It is a fairly simple procedure that can be performed manually using the recipe provided by Feret et al. [111]. Related methods have also been described recently [129–133]. We note that fragmentation is connected to modeling approaches that have long been used in polymer chemistry (see [88] and references therein).
Supplementary file S7 provides an example of a model specified using compartmental BNGL (cBNGL) [134], which enables the definition of reaction compartments and transport between them. The topology of the compartments is assumed to be composed of a hierarchy of membrane-enclosed volumes. Each surface encloses a single volume, which in turn can contain an arbitrary number of surfaces. Each compartment has a specified reaction volume, which for surface compartments is given by the product of the surface area and an effective width that has typical values corresponding to the molecular dimensions of proteins (see Harris et al. [33, 134] for more details). The use of effective volumes for surface compartments allows all bimolecular rate constants to be specified in the same units (e.g., M−1s−1), with automatic conversion being performed to obtain the correct rates for surface-surface, surface-volume, and volume-volume reactions. In cBNGL both species and their molecular constituents have a location attribute that maps them to a specific compartment. Membrane-associated species may have molecular constituents that are located in volume compartments on either side of the membrane compartment, which facilitates correct localization of product species upon dissociation. Most cBNGL rules do not explicitly reference location attributes but rely instead on additional semantics that only allow reactions between species in the same or adjacent compartments and automatically determine the correct volume of reaction for determining reaction rates. Currently, simulations of cBNGL models can only be performed using indirect methods. In addition to ODE and stochastic simulations based on well-mixed compartments, it is possible to perform spatially resolved simulations of cBNGL models using MCell, a particle-based spatial stochastic simulator [135]. A tutorial describing how to perform spatial simulations of cBNGL models using realistic geometries is available online [136], which also describes how to obtain realistic geometries for cell simulation from statistical models built from experimental imaging data using a tool called CellOrganizer [137]. Supplementary file S7 defines a simplified model that encompasses many key events in signal transduction including ligand-receptor binding, receptor trafficking, transcriptional activation, and negative feedback through expression of an inhibitor of receptor activity. The model demonstrates that oscillations may arise in the system when there is a sharp threshold for transcriptional activation, which can occur, for example, if there is an inhibitor than binds and inactivates the transcription factor with high affinity [138].
Our last example uses a recent extension of BNGL called eBNGL [139], in which one can represent interactions using minimalist rules together with energy patterns that define cooperative interactions [139]. eBNGL was inspired by and builds upon an earlier tool called the Allosteric Network Compiler (ANC) [140] and the work of Danos et al. [141]. An eBNGL-encoded model is parameterized in terms of thermodynamic quantities, which ensures that constraints of detailed balance [142–145] are satisfied. Kinetics can still be predicted on the basis of assumptions about activation energy barriers and transition state theory. The model of Supplementary file S8 captures coupled folding and binding of a disordered protein to a second, folded protein in accordance with a reaction scheme discussed by Kiefhaber et al. [146]. This scheme encompasses a two-step folding-after-binding mechanism, as well as a two-step conformational selection mechanism, in which the disordered protein first folds and then binds.
Conclusions
Rule-based modeling is different from traditional equation-based modeling. Equations are jettisoned for the most part. Instead, one relies on sophisticated software packages, which are more accessible than equations (or software that requires equations as input). Use of these packages, by design, does not require specialized mathematical training or extensive programming skills. Models are specified using a specialized model-specification language, which can be understood by both a computer and a user/biologist. Using one of the available languages to specify a model is similar to writing a simple program, or a script. Thus, some aptitude for programming is required, but learning a model-specification language, such as BNGL, is considerably easier than learning, say, a general-purpose programming language, such as Python [147]. BNGL, like most available model-specification languages, by design, is streamlined and less complex than a general-purpose programming language.
BNGL, which is representative of other rule-based modeling languages, has been designed for use by (physical) biologists; it is intended to allow easy representation of knowledge about biomolecules and their interactions. Within the framework of BNGL, molecules are represented as structured objects, with the component parts of these objects corresponding to the functional sites in biomolecules that are responsible for binding and catalytic interactions of interest. BNGL-encoded rules define necessary and sufficient conditions for interactions and provide rate laws. Capturing the site-specific details of interactions is relatively straightforward. The problem of combinatorial complexity, which quickly stymies traditional modeling approaches and severely limits the biochemical details that can be considered in traditional models, is sidestepped, provided that assumptions of modularity can be plausibly justified or used as a first-order or initial approximation. Because BNGL is based on a graphical formalism for representing molecules and interactions, models and the formal elements of models can be readily visualized with precision. Indeed, automatic visualization capabilities are available for both individual rules and entire rule-based models [76–78, 108, 109]. It is even possible to use a simple drawing tool to specify a rule-based model [75, 105].
A strong advantage of rule-based modeling, which has yet to be fully leveraged, is the ease with which these models can be annotated [38, 41, 42, 86, 96, 119], as there is often a one-to-one correspondence between the formal elements of a model and entries in public databases that collect information about biomolecules and their interactions. Thus, rule-based modeling could provide a foundation for building standard models for well-studied cellular regulatory networks of outstanding interest, such as those implicated in diseases.
Long ago, mathematicians developed formalisms to model and analyze natural phenomena of interest at the time, through the methods available at the time, such as calculations possible through reasoning and the use of pen and paper. These methods were eventually applied to study chemical and biochemical systems [148, 149], and they remain popular to this day. However, biologists now have at hand more powerful modeling tools, which also happen to be easier to use. It is time to pick up these tools and use them as complements to intuition and reasoning and quantitative experimental approaches. For those interested in taking up this challenge, a number of resources are available beyond this review, including a primer [72], a wiki with extensive documentation and links to additional resources [73], and an annual short course where hands-on instruction in rule-based modeling is offered [150].
Supplementary Material
This plain-text file provides a BNGL-encoded model specification. The model specified in this file captures early events in IgE receptor signaling illustrated in Figure 2. It includes the rules of Equations (1)–(3), which are depicted graphically in Figure 3. This file can be processed by BioNetGen to obtain the simulation results shown in Figure 4. The molecules, molecular components, and binding and catalytic interactions considered in the model are visualized in Figure 5.
This file provides a machine-readable version of the model of Figure 7; it includes a parameters block and annotation not shown in Figure 7. This file can be processed by BioNetGen.
This file is a preprocessed/fragmented version of the model of Figure 7 and Supplementary file S3. Processing of this file by BioNetGen produces a smaller reaction network (and smaller corresponding system of ODEs) than when BioNetGen processes the unfragmented model specification.
This file provides an example of a model specified using cBNGL, which is an extension of BNGL that allows different reaction compartments in a system to be explicitly represented. A limitation of cBNGL is that compartments are static, i.e., they cannot be created, destroyed or merged.
This file provides an example of a model specified using eBNGL, which is an extension of BNGL that allows interactions to be represented by minimalist rules together with energy patterns, which capture cooperative effects. A model specified using eBNGL is parameterized in terms of thermodynamic quantities, which guarantees that constraints of detailed balance are satisfied.
Acknowledgments
This work was supported by grants R25GM105608, P50GM085273, and P41GM103712 from the National Institute of General Medical Sciences (NIGMS) of the National Institutes of Health (NIH) and by an Expeditions in Computing Grant (award 0926181) from the National Science Foundation (NSF).
References
- 1.Le Novère N, Hucka M, Mi H, Moodie S, Schreiber F, Sorokin A, Demir E, Wegner K, Aladjem MI, Wimalaratne SM, et al. The Systems Biology Graphical Notation. Nat Biotechnol. 2009;27:735–741. doi: 10.1038/nbt.1558. [DOI] [PubMed] [Google Scholar]
- 2.Tyson JJ, Chen KC, Novak B. Sniffers, buzzers, toggles and blinkers: dynamics of regulatory and signaling pathways in the cell. Curr Opin Cell Biol. 2003;15:221–231. doi: 10.1016/s0955-0674(03)00017-6. [DOI] [PubMed] [Google Scholar]
- 3.Tsai TY, Choi YS, Ma W, Pomerening JR, Tang C, Ferrell JE., Jr Robust, tunable biological oscillations from interlinked positive and negative feedback loops. Science. 2008;321:126–129. doi: 10.1126/science.1156951. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Nobeli I, Favia AD, Thornton JM. Protein promiscuity and its implications for biotechnology. Nat Biotechnol. 2009;27:157–167. doi: 10.1038/nbt1519. [DOI] [PubMed] [Google Scholar]
- 5.Schreiber G, Keating AE. Protein binding specificity versus promiscuity. Curr Opin Struct Biol. 2011;21:50–61. doi: 10.1016/j.sbi.2010.10.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Aldridge BB, Burke JM, Lauffenburger DA, Sorger PK. Physicochemical modelling of cell signalling pathways. Nat Cell Biol. 2006;8:1195–1203. doi: 10.1038/ncb1497. [DOI] [PubMed] [Google Scholar]
- 7.Kastritis PL, Bonvin AM. On the binding affinity of macromolecular interactions: daring to ask why proteins interact. J R Soc Interface. 2013;10:20120835. doi: 10.1098/rsif.2012.0835. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Cowan AE, Moraru II, Schaff JC, Slepchenko BM, Loew LM. Spatial modeling of cell signaling networks. Methods Cell Biol. 2012;110:195–221. doi: 10.1016/B978-0-12-388403-9.00008-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Rowland MA, Fontana W, Deeds EJ. Crosstalk and competition in signaling networks. Biophys J. 2012;103:2389–2398. doi: 10.1016/j.bpj.2012.10.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.de Jong H. Modeling and simulation of genetic regulatory systems: a literature review. J Comput Biol. 2002;9:67–103. doi: 10.1089/10665270252833208. [DOI] [PubMed] [Google Scholar]
- 11.Miskov-Zivanov N, Turner MS, Kane LP, Morel PA, Faeder JR. The duration of T cell stimulation is a critical determinant of cell fate and plasticity. Sci Signal. 2013;6:ra97. doi: 10.1126/scisignal.2004217. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Hucka M, Finney A, Sauro HM, Bolouri H, Doyle JC, Kitano H, Arkin AP, Bornstein BJ, Bray D, Cornish-Bowden A, et al. The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models. Bioinformatics. 2003;19:524–531. doi: 10.1093/bioinformatics/btg015. [DOI] [PubMed] [Google Scholar]
- 13.Laidler KJ. Chemical kinetics and the origins of physical chemistry. Arch Hist Exact Sci. 1985;32:43–75. [Google Scholar]
- 14.Gillespie DT, Hellander A, Petzold LR. Perspective: stochastic algorithms for chemical kinetics. J Chem Phys. 2013;138:170901. doi: 10.1063/1.4801941. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Hlavacek WS, Faeder JR, Blinov ML, Perelson AS, Goldstein B. The complexity of complexes in signal transduction. Biotechnol Bioeng. 2003;84:783–794. doi: 10.1002/bit.10842. [DOI] [PubMed] [Google Scholar]
- 16.Mayer BJ, Blinov ML, Loew LM. Molecular machines or pleiomorphic ensembles: signaling complexes revisited. J Biol. 2009;8:81. doi: 10.1186/jbiol185. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Goldstein B, Faeder JR, Hlavacek WS. Mathematical and computational models of immune-receptor signalling. Nat Rev Immunol. 2004;4:445–456. doi: 10.1038/nri1374. [DOI] [PubMed] [Google Scholar]
- 18.Hlavacek WS, Faeder JR, Blinov ML, Posner RG, Hucka M, Fontana W. Rules for modeling signal-transduction systems. Sci STKE. 2006;2006:re6. doi: 10.1126/stke.3442006re6. [DOI] [PubMed] [Google Scholar]
- 19.Danos V, Feret J, Fontana W, Harmer R, Krivine J. Rule-based modelling of cellular signalling. Concur 2007 - Concurrency Theory, Proceedings. 2007;4703:17–41. [Google Scholar]
- 20.Meier-Schellersheim M, Fraser ID, Klauschen F. Multiscale modeling for biologists. Wiley Interdiscip Rev Syst Biol Med. 2009;1:4–14. doi: 10.1002/wsbm.33. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Bachman JA, Sorger P. New approaches to modeling complex biochemistry. Nat Methods. 2011;8:130–131. doi: 10.1038/nmeth0211-130. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Chylek LA, Stites EC, Posner RG, Hlavacek WS. Innovations of the rule-based modeling approach. In: Prokop A, Csukás B, editors. Systems Biology: Integrative Biology and Simulation Tools. Vol. 1. Dordrecht: Springer; 2013. pp. 273–300. [Google Scholar]
- 23.Chylek LA, Harris LA, Tung CS, Faeder JR, Lopez CF, Hlavacek WS. Rule-based modeling: a computational approach for studying biomolecular site dynamics in cell signaling systems. Wiley Interdiscip Rev Syst Biol Med. 2014;6:13–36. doi: 10.1002/wsbm.1245. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Stefan MI, Bartol TM, Sejnowski TJ, Kennedy MB. Multi-state modeling of biomolecules. PLoS Comput Biol. 2014;10:e1003844. doi: 10.1371/journal.pcbi.1003844. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Marchisio MA, Colaiacovo M, Whitehead E, Stelling J. Modular, rule-based modeling for the design of eukaryotic synthetic gene circuits. BMC Syst Biol. 2013;7:42. doi: 10.1186/1752-0509-7-42. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Rangarajan S, Kaminski T, Van Wyk E, Bhan A, Daoutidis P. Language-oriented rule-based reaction network generation and analysis: Algorithms of RING. Comput Chem Eng. 2014;64:124–137. [Google Scholar]
- 27.Ligon TS, Leonhardt C, Radler JO. Multi-level kinetic model of mRNA delivery via transfection of lipoplexes. PLoS One. 2014;9:e107148. doi: 10.1371/journal.pone.0107148. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Broadbelt LJ, Stark SM, Klein MT. Computer-Generated Pyrolysis Modeling - on-the-Fly Generation of Species, Reactions, and Rates. Ind Eng Chem Res. 1994;33:790–799. [Google Scholar]
- 29.Klinke DJ, Broadbelt LJ. Mechanism reduction during computer generation of compact reaction models. Aiche J. 1997;43:1828–1837. [Google Scholar]
- 30.Faulon JL, Sault AG. Stochastic generator of chemical structure. 3. Reaction network generation. J Chem Inf Comput Sci. 2001;41:894–908. doi: 10.1021/ci000029m. [DOI] [PubMed] [Google Scholar]
- 31.Fontana W, Buss LW. The barrier of objects: from dynamical systems to bounded organizations. In: Casti J, Karlqvist A, editors. Boundaries and Barriers. Reading, MA: Addison-Wesley; 1996. pp. 55–116. [Google Scholar]
- 32.Regev A, Silverman W, Shapiro E. Representation and simulation of biochemical processes using the π-calculus process algebra. Pac Symp Biocomput. 2001:459–470. doi: 10.1142/9789814447362_0045. [DOI] [PubMed] [Google Scholar]
- 33.Gillespie DT. The chemical Langevin equation. Journal of Chemical Physics. 2000;113:297–306. [Google Scholar]
- 34.Faeder JR, Blinov ML, Hlavacek WS. Rule-based modeling of biochemical systems with BioNetGen. Methods Mol Biol. 2009;500:113–167. doi: 10.1007/978-1-59745-525-1_5. [DOI] [PubMed] [Google Scholar]
- 35.Blinov ML, Faeder JR, Goldstein B, Hlavacek WS. BioNetGen: software for rulebased modeling of signal transduction based on the interactions of molecular domains. Bioinformatics. 2004;20:3289–3291. doi: 10.1093/bioinformatics/bth378. [DOI] [PubMed] [Google Scholar]
- 36.Faeder JR, Blinov ML, Goldstein B, Hlavacek WS. Rule-based modeling of biochemical networks. Complexity. 2005;10:22–41. [Google Scholar]
- 37.Apgar JF, Witmer DK, White FM, Tidor B. Sloppy models, parameter uncertainty, and the role of experimental design. Mol Biosyst. 2010;6:1890–1900. doi: 10.1039/b918098b. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Barua D, Hlavacek WS, Lipniacki T. A computational model for early events in B cell antigen receptor signaling: analysis of the roles of Lyn and Fyn. J Immunol. 2012;189:646–658. doi: 10.4049/jimmunol.1102003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Tonsing C, Timmer J, Kreutz C. Cause and cure of sloppiness in ordinary differential equation models. Phys Rev E. 2014;90(2) doi: 10.1103/PhysRevE.90.023303. [DOI] [PubMed] [Google Scholar]
- 40.Barua D, Goldstein B. A mechanistic model of early FcεRI signaling: lipid rafts and the question of protection from dephosphorylation. PLoS One. 2012;7:e51669. doi: 10.1371/journal.pone.0051669. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Chylek LA, Holowka DA, Baird BA, Hlavacek WS. An interaction library for the FcεRI signaling network. Front Immunol. 2014;5:172. doi: 10.3389/fimmu.2014.00172. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Barua D, Hlavacek WS. Modeling the effect of APC truncation on destruction complex function in colorectal cancer cells. PLoS Comput Biol. 2013;9:e1003217. doi: 10.1371/journal.pcbi.1003217. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Metropolis N, Rosenbluth AW, Rosenbluth MN, Teller AH, Teller E. Equation of state calculations by fast computing machines. J Chem Phys. 1953;21:1087–1092. [Google Scholar]
- 44.Gunawardena J. Models in biology: 'accurate descriptions of our pathetic thinking'. BMC Biol. 2014;12:29. doi: 10.1186/1741-7007-12-29. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Chelliah V, Laibe C, Le Novère N. BioModels Database: a repository of mathematical models of biological processes. Methods Mol Biol. 2013;1021:189–199. doi: 10.1007/978-1-62703-450-0_10. [DOI] [PubMed] [Google Scholar]
- 46.Hilbert D. Mathematical problems (reprinted) B Am Math Soc. 2000;37:407–436. [Google Scholar]
- 47.Alon U. Network motifs: theory and experimental approaches. Nat Rev Genet. 2007;8:450–461. doi: 10.1038/nrg2102. [DOI] [PubMed] [Google Scholar]
- 48.Ferrell JE, Jr, Pomerening JR, Kim SY, Trunnell NB, Xiong W, Huang CY, Machleder EM. Simple, realistic models of complex biological processes: positive feedback and bistability in a cell fate switch and a cell cycle oscillator. FEBS Lett. 2009;583:3999–4005. doi: 10.1016/j.febslet.2009.10.068. [DOI] [PubMed] [Google Scholar]
- 49.Sessions R. How a 'difficult' composer gets that way. New York Times. 1950 [Google Scholar]
- 50.Beringer J, Arguin JF, Barnett RM, Copic K, Dahl O, Groom DE, Lin CJ, Lys J, Murayama H, Wohl CG, et al. Review of particle physics. Phys Rev D. 2012;86:010001. [Google Scholar]
- 51.Jorissen RN, Walker F, Pouliot N, Garrett TP, Ward CW, Burgess AW. Epidermal growth factor receptor: mechanisms of activation and signalling. Exp Cell Res. 2003;284:31–53. doi: 10.1016/s0014-4827(02)00098-8. [DOI] [PubMed] [Google Scholar]
- 52.Lemmon MA, Schlessinger J. Cell signaling by receptor tyrosine kinases. Cell. 2010;141:1117–1134. doi: 10.1016/j.cell.2010.06.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Wiley HS, Shvartsman SY, Lauffenburger DA. Computational modeling of the EGF-receptor system: a paradigm for systems biology. Trends Cell Biol. 2003;13:43–50. doi: 10.1016/s0962-8924(02)00009-0. [DOI] [PubMed] [Google Scholar]
- 54.Pawson T, Nash P. Assembly of cell regulatory systems through protein interaction domains. Science. 2003;300:445–452. doi: 10.1126/science.1083653. [DOI] [PubMed] [Google Scholar]
- 55.Schlessinger J. SH2/SH3 signaling proteins. Curr Opin Genet Dev. 1994;4:25–30. doi: 10.1016/0959-437x(94)90087-6. [DOI] [PubMed] [Google Scholar]
- 56.Van Roey K, Uyar B, Weatheritt RJ, Dinkel H, Seiler M, Budd A, Gibson TJ, Davey NE. Short linear motifs: ubiquitous and functionally diverse protein interaction modules directing cell regulation. Chem Rev. 2014;114:6733–6778. doi: 10.1021/cr400585q. [DOI] [PubMed] [Google Scholar]
- 57.Walsh CT, Garneau-Tsodikova S, Gatto GJ., Jr Protein posttranslational modifications: the chemistry of proteome diversifications. Angew Chem Int Ed Engl. 2005;44:7342–7372. doi: 10.1002/anie.200501023. [DOI] [PubMed] [Google Scholar]
- 58.Le Novère N, Shimizu TS. STOCHSIM: modelling of stochastic biomolecular processes. Bioinformatics. 2001;17:575–576. doi: 10.1093/bioinformatics/17.6.575. [DOI] [PubMed] [Google Scholar]
- 59.Priami C, Regev A, Shapiro E, Silverman W. Application of a stochastic name-passing calculus to representation and simulation of molecular processes. Inform Process Lett. 2001;80:25–31. [Google Scholar]
- 60.Danos V, Laneve C. Formal molecular biology. Theor Comput Sci. 2004;325:69–110. [Google Scholar]
- 61.Talcott C, Eker S, Knapp M, Lincoln P, Laderoute K. Pathway logic modeling of protein functional domains in signal transduction. Pac Symp Biocomput. 2004:568–580. [PubMed] [Google Scholar]
- 62.Lok L, Brent R. Automatic generation of cellular reaction networks with Moleculizer 1.0. Nat Biotechnol. 2005;23:131–136. doi: 10.1038/nbt1054. [DOI] [PubMed] [Google Scholar]
- 63.Meier-Schellersheim M, Xu X, Angermann B, Kunkel EJ, Jin T, Germain RN. Key role of local regulation in chemosensing revealed by a new molecular interaction-based modeling method. PLoS Comput Biol. 2006;2:e82. doi: 10.1371/journal.pcbi.0020082. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Phillips A, Cardelli L. Efficient, correct simulation of biological processes in the stochastic pi-calculus. Lect Notes Comput Sci. 2007;4695:184–199. [Google Scholar]
- 65.Dematte L, Priami C, Romanel A. The Beta Workbench: a computational tool to study the dynamics of biological systems. Brief Bioinform. 2008;9(5):437–449. doi: 10.1093/bib/bbn023. [DOI] [PubMed] [Google Scholar]
- 66.Moraru II, Schaff JC, Slepchenko BM, Blinov ML, Morgan F, Lakshminarayana A, Gao F, Li Y, Loew LM. Virtual Cell modelling and simulation software environment. IET Syst Biol. 2008:352–362. doi: 10.1049/iet-syb:20080102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Lis M, Artyomov MN, Devadas S, Chakraborty AK. Efficient stochastic simulation of reaction-diffusion processes via direct compilation. Bioinformatics. 2009;25:2289–2291. doi: 10.1093/bioinformatics/btp387. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Maus C, Rybacki S, Uhrmacher AM. Rule-based multi-level modeling of cell biological systems. BMC Syst Biol. 2011;5:166. doi: 10.1186/1752-0509-5-166. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Lopez CF, Muhlich JL, Bachman JA, Sorger PK. Programming biological models in Python using PySB. Mol Syst Biol. 2013;9:646. doi: 10.1038/msb.2013.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Palmisano A, Hoops S, Watson LT, Jones TC, Jr, Tyson JJ, Shaffer CA. Multistate Model Builder (MSMB): a flexible editor for compact biochemical models. BMC Syst Biol. 2014;8:42. doi: 10.1186/1752-0509-8-42. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Chylek LA, Wilson BS, Hlavacek WS. Modeling biomolecular site dynamics in immunoreceptor signaling systems. In: Corey SJ, Kimmel M, Leonard JN, editors. A Systems Biology Approach to Blood. ch. 13. Dordrecht: Springer; 2015. [DOI] [PubMed] [Google Scholar]
- 72.Sekar JA, Faeder JR. Rule-based modeling of signal transduction: a primer. Methods Mol Biol. 2012;880:139–218. doi: 10.1007/978-1-61779-833-7_9. [DOI] [PubMed] [Google Scholar]
- 73.BioNetGen wiki. [ http://bionetgen.org/] [Google Scholar]
- 74.Hogg JS, Harris LA, Stover LJ, Nair NS, Faeder JR. Exact hybrid particle/population simulation of rule-based models of biochemical systems. PLoS Comput Biol. 2014;10:e1003544. doi: 10.1371/journal.pcbi.1003544. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Hu B, Fricke GM, Faeder JR, Posner RG, Hlavacek WS. GetBonNie for building, analyzing and sharing rule-based models. Bioinformatics. 2009;25:1457–1460. doi: 10.1093/bioinformatics/btp173. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Xu W, Smith AM, Faeder JR, Marai GE. RuleBender: a visual interface for rulebased modeling. Bioinformatics. 2011;27:1721–1722. doi: 10.1093/bioinformatics/btr197. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Smith AM, Xu W, Sun Y, Faeder JR, Marai GE. RuleBender: integrated modeling, simulation and visualization for rule-based intracellular biochemistry. BMC Bioinformatics. 2012;13(Suppl 8):S3. doi: 10.1186/1471-2105-13-S8-S3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Wenskovitch JE, Jr, Harris LA, Tapia JJ, Faeder JR, Marai GE. MOSBIE: a tool for comparison and analysis of rule-based biochemical models. BMC Bioinformatics. 2014;15:316. doi: 10.1186/1471-2105-15-316. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Colvin J, Monine MI, Faeder JR, Hlavacek WS, Von Hoff DD, Posner RG. Simulation of large-scale rule-based models. Bioinformatics. 2009;25:910–917. doi: 10.1093/bioinformatics/btp066. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Colvin J, Monine MI, Gutenkunst RN, Hlavacek WS, Von Hoff DD, Posner RG. RuleMonkey: software for stochastic simulation of rule-based models. BMC Bioinformatics. 2010;11:404. doi: 10.1186/1471-2105-11-404. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Sneddon MW, Faeder JR, Emonet T. Efficient modeling, simulation and coarsegraining of biological complexity with NFsim. Nat Methods. 2011;8:177–183. doi: 10.1038/nmeth.1546. [DOI] [PubMed] [Google Scholar]
- 82.MATLAB Documentation. [ http://www.mathworks.com/help/matlab/] [Google Scholar]
- 83.Kholodenko BN, Demin OV, Moehren G, Hoek JB. Quantification of short term signaling by the epidermal growth factor receptor. J Biol Chem. 1999;274(42):30169–30181. doi: 10.1074/jbc.274.42.30169. [DOI] [PubMed] [Google Scholar]
- 84.Chen WW, Schoeberl B, Jasper PJ, Niepel M, Nielsen UB, Lauffenburger DA, Sorger PK. Input-output behavior of ErbB signaling pathways as revealed by a mass action model trained against dynamic data. Mol Syst Biol. 2009;5:239. doi: 10.1038/msb.2008.74. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Blinov ML, Faeder JR, Goldstein B, Hlavacek WS. A network model of early events in epidermal growth factor receptor signaling that accounts for combinatorial complexity. Biosystems. 2006;83(2–3):136–151. doi: 10.1016/j.biosystems.2005.06.014. [DOI] [PubMed] [Google Scholar]
- 86.Creamer MS, Stites EC, Aziz M, Cahill JA, Tan CW, Berens ME, Han H, Bussey KJ, Von Hoff DD, Hlavacek WS, et al. Specification, annotation, visualization and simulation of a large rule-based model for ERBB receptor signaling. BMC Syst Biol. 2012;6:107. doi: 10.1186/1752-0509-6-107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Faeder JR, Blinov ML, Goldstein B, Hlavacek WS. Combinatorial complexity and dynamical restriction of network flows in signal transduction. Syst Biol (Stevenage) 2005;2:5–15. doi: 10.1049/sb:20045031. [DOI] [PubMed] [Google Scholar]
- 88.Perelson AS, Delisi C. Receptor clustering on a cell surface. I. Theory of receptor cross-linking by ligands bearing two chemically identical functional groups. Math Biosci. 1980;48:71–110. [Google Scholar]
- 89.Ullah G, Mak DO, Pearson JE. A data-driven model of a modal gated ion channel: the inositol 1,4,5-trisphosphate receptor in insect Sf9 cells. J Gen Physiol. 2012;140:159–173. doi: 10.1085/jgp.201110753. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.da Silva EZ, Jamur MC, Oliver C. Mast cell function: a new vision of an old cell. J Histochem Cytochem. 2014 doi: 10.1369/0022155414545334. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Holowka D, Baird B. Antigen-mediated IgE receptor aggregation and signaling: a window on cell surface structure and dynamics. Annu Rev Biophys Biomol Struct. 1996;25:79–112. doi: 10.1146/annurev.bb.25.060196.000455. [DOI] [PubMed] [Google Scholar]
- 92.UniProt. [ http://www.uniprot.org/] [Google Scholar]
- 93.Faeder JR, Blinov ML, Hlavacek WS. Graphical rule-based representation of signal-transduction networks. In: Liebrock LM, editor. Proceedings of the 2005 ACM Symposium on Applied Computing; ACM Press; New York. 2005. pp. 133–140. [Google Scholar]
- 94.Chen WW, Niepel M, Sorger PK. Classic and contemporary approaches to modeling biochemical reactions. Genes Dev. 2010;24:1861–1875. doi: 10.1101/gad.1945410. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Schnell S. Validity of the Michaelis-Menten equation—steady-state or reactant stationary assumption: that is the question. Febs J. 2014;281:464–472. doi: 10.1111/febs.12564. [DOI] [PubMed] [Google Scholar]
- 96.Chylek LA, Hu B, Blinov ML, Emonet T, Faeder JR, Goldstein B, Gutenkunst RN, Haugh JM, Lipniacki T, Posner RG, et al. Guidelines for visualizing and annotating rule-based models. Mol Biosyst. 2011;7:2779–2795. doi: 10.1039/c1mb05077j. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Erickson J, Goldstein B, Holowka D, Baird B. The effect of receptor density on the forward rate constant for binding of ligands to cell surface receptors. Biophys J. 1987;52:657–662. doi: 10.1016/S0006-3495(87)83258-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Berg HC, Purcell EM. Physics of chemoreception. Biophys J. 1977;20:193–219. doi: 10.1016/S0006-3495(77)85544-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.Danos V, Feret J, Fontana W, Harmer R, Krivine J. Rule-based modelling, symmetries, refinements. Lect Notes Comput Sci. 2008;5054:103–122. [Google Scholar]
- 100.Pedersen M, Oury N, Gravill C, Phillips A. Bio Simulators: a web UI for biological simulation. Bioinformatics. 2014;30:1491–1492. doi: 10.1093/bioinformatics/btu050. [DOI] [PubMed] [Google Scholar]
- 101.Danos V, Feret J, Fontana W, Harmer R, Krivme J. Rule-based modelling and model perturbation. Lect Notes Comput Sci. 2009;5750:116–137. [Google Scholar]
- 102.Gruenert G, Ibrahim B, Lenser T, Lohel M, Hinze T, Dittrich P. Rule-based spatial modeling with diffusing, geometrically constrained molecules. BMC Bioinformatics. 2010;11:307. doi: 10.1186/1471-2105-11-307. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 103.Sorokina O, Sorokin A, Douglas Armstrong J, Danos V. A simulator for spatially extended kappa models. Bioinformatics. 2013;29:3105–3106. doi: 10.1093/bioinformatics/btt523. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 104.Angermann BR, Klauschen F, Garcia AD, Prustel T, Zhang F, Germain RN, Meier-Schellersheim M. Computational modeling of cellular signaling processes embedded into dynamic spatial contexts. Nat Methods. 2012;9:283–289. doi: 10.1038/nmeth.1861. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105.Zhang F, Angermann BR, Meier-Schellersheim M. The Simmune Modeler visual interface for creating signaling networks based on bi-molecular interactions. Bioinformatics. 2013;29:1229–1230. doi: 10.1093/bioinformatics/btt134. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106.Mjolsness E. Time-ordered product expansions for computational stochastic system biology. Phys Biol. 2013;10:035009. doi: 10.1088/1478-3975/10/3/035009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 107.Bittig AT, Matschegewski C, Nebe J, Stahlke S, Uhrmacher AM. Membrane related dynamics and the formation of actin in cells growing on microtopographies: a spatial computational model. BMC Syst Biol. 2014;8:106. doi: 10.1186/s12918-014-0106-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 108.Tiger CF, Krause F, Cedersund G, Palmer R, Klipp E, Hohmann S, Kitano H, Krantz M. A framework for mapping, visualisation and automatic model creation of signal-transduction networks. Mol Syst Biol. 2012;8:578. doi: 10.1038/msb.2012.12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 109.Cheng HC, Angermann BR, Zhang F, Meier-Schellersheim M. NetworkViewer: visualizing biochemical reaction networks with embedded rendering of molecular interaction rules. BMC Syst Biol. 2014;8:70. doi: 10.1186/1752-0509-8-70. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 110.Blinov ML, Yang J, Faeder JR, Hlavacek WS. Graph theory for rule-based modeling of biochemical networks. Lect Notes Comput Sci. 2006;4230:89–106. [Google Scholar]
- 111.Feret J, Danos V, Krivine J, Harmer R, Fontana W. Internal coarse-graining of molecular systems. Proc Natl Acad Sci U S A. 2009;106:6453–6458. doi: 10.1073/pnas.0809908106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 112.Hindmarsh AC, Brown PN, Grant KE, Lee SL, Serban R, Shumaker DE, Woodward CS. SUNDIALS: suite of nonlinear and differential/algebraic equation solvers. Acm T Math Software. 2005;31:363–396. [Google Scholar]
- 113.Cardelli L, Zavattaro G. Turing universality of the Biochemical Ground Form. Math Struct Comp Sci. 2010;20:45–73. [Google Scholar]
- 114.Danos V, Feret J, Fontana W, Krivine J. Scalable simulation of cellular signaling networks. Lect Notes Comput Sci. 2007;4807:139–157. [Google Scholar]
- 115.Yang J, Monine MI, Faeder JR, Hlavacek WS. Kinetic Monte Carlo method for rule-based modeling of biochemical networks. Phys Rev E. 2008;78:031910. doi: 10.1103/PhysRevE.78.031910. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 116.Gillespie DT. Approximate accelerated stochastic simulation of chemically reacting systems. J Chem Phys. 2001;115:1716–1733. [Google Scholar]
- 117.Monine MI, Posner RG, Savage PB, Faeder JR, Hlavacek WS. Modeling multivalent ligand-receptor interactions with steric constraints on configurations of cell-surface receptor aggregates. Biophys J. 2010;98:48–56. doi: 10.1016/j.bpj.2009.09.043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 118.Mahajan A, Barua D, Cutler P, Lidke DS, Espinoza FA, Pehlke C, Grattan R, Kawakami Y, Tung CS, Bradbury AR, et al. Optimal aggregation of FcεRI with a structurally defined trivalent ligand overrides negative regulation driven by phosphatases. ACS Chem Biol. 2014;9:1508–1519. doi: 10.1021/cb500134t. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 119.Chylek LA, Akimov V, Dengjel J, Rigbolt KT, Hu B, Hlavacek WS, Blagoev B. Phosphorylation site dynamics of early T-cell receptor signaling. PLoS One. 2014;9:e104240. doi: 10.1371/journal.pone.0104240. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 120.Harris LA, Clancy P. A "partitioned leaping" approach for multiscale modeling of chemical reaction dynamics. J Chem Phys. 2006;125:144107. doi: 10.1063/1.2354085. [DOI] [PubMed] [Google Scholar]
- 121.Yang J, Hlavacek WS. The efficiency of reactant site sampling in networkfree simulation of rule-based models for biochemical systems. Phys Biol. 2011;8:055009. doi: 10.1088/1478-3975/8/5/055009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 122.RuleBender. [ http://visualizlab.org/rulebender/] [Google Scholar]
- 123.Andrews SS. Spatial and stochastic cellular modeling with the Smoldyn simulator. Methods Mol Biol. 2012;804:519–542. doi: 10.1007/978-1-61779-361-5_26. [DOI] [PubMed] [Google Scholar]
- 124.Bartol T, Dittrich M, Faeder J. MCell. In: Jaeger D, Jung R, editors. Encyclopedia of Computational Neuroscience. New York: Springer; 2015. pp. 1–5. [Google Scholar]
- 125.Gardner TS, Cantor CR, Collins JJ. Construction of a genetic toggle switch in Escherichia coli. Nature. 2000;403:339–342. doi: 10.1038/35002131. [DOI] [PubMed] [Google Scholar]
- 126.MatCont. [ http://sourceforge.net/projects/matcont/] [Google Scholar]
- 127.Munsky B, Neuert G, van Oudenaarden A. Using gene expression noise to understand gene regulation. Science. 2012;336:183–187. doi: 10.1126/science.1216379. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 128.Klinke DJ, 2nd, Finley SD. Timescale analysis of rule-based biochemical reaction networks. Biotechnol Prog. 2012;28:33–44. doi: 10.1002/btpr.704. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 129.Borisov NM, Chistopolsky AS, Faeder JR, Kholodenko BN. Domain-oriented reduction of rule-based network models. IET Syst Biol. 2008;2:342–351. doi: 10.1049/iet-syb:20070081. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 130.Conzelmann H, Fey D, Gilles ED. Exact model reduction of combinatorial reaction networks. BMC Syst Biol. 2008;2:78. doi: 10.1186/1752-0509-2-78. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 131.Harmer R, Danos V, Feret J, Krivine J, Fontana W. Intrinsic information carriers in combinatorial dynamical systems. Chaos. 2010;20:037108. doi: 10.1063/1.3491100. [DOI] [PubMed] [Google Scholar]
- 132.Bugenhagen SM, Beard DA. Specification, construction, and exact reduction of state transition system models of biochemical processes. J Chem Phys. 2012;137:154108. doi: 10.1063/1.4758074. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 133.Camporesi F, Feret J, Hayman J. Context-sensitive flow analyses: a hierarchy of model reductions. Lect Notes Bioinformat. 2013;8130:220–233. [Google Scholar]
- 134.Harris LA, Hogg JS, Faeder JR. Compartmental rule-based modeling of biochemical systems. Wint Simul C Proc. 2009:854–865. [Google Scholar]
- 135.Kerr RA, Bartol TM, Kaminsky B, Dittrich M, Chang JCJ, Baden SB, Sejnowski TJ, Stiles JR. Fast Monte Carlo Simulation Methods for Biological Reaction-Diffusion Systems in Solution and on Surfaces. Siam J Sci Comput. 2008;30:3126–3149. doi: 10.1137/070692017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 136.MCell. [ http://www.mcell.org/] [Google Scholar]
- 137.Murphy RF. Cell Organizer: Image-Derived Models of Subcellular Organization and Protein Distribution. Method Cell Biol. 2012;110:179–193. doi: 10.1016/B978-0-12-388403-9.00007-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 138.Buchler NE, Cross FR. Protein sequestration generates a flexible ultrasensitive response in a genetic network. Mol Syst Biol. 2009;5:272. doi: 10.1038/msb.2009.30. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 139.Hogg JS. PhD Thesis. Pittsburgh: University of Pittsburgh; 2013. Advances in Rule-Based Modeling: Compartments, Energy, and Hybrid Simulation, with Application to Sepsis and Cell Signaling. [Google Scholar]
- 140.Ollivier JF, Shahrezaei V, Swain PS. Scalable rule-based modelling of allosteric proteins and biochemical networks. PLoS Comput Biol. 2010;6(11):e1000975. doi: 10.1371/journal.pcbi.1000975. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 141.Danos V, Harmer R, Honorato-Zimmer R. Thermodynamic graph-rewriting. Lect Notes Comput Sci. 2013;8052:380–394. [Google Scholar]
- 142.Lewis GN. A new principle of equilibrium. Proc Natl Acad Sci U S A. 1925;11:179–183. doi: 10.1073/pnas.11.3.179. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 143.Yang J, Bruno WJ, Hlavacek WS, Pearson JE. On imposing detailed balance in complex reaction mechanisms. Biophys J. 2006;91:1136–1141. doi: 10.1529/biophysj.105.071852. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 144.Ederer M, Gilles ED. Thermodynamically feasible kinetic models of reaction networks. Biophys J. 2007;92:1846–1857. doi: 10.1529/biophysj.106.094094. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 145.Gorban AN, Yablonsky GS. Extended detailed balance for systems with irreversible reactions. Chem Eng Sci. 2011;66:5388–5399. [Google Scholar]
- 146.Kiefhaber T, Bachmann A, Jensen KS. Dynamics and mechanisms of coupled protein folding and binding reactions. Curr Opin Struct Biol. 2012;22:21–29. doi: 10.1016/j.sbi.2011.09.010. [DOI] [PubMed] [Google Scholar]
- 147.Python. [ https://http://www.python.org/] [Google Scholar]
- 148.Harcourt AV, Esson W. On the laws of connexion between the conditions of a chemical change and its amount. Proc R Soc Lond. 1865;14:470–474. [Google Scholar]
- 149.van't Hoff JH. Études de Dynamique chimique. Amsterdam: Frederik Muller; 1884. [Google Scholar]
- 150.The q-bio Summer School and Conference wiki; [ http://q-bio.org/] [DOI] [PMC free article] [PubMed] [Google Scholar]
- 151.Goldstein B, Coombs D, Faeder JR, Hlavacek WS. Kinetic proofreading model. Adv Exp Med Biol. 2008;640:82–94. doi: 10.1007/978-0-387-09789-3_8. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
This plain-text file provides a BNGL-encoded model specification. The model specified in this file captures early events in IgE receptor signaling illustrated in Figure 2. It includes the rules of Equations (1)–(3), which are depicted graphically in Figure 3. This file can be processed by BioNetGen to obtain the simulation results shown in Figure 4. The molecules, molecular components, and binding and catalytic interactions considered in the model are visualized in Figure 5.
This file provides a machine-readable version of the model of Figure 7; it includes a parameters block and annotation not shown in Figure 7. This file can be processed by BioNetGen.
This file is a preprocessed/fragmented version of the model of Figure 7 and Supplementary file S3. Processing of this file by BioNetGen produces a smaller reaction network (and smaller corresponding system of ODEs) than when BioNetGen processes the unfragmented model specification.
This file provides an example of a model specified using cBNGL, which is an extension of BNGL that allows different reaction compartments in a system to be explicitly represented. A limitation of cBNGL is that compartments are static, i.e., they cannot be created, destroyed or merged.
This file provides an example of a model specified using eBNGL, which is an extension of BNGL that allows interactions to be represented by minimalist rules together with energy patterns, which capture cooperative effects. A model specified using eBNGL is parameterized in terms of thermodynamic quantities, which guarantees that constraints of detailed balance are satisfied.










