Specification, construction, and exact reduction of state transition system models of biochemical processes

Scott M Bugenhagen; Daniel A Beard

doi:10.1063/1.4758074

. 2012 Oct 17;137(15):154108. doi: 10.1063/1.4758074

Specification, construction, and exact reduction of state transition system models of biochemical processes

Scott M Bugenhagen ¹, Daniel A Beard ^1,^a)

PMCID: PMC3487925 PMID: 23083149

Abstract

Biochemical reaction systems may be viewed as discrete event processes characterized by a number of states and state transitions. These systems may be modeled as state transition systems with transitions representing individual reaction events. Since they often involve a large number of interactions, it can be difficult to construct such a model for a system, and since the resulting state-level model can involve a huge number of states, model analysis can be difficult or impossible. Here, we describe methods for the high-level specification of a system using hypergraphs, for the automated generation of a state-level model from a high-level model, and for the exact reduction of a state-level model using information from the high-level model. Exact reduction is achieved through the automated application to the high-level model of the symmetry reduction technique and reduction by decomposition by independent subsystems, allowing potentially significant reductions without the need to generate a full model. The application of the method to biochemical reaction systems is illustrated by models describing a hypothetical ion-channel at several levels of complexity. The method allows for the reduction of the otherwise intractable example models to a manageable size.

INTRODUCTION

A system model aims to predict the integrated behavior of a number of interacting system components. One class of system with particular relevance to biochemical processes is the discrete-state continuous-time system, where bimolecular association/dissociation reactions and unimolecular conformational changes are represented as discrete events representing the elementary reactions of biochemical systems. With this view, a biochemical network may be represented as a set of discrete reaction events occurring between a set of molecules. The qualitative behavior of a biochemical network may then be modeled as a labeled transition system (LTS), consisting of a set of states and labeled transitions between the states, where the transitions correspond to biochemical reaction events. Additionally, the quantitative behavior may be modeled as a continuous-time Markov chain, which is simply a special case of an LTS where transitions are labeled with rate constants.

A major drawback of modeling a biochemical network as an LTS (or as a CTMC) is the well known state-space explosion problem—a rapid (exponential) growth in the state space of a system with the numbers of molecules and reactions in the system. As the state space of a system becomes large, the process of manually constructing the state-level model (i.e., the LTS or CTMC) for the system becomes tedious and error prone, and memory requirements for storing the state-level model become excessive. For this reason, techniques for rule-based modeling have garnered increasing interest amongst the biochemical modeling community as evidenced by the release of several tools designed for this purpose.¹^,²^,³^,⁴ The basic approach used by many of these tools to deal with exploding state-spaces is to define a high-level model (HLM) describing the reaction rules according to some high-level specification method (HLSM), and to use the HLM along with a kinetic Monte Carlo (KMC) algorithm to simulate sample trajectories of a reaction. Using this approach, states are generated “on-the-fly,” thereby avoiding the need to generate a state-level model.

While the simulation approach (KMC on an HLM) works well for a wide-variety of biochemical systems, there remain certain classes of systems for which this approach leads to significant difficulty. Some examples include: (1) stiff systems, defined as those containing transition rates differing by orders of magnitude; (2) biochemical systems associated with rare events; and (3) multiscale systems integrating biochemical reactions and reaction networks with physiological processes at the cell, tissue, organ, and whole body levels. There exist variations on standard KMC algorithms that are better able to cope with stiff systems;⁵ however, the statistical nature of KMC and its reliance on confidence intervals is a fundamental limitation with regard to its ability to compute bounds on measures associated with rare events. (Since the width of a confidence interval scales by the inverse of the square-root of the sample size, achieving narrow enough confidence intervals on rare events can require huge sample sizes.) In addition, simulating a biochemical network in the context of a multiscale physiological system almost always requires integration with a global system of ODEs and/or PDEs, and it is usually the case that many instances of a single biochemical network occur simultaneously at the physiological scale of interest. For example, a cell may contain thousands of receptor complexes on its membrane, and a tissue or organ may be composed of millions of cells, each expressing a variable number of copies of a gene product associated with a gene regulatory network. Thus, stochastic simulation of a physiological process at a particular scale of interest potentially requires a large number of reaction trajectories per sample run, and therefore is generally not a suitable approach for use in multiscale modeling applications.

Although a large focus on the rule-based modeling of biochemical systems has been geared toward simulation using KMC algorithms, there exists a variety of powerful numerical approaches for the analysis of a HLM that have been developed and applied to many different problems (particularly in the field of operations) over the last few decades. These approaches may be divided into two major classes: (1) those designed to tolerate largeness; and (2) those designed to avoid largeness. Approaches to tolerate largeness are mainly centered upon use of symbolic data structures and associated algorithms,⁶ whereas approaches to avoid largeness are based on either using a HLM to generate only a part of the state-level model (e.g., truncation⁷ and/or probabilistic evaluation⁸^,⁹), or on using a HLM to identify structure in the associated state-level model (e.g., lumpability,¹⁰ invariant manifolds,¹¹ product form solutions,¹² etc.). A major advantage of all these approaches over stochastic simulation is that they allow direct computation of measures of interest (subject to an approximation error) such that rare events are much less of an issue. In addition, since the probability distribution of the states of a system is computed directly, incorporating a biochemical network into a multiscale physiological model is straightforward.

Here, we present an approach for largeness avoidance based on model structure identification from a HLM. Specifically, we present a formal description of a HLSM for models of biochemical systems composed of multiple interacting components, we describe how a HLM specified according to our method may be used to identify symmetries and independent subsystems, and we describe methods for using the identified structure to construct a reduced LTS describing the system. Finally, we illustrate the application of the method with examples.

RELATED METHODS

There exist a number of different classes of HLSMs for the specification of an LTS/CTMC, each with different advantages and disadvantages for particular application domains.¹³ Many techniques developed for applications in the fields of dependability and performability of computer and communication systems have been adopted in the field of computational biology. One popular approach is to use stochastic Petri nets (SPNs). A SPN consists of a set of places P, a set of transitions T, and a set of arcs A ⊆ (P × T) ∪ (T × P) between places and transitions. Each place contains an integer number of tokens, and the number of tokens in a place p_i is its marking. The state of a system specified by a SPN is the vector of markings M = (p₁, …, p_n). In representing a well-mixed biochemical reaction network as an SPN, places represent molecular species, tokens represent individual molecules, and transitions represent molecular reaction events.¹⁴

One drawback to representing a biochemical reaction network using a classical SPN is that it requires that all molecular species be specified and assigned to places. Since many biochemical systems give rise to a combinatorial blow-up of reachable species, enumerating the reachable species may be difficult or impossible. A second drawback is the well-mixed assumption required in the classical SPN representation. Because of this assumption, it is not possible to distinguish individual molecules on the basis of position or to model systems with compartments.

Extensions of SPN models using composition operations allow specification of hierarchical models helping to overcome the above limitations. Using these operators, molecular species can be specified componentwise such that the state of an individual domain of a species is represented by a much simpler atomic-level SPN. Then the global state of a system is given by the composition of the states of each of the individual atomic-level submodels. Interactions between components are accounted for by either synchronization of transitions¹⁵ or state-variable sharing.¹⁶ Using this approach, specification of a biochemical system does not require specification of the molecular species. In addition, replicates of submodels may represent components of different spatial orientations or in different compartments so that the well-mixed assumption need not hold. The structure of a compositional SPN model containing replicate components may give rise to symmetries that may be used to reduce the state space of a model. Such an approach is implemented in the Möbius framework¹⁶ and has been used to model Ca²⁺ signaling complexes.¹⁷

Another approach that avoids the need to enumerate molecular species in a biochemical system is that of rule-based modeling. In the field of biological systems modeling, the term rule-based modeling refers broadly to the class of specification methods that do not require the enumeration of all molecular species in a biochemical system (thus, compositional SPNs may be classified as a rule-based modeling approach under this definition). The methods leverage on the fact that the kinetics of elementary events operating on a molecular species usually do not depend on the species itself (i.e., the global state of a molecular complex), but rather on the limited context of a partially defined species. In this approach, reaction rules are generally specified as

if conditions met then reaction fires with rate λ,where conditions are local to a partially defined species.

There exist a variety of languages and tools to specify and construct a rule-based model.¹^,²^,³^,⁴ Although the rule-based modeling framework avoids complexity associated with system specification, the state-space of networks associated with a rule-based model is often very large due to the combinatorial nature of such a system. Therefore, methods to reduce the state-space of a rule-based model directly from the system specification are necessary to obtain tractable models. Previously, a method and theory for reducing a rule-based model specified in the Kappa language³ based on differential¹⁸ and stochastic fragments¹⁹^,²⁰ has been suggested. In this approach, a dependency analysis performed on an annotated contact map is used to analyze correlations between reaction sites in a molecular complex in order to identify a minimal partitioning (i.e., a partitioning with the finest possible partition classes) of the sites. This partitioning is used to identify independent subsets of reaction sites which are used to define molecular fragments representing aggregations of molecular species.

One drawback with Kappa³^,¹⁹^,²⁰ and other similar rule-based modeling languages is the assumption that the order of agents and reaction sites does not matter. Thus, biochemical systems with spatial structure cannot be specified using these languages. Here, we propose a high-level rule-based specification method that allows for the specification of position-dependent reaction rules. Individual components (agents) in a system are specified in a component state-space graph as finite-state automata, and interactions between components occur as synchronization of transitions and/or as functional dependencies of the rate of transitions on the state of nonlocal components. These functional dependencies are specified as a hypergraph called a dependency graph. The dependency graphs are used to define a system characteristic graph which may be used to identify symmetries among system components, allowing for an automated reduction of the state-space associated with the model.

In a previous application of symmetry identification in compositional models with general components,²¹ interactions between components are based on state-variable sharing. In that approach, a model composition graph is described in which shared state-variables represented as vertices of a graph are used to connect components in the system. The symmetries among components are then given as the automorphism group of the resulting simple graph. Our approach differs from that of Douglas Obal II et al.²¹ in that it does not require a user to define shared state-variables, which can be cumbersome and error prone. Instead, connections between components follow directly from the dependency graphs used to define reaction rules. The dependency graphs are used to construct a hypergraph called a system characteristic graph, the automorphism group of which determines the component symmetries. Because this graph is constructed directly from the reaction rules, it can be mapped to from any rule-based specification language and does not require any additional input from the user to define.

SYSTEM SPECIFICATION AND GENERATION

State-level model

As described in the introduction, a biochemical reaction may be viewed as a discrete event occurring in a molecule or between sets of molecules such that the possible behaviors of a system may be described completely by a set of discrete states and transitions between states. Thus, the qualitative behavior of a biochemical reaction system may be described by a directed graph called an LTS, defined as

L = (F, R, Δ),

where

F is a set of system states;
R is a set of transition labels; and
Δ ⊆ F × R × F is the transition relation.

An example of a simple LTS is shown in Figure 1.

Example LTS. States in F are given as circles and transitions in Δ are given as labeled arrows.

In addition, quantitative measures on a biochemical system may be obtained by mapping L to a homogeneous CTMC using a weighting function $W_{h} : R \to R_{\geq 0}$ in the case of constant transition rates, or to an inhomogeneous CTMC using a weighting function $W_{i} : R \to R_{\geq 0}^{t}$ in the more general case of time-varying rate constants ( $R_{\geq 0}^{t}$ denotes the set of positive real-valued functions of time t). The resulting CTMC may be used to derive the master equation describing the time-evolution of the probabilities of the system occupying each of the states in F. This is done by imposing some order on the system states using an indexing function J such that J: I → F, where I = {1, 2, …, |F|} is the index set. The master equation may then be written as

\frac{d P}{d t} = A P,

(1)

where P = [P_i] is a column vector defined such that P_i, under the restriction that P_i > 0 and $\sum_{i} P_{i} = 1$ , is the probability of the system occupying state J(i) at time t, and A = [A_ij] is a matrix of transition rates given by

A_{i j} = \{\begin{matrix} W (r, t) \\ - \sum_{i \neq j} A_{i j} \\ 0 \end{matrix} \begin{matrix} for J (j) \overset{r}{\to} J (i) \in Δ, \\ for i = j, and \\ otherwise, \end{matrix}

(2)

where W(r, t) is the weight (transition rate) associated with label r ∈ R at time t.

The LTS and associated CTMC are analytical (state-level) representations of a system that may be used directly to analyze its behavior; however, they are far too cumbersome to be of much value in system specification. For that, we rely on a high-level representation of the system as described below.

High-level model

A HLSM allows specification of complex models in a compact and intuitive form with the purpose of simplifying the modeling process and avoiding modeling errors. A HLM should contain all the information required to automatically generate a state-level model of the system. Our approach formulates a system as a set of components that may interact via both synchronizing transitions (transitions occurring simultaneously among a subset of components) and/or functional transitions (transitions in a subset of components with rates that depend on the state of disjoint subsets of components). These types of interactions are common in many biochemical processes, especially in signaling complexes, enzymes, ion channels, etc.

Component state space graph

The first step of specifying a biochemical reaction system using our approach is to specify the components. This is done using a structure called the component state space graph C, defined as

C = (S, Q),

where

S = {s₁, s₂, …, s_n} is a set of n component states;
Q = {Q₁, Q₂, …, Q_m}, with Q_j = (X_j, Y_j, Z_j) for j = 1, …, m, is a set of component transitions where each X_j ⊆ S is a set of source states, each Y_j is a mapping Y_j: X_j → S of source states to destination states, and each Z_j is a mapping Z_j: X_j → N such that Z_j(s_i) is the multiplicity, or number of occurrences, of components in state s_i required to enable transition Q_j.

(Here, the notation $X_{j}^{(1)}, X_{j}^{(2)}, ..., X_{j}^{(l \leq n)} \overset{Z_{j} (X_{j}^{(1)}), Z_{j} (X_{j}^{(2)}), ..., Z_{j} (X_{j}^{(l \leq n)})}{\to} Y_{j} (X_{j}^{(1)}), Y_{j} (X_{j}^{(2)}), ..., Y_{j} (X_{j}^{(l \leq n)})$ is used to represent a component transition Q_j.) The definition above implies a graph topology in which the component states S represent the vertices and the component transitions Q represent connections (arcs) of the graph. An important aspect of the above definition is that it allows specification of synchronized transitions. Any component transition Q_j with more than one element in X_j or with Z_j(s_i) > 1 for any s_i ∈ X_j is a synchronizing transition.

Each component state in S may be associated with a component type by finding the unique partition {S₁, S₂, …, S_r} ⊢ S of S into r maximal connected subsets (i.e., the coarsest partition such that every pair of vertices s ∈ S_i are connected via some transition Q_j ∈ Q) and defining a function T: S → {1, 2, …, r} such that T(s) = i for all s ∈ S_i. Then, T(s) is the component type associated with component state s ∈ S. The usefulness of this definition is described below. (The practical application of these definitions is illustrated below in the section Simple Example).

System domain

The next step in system specification is to define the numbers and positions of system components. The system domain D = {d₁, d₂, …, d_p} is an indexed set of p component positions or places. The state of the system is described by a mapping of each element (position) in D to a component state in S. That is, the system state is a function f from D to S. (For convenience, f may be represented as a p-tuple.) The type of component assigned to each position d_i ∈ D is given by T( f_init(d_i)), where f_init is an initial system state. In general, the set F of system states reachable from some initial system state f_init is a proper subset of S^D (where S^D = {f | f: D → S}) such that the subset of possible component states assigned to d_i is restricted to $S_{T (f_{i n i t} (d_{i}))}$ . F may be further restricted by the existence of synchronizing and/or functional interactions.

Dependency graph

The final step in system specification is to define the functional dependencies of component transitions; in other words, to define a set of rules for mapping a particular component transition to a transition rate. These rules are given in the form of a directed hypergraph with labeled hyperarcs, which we refer to as the dependency graph D(α) of a transition rate α, defined as

D (α) = (D, E, L),

where

D = {d₁, d₂, …, d_p} is the system domain;
E = {E₁, E₂, …, E_q}, with E_y = (A_y, B_y) for y = 1, …, q, is a set of hyperarcs with tail sets A_y ∈ P(D) and head sets B_y ∈ P(D), where P(D) is the power set of D;
L = {L_y ∈ L|L_y: B_y → P(S)} for y = 1, …, q, is a set of labeling functions mapping elements of the head sets of each E_y to elements (subsets of component states) in the power set of S.

The hyperarcs E_y ∈ E represent dependencies of transitions in components pointed to by tail sets A_y on the state of components pointed to by head sets B_y. All A_y are singleton for local (non-synchronizing) transitions. Synchronizing transitions have E with each A_y containing d_x for each component involved in the transition. Each element d_x in each B_y is labeled according to a subset of states required to enable a transition in the components of A_y at a specific rate constant α. Two E_y ∈ E may have identical tail sets if there is more than one condition under which a specific rate constant applies for a particular A_y. The set E is empty if there are no functional dependencies of the corresponding rate constant.

Generating the state-level model

The HLM of a system is used to generate the state-space graph S of a system, which is a modified LTS describing the set of all system states f ∈ S^D that are reachable from an initial system state f_init (i.e., it is the reachability graph spanning the set of all f ∈ S^D such that there exists a path starting from f_init and ending at f ). S is defined as follows:

S = (D, F, R, Δ, C, D, f_{i n i t}),

where

D = {d₁, d₂, …, d_p} is the system domain;
F ⊆ S^D, with S^D = {f | f: D → S}, is a set of system states;
R = {R₁, R₂, …, R_m} is a family of sets of transition labels, each R_j being associated with a component transition Q_j;
$Δ \subseteq F \times ⋃_{j} R_{j} \times F$ is the transition relation;
C is the component state-space graph;
$D = {D (r_{k}^{(j)}) | r_{k}^{(j)} \in ⋃_{j} R_{j}}$ is the set of all dependency graphs; and
f_init ∈ F is an initial system state.

An algorithm (modified from Algorithm 2.1 in Ref. 22 to account for the HLM) for generating the state-space graph S is given below:

An algorithm for generating the full state-space graph.

\begin{matrix} 1 : Set u n p r o c e s s e d = {f_{i n i t}} \\ 2 : Set F = {f_{i n i t}} \\ 3 : Set Δ = ⊘ \\ 4 : while u n p r o c e s s e d \neq ⊘ do \\ 5 : Select any f \in u n p r o c e s s e d and set u n p r o c e s s e d \\ = u n p r o c e s s e d ∖ {f} \\ 6 : for all Q_{j} \in {Q_{j} \in Q | (X_{j}, Z_{j}) \subseteq f [D]} do \\ 7 : for all D^{'} \in \{D^{'} \in P (D) | f [D^{'}] = (X_{j}, Z_{j})\} do \\ 8 : Set f^{'} = f \\ 9 : for all D (r_{k}^{(j)}) \in \{D (r_{k}^{(j)}) \in D | r_{k}^{(j)} \in R_{j}\} do \\ 10 : if E = ⊘ then goto 13 \\ 11 : for all E_{y} \in \{E_{y} \in E | A_{y} = D^{'}\} do \\ 12 : if f (d) \in L_{y} (d) \forall d \in B_{y} then \\ 13 : for all d \in D^{'} do \\ 14 : Set f^{'} (d) = Y_{j} (f (d)) \\ 15 : Set Δ = Δ \cup \{(f, r_{k}^{(j)}, f^{'})\} \\ 16 : if f^{'} \notin F then \\ 17 : Set u n p r o c e s s e d = u n p r o c e s s e d \cup \{f^{'}\} \\ 18 : Set F = F \cup \{f^{'}\} \\ 19 : return S = (D, F, R, Δ, C, D, f_{i n i t}) \end{matrix}

Open in a new tab

In line 6 and 7 of the above algorithm, the notation f [D] and f [D^′] denote the multiset image (rather than the usual set image) of D and D^′, respectively, under f. (A multiset is a pair (A, m) where A is a set, or underlying set of elements, and m: A → N is a multiplicity function mapping elements in A to the number of occurrences of those elements in the multiset.) The relations in lines 6 and 7 compare these multiset images with the multiset (X_j, Z_j). In the following, the image f [A] of a set A under a function f should be understood as the multiset image when given in the context of a relation involving a multiset, and as the usual set image otherwise.

S may be mapped to a CTMC by assigning rates to the transition labels using a weighting function as described above for a general LTS.

Simple example

To illustrate the high-level specification of a simple biochemical system using our method, consider a reaction volume containing three molecules, two labeled A and one labeled B, as depicted in Figure 2a. Molecules A can be in one of two conformational states, folded or unfolded, and when in the folded conformation can bind to molecule B. Molecule B has only one conformation and binds to folded molecules A. The possible molecular states and state transitions (reaction events) of all molecules (A and B) in this reaction volume are given by the component state space graph C illustrated in Figure 2b with transitions tabulated in Table 1. There, the components are defined as molecule A and B binding sites and molecule A conformation. Because, there are two molecules A and one molecule B in the system, the system domain must contain five elements (two molecule A binding sites, two molecule A conformations, and one molecule B binding site), and each element in the domain is assigned to an appropriate component type by the initial state f_init.

Example biochemical reaction system. (a) An illustration of a small reaction volume containing two molecules of type A and one of type B. Molecules A can bind to molecules B, but only when in a folded conformation. (b) An illustration of the component state space graph C associated with the reaction system depicted in panel (a). Circles represent component states S and arrows represent component transitions Q which are labeled with a multiplicity. The system domain D and an initial system state f_init are also shown.

Table 1.

Component transitions Q of the component state-space graph C for the example system illustrated in Figure 2. The mappings Y_j and Z_j are listed such that the first element in a set X_j maps to the first elements listed in Y_j and Z_j, the second element listed in a set X_j maps to the second elements listed in Y_j and Z_j, etc. Also given in the table are the sets of labels R_j associated with each transition j.

	j	X_j	Y_j	Z_j	R_j
	1	{1}	(2)	(1)	{r₁⁽¹⁾}
Q =	2	{2}	(1)	(1)	{r₁⁽²⁾}
	3	{3,5}	(4,6)	(1,1)	{r₁⁽³⁾}
	4	{4,6}	(3,5)	(1,1)	{r₁⁽⁴⁾}

Open in a new tab

This system contains functional transitions and therefore requires rules for the mapping of those transitions to transition rates in the form of dependency graphs. Specifically, the unfolding of a molecule A ( $s_{2} \overset{1}{\to} s_{1}$ ) at rate $r_{1}^{(2)}$ requires that it is unbound, and the binding of a molecule A to B ( $s_{3}, s_{5} \overset{1, 1}{\to} s_{4}, s_{6}$ ) at rate $r_{1}^{(3)}$ requires that A is in the folded conformation. The dependency graphs $D (r_{1}^{(2)})$ and $D (r_{1}^{(3)})$ for the rate constants $r_{1}^{(2)}$ and $r_{1}^{(3)}$ are shown in Figure 3 and tabulated in Table 2. (In this system, it is possible to define a different component state space graph with courser components so that the functional transitions described above become non-functional, or constant transitions.) The component state space graph, system domain, dependency graphs, and initial state completely describe the HLM for this simple system, and may be used to generate a state-space graph S using Algorithm 1. The generated graph is shown in Figure 4a.

Table 2.

Dependency graphs for the rate constants of the example system illustrated in Figure 2.

	y	A_y	B_y	L_y
$D (r_{1}^{(2)})$	1	{1}	{3}	({3})
	2	{2}	{4}	({3})
$D (r_{1}^{(3)})$	1	{3,5}	{1}	({2})
	2	{4,5}	{2}	({2})

Open in a new tab

State-space graphs associated with the system illustrated in Figure 2. (a) Full state-space graph S. Circles represent the system states F and arrows represent the system transitions Δ. (b) Reduced state-space graph $\tilde{S}$ . Circles represent system state equivalence class representatives $\tilde{F}$ and arrows represent the transitions $\tilde{Δ}$ between equivalence classes. The coefficients of the rate labels are the multiplicities of the transitions.

EXACT REDUCTION METHODS

Bisimilarity

A bisimulation of an LTS L = (F, R, Δ) is an equivalence relation ∼ over F such that for any f₁, f₂ ∈ F, f₁ ∼ f₂ implies that f₁ and f₂ behave identically. In this case, f₁ and f₂ are said to be bisimilar.

There are two principle forms of bisimulation on an LTS, namely forward and backward bisimulation.²³ Forward bisimulation requires equivalence of outgoing transitions. More specifically, letting Λ(f₁, f₂) be the multiset of transition labels associated with all transitions from f₁ to f₂ in Δ, ∼_f is a forward bisimulation if for all f₁, f₂ ∈ F such that f₁ ∼ f₂, and for all equivalence classes C ∈ F/ ∼_f, the following holds:

\underset{f^{'} \in C}{⊎} Λ (f_{1}, f^{'}) = \underset{f^{'} \in C}{⊎} Λ (f_{2}, f^{'}),

where ⊎ denotes the multiset union. (The multiset union of two multisets (A,m_A) and (B,m_B) is the multiset (A ∪ B, m_A + m_B).) This condition implies ordinary, or strong, lumpability of any CTMC to which the LTS is mapped.²⁴ An example of an LTS that can be reduced via forward bisimulation (but not via backward bisimulation) is shown in Figure 5a.

Examples of systems that can be reduced via bisimilarity (adapted from Ref. 23). In each example, states f₂ and f₃, and states f₄ and f₅, are equivalent. (a) can be reduced by forward, but not backward bisimulation, (b) can be reduced by backward, but not forward bisimulation, and (c) can be reduced by both forward and backward bisimulation.

The lumping of the CTMC associated with an LTS reduced by forward bisimulation allows for the exact calculation of the probability distribution of the aggregated states and is valid for any initial probability distribution of the states in the original system. However, in general it is not possible to recover the full solution to the original system from the solution to the reduced system (that is, the lumping cannot be inverted).

By contrast, backward bisimulation requires equivalence of incoming transitions. More specifically, ∼_b is a backward bisimulation if for all f₁, f₂ ∈ F such that f₁ ∼_bf₂, and for all equivalence classes C ∈ F/ ∼_b, the following holds:

$\underset{f^{'} \in C}{⊎} Λ (f^{'}, f_{1}) = \underset{f^{'} \in C}{⊎} Λ (f^{'}, f_{2})$ and
$\underset{f^{'} \in F}{⊎} Λ (f_{1}, f^{'}) = \underset{f^{'} \in F}{⊎} Λ (f_{2}, f^{'}) .$

Condition 2 above requires an equivalent multiset of exit transitions of backward bisimilar states (note that this condition is implicit in the definition of forward bisimulation). Condition 1 and 2 together imply exact lumpability of any CTMC to which the LTS is mapped.²⁵ An example LTS that can be reduced via backward bisimulation (but not forward bisimulation) is shown in Figure 5b.

As with forward bisimulation, the lumping of the CTMC associated with an LTS reduced by backward bisimulation allows for the exact calculation of the probability distribution of the aggregated states. However, the solution is only valid when the initial probability distribution of the states in the original system satisfies the condition that for all f₁, f₂ ∈ F such that f₁ ∼_bf₂, P(f₁) = P(f₂), where P(·) is the initial occupancy probability associated with a state. If this condition is not satisfied, the solution will not be valid. One advantage of backward bisimulation over forward bisimulation is that all states belonging to a given equivalence class will have the same occupancy probability so that it is possible to compute the solution to the full original system directly from the solution to the reduced system.

An efficient algorithm exists for computing the coarsest forward (or backward) bisimulation of an LTS;²⁶ however, this algorithm operates on the full state-space and therefore cannot be used to reduce a system a priori (i.e., before state-space generation). It is possible to compute a bisimulation directly from a HLM by considering states that are both forward and backward bisimilar. This is the basis of the symmetry reduction method described below. An example of an LTS containing states that are both forward and backward bisimilar, and can therefore be reduced via the symmetry reduction method, is shown in Figure 5c.

Symmetry reduction method

The symmetry reduction technique is a method for the state-space reduction of a system that works by exploiting symmetries in the system. A complete presentation and detailed overview of the method can be found in the articles of vol. 9, no. 1/2 of the Formal Methods in System Design Journal as well as in T. Junttila's doctoral thesis.²² Here, the necessary theory behind the method is briefly reviewed and then applied to the system description formalism developed above.

A symmetry is an automorphism of the state-space graph S. More formally, it is a permutation π of the set of system states F such that, for any f, f ^′ ∈ F and any $r \in ⋃_{j} R_{j}$ , the following holds:

f \overset{r}{\to} f^{'} \in Δ \Leftrightarrow π (f) \overset{r}{\to} π (f^{'}) \in Δ .

In other words, a symmetry is a permutation of F which preserves Δ. Since symmetries are permutations, it follows that the set of all symmetries of a state-space graph S forms an automorphism group, denoted Aut(S), under the function composition operation. Aut(S) induces an equivalence relation ∼ in F such that for any two states f₁, f₂ ∈ F, f₁ ∼ f₂ implies that π(f₁) = f₂ for some π ∈ Aut(S). Furthermore, f₁ ∼ f₂ implies both forward and backward bisimilarity of f₁ and f₂.

Symmetries can arise in biochemical systems containing replicated components. In the case that the well-mixed assumption applies for a particular subset of components (that is, the position of the components in the system has no effect on dynamics), every permutation of that subset induces a symmetry in the state-space graph. If however the transition rate in one component depends on the state and position of neighboring components, not all permutations of the components will induce a symmetry in the state-space graph, just the subgroup of those permutations corresponding to spatial symmetries among the components.

It is possible to find Aut(S) without generating the full state-space graph (which may be impractical or impossible for large models). This is done by identifying a system description level group G of symmetries from the HLM along with a group action h : G → Sym(F) of G on F satisfying

h (g) = π \Rightarrow f (g (d)) = π (f) (d),

for all g ∈ G, π ∈ Aut(S), f ∈ F and d ∈ D. G can be readily identified from the set of dependency graphs D according to the following procedure:

For each dependency graph $D (r_{k}^{(j)})$ , assign a unique label (e.g., $r_{k}^{(j)}$ ) to its hyperarcs.
Define a new graph, called the system characteristic graph, as $G = (V, U)$ , where
1. V = D and
2. $U = ⋃_{D (r_{k}^{(j)}) \in D} E_{k}^{(j)} .$
Label each d ∈ V by T(f_init(d)).
Find the automorphism group of G.

The characteristic graph describes all dependencies among all components in a system and is used to determine which permutations of the system domain (in other words, which rearrangements of the components) leave system dynamics invariant. The group of all these rearrangements is the automorphism group Aut(G) of the graph. Note that non-identical components (i.e., components of different type T) are labeled differently (see step 3 in the above procedure) so that they cannot be mapped to one another; thus the calculated automorphism group is actually a subgroup of the full automorphism group of the graph. More specifically, G is the subgroup stabilizing the component labels. The characteristic graph G of the example shown in Figure 2 is illustrated in Figure 6.

Characteristic graph G for the example system illustrated in Figure 2.

Aut(G) can be computed using any general purpose graph automorphism group tool (in our implementation, we use the NAUTY²⁷ tool). (As the automorphism group of a graph can be very large, most tools including NAUTY²⁷ return the group in the form of a set of generating permutations.) However, most available general-purpose tools require graphs with simple uncolored edges (though vertex colors are usually allowed). A procedure for converting a characteristic graph G to an equivalent simple (directed) graph $\tilde{G}$ is developed in Appendix A.

Once G is found, it can be used to generate a reduced state-space graph $\tilde{S}$ , defined as

\tilde{S} = (D, \tilde{F}, R, \tilde{Δ}, M, C, D, f_{i n i t}),

where

D = {d₁, d₂, … , d_p} as defined in S,
$\tilde{F}$ is a transversal of the quotient set F/G,
R = {R₁, R₂, …, R_m} is a family of sets of labels as defined in S,
$\tilde{Δ} \subseteq \tilde{F} \times ⋃_{j} R_{j} \times \tilde{F}$ is the transition relation,
$M : \tilde{Δ} \to N$ is a multiplicity function,
C is the component state-space graph,
$D = {D (r_{k}^{(j)}) | r_{k}^{(j)} \in ⋃_{j} R_{j}}$ is the set of all dependency graphs, and
$f_{i n i t} \in \tilde{F}$ is an initial system state.

Note the changes in the definitions of $\tilde{F}$ and $\tilde{Δ}$ and the introduction of the function M. The set of system states $\tilde{F}$ is now a transversal (more specifically, a system of distinct representatives) of the quotient set F/G, and transitions $\tilde{Δ}$ now occur between equivalence class representatives in $\tilde{F}$ . M becomes necessary since multiple combinations of components can participate in a single transition in $\tilde{Δ}$ (whereas only a single combination of components participated in a transition in Δ).

An algorithm (based on Algorithm 2.2 in Ref. 22) for generating the reduced state-space graph $\tilde{S}$ of a system is given below:

An algorithm for generating the reduced state-space graph.

\begin{matrix} 1 : Set u n p r o c e s s e d = \{f_{i n i t}\} \\ 2 : Set \tilde{F} = \{f_{i n i t}\} \\ 3 : Set \tilde{Δ} = ⊘ \\ 4 : while u n p r o c e s s e d \neq ⊘ do \\ 5 : Select any f \in u n p r o c e s s e d and set u n p r o c e s s e d \\ = u n p r o c e s s e d ∖ \{f\} \\ 6 : for all Q_{j} \in \{Q_{j} \in Q | (X_{j}, Z_{j}) \subseteq f [D]\} do \\ 7 : for all D^{'} \in \{D^{'} \in P (D) | f [D^{'}] = (X_{j}, Z_{j})\} do \\ 8 : Set f^{'} = f \\ 9 : for all D (r_{k}^{(j)}) \in \{D (r_{k}^{(j)}) \in D | r_{k}^{(j)} \in R_{j}\} do \\ 10 : if E = ⊘ then go to 13 \\ 11 : for all E_{y} \in \{E_{y} \in E | A_{y} = f [D^{'}]\} do \\ 12 : if f (d) \in L_{y} (d) \forall d \in B_{y} then \\ 13 : for all d \in D^{'} do \\ 14 : Set f^{'} (d) = Y_{j} (f (d)) \\ 15 : if \exists \tilde{f} \in \tilde{F} | \tilde{f} \sim f^{'} then \\ 16 : Set f^{'} = \tilde{f} \\ 17 : else \\ 18 : Set u n p r o c e s s e d = u n p r o c e s s e d \cup \{f^{'}\} \\ 19 : Set \tilde{F} = \tilde{F} \cup \{f^{'}\} \\ 20 : if (f, r_{k}^{(j)}, f^{'}) \notin \tilde{Δ} then \\ 21 : Set \tilde{Δ} = \tilde{Δ} \cup \{(f, r_{k}^{(j)}, f^{'})\} \\ 23 : Set M ((f, r_{k}^{(j)}, f^{'})) = 1 \\ 24 : else \\ 25 : Set M ((f, r_{k}^{(j)}, f^{'})) = M ((f, r_{k}^{(j)}, f^{'})) + 1 \\ 26 : return \tilde{S} = (D, \tilde{F}, R, \tilde{Δ}, M, C, D, f_{i n i t}) \end{matrix}

Open in a new tab

The reduced state-space graph $\tilde{S}$ of the system illustrated in Figure 2 generated using Algorithm 2 is shown in Figure 4b. Compare this with the full state-space graph S generated using Algorithm 1, shown in Figure 4a.

The critical difference between Algorithm 1 and Algorithm 2 lies in line 15 of Algorithm 2. Line 15 tests whether a transition destination state f ^′ is equivalent to any representative system state $\tilde{f} \in \tilde{F}$ under the relation ∼. If it is, then f ^′ is set equal to $\tilde{f}$ rather than adding f ^′ to the set of system states as is done in Algorithm 1. Methods for testing the equivalence of two states f₁ and f₂ are discussed in Appendix B.

The reduced state-space graph $\tilde{S}$ may be mapped to a CTMC using the general procedure described above of assigning rates to the transition labels using a weighting function; however, the multiplicity function M must also be accounted for. In this case, the master equation associated with the resulting CTMC is derived by imposing an order on $\tilde{F}$ using an indexing function $\tilde{J}$ such that $\tilde{J} : \tilde{I} \to \tilde{F}$ , where $\tilde{I} = \{1, 2, ..., | \tilde{F} |\}$ is the index set, and is given by

\frac{d \tilde{P}}{d t} = \tilde{A} \tilde{P},

(3)

where $\tilde{P} = [{\tilde{P}}_{i}]$ is a column vector defined such that ${\tilde{P}}_{i}$ , under the restriction that ${\tilde{P}}_{i} > 0$ and $\sum_{i} {\tilde{P}}_{i} = 1$ , is the probability of the system occupying a state in equivalence class $\tilde{J} (i)$ at time t, and $\tilde{A} = [{\tilde{A}}_{i j}]$ is a matrix of transition rates given by

{\tilde{A}}_{i j} = \{\begin{matrix} M (δ) \cdot W (r, t) \\ - \sum_{i \neq j} {\tilde{A}}_{i j} \\ 0 \end{matrix} \begin{matrix} for δ = \tilde{J} (j) \overset{r}{\to} \tilde{J} (i) \in \tilde{Δ}, \\ for i = j, and \\ otherwise, \end{matrix}

(4)

where W(r, t) is the weight (transition rate) associated with label $r \in ⋃_{j} R_{j}$ at time t. Since symmetries imply backward (as well as forward) bisimilarity of equivalent states under ∼, the lumping of states in the resulting CTMC is exact such that the occupancy probability P_i of any state $J (i) \in \tilde{J} (j)$ is given by

P_{i} = \frac{P_{j}}{| \tilde{J} (j) |},

(5)

where $| \tilde{J} (j) |$ denotes the cardinality of equivalence class $\tilde{J} (j)$ . (This formula only holds if the initial probability of all states of a given equivalence class are equivalent. If this condition is not satisfied, the formula will not be valid for all time t > 0.)

Decomposition of independent subsystems

It may be possible to achieve an exact reduction of Eq. 1 beyond what can be achieved through the symmetry reduction technique alone if there exists independence among subsystems of a system. Independence of subsystems occurs when there are no interactions, neither functional nor synchronizing, between disjoint subsets of system components. Let δ ⊆ D be a subsystem with state-space graph given by

S_{δ} = (δ, F_{δ}, L, Δ_{δ}, C, D, f_{i n i t}) .

If D can be partitioned into a set {δ₁, δ₂, …, δ_n} of subsystems with state spaces $\{F_{δ_{1}}, F_{δ_{2}}, ..., F_{δ_{n}}\}$ such that all components in subsystem δ_i are independent from (i.e., have no functional or synchronizing interactions between) all components in subsystem δ_j for j ≠ i, then the solution to the master equation associated with the CTMC generated on D can be expressed exactly as

P = \prod_{i = 1}^{n} P_{i},

(6)

where each P_i is a probability distribution associated with subsystem i. (Note that Eq. 6 holds for both the steady-state and transient solutions provided that initial transients have decayed.) It is straightforward to test for independence of subsystems using the HLM (see Appendix C for a procedure and associated algorithms).

The symmetry reduction technique and the reduction by decomposition of independent subsystems are not mutually exclusive. A general approach to combine the methods is to first identify independent subsystems, and to then apply symmetry reduction during the state-space generation for each subsystem using Algorithm 2. Applying this approach allows for potentially significant reductions in models of biochemical reaction systems a priori, as illustrated by the examples given below.

EXAMPLES

Example 1: Reduction scalability in a simple example

As a first example, we illustrate how our techniques may be used to alleviate the combinatorial state-space explosion problem in a class of models taken from Ref. 19, and we compare the reductions achieved on this system in that study using the method of stochastic fragments with the reductions achieved using the present methods.

This example is similar to the simple example illustrated in Figure 2 in that it consists of a biochemical reaction system involving two types of particles labeled A and B which can bind to (form complexes with) one another. Both types of particles can be in an activated or an inactivated state. Two cases are considered: one in which the binding rate depends on the activation state of the particles and one in which it does not. A high-level model for the first case specified using our framework is illustrated in Figure 7 and Table 3.

Component state-space graph C for Example 1 case #1 (binding rate depends on activation state). Circles represent component states S and arrows represent component transitions Q (multiplicities are not shown but are 1 for all transitions). The system domain D and an initial system state f_init are also shown. Note that all transitions, except for transitions from s₁ ↔ s₂ and from s₇ ↔ s₈, are synchronizing transitions.

Table 3.

Component transitions Q associated with the component state-space graph C shown in Figure 7. The mappings Y_j and Z_j are listed such that the first element in a set X_j maps to the first elements listed in Y_j and Z_j, the second element listed in a set X_j maps to the second elements listed in Y_j and Z_j, etc. Also given in the table are the sets of labels R_j associated with each transition j.

	j	X_j	Y_j	Z_j	R_j
	1	{1}	(2)	(1)	{r₁⁽¹⁾}
	2	{2}	(1)	(1)	{r₁⁽²⁾}
	3	{7}	(8)	(1)	{r₁⁽³⁾}
	4	{8}	(7)	(1)	{r₁⁽⁴⁾}
	5	{1,7}	(3,9)	(1,1)	{r₁⁽⁵⁾}
	6	{3,9}	(1,7)	(1,1)	{r₁⁽⁶⁾}
	7	{1,8}	(5,10)	(1,1)	{r₁⁽⁷⁾}
	8	{5,10}	(1,8)	(1,1)	{r₁⁽⁸⁾}
	9	{2,7}	(4,11)	(1,1)	{r₁⁽⁹⁾}
Q =	10	{4,11}	(2,7)	(1,1)	{r₁⁽¹⁰⁾}
	11	{2,8}	(6,12)	(1,1)	{r₁⁽¹¹⁾}
	12	{6,12}	(2,8)	(1,1)	{r₁⁽¹²⁾}
	13	{3,9}	(4,11)	(1,1)	{r₁⁽¹³⁾}
	14	{4,11}	(3,9)	(1,1)	{r₁⁽¹⁴⁾}
	15	{5,10}	(6,12)	(1,1)	{r₁⁽¹⁵⁾}
	16	{6,12}	(5,10)	(1,1)	{r₁⁽¹⁶⁾}
	17	{3,9}	(5,10)	(1,1)	{r₁⁽¹⁷⁾}
	18	{5,10}	(3,9)	(1,1)	{r₁⁽¹⁸⁾}
	19	{4,11}	(6,12)	(1,1)	{r₁⁽¹⁹⁾}
	20	(6,12)	{4,11}	(1,1)	{r₁⁽²⁰⁾}

Open in a new tab

Expressions were derived for the number of states of such a system for three different levels of abstraction: identified (distinguishable) particles; anonymous (indistinguishable) particles; and anonymous fragments.¹⁹ The number of states assuming identified particles is given by

2^{m} \cdot 2^{n} \cdot \sum_{k = 0}^{\min (n, m)} k! \cdot C_{k}^{m} \cdot C_{k}^{n},

(7)

where m and n are the total number of particles A and particles B, respectively, k is the number AB complexes, and $C_{k}^{m} = m! / ((m - k)! k!)$ . The summation in Equation 7 is over all possible number of complexes.

The first level of abstraction involves de-identifying particles by assuming that any two states that can be mapped to one-another by a reordering of particle indices are equivalent. (Note that this abstraction is only sound if it is assumed that the system is well-mixed.) Assuming anonymous particles, the number of states is

\sum_{k = 0}^{\min (n, m)} (m - k + 1) \cdot (n - k + 1) \cdot C_{3}^{k + 3} .

(8)

This number of states is obtained by applying symmetry reduction to the model illustrated in Figure 7. There are no functional dependencies among the components, and thus the symmetries in this case consist of all possible permutations of the individual particles of which there are m!n!. In the case that the binding rate of particles depends on activation state, it is not possible to achieve a further abstraction using either our symmetry reduction approach or the stochastic fragments approach.¹⁹

In the case that the binding rate does not depend on the activation state of the particles, we can construct a simpler model of the system as shown in Figure 8 and Table 4. In this case, it is possible using the stochastic fragments approach to abstract the system into anonymous fragments, the number of which is given by¹⁹

\sum_{k} ((m - k + 1) \cdot (n - k + 1) \cdot {(k + 1)}^{2} | k \in N) .

(9)

Our algorithm goes beyond this. Again, in this case there are no functional dependencies between system components and the system characteristic graph is disconnected. However, there is more symmetry in this case than in the first case; in this case we have components corresponding to both activation and binding state of the two types of particles, and the symmetries consist of the group of all permutations of components of a given type of which there are (m!)² · (n!)². Applying our symmetry reduction technique to the model yields

(m + 1) (n + 1) (\min (m, n) + 1)

(10)

system states, a substantial reduction compared to the number of fragments given by Eq. 9.

Component state-space graph C for Example 1 case #2 (binding rate is independent from activation state). Circles represent component states S and arrows represent component transitions Q labeled with multiplicities. The system domain D and an initial system state f_init are also shown.

Table 4.

Component transitions Q associated with the component state-space graph C shown in Figure 8. The mappings Y_j and Z_j are listed such that the first element in a set X_j maps to the first elements listed in Y_j and Z_j, the second element listed in a set X_j maps to the second elements listed in Y_j and Z_j, etc. Also given in the table are the sets of labels R_j associated with each transition j.

	j	X_j	Y_j	Z_j	R_j
	1	{1}	(2)	(1)	{r₁⁽¹⁾}
	2	{2}	(1)	(1)	{r₁⁽²⁾}
	3	{3}	(4)	(1)	{r₁⁽³⁾}
Q =	4	{4}	(3)	(1)	{r₁⁽⁴⁾}
	5	{5,7}	(6,8)	(1,1)	{r₁⁽⁵⁾}
	6	{6,8}	(5,7)	(1,1)	{r₁⁽⁶⁾}

Open in a new tab

Since there exists independence among subsets of components in this case, we can go one step further by decomposing the independent subsystems. All components corresponding to activation state are independent from one another. However, the components corresponding to binding state are correlated due to synchronizing interactions among the components and therefore cannot be decomposed. We can construct models for three independent subsystems and use these models to obtain an exact solution to the full model using Eq. 6. The three subsystems consist of: one subsystem describing the activation state of a particle A; one subsystem describing the activation state of a particle B; and one subsystem describing the binding state of the m + n particles in the system. Applying symmetry reduction to each of the three submodels, we can obtain a reduced model with a total state-space of size

\min (m, n) + 5

(11)

for m, n ≥ 1. Thus, by applying both the symmetry reduction and independent subsystem decomposition techniques to the system, we are able to obtain exact solutions for the stochastic dynamics of a system using a model of dimension that scales (sub)-linearly with the number of particles in the system. A summary of the reductions achieved for this system using the method of stochastic fragments and using the methods described here is illustrated in Figure 9.

Reduction scalability in Example 1. The number of states in case #1 assuming identified particles (solid black) and anonymous particles (dashed black), and the number of states in case #2 obtained by applying stochastic fragments (dashed grey) and symmetry reduction techniques with (dotted black) and without (dotted grey) applying independent subsystem decomposition are plotted against particle number n for m = n.

Example 2: Single ion channel

We next consider an example consisting of a cluster of four identical Na⁺ channels arranged in a ring configuration. Suppose that we first wish to develop a single channel model in which the individual channels are composed of one α subunit and two β subunits, each of which may reside in one of the following possible conformations: inactive; permissive; or open. In addition, the β subunits bind to a sodium ion and can also form dimers by binding to one another. We assume that Na⁺ binds to the intracellular side of the channel.

We begin by modeling the β subunit, which we divide into three functional domains: a Na⁺ binding domain with two possible states (unbound and bound); a β subunit binding domain with two possible states (unbound and bound); and a hinge domain with three possible states (inactive, permissive, open). Thus, we require three system components for each β subunit. Because the α subunit does not have any binding domains, it may be modeled using a single component with three possible states (inactive, permissive, and open). The component state-space graph C for the channel may be formulated as shown in Figure 10 and Table 5. We assign the initial system state f_init of the seven system components as (1,1,4,4,6,6,8), i.e., both β subunits and the α subunit in the inactive state and both binding domains of the two β subunits in the unbound state. According to C, there are 10 possible component transitions for the Na⁺ channel, four of which represent synchronizing transitions (i.e., for j = 2, 3, 7, and 8), and five of which represent functional transitions (i.e., for j = 2, 5, 6, 7, and 8).

Component state-space graph C for Na⁺ channel model. Circles represent component states S and arrows represent component transitions Q which are labeled with a multiplicity. The system domain D and an initial system state f_init are also shown.

Table 5.

Component transitions Q associated with the component state-space graph C shown in Figure 10. The mappings Y_j and Z_j are listed such that the first element in a set X_j maps to the first elements listed in Y_j and Z_j, the second element listed in a set X_j maps to the second elements listed in Y_j and Z_j, etc. Also given in the table are the sets of labels R_j associated with each transition j.

	j	X_j	Y_j	Z_j	R_j
	1	{1}	(2)	(1)	{r₁⁽¹⁾}
	2	{2,9}	(3,10)	(2,1)	{r₁⁽²⁾}
	3	{3,10}	(2,9)	(2,1)	{r₁⁽³⁾}
	4	{2}	(1)	(1)	{r₁⁽⁴⁾}
Q =	5	{4}	(5)	(1)	{r₁⁽⁵⁾,r₂⁽⁵⁾}
	6	{5}	(4)	(1)	{r₁⁽⁶⁾,r₂⁽⁶⁾}
	7	{6}	(7)	(2)	{r₁⁽⁷⁾}
	8	{7}	(6)	(2)	{r₁⁽⁸⁾,r₂⁽⁸⁾,r₂⁽⁸⁾}
	9	{8}	(9)	(1)	{r₁⁽⁹⁾}
	10	{9}	(8)	(1)	{r₁⁽¹⁰⁾}

Open in a new tab

The functional transitions require rules for their mapping to transition labels (rates), given in the form of dependency graphs. The dependency graphs are listed in Table 6. The first dependency graph $D (r_{1}^{(2)})$ encodes the rule that the two β subunits must not be associated in order for the channel to open. $D (r_{1}^{(5)})$ and $D (r_{2}^{(5)})$ encode the rule that Na⁺ binds to a β subunit at rate $r_{1}^{(5)}$ if the channel is not open and at rate $r_{2}^{(5)}$ if the channel is open (the biophysical mechanism being that the local Na⁺ concentration is elevated when the channel is open). $D (r_{1}^{(6)})$ and $D (r_{2}^{(6)})$ encode the rule that Na⁺ dissociates with the rate $r_{1}^{(6)}$ if the β subunit binding domain is unbound and with the rate $r_{2}^{(6)}$ if the β subunits are in a dimer configuration (i.e., the affinity for Na⁺ depends on whether the β subunits are in a dimer configuration or not). $D (r_{1}^{(7)})$ encodes the rule that the β subunits can only form dimers when the channel is closed. Finally, $D (r_{1}^{(8)})$ , $D (r_{2}^{(8)})$ , and $D (r_{3}^{(8)})$ encode the rule that the dissociation rate of the β subunit dimer depends on whether Na⁺ is bound to one, both, or none of the β subunits.

Table 6.

Dependency graphs for rate constants of the single channel model of Example 2. The labeling functions L_y are listed such that the first element listed in a tail set B_y maps to the first element listed in L_y, the second element listed in B_y maps to the second element listed in L_y, etc.

	y	A_y	B_y	L_y
$D (r_{1}^{(2)})$	1	{1,2,7}	{5,6}	({6},{6})
$D (r_{1}^{(5)})$	1	{3}	{1,2,7}	({1,2},{1,2},{8,9})
	2	{4}	{1,2,7}	({1,2},{1,2},{8,9})
$D (r_{2}^{(5)})$	1	{3}	{1,2,7}	({3},{3},{10})
	2	{4}	{1,2,7}	({3},{3},{10})
$D (r_{1}^{(6)})$	1	{3}	{5,6}	({6},{6})
	2	{4}	{5,6}	({6},{6})
$D (r_{2}^{(6)})$	1	{3}	{5,6}	({7},{7})
	2	{4}	{5,6}	({7},{7})
$D (r_{1}^{(7)})$	1	{5,6}	{1,2,7}	({1,2},{1,2},{8,9})
$D (r_{1}^{(8)})$	1	{5,6}	{3,4}	({4},{4})
$D (r_{2}^{(8)})$	1	{5,6}	{3,4}	({4},{5})
	2	{5,6}	{3,4}	({5},{4})
$D (r_{3}^{(8)})$	1	{5,6}	{3,4}	({5},{5})

Open in a new tab

VERNAN, our MATLAB implementation of the method, can be used as a convenient platform for specifying and constructing the model described above (codes are available upon request). MATLAB codes for specifying and constructing the model using VERNAN are given in the supplementary material.²⁸ (In addition, a mat-file (Example1.mat) containing the model specifications is included in the VERNAN package.) VERNAN gives the option of generating the full state-space graph (using Algorithm 1) or the reduced state-space graph (using Algorithm 2). Generating the full state-space graph yields a model with 68 system states and 400 system transitions. (Recall that the full system state-space includes only those states that are reachable from the initial state f_init which is why the full state-space size is much less than the product 3² × 2² × 2² × 3 = 432 of the number of states of each component.) On the other hand, generating the reduced state-space graph generates a model with 39 states and 178 transitions, a substantial reduction compared to the full state-space. There are eight symmetries in this example consisting of the group of permutations generated by independently swapping components 1 with 2, 3 with 4, and 5 with 6. Because the characteristic graph of this system is connected (i.e., there are dependencies between each component, and no two subsets of components are independent), it is not possible to achieve a further reduction of this system using the techniques described here.

The solution to the reduced model can be used to compute an exact solution to the full model using a differential equation of order 39, rather than 68 (or 432), which would be required without applying the reduction technique. To illustrate this, we assign random values in the interval (0,10) to 12 of the 14 rates in the model. We assign time-varying sinusoidal functions to the remaining two rates $r_{1}^{(5)}$ and $r_{2}^{(5)}$ such that $r_{1}^{(5)} \mapsto k_{5} + A (1 + \sin (ω t))$ and $r_{2}^{(5)} \mapsto k_{6} + A (1 + \sin (ω t))$ , where A is set to 5 second⁻¹, ω is set to 2π radians/second, and values for k₅ and k₆ are chosen randomly in the interval (0,10) with the constraint that k₆ be greater than k₅.

Next we numerically solve the master equations associated with the full and reduced systems (i.e., equations 1, 3, respectively) using MATLAB's built-in solver ode15s using an initial probability of P_i = 0 for all i such that J(i) ∈ F\f_init and P_i = 1 for the i satisfying J(i) = f_init in the full system, and of ${\tilde{P}}_{i} = 0$ for all i such that $\tilde{J} (i) \in \tilde{F} ∖ [f_{i n i t}]$ and $\tilde{P_{i}} = 1$ for the i satisfying $\tilde{J} (i) = [f_{i n i t}]$ in the reduced system. (MATLAB codes are given in the supplementary material.²⁸) If we are interested in the open probability of the channel, we identify all states f ∈ F and all states $\tilde{f} \in \tilde{F}$ such that the α subunit and two β subunits are in the open conformation. There are four such f in the full model, given by {(3,3,4,4,6,6,10), (3,3,4,5,6,6,10), (3,3,5,4,6,6,10), (3,3,5,5,6,6,10)}, and three such $\tilde{f}$ in the reduced model, given by {(3,3,4,4,6,6,10), (3,3,4,5,6,6,10), (3,3,5,5,6,6,10)}. The macroscopic open probability is then simply the sum of the occupancy probabilities of these three (four) microscopic open states of the reduced (full) model. Plots of the time-evolution of the macroscopic open probability computed using the full and reduced models are shown in Figure 11a, demonstrating the equivalence of the two solutions.

Transient solutions to the full and reduced versions of the single channel model. Solutions are for the system characterized by the set (ordered as listed in Table 3) of randomly chosen rate constants {4.5054, 0.8382, 2.2897, 9.1333, 0.0463+5*(1+sin(2*pi*t)), 7.7595+5*(1+sin(2*pi*t)), 1.5237, 8.2581, 5.3834, 9.9613, 0.7817, 4.4267, 1.0665, 9.6189}. (a) Time-evolution of the open probability of the single channel model computed numerically from the full (dotted line) and reduced (solid line) versions of the model. (b) Time-evolution of the occupancy probability of state f = (3, 3, 5, 4, 6, 6, 10) computed numerically from the full (dotted line) and reduced (solid line) versions of the model, along with the time-evolution of the occupancy probability of the state pattern [f], computed from the reduced model (dashed line).

Suppose now that we are interested in the occupancy probability of the microscopic state (3,3,5,4,6,6,10) and wish to compute this probability using the reduced model. Although this state is not included (i.e., is not a canonical representative) in the reduced system, we observe that this state is a member of the pattern of states represented by state (3,3,4,5,6,6,10) in the reduced system. The two states (3,3,4,5,6,6,10) and (3,3,5,4,6,6,10) together constitute the entire equivalence class represented by state (3,3,4,5,6,6,10). Thus, we can compute the occupancy probability of state (3,3,5,4,6,6,10) in the full system by simply dividing the occupancy probability of the pattern represented by (3,3,4,5,6,6,10) in the reduced system by two. The time-evolution of state (3,3,5,4,6,6,10) computed using the full and reduced system models, along with the time evolution of the state equivalence class represented by state (3,3,4,5,6,6,10), are plotted in Figure 11b.

Example 3: Ion channel complex with fine components

In the next hypothetical example, the Na⁺ channels modeled above are not physiologically observed alone, but rather as a complex of four such channels arranged in a ring configuration with each neighbor spaced equally apart. Whereas there were seven components in the individual channel model, there are now 28 (7 × 4) components in the complex model. We assign the initial state f_init of the complex to be (1,1,1,1,1,1,1,1,4,4,4,4,4,4,4,4,6,6,6,6,6,6,6,6,8,8,8,8); that is, all four channels are in the inactive, unbound, dissociated state.

We assume that the four channels are close enough together that they are coupled through local Na⁺ concentration elevations caused by the opening and closing of the individual Na⁺ channels. Assuming that local Na⁺ concentrations reach a steady-state on a time-scale much faster than Na⁺ channels openings and closings, we can assume that this coupling is instantaneous. These are the same assumptions that are made in models of intracellular calcium release channel (ryanodine receptor) clusters.²⁹ This coupling depends on the spatial orientation of the channels, and because of our assumed orientation of the channels, there exist symmetries that can be exploited when constructing the Na⁺ channel complex model.

The component state-space graph C for the complex model is the same as the individual channel model. However, there are now more rate constants associated with transition Q₅, i.e., the binding of a sodium ion to a β subunit. Recall that in the single channel model, there were two possible binding rates depending on whether the channel was open or closed. In the channel complex model, there are 12 possible binding rates which depend on the open/closed configuration of all four channels in the complex. Consider the case where a sodium ion binds to a β subunit of the channel labeled #1 in Figure 12. The 12 binding rates then depend on the open/closed configurations of all four channels as depicted in the diagram of Figure 12.

The set of possible rate labels for the binding of Na⁺ to the β subunit of the channel labeled #1 in the Na⁺ channel complex, along with the configurations of open and closed channels associated with each rate constant. Each channel is represented by a circle in the figure, and all channels are labeled as shown for $r_{1}^{(5)}$ . Unfilled circles represent a channel in a closed state whereas filled circles represent a channel in an open state.

Another difference is that transition Q₃ is now a functional transition (recall that it was a constant transition in the single channel model). Q₃ is a synchronizing transition involving two β subunits and one α subunit. The requirement that Q₃ be a functional transition in the complex model is a consequence of the fact that multiple channels may be in the open state simultaneously, allowing for Q₃ to occur between multiple combinations of components (whereas in the single channel model there was only one possible combination of components that could participate in Q₃). However, in our model, we assume that Q₃ can only occur between subunits of the same channel, and this requires a functional dependency to restrict the set of possible combinations of components that may be involved in the transition. Note that all synchronizing transitions in the complex model require a functional dependency for this reason, but all other synchronizing transitions were already functional transitions in the single channel model. Thus, it might have been easier to overlook this requirement for Q₃ compared to the others.

Since the system domain D for the complex model is different from to the single channel model, new rules, along with a new set of dependency graphs D, must be specified. Because the dependency graphs are rather complicated, we do not discuss them in detail here. However, a table listing the graphs can be found in the supplementary material.²⁸ In addition, a .mat-file containing all specifications of the ion channel complex model, including the dependency graphs, as well as VERNAN outputs, is included in the VERNAN toolbox package (Example2.mat). Generating the reduced-state space graph using the VERNAN tool generates a model with 304 590 system states and 5 414 760 system transitions, representing an over 70-fold reduction in state-space when compared to the expected state-space size (68⁴ = 21 381 376) of the full model. There are 32 768 symmetries in this example consisting of the group of permutations generated by rotating the ring, flipping the ring, and independently swapping the components corresponding to the β subunits of each channel. It is not possible to achieve a larger reduction, as the characteristic graph of the model is connected. (For this example, it is not possible to generate the full model on a 32-bit system because the sparse transition matrix is too large to store in the available address space.)

Example 4: Ion channel complex with course components

The large number of components in the ion channel complex model can make model specification a rather tedious task. However, there is flexibility in defining system components, which may be made as large or small as desired. In the preceding examples, the components were essentially specified to be as fine as possible (for the most part, constituting elementary reactions). In the case of the ion channel complex, it would have been possible to construct the model using only four components, where each component represents an individual ion channel. By doing this, the dependency graphs would be much smaller and easier to specify. However, the component state-space graph would be more complex. In addition, certain symmetries or independence between subsets of components that may exist in models specified using finer components may not exist in models specified using coarser components. Keeping this in mind, we next demonstrate the use of coarser components, namely the entire single channel model, in specifying the four channel complex model.

First, we must redefine the component state-space graph C as follows: we let the set of component states S now be the set of state equivalence class representatives $\tilde{F}$ of the reduced single-channel model and the set of component transitions Q be the set of system transitions $\tilde{Δ}$ in the reduced single-channel model. All transitions in the new component state-space graph are local transitions. However, all transitions representing a Na⁺ binding event in the single channel model must be made to be functional transitions in the complex model. Next, the rules and associated dependency graphs need to be defined. One difficulty that arises when specifying the rules for the coarser components is that, although there are many more component transitions, each of these transitions involves only one of a much smaller number of elementary reactions so that the number of labels (rates) is much smaller than the number of component transitions. Also, since we are dealing with the reduced single-channel model, a multiplicity is associated with each transition which must be accounted for. To avoid all of these difficulties, this model composition process can be automated using the VERNAN tool. In VERNAN, a function (SystemToComponent.m) is provided which takes the system state-space graph as input, and produces a new component state-space graph C, a set of rules, and a mapping of transitions to a set of labels as output.

Finally, we define the new dependency graphs in D for the rates associated with the transition Q₅. (A .mat-file, Example3.mat, containing the specifications of this model and VERNAN outputs is included in the VERNAN toolbox package). In this case, there are only six graphs that need to be defined, which are shown in Table 7. Although Na⁺ can bind with twelve different rates depending on the open/closed configuration of the four channels in the complex (as illustrated in Figure 12), any given channel only depends on the other channels, and there are only six possible configurations that these other channels can be in. With the finer components, we also had to account for the dependencies between subunits of the same channel, but these dependencies are implicitly accounted for by using the courser components.

Table 7.

Dependency graphs for the rate constants associated with the binding of Na⁺ to a β subunit in the four component Na⁺ sodium channel complex model. Each graph is associated with more than one rate label (due to the fact that multiple component transitions in the four component complex model correspond to a Na⁺ binding event), which is why rate label superscripts are given as asterisks.

	y	A_y	B_y	L_y
$D (r_{1}^{*})$	1	{1}	{2,3,4}	(C,C,C)
	2	{2}	{1,3,4}	(C,C,C)
	3	{3}	{1,2,4}	(C,C,C)
	4	{4}	{1,2,3}	(C,C,C)
$D (r_{2}^{*})$	1	{1}	{2,3,4}	(O,C,C)
	2	{1}	{2,3,4}	(C,C,O)
	3	{2}	{1,3,4}	(O,C,C)
	4	{2}	{1,3,4}	(C,O,C)
	5	{3}	{1,2,4}	(C,O,C)
	6	{3}	{1,2,4}	(C,C,O)
	7	{4}	{1,2,3}	(O,C,C)
	8	{4}	{1,2,3}	(C,C,O)
$D (r_{3}^{*})$	1	{1}	{2,3,4}	(O,C,O)
	2	{2}	{1,3,4}	(O,O,C)
	3	{3}	{1,2,4}	(C,O,O)
	4	{4}	{1,2,3}	(O,C,O)
$D (r_{4}^{*})$	1	{1}	{2,3,4}	(C,O,C)
	2	{2}	{1,3,4}	(C,C,O)
	3	{3}	{1,2,4}	(O,C,C)
	4	{4}	{1,2,3}	(C,O,C)
$D (r_{5}^{*})$	1	{1}	{2,3,4}	(O,O,C)
	2	{1}	{2,3,4}	(C,O,O)
	3	{2}	{1,3,4}	(O,C,O)
	4	{2}	{1,3,4}	(C,O,O)
	5	{3}	{1,2,4}	(O,O,C)
	6	{3}	{1,2,4}	(O,C,O)
	7	{4}	{1,2,3}	(O,O,C)
	8	{4}	{1,2,3}	(C,O,O)
$D (r_{6}^{*})$	1	{1}	{2,3,4}	(O,O,O)
	2	{2}	{1,3,4}	(O,O,O)
	3	{3}	{1,2,4}	(O,O,O)
	4	{4}	{1,2,3}	(O,O,O)

Open in a new tab

With the new set of model specifications tailored for the larger components, we generate the reduced-state space graph of the system using VERNAN, generating a model with 304 590 system states and 5 414 760 system transitions as before. In this example however, there are only eight symmetries rather than 32,768 as in the previous example. This is because the symmetries corresponding to swapping the β components have already been accounted for in constructing the single channel model, and we are left with only those symmetries that can be generated by rotating or flipping the four-channel ring.

If we index the system states of the two models in the same way, we can verify that the transition matrices associated with the two models are identical (see supplementary material²⁸ for MATLAB codes), validating the use of courser components for specifying the model. There was a slight improvement in efficiency in generating the model using courser components. Where it took ∼140 minutes (on a desktop with 8-core 3.4 GHz Intel Xeon processors and 16 GB RAM) to generate the model using the 28 component specification, it took ∼100 minutes using the 4 component specification. This speed up is attributed to the smaller characteristic graph size when using the larger components which greatly improved the efficiency of the canonical graph labeling routine.

Example 5: Mean-field coupled ion channels

Although it is not possible to achieve a further exact reduction of the ion-channel complex model described above using the techniques described here, it is certainly possible to make simplifying assumptions that would allow a further approximate reduction. One such simplifying assumption that is commonly applied to calcium-release site models is the mean-field coupling approximation,²⁹^,³⁰ in which the local calcium concentration in a cluster of channels is assumed to be uniform across all channels so that the concentration at any one channel depends only on the numbers of open/closed channels in the cluster and not any specific spatial configuration of open/closed channels. We can apply this same approximation to our Na⁺ channel complex model (here, we apply it to our 28 component model; model specifications and VERNAN outputs are included in the file Example4.mat). By doing this, the number of different rates at which Na⁺ binds to the β subunit changes from 12 in the spatially coupled model, to five in the mean-field coupled model. The rules by which transition Q₅ maps to these five rates must be defined, requiring us to re-specify our dependency graphs. The dependency graphs for all other rates remain the same.

As we have seen when specifying the spatially- coupled 28 component model, defining the dependency graphs can be a tedious process when a large number of components are involved. Specifying dependency graphs of a mean-field coupled model can be even more daunting, requiring a hyperarc for every possible configuration of open/closed channels. Although this process may be automated by writing a simple script, VERNAN provides the option of defining dependency graphs in a mean-field coupling format (versus the spatially-coupled format which has been used up to now). Using the mean-field coupling format, we only need to specify one or more classes of component states along with the number of components required to occupy each of the classes of states. Dependency graphs in mean-field coupling format for the five rate constants mapping to Q₅ are given in Table 8 for the 28 component model.

Table 8.

Dependency graphs for the model of Example 5, given in VERNAN's mean-field coupling format. In mean-field coupling format, a dependency y is given as a class of states ρ_y, along with the number of components N_y required to occupy the class of states in order to satisfy the dependency. Note that the dependencies labeled y = 2 in the five graphs are redundant and are not required to completely specify the model of Example 5. We include them here for completeness.

	y	ρ_y	N_y
$D (r_{1}^{(5)})$	1	{10}	0
	2	{8,9}	8
$D (r_{2}^{(5)})$	1	{10}	1
	2	{8,9}	6
$D (r_{3}^{(5)})$	1	{10}	2
	2	{8,9}	4
$D (r_{4}^{(5)})$	1	{10}	3
	2	{8,9}	2
$D (r_{5}^{(5)})$	1	{10}	4
	2	{8,9}	0

Open in a new tab

Generating the reduced state-space graph of the newly specified system yields a model with 111 930 system states and 1 897 480 system transitions, an approximately three-fold reduction compared to the spatially-coupled model. In this example, there are 98,304 symmetries consisting of the group of permutations generated by independently swapping the β components of each channel and by rearranging the four-channels (there are 4! possible rearrangements of the channels). This number of states is simply the number of combinations (with repetition) of the four channels on the set of 39 possible channel states in the reduced single channel model, given by the binomial coefficient

|F| = (\begin{matrix} 39 + 4 - 1 \\ 4 \end{matrix}) .

Although we refer to the mean-field coupled model as an approximation, we emphasize that we are referring to an approximation in the mechanism of coupling, not an approximation in the solution of the master equation. Using the system specification of the mean-field coupled model, the 119 930 state model generated using Algorithm 2 is an exact reduction of the 21 381 376 state model that would be generated using Algorithm 1 given sufficient time and memory capacity.

Example 6: Independent ion channels

Next, we assume that each individual channel in the complex is spaced far enough apart that the channels are effectively uncoupled from one another. In this case, the characteristic graph of the system becomes disconnected, and since no synchronizing transitions between channels exist, each channel is independent. (VERNAN gives the option of testing for these conditions.) Then the channel complex model can be decomposed into four equivalent subsystems, each subsystem being the single channel model. In this case, an exact solution to the 21 382 376 state ion channel complex model may be obtained directly from the solution to the 39 state reduced single channel model using Eq. 6.

Example 7: Gene regulatory circuit

As a final example, we illustrate the regulation of a transcription factor by three different promoters labeled promoter A (P_A), promoter B (P_B), and promoter C (P_C). Each promoter binds to the same transcription factor (labeled TF), but with different binding kinetics. Thus, in this case the promoters are distinguishable whereas the transcription factors are not. The component state-space graph C for the channel may be formulated as shown in Figure 13 and Table 9. We assume that the network contains two of each type of promoter and two transcription factors so that the system domain contains eight elements. We assign the initial system state f_init of the eight components as (1,1,3,3,5,5,7,7), i.e., all promoters and transcription factors in the dissociated state. According to C, there are 6 possible component transitions, all of which represent synchronizing transitions. In this example, there are no functional transitions, so there is no need to specify any dependency graphs for the transitions.

Component state-space graph C for Example 7. Circles represent component states S and arrows represent component transitions Q labeled with multiplicities. The system domain D and an initial system state f_init are also shown. Note that all transitions are synchronizing transitions in this sytem.

Table 9.

Component transitions Q associated with the component state-space graph C shown in Figure 13. The mappings Y_j and Z_j are listed such that the first element in a set X_j maps to the first elements listed in Y_j and Z_j, the second element listed in a set X_j maps to the second elements listed in Y_j and Z_j, etc. Also given in the table are the sets of labels R_j associated with each transition j.

	j	X_j	Y_j	Z_j	R_j
	1	{1,7}	(2,8)	(1,1)	{r₁⁽¹⁾}
	2	{2,8}	(1,7)	(1,1)	{r₁⁽²⁾}
	3	{3,7}	(4,8)	(1,1)	{r₁⁽³⁾}
Q =	4	{4,8}	(3,7)	(1,1)	{r₁⁽⁴⁾}
	5	{5,7}	(6,8)	(1,1)	{r₁⁽⁵⁾}
	6	{6,8}	(5,7)	(1,1)	{r₁⁽⁶⁾}

Open in a new tab

Generating the full state-space graph yields a model with 28 system states and 144 system transitions. Generating the reduced state-space graph yields a model with 10 system states and 24 transitions. In this case, there are 16 symmetries consisting of those generated by independently swapping components corresponding to a given type of promoter or the transcription factor. Although the characteristic graph of the system is disconnected, none of the components are independent from one another because each component participates in a synchronizing transition with disjoint components (see Appendix C for discussion about independence). Therefore, it is not possible to achieve a further reduction on this system using the techniques described here.

The reductions achieved for the above examples and computation times for the examples are summarized in Tables 10, 11, respectively.

Table 10.

State-space size and symmetry summary.

Example	Full	Symm	Red	Rel. size (%)
2	68	8	39	57
3	21 381 376	32 768	304 590	1.4
4	21 381 376	8	304 590	1.4
5	21 381 376	98 304	111 930	0.5
6	21 381 376	8	39	<0.01
7	28	16	10	36

Open in a new tab

Table 11.

Computation times (in seconds) for algorithms.

		Algorithm 2
Example	Algorithm 1	Total	Canonizer	Algorithms 3 and 4
2 and 6	0.155	0.171	0.031	0.095
3	NA	8621	5367	0.86
4	NA	6185	680	0.86
5	NA	2637	1562	0.18
7	0.067	0.08	0.01	0.045

Open in a new tab

DISCUSSION

Summary

The system description formalism presented here offers a convenient and intuitive approach for specifying complex state transition systems with many different types of interactions. The technique allows the transition rules of a system to be systematically defined, helping to minimize the potential for modeling errors that could occur using a more direct manual approach. In addition, rule definitions do not require learning complicated language syntax and semantics, as they are easily understood in terms of simple graph theory. The way in which rules are specified allows a natural and automated application to the high-level model of the symmetry reduction technique and reduction by decomposition of independent subsystems to achieve potentially significant exact reductions of master equations associated with the models.

To illustrate how the method may be applied to the modeling of biochemical systems, we have presented a hypothetical Na⁺ channel complex model involving several different types of components and many different interactions. (All models were constructed using VERNAN, our MATLAB implementation of the method (codes are available upon request).) Using the reduction techniques described here, we generated a reduced model with an over 70-fold reduction in state-space when compared to the full (unreduced) version of the model. In general, highly symmetric dependencies among system components (as with the mean-field coupling approximation), or independence between components, results in large state-space reductions.

Improvements and future directions

Although the reduction techniques described here focused on those that can be done a priori using a HLM, further exact reduction may be possible using techniques that operate directly at the state-level model. This further reduction may be possible for two reasons: (1) symmetries or independence between components exist at the state-space level that are non-identifiable at the system-description level; and (2) states exist that are either forward or backward bisimilar, but not both. The first situation can be avoided by careful modeling at the system-description level (see discussion in Appendix C for discussion). The second situation on the other hand cannot be managed at the system-description level and requires a posteriori processing to account for. In this case, the algorithm of Derisavi et al.²⁶ may be applied to compute the coarsest possible aggregation of the generated state-space.

In addition, it is also possible to compliment the exact reduction strategies described here with one or more approximate reduction strategies to achieve additional reduction of a model. Potential avenues for further reduction using approximation strategies include truncation⁷ and probabilistic evaluation⁸^,⁹ techniques and perturbation methods such as time-scale separation,³¹^,³² aggregation of nearly lumpable states,¹⁰ and decomposition of nearly independent subsystems. Probabilistic evaluation techniques such as the finite state projection algorithm of Munsky and Khammash⁹ are particularly attractive as they offer the potential to automatically truncate the state-space of a very large, possibly infinite, state-space to a much smaller projection space of size just large enough to satisfy a user-specified tolerance in the total probability density error. (Although our system description method is restricted to systems with finite numbers of states and state transitions, it is possible to extend the specification method to allow specification of components with infinite state-spaces.) Algorithm 2 could be implemented directly in the finite state projection algorithm to expand the projection space on each iteration.

Finally, there is room for improvement in the efficiency of the reachability analyses described by Algorithms 1 and 2. In particular, Algorithm 2 would benefit from implementing one of the more efficient algorithms described by Junttila³³^,³⁴ for finding the canonical representative of a system state. (Using NAUTY²⁷ to find a canonical labeling of a characteristic graph is a bottleneck in computing the reachable state-space for systems with large characteristic graphs.) Also, computing the multiplicity for transitions with many possible combinations of reactants is costly using Algorithm 2, as it requires finding a canonical representative for the destination state of each combination. A better algorithm for computing the multiplicity function would therefore result in a significant speed up. Additionally, there is much room for improvement in the speed of our VERNAN implementation of these algorithms by incorporating more efficient data structures and search methods.

ACKNOWLEDGMENTS

This work is supported by the National Institute of Health under Grant No. P50-GM094503.

APPENDIX A: CONVERTING A DIRECTED ARC-LABELED HYPERGRAPH TO AN EQUIVALENT DIRECTED VERTEX-LABELED SIMPLE GRAPH

Reaction dependencies, or rules, are specified in the form of directed arc-labeled hypergraphs. These graphs are used to construct a characteristic graph G for a system, as described in the text. In VERNAN, automorphisms of G are identified using the NAUTY algorithm²⁷ which requires that graphs be given in simple vertex-labeled form. Converting G (a hypergraph) to an equivalent simple graph $\tilde{G}$ is done according to the procedure described as follows:

Generate a partition {U₁, U₂, …, U_n} ⊢ U such that all hyperarcs belonging to a given U_i have the same tail and head sets, and all hyperarcs belonging to U_j for j ≠ i have tail and head sets different from those in U_i
For each U_i, define a labeling function μ_i assigning a multiset of labels to each head d ∈ B_i such that:
$μ_{i} (d) = ⊎_{l} (L_{l} (d), r_{k}^{(j)}),$
where L_l is the labeling function and $r_{k}^{(j)}$ is the rate label associated with hyperarc E_l ∈ U_i.
For each U_i containing hyperarcs with tail sets A_i or head sets B_i with cardinality greater than 1:
1. Remove U_i from U,
2. Add a new vertex h to V,
3. Add arcs connecting all d ∈ A_i to h,
4. Add a new vertex v_d for each d ∈ B_i, label each vertex according to μ_i(d), and add arcs connecting h to v_d and connecting v_d to d.
For each U_i containing hyperarcs with tail sets A_i and head sets B_i both with cardinality equal to 1:
1. Remove U_i from U,
2. Add a new vertex labeled according to μ_i(d), and add arcs connecting A_i to the new vertex and connecting the new vertex to B_i.

Applying this conversion procedure to the characteristic graph G shown in Figure 14 panel A produces the equivalent simple graph $\tilde{G}$ shown in Figure 14 panel B. The automorphism group of $\tilde{G}$ restricted to the vertices labeled with numbers is the same as the automorphism group of G.

Conversion of a hypergraph G to an equivalent vertex-colored simple graph $\tilde{G}$ . (a) A hypothetical characteristic graph G. (b) An equivalent simple graph $\tilde{G}$ obtained by applying the procedure described in Appendix A. The additional vertices in the graph labeled with letters are the result of step 3b in the procedure whereas the smaller unlabeled vertices are the result of steps 3d and 4b. The smaller vertices are colored such that vertices assigned to the same multiset of labels by a function μ_i have the same color and those assigned to different multisets of labels have different colors.

APPENDIX B: STATE EQUIVALENCE TESTING AND CANONIZATION METHOD

There are several different ways of testing the equivalence of two states f₁ and f₂. A simple but naïve approach is to generate the full equivalence class [f₁] of state f₁ by applying each element g ∈ G to f₁, and to then test whether any function in [f₁] is equal to f₂. This method works fine if G is of low order but quickly becomes problematic for larger order G. For example, consider the case where all p components of a system are identical symmetrically (that is, all permutations of the p components are symmetries). Then, the group G is the symmetric group of order p!, which can be very large for even relatively small p.

A better approach is to compute the canonical representatives of the states f₁ and f₂, and to then test whether the canonical representatives are the same. Several alternative methods for computing canonical representatives are presented in Refs. ³³ and ³⁴. Each of these methods exploit a structure known as the Schreier-Sims representation of a group, which is a compact representation of a group in the form of a base and strong generating set that allows linear time computations with groups. In our implementation, we apply the method described in section 3 of Ref. 33, which is based on the use of a graph canonizer to find the canonical version of a system state's characteristic graph.

The characteristic graph of a system state f is defined as a graph G_f whose vertex set contains D such that, for all system states f₁ and f₂, the following holds:³³

if g ∈ G maps f₁ to f₂, then there is an isomorphism γ from $G_{f_{1}}$ to $G_{f_{2}}$ such that γ restricted to D equals g, and
if γ is an isomorphism from $G_{f_{1}}$ to $G_{f_{2}}$ , then γ restricted to D belongs to G and maps f₁ to f₂.

Thus, the characteristic graphs $G_{f_{1}}$ and $G_{f_{2}}$ of f₁ and f₂ are isomorphic if and only if f₁ ∼ f₂.

The bulk of the work for finding characteristic graphs of system states is done by constructing the system characteristic graph G. The characteristic graph G_f of a system state f is simply G with vertices corresponding to D labeled according to f(d) for all d ∈ D. Next, a black-box graph canonizer, such as the one provided in the NAUTY tool,²⁷ is applied to find the canonical version of G_f. A function K is a graph canonizer if, for all graphs G₁ and G₂, the following holds:³³

K(G₁) is isomorphic to G₁ and
K(G₁) = K(G₂) if and only if G₁ and G₂ are isomorphic to each other.

By applying the canonizer K to a system state characteristic graph G_f, we obtain the canonical version K(G_f) of the graph along with an isomorphism γ carrying G_f to K(G_f).

Given a permutation group represented in Schreier-Sims form, Junttila describes the concept of compatible permutations and gives an algorithm (Algorithm 2.2 of Ref. 33) for enumerating permutations in the group which are compatible with a place valuation pval (a function of the form $p v a l : D \to N$ ) under a multiset selector (a function from a multiset to a subset of its underlying set of elements such that each element in the subset is associated with a non-zero multiplicity). Junttila proves that, if pval is an injective function, and the multiset selector used is a function multiset selector (a multiset selector with a singleton image set), then one and only one unique permutation is compatible with pval. Junttila further proves that, by letting pval be equal to the isomorphism γ (restricted to D), and by finding the unique permutation g ∈ G compatible with pval under a function multiset selector, then the canonical form of the system state f may be found by applying the inverse of g to f (Theorem 3.1 of Ref. 33). That is, K( f ) = g⁻¹( f ) is the canonical representative of system state f.

Using the canonical representative approach, Algorithm 2 may be improved. Specifically, in line 15, instead of testing whether each destination state f ^′ is isomorphic to any state in $\tilde{F}$ , the canonical version K( f ^′) of f ^′ may be computed and tested for membership in $\tilde{F}$ .

APPENDIX C: TESTING FOR INDEPENDENCE OF SYSTEM COMPONENTS

Independence of subsets of system components implies the absence of both functional and synchronizing interactions between the subsets of components. Determining whether there are any functional interactions between subsets of components can be done at the system description level by examining the characteristic graph G of the system. Similarly, determining whether there are any synchronizing interactions between component subsets can be done by examining the component graph C in conjunction with f_init. A sufficient condition for the existence of independent subsets of components is then given by the following:

the system characteristic graph G is disconnected and
there exists a maximally connected subgraph of G such that none of its components are involved in a synchronizing transition with the components of any other disjoint maximally connected subgraph of G.

Criterion number 1 is straightforward to test using a standard graph search algorithm such as the one given below:

An algorithm for computing the connectivity of a characteristic graph.

\begin{matrix} 1 : Set u n p r o c e s s e d = V \\ 2 : while u n p r o c e s s e d \neq ⊘ do \\ 3 : Select any s o u r c e \in u n p r o c e s s e d \\ 4 : Set π (s o u r c e) = s o u r c e \\ 5 : Set q u e u e = s o u r c e \\ 6 : while q u e u e \neq ⊘ do \\ 7 : Select any v \in q u e u e and set q u e u e = q u e u e ∖ \{v\} \\ 8 : Set u n p r o c e s s e d = u n p r o c e s s e d ∖ \{v\} \\ 9 : for all E_{y} \in \{E_{y} \in U | v \in A_{y} \cup B_{y}\} do \\ 10 : for all d \in A_{y} \cup B_{y} \cap u n p r o c e s s e d ∖ q u e u e do \\ 11 : Set π (d) = s o u r c e \\ 12 : Set q u e u e = q u e u e \cup \{d\} \\ 13 : return π \end{matrix}

Open in a new tab

This algorithm generates a partition {V₁, V₂, …, V_n}⊢V of the vertices v of G into n maximal connected subsets such that the value of π(d) is the same for all d ∈ V_j and different for all d∉V_j. If the partition is non-trivial (i.e., π[V] is not singleton), then criterion number 1 is satisfied.

Criterion number 2 is more difficult to test, requiring that a reachability analysis be performed on f_init similar to Algorithms 1 and 2. However, a stricter test can be done at the system description level, therefore not requiring a traversal of the reachability graph. This involves testing whether a synchronizing transition involving components of disjoint maximal connected subsets even exists (whereas criterion number 2 allows such a transition to exist but requires that components not participate in it). An algorithm for the test is given below:

An algorithm to merge disconnected subgraphs of G based on the existence of synchronizing transitions involving components of the subgraphs.

\begin{matrix} 1 : Set u n p r o c e s s e d = V \\ 2 : while u n p r o c e s s e d \neq ⊘ do \\ 3 : Select any v \in u n p r o c e s s e d and set u n p r o c e s s e d = u n p r o c e s s e d ∖ \{v\} \\ 4 : for all Q_{j} \in {Q_{j} \in Q | \sum_{s \in X_{j}} Z_{j} (s) > 1 \land T (f_{i n i t} (v)) \\ = T (s) \exists s \in X_{j}} do \\ 5 : Select any s \in \{s \in X_{j} | T (f_{i n i t} (v)) = T (s)\} \\ 6 : for all d \in u n p r o c e s s e d do \\ 7 : if π (d) \neq π (v) \land T (f_{i n i t} (d)) = T (s^{'}) \exists s^{'} \in (X_{j}, Z_{j}) ∖ \{s\} then \\ 8 : for all d^{'} \in \{d^{'} \in V | π (d^{'}) = π (d)\} do \\ 9 : Set π (d^{'}) = π (v) \\ 10 : if π [V] ∖ π (v) = ⊘ then goto 11 \\ 11 : return π \end{matrix}

Open in a new tab

In line 7, (X_j, Z_j)\{s} is the multiset difference rather than the usual set difference.

The above is a union-find algorithm that merges subsets of the partition {V₁, V₂, …, V_n}⊢V (given by π) together when synchronizing transitions exist between subsets. If the updated partition is non-trivial (π[V] is non-singleton), then both criteria of the independence condition are satisfied.

The test given by the above algorithm should be sufficient in most cases. Careful modeling, especially of f_init and of Q, can ensure that the stronger test need never be performed. (Note that criterion number 2 can also be verified directly a posteriori by examining the reachability graph $\tilde{S}$ after it is generated. In general however, it is preferred that the full $\tilde{S}$ is never generated when a reduced version exists.)

When specifying a model at the system description level, we account for all possible interactions between system components. However, as already mentioned in the case of synchronizing transitions, there will be no interaction in the model unless the transition actually occurs in the reachability graph. This is also the case for functional transitions. That is, a functional interaction does not exist in a model unless its associated functional transition actually occurs in the reachability graph.

This subtle but important point has implications regarding both criterion number 1 of the independence condition above as well as in the model symmetry. If we are accounting for a functional interaction in our system characteristic graph, and that interaction does not actually exist in the reachability graph, then it is possible that there are symmetries in the model that we are not accounting for. While it is possible to verify the existence of interactions a posteriori by examining the generated reachability graph $\tilde{S}$ , careful modeling at the system description level can render this step unnecessary. For example, if a particular transition in Q requires that four components of type y be in state x to occur, and there are only three such components specified in f_init, then it is obvious that the transition never occurs and should therefore be removed from Q.

When the independence condition is satisfied, the full system can be decomposed into a number of unique orthogonal subsystems by first partitioning D according to π. That is, D is partitioned into {D₁, D₂, …, D_n} such that the value of π(d) is the same for all d ∈ D_j and different for all d ∉ D_j. Next, each subsystem can be characterized as unique or non-unique by testing the following:

whether any other subsystem contains the same number and type of components and
whether its associated subsystem characteristic graph, defined as the induced subgraph of G with vertex set D_j, is isomorphic to any other subsystem's characteristic graph.

State-level models generated for each unique subsystem may be used to obtain an exact solution to the full model as described in the main text.

References

Le Novere N. and Shimizu T. S., Bioinformatics 17(6), 575 (2001). 10.1093/bioinformatics/17.6.575 [DOI] [PubMed] [Google Scholar]
Blinov M. L., Faeder J. R., Goldstein B., and Hlavacek W. S., Bioinformatics 20(17), 3289 (2004). 10.1093/bioinformatics/bth378 [DOI] [PubMed] [Google Scholar]
Danos V. and Laneve C., Theor. Comput. Sci. 325(1), 69 (2004). 10.1016/j.tcs.2004.03.065 [DOI] [Google Scholar]
Lok L. and Brent R., Nat. Biotechnol. 23(1), 131 (2005). 10.1038/nbt1054 [DOI] [PubMed] [Google Scholar]
Gillespie D. T., Annual Review of Physical Chemistry (Annual Reviews, Palo Alto, 2007), Vol. 58, pp. 35. [DOI] [PubMed] [Google Scholar]
Miner A. and Parker D., Validation of Stochastic Systems: A Guide to Current Research (Springer-Verlag, Berlin, 2004), Vol. 2925, pp. 296. [Google Scholar]
Haverkort B., Perform. Eval. 18, 61 (1993). 10.1016/0166-5316(93)90027-R [DOI] [Google Scholar]
Haverkort B., Comput. J. 38(7), 521 (1995). 10.1093/comjnl/38.7.521 [DOI] [Google Scholar]
Munsky B. and Khammash M., J. Chem. Phys. 124(4), 13 (2006). 10.1063/1.2145882 [DOI] [PubMed] [Google Scholar]
Buchholz P., J. Appl. Probab. 31(1), 59 (1994). 10.2307/3215235 [DOI] [Google Scholar]
Keener J., J. Math. Biol. 58(3), 447 (2009). 10.1007/s00285-008-0199-6 [DOI] [PubMed] [Google Scholar]
Sereno M., Comput. J. 38(7), 622 (1995). 10.1093/comjnl/38.7.622 [DOI] [Google Scholar]
Haverkort B. and Trivedi K., Discrete Event Dyn. Syst. 3(3), 219 (1993). 10.1007/BF01439850 [DOI] [Google Scholar]
Goss P. and Peccoud J., Proc. Ntl. Acad. Sci. 95(12), 6750 (1998). 10.1073/pnas.95.12.6750 [DOI] [PMC free article] [PubMed] [Google Scholar]
Donatelli S., in Proceedings of the 15th International Conference on Application and Theory of Petri Nets, Lecture Notes in Computer Science (Springer-Verlag; Berlin, 1994), Vol. 815, pp. 258.
Devours D. D., Clark G., Courtney T., Daly D., Derisavi S., Doyle J. M., Sanders W. H., and Webster P. G., IEEE Trans. Software Eng. 28(10), 256 (2002). 10.1109/32.991320 [DOI] [Google Scholar]
Lamprecht R., Smith G. D., and Kemper P., Nat. Comput. 10(3), 1045 (2011). 10.1007/s11047-009-9143-y [DOI] [Google Scholar]
Feret J., Danos V., Krivine J., Harmer R., and Fontana W., Proc. Ntl. Acad. Sci. 106(16), 6453–6458 (2009). 10.1073/pnas.0809908106 [DOI] [PMC free article] [PubMed] [Google Scholar]
Feret J., Koeppl H., and Petrov T., “Stochastic fragments: A framework for the exact reduction of the stochastic semantics of rule-based models,” Int. J. Software Inf. (in press), available online at http://infoscience.epfl.ch/record/142570. [Google Scholar]
Feret J., Henzinger T., Koeppl H., and Petrov T., Theor. Comput. Sci. 431(4), 137 (2012). 10.1016/j.tcs.2011.12.059 [DOI] [Google Scholar]
W.Douglas ObalII, McQuinn M. G., and Sanders W. H., IEEE Trans. Reliab. 56(4), 643 (2007). 10.1109/TR.2007.909761 [DOI] [Google Scholar]
Junttila T., Ph.D. dissertation, Helsinki University of Technology, 2003. [Google Scholar]
Sproston J. and Donatelli S., IEEE Trans. Software Eng. 32(8), 531 (2006). 10.1109/TSE.2006.74 [DOI] [Google Scholar]
Kemeny J. G. and Snell J. L., Finite Markov Chains (Van Nostrand, Princeton, NJ, 1960). [Google Scholar]
Schweitzer P. J., In Proceedings of the International Workshop on Computer Performance and Reliability (North-Holland, Amsterdam, The Netherlands, 1984), pp. 275.
Derisavi S., Hermanns H., and Sanders W. H., Inf. Process. Lett. 87(6), 309 (2003). 10.1016/S0020-0190(03)00343-0 [DOI] [Google Scholar]
McKay B. D., NAUTY, version 2.4, June 2010. Downloaded from the internet at http://cs.anu.edu.au/~bdm/nauty/.
See supplementary material at http://dx.doi.org/10.1063/1.4758074 for MATLAB codes and tables for examples.
Nguyen V., Mathias R., and Smith G. D., Bull. Math. Biol. 67(3), 393 (2005). 10.1016/j.bulm.2004.08.010 [DOI] [PubMed] [Google Scholar]
Greenstein J. L., Hinch R., and Winslow R. L., Biophys. J. 90(1), 77 (2006). 10.1529/biophysj.105.065169 [DOI] [PMC free article] [PubMed] [Google Scholar]
Gomez-Uribe C. A., Verghese G. C., and Tzafriri A. R., J. Chem. Phys. 129(24), 244112 (2008). 10.1063/1.3050350 [DOI] [PMC free article] [PubMed] [Google Scholar]
Peles S., Munsky B., and Khammash M., J. Chem. Phys. 125(20), 13 (2006). 10.1063/1.2397685 [DOI] [PubMed] [Google Scholar]
Juntilla T., Research Report A75 (Helsinki University of Technology, Laboratory for Theoretical Computer Science, Espoo, Finland, 2002). [Google Scholar]
Juntilla T., Applications and Theory of Petri Nets, Lecture Notes in Computer Science (Springer-Verlag, Berlin, 2004), Vol. 3099, pp. 258. [Google Scholar]

[c1] Le Novere N. and Shimizu T. S., Bioinformatics 17(6), 575 (2001). 10.1093/bioinformatics/17.6.575 [DOI] [PubMed] [Google Scholar]

[c2] Blinov M. L., Faeder J. R., Goldstein B., and Hlavacek W. S., Bioinformatics 20(17), 3289 (2004). 10.1093/bioinformatics/bth378 [DOI] [PubMed] [Google Scholar]

[c3] Danos V. and Laneve C., Theor. Comput. Sci. 325(1), 69 (2004). 10.1016/j.tcs.2004.03.065 [DOI] [Google Scholar]

[c4] Lok L. and Brent R., Nat. Biotechnol. 23(1), 131 (2005). 10.1038/nbt1054 [DOI] [PubMed] [Google Scholar]

[c5] Gillespie D. T., Annual Review of Physical Chemistry (Annual Reviews, Palo Alto, 2007), Vol. 58, pp. 35. [DOI] [PubMed] [Google Scholar]

[c6] Miner A. and Parker D., Validation of Stochastic Systems: A Guide to Current Research (Springer-Verlag, Berlin, 2004), Vol. 2925, pp. 296. [Google Scholar]

[c7] Haverkort B., Perform. Eval. 18, 61 (1993). 10.1016/0166-5316(93)90027-R [DOI] [Google Scholar]

[c8] Haverkort B., Comput. J. 38(7), 521 (1995). 10.1093/comjnl/38.7.521 [DOI] [Google Scholar]

[c9] Munsky B. and Khammash M., J. Chem. Phys. 124(4), 13 (2006). 10.1063/1.2145882 [DOI] [PubMed] [Google Scholar]

[c10] Buchholz P., J. Appl. Probab. 31(1), 59 (1994). 10.2307/3215235 [DOI] [Google Scholar]

[c11] Keener J., J. Math. Biol. 58(3), 447 (2009). 10.1007/s00285-008-0199-6 [DOI] [PubMed] [Google Scholar]

[c12] Sereno M., Comput. J. 38(7), 622 (1995). 10.1093/comjnl/38.7.622 [DOI] [Google Scholar]

[c13] Haverkort B. and Trivedi K., Discrete Event Dyn. Syst. 3(3), 219 (1993). 10.1007/BF01439850 [DOI] [Google Scholar]

[c14] Goss P. and Peccoud J., Proc. Ntl. Acad. Sci. 95(12), 6750 (1998). 10.1073/pnas.95.12.6750 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c15] Donatelli S., in Proceedings of the 15th International Conference on Application and Theory of Petri Nets, Lecture Notes in Computer Science (Springer-Verlag; Berlin, 1994), Vol. 815, pp. 258.

[c16] Devours D. D., Clark G., Courtney T., Daly D., Derisavi S., Doyle J. M., Sanders W. H., and Webster P. G., IEEE Trans. Software Eng. 28(10), 256 (2002). 10.1109/32.991320 [DOI] [Google Scholar]

[c17] Lamprecht R., Smith G. D., and Kemper P., Nat. Comput. 10(3), 1045 (2011). 10.1007/s11047-009-9143-y [DOI] [Google Scholar]

[c18] Feret J., Danos V., Krivine J., Harmer R., and Fontana W., Proc. Ntl. Acad. Sci. 106(16), 6453–6458 (2009). 10.1073/pnas.0809908106 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c19] Feret J., Koeppl H., and Petrov T., “Stochastic fragments: A framework for the exact reduction of the stochastic semantics of rule-based models,” Int. J. Software Inf. (in press), available online at http://infoscience.epfl.ch/record/142570. [Google Scholar]

[c20] Feret J., Henzinger T., Koeppl H., and Petrov T., Theor. Comput. Sci. 431(4), 137 (2012). 10.1016/j.tcs.2011.12.059 [DOI] [Google Scholar]

[c21] W.Douglas ObalII, McQuinn M. G., and Sanders W. H., IEEE Trans. Reliab. 56(4), 643 (2007). 10.1109/TR.2007.909761 [DOI] [Google Scholar]

[c22] Junttila T., Ph.D. dissertation, Helsinki University of Technology, 2003. [Google Scholar]

[c23] Sproston J. and Donatelli S., IEEE Trans. Software Eng. 32(8), 531 (2006). 10.1109/TSE.2006.74 [DOI] [Google Scholar]

[c24] Kemeny J. G. and Snell J. L., Finite Markov Chains (Van Nostrand, Princeton, NJ, 1960). [Google Scholar]

[c25] Schweitzer P. J., In Proceedings of the International Workshop on Computer Performance and Reliability (North-Holland, Amsterdam, The Netherlands, 1984), pp. 275.

[c26] Derisavi S., Hermanns H., and Sanders W. H., Inf. Process. Lett. 87(6), 309 (2003). 10.1016/S0020-0190(03)00343-0 [DOI] [Google Scholar]

[c27] McKay B. D., NAUTY, version 2.4, June 2010. Downloaded from the internet at http://cs.anu.edu.au/~bdm/nauty/.

[c28] See supplementary material at http://dx.doi.org/10.1063/1.4758074 for MATLAB codes and tables for examples.

[c29] Nguyen V., Mathias R., and Smith G. D., Bull. Math. Biol. 67(3), 393 (2005). 10.1016/j.bulm.2004.08.010 [DOI] [PubMed] [Google Scholar]

[c30] Greenstein J. L., Hinch R., and Winslow R. L., Biophys. J. 90(1), 77 (2006). 10.1529/biophysj.105.065169 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c31] Gomez-Uribe C. A., Verghese G. C., and Tzafriri A. R., J. Chem. Phys. 129(24), 244112 (2008). 10.1063/1.3050350 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c32] Peles S., Munsky B., and Khammash M., J. Chem. Phys. 125(20), 13 (2006). 10.1063/1.2397685 [DOI] [PubMed] [Google Scholar]

[c33] Juntilla T., Research Report A75 (Helsinki University of Technology, Laboratory for Theoretical Computer Science, Espoo, Finland, 2002). [Google Scholar]

[c34] Juntilla T., Applications and Theory of Petri Nets, Lecture Notes in Computer Science (Springer-Verlag, Berlin, 2004), Vol. 3099, pp. 258. [Google Scholar]

PERMALINK

Specification, construction, and exact reduction of state transition system models of biochemical processes

Scott M Bugenhagen

Daniel A Beard

Abstract

INTRODUCTION

RELATED METHODS

SYSTEM SPECIFICATION AND GENERATION

State-level model

Figure 1.

High-level model

Component state space graph

System domain

Dependency graph

Generating the state-level model

Simple example

Figure 2.

Table 1.

Figure 3.

Table 2.

Figure 4.

EXACT REDUCTION METHODS

Bisimilarity

Figure 5.

Symmetry reduction method

Figure 6.

Decomposition of independent subsystems

EXAMPLES

Example 1: Reduction scalability in a simple example

Figure 7.

Table 3.

Figure 8.

Table 4.

Figure 9.

Example 2: Single ion channel

Figure 10.

Table 5.

Table 6.

Figure 11.

Example 3: Ion channel complex with fine components

Figure 12.

Example 4: Ion channel complex with course components

Table 7.

Example 5: Mean-field coupled ion channels

Table 8.

Example 6: Independent ion channels

Example 7: Gene regulatory circuit

Figure 13.

Table 9.

Table 10.

Table 11.

DISCUSSION

Summary

Improvements and future directions

ACKNOWLEDGMENTS

APPENDIX A: CONVERTING A DIRECTED ARC-LABELED HYPERGRAPH TO AN EQUIVALENT DIRECTED VERTEX-LABELED SIMPLE GRAPH

Figure 14.

APPENDIX B: STATE EQUIVALENCE TESTING AND CANONIZATION METHOD

APPENDIX C: TESTING FOR INDEPENDENCE OF SYSTEM COMPONENTS

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases