Abstract
In pharmacology and systems biology, it is a fundamental problem to determine how biological systems change their dose-response behavior upon perturbations. In particular, it is unclear how topologies, reactions, and parameters (differentially) affect the dose response. Because parameters are often unknown, systematic approaches should directly relate network structure and function. Here, we outline a procedure to compare general non-monotone dose-response curves and subsequently develop a comprehensive theory for differential dose responses of biochemical networks captured by non-equilibrium steady-state linear framework models. Although these models are amenable to analytical derivations of non-equilibrium steady states in principle, their size frequently increases (super) exponentially with model size. We extract general principles of differential responses based on a model’s graph structure and thereby alleviate the combinatorial explosion. This allows us, for example, to determine reactions that affect differential responses, to identify classes of networks with equivalent differential, and to reject hypothetical models reliably without needing to know parameter values. We exemplify such applications for models of insulin signaling.
Introduction
Dose-response curves are a classical tool for relating the dose of a biochemically active agent, such as a ligand, enzyme, drug, or toxicant, to its biochemical (e.g., receptor activation in a cell), physiological (concentration of a chemical in a body compartment), or even population-level (mortality) effect (1). Experimentally obtained dose-response curves frequently associate a given dose with its time-independent, steady-state effect and they often have a sigmoid shape when the dose is plotted in log scale (see Fig. 1). Their standard analysis aims to retrieve important characteristics such as the baseline and maximal responses as well as the dose that produces a response halfway between baseline and maximum as a measure of the agent’s potency, alternatively denoted as effective concentration , inhibitory concentration , or infectious dose . Such measures allow one to analyze relative differences between a reference and a perturbed dose-response curve, which we call a “differential response” (or “differential” for short). One can, for example, compare how dose affects different system responses to find an optimal trade-off between therapeutic efficiency and toxicity via the therapeutic index (2). Natural systems also exploit differential responses. For example, insulin receptors recognize various natural ligands, such as insulin and insulin-like growth factor 1 (IGF-1), that have different binding affinities to the receptor to trigger appropriate differential (metabolic or mitogenic) responses (3).
More generally, it is often of interest to understand how perturbations such as mutations, drugs, or natural variations affect dose-response relations. Shifts in quantities such as the easily capture the differential between sigmoid dose-response curves. However, biphasic dose-response curves with low-dose stimulation and high-dose inhibition, so-called hormetic curves, have received renewed attention (4, 5, 6), and non-monotonic dose-response curves with several peaks have been experimentally obtained (7). A recent study of drug responses in cancer cells illustrates that “non-standard” dose-response relations occur in substantial numbers: among 11,650 experimental dose-response curves, 28% were best described by a non-monotonic model, and 12% required two inflection points (6). However, it is unclear how to compare dose-response curves of (potentially) any shape with each other (see Fig. 1).
Like any perturbation experiment, differential analysis can probe the functioning of a biological system to identify the underlying mechanisms of the system’s (steady-state) behavior. However, relating the observed effects to biochemical mechanisms is a key open challenge. Mathematical models in dose-response analysis are frequently empirical (for data interpolation to estimate the characteristics described above), and not mechanistic in the sense of incorporating (hypothesized) relationships between biochemical entities that give rise to the experimental data. There exists no comprehensive mechanistic model-based theory to describe differential responses, despite their fundamental nature and importance in all fields of biology. This is not surprising: many relevant biological network models are non-linear and cannot be analyzed symbolically for non-trivial network sizes to derive design principles. Thus, the characterization of differential responses has to resort to numerical simulations that depend on often unknown network structures and parameter values. Current parameter-free methods such as chemical reaction network theory, which uses the network (model) structure to obtain a qualitative understanding of biological systems by determining their capabilities in terms of number of steady states (8, 9), are not directly applicable to differential responses, because we are interested in quantitative features (e.g., shifts in ).
To tackle such limitations, and to extract general principles of biological systems, we concentrate on models in the graph-theoretical “linear framework” introduced in (10), which we call “linear framework models”. These models represent linear molecular-state transitions but can also explicitly account for certain non-linear kinetics (10). They can be interpreted as deterministic (linear ordinary differential equation (ODE), or linear algebraic for the steady state) or stochastic (master equations of Markov processes) models of biochemical systems (11). The “linear framework” has provided multiple promising applications (12, 13, 14) and it unifies results across many biological areas such as enzyme kinetics, G-protein-coupled receptors, and gene regulation (10). Importantly, the framework connects the symbolic derivation of steady-state expressions to combinatorial objects on graphs (10). This relation between network function and structure makes the framework appealing to analyze principles of differential responses.
Linear framework models result from demonstrated or, more frequently, assumed timescale separation, where a part of a biochemical system reaches a steady state because it operates much faster than the rest of the system (15). Dose-response analysis with mechanistic models, for instance in pharmacology, often assumes that the system is at thermodynamic equilibrium (1). There, the principle of detailed balance imposes structural (all reactions must be microscopically reversible) and parametric (the so-called “cycle condition”) constraints on linear framework models. It leads to simple “history-independent” equilibrium derivations that contain only products of equilibrium constants along any path in the model graph (10). However, for systems that dissipate energy, such as cell signaling pathways and eukaryotic gene regulation, this formalism does not apply. Energy dissipation keeps systems away from equilibrium, thus abolishing detailed balance. As a result, non-equilibrium steady states become “history dependent” and algebraically substantially more complex (14). A paramount challenge in working with non-equilibrium steady states of linear framework models symbolically is their frequently (super)exponentially growing size with model size—making even apparently “small” systems practically intractable.
Here, we develop a comprehensive theory and practically scalable computational methods for studying non-equilibrium steady-state differential dose-response relationships to pinpoint the mechanisms leading to experimentally observed behaviors. We extend the classic comparison of sigmoid dose-response curves and formally define a general notion of the differential. We then exploit connectivity properties of directed graphs representing linear framework models to address challenges such as determining the reactions that affect differential responses, identifying equivalence classes of networks according to their differential, and reliably rejecting hypothetical models without needing to know parameter values. Specifically, the theory helps determine which reactions take part in the differential and how perturbations such as variation of parameter values, deletions and additions of states and reactions affect the differential. In our approach, realistic practical applications are possible, because we do not actually need to derive complete steady-state expressions for quantitative and qualitative symbolic analysis. Our computationally efficient graph algorithms (16; unpublished data) yield compact, factorized steady-state expressions. We illustrate the application of the framework for insulin signaling, covering aspects such as model building and analysis, model rejection, experimental design, and (numerical) bounds on differential dose-response relations.
Materials and Methods
Linear framework models
We focus on the deterministic interpretation of linear framework models, that is, on biochemical reaction networks governed by Laplacian dynamics and modeled by systems of linear ODEs. Consider the example network in Fig. 2 A, in which all reactions follow first-order mass-action kinetics. It comprises three species and four reactions: a receptor, R, can transition to (from) its ligand-bound state, , with rate constant , or the receptor can become irreversibly phosphorylated as with rate constant , and can transition to R with rate constant . This system is a simple model of insulin receptor activation, and its dynamics can be expressed as the ODE system
(1) |
where x denotes the concentration of the respective species in the subscript and is called the Laplacian matrix of the system.
The dynamics of linear framework models can be represented as a diffusion process on directed graphs corresponding to their reaction schemes. Formally, a simple directed graph, , consists of a set of vertices, , and a set of edges (ordered pairs of distinct vertices), ; it has no self loops and no multiple parallel edges. For conciseness, we use “graph” as shorthand for “directed graph” throughout. Here, we concentrate on linear framework models of biological networks corresponding to “strongly connected” graphs, that is, graphs G in which, for any two distinct vertices , there exists a directed path from u to v and from v to u. For example, the graph in Fig. 2 A is strongly connected (whereas the graph in Fig. 2 D is not). Strong connectivity is not a restrictive assumption for our theory, but rather mirrors the cell signaling models we consider. Note that non-strongly-connected graphs could emerge during the analysis procedures, but these are not linear framework models.
We consider the reaction scheme in Fig. 2 A as a labeled graph, G, with , where v denotes the vertex corresponding to the species in the subscript, and . The reaction rate constants are labels of the corresponding edges in G. In such a labeled graph G, we can associate each vertex, , to a non-negative species concentration and each edge to a mass-action reaction. Hence, we obtain a dynamical system in which species concentrations hosted on the vertices of G flow in the direction of the edges at rates proportional to the concentrations on the edges’ source vertices.
Proportionality of reaction rates is defined by edge labels, which have units of inverse time. We denote the label of an edge by . For example, . Edge labels can also host complex algebraic expressions of species concentrations and kinetic parameters. They can exactly account for non-linearities, for example, by containing concentrations of slow species resulting from timescale separation, provided that the uncoupling condition holds that prohibits concentrations corresponding to species from in the edge labels (10). Note that here we mostly take the labels as uninterpreted symbols, that is, we ignore the algebraic expressions to which they correspond and regard them as unique edge names. The only exceptions are edges affected by the dose (of a ligand), because we need explicit dose dependencies to determine differentials.
Linear framework models can be “closed,” not exchanging matter with the environment (as in the illustrative example), or “open,” when synthesis and degradation reactions are present. In general, the dynamics of closed models can be expressed in the form:
(2) |
where is the vector of species concentrations corresponding to each vertex and is the Laplacian matrix of G, defined as
(3) |
and when . In closed models, the total amount of material is conserved and there is a single conservation law, . The system reaches a unique stable steady state that can be derived symbolically for any species from and the kernel of the Laplacian matrix. Analogous definitions for open models can be found in the Supporting Material, and for more details, proofs, and derivations, see (10, 11, 17).
Spanning trees
Non-equilibrium steady states of linear framework models can always be derived in symbolic form, but, in practice, the length of the symbolic steady-state expressions grows (super) exponentially with the size of graph G. To cope with this growth, we introduce concepts intimately connected to both the structure of linear framework models and their steady states. Namely, a certain class of subgraphs, so-called “rooted directed spanning trees,” can be used to generate Kirchhoff polynomials (see next section).
Formally, a graph H is a subgraph of a graph G if and , where every edge in G between vertices in H is also an edge in H. A strongly connected component (SCC) of G is any largest (w.r.t. vertex inclusion) strongly connected induced subgraph of G. It can be shown that no two distinct SCCs can share a vertex, and thus, the SCCs of a graph G induce a unique partition, of . Additionally, for two distinct SCCs and , there can be a directed path from to , or from to , but not both. The existence of such paths between SCCs naturally induces a unique partial order on the SCCs .
A rooted directed spanning tree (spanning tree, for short) A is a subgraph of G spanning its vertex set such that a root vertex is reachable from all vertices along a unique directed path. By we denote the set of all spanning trees of G, and by the set of all spanning trees rooted at vertex v. All spanning trees of the example graph are shown in Fig. 2 B: two spanning trees are rooted at , one is rooted at , and another is rooted at . Let be the graph rooting operation such that is the graph constructed from G by removing all edges outgoing from v (see Fig. 2 D). All spanning trees of are necessarily rooted at v. We will say that a graph G is rooted at a vertex v if v has no outgoing edges and v is reachable from every other vertex in G. Observe that . Note also that a spanning tree of G exists iff the partial order of the SCCs has exactly one maximal element (no other SCC is reachable from a maximal SCC). Such an SCC is called a terminal SCC.
Kirchhoff polynomials and steady states of linear framework models
A spanning tree A of a graph G with n vertices has edges, (for a concise notation, we denote edges with the symbol e when not referring to the pairs of vertices defining them). A spanning tree can also be represented as a monomial in the edge labels , when the edge labels are taken as uninterpreted symbols denoting unique edge names. Correspondingly, one can represent the set of all spanning trees in G by a homogeneous multivariate polynomial over the variables , . This polynomial is called the Kirchhoff polynomial, :
The Kirchhoff polynomial of the example graph is shown in Fig. 2 C. Observe that if G is disconnected or has more than one terminal SCC, then , and if G consists of a single vertex, then .
Kirchhoff polynomials establish a direct connection between model structure, in terms of spanning trees, and function, in terms of steady-state expressions. Briefly, the steady state of a linear framework model can be symbolically obtained from initial conditions and the kernel of the Laplacian matrix by employing Tutte’s Matrix-Tree Theorem (18), which links the minors of the Laplacian matrix to the spanning trees in the model’s graph and their representation as Kirchhoff polynomials (see (10, 11, 17) for details and proofs).
Here, we are interested in the final results, namely that the steady-state concentration of species i in a closed system associated to a vertex can be expressed as a fraction of Kirchhoff polynomials:
(4) |
where we denote the Kirchhoff polynomial of all spanning trees rooted at vertex v by is a shorter notation for . Thus, the steady-state concentration of species associated to vertex in our example system is (see Fig. 2, B–D)
Although these expressions look simple, the number of spanning trees in a graph G often grows (super) exponentially with the size of G (19). Symbolic steady-state expressions of linear framework models as expressed by Kirchhoff polynomials, therefore, face the problem of combinatorial explosion, which makes manipulation and generation of such expressions challenging.
Manipulation of Kirchhoff polynomials
The manipulation of combinatorially complex Kirchhoff polynomials is facilitated by establishing a relation between procedures for their algebraic simplification and operations on their corresponding graphs (unpublished data). Graph operations such as edge deletion-contraction and prime decomposition (16) allow a Kirchhoff polynomial to be written as a sum and a product, respectively, of other Kirchhoff polynomials without the need of explicit Kirchhoff polynomial generation.
For a graph G with and we denote edge deletion by , i.e., the graph obtained from G by deleting e (see Fig. 2 E for an application to the example graph). Further, for a graph G and an edge , we denote by the edge-contracted graph that is constructed from G by 1) removing the edge , if it exists, and all outgoing edges from , i.e., , and 2) fusing vertices and into a single new vertex, w (see Fig. 2 F). Edge contractions may yield graphs with multiple parallel edges between two vertices. We resolve this by replacing m multiple parallel directed edges going from u to v with a single edge such that .
The classic deletion-contraction formula for an edge partitions into two sets, one in which e participates in no spanning trees and one in which e participates in all spanning trees. Equivalently, it decomposes into a sum of Kirchhoff polynomials (20):
(5) |
We call a Kirchhoff polynomial, P, a “factor” of another Kirchhoff polynomial, Q, if there exists a Kirchhoff polynomial R such that . A Kirchhoff polynomial P that cannot be factorized into non-trivial factors is called “prime”. We extend these definitions to graphs by calling a component (a prime component) of G if is a factor (a prime factor) of . The work in (16) provides graph decomposition rules that correspond to factorization steps of the Kirchhoff polynomial and also gives necessary and sufficient primality conditions of the resulting factors expressed by connectivity properties of the corresponding decomposed components. In particular, the exhaustive application of two decomposition rules to a graph G yields in linear time graphs whose Kirchhoff polynomials are prime factors of the Kirchhoff polynomial of the original G:
where are the prime components of G. A prime component can be either 1) strongly connected or 2) rooted at v such that is strongly connected and does not have any non-trivial vertex dominators (using the definition that a vertex u dominates a vertex w if every path from w to v goes through u). Note that the prime factorization is conditional on label uniqueness—when the labels are not unique or contain expressions, the factorization is not guaranteed to be prime. This is why we regard edge labels as unique uninterpreted symbols when defining Kirchhoff polynomials on linear framework models. Further, we call an edge a “prime bridge” if the edge deleted graph has more non-trivial prime components than the original graph G. To efficiently manipulate and generate Kirchhoff polynomials we used the Python package KirchPy (unpublished data).
Results
Formalizing differential responses
Differentials describe how a perturbation transforms a reference dose-response curve. For monotone curves, we can simply quantify this transformation by the difference between points on the reference and the perturbed curve with identical percentage of response between baseline and maximum, such as the distance between values. However, there exists no established approach for comparing dose-response curves of different functional form (sigmoid or multiphasic). Comparisons are particularly ambiguous when they involve non-monotone curves (see Fig. 1), but they are essential, because non-monotone curves have been experimentally observed and receive increasing attention (4, 7).
To provide a general formal procedure for comparison, let and be the functions generating the reference and the perturbed dose-response curves, respectively. Here and in the following, the superscripts α and β denote the specific dissimilar (parametric and structural) features of the systems generating the two curves. Without explicitly specifying α or β, we refer to both identifiers at the same time. To quantify the differential, we propose the following procedure (see the Supporting Material for details):
-
1)
Subdivide the curves into monotone segments. To obtain monotone segments, we subdivide the dose-response curves along the dose coordinate at their critical points, that is, at the doses for which ’s first derivative is zero.
-
2)
Decide which segments to compare. If the curves have equal numbers of segments, it is reasonable to assume that the perturbation shifts and scales the segments such that they can be compared in the order defined by the critical points: the first segment of maps to the first segment in , and so on. When the perturbed curve has more or fewer segments than the reference curve, and without information about which segments fuse or split, we compare the segments in all possible ways while preserving their order. For example, comparing a sigmoid and a biphasic curve amounts to mapping the sigmoid to each of the two monotone segments of the biphasic curve.
-
3)
Determine corresponding points in compared pairs of segments. As for monotone dose-response curves, we relate the doses that have the same proportion h, , of response between the minimal and maximal response in each pair of segments.
-
4)
Quantify the displacement of corresponding points. We quantify the displacement between points with identical proportion of response h for all pairs of mapped segments. In particular, we calculate the dose differential, , and the response differential, , as the difference in the dose and the response components of corresponding points:
with doses d and responses R. The dose differential is expressed in log scale to easily identify fold differences. These quantities have clear biological interpretations. For example, for corresponding segments, is the difference between log values and is the difference between maximal responses.
Differentials cannot be obtained in closed form in general, but we will show next that one can often determine differentials analytically when functional relations between dose and response originate from steady-state expressions of linear framework models.
Differential systems
To derive general expressions for the dose and response differentials, we need to define dose, response, and perturbation in linear framework models (see Materials and Methods for formal concepts), leading to the notion of differential linear framework models.
Here, we assume that the dose variable affects one or more reactions proportionally, which can be interpreted as an input changing the rate constant gradually, or as an input species with constant concentration binding to the educt of the reaction; for example, the effect of a ligand with constant external concentration that binds to a receptor incorporated in this way via the law of mass action. Formally, the input dose variable, d, partakes in the mathematical expressions labeling w edges in the model’s graph G, . We call members of the set “dose edges” of G and their labels are expressions proportional to the input dose variable, d:
where are functions that do not contain d (see Fig. 3 A for the insulin receptor example in which we assume that the dose affects receptor-ligand binding).
In our analysis, the response is a linear combination of the steady-state concentrations at chosen output vertices (such as the phosphorylated receptor-ligand complex, , in Fig. 3 A). The q species eliciting the response are associated with a set of output vertices in closed systems. Then, if we use Eq. 4 to derive a general expression for the steady-state response of closed models using Kirchhoff polynomials, , we obtain
where designates the weight given to the steady-state concentration associated with vertex . The response function for open models is similar (see the Supporting Material). This implies that steady-state dose-response curves of linear framework models are rational functions of the dose variable, for example, .
Perturbations are any changes in the model structure (additions and deletions of species and reactions), parameters (having different values in the reference and perturbed model; such parameters we call “differential parameters”), number and position of the edges affected by the input variable, number and position of the output vertices, and parametrization of the output function. We capture these perturbations by defining two linear framework models for a reference (α) and a perturbed (β) condition, each of which consists of a graph, G, a set of parameters, p (from G’s labels and for closed models), a set of dose edges, I (whose labels contain the dose variable, d), and a set of output vertices, O, the concentrations associated to which are weighted by a to obtain the observed response . Formally, a differential system is then an ordered pair, .
We use our example insulin receptor model from Fig. 2 A to define a first differential system, (Fig. 3 A). Let the receptor, R, transition to its ligand-bound state, RL, upon activation by a ligand with constant concentration d. We account for the dose variable d in the transition rate by changing the label of edge to . To construct a differential Laplacian system, we consider a reference model with graph topology as the example graph, G, dose edge , i.e., with , a single output vertex, , weighted by 1, and parameters . The perturbed model is identical to the reference one, with the exception of the values of parameters and , which we will denote as , ( and ) in the reference (perturbed) system. These differential parameters correspond to stimulation of the system with two different ligands that have different affinities to the receptor R, such as insulin and IGF-1. Since reference and perturbed model are identical except for their differential parameters, we can illustrate the differential system by a single graph and highlight the differential parameters as shown in Fig. 3 A.
Symbolic derivation of the differential and its properties
For tractability, we are first interested in deriving analytical expressions for the differential of systems with a constant input that influences exactly one edge proportionally in both reference and perturbed models, and , to model, for example, a ligand binding once to a receptor.
To express the steady-state response, , explicitly as a function of the dose variable, d, we apply the deletion-contraction property from Eq. 5 to partition the set of spanning trees from the numerator and denominator of the corresponding rational function into two categories—those that contain the dose d in one of their labels and those that do not. This is equivalent to factoring out the edge labels from the monomials in the corresponding Kirchhoff polynomial that contain the dose variable d. After simplification, we obtain the general form of dose-response expressions for models with a graph G:
(6) |
where for closed models (see the Supporting Material for open models),
We call the terms “steady-state coefficients,” although they are symbolic expressions involving parameters and Kirchhoff polynomials of specific graphs obtained from G. The coefficients k might be zero if spanning trees do not exist in the respective graphs.
How to obtain the relevant graphs participating in the coefficients for closed models can be seen in the tree scheme from Fig. 3 B (see the tree scheme for open models in Fig. S1), and an application to the example differential system can be found in Fig. 3 C. With only one edge containing the dose d, the numerator and denominator are at most first degree in d, and thus, the tree schemes in Fig. 3, B and C, have four leaves. Note that the tree scheme is the same for the reference and perturbed systems, except for the differential parameters. The coefficients for the example are (rtRLp(G)\ed), (rtRLp(G)/ed), (G\ed), and . By generating the Kirchhoff polynomials in the respective graphs, we obtain
Following the procedure for deriving the differential expressions and (for details, see the Supporting Material), we obtain
(7) |
When either or is zero, the response differential has a simpler form but the dose differential is not affected. The differential can be degenerate, e.g., when is always zero, implying that , or undefined, e.g., when is either constant or unbounded, leading to or .
The dose differential in Eq. 7 depends not on h, , and but on the Kirchhoff polynomials contained in the coefficients and . It is therefore independent of the choice of output vertices in O, of the vertex weights, a, and of the total conserved amount, , in closed systems (the synthesis reactions, , in open systems) for the reference and perturbed models. The dose differential can be simplified to the logarithm of an irreducible fraction by obtaining the prime factorizations of the numerator and denominator and dividing them by the greatest common divisor, . Then, the necessary and sufficient condition for a reaction to participate in is to be part of a prime component of the relevant graphs that is different in the reference and perturbed models, and not the same in the dose-edge-deleted and dose-edge-contracted graphs of the same condition. To illustrate these points, consider for closed differential systems:
The polynomials (G\ed) are factorizable if and only if is a prime bridge. Edge contraction for could also lead to a factorizable Kirchhoff polynomial if any of the deleted edges during the procedure is a prime bridge. The factor we assume to be prime as part of the label of a single edge. Overall, thus, whether the dose differential is reducible depends exclusively on the perturbation, the connectivity of the dose edge, (in and ), and the connectivity of the edges in the immediate neighborhood of ed. Note that the reducibility characterization of for open differential systems is analogous but includes an additional dependency on the location and connectivity of the synthesis edges in and .
The response differential, , in Eq. 7 is a sum of ratios dependent on all coefficients and includes the conserved in each ratio for closed systems, the synthesis reactions in open systems, and the mapping variable h. However, it does not depend on the dose-edge label function, , because always cancels in the fraction (for both reference and perturbed coefficients). To illustrate, let us focus on the response differential, (see Eq. 7), for closed systems when . To simplify the expression, first, the common factors between and are canceled (keeping in mind that is a linear combination of prime factorized Kirchhoff polynomials) to obtain and . The response differential can be further reduced if by combining the fractions under a common denominator. Note that the response differential could be zero due to the minus sign. The characterization of the response differential in the general case for h not fixed and for open systems is analogous.
For the example differential system, we find that and where the response component of the differential vanishes because and because the differential parameters in and cancel each other. Also, the dose differential is independent of the rate constant .The example illustrates that by deriving the general form of the differentials and , which are algebraic expressions of Kirchhoff polynomials in linear framework models, we establish a direct connection between the structure of the differential system and its function.
Analyzing the insulin receptor life-cycle
Insulin signaling in response to various ligands determines differential cellular responses through complex and incompletely understood mechanisms (3, 21). To unravel the processes at play, it is important to comprehend the role of reactions and species in the observed cellular functions. To illustrate applicability of our theory to such an investigation, we extend the example insulin model stepwise to a model incorporating insulin receptor binding, recycling, and phosphorylation. The resulting model is a subsystem of a more comprehensive insulin receptor signaling model from (22). This subsystem is appropriate for our analysis, because it is a linear framework model, it is away from equilibrium since receptor recycling and phosphorylation dissipate energy (as indicated by irreversible reactions), and the steady state of phosphorylated insulin receptors determines insulin signal transduction (23).
Here, we do not explicitly consider the molecular details of all reactions of this model but adopt a more general structural analysis approach by equating each reaction rate constant to a unique symbol. Specifically, we take the reaction rate constants , , and (see Fig. 6 A) as unique symbols, although they actually depend on the concentration of protein tyrosine phosphatase (PTP) and are thus coupled. This still allows us to analyze the role of each reaction in the differential, but it precludes algebraic simplifications for the concentration of PTP; if more molecular details are of interest, they have to be explicitly included. Again, we define the differential system by assuming that the reference and the perturbed model differ only in the values of a subset of parameters—the differential parameters corresponding to ligands with different affinity toward the insulin receptor—and that all other elements are identical.
We extend the example model (Fig. 2) with two states, an internalized phosphorylated ligand-bound receptor, , and an internalized receptor, (Fig. 4). These states can be reached by reversible reactions from their non-internalized (membrane) counterparts, and R, to represent endocytosis and receptor recycling, and can be dephosphorylated to R by an irreversible reaction. We regard the phosphorylated ligand-bound receptors as output species, such that , with unit weights .
The resulting differential system is what we call a “basic signaling system.” It is a differential system with a closed graph that contains a reversible reaction for which the forward and reverse rates differ between the reference and perturbed model, and for which the forward rate is affected by the dose variable, d. More precisely, we have a graph G with , , and . Also, the reference and perturbed models have no structural differences; they differ only by the functions and (and the parameters contained in them). In such basic signaling models, the dose-edge-contracted graph is the same for the reference and perturbed systems; therefore, its Kirchhoff polynomial always cancels in the dose-differential fraction. The dose differential for closed systems (results and definitions are analogous for open systems) is
The response component of the differential for will always be zero under the stated assumptions, since . When , the response component is not affected by the assumptions for basic signaling systems.
Although the graph of the extended system contains more states and reactions and the steady-state expression is more complicated (see the Supporting Material), the differential is identical to that of our initial example without endocytosis, because the newly added reactions take part in prime components that do not contain differential parameters and therefore cancel out (see Fig. 4). We can formally describe such differential systems giving rise to identical differential expressions by employing the mathematical concept of “equivalence classes.” Let be a set of differential systems and define the equivalence relation, , on such that , , iff and . Thus, the initial and extended models of insulin signaling belong to the same equivalence class with respect to their differential. The concept of equivalence classes has direct applications for model selection and experimental design. For example, assume that we have experimental evidence that changes in the rate constant of a reaction such as , in both the reference and perturbed models, affect the dose differential. If this reaction does not take part in the dose-differential expression of the equivalence class of models we are studying (as for the equivalence class for insulin signaling defined above), we can directly reject all members of the equivalence class, because they are incompatible with the experimental observation. Similarly, we can use the analytic framework to answer questions such as what second perturbation to design (or how to change the first perturbation) to differentiate between models in the same equivalence class (see the Supporting Material).
Next, to analyze long-term effects of insulin receptor trafficking, we need to account for receptor synthesis and degradation. This converts the model from a closed system to an open system. Let us consider a model extension in which the non-internalized free receptor, R, is synthesized and degraded. Then, however, the differential is not defined because the steady-state coefficient is zero. Fig. 5 A shows that the contraction of the dose edge eliminates the degradation edge and, additionally, that the rooting at the environment vertex eliminates the synthesis edge ; this results in a disconnected graph with no spanning trees in . Hence, we can reject this extension just based on structural considerations. We therefore add synthesis and degradation reactions for as a more biologically plausible way to capture receptor trafficking (Fig. 5 B). The steady-state expression changes in this case (see the Supporting Material), but the differential is still identical to that of the initial example model from Fig. 2; the extended model belongs to the same equivalence class due to cancellation of prime factors that do not contain differential parameters in both reference and perturbed models (see Fig. 5 B). Overall, our structural analysis predicts that differential responses for ligands with different affinities always exhibit a zero response differential and a dose differential dependent on the ligand affinities. This result is corroborated by the observation that most natural and modified insulin receptor ligands shift sigmoid dose-response curves leftward or rightward (relative to insulin) but have an identical maximal response (24).
Extension: two dose edges
To extend the framework to cases in which hormesis is possible, we consider that the input dose acts proportionally and simultaneously on two edges, i.e., , , and . To derive the general form of dose-response expressions for closed and open systems, we apply the deletion-contraction formula to partition the set of spanning trees from the numerator and denominator of the response function, , into four categories—those containing no input edges, those containing but not , those containing but not , and those containing both and . After simplification, we obtain
(8) |
where and (see the Supporting Material for all details). For two dose edges, the numerator and denominator polynomials in the dose d can be at most of degree two; for an input acting on w edges simultaneously, the maximal degree is w. However, the spanning tree partitioning determines the graphs contained in the coefficients k (see, for definitions, the tree scheme in Fig. S3). The exponential increase of the number of relevant graphs with number of dose edges ( graphs) makes systems with multiple inputs complex.
Depending on the coefficients , two or three critical points define a sigmoid or a biphasic/hormetic dose-response relationship, respectively. A necessary and sufficient condition for a biphasic dose-response function, which we call the “Hormesis condition,” is
Because the values of the coefficients are not known in general, the number of critical points cannot be determined unambiguously, and we need to distinguish cases depending on the number of segments in the reference and perturbed curves that are mapped to each other to derive the differential and (see the Supporting Material for details).
The differential expressions have a more complicated form (for example, they may involve square roots), and multiple conditions have to be considered, but the expressions are symbolic and (more involved) symbolic analysis with efficient methods for the generation and manipulation of Kirchhoff polynomials is applicable. For example, general properties of the differentials in systems with two dose edges are 1) in contrast to systems with a single dose edge, the dose differential, , depends on the choice of output vertices and their weights; 2) depends on the proportion variable, h, and the synthesis reactions in open systems, but not on the conserved in closed systems; 3) the response differential, , depends on all eight partitions of the set of spanning trees of G, and it includes in closed systems; and 4) the differential can also be degenerate or undefined.
To apply the theory to insulin signaling, we consider a more detailed model for receptor trafficking that includes binding of a second ligand molecule to the receptor, as well as the relevant reactions and internalized species shown in Fig. 6 A. It is a receptor-level subsystem of the more comprehensive model of insulin signaling from (22). The motivation for the analysis is twofold. First, designed insulin analogs that trigger hormetic responses could potentially influence insulin differential signaling, for example, by alleviating mitogenic effects at high ligand doses. Second, biphasic dose-response behaviors of insulin receptor phosphorylation were experimentally observed in cells stimulated with an insulin analog, the peptide S961 (24). We therefore asked if there exist sets of species that need to be active to achieve a robust hormetic response in the model. We derived the steady-state coefficients for all 127 combinations of output vertices in O and used the hormesis condition to classify the resulting models into four groups. The first three groups contain models with robust dose-responses (same qualitative shape irrespective of parameter values), namely, 1) constant response for models with output vertex ; 2) sigmoid for models with output vertices , , and ; and 3) hormetic for models with output vertices , , and (see Fig. 6 B). The fourth group contains the remaining 120 combinations of output vertices; the shapes of dose-response curves generated by these models depend on parameter values (see the Supporting Material for examples of analysis results that account for parameter dependencies). In particular, outputs producing robust hormetic responses comprise only phosphorylated receptors bound to a single ligand molecule, implying that doubly bound phosphorylated receptors should not signal if we are to obtain a robust hormetic dose response. This is consistent with the proposed mechanism of action of S961 in (24), namely, that the first binding of S961 cross-links the receptor, leading to agonist activity, and the second binding results in receptor inactivation.
Extension: numerical analysis
Numerical methods can augment the symbolic analysis when quantitative prior knowledge or experimental data are available. For example, we can directly incorporate known parameter values (or their ratios) to simplify symbolic expressions. We may also account for prior knowledge by defining plausible intervals of parameter values and determining whether the model’s thus bounded differential is consistent with an experimentally observed differential, thereby assessing whether the model agrees with the observations.
Formally, we denote the range of the differential as the interval , where is a box in parameter space. Let a and b be the infimum and supremum of . We want to determine tight outer bounds of the differential, finding and of over the parameter box , where and . In systems with a single dose edge, with edge labels that contain rational expressions, and regarding all parameters as variables, bounding the differential translates to finding the global extrema of a multivariate rational function. This is an NP-hard problem (25), but several numerical methods give certificates for global optimality or find bounds based on polynomial optimization. Here, we focus on Bernstein enclosures of rational functions (26) whose implementation in the Kodiak package, for example, was shown to produce tighter outer bounds for a dose differential expression with 12 free variables compared to the established method of interval arithmetic (27).
To illustrate the approach for the example model from Fig. 3 A (for a two-dose-edge example, see the Supporting Material), we analyze how the bounds of the dose differential change when a parameter is altered; we call the resulting plots “profile differential bounds”. Specifically, we fix each of the free parameters taking part in the differential, and , for every value in their interval of definition, and calculate the bounds of the dose differential (Fig. 7 A). This analysis shows the capacity of individual parameters to control and constrain the possible magnitude of the differential: independent of the value of the other free parameter, can significantly change the lower bound of the differential for values smaller than ∼, and for higher values it starts decreasing the upper bound. If the observed differential is outside the calculated bounds, one can reliably reject the model (and the parameter box ), because no parametrization (in ) can reproduce the observed differential. Conversely, this analysis can help confine such that all considered parameter values are consistent with the experimental observations.
The non-linear differential expressions’ algebraic structures also imply that not every differential value is equally likely when we sample parameter values uniformly from . The non-uniformity of the differential can be interpreted as parametric robustness, because random changes of parameters may not lead to random magnitudes of the differential. For our simple model from Fig. 3 A, uniform sampling of the dose differential yields a few magnitudes of variability but two small regions of most probable values with a marginal density peaked at and (Fig. 7 B). The profile differential distributions in Fig. 7 A show how the free parameters (ligand dissociation rate in the perturbed model) and (receptor phosphorylation) affect not only the bounds but also the peak of the marginal distribution, revealing their potential to control the differential. Hence, the differential system induces an important structural prior on the behavior—system structures (and parameter intervals) define what magnitude of the differential can be expected. This information could be exploited for more detailed model selection against experimental data.
Discussion
Here, we aimed to develop a theory for steady-state dose-response relationships in linear framework models of biochemical reaction networks that is analytic, and therefore also applicable when parameter values are largely unknown. We formalized the concept of differential responses to establish a comprehensive parameter-free framework for analyzing relative responses. It helps to study system behaviors upon perturbations of many features of reaction networks, such as network topology, parameters, and choice and number of inputs and outputs. In particular, the algebraic and numerical methods allow us to explore possible network topologies and perturbations to arrive efficiently at a set of candidate models that are consistent with prior knowledge and experimental data. For example, if it is known from experiments that a particular perturbation leads to a significant dose differential, we can reliably reject all potential models for which the differential expression does not depend on the perturbed variable. Numerical bounding over a predefined region in parameter space provides us with limits for all possible differential magnitudes—which we can use as a certificate to reliably reject models that can never reproduce quantitative experimental data. Another application of our methodology is in experimental design, namely, to determine (optimal) perturbations of a reference system that lead to a desired differential, to invalidate a model, to discriminate between equivalence classes of networks, or for applications such as finding optimal (combinations of) drug targets. Note also that the differential analysis framework extends to applications beyond the ones we covered: in general, every element of the differential system can be perturbed and labels for dose edges can be formulated in a more general form.
The most obvious limitation of our framework is the restriction to systems with Laplacian dynamics; it has to rely on prior, often parameter- and state-dependent simplifications from higher-order kinetics, such as those obtained via timescale separation. In terms of scalability, the exponential growth of the number of graphs to be considered for the analysis and the resulting high-degree polynomials with increasing number of reactions influenced by the dose is a challenge. Our examples demonstrate that systems with up to two dose edges can be analyzed efficiently due to the linear scaling of the prime factorization algorithm and the few relevant graphs. General algebraic solutions for the roots of polynomial equations of degree five or higher with arbitrary coefficients—corresponding to the dose acting proportionally on five or more edges—do not exist. However, depending on the particular graph structure and form of the label expressions, such cases, or cases with labels that are non-linear functions of the dose, could be analytically tractable if the polynomials simplify to a lower degree. Similarly, obtaining exact bounds might only be feasible for simple differential expressions with few variables and convex properties. In practice, therefore, bounding methods provide us with inner bounds, which do not guarantee reliable model rejection. Because bounding approaches are often employed in control theory (28) and methods are being continuously improved (27, 29), these limitations may be less pertinent.
Extensions of the framework could further build on the central insight that prime factors and components (and their similarity in perturbed graphs) are the units, the very characterization, of the differential response function in steady-state linear framework models. For example, additions and deletions of vertices and edges in certain parts of both the reference and perturbed graph may never have an effect on the differential if they belong to a prime component that always cancels in the differential expression. In general, the positions (connectivity) of additions and deletions within the graph and the size distribution of the induced prime components play an important role in the change of the differential upon structural perturbations. This could be exploited to develop a notion of “structural robustness” of the differential to random changes of the network topology. Finally, many of our graph theoretical notions subsume graph theoretical concepts such as distributions of sizes of SCCs, strong bridges, and strong articulation points that are actively researched in computer science; efficient algorithms for their characterization (30, 31, 32, 33, 34) could help to extend the scope of our framework.
In terms of applications, our analysis of the insulin signaling network demonstrates first steps in a direction we believe will become increasingly important: a systematic analysis of differential dose responses in biochemical reaction networks despite prevailing uncertainties as to the networks’ quantitative features. In particular, we showed how to characterize the dose responses of natural ligands, such as insulin and IGF-1, where different affinities influence the dose but not the maximal response component, and we recovered previously hypothesized modes of action of the insulin analog S961. We envisage that our framework will be most useful for the systematic study of mechanisms underlying hormesis, a phenomenon in toxicology and cell signaling that is receiving increasing attention. For example, empirical evidence favored hormetic over threshold models for dose-response relationships in a large-scale yeast anticancer drug screen (35), and hormetic phenomena are frequently observed in stress responses and their relations to aging (36). Corresponding theoretical work has only (re-)started very recently, showing, for example, that non-monotonic dose-response relationships can arise through non-obvious pathway interactions, and that network structures impose fundamental constraints on options for pharmacological treatment (37).
Author Contributions
P.Y. and J.S. designed research, P.Y. performed research, and P.Y. and J.S. wrote the manuscript.
Acknowledgments
We thank Jeremy Gunawardena and anonymous reviewers for critical comments and valuable suggestions.
We gratefully acknowledge financial support from the European Union FP7 project IFNAction (contract 223608).
Editor: Stanislav Shvartsman.
Footnotes
Supporting Materials and Methods, Supporting Results, and five figures are available at http://www.biophysj.org/biophysj/supplemental/S0006-3495(17)35042-7.
Contributor Information
Pencho Yordanov, Email: pencho.yordanov@bsse.ethz.ch.
Jörg Stelling, Email: joerg.stelling@bsse.ethz.ch.
Supporting Material
References
- 1.Tallarida R., Jacob L.S. Springer-Verlag; New York: 1979. The Dose-Response Relation in Pharmacology. [Google Scholar]
- 2.Muller P.Y., Milton M.N. The determination and interpretation of the therapeutic index in drug development. Nat. Rev. Drug Discov. 2012;11:751–761. doi: 10.1038/nrd3801. [DOI] [PubMed] [Google Scholar]
- 3.Jensen M., De Meyts P. Molecular mechanisms of differential intracellular signaling from the insulin receptor. Vitam. Horm. 2009;80:51–75. doi: 10.1016/S0083-6729(08)00603-1. [DOI] [PubMed] [Google Scholar]
- 4.Calabrese E.J., Baldwin L.A. Hormesis: the dose-response revolution. Annu. Rev. Pharmacol. Toxicol. 2003;43:175–197. doi: 10.1146/annurev.pharmtox.43.100901.140223. [DOI] [PubMed] [Google Scholar]
- 5.Mattson M.P. Hormesis defined. Ageing Res. Rev. 2008;7:1–7. doi: 10.1016/j.arr.2007.08.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Di Veroli G.Y., Fornari C., Jodrell D.I. An automated fitting procedure and software for dose-response curves with multiphasic features. Sci. Rep. 2015;5:14701. doi: 10.1038/srep14701. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Cvrčková F., Luštinec J., Žárský V. Complex, non-monotonic dose-response curves with multiple maxima: do we (ever) sample densely enough? Plant Signal. Behav. 2015;10:e1062198. doi: 10.1080/15592324.2015.1062198. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Conradi C., Flockerzi D., Stelling J. Subnetwork analysis reveals dynamic features of complex (bio)chemical networks. Proc. Natl. Acad. Sci. USA. 2007;104:19175–19180. doi: 10.1073/pnas.0705731104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Craciun G., Pantea C. Identifiability of chemical reaction networks. J. Math. Chem. 2008;44:244–259. [Google Scholar]
- 10.Gunawardena J. A linear framework for time-scale separation in nonlinear biochemical systems. PLoS One. 2012;7:e36321. doi: 10.1371/journal.pone.0036321. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Mirzaev I., Gunawardena J. Laplacian dynamics on general graphs. Bull. Math. Biol. 2013;75:2118–2149. doi: 10.1007/s11538-013-9884-8. [DOI] [PubMed] [Google Scholar]
- 12.Thomson M., Gunawardena J. The rational parameterization theorem for multisite post-translational modification systems. J. Theor. Biol. 2009;261:626–636. doi: 10.1016/j.jtbi.2009.09.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Ahsendorf T., Wong F., Gunawardena J. A framework for modelling gene regulation which accommodates non-equilibrium mechanisms. BMC Biol. 2014;12:102. doi: 10.1186/s12915-014-0102-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Estrada J., Wong F., Gunawardena J. Information integration and energy expenditure in gene regulation. Cell. 2016;166:234–244. doi: 10.1016/j.cell.2016.06.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Gunawardena J. Time-scale separation—Michaelis and Menten’s old idea, still bearing fruit. FEBS J. 2014;281:473–488. doi: 10.1111/febs.12532. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Mihal'ák, M., P. Uznański, and P. Yordanov. 2016. Prime factorization of the Kirchhoff polynomial: compact enumeration of arborescences. In Proceedings of the 13th Workshop on Analytic Algorithmics and Combinatorics (ANALCO16), J. A. Fill and M. D. Ward, eds. (SIAM), pp. 93–105.
- 17.Mirzaev I., Bortz D.M. Laplacian dynamics with synthesis and degradation. Bull. Math. Biol. 2015;77:1013–1045. doi: 10.1007/s11538-015-0075-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Tutte W.T. The dissection of equilateral triangles into equilateral triangles. Math. Proc. Camb. Phil. Soc. 1948;44:463–482. [Google Scholar]
- 19.Gabow H.N., Myers E.W. Finding all spanning trees of directed and undirected graphs. SIAM J. Comput. 1978;7:280–287. [Google Scholar]
- 20.Levine L. Sandpile groups and spanning trees of directed line graphs. J. Comb. Theory Ser. A. 2011;118:350–364. [Google Scholar]
- 21.Morcavallo A., Stefanello M., Morrione A. Ligand-mediated endocytosis and trafficking of the insulin-like growth factor receptor I and insulin receptor modulate receptor function. Front. Endocrinol. (Lausanne) 2014;5:220. doi: 10.3389/fendo.2014.00220. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Sedaghat A.R., Sherman A., Quon M.J. A mathematical model of metabolic insulin signaling pathways. Am. J. Physiol. Endocrinol. Metab. 2002;283:E1084–E1101. doi: 10.1152/ajpendo.00571.2001. [DOI] [PubMed] [Google Scholar]
- 23.Goldstein B.J., Bittner-Kowalczyk A., Harbeck M. Tyrosine dephosphorylation and deactivation of insulin receptor substrate-1 by protein-tyrosine phosphatase 1B. Possible facilitation by the formation of a ternary complex with the Grb2 adaptor protein. J. Biol. Chem. 2000;275:4283–4289. doi: 10.1074/jbc.275.6.4283. [DOI] [PubMed] [Google Scholar]
- 24.Knudsen L., Hansen B.F., De Meyts P. Agonism and antagonism at the insulin receptor. PLoS One. 2012;7:e51972. doi: 10.1371/journal.pone.0051972. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Jibetean D., de Klerk E. Global optimization of rational functions: a semidefinite programming approach. Math. Program. 2006;106:93–109. [Google Scholar]
- 26.Narkawicz A., Garloff J., Munoz C.A. Bounding the range of a rational function over a box. Reliab. Comput. 2012;17:34–39. [Google Scholar]
- 27.Smith A.P., Muñoz C.A., Markevicius M. NASA, Hampton, VA; 2015. Kodiak: An Implementation Framework for Branch and Bound Algorithms: TM-2015–218776. [Google Scholar]
- 28.Walter E., Norton J., Milanese M. Plenum Press; New York: 1996. Bounding Approaches to System Identification. [Google Scholar]
- 29.Titi J., Hamadneh T., Garloff J. Convergence of the simplicial rational Bernstein form. In: Thi H.A.L., Dinh T.P., Nguyen N.T., editors. Proceedings of the 3rd International Conference on Modelling, Computation and Optimization in Information Systems and Management Sciences. Springer International; 2015. pp. 433–441. [Google Scholar]
- 30.Italiano G., Laura L., Santaroni F. Finding strong bridges and strong articulation points in linear time. Theor. Comput. Sci. 2012;447:74–84. [Google Scholar]
- 31.Georgiadis L., Italiano G.F., Parotsidis N. 2-edge connectivity in directed graphs. In: Indyk P., editor. Proceedings of the 26th ACM-SIAM Symp. on Discrete Algorithms. SIAM; 2015. pp. 1988–2005. [Google Scholar]
- 32.Georgiadis L., Italiano G.F., Parotsidis N. 2-vertex connectivity in directed graphs. In: Handorsson M.M., Iwama K., Kobayashi N., Speckmann B., editors. Proceedings 42nd Intl. Coll. on Automata, Languages, and Programming. Springer; 2015. pp. 605–616. [Google Scholar]
- 33.Georgiadis, L., G. F. Italiano, and N. Parotsidis, 2015. A new framework for strong connectivity and 2-connectivity in directed graphs. arXiv:1511.02913, https://arxiv.org/abs/1511.02913.
- 34.Firmani D., Georgiadis L., Santaroni F. Strong articulation points and strong bridges in large scale graphs. Algorithmica. 2015;74:1123–1147. [Google Scholar]
- 35.Calabrese E.J., Stanek E.J., 3rd, Hoffmann G.R. Hormesis predicts low-dose responses better than threshold models. Int. J. Toxicol. 2008;27:369–378. doi: 10.1080/10915810802503735. [DOI] [PubMed] [Google Scholar]
- 36.Gems D., Partridge L. Stress-response hormesis and aging: “that which does not kill us makes us stronger”. Cell Metab. 2008;7:200–203. doi: 10.1016/j.cmet.2008.01.001. [DOI] [PubMed] [Google Scholar]
- 37.van Wijk R., Tans S.J., Mashaghi A. Non-monotonic dynamics and crosstalk in signaling pathways and their implications for pharmacology. Sci. Rep. 2015;5:11376. doi: 10.1038/srep11376. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.