Skip to main content
PLOS Computational Biology logoLink to PLOS Computational Biology
. 2020 May 4;16(5):e1007573. doi: 10.1371/journal.pcbi.1007573

Robustness and parameter geography in post-translational modification systems

Kee-Myoung Nam 1, Benjamin M Gyori 2, Silviana V Amethyst 3, Daniel J Bates 4, Jeremy Gunawardena 1,*
Editor: Pedro Mendes5
PMCID: PMC7224580  PMID: 32365103

Abstract

Biological systems are acknowledged to be robust to perturbations but a rigorous understanding of this has been elusive. In a mathematical model, perturbations often exert their effect through parameters, so sizes and shapes of parametric regions offer an integrated global estimate of robustness. Here, we explore this “parameter geography” for bistability in post-translational modification (PTM) systems. We use the previously developed “linear framework” for timescale separation to describe the steady-states of a two-site PTM system as the solutions of two polynomial equations in two variables, with eight non-dimensional parameters. Importantly, this approach allows us to accommodate enzyme mechanisms of arbitrary complexity beyond the conventional Michaelis-Menten scheme, which unrealistically forbids product rebinding. We further use the numerical algebraic geometry tools Bertini, Paramotopy, and alphaCertified to statistically assess the solutions to these equations at ∼109 parameter points in total. Subject to sampling limitations, we find no bistability when substrate amount is below a threshold relative to enzyme amounts. As substrate increases, the bistable region acquires 8-dimensional volume which increases in an apparently monotonic and sigmoidal manner towards saturation. The region remains connected but not convex, albeit with a high visibility ratio. Surprisingly, the saturating bistable region occupies a much smaller proportion of the sampling domain under mechanistic assumptions more realistic than the Michaelis-Menten scheme. We find that bistability is compromised by product rebinding and that unrealistic assumptions on enzyme mechanisms have obscured its parametric rarity. The apparent monotonic increase in volume of the bistable region remains perplexing because the region itself does not grow monotonically: parameter points can move back and forth between monostability and bistability. We suggest mathematical conjectures and questions arising from these findings. Advances in theory and software now permit insights into parameter geography to be uncovered by high-dimensional, data-centric analysis.

Author summary

Biological organisms are often said to have robust properties but it is difficult to understand how such robustness arises from molecular interactions. Here, we use a mathematical model to study how the molecular mechanism of protein modification exhibits the property of multiple internal states, which has been suggested to underlie memory and decision making. The robustness of this property is revealed by the size and shape, or “geography,” of the parametric region in which the property holds. We use advances in reducing model complexity and in rapidly solving the underlying equations, to extensively sample parameter points in an 8-dimensional space. We find that under realistic molecular assumptions the size of the region is surprisingly small, suggesting that generating multiple internal states with such a mechanism is much harder than expected. While the shape of the region appears straightforward, we find surprising complexity in how the region grows with increasing amounts of the modified substrate. Our approach uses statistical analysis of data generated from a model, rather than from experiments, but leads to precise mathematical conjectures about parameter geography and biological robustness.

Introduction

Biological systems are widely acknowledged to be robust, which informally means that some property of a system is insensitive to perturbations. Particular forms of robustness, such as homeostasis in physiology [1], canalisation in development [2], and resilience in ecology [3], have been extensively studied. Robust design has been suggested as a general biological criterion [4] with parallels to engineering [5, 6] and as an important requirement for synthetic biological systems [7]. Furthermore, organismal robustness is often invoked as a form of buffering, to account for the extensive genotypic variation seen in populations, on which natural selection may subsequently act [8, 9]. A better understanding of robustness is therefore relevant to many aspects of biology.

We approach this problem here through mathematical analysis. The perturbations to which a biological system is robust typically arise in the system’s environment. When the system is represented by a mathematical model, it is the model’s parameters which capture the interactions between the system and its environment, so that perturbations are represented by changes in parameter values. This kind of parametric robustness is not the only way in which robustness can be interpreted mathematically—the effect of noise on the dynamics or of changes to conserved quantities may also be important [10]—but parametric robustness has been widely studied.

To clarify this kind of parametric robustness further, it is helpful to keep in mind the relationship between parameters and state variables, as shown in Fig 1A. We have assumed that the underlying mathematical model is that of a system of ordinary differential equations, because we will use this kind of model here, but a similar picture could be drawn for a system of difference equations, or for stochastic or partial differential equations. It is only when the parameters are given numerical values that a dynamics is specified in the state space. If the state variables are in some initial condition, the system follows a trajectory over time and eventually reaches a steady-state or a limit cycle or some more complicated attractor [11]. Crucially, this dynamics in the state space depends on the choice of numerical values in the parameter space. We typically expect the parameter space to break up into regions so that the dynamical portrait varies only quantitatively within a region, but changes qualitatively between regions. Bifurcations arise on the boundaries between regions and give rise to the abrupt change in qualitative dynamics from one parameter region to the next.

Fig 1. Models and their parameter geography.

Fig 1

A: Behaviour of a hypothetical mathematical model, as in a system of ordinary differential equations. When a point is chosen in the parameter space (right), indicated by a number 1, …, 4, a dynamics takes place in the state space (left), shown by the trajectories with arrowheads. The state space and parameter space are shown here as 2-dimensional but could be of any dimension. The parameter space is expected to break up into regions, shown here within a box of finite volume (dotted boundary) following the method adopted in the paper, such that the dynamics remains qualitatively similar within each region, as for parameter points 2 and 3, and becomes qualitatively different between regions. Parameter point 1 gives bistability in the dynamics, parameter points 2 and 3 give monostability, and parameter point 4 gives a stable limit cycle. Stable attractors are magenta; unstable attractors are cyan. B: Hypothetical shapes of regions in parameter space, assumed to be within a finite volume in two dimensions. All except region 2 have nonzero volume in two dimensions in the vicinity of each point. Region 1 is convex; region 2 has one-dimensional subregions, which would not be detectable by random sampling in two dimensions; region 3 has a hole in its interior; region 4 has a narrow “neck;” region 5 is disconnected.

System properties whose robustness is being assessed are typically defined for particular dynamical portraits and are therefore properties of one or more parametric regions. We will focus here on the property of bistability: the existence in the dynamical portrait of two stable steady-states, accompanied by one unstable steady-state, as in parameter point 1 in Fig 1A. Technically speaking, we will work in terms of stationarity—the existence of steady-states—rather than stability, which requires an assessment of local dynamics. However, as a convenience of language, we will continue to use “monostability” and “bistability” in favour of the less euphonious “monostationarity” and “tristationarity,” respectively, and we explain further below the issues involved in distinguishing between these properties.

Bistability has been widely interpreted as the mathematical counterpart of biological decision making, switching, or memory. It has been used, for example, to interpret state switching in single cells, both in unicellular organisms [12, 13] and in individual cells within multi-cellular organisms [1418]; state switching in whole organs [19]; cell lineage choice during organismal development [2025], where the implications of bistability have been widely reviewed [2628]; and memory formation during signal transduction [14, 29, 30] and neuronal learning [3133].

Here, we will consider bistability arising from protein post-translational modification (PTM). PTM is the mechanism by which amino acid residues in a protein are covalently modified in response to physiological conditions, through the catalytic action of forward-modifying and reverse-demodifying enzymes [34]. Phosphorylation is the most widely studied modification, but many others are now known and PTM is a central mechanism in cellular information processing [35]. Models show that bistability, and even multistability with more than two stable steady-states, can emerge in PTM systems, provided a substrate protein is modified on two or more sites by one forward and one reverse enzyme [29, 30]. Such multisite modification is common, and bistability in PTM has been suggested as the basis for cellular memory [36].

The choice of PTM as a bistability mechanism has the advantage that steady-states can be realised as solutions to polynomial equations. This permits analysis by numerical solution of polynomial equations rather than by numerical integration of differential equations. The former is much faster computationally. This allows the robustness problem to be addressed by randomly generating points in parameter space and identifying those which give rise to bistability. In this way, the bistable region can be effectively characterised. Such a statistical approach has interesting parallels with high-dimensional data analysis, although, here, the data arise not from experiments but from a model.

Many kinds of approaches have been taken to quantitatively assess robustness in this way, such as by parametric sensitivity [3740], or by estimating volume and shape [4147]. Algebraic methods can sometimes provide an analytical description of parametric regions [46, 4851], but these methods tend to scale poorly with the complexity of the system. For systems arising from networks of biochemical reactions, methods also exist which give parametric conditions under which bistability occurs [5258], or does not occur [5962], and some of these apply to PTM systems [54, 57, 58, 6366]. Discriminant locus approaches based on more general algebraic geometric techniques have been used to find regions of parameter space for which appropriate generic behaviours occur [67]. Bistable parametric regions have thereby been demarcated in various contexts [49, 53, 56, 58, 64, 66]. However, the relevant conditions for bistability are typically sufficient, but not always necessary, making it difficult in some cases to exactly determine bistable regions. Furthermore, these kinds of results also typically require a complete description of the underlying network, which makes it difficult to rise above the biochemical complexity.

Here, we build upon the approach of exploring the size and shape of parametric regions. Such “parameter geography” seeks to make a global assessment of the bistable region. The first property to consider is the dimension of the region. If the parameter space has dimension m, the bistable region may also be of dimension m (Fig 1B, example 1), of lower dimension, or some combination of the two (Fig 1B, example 2). Lower dimensionality has nearly always been neglected in the biological literature because it would never be found by random sampling. However, we are not aware of theorems that would rule it out for a general dynamical system, and it could conceivably arise from some mathematical constraint or degeneracy among the parameters. Therefore, to be careful, results obtained by parametric sampling should be qualified by the statement “with probability one,” to allow for any subsets of lower local dimension that are invisible to the sampling process. We will take this caveat for granted in what follows.

Assuming the bistable region has full dimension relative to the ambient parameter space, so that points within it can be found by sampling, a concise, global measure of robustness is the m-dimensional volume of the region, as contained within some m-dimensional box of finite extent (Fig 1A). This may be statistically estimated by counting the proportion of points in the region. The larger the volume, the more bistable parameter points and the more robust the property of bistability. However, volume gives no information about the region’s shape [46], which may conceivably exhibit local features, such as interior holes and cavities (Fig 1B, example 3) or waists (Fig 1B, example 4). Relatively small changes to parameter values in such local regions may destroy bistability and compromise robustness. Convexity offers a test for this. A region is convex if, given any two points within the region, the straight line segment connecting the two points also lies within the region (Fig 1B, example 1). Convexity is a strong property of a region but a more nuanced “visibility ratio” can be estimated by randomly choosing pairs of points from the region and estimating the frequency with which the line segment between each pair lies entirely in the region. The higher the visibility ratio, the closer the region is to convexity and the more robust the property of bistability. Another measure of shape is topological connectedness. A region is disconnected if it consists of two or more separated pieces (Fig 1B, example 5). Lack of connectivity may indicate that bistability arises for different reasons, which may have different degrees of robustness. Connectedness may be estimated using a connectivity graph, originally developed for robotic motion planning. These measures of volume, convexity, and connectedness will be the focus of the results presented here, but we note that they are only a first step towards understanding the complexities of shape in high dimensions [68].

Two recent developments, one mathematical and one computational, make the random sampling of high-dimensional parameter space feasible for determining parameter geography under realistic biochemical assumptions. We briefly describe the two developments here, with further details in the main text.

First, we use the graph-based linear framework for timescale separation to describe PTM systems [69, 70]. The framework offers several advantages. To begin with, it allows the enzyme mechanism underlying each modification to be treated in a general and realistic manner, instead of having to assume only the Michaelis-Menten reaction scheme. Specifically, an enzyme E that converts substrate S into product P can follow any mechanism that is built up from the elementary reactions in the following “grammar,”

S+EYi,YjYk,YP+E, (1)

where the Y’s are intermediate enzyme-substrate complexes [71, 72]. This allows a mechanism to take multiple routes with multiple intermediates and to be irreversible (product cannot be converted back into substrate) without being strongly irreversible (product does not rebind to enzyme). Fig 2 shows a linear-framework graph using the reactions (edges) in Eq 1 for an example of a weakly irreversible mechanism, i.e., a mechanism in which product is not converted to substrate but can rebind to enzyme.

Fig 2. Reaction network and example enzyme mechanism for a two-site PTM system.

Fig 2

A two-site PTM system is shown, in which modification and demodification are sequential and in which each enzymatic step yields a single product (“distributivity”). The box on the right shows an example of an enzyme mechanism made up from the elementary reactions in the grammar in Eq 1, illustrating multiple routes and multiple intermediate enzyme-substrate complexes. This example is weakly irreversible: the product S1 can bind to E but cannot be converted back into substrate S0, so that the mechanism is irreversible overall.

The significance of weak irreversibility is frequently overlooked. Forward modification and reverse demodification of a protein may well be effectively irreversible under physiological conditions but this does not imply absence of product rebinding. If the concentration of product is appreciable, as will often be the case in a PTM system, then the product must be expected to rebind to the enzyme that produced it. Indeed, it is a requirement of thermodynamics that binding and unbinding events, which draw their energy from the surrounding thermal bath, must be reversible [70]. Any strongly irreversible mechanism, such as the Michaelis-Menten scheme, fails to satisfy this requirement. (We note, however, that it was a perfectly appropriate assumption for Michaelis and Menten [73].) Despite such difficulties being repeatedly pointed out [70, 74, 75], the Michaelis-Menten scheme remains almost universally used for describing enzyme kinetics. We were particularly interested, therefore, in understanding how the different irreversibility assumptions would influence parameter geography and the assessment of robustness.

Regardless of the complexity of the reaction mechanism built from the grammar in Eq 1, the steady-state behaviour of the mechanism can be summarised with just four generalised parameters, two for the forward direction and two for the reverse direction [71, 72]. These parameters can be thought of as versions of the catalytic efficiency and Michaelis-Menten constant for the simple Michaelis-Menten scheme. By using these generalised parameters, in place of the many individual rate constants for each mechanism, it becomes possible to make general statements about steady-state behaviour for systems in which each enzyme follows its own reaction mechanism subscribing to the grammar in Eq 1 [72]. Due attention can thereby be paid to the behaviour of individual enzymes, which are known to exhibit many different kinds of reaction mechanisms [76]. Note, in particular, that our results, although obtained numerically, are valid for an infinite class of models, corresponding to different choices of mechanisms from the grammar in Eq 1.

For a PTM system, the linear framework further allows the exponential combinatorial complexity arising from multiple modification sites to be eliminated at steady-state [17, 30]. The steady-state behaviour of any PTM system can be reduced in this way to the solution of k polynomial equations in k variables, where k is the number of enzymes in the system. The number of modification sites influences the degrees of these equations but not the number of variables. For the case of a two-site PTM system with one forward and one reverse enzyme, this elimination procedure yields two polynomial equations, each of total degree 4 in two variables (Eq 10). These equations have eight non-dimensional parameters, which are defined in terms of the generalised parameters for the two enzymes, and three conserved quantities, which correspond to the total amounts of substrate and enzymes.

The variables in the polynomial equations are the normalised steady-state concentrations of the (free) enzymes, from which the steady-state concentrations of all other components in the PTM system can be determined. Solutions of the polynomial equations correspond exactly to the steady-states of the PTM system. Numerical integration of the underlying differential equations is thereby avoided. The linear framework allows us to rise above the details of enzyme mechanisms and the combinatorial complexity of PTM, at least for describing the steady-state behaviour [70].

The second development on which we rely are advances in numerical algebraic geometry for solving polynomial equations, implemented in the software tools, Bertini, Paramotopy, and alphaCertified [77, 78]. Algebraic geometry deals with the mathematical structures that arise as solutions to polynomial equations and has already been applied to systems biology [48, 49]. Bertini numerically solves polynomial equations by “homotopy continuation:” it starts from a system of polynomial equations whose solutions are known, then continuously deforms these solutions through a homotopy until they coincide, up to arbitrary numerical precision, with the solutions of the system of interest. The solutions along the homotopy are tracked using predictor-corrector methods. Paramotopy extends this procedure to efficiently track homotopies in parameter space, thereby facilitating the parallel solution of a system of parameterised polynomial equations at many different parameter values. Finally, alphaCertified can be used to rigorously determine whether each approximate numerical solution found by Bertini lies near a true solution to the equations, and thus confirm the accuracy of our calculations [78].

In summary, the linear framework enables model reduction of a realistic PTM system to two polynomial equations, while Bertini, Paramotopy and alphaCertified enable efficient and accurate solution of these equations. Their combination allows us to determine the steady-state behaviour of the two-site PTM system at a total of ∼109 parameter points in five different hypercubes in both an 8-dimensional parameter space for weak irreversibility and a 6-dimensional parameter space for strong irreversibility. We thereby map the parameter geography of bistability, from which several interesting and unexpected conclusions emerge.

We find that the bistable volume increases, in an apparently monotonic and sigmoidal manner, as the substrate grows more abundant relative to the enzymes, and there is a threshold substrate level below which bistability is undetectable by random sampling. Strikingly, we find that the bistable region occupies a much smaller proportion of the sampling domain under weak irreversibility than under strong irreversibility, and we demonstrate a tradeoff between bistability and product rebinding that underlies this discrepancy. We also find that, despite the apparently monotonic growth in the bistable volume, the region itself does not grow monotonically: parameter points can move back and forth between monostability and bistability. We formulate these observations as mathematical conjectures and questions that invite further analysis.

Results

Steady-state polynomial equations

We give an overview here of how the steady-state polynomial equations are derived, focusing on the generalised parameters and the process of model reduction, as described in the Introduction. Full details of the calculation are provided in the Materials and Methods.

We consider a protein, S, that is post-translationally modified at two sites by a forward-modifying enzyme, E, and a reverse-demodifying enzyme, F (Fig 2). We assume that modification takes place in a specific site order and that demodification takes place in the reverse order, so that there are only three modification states, or “modforms” [34]. The modforms will be denoted by Si, where i is the number of modified sites. These assumptions reduce the algebraic complexity of the equations, thereby permitting more extensive parametric exploration, but the methods presented here may be applied more generally.

We assume that E and F follow any reasonable distributive reaction mechanism built up from the grammar in Eq 1. Here, “reasonable” means only that the mechanism should be able to convert substrate to product and not yield only a dead-end complex; see [72] for details. A distributive (“hit-and-run”) reaction is one that yields only a single product with a given substrate. Processive (“bind-and-slide”) reactions, in which the enzyme catalyses multiple modifications while remaining bound to the substrate, can also be accommodated within the grammar, but can yield more complex behaviours [79, 80]. Each enzyme has two substrates—S0 and S1 for E, and S1 and S2 for F—and may use a different mechanism from the grammar on each substrate.

The linear framework shows that the steady-state behaviour of each reaction mechanism can be summarised with just four generalised parameters. For the case of E converting S0 to S1, which we will denote by the shorthand S0ES1, there are two reciprocal total generalised Michaelis-Menten constants (rtgMMCs), κ0,1E and κ1,0E, and two total generalised catalytic efficiencies (tgCEs), c0,1E and c1,0E. One parameter of each pair follows the forward direction in which S0 is converted to S1, indicated by the subscript “0, 1”, while the other parameter follows the reverse direction in which S1 is converted to S0, indicated by the subscript “1, 0”.

The rtgMMCs, κ0,1E and κ1,0E, respectively determine the extent to which S0 and S1 bind to E to form the intermediate complexes in the reaction mechanism:

κ0,1E[E][S0]+κ1,0E[E][S1]=S0ES1[Y*]. (2)

Here, and in what follows in the rest of the paper, [X] denotes the steady-state concentration of X, and Y* is a shorthand for those intermediate complexes appearing in the reaction mechanism given in the subscript of the summation. This avoids having to introduce notation for the individual intermediates when these details are not necessary. The rtgMMCs have units of (concentration)−1. The parameter κ1,0E measures the extent to which the product of the reaction, S1, can bind to E, thereby sequestering the enzyme from its substrate, S0, and giving rise to product inhibition [81].

The tgCEs, c0,1E and c1,0E, determine the rate at which E converts S0 to S1 and the rate at which E converts S1 to S0, respectively. The reaction S0ES1 incurs the following rate contributions:

ddt[S0]=+c1,0E[E][S1]-c0,1E[E][S0]+ddt[S1]=+c0,1E[E][S0]-c1,0E[E][S1]+, (3)

where the dots indicate similar rate contributions from the other three reactions (Eq 22). The tgCEs have units of (concentration ⋅ time)−1.

The generalised parameters are given by rational expressions in the rate constants of the corresponding reaction mechanisms. These expressions can be explicitly described once these mechanisms are specified in the grammar of Eq 1 [72]. Different mechanisms yield different expressions for the generalised parameters, but the steady-state behaviour of the mechanism is independent of the details of these expressions.

Modification and demodification in PTM systems are energy-dissipating and regarded as irreversible under physiological conditions [35]. We therefore assume that the enzymes operate irreversibly, so that, using S0ES1 as an example,

κ0,1E>0,c0,1E>0,κ1,0E0,c1,0E=0. (4)

This ensures positive flux of substrate S0 into product S1 (c0,1E>0), which also requires binding of substrate to enzyme (κ0,1E>0), but no flux of product into substrate (c1,0E=0), so that the reaction is irreversible overall. Product rebinding is permitted (κ1,0E0) and strong irreversibility arises when κ1,0E=0. Weak irreversibility corresponds to κ1,0E>0.

The PTM system has four separate reactions, each of which has three nonzero generalised parameters, giving 12 parameters in all. In reducing the system to two polynomial equations in two variables, the number of parameters is further reduced from 12 to 8. We briefly summarise here the three key steps in the model reduction, leaving full details to the Materials and Methods.

The first step arises from the steady-state assumption. Because modification and demodification are assumed to be ordered (Fig 2), the net flux through each modification loop must be zero [82]. Hence, using Eqs 3 and 4 (see also Eq 22),

c0,1E[E][S0]=c1,0F[F][S1]andc1,2E[E][S1]=c2,1F[F][S2].

Hence, [S1] and [S2] can be determined in terms of [S0], [E], and [F], as

[S1]=α([E][F])[S0]and[S2]=β([E][F])[S1], (5)

where

α=c0,1Ec1,0Fandβ=c1,2Ec2,1F (6)

are new non-dimensional parameters. This reduces the number of parameters from 12 to 10.

The second step arises from conservation of the substrate, which leads to the equation,

Stot=[S0]+[S1]+[S2]+S0ES1[Y*]+S1FS0[Y*]+S1ES2[Y*]+S2FS1[Y*], (7)

for some positive constant Stot. Using Eqs 2 and 5, this allows [S0], and therefore [S1] and [S2], to be determined in terms of [E] and [F]. All the state variables have now been eliminated in favour of [E] and [F]. (See Eqs 2325).

The third and final step arises from the conservation of the enzymes, which leads to the two equations,

Etot=[E]+S0ES1[Y*]+S1ES2[Y*]Ftot=[F]+S1FS0[Y*]+S2FS1[Y*], (8)

for positive constants Etot and Ftot. Using Eq 2 and the expressions for [S0], [S1], and [S2] described above, this yields two equations for [E] and [F], which fully determine the steady-state. The remaining state variables can be expressed as rational functions of the steady-state values of [E] and [F]. (See Eqs 26 and 27).

This result is a particular instance of the general theorem that, if a PTM system has k enzymes operating on a single substrate, then, irrespective of the number of sites and the mechanisms of the enzymes, the steady-state of each state variable is a rational function of the k steady-state enzyme concentrations, and these concentrations can be obtained as the solutions to a system of k equations in k unknowns [17]. Here, k = 2.

The three conserved totals, Stot, Etot, and Ftot, are different in character from the parameters of the system because they are determined by the initial conditions. In the biological interpretation, these conserved totals can be modulated by changes in physiological conditions. We therefore seek to understand the parameter geography of the system as these totals are varied.

The elimination process above requires only the composite parameters κ1E=κ1,0E+κ1,2E and κ1F=κ1,0F+κ1,2F (Materials and methods), which summarise the binding of the intermediate modform, S1, to E and F, respectively. This further reduces the number of parameters from 10 to 8.

It is always more convenient to work with non-dimensional parameters, and α and β are already non-dimensional. The other six parameters involve the rtgMMCs and we choose to non-dimensionalise them using the corresponding enzyme totals,

ϵ0=κ0,1EEtot>0,ϵ1=κ1EEtot>0,ϵ2=κ2,1EEtot0,ϕ0=κ0,1FFtot0,ϕ1=κ1FFtot>0,ϕ2=κ2,1FFtot>0. (9)

The ϵ’s summarise the binding characteristics of the reactions catalysed by E; the ϕ’s summarise the binding characteristics of the reactions catalysed by F; and α and β are ratios that compare the catalytic efficiencies of the reactions catalysed by E and F.

The constraints on the parameters in Eq 9 arise from an interesting asymmetry between the reactions in which S1 is a product, S0ES1 and S2FS1, and the reactions in which S1 is a substrate, S1ES2 and S1FS0. Strong irreversibility of the latter reactions influences the parameters in Eq 9: ϵ2 = 0 if, and only if, S1ES2 is strongly irreversible; and ϕ0 = 0 if, and only if, S1FS0 is strongly irreversible. However, strong irreversibility of the former reactions has no such effect: if S0ES1 is strongly irreversible, so that κ1,0E=0, it is still the case that κ1,2E>0 (Eq 4), so that κ1E>0. In other words, even if S1 is unable to sequester E as the product of the reaction S0ES1, it is still able to bind to E by being the substrate of the reaction S1ES2. Similarly, κ1F>0, irrespective of whether or not the reaction S2FS1 is strongly irreversible. It follows that the four parameters, ϵ0, ϵ1, ϕ1 and ϕ2, in Eq 9 are always positive, while the remaining two parameters, ϵ2 and ϕ0, are non-negative.

The three conserved totals can be non-dimensionalised as follows,

σ=StotEtot,λ=StotFtot,ζ=EtotFtot.

Finally, the two state variables can be non-dimensionalised using the corresponding enzyme totals,

u=[E]Etot,v=[F]Ftot.

Non-dimensionalisation can be performed in different ways, which can lead to different insights; the method adopted here works well for this particular analysis. The non-dimensional parameters and non-dimensional totals are all assumed to be positive, except when strong irreversibility is imposed on S1ES2 or S1FS0, in which case ϵ2 = 0 or ϕ0 = 0, respectively. The other parameters and variables are always taken to be positive.

The Materials and Methods show that, once the dust of calculation has settled, we arrive at the equations Φ1(u, v) = 0 and Φ2(u, v) = 0, where

Φ1(u,v)=αv2+ζuv+βζ2u2+(αϵ0-αϵ0σ-α+ϕ1ζ)uv2+(ϵ1ζ-ϵ1ζσ-ζ+βϕ2ζ2)u2v+(βϵ2ζ2-βϵ2ζ2σ-βζ2)u3+αϕ0v3-(αϵ0+ϕ1ζ)u2v2-(ϵ1ζ+βϕ2ζ2)u3v-βϵ2ζ2u4-αϕ0uv3Φ2(u,v)=αv2+ζuv+βζ2u2+(αϕ0-αϕ0λ-α)v3+(ϕ1ζ-ϕ1ζλ-ζ+αϵ0)uv2+(βϕ2ζ2-βϕ2ζ2λ-βζ2+ϵ1ζ)u2v+βϵ2ζ2u3-(αϵ0+ϕ1ζ)uv3-(ϵ1ζ+βϕ2ζ2)u2v2-βϵ2ζ2u3v-αϕ0v4. (10)

Here, Φ1 and Φ2 are each polynomial of total degree 4 in the non-dimensional variables u and v, with eight non-dimensional parameters and three non-dimensional totals. The polynomial equations in Eq 10 will be the object of analysis in the rest of the paper.

General approach to parameter geography

We describe here the general approach we take to exploring the parameter geography of the bistable region, which is then used in all subsequent sections of the paper. To keep the analysis relatively simple, we assume that Etot = Ftot, so that ζ = 1 and σ = λ, and take σ to be the parameter that varies. If σ > 1, so that Stot > Etot = Ftot, then both enzymes approach saturation by the substrate, which is known to promote bistability. We therefore began our analysis by examining parameter geography for 15 values of σ,

σ=1.0,1.5,2.0,2.5,3.0,4.0,5.0,7.0,10,15,20,50,100,200,500. (11)

We order the eight non-dimensional parameters so that, if θR8 is a point in parameter space, then

θ1=α,θ2=β,θ3=ϵ0,θ4=ϵ1,θ5=ϵ2,θ6=ϕ0,θ7=ϕ1,θ8=ϕ2.

Throughout most of the analysis which follows, we consider a finite-volume box in parameter space, H=[0.1,10]8R8, which constrains each parameter to lie in the interval [0.1, 10]. Each non-dimensional parameter is therefore positive. As previously noted, this means that the reactions in which S1 is a substrate, S1ES2 and S1FS0, are weakly irreversible, although the reactions in which S1 is a product, S0ES1 and S2FS1, may be either strongly or weakly irreversible. Weak irreversibility is the physically realistic assumption and we focus on that first.

The range [0.1, 10] sets the nominal value of each non-dimensional parameter to 1. This is appropriate for θ1 = α and θ2 = β because they are ratios of tgCEs for E and F (Eq 6). The other non-dimensional parameters, however, are products of rtgMMCs and conserved totals (Eq 9) and their nominal values are harder to judge. While some experimental data are available, estimated values can vary widely. In the absence of broadly acknowledged values, we chose 1 as the nominal value for all non-dimensional parameters.

Bézout’s Theorem from algebraic geometry tells us that the typical number of solutions of a system of polynomial equations is given by the product of the total degrees of the polynomials [83]. This gives 16 solutions for the two equations in Eq 10, which each have total degree 4. However, “solution” has to be interpreted carefully. Bézout’s Theorem holds over the field of complex numbers. The equation x2 + 1 = 0 has two complex solutions, x = ±i, but no real solutions. Bézout’s Theorem also requires the use of projective space, which allows solutions at infinity, like that of the equations x + y = 1 and x + y = 2, which do not intersect in Euclidean space. Finally, solutions may be repeated, as in the case of the equation (x − 1)2 = 0, in which case they must be counted with the appropriate multiplicity.

In practice, we found that, given ζ = 1, a fixed value of σ, and a randomly chosen point θH, the software tool Bertini yields the following 16 complex solutions for Eq 10: the zero solution, u = v = 0, which is always a solution of Eq 10, has multiplicity 6; seven additional finite solutions; and three solutions which are projectively at infinity. This pattern of solutions is generic: departures from the pattern can only occur on a subset of probability zero (Lebesgue measure zero) in R8 [83]. Accordingly, departures do not occur for randomly selected parameter points in R8. However, genericity can sometimes be lost during numerical homotopy continuation in Bertini. We developed a systematic procedure for addressing this (Materials and methods), which may be of interest in other studies. We also used the software tool alphaCertified to confirm that representative random samples of our numerical solutions were in the vicinity of true solutions, thereby greatly reducing the possibility of numerical artifacts (Materials and methods). Importantly, of the seven finite, nonzero solutions of Eq 10 at each parameter point, we always found either one or three positive real solutions.

Throughout this analysis, we refer to the occurrence of one positive real solution as monostability, and the occurrence of three positive real solutions as bistability (assuming two stable steady-states and one unstable one). We use this language only as a convenience and note the importance of distinguishing between stationarity and stability. For example, recent work on “mixed-mechanism” PTM systems, which incorporate both distributive and processive mechanisms, has shown the existence of a single unstable steady-state with limit-cycle oscillations [79, 80]. Such behaviours are not known to be a feature of PTM systems that employ only distributive mechanisms but checking for them requires testing for stability. This is not straightforward within the algebraic approach taken here. Eq 10 does not provide information about the stability of its solutions, which depends on the transient behaviour of the system near a steady-state. To determine stability, it is necessary to fix the mechanism of each enzyme, as built up from the grammar in Eq 1, and analyse the corresponding system of differential equations. If the steady-state is hyperbolic, asymptotic stability can be determined from the eigenvalues of the Jacobian matrix [11]. However, this would leave open the question of whether the same stability would be found for other choices of enzyme mechanism. We decided, therefore, to set aside the stability question and to focus on what can be deduced algebraically about steady-states from Eq 10. With that in mind, as mentioned above and in the Introduction, we use the terms “monostability” and “bistability” only for convenience, in place of “monostationarity” and “tristationarity,” respectively.

Bistable volume increases sigmoidally with σ

Having explained our general approach in the previous section, we begin the analysis by introducing the parametric region of interest. Recall that H=[0.1,10]8R8 is the box in parameter space in which we will work. Given ζ = 1 and a value of σ, let MσH be the subset of parameter points in H at which the system is bistable,

Mσ={θHthreepositiverealsolutionstoEq.10}.

Our main goal in the paper is to explore how the size and shape of Mσ changes as σ takes the values listed in Eq 11. We start our exploration with the volume of the bistable region.

Let ισ:H{0,1} be the indicator function of Mσ:

ισ(θ)={1ifθMσ0otherwise.

Let VH=28=256 denote the (base ten) logarithmic volume of H, and let Vσ denote the logarithmic volume of Mσ, normalised to that of H. This is conveniently defined by the integral of the indicator function,

Vσ=1VHθHισ(θ)dlogθ. (12)

The integral in Eq 12 is taken with respect to the logarithmic measure on R8. It cannot be evaluated analytically but lends itself to efficient unbiased statistical estimation by Monte Carlo methods, in which parameter points in H are randomly sampled. Specifically, Vσ can be approximated as the proportion of randomly sampled points in H that lie in Mσ, where “random” means with respect to the logarithmic measure. This amounts to independently sampling the logarithm of each parameter, log θi, from the uniform distribution on [−1, 1]. We refer to this as ILR (Independent Logarithmic Random) sampling.

Given a set S of parameter points, randomly chosen in this way, an unbiased statistical estimator for Vσ is given by

V^σ=1#SθSισ(θ), (13)

where #X denotes the number of elements in the finite set X. Confidence intervals for the estimator in Eq 13 may be computed using the central limit theorem (Materials and methods).

To perform this estimation, we ran Bertini and Paramotopy on a large computing cluster (Materials and methods, S1 Appendix). We generated an ILR sample of 4 × 106 points in H to calculate V^σ for all values of σ in Eq 11 greater than 2.5, for each of which we found sufficiently many bistable points to achieve good statistical confidence. For σ = 2.5, we generated an additional ILR sample of 2 × 106 points in H and calculated V^σ using the combined sample of 6 × 106 points; and, for σ = 1.0, 1.5, 2.0, we generated a third ILR sample of 4 × 106 points in H and calculated V^σ using the combined sample of 107 points.

The results of the estimation are shown in Fig 3. We first found that the estimated normalised volume of the bistable region is zero at σ = 1.0. This suggests the existence of a threshold in σ, below which there is no bistability; we explore this possibility further below. The normalised volume then appears to increase smoothly in a sigmoidal (“S-shaped”) manner and saturate at large values of σ. Saturation was not expected on mathematical grounds (Discussion) and we found that it is more apparent under assumptions of weak irreversibility than under strong irreversibility (below). The value of σ at which saturation is established, and the saturating volume itself, are difficult to determine precisely, but our analysis suggests that the saturating volume is close to 1.1% of the volume of H. We were surprised by how small this was. It suggests that, under realistic assumptions of weak irreversibility, bistability is robust but rare. We return to this point in the Discussion.

Fig 3. Volume of the bistable region under weak irreversibility.

Fig 3

The 8-dimensional volume of the bistable region, normalised as a proportion of the volume of the box H, is plotted against the values of σ in Eq 11, for the case when the reactions S1ES2 and S1FS0 are weakly irreversible. The accompanying table lists the number of bistable points found for each value of σ, together with the percentage of the box H occupied by the bistable region. The error bars give 95% confidence intervals for each estimate (Materials and methods). The estimates have been joined by line segments.

A threshold for bistability

The curve in Fig 3 raises several questions. First, it is unclear whether the lack of any observed bistability at σ = 1.0 reflects the existence of a bona fide threshold for bistability, σ*, below which there is only monostability, or rather arises as a consequence of undersampling. To clarify this point, we sought further evidence that M1 is indeed empty. We reasoned that, if bistability does exist for values of σ near 1, it is more likely to occur near those parameter points at which bistability occurs for larger values of σ (see also the section below on “blinking”). To find such points systematically, we turned to importance sampling, as implemented in the VEGAS algorithm, introduced originally for Monte Carlo estimation of multi-dimensional integrals [84].

VEGAS starts from an initial sample of points in a subregion of a multi-dimensional space and adaptively constructs augmented samples that are preferentially drawn from regions of higher sample density. Specifically, the section of each coordinate axis containing the projection of the sample is partitioned into a specified number, M, of bins whose lengths are chosen so that each bin contains the same number of projected sample points, up to some smoothing of the bin frequencies (Materials and methods). There are, therefore, proportionately more smaller bins in regions of higher sample density. Subsequently, a new sample of N points is generated one coordinate at a time, with each coordinate of each point chosen uniformly at random from a bin along the corresponding axis, such that the N values are roughly evenly partitioned among the M bins. This results in a new sample of N points biased towards high-density regions of the initial sample. The initial sample is then augmented with the new sample, and the entire process repeated T times.

We first generated 12 VEGAS samples, one for each of the following values of σ,

σ=2.5,3.0,4.0,5.0,7.0,10,15,20,50,100,200,500. (14)

For each value of σ listed in Eq 14, we initialised the VEGAS algorithm using the set of bistable points gathered through ILR sampling at the given value of σ. For instance, the set of 27508 bistable points found through ILR sampling at σ = 10 (Fig 3) was used as an initial sample for a VEGAS sample at σ = 10. We generated each VEGAS sample over T = 6 iterations, during each of which we sampled N = 106 parameter points and determined the subset of bistable points at the given value of σ using Bertini and Paramotopy. All sampling was performed over logarithmic coordinates, with each coordinate sampled from M = 50 bins covering the interval [−1, 1]. Thus, we obtained 12 VEGAS samples, one for each value of σ listed in Eq 14, each containing 6 × 106 points.

As expected for importance sampling, the proportion of bistable points found within each VEGAS sample at the corresponding value of σ was much larger than V^σ. The number of bistable points in each VEGAS sample ranged between ∼2.45 × 106 (at σ = 2.5) and ∼3.35 × 106 (at σ = 500), out of a possible 6 × 106. In marked contrast, running Bertini and Paramotopy on all 12 VEGAS samples at σ = 1.0 found zero bistable points, out of 7.2 × 107 points. This further suggests that M1 is indeed empty.

As a further test for this hypothesis, we generated one additional VEGAS sample for smaller values of σ. Here, we reasoned that the few bistable points found through ILR sampling for σ = 1.5 and σ = 2.0 (69 and 2476, respectively) rendered these sets inadequate for initialising the VEGAS algorithm. Thus, we instead opted to use the set of ∼2.45 × 106 bistable points found in the VEGAS sample for σ = 2.5 as an initial sample, and generated a VEGAS sample of M = 6 × 106 parameter points in a single (T = 1) iteration, again sampling each parameter from M = 50 bins covering the interval [−1, 1] in logarithmic coordinates. This VEGAS sample contained numerous bistable points at σ = 1.5 and σ = 2.0 (∼1.79 × 105 and ∼1.48 × 106, respectively), but did not contain any bistable points at σ = 1.0.

In sum, the above results suggest that M1 is indeed empty, and that a threshold for bistability, 1 ≤ σ* < 1.5, within H does exist. We believe this is an instance of a more general mathematical result (Discussion). We note that this conclusion is limited by the choice of box, H, we used to bound the parameter values. Indeed, it is entirely possible that there exist parameter points outside H that exhibit bistability at σ = 1, or even at arbitrarily small values of σ. We address this possibility in a subsequent section.

Bistable regions are connected

As explained in the Introduction, the positive volume of the bistable region confirms the simplest requirement for robustness but does not tell us about the shape of the region (Fig 1B). We therefore sought evidence for whether the bistable region Mσ is topologically connected, at least for σ ≥ 1.5 for which we know it is non-empty (Fig 3). The challenge here is that we have to assess connectedness from finite samples of parameter points, SH, which provide only discrete approximations, M^σ=SMσ, of the bistable region.

To address this, we constructed the connectivity graph, GΔ(M^σ), associated with the sample. We adapted this idea from previous studies of robotic motion planning, in which the graph is used to determine connected, obstacle-free regions of a robot’s configuration space [85, 86]. The vertices of the connectivity graph are the bistable points, M^σ, in the sample, and there is an undirected edge between two vertices if they are within Euclidean distance Δ of each other in logarithmic coordinates. Here, Δ > 0 is an adjustable threshold. Such a graph can be partitioned into connected components. Any two vertices within the same connected component can be joined by a path of contiguous edges, while no such path exists between vertices in different connected components.

Consider a sufficiently large finite sample of some connected region in a multi-dimensional space. If Δ is larger than the maximum distance between two points, then every point is connected to every other point and the graph consists of a single connected component. If Δ is smaller than the minimum distance between two points, then no point is connected to any other and there are as many connected components as there are vertices. In between these extremes, however, we would expect a different behaviour, with a single very large connected component, comprising most of the points in the interior of the region, together with many much smaller components, typically comprising points close to the boundary of the region that fail to be within Δ of the largest component.

In contrast, consider a region that is disconnected and consists of several connected components. We would then expect that for intermediate values of Δ, the corresponding connectivity graph would break up into several large connected components, along with many much smaller boundary components. Unless some of the connected components of the region were much smaller than others, we would expect each such component to manifest as a large connected component in the graph. We can thereby estimate the number of the former by counting instances of the latter.

It is not straightforward to assess the statistical accuracy of such estimates. To do so requires specifying a prior expectation for the region’s connectivity, and we have little to guide us in knowing what to expect of parameter geography. It is easy to imagine complicated regions, such as those with one very large part and many very small ones, that would confuse a connectivity graph analysis. Also, sampling is oblivious to regions of lower dimension, as noted in the Introduction, and we build the graphs from points within the finite box H, both of which issues could compromise the conclusions drawn from a connectivity graph analysis. With these caveats in mind, we believe the structure of the connectivity graph for intermediate values of Δ provides helpful preliminary evidence for the connectivity of the bistable region. There are also further tests that we can undertake, as explained below.

The connectivity graph is most informative when there are many sample points, so we used the bistable sets obtained from the VEGAS samples described above to build connectivity graphs, GΔ(M^σ), for each of the values

σ=1.5,2.0,2.5,3.0,4.0,5.0,7.0,10,15,20,50,100,200,500. (15)

The graphs for σ = 1.5 and σ = 2.0 were built from the single-iteration VEGAS sample gathered for small σ; for each remaining value of σ, the graph was built from the corresponding six-iteration VEGAS sample.

Constructing the graph in its entirety is computationally intractable if the sample size is large because the number of edges scales quadratically with the number of vertices. However, it is sufficient to construct a spanning forest, an acyclic subgraph that includes every vertex of the full graph. This is because the connected components of the spanning forest are individual spanning trees that are in bijective correspondence with the connected components of the full graph. A full description of this algorithm is given in the Materials and Methods.

Given a set of bistable points gathered from an ILR sample, SH, it is straightforward to estimate a value of Δ for which, given a point in S, there is a large probability, say 0.99, that there is at least one other point in S within distance Δ. This seems a reasonable choice for an intermediate value of Δ because it allows most vertices in the graph to have an incident edge. It is shown in the Materials and Methods how to calculate such a Δ for ILR samples of a given size. The VEGAS samples were not constructed by ILR sampling but each augmented VEGAS sample SA is generated from an initial set of bistable points obtained from an ILR sample, SI. We therefore chose an effective sample size, N′, that scaled proportionally with the increase in the number of bistable points,

N=(#bistablepointsinSA#bistablepointsinSI)#SI,

and used N′ in place of #S as the sample size in calculating Δ. We found using this approach that Δ = 0.15 is a suitable choice for all values of σ given in Eq 15 (Materials and methods).

The results of the connectivity graph analysis are shown in Table 1. For each value of σ in Eq 15, the largest connected component of the graph contains the vast majority of vertices. For instance, for σ = 3.0, there are 2705883 bistable points in the sample, which decompose into 122672 connected components. Of these, the largest component contains 2556447 vertices, or ∼94% of the total, while the second largest component contains a mere 21 vertices. While the proportion of vertices in the largest component decreases with σ—at σ = 500, only ∼74% of the vertices lie in the largest component—the size of the second largest connected component remains tiny, relative to that of the largest component, as σ increases. Furthermore, the number of components rapidly increases with σ, and a roughly constant percentage of them, between ∼83% and ∼87%, are singletons, indicating that the connectivity graph decomposes into one huge component and increasing numbers of very tiny components. This is exactly what would be expected if Mσ were connected.

Table 1. Connectivity of the bistable region.

Details of the initial connectivity graphs for the indicated values of σ in column 1. Column 2 gives the number of bistable points in the set used to construct the graph; column 3 gives the number of connected components in the resulting connectivity graph; columns 4 and 5 give the sizes of the largest and second-largest components of the graph, respectively; column 6 gives the number of singleton components; and column 7 gives the size of the largest component as a percentage of the number of bistable points in the sample. The numbers show that the connectivity graphs consist of one large component and many very small components, suggesting that the bistable region is connected for all examined values of σ. Details of the connectivity graphs obtained through refinement with additional sampling (Results, Materials and methods) are given in S1 Appendix.

σ #M^σ # components #C1 #C2 # singletons #C1#/#M^σ
1.5 179130 4519 173773 12 3923 97%
2.0 1481421 29484 1446437 16 25788 98%
2.5 2456156 76896 2362892 22 66180 96%
3.0 2705883 122672 2556447 21 105260 94%
4.0 2932035 209165 2672497 36 177833 91%
5.0 3050788 278684 2702553 28 235930 89%
7.0 3146912 373650 2672992 35 314273 85%
10 3236189 452663 2659036 33 379762 82%
15 3271113 527437 2592576 38 440444 79%
20 3303594 563899 2576185 55 470983 78%
50 3366306 631931 2548744 54 527579 76%
100 3382039 658003 2529828 42 548689 75%
200 3401104 669140 2533572 62 558110 74%
500 3396987 677710 2519055 38 565368 74%

If this is in fact the case, we should be able to connect the vertices in the smaller connected components to the largest component by paths of bistable points. Accordingly, we undertook a further test of the initial connectivity graphs described above (Table 1). For each graph, we considered each of the non-largest components and chose a vertex, θ, uniformly at random in that component. We then selected the K (approximate) nearest neighbours, ν(1), …, ν(K), to θ from the largest component, and sampled along the straight line segments between θ and ν(j), for j = 1, …, K, by sub-dividing each line segment into sub-intervals of length 0.98Δ and collecting the endpoints of these sub-intervals. We used Bertini and Paramotopy to determine which of these sampled points were in the bistable region. We then recomputed the connectivity graph with these newfound bistable points added to the original bistable set, and iterated this procedure until the proportion of vertices in the largest component remained constant between consecutive iterations.

The computationally intensive step here is determining which point from the largest component is nearest to each randomly selected vertex, θ. For this, we used the Approximate Nearest Neighbour (ANN) algorithm [87], which builds a hierarchical data structure to efficiently select K points, ν(1), …, ν(K), which are approximately nearest to θ in the sense that

d(θ,ν(j))(1+ϵ)d(θ,μ(j))forj=1,,K, (16)

where d is the distance metric being used (in our case, Euclidean distance over logarithmic coordinates) and μ(j) is the true jth nearest neighbour to θ, for j = 1, …, K. The approximation factor, ϵ, can be chosen to be as small as required and the algorithm is optimal in an appropriate sense [87]. Experience with the open-source C++ ANN library suggests that this is an efficient procedure for samples of order 105 in spaces of dimension up to 20, and that the probability of choosing a non-nearest neighbour is relatively small in practice [88].

For our analysis, we used an approximation factor of ϵ = 0.001, increasing K after each iteration (Materials and methods). Within two iterations of this procedure, we were able to obtain, for each of the values of σ in Eq 15, a set of bistable points whose connectivity graph consisted of a single connected component. This provides yet further evidence that the bistable region is connected. The full results of this analysis are given in S1 Appendix, Table C.

Bistable regions are not convex but have high visibility ratios

As described in the Introduction, convexity tells us about the shape of a region and rules out many features that compromise robustness, like the waists and holes in Fig 1B. It is not difficult to show, by randomly sampling pairs of points within the bistable regions, that none of these regions are convex. But how far do they depart from convexity? The visibility ratio of a region offers a measure of this. Informally, it is the probability that the line joining two points drawn at random from the region lies entirely in the region. Formally, we define the visibility ratio of Mσ as

vis(Mσ)=1Vσ2(θ,μ)Mσ×Mσν(θ,μ)d(logθ,logμ), (17)

where ν:Mσ×Mσ{0,1} is the indicator function of the set of pairs of bistable points for which the straight line segment (over logarithmic coordinates) between the two points lies entirely within the bistable region,

ν(θ,μ)={1ifισ(10tlogθ+(1-t)logμ)=1for0t10otherwise.

With the normalisation in Eq 17, it is easy to see that, if Mσ is convex, then vis(Mσ)=1.

Since computing ν(θ, μ) for even a single choice of θ and μ requires evaluating ισ on infinitely many parameter points, we cannot directly estimate vis(Mσ). Therefore, we sought to approximate ν(θ, μ) by sub-dividing the straight line segment between θ and μ into K + 1 sub-intervals of equal length (over logarithmic coordinates), and checking bistability only on the K endpoints of the intervals. In other words, we computed the following indicator function,

νK(θ,μ)={1ifισ(10tlogθ+(1-t)logμ)=1forallt=jd(θ,μ)K+1,j=1,,K0otherwise.

This quantity converges to ν(θ, μ) as K → ∞. Integrating over Mσ×Mσ and normalising, we obtain the K-fold visibility ratio,

vis(Mσ,K)=1Vσ2(θ,μ)Mσ×MσνK(θ,μ)d(logθ,logμ), (18)

which converges to vis(Mσ) as K → ∞.

Since νK(θ, μ) = 0 implies that ν(θ, μ) = 0, vis(Mσ,K)vis(Mσ) for all K > 0. The K-fold visibility ratio will therefore tend to overestimate the true visibility ratio. With this caveat in mind, it offers a computationally feasible estimate for the extent of departure from convexity.

We estimated the K-fold visibility ratio by randomly choosing M pairs of bistable points

(θ(1),μ(1)),,(θ(M),μ(M))M^σ×M^σ

without replacement, where M^σ is a set of bistable points gathered with ILR sampling from H, and computing the estimator

vis^(Mσ,K)=1Mj=1MνK(θ(j),μ(j)).

We undertook this estimation by choosing K = 10 and sampling M = 20000 pairs of points, without replacement, from the bistable points gathered through ILR sampling for each of the values of σ given in Eq 11, except for σ = 1.0 and σ = 1.5, at which too few bistable points were found through ILR sampling (Fig 3; see also the previous section on the bistable volume). Confidence intervals for these estimates were computed using standard results in finite population sampling statistics (Materials and methods).

The results of this analysis are shown in Fig 4A. We found that although the bistable region is not convex for any value of σ, it is close to convex for all considered values of σ, with 10-fold visibility ratios exceeding 0.95. This suggests that, for a large majority of pairs of bistable parameter points, the straight line connecting them in parameter space consists also of bistable points.

Fig 4. Visibility ratio of the bistable region under weak irreversibility.

Fig 4

A: The 10-fold visibility ratio of the bistable region under weak irreversibility is plotted as a function of σ. B: Three families of 2-dimensional regions, with length parameters a and R, as shown, along with plots of their 10-fold visibility ratios as functions of a/R, with R = 1 and a varying. 10-fold visibility ratios were numerically computed using M = 20000 random pairs of points, as in the main text. The dotted red lines in the plots indicate the values of a/R at which the visibility ratio is 0.9.

The visibility ratio is a global measure of the region’s departure from convexity but offers little information regarding what local geometric features may influence the loss of convexity. Different shapes can have the same visibility ratio and how the visibility ratio depends on shape is difficult to describe in general. Low-dimensional examples suggest that interior holes reduce the visibility ratio to a greater extent than boundary indentations (Fig 4B). The fact that the visibility ratio appears to be largely independent of σ suggests two possibilities. Either the bistable region grows radially outward in parameter space, in such a way that the size of any local geometric feature which compromises convexity scales with that of the entire bistable region; or these local geometric features appear and disappear in a manner that approximately preserves the visibility ratio as σ increases. The latter possibility may appear less plausible but the next section suggests that the growth of the bistable region with σ may indeed exhibit phenomena of this kind.

Non-monotonic growth of the bistable region

The volume of the bistable region, Vσ, appears to increase monotonically as σ increases (Fig 3). This monotonicity would follow naturally if, once a parameter point θH exhibits bistability at some value σ = a, it continues to do so for all σ > a. The bistable region would then increase in extent with σ, so that if a < b, then MaMb. We therefore tested this behaviour and were surprised to find, in contrast to our initial expectation, that parameter points do not behave this way. Instead, they can transition back and forth between monostability and bistability.

To illustrate the possibilities, Fig 5 shows four parameter points which exhibit different behaviours as σ takes the values 3.0, 4.0, 5.0, 7.0, and 10. Each plot shows the “pseudo-nullclines,” Φ1(u, v) = 0 and Φ2(u, v) = 0 from Eq 10, for which the intersections of the two curves give the steady-states of the PTM system [30]. (The zero solution, u = v = 0, is isolated from these curves in the real uv-plane and is omitted for simplicity.) The first parameter point,

θ(1)=(7.239380624,0.1047069812,0.1045129736,1.114909161,2.017259480,0.1057288524,2.544613812,0.1021803160),

is monostable at all five values of σ. The second parameter point,

θ(2)=(4.684802750,0.5200773771,1.589343505,0.1804775460,0.1270218587,0.1129848293,0.1816907000,5.708851611),

becomes bistable between σ = 4.0 and σ = 5.0, and remains so at σ = 5.0, σ = 7.0, and σ = 10. The third parameter point,

θ(3)=(6.865206499,0.2105440222,0.2524064999,0.1025041753,0.1178107732,0.1017610135,0.4495453146,5.796119139),

becomes bistable between σ = 4.0 and σ = 5.0, then reverts to monostability between σ = 5.0 and σ = 7.0. Finally, the fourth parameter point,

θ(4)=(4.885859541,0.4985163876,0.7212641184,2.119877596,0.1655484166,0.1246565927,0.7180476375,8.912153306),

becomes bistable between σ = 3.0 and σ = 4.0, reverts to monostability between σ = 4.0 and σ = 5.0, then reverts back to bistability between σ = 7.0 and σ = 10.

Fig 5. Parameter points move between monostability and bistability.

Fig 5

Pseudo-nullcline plots for the steady-state equations, Φ1(u, v) = 0 (blue) and Φ2(u, v) = 0 (green), in Eq 10 are shown for the four parameter values given in the main text, at five values of σ, showing how steady-states (red dots) appear and disappear at the intersections of the pseudo-nullclines. The zero solution, u = v = 0 is isolated from the pseudo-nullclines in the real uv-plane and is not shown for simplicity.

We examined the common ILR sample of 4 × 106 points in H at which solutions to Eq 10 were obtained for each value of σ in Eq 11. We determined for each value of σ the subset of bistable parameter points which become monostable at some larger value of σ (Table 2). We refer to this phenomenon as “blinking.” We were able to alphaCertify the solutions to Eq 10 at each of these parameter points for all values of σ, thereby confirming that the designations of monostability or bistability were mathematically correct. Blinking is therefore not a numerical artifact (Materials and methods).

Table 2. Blinking parameter points.

For each value of σ in column 1, the number of bistable points in the 4 × 106 samples obtained by independent logarithmic random (ILR) sampling is shown in column 2 (see also Fig 3). Of these bistable points, the number of blinking points, or those which become monostable at some larger value of σ, is shown in column 3 (titled BP). The subsequent columns indicate, for each value of σ, the number of blinking points that lie outside the largest component in the corresponding initial connectivity graph (BPSC, column 4); the number of blinking points that never enter the largest component of the initial connectivity graph for larger values of σ (BPNL, column 5); and the number of blinking points that are monostable at σ = 500 (“asymptotically monostable”, BPAM, column 6). The numbers show that blinking points are found primarily on the boundary of the bistable region and become monostable for large values of σ, as discussed further in the text.

σ #M^σ # BP # BPSC # BPNL # BPAM
1.5 23 0 0 0 0
2.0 992 8 8 3 2
2.5 3451 62 62 40 36
3.0 6213 94 94 67 70
4.0 11592 230 230 180 205
5.0 15942 308 308 260 290
7.0 22061 381 381 313 370
10 27508 396 396 348 392
15 32242 337 337 288 334
20 34832 299 299 281 298
50 39737 169 169 157 169
100 41490 82 82 76 82
200 42381 34 34 34 34

If blinking points occur within the interior of the bistable region, it suggests that the region can develop interior holes. This would be surprising in view of the high visibility ratio found previously, as such features tend to lower the visibility ratio (Fig 4B). We therefore examined the blinking points in relation to the connectivity graph GΔ(Mσ) of the bistable region (Table 1, S1 Appendix) and summarised the findings in Table 2. The number of blinking parameter points increases from 8 at σ = 2 to a maximum of 396 at σ = 10 but then decreases to 34 at σ = 200 (Table 2, column 3), despite the monotonic increase in the volume of the bistable region (Fig 3). Every blinking point was found to lie outside the largest connected component of the corresponding connectivity graph for every value of σ (Table 2, column 4). For any given value of σ, a sizeable majority of blinking points never enter the largest connected component of the graph at larger values of σ (Table 2, column 5). For instance, among the 308 blinking points at σ = 5, 260 (84%) never enter the largest component. These findings suggest that bistable points that become monostable at larger values of σ lie on the boundary of the bistable region, rather than in its interior, and tend to remain near the boundary as they exit and enter the bistable region as σ increases. Finally, most blinking points become monostable at σ = 500 (Table 2, column 6). For instance, of the 308 blinking points at σ = 5, 290 (94%) are monostable at σ = 500. This may indicate that many blinking points become asymptotically monostable as σ → ∞. We return to these interesting findings in the Discussion.

The strongly irreversible case

Up to now, we have assumed that the enzymes in the PTM system are irreversible and that the reactions in which S1 is a substrate, S1ES2 and S1FS0, are also weakly irreversible, with nonzero affinity for product rebinding. (Recall that the reactions in which S1 is a product, S0ES1 and S2FS1, may be either strongly or weakly irreversible without affecting the signs of the non-dimensional parameters (Eq 9) and the results deduced so far.) As explained in the Introduction, weak irreversibility is the realistic assumption for enzymes in a PTM system. However, in view of the nearly universal reliance in the literature on the Michaelis-Menten reaction scheme, we wanted to understand the impact of strong irreversibility on parameter geography.

Accordingly, we assume now that the reactions in which S1 is a substrate are strongly irreversible, so that ϵ2 = ϕ0 = 0, and take the parameter point ηR6 to be defined by

η1=α,η2=β,η3=ϵ0,η4=ϵ1,η5=ϕ1,η6=ϕ2.

We consider solutions within the finite-volume box H*=[0.1,10]6. The steady-state polynomial equations in Eq 10 still have degree 4 and Bézout’s Theorem tells us that there are 16 complex solutions in projective space. Given ζ = 1, a fixed value of σ and a randomly chosen parameter point in H*, the generic solutions are as follows: the zero solution, u = v = 0, which has multiplicity 6; five additional finite solutions; and five solutions which are projectively at infinity. Of the five nonzero, finite solutions, we always find either one or three positive real solutions, which we refer to as monostability and bistability, respectively, as explained previously. Let Mσ*H* denote the subset of bistable parameter points at a given value of σ.

The volume, Vσ*, of Mσ* and a statistical estimator of this volume, V^σ*, can be defined in a similar way to Eqs 12 and 13. Here, we used a single sample of 4 × 106 parameter points, generated by ILR sampling from H*, to compute V^σ*, as we found sufficiently many bistable points at all values of σ to ensure high statistical confidence. The results are shown in Fig 6. As in the weakly irreversible case (Fig 3), the volume appears to increase monotonically with σ. The threshold for bistability appears, as before, to be at or slightly above σ = 1. There are, however, two important differences between the weakly and strongly irreversible cases. First, saturation is less apparent under strong irreversibility, even with additional volumes evaluated at σ = 1000, 2000, and 5000. Second, under strong irreversibility the bistable volume increases much more rapidly with increasing σ: at σ = 500, the bistable region occupies 20% of H* under strong irreversibility in contrast to only 1.1% of H under weak irreversibility.

Fig 6. Volume of the bistable region under strong irreversibility.

Fig 6

The 6-dimensional volume of the bistable region, normalised as a proportion of the volume of the box H*, is plotted against the values of σ in Eq 11, along with the additional values σ = 1000, 2000, 5000, for the case when the reactions S1ES2 and S1FS0 are strongly irreversible. The accompanying table lists the number of bistable points found for each value of σ, together with the percentage of the box H* occupied by the bistable region. The error bars give 95% confidence intervals for each estimate (Materials and methods). The estimates have been joined by line segments.

Volumes scale in different ways with the dimension of the ambient space. The volume of a ball of radius r goes to zero as n → ∞. In contrast, the volume of a hypercube of length r scales as rn. Because we have used the volume of a hypercube with sides [0.1, 10] to normalise the volume of the bistable region, we expect this to compensate for the change in the ambient space from six dimensions to eight, even if we do not know how to accommodate the shape of the bistable region in the normalisation. With this caveat, the decrease in the normalised volume from strong to weak irreversibility still seems dramatic. Bistability appears to be a far more robust property under the unrealistic assumption of strong irreversibility, such as with Michaelis-Menten enzyme mechanisms, than it does under the realistic assumption of weak irreversibility.

How does such a large relative volume of bistable parameter points in H* become such a small relative volume in H? To address this question, we examined more closely the values of ϵ2 and ϕ0 at which bistability occurs in the weakly irreversible case. Fig 7A shows the parameter values, θ5 = ϵ2 and θ6 = ϕ0, for the bistable points used to calculate V^σ in Fig 3. We immediately notice a striking tradeoff between the two parameters: bistability appears to be confined to the region in which

ϵ2ϕ0<K, (19)

for some constant K whose optimal value increases with σ. There appears, in other words, to be a tradeoff between bistability and product rebinding: the more of the latter, as given by ϵ2 ϕ0 being large, the less of the former.

Fig 7. Tradeoff in weak irreversibility for bistability.

Fig 7

A: The bistable regions under weak irreversibility, whose volumes are shown in Fig 3, are shown here projected onto the parameters θ5 = ϵ2 and θ6 = ϕ0 for the indicated values of σ (top-right corner). In the bottom-right corner of each plot is the approximate value of the bound, K, on ϵ2 ϕ0, with the hyperbola ϵ2 ϕ0 = K is shown in red. B: A 3-dimensional schematic of the tradeoff between ϵ2 and ϕ0. The other six parameters are depicted as spanning a single dimension (the vertical axis). The 6-dimensional hypercube H* is therefore a line segment along this vertical axis; the bistable region is shown occupying around 20% of this line segment (with respect to the logarithmic measure). The tradeoff between ϵ2 and ϕ0 yields a region that occupies a small volume in the 8-dimensional hypercube H.

The constraint given by Eq 19 explains the dramatic decrease in volume from the strongly irreversible case in H* to the weakly irreversible case in H. The bistable region in the latter case is confined to the “thin” subregion of H in which ϵ2 and ϕ0 are not simultaneously large, as illustrated in Fig 7B.

The bistable region outside H

We have focused so far on the bistable region within boxes with sides [0.1, 10]. To assess how far these observations remain valid in larger regions of parameter space, we estimate here the volume of the bistable region in the boxes, Hp=[0.1p,10p]8 for weak irreversibility, and Hp*=[0.1p,10p]6, for strong irreversibility, with p running through the values p = 2, 3, 4, 5. As previously, we define Mσ,p, Vσ,p, V^σ,p to be, respectively, the bistable region in Hp, its normalised volume and its estimated volume by Monte Carlo sampling and use an asterisk for the corresponding quantities Mσ,p*, Vσ,p*, V^σ,p* for the bistable region in Hp*. In normalising the volumes, we note that the logarithmic volumes of the respective boxes are given by VHp=(2p)8 and VHp*=(2p)6.

We generated samples of 106 points by ILR sampling in each of the boxes H2,,H5 and H2*,,H5* and ran Bertini and Paramotopy to find the bistable proportion of each sample at the 15 values of σ listed in Eq 11, as well as six additional values of σ between 1.0 and 1.5 which are discussed below (Eq 20). Despite the smaller sample sizes, we were able to find sufficiently many bistable points in each sample to obtain high-confidence volume estimates.

The results for weak irreversibility are shown in Fig 8. We found that, for each value of p, the volume of the bistable region restricted to Hp increases monotonically with σ, as we found previously for p = 1. The monotonic increase appears to be sigmoidal for each value of p, though saturation at large values of σ becomes less evident as p increases. We also found that the bistable proportion of Hp at a fixed value of σ > 1 increases monotonically with p, so that Vσ,a < Vσ,b whenever a < b. This suggests that, as we expand the domain of sampling in parameter space, the rate at which we lose bistable points due to lower sampling density in Ha is exceeded by the rate at which we find bistable points in the complement, Hb-Ha. Moreover, we observed that, for all values of σ greater than a threshold near σ = 2.0, the difference between volumes for successive values of p, Vσ,p+1Vσ,p, decreases with p, suggesting the possibility that Vσ,p increases as p → ∞ towards an asymptotic functional dependence on σ, which is also monotonic and sigmoidal.

Fig 8. Volume of the bistable region outside H under weak irreversibility.

Fig 8

The 8-dimensional volume, V^σ,p, of the bistable region under weak irreversibility, normalised as a proportion of the volume of the box, Hp=[0.1p,10p]8, is plotted for the 21 values of σ in Eqs 11 and 20, for p = 2, 3, 4, 5. The curve for p = 1 is taken from Fig 3. The error bars give 95% confidence intervals for each estimate (Materials and methods). The estimates have been joined by line segments.

Strikingly, we also found that none of the points sampled in any of the boxes exhibits bistability at σ = 1.0, further supporting our prediction that a threshold for bistability exists. To find a more precise estimate of this threshold, we ran Bertini and Paramotopy on ILR samples acquired for H2,,H5 at each of the following six additional values of σ,

σ=1.0078125,1.015625,1.03125,1.0625,1.125,1.25. (20)

We found bistable points at each of these values. For instance, we found 35 bistable points at σ = 1.0078125 among the sample in H4, and as many as 372 bistable points at σ = 1.0078125 among the sample in H5. This implies that, if a threshold for bistability exists, then it must be less than 1.0078125. Our failure to find any bistable points at σ = 1.0 leads us to conjecture that the threshold is in fact exactly 1; we pose this as a conjecture for further study.

The results for strong irreversibility are shown in Fig 9. As in the weakly irreversible case, we found that, for each value of p, the volume of the bistable region in Hp* increases monotonically with σ. We found no bistable points at σ = 1 for any p, again consistent with the threshold conjecture.

Fig 9. Volume of the bistable region outside H* under strong irreversibility.

Fig 9

The 6-dimensional volume, V^σ,p*, of the bistable region under strong irreversibility, normalised as a proportion of the volume of the box, Hp*=[0.1p,10p]6, is plotted for the 21 values of σ in Eqs 11 and 20, for p = 2, 3, 4, 5. The curve for p = 1 is taken from Fig 6. The error bars give 95% confidence intervals for each estimate (Materials and methods). The estimates have been joined by line segments. The inset shows the curves in the vicinity of σ = 10, at which point all five curves are close to intersecting. The 95% confidence interval for V^10,1* does not overlap with that for V^10,p* for p = 2, 3, 4, 5, indicating that the difference between the former and the latter is statistically significant at the 0.05 level.

However, we also observed three interesting differences between the weakly and strongly irreversible cases. First, although the asymptotic behaviour of the bistable volume as σ → ∞ is difficult to resolve in the weakly irreversible case, the absence of saturation becomes conspicuous in the strongly irreversible case. The curves in Fig 9 suggest that under strong irreversibility the bistable volume increases more linearly with σ when σ is large. Second, we observed that, as for p = 1, the bistable volume increases more rapidly with σ under strong irreversibility than under weak irreversibility. However, this disparity becomes less prominent as p increases: a roughly 20-fold difference between the bistable volumes at p = 1 (V500,1 ≈ 1.1% and V500,1*20%) decreases to a roughly 2.5-fold difference between the corresponding bistable volumes at p = 5 (V500,5 ≈ 4.3% and V500,5*11%).

Third, in contrast to the monotonically increasing dependence of Vσ,p on p for fixed σ, we find that Vσ,p* exhibits two qualitatively distinct patterns of dependence on p. For smaller values of σ, Vσ,p* monotonically increases with p; for larger values of σ, Vσ,p* monotonically decreases with p. The transition from the first to the second behaviour occurs close to σ = 10, at which value all five volume curves are close to intersecting. Among the five bistable volume estimates at σ = 10, V^10,1* appears to deviate the most from the others, and this difference is statistically significant at the 0.05 level (note the non-overlapping 95% confidence intervals at σ = 10 in Fig 9). From this, it is difficult to say with confidence whether there is an interval of values for σ, near to or including σ = 10, in which Vσ,p* is non-monotonic in p. With that said, our analysis suggests that, for values of σ outside such an interval, Vσ,p* depends monotonically on p but is increasing for σ smaller than the interval and decreasing for σ greater than the interval. Here too, it seems that Vσ,p* may converge as p → ∞ to an asymptotic functional dependence on σ but in a seemingly different manner to the weakly irreversible case.

In summary, some of the observations made previously for the bistable region in H1=H generalise to larger sampling domains in parameter space: the volume of the bistable region increases monotonically with σ from a threshold, possibly toward a saturating volume in the weakly irreversible case. There is further compelling evidence for a bistability threshold which we conjecture to be at σ = 1. However, the size of the sampling domain affects bistable volumes in markedly different ways under weak and strong irreversibility.

Discussion

Parametric robustness of a model—the maintenance of a property in the face of parametric change—offers a mathematical window onto the broader concept of biological robustness. We have approached this problem here through parameter geography, which offers an integrated, global view of the parametric region in which a property holds. We have been able to analyse the parameter geography of PTM systems with two sites but arbitrary enzymatic complexity and to estimate the size and shape of regions in 8-dimensional parameter space. We briefly summarise our most interesting findings and comment on their broader significance.

The approach we have taken has relied on randomly sampling parameter points and is therefore limited to features which have positive measure in the space being sampled. We have also conflated monostability and bistability with monostationarity and tristationarity, respectively, but merely as a convenience of language. With these caveats in mind, we found compelling evidence for a threshold in total substrate below which bistability does not exist. We found this both for weak irreversibility in all boxes Hp=[0.1p,10p]8 and for strong irreversibility in all boxes Hp*=[0.1p,10p]6, with p = 1, 2, 3, 4, 5. In the light of other calculations not discussed here, we believe such a threshold may hold in greater generality and therefore put forward the following

Conjecture: Given a PTM system of the type studied in [17], consisting of a single substrate and multiple enzymes, Ei, each acting through any mechanism specified in the grammar of Eq 1, there exists a threshold level, T({Ei,tot}) > 0, depending on the total amounts of each enzyme, such that the system shows no multistationarity if Stot < T({Ei,tot}).

For the two-site, distributive system studied here, in which we restricted attention to Etot = Ftot, we found bistable points in H4 at Stot/Etot = σ = 1.0078125, which suggests that the threshold lies at σ = 1, so that T(Etot, Ftot) = Etot when Etot = Ftot. The conjecture above is consistent with recent work which showed that, when all enzymes follow the Michaelis-Menten reaction scheme, if Stot < Etot or Stot < Ftot, parameter values can be found at which there are multiple steady-states but such multistationarity does not exist if Stot > Etot and Stot > Ftot [58, 89, 90].

Once past the threshold value, the bistable region acquires positive measure in the appropriate parameter space and thereby a volume. When σ is large, the volume appears to saturate, although this is more pronounced for weak irreversibility and smaller boxes (Figs 8 and 9). However, there is a striking difference between the largest volumes reached under different enzymatic assumptions. The volume of the bistable region is substantially smaller under weak irreversibility than under strong irreversibility. We were able to identify the reason for this as a constraint on enzyme parameters (Fig 7). This leads us to make another

Conjecture: For the two-site PTM system considered here, under the assumption of weak irreversibility, there exists a constant K(σ), which increases with σ, such that there is no bistability when ϵ2 ϕ0 > K.

The parameters ϵ2 and ϕ0 depend on the ability of S2 to rebind to the forward enzyme E, ϵ2=κ2,1EEtot, and the ability of S0 to rebind to the reverse enzyme F, ϕ0=κ0,1FFtot, respectively (Eq 9). The above conjecture suggests there is a tradeoff between total rebinding and bistability: bistability is abrogated if both E and F exhibit high rebinding.

To our knowledge, there is no indication of such a relationship in the current literature. This is not surprising in view of the widespread fixation with the strongly irreversible Michaelis-Menten mechanism. As pointed out in the Introduction, one of our concerns has been to understand the impact of this unrealistic assumption on parametric robustness. Our results show that it can be very misleading. Under strong irreversibility, bistability appears robust, with the bistable region occupying 20% of [0.1,10]6=H1* at σ = 500. Under weak irreversibility, bistability appears far rarer, occupying only 1.1% of [0.1,10]8=H1 at σ = 500. The discrepancy becomes less marked for larger boxes, as noted above, but remains significant. A further difference between weak and strong irreversibility arises in the dependence of the bistable volume on the exponent, p, in the interval [0.1p, 10p], which determines the size of the box: under weak irreversibility, volume increases monotonically with p for each σ (Fig 8); under strong irreversibility, volume increases monotonically for σ < ∼10 and decreases monotonically for σ > ∼10 (Fig 9). The reasons for this marked difference remain unclear but, here too, we see that unrealistic assumptions have significant consequences.

The implication of these findings is that a multisite PTM mechanism of the kind studied here is unlikely to act as a genuine biological memory or switch. It may require additional features to improve its robustness, such as scaffolding [91], to suggest only one possibility. Moreover, future studies will need to be more careful in their biochemical assumptions before making claims as to robustness.

The linear framework allows such care to be exercised by enabling realistic, complex enzyme mechanisms to be analysed at steady-state. This has not only brought out the differences between weak and strong irreversibility but also led to a potential explanation for the reduction in robustness with strong irreversibility, which lies in the tradeoff between rebinding and bistability expressed in the conjecture above. We hope these demonstrations will encourage less reliance on unrealistic assumptions and more focus on actual enzyme mechanisms.

Between low and high σ, the volume of the bistable region appears to increase monotonically: if σ1 and σ2 are taken from the list in Eq 11 and σ1 < σ2, then V^σ1<V^σ2. This holds for both weak (Figs 3 and 8) and strong (Figs 6 and 9) irreversibility. It is natural to expect that such monotonicity arises because the bistable region itself increases with σ, so that Mσ1Mσ2. However, this is not the case: parameter points can move back and forth between monostability and bistability (Fig 5). The pattern of blinking, in which a bistable point becomes monostable, exhibits several interesting features (Table 2), which suggest that blinking may be restricted to the boundary of the bistable region.

These findings are perplexing and suggest surprising complexity in how the steady-state manifold is positioned relative to the hypersurfaces defined by conservation of total substrate and total enzymes. One way to focus on the problem is to ask whether or not the volume of the bistable region, Vσ, is a monotonic function of σ. If it is monotonic, this makes it difficult to account for blinking points, whose disappearance from Mσ must then be compensated by the appearance of other bistable points. If it is not monotonic, this makes it difficult to account for the monotonicity of the estimated volume, V^σ, as shown in Fig 3. It is, of course, possible that departure from monotonicity of Vσ does occur but at a scale that is too small to be observed by random sampling. If so, it becomes an interesting problem to determine the scale of these volume fluctuations. Such challenges lead us to pose the following

Question: How does Vσ depend on σ? If the relationship is monotonic, how is blinking compensated? If it is not monotonic, on what scale does it depart from monotonicity so that the estimated volume, V^σ, depends monotonically on σ?

The volume of the bistable region indicates its size but not its shape. This becomes a subtle problem in high dimensions. Even in two dimensions, it is evident that local variation in shape can greatly compromise robustness (Figs 1B and 4B). It is interesting, therefore, that the bistable region appears to be well behaved for the simplest shape measures. It is both connected (Table 1) and nearly convex, with a visibility ratio over 95% for all values of σ (Fig 4A). As far as these limited measures of shape are concerned, the bistable region exhibits good robustness. To know more, it would be necessary to use methods like persistent homology, which give access to higher-dimensional topological invariants [92, 93]. While this lies beyond the scope of the present paper, the datasets arising from our analysis are available for others to explore this question (S1 File; see also S1 Appendix). It would be of considerable interest to have an estimate of the topological type of the bistable region, which could stimulate further mathematical conjectures like those above.

Exploring the conjectures and question above presents an interesting challenge. Approaches based on chemical reaction network theory [56, 5862] may be useful for clarifying whether our conjectures hold for particular choices of enzyme mechanism but they may need to be adapted to rise above the details of such mechanisms. Discriminant locus approaches have been used to identify parameter regions for multistationarity but have so far been limited to parameter spaces of low dimension [67].

The analysis undertaken here, while based on numerical calculations and statistical analysis of data, has led to precise mathematical conjectures and questions about parameter geography. This has been possible by bringing together two advances: the linear framework, which enables steady-state reduction of PTM systems to polynomial equations, and numerical algebraic geometry, in the form of Bertini and related software tools, which allow efficient sampling of high-dimensional parameter spaces. This suggests that a new kind of exploratory mathematics is becoming feasible to study those biological systems which can be treated in this way, through timescale separation and polynomial specification. It may now be possible to accommodate more of the molecular complexity that is found in biology, as we have done here for enzyme mechanisms, but to also elicit mathematical insights which rise above that complexity, as described in the conjectures above. Perhaps a theory of parameter geography may eventually crystallise from studies of this kind.

Materials and methods

Derivation of Eq 10

We use the linear framework to show that the steady-state of the two-site PTM system described in Fig 2 is given by the solutions of the two polynomial equations given in Eq 10.

Each of the four modifications in the system is of the form, SiXSj, where the indices i, j ∈ {0, 1, 2} enumerate the modforms and X ∈ {E, F} is an enzyme (Fig 2). We assume that the aggregate mechanism SiXSj is described by a graph of elementary reactions in the grammar in Eq 1, with a finite number of enzyme-substrate intermediate complexes. We further assume that the sets of enzyme-substrate complexes involved in distinct enzyme mechanisms are disjoint. Then, by Proposition 1 in [17], the steady-state concentration of any enzyme-substrate complex, Y, is given by

[Y]=(μi,X[Si]+μj,X[Sj])[X],

where μi,X and μj,X are reciprocal generalised Michaelis-Menten constants (rgMMCs). The linear framework guarantees that these parameters are positive for each Y, unless SiXSj is strongly irreversible, in which case there is no flux from Sj to any of the enzyme-substrate complexes, and so μj,X=0 for every Y involved in the mechanism for SiXSj.

We may aggregate the rgMMCs by summing the concentrations of the enzyme-substrate complexes over each modification, to yield Eq 2,

S0ES1[Y*]=(κ0,1E[S0]+κ1,0E[S1])[E]S1ES2[Y*]=(κ1,2E[S1]+κ2,1E[S2])[E]S2FS1[Y*]=(κ2,1F[S2]+κ1,2F[S1])[F]S1FS0[Y*]=(κ1,0F[S1]+κ0,1F[S0])[F],

where

κ0,1E=S0ES1μ0,*Eκ1,0E=S0ES1μ1,*Eκ1,2E=S1ES2μ1,*Eκ2,1E=S1ES2μ2,*Eκ0,1F=S1FS0μ0,*Fκ1,0F=S1FS0μ1,*Fκ1,2F=S2FS1μ1,*Fκ2,1F=S2FS1μ2,*F

are the reciprocal total generalised Michaelis-Menten constants (rtgMMCs). When the four mechanisms are each strongly irreversible, we have

κ1,0E=κ2,1E=κ0,1F=κ1,2F=0,

since the rgMMCs in the corresponding sums are identically zero.

We may further aggregate the rtgMMCs as follows,

S0ES1[Y*]+S1ES2[Y*]=(κ0E[S0]+κ1E[S1]+κ2E[S2])[E]S2FS1[Y*]+S1FS0[Y*]=(κ0F[S0]+κ1F[S1]+κ2F[S2])[F], (21)

where

κ0X=κ0,1X,κ1X=κ1,0X+κ1,2X,κ2X=κ2,1XforX=E,F.

When the four mechanisms are strongly irreversible, we have κ2E=κ0F=0.

Note that Eq 21 and the enzyme conservation laws, Eq 8, imply that [E] = [F] = 0 if, and only if, Etot = Ftot = 0, which we assume is not the case. Therefore, [E] and [F] must be positive. (This is relevant to the steps that follow in which we divide by [E] or [F].)

Proposition 1 in [17] implies that under mass action kinetics, the dynamics of the substrate mod-forms may be written as follows,

d[S0]dt=(c1,0E[E]+c1,0F[F])[S1]-(c0,1E[E]+c0,1F[F])[S0]d[S1]dt=(c0,1E[E]+c0,1F[F])[S0]+(c2,1E[E]+c2,1F[F])[S2]-(c1,0E[E]+c1,0F[F]+c1,2E[E]+c1,2F[F])[S1]d[S2]dt=(c1,2E[E]+c1,2F[F])[S1]-(c2,1E[E]+c2,1F[F])[S2], (22)

where ci,jX are the tgCEs. Assuming the forward-modification tgCEs are positive, c0,1E,c1,2E,c1,0F,c2,1F>0, and the reverse-demodification tgCEs are zero, c1,0E=c2,1E=c0,1F=c1,2F=0, we have,

c0,1E[E][S0]=c1,0F[F][S1]andc1,2E[E][S1]=c2,1F[F][S2]

at steady-state. This yields Eq 5,

[S0]=α([F][E])[S1]and[S2]=β([E][F])[S1],

where α and β are as defined in Eq 6,

α=c1,0Fc0,1Eandβ=c1,2Ec2,1F.

Under our assumptions, α, β, [E], and [F] are all positive. Hence, Eq 5 implies that, if any one of the modform concentrations is zero, then all of them are. This implies in turn, via Eq 21 and the substrate conservation law Eq 7, that Stot is zero. Therefore, the modform and enzyme concentrations are positive at steady-state, as long as we assume that the following are positive: the conserved quantities Stot, Etot, Ftot; any one of the rtgMMCs for E, κ0E,κ1E,κ2E; any one of the rtgMMCs for F, κ0F,κ1F,κ2F; and all of the forward-modification tgCEs c0,1E,c1,2E,c2,1F,c1,0F. The strongly and weakly irreversible cases both satisfy these assumptions.

We can combine Eqs 5 and 21 to write

[S0]+[S1]+[S2]=(α[F][E]+1+β[E][F])[S1]S0ES1[Y*]+S1ES2[Y*]=(κ0Eα[F][E]+κ1E+κ2Eβ[E][F])[S1][E]S2FS1[Y*]+S1FS0[Y*]=(κ0Fα[F][E]+κ1F+κ2Fβ[E][F])[S1][F]. (23)

Define the rational functions

ψ1([E],[F])=α[F][E]+1+β[E][F]ψ2([E],[F])=Etot(κ0Eα[F][E]+κ1E+κ2Eβ[E][F])ψ3([E],[F])=Ftot(κ0Fα[F][E]+κ1F+κ2Fβ[E][F]),

and substitute them into Eq 23:

[S0]+[S1]+[S2]=[S1]ψ1S0ES1[Y*]+S1ES2[Y*]=[S1][E]Etotψ2S2FS1[Y*]+S1FS0[Y*]=[S1][F]Ftotψ3. (24)

Substituting Eq 24 into the substrate conservation law, Eq 7, gives

Stot=(ψ1+[E]Etotψ2+[F]Ftotψ3)[S1]; (25)

and substituting Eq 24 into the enzyme conservation laws, Eq 8, gives

Etot=(1+[S1]Etotψ2)[E]andFtot=(1+[S1]Ftotψ3)[F]. (26)

These substitutions eliminate all of the state variables except for [S1], [E], and [F]. We then eliminate [S1] by substituting Eq 25 into Eq 26, to get

Etot=(1+Stot/Etotψ1+([E]/Etot)ψ2+([F]/Ftot)ψ3ψ2)[E]Ftot=(1+Stot/Ftotψ1+([E]/Etot)ψ2+([F]/Ftot)ψ3ψ3)[F]. (27)

Finally, we substitute the non-dimensional quantities,

u=[E]Etot,v=[F]Ftot,ϵ0=κ0EEtot,ϵ1=κ1EEtot,ϵ2=κ2EEtot,σ=StotEtot,λ=StotFtot,ϕ0=κ0FFtot,ϕ1=κ1FFtot,ϕ2=κ2FFtot,

into ψ1, ψ2, and ψ3 to get

ψ1(u,v)=αζvu+1+βζuvψ2(u,v)=ϵ0αζvu+ϵ1+ϵ2βζuvψ3(u,v)=ϕ0αζvu+ϕ1+ϕ2βζuv,

and into Eq 27 to get

1u=1+σψ1+uψ2+vψ3ψ21v=1+λψ1+uψ2+vψ3ψ3. (28)

Cross-multiplying Eq 28 gives

(1-u)(ψ1+uψ2+vψ3)-σuψ2=0(1-v)(ψ1+uψ2+vψ3)-λvψ3=0. (29)

Finally, we note that the two left-hand-side expressions in Eq 29 are not polynomials in u and v, since ψ1, ψ2, and ψ3 include terms with both u/v and v/u. So we multiply both sides by ζuv to get,

Φ1(u,v)=ζuv((1-u)(ψ1+uψ2+vψ3)-σuψ2)=0Φ2(u,v)=ζuv((1-v)(ψ1+uψ2+vψ3)-λvψ3)=0. (30)

One can check that expanding Eq 30 and substituting in the expressions for ψ1, ψ2, and ψ3 gives Eq 10.

Solving Eq 10 with Bertini and Paramotopy

We briefly describe how homotopy continuation works and discuss the practical issues we encountered in using Bertini and Paramotopy. For more complete details, see [77]. A complete description of our workflow, along with details on the supplemental code and datasets, is given in S1 Appendix.

Homotopy continuation with Bertini

Consider a system of n polynomial equations in n variables,

f(x)=f(x1,,xn)=[f1(x1,,xn)f2(x1,,xn)fn(x1,,xn)]=0.

We are interested in solutions of such systems which consist of finitely many isolated points. Homotopy continuation uses two steps. First, another system of n polynomial equations, the start system, g(x) = 0, is selected, whose set of solutions can be easily computed. Second, a continuous map H:n×[0,1]n is defined so as to give a homotopy between f and g,

H(x1,,xn,1)=g(x1,,xn)andH(x1,,xn,0)=f(x1,,xn).

The idea is that, as t is changed from 1 to 0, H(x1, …, xn, t) = 0 traces a continuous deformation, or path, from each of the known solutions of g(x1, &, xn) = 0 to the desired solutions of f(x1, …, xn) = 0.

Bertini’s default choice of homotopy H is the total-degree homotopy, given by,

H(x1,,xn,t)=(1-t)f(x1,,xn)+γtg(x1,,xn),

where γ ≠ 0 is a random complex number and g is a total-degree start system, which is one that has the maximal number of finite isolated solutions given by Bézout’s Theorem. For example, we may set gi, for i = 1, …, n, to the polynomial [77]

gi(x1,,xn)=xidi-1,

where di is the degree of fi. We found this choice of start system and homotopy to be adequate for our analysis.

The homotopy H defines a complex-valued path from each solution of g(x1, …, xn) = 0 to a solution of f(x1, …, xn) = 0. Bertini tracks these paths using numerical predictor-corrector methods. More sophisticated algorithms, called endgames, are used to track the paths with enhanced precision once t is close to zero. The use of projective coordinates, which introduce additional points at infinity, allows the reliable tracking of divergent paths to arbitrary precision.

A solution x* is non-singular if it has multiplicity 1; otherwise, the solution is singular. Bertini determines the endpoint x* of a homotopy path to be singular if the condition number of the Jacobian matrix of f at x* is large. A non-singular solution, if reasonably accurate, can be sharpened to arbitrarily many digits by performing additional post-endgame iterations of Newton’s method. This provides a rapid alternative to re-tracking the homotopy with more stringent Bertini settings to obtain highly accurate non-singular solutions. It also guarantees a desired level of accuracy irrespective of the path-tracking behaviour. However, this method is not useful for obtaining highly accurate singular solutions, near which Newton’s method can converge more slowly, or not at all [77]. Bertini incorporates a large number of customisable settings which control its path-tracking behaviour; for a comprehensive discussion see [77].

Parameter homotopy continuation with Paramotopy

The polynomial systems of interest to us contain parameters and can be written as f(x;θ) = 0, where x = (x1, …, xn) and θ=(θ1,θm)m. We typically have a set of parameter points, θ(1),,θ(N)m, at which solutions of f(x;θ(i)) = 0 are required. Paramatopy is a software package built on Bertini which allows such parameterised polynomial systems to be solved efficiently in parallel through homotopies in parameter space. It uses the following two-step process.

Step 1. Randomly sample a parameter point, θ* ∈ ℂm, and find all the isolated solutions of f(x; θ*) = 0 using homotopy continuation in Bertini.

Step 2. For each j = 1, …, N, solve the system f(x; θ(j)) = 0 by tracking in parallel the solutions of the parameter homotopy,

f(x;p(t))=0,wherep(t)=(1-t)θ(j)+tθ*.

Paramotopy exploits the fact that a parameterised polynomial system has the same number of non-singular solutions at any parameter point outside a set of measure zero [77]. Therefore, if the system has k non-singular solutions, and dk is the maximal number of finite isolated solutions given by Bézout’s Theorem, then Paramotopy would track d paths during Step 1 and kN paths during Step 2. This can offer significant computational savings in comparison with running Bertini N times, once for each parameter point θ(j), which would require tracking a total of dN paths.

As noted in the main text, we have d = 16 for Eq 10. In the weakly irreversible case, we found k = 7 “proper” (meaning nonzero, non-singular and non-projectively infinite) complex solutions at each parameter point. In the strongly irreversible case, we found k = 5 proper solutions.

Despite having a wide range of path-tracking methods, Bertini may still fail to track a path to t = 0 if numerical difficulties arise. Paramotopy collects all such instances of early path tracking termination, called path failures, and can re-track these failed homotopies with a new Step 1 parameter point. This can be repeated as many times as necessary until all path failures are resolved.

Use of cluster computing

We ran Bertini and Paramotopy on the Orchestra and O2 high-performance computing clusters at Harvard Medical School. The vast majority of the Paramotopy runs were performed on Orchestra, with the Step 2 path-tracking performed in parallel over 10–20 cores per batch of 2.5 × 105 parameter points; the remaining computations were performed on O2, which succeeded Orchestra in March 2018. O2 currently consists of more than 11000 cores across 300 Intel Xeon x86 multi-core processors of various specifications.

The Paramotopy runs are generally time-consuming: solving the system in Eq 10 on a batch of 2.5 × 105 parameter points in a single Paramotopy run usually requires a runtime of several hours. The exact runtime depends on the values of the parameters and conserved quantities, the number of cores used, and details of the underlying numerics, such as the choice of Step 1 parameter point θ* (see above) and the Bertini settings (see below). See S1 Appendix for details.

Classifying solutions and numerical settings

Bertini and Paramotopy incorporates various tunable settings for distinguishing between zero and nonzero endpoints (ImagThreshold) and between finite and projectively infinite endpoints (EndpointFiniteThreshold). However, enforcing a fixed threshold can lead to misleading conclusions, as the accuracy of a solution obtained through parameter homotopy continuation depends on many factors. We therefore used a different strategy for distinguishing between zero, nonzero finite, and projectively infinite values for each solution obtained via Paramotopy.

A reported numerical solution coordinate, x* = a* + ib* ∈ ℂ, was categorised as one of the following, depending on three positive thresholds, Tzmin, Tzmax, and T, chosen such that 0 < TzminTzmax ≪ 1 ≪ T:

x*is{small_if|a*|,|b*|<Tzmaxambiguous_ifTzmin<|b*|<Tzmaxinfinite_if|a*|>Tor|b*|>Tnonzeroreal_ifTzmax<|a*|<Tand|b*|<Tzminnonzeronon-real_otherwise. (31)

A reported solution, (u*, v*) ∈ ℂ2, was determined to be proper if (1) (u*, v*) was reported by Paramotopy as non-singular, and (2) both u and v* were categorised as nonzero real or nonzero non-real.

We also determined a reported solution coordinate, x* = a* + ib*, to be insufficiently precise if Paramotopy specified either a* or b* with fewer than Td digits, for some positive integer Td. By using Bertini’s sharpening module, most non-singular solutions were specified with the desired precision but insufficiently precise solutions could occasionally arise in one of two ways. On the one hand, a path failure in Step 2 that was not subsequently resolved with sufficiently stringent Bertini settings occasionally manifested as an insufficiently precise solution. On the other hand, an ill-conditioned Jacobian matrix (whose condition number exceeds an internal threshold) was occasionally encountered during sharpening for a small minority of solutions—many of which were at parameter points sampled from H4* or H5*—at which point Bertini terminated sharpening prematurely. In view of these numerical issues, we set Td to slightly (usually five) fewer digits than the value of SharpenDigits (Table 3 and below) for the corresponding Paramotopy run.

Table 3. First-pass Bertini settings.

The principal settings used for the computational analysis are listed here. Every first-pass Paramotopy run was performed using these Bertini settings; all other settings were set to their default values [77]. The column title PFR refers to “path failure resolution” and this column gives Bertini setting values used for the first iteration of this process; the Bertini settings were modulated appropriately for subsequent iterations. A complete list of all Bertini settings used for every Paramotopy run and re-run, as well as all path failure resolution iterations, is given in S1 Dataset, with further details in S1 Appendix.

Setting Step 1 Step 2 PFR
CondNumThreshold 108 1014 1014
FinalTol 10−11 10−11 10−8
MaxNorm 105 105 108
MaxNumberSteps 10000 10000 20000
SecurityLevel 0 1 1
SharpenDigits 0 25 0
SharpenOnly 0 1 0
TrackTolBeforeEG 10−5 10−5 10−6
TrackTolDuringEG 10−6 10−6 10−7
UserHomotopy 0 2 2

Using the classification in Eq 31, we were able to find the generic number of proper solutions to Eq 10 with sufficient precision—seven under weak irreversibility and five under strong irreversibility—for the majority of parameter points on the first attempt, using the first-pass Bertini settings in Table 3 and the threshold values Tzmin = 10−25, Tzmax = 10−10, T = 108, and Td = 20. However, each such first-pass Paramotopy run resulted in between ∼100 and ∼105 parameter points with at least one questionable solution (i.e., a solution reported by Paramotopy as singular, or a solution with at least one small, ambiguous, infinite, or insufficiently precise coordinate). We therefore re-ran Paramotopy on all such points with a new choice of Step 1 point and more stringent Bertini settings. Upon obtaining a new solution set for each of these parameter points, we repeated this process—collecting all parameter points with at least one questionable solution and re-running Paramotopy on these points with increasingly stringent Bertini settings—until we identified the generic number of proper solutions for each point in the entire sample. We collected together a final set of proper solutions for each point in the sample, with which we performed all downstream analyses. The list of all Paramotopy runs and re-runs, and the Bertini settings used for each, are given in S1 Dataset; see S1 Appendix for further details.

We found by manual exploration that setting Tzmin = 10−25, Tzmax = 10−10, and T = 108 was appropriate for most Paramotopy runs. For a small subset of Paramotopy re-runs, we found that alternative values for the thresholds were more appropriate, based on seeing repeated convergence to questionable values despite stringent Bertini settings. For instance, we found that re-running Paramotopy on certain parameter points with more stringent Bertini settings yielded solutions with imaginary parts with absolute value less than 10−10 but greater than 10−11, so that Tzmax = 10−11 is a more appropriate choice. Likewise, certain parameter points also exhibited seemingly finite solutions with absolute value greater than 108, necessitating the use of larger values for T.

As mentioned above, we set Td to (usually) five fewer digits than the value of SharpenDigits for the corresponding run or re-run. For instance, each first-pass Paramotopy run was performed with SharpenDigits set to 25 (Table 3), and we set Td = 20 when classifying the solutions reported from these runs. Subsequent re-runs were performed with incrementally increasing values for SharpenDigits (S1 Dataset), with Td also increasing proportionately.

The list of values used for Tzmin, Tzmax, T, and Td for each Paramotopy run and re-run is given in S1 Dataset, with further details in S1 Appendix.

Certifying solutions

The solutions reported by Bertini and Paramotopy are approximate. The software package alphaCertified, which implements Smale’s α-theory [77, 78, 94], can determine if an approximate non-singular solution x* to the polynomial system f(x) = 0 would converge under repeated application of Newton’s method to an exact solution ξ. If so, we have a guarantee that the numerically obtained approximate solution is in the vicinity of an actual solution and we say that x* is a certified approximate solution to f(x) = 0 with associated solution ξ. This method cannot be applied to singular solutions, which do not behave well with respect to Newton’s method. Accordingly, we sought to certify only the seven or five proper solutions identified for each parameter point at each value of σ.

To exploit this capability, we randomly chose 5% of each collection of proper solution sets associated with each Paramotopy run, yielding a total of ∼2.5 × 107 proper solution sets across 24 values of σ (S1 Appendix), and used alphaCertified to certify each of these solutions. alphaCertified comes with a built-in module for sharpening non-singular solutions by additional Newton iterations, so we implemented a certify-and-sharpen procedure, in which any uncertified solution would undergo further sharpening before being passed to alphaCertified for another certification attempt.

With this procedure, we were able to certify almost every proper solution among the chosen ∼2.5 × 107 proper solution sets within five iterations of certification and four iterations of sharpening (totalling eight iterations of Newton’s method). A tiny minority of 106 solution sets, all from parameter points sampled in H5*, exhibited at least one uncertifiable proper solution, even after four iterations of sharpening. Among these 106 solution sets, 99 exhibited only one uncertifiable solution. Manual inspection of the alphaCertified output suggests that the uncertifiable solutions are in fact close to singular: applying successive iterations of Newton’s method on these solutions, (u*, v*), fails to show convergence in one or both of |u*| or |v*| (S1 Appendix). These apparently singular solutions evaded the tests imposed in Bertini and Paramotopy. However, they are extremely rare and were found only for the bistable region in H5*. Accordingly, we do not believe they affect any of the quantitative conclusions we have drawn, even for H5*.

In addition, we followed the same procedure to certify every proper solution, for every value of σ in Eq 11, associated with each parameter point found to “blink” at some value of σ (Results, Table 2). Each of these proper solutions was successfully certified within five iterations of certification and four iterations of sharpening.

A full description of our certification procedure, along with a discussion of uncertifiable solutions, is given in S1 Appendix.

Confidence estimates for bistable volumes

Given an ILR sample SH of N parameter points, it follows from Eq 13 that the unbiased volume estimator is given by

V^σ=1NθSNισ(θ).

In other words, V^σ is the sample mean of a sequence of independent and identically distributed Bernoulli random variables, ισ(θ), with success probability Vσ. Therefore, the statistical properties of V^σ, in the limit of large N, are determined by the central limit theorem, as

limNPr(-ϵ<N(V^σ-Vσ)δ<ϵ)=Φ(ϵ)-Φ(-ϵ)=2Φ(ϵ)-1

for any ϵ > 0, where Φ is the standard normal cumulative distribution function, and δ=Vσ(1-Vσ) is the standard deviation of ισ over H. Since the value of δ is unknown, we introduce the sample standard deviation of ισ over S,

δ^=1N-1θS(ισ(θ)-V^σ)2,

which converges to δ as N → ∞. Therefore, we can write

limNPr(-ϵ<N(V^σ-Vσ)δ^<ϵ)=2Φ(ϵ)-1

for any ϵ > 0. In particular, for any α > 0, we have

limNPr(|V^σ-Vσ|<Φ-1(1-α2)δ^N)=1-α.

Hence, the 100(1 − α)% confidence interval for V^σ is given by

V^σ±Φ-1(1-α2)δ^N,

provided that N is large.

Confidence estimates for visibility ratios

In contrast to the volume estimator, V^σ, the estimator for the K-fold visibility ratio, vis^(Mσ,K), was computed by generating a random sample without replacement of M = 20000 pairs of bistable points,

(θ(1),μ(1)),,(θ(M),μ(M)),

from a finite population M^σ×M^σ, where M^σ is a set of bistable points gathered with ILR sampling from H, and evaluating

vis^(Mσ,K)=1Mj=1MνK(θ(j),μ(j)).

Let N=#(M^σ×M^σ). Provided that M and NM are sufficiently large, an approximate 100(1 − α)% confidence interval for vis^(Mσ,K) is given by [95]

vis^(Mσ,K)±Φ-1(1-α2)δ^M1-MN,

where δ^ is the sample standard deviation of νK over S,

δ^=1M-1j=1M(νK(θ(j),μ(j))-vis^(Mσ,K))2,

and 1-M/N is a correction factor accounting for the finiteness of the population M^σ×M^σ. It is important to note that M^σ×M^σ is fixed throughout this calculation. Running this calculation repeatedly with many distinct samples of size M from M^σ×M^σ, roughly 100(1 − α)% of the resulting confidence intervals would contain the value,

1N(θ,μ)M^σ×M^σνK(θ,μ).

As such, this confidence interval does not directly measure the accuracy of the estimate, vis^(Mσ,K), relative to the K-fold visibility ratio, vis(Mσ,K); such a measurement would require, at a minimum, generating many distinct samples of bistable points, M^σ, with which to compute vis^(Mσ,K).

We also note that MN for all values of σ, so that the correction factor becomes negligible. Thus, we simply compute the 100(1 − α)% confidence interval as

vis^(Mσ,K)±Φ-1(1-α2)δ^M.

The VEGAS sampling algorithm

We outline here the VEGAS sampling algorithm described in the Results. Let T be the total number of iterations, N the size of each sample, M the number of intervals, and K the smoothing factor. We begin with an initial bistable sample, M^σ(0)[0.1,10]8, gathered through some other sampling process (such as ILR sampling).

  1. For each j = 1, …, 8, partition the interval [−1, 1] into M bins I1(j),,IM(j) of equal length.

  2. For each t = 1, …, T, do the following:

    1. For each j = 1, …, 8, do the following:

      1. Compute the histogram of values in the projection M^σ,j(t-1)={θj:θM^σ(t-1)}, according to the partition of [−1, 1] given by I1(j),,IM(j).

      2. Re-normalise the bin frequencies, f1(j),,fM(j), as follows:
        fi(j)Kfi(j)#M^σ(t-1)+1,
        where ⌈x⌉ denotes the ceiling of x, i.e., the least integer greater than x.
      3. Re-size the bins, {Ii(j)=[ai(j),bi(j)]:i=1,,M}, to have length proportional to fi(j).

      4. Sample a number, r, uniformly from [0, 1], then partition the N values to be sampled among the M bins, so that the bin from which to generate the jth coordinate of the nth point, θj(n), in S(t) is given by
        I(n,r)=Ii(j),
        where i is the least integer such that i/M ≥ (r + n)/N.
      5. For each n = 1, …, N, do the following:

        1. Sample a value, s, from the uniform distribution on I(n, r).

        2. Set θj(n)=10s.

    2. Determine the bistable subset, MσS(t), and update the total bistable sample as M^σ(t)M^σ(t-1)(MσS(t)).

Step 2a, ii is a smoothing which we introduced for the following reason. If the bistable region, Mσ, has very small volume relative to the bounding box then a sample M^σ can have many empty bins in the histograms over each projection. This can lead to heavy bias in future iterations of the sampling. With this in mind, we incorporated the smoothing factor, K, to re-normalise the bin frequencies and incremented each bin frequency by 1, so that each bin has a nonzero frequency. This slightly shifts the sampling probability along each parameter axis towards regions of lower sample density; this effect grows stronger as we accumulate more points in Mσ. We fixed M = 50 and K = 1000 throughout our analysis, across all VEGAS samples generated for all values of σ. The values of T and N are described in the Results. A full description of our VEGAS implementation is given in S1 Appendix.

Building the connectivity graph

We outline here the algorithm for building the connectivity graph, GΔ(M^σ).

Choosing Δ

We first consider the task of choosing Δ for a bistable sample M^σ coming from an ILR sample SH. We reasoned that a suitable threshold for determining whether two points in M^σ are directly connected should be that, for any point θS, the probability that there is a second point μS such that d(θ, μ) ≤ Δ (where d is Euclidean distance over logarithmic coordinates) is large, say, 0.99. That is, we want to choose Δ such that

Pr(μSsuchthatμθandd(θ,μ)Δ)0.99.

This is clearly 1 minus the probability that there exists no such point μ. The probability that no point in S (other than θ) is within distance Δ of θ, assuming that S is an ILR sample, is given by

(1-V8(Δ)VH)#S-1,

where V8(Δ) is the volume of an 8-dimensional ball of radius Δ,

V8(Δ)=π4Δ824.

So we want to choose Δ such that

1-(1-π4Δ824VH)#S-10.99.

Rearranging, we find that a suitable value for Δ is given by

Δ(24VHπ4(1-0.011/(#S-1)))1/8.

Now suppose that M^σ was built from an augmented VEGAS sample SA obtained from an initial (ILR) sample SI. We can estimated an effective sample size, N′, for SA as follows,

N=(#bistablepointsin#SA#bistablepointsin#SI)#SI.

For instance, the VEGAS sample for σ = 10 was initialised with an initial set of 27508 bistable points from an ILR sample of 4 × 106 points (Fig 3), and, after T = 6 iterations, consisted of 6 × 106 points, of which 3208681 were bistable (Table 1). This gives an effective sample size of N′ ≈ 4.67 × 108, which gives a value of Δ ≈ 0.17. Performing this calculation for each value of σ given in Eq 15 and the corresponding bistable sample, we found that suitable values of Δ range between ∼0.10 and ∼0.18. Accordingly, we used Δ = 0.15 for our connectivity analysis.

Constructing GΔ(M^σ)

Given a set M^σ of bistable points and a constant Δ > 0, chosen as above, the graph GΔ(M^σ) is built as a spanning forest which contains each point in M^σ as a vertex, and connects two points with an edge if, and only if, the Euclidean distance between the two points is less than Δ. Such a spanning forest is not unique.

Suppose M^σ={θ(j):j=1,,N}. The algorithm below returns an N × N adjacency matrix A and a vector L of N component labels. The matrix satisfies Aij = 1 if points θ(i) and θ(j) are connected by an edge and Aij = 0 otherwise. The vector L assigns each point to the label of its connected component.

  1. Initialise unvisitedM^σ, label ← 1, Li ← 0 for all i = 1, …, N, and Aij ← 0 for all i, j = 1, …, N. Initialise an empty queue, pointqueue.

  2. While unvisited is not empty, do the following:

    1. Choose a point θ(i)unvisited.

    2. Remove θ(i) from unvisited.

    3. Push θ(i) onto the end of pointqueue.

    4. While pointqueue is not empty, do the following:

      1. Pop the first point θ(j) from the start of pointqueue.

      2. Set Ljlabel.

      3. For each θ(k)unvisited such that d(θ(j), θ(k))< Δ, do the following:

        1. Remove θ(k) from unvisited.

        2. Set Ajk ← 1.

        3. Push θ(k) onto the end of pointqueue.

    5. Update labellabel + 1.

A complete description of our implementation of this algorithm is given in S1 Appendix.

Refining the connectivity graph

The refinement procedure was described in the main text. We used an approximation factor of ϵ = 0.001 (Eq 16) throughout the analysis and sought to speed up this refinement process by increasing K between iterations. Specifically, for each value of σ listed in Eq 15, we updated K as follows,

K{j+1if#C2>103(j+1)if3<#C21010(j+1)otherwise,

where j = 0, 1, 2, … is the iteration number, and C2 is the second-largest component in the graph. This procedure yielded a single-component graph in two iterations for each value of σ listed in Eq 15. Full details are given in S1 Appendix.

Supporting information

S1 Appendix. Supplemental methods.

This document provides a comprehensive description of our implementations of the methods described in the paper and guidelines for navigating the supplemental code and datasets. Supplemental figures: (A) Workflow for computing and parsing solutions with Paramotopy. Supplemental tables: (A) Seeds used to initialise the MATLAB pseudo-random number generator for sampling; (B) Seeds used to initialise the MATLAB pseudo-random number generator for VEGAS sampling; (C) Details of the refined connectivity graphs.

(PDF)

S1 Code. Scripts used to process Paramotopy output.

A detailed description of each script in this collection is given in S1 Appendix.

(GZ)

S1 Dataset. Summary of Paramotopy runs and associated Bertini settings.

This dataset contains: (1) a tab-delimited text file (metadata.tsv) enumerating all Paramotopy runs and re-runs performed in this analysis, along with the samples on which they were performed and the Bertini settings employed for each run and re-run; (2) a directory of tab-delimited text files (thresholds/) enumerating all Paramotopy runs and re-runs with their corresponding values of the thresholds Tzmin, Tzmax, T, and Td; and (3) an XML file (defaultprefs.xml), passed as input into Paramotopy, that enumerates the default Bertini settings given in Table 3. See S1 Appendix for details.

(GZ)

S1 File. Dataset DOIs.

All datasets are available on Mendeley Data under the given DOIs. See S1 Appendix for details.

(TSV)

Data Availability

All datasets referenced in the paper (outside of the files in S1 Dataset) are available on Mendeley Data. Their DOIs are listed in S1 File.

Funding Statement

K-MN and JG were supported by National Science Foundation award #1462629 (https://www.nsf.gov/). DJB was supported by National Science Foundation award #1719658. SVA and DJB were supported by National Science Foundation award #1115668. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1. Cannon WB. Organization for physiological homeostasis. Physiol Rev. 1929;9(3):399–431. 10.1152/physrev.1929.9.3.399 [DOI] [Google Scholar]
  • 2. Waddington CH. Canalization of development and the inheritance of acquired characters. Nature. 1942;150(3811):563–5. 10.1038/150563a0 [DOI] [Google Scholar]
  • 3. Holling CS. Resilience and stability of ecological systems. Annu Rev Ecol Syst. 1973;4(1):1–23. 10.1146/annurev.es.04.110173.000245 [DOI] [Google Scholar]
  • 4. Savageau MA. Parameter sensitivity as a criterion for evaluating and comparing the performance of biochemical systems. Nature. 1971;229(5286):542–4. 10.1038/229542a0 [DOI] [PubMed] [Google Scholar]
  • 5. Kitano H. Biological robustness. Nat Rev Genet. 2004;5(11):826–37. 10.1038/nrg1471 [DOI] [PubMed] [Google Scholar]
  • 6. Stelling J, Sauer U, Szallasi Z, Doyle FJ III, Doyle J. Robustness of cellular functions. Cell. 2004;118(6):675–85. 10.1016/j.cell.2004.09.008 [DOI] [PubMed] [Google Scholar]
  • 7. Andrianantoandro E, Basu S, Karig DK, Weiss R. Synthetic biology: new engineering rules for an emerging discipline. Mol Syst Biol. 2006;2(1):2006.0028. 10.1038/msb4100073 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Masel J, Trotter MV. Robustness and evolvability. Trends Genet. 2010;26(9):406–14. 10.1016/j.tig.2010.06.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Félix MA, Barkoulas M. Pervasive robustness in biological systems. Nat Rev Genet. 2015;16(8):483–96. 10.1038/nrg3949 [DOI] [PubMed] [Google Scholar]
  • 10. Gunawardena J. Models in systems biology: the parameter problem and the meanings of robustness In: Lodhi HM, Muggleton SH, editors. Elements of Computational Systems Biology. John Wiley & Sons, Ltd; 2010. p. 19–47. [Google Scholar]
  • 11. Strogatz SH. Nonlinear Dynamics and Chaos: With Applications to Physics, Biology, Chemistry, and Engineering. Perseus Books; 2001. [Google Scholar]
  • 12. Novick A, Weiner M. Enzyme induction as an all-or-none phenomenon. Proc Natl Acad Sci USA. 1957;43(7):553–66. 10.1073/pnas.43.7.553 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Dubnau D, Losick R. Bistability in bacteria. Mol Microbiol. 2006;61(3):564–72. 10.1111/j.1365-2958.2006.05249.x [DOI] [PubMed] [Google Scholar]
  • 14. Xiong W, Ferrell JE. A positive-feedback-based bistable ‘memory module’ that governs a cell fate decision. Nature. 2003;426(6965):460–5. 10.1038/nature02089 [DOI] [PubMed] [Google Scholar]
  • 15. Pomerening JR, Sontag ED, Ferrell JE. Building a cell cycle oscillator: hysteresis and bistability in the activation of Cdc2. Nat Cell Biol. 2003;5(4):346–51. 10.1038/ncb954 [DOI] [PubMed] [Google Scholar]
  • 16. Legewie S, Blüthgen N, Herzel H. Mathematical modeling identifies inhibitors of apoptosis as mediators of positive feedback and bistability. PLOS Comput Biol. 2006;2(9):e120 10.1371/journal.pcbi.0020120 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Thomson M, Gunawardena J. The rational parameterization theorem for multisite post-translational modification systems. J Theor Biol. 2009;261(4):626–36. 10.1016/j.jtbi.2009.09.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Ho KL, Harrington HA. Bistability in apoptosis by receptor clustering. PLOS Comput Biol. 2010;6(10):e1000956 10.1371/journal.pcbi.1000956 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Bernard BA. The human hair follicle, a bistable organ? Exp Dermatol. 2012;21(6):401–3. 10.1111/j.1600-0625.2012.01457.x [DOI] [PubMed] [Google Scholar]
  • 20. Laurent M, Kellershohn N. Multistability: a major means of differentiation and evolution in biological systems. Trends Biochem Sci. 1999;24(11):418–22. 10.1016/s0968-0004(99)01473-5 [DOI] [PubMed] [Google Scholar]
  • 21. Ingolia NT. Topology and robustness in the Drosophila segment polarity network. PLOS Biol. 2004;2(6):e123 10.1371/journal.pbio.0020123 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Graziani S, Silar P, Daboussi MJ. Bistability and hysteresis of the ‘Secteur’ differentiation are controlled by a two-gene locus in Nectria haematococca. BMC Biol. 2004;2:18 10.1186/1741-7007-2-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Chang HH, Oh PY, Ingber DE, Huang S. Multistable and multistep dynamics in neutrophil differentiation. BMC Cell Biol. 2006;7:11 10.1186/1471-2121-7-11 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Lasio P, Spooner CJ, Warmflash A, Lancki DW, Lee HJ, Sciammas R, et al. Multilineage transcriptional priming and determination of alternate hematopoietic cell fates. Cell. 2006;126(4):755–66. 10.1016/j.cell.2006.06.052 [DOI] [PubMed] [Google Scholar]
  • 25. Chickarmane V, Troein C, Nuber UA, Sauro HM, Peterson C. Transcriptional dynamics of the embryonic stem cell switch. PLOS Comput Biol. 2006;2(9):e123 10.1371/journal.pcbi.0020123 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Huang S. Non-genetic heterogeneity of cells in development: more than just noise. Development. 2009;136(23):3853–62. 10.1242/dev.035139 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Hanna JH, Saha K, Jaenisch R. Pluripotency and cellular reprogramming: facts, hypotheses, unresolved issues. Cell. 2010;143(4):508–25. 10.1016/j.cell.2010.10.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Ferrell JE. Bistability, bifurcations, and Waddington’s epigenetic landscape. Curr Biol. 2012;22(11):R458–66. 10.1016/j.cub.2012.03.045 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Markevich NI, Hoek JB, Kholodenko BN. Signaling switches and bistability arising from multisite phosphorylation in protein kinase cascades. J Cell Biol. 2004;164(3):353–9. 10.1083/jcb.200308060 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Thomson M, Gunawardena J. Unlimited multistability in multisite phosphorylation systems. Nature. 2009;460(7252):274–7. 10.1038/nature08102 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Crick F. Memory and molecular turnover. Nature. 1984;312(5990):101 10.1038/312101a0 [DOI] [PubMed] [Google Scholar]
  • 32. Lisman JE. A mechanism for memory storage insensitive to molecular turnover: a bistable autophosphorylating kinase. Proc Natl Acad Sci USA. 1985;82(9):3055–7. 10.1073/pnas.82.9.3055 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Ogasawara H, Kawato M. The protein kinase Mζ network as a bistable switch to store neuronal memory. BMC Syst Biol. 2010;4:181 10.1186/1752-0509-4-181 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Prabakaran S, Lippens G, Steen H, Gunawardena J. Post-translational modification: nature’s escape from genetic imprisonment and the basis for dynamic information coding. Wiley Interdiscip Rev Syst Biol Med. 2012;4(6):565–83. 10.1002/wsbm.1185 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Walsh CT. Posttranslational Modification of Proteins: Expanding Nature’s Inventory. Roberts and Company Publishers; 2006. [Google Scholar]
  • 36. Lisman J, Schulman H, Cline H. The molecular basis of CaMKII function in synaptic and behavioural memory. Nat Rev Neurosci. 2002;3(3):175–90. 10.1038/nrn753 [DOI] [PubMed] [Google Scholar]
  • 37. Gutenkunst RN, Waterfall JJ, Casey FP, Brown KS, Myers CR, Sethna JP. Universally sloppy parameter sensitivities in systems biology models. PLOS Comput Biol. 2007;3(10):e189 10.1371/journal.pcbi.0030189 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Varma A, Morbidelli M, Wu H. Parametric Sensitivity in Chemical Systems. Cambridge University Press; 1999. [Google Scholar]
  • 39. Stricker J, Cookson S, Bennett MR, Mather WH, Tsimring LS, Hasty J. A fast, robust and tunable synthetic gene oscillator. Nature. 2008;456(7221):516–9. 10.1038/nature07389 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Zi Z. Sensitivity analysis approaches applied to systems biology models. IET Syst Biol. 2011;5(6):336–46. 10.1049/iet-syb.2011.0015 [DOI] [PubMed] [Google Scholar]
  • 41. von Dassow G, Meir E, Munro EM, Odell GM. The segment polarity network is a robust developmental module. Nature. 2000;406(66792):188–92. 10.1038/35018085 [DOI] [PubMed] [Google Scholar]
  • 42. Ma W, Trusina A, El-Samad H, Lim WA, Tang C. Defining network topologies that can achieve biochemical adaptation. Cell. 2009;138(4):760–73. 10.1016/j.cell.2009.06.013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Shah NA, Sarkar CA. Robust network topologies for generating switch-like cellular responses. PLOS Comput Biol. 2011;7(6):e1002085 10.1371/journal.pcbi.1002085 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Chau AH, Walter JM, Gerardin J, Tang C, Lim WA. Designing synthetic regulatory networks capable of self-organizing cell polarization. Cell. 2012;151(2):320–32. 10.1016/j.cell.2012.08.040 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Zamora-Sillero E, Hafner M, Ibig A, Stelling J, Wagner A. Efficient characterization of high-dimensional parameter spaces for systems biology. BMC Syst Biol. 2011;5:142 10.1186/1752-0509-5-142 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Chaves M, Sengupta A, Sontag ED. Geometry and topology of parameter space: investigating measures of robustness in regulatory networks. J Math Biol. 2009;59(3):315–58. 10.1007/s00285-008-0230-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47. Dayarian A, Chaves M, Sontag ED, Sengupta AM. Shape, size, and robustness: feasible regions in the parameter space of biochemical networks. PLOS Comput Biol. 2009;5(1):e1000256 10.1371/journal.pcbi.1000256 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Manrai AK, Gunawardena J. The geometry of multisite phosphorylation. Biophys J. 2008;95(12):5533–43. 10.1529/biophysj.108.140632 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Gross E, Harrington HA, Rosen Z, Sturmfels B. Algebraic systems biology: a case study for the Wnt pathway. Bull Math Biol. 2016;78(1):21–51. 10.1007/s11538-015-0125-1 [DOI] [PubMed] [Google Scholar]
  • 50. Siegal-Gaskins D, Franco E, Zhou T, Murray RM. An analytical approach to bistable biological circuit discrimination using real algebraic geometry. J R Soc Interface. 2015;12(108):20150288 10.1098/rsif.2015.0288 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Bradford R, Davenport JH, England M, Errami H, Gerdt V, Grigoriev D, et al. A case study on the parametric occurrence of multiple steady states. In: Burr M, editor. Proceedings of the 2017 ACM on International Symposium on Symbolic and Algebraic Computation. ISSAC’17. ACM; 2017. p. 45–52.
  • 52. Feinberg M. Chemical reaction network structure and the stability of complex isothermal reactors—I. The deficiency zero and deficiency one theorems. Chem Eng Sci. 1987;42(10):2229–68. 10.1016/0009-2509(87)80099-4 [DOI] [Google Scholar]
  • 53. Otero-Muras I, Banga JR, Alonso AA. Characterizing multistationarity regimes in biochemical reaction networks. PLOS ONE. 2012;7(7):e39194 10.1371/journal.pone.0039194 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54. Pérez Millán M, Dickenstein A, Shiu A, Conradi C. Chemical reaction systems with toric steady states. Bull Math Biol. 2012;74(5):1027–65. 10.1007/s11538-011-9685-x [DOI] [PubMed] [Google Scholar]
  • 55. Otero-Muras I, Yordanov P, Stelling J. Chemical reaction network theory elucidates sources of multistability in interferon signaling. PLOS Comput Biol. 2017;13(4):e1005454 10.1371/journal.pcbi.1005454 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56. Conradi C, Feliu E, Mincheva M, Wiuf C. Identifying parameter regions for multistationarity. PLOS Comput Biol. 2017;13(10):e1005751 10.1371/journal.pcbi.1005751 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57. Pérez Millán M, Dickenstein A. The structure of MESSI biological systems. SIAM J Appl Dyn Syst. 2018;17(2):1650–82. 10.1137/17M1113722 [DOI] [Google Scholar]
  • 58.Bihan F, Dickenstein A, Giaroli M. Lower bounds for positive roots and regions of multistationarity in chemical reaction networks; 2018. Available from: https://arxiv.org/abs/1807.05157.
  • 59. Craciun G, Feinberg M. Multiple equilibria in complex chemical reaction networks: I. The injectivity property. SIAM J Appl Math. 2005;65(5):1526–46. 10.1137/S0036139904440278 [DOI] [Google Scholar]
  • 60. Joshi B, Shiu A. Simplifying the Jacobian criterion for precluding multistationarity in chemical reaction networks. SIAM J Appl Math. 2012;72(3):857–76. 10.1137/110837206 [DOI] [Google Scholar]
  • 61. Feliu E. Injectivity, multiple zeros and multistationarity in reaction networks. Proc R Soc A. 2015;471:20140530 10.1098/rspa.2014.0530 [DOI] [Google Scholar]
  • 62. Müller S, Feliu E, Regensburger G, Conradi C, Shiu A, Dickenstein A. Sign conditions for injectivity of generalized polynomial maps with applications to chemical reaction networks and real algebraic geometry. Found Comput Math. 2016;16(1):69–97. 10.1007/s10208-014-9239-3 [DOI] [Google Scholar]
  • 63. Holstein K, Flockerzi D, Conradi C. Multistationarity in sequential distributed multisite phosphorylation networks. Bull Math Biol. 2013;75(11):2028–58. 10.1007/s11538-013-9878-6 [DOI] [PubMed] [Google Scholar]
  • 64. Conradi C, Mincheva M. Catalytic constants enable the emergence of bistability in dual phosphorylation. J R Soc Interface. 2014;11(95):20140158 10.1098/rsif.2014.0158 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65. Pérez Millán M, Turjanski AG. MAPK’s networks and their capacity for multistationarity due to toric steady states. Math Biosci. 2015;262:125–137. 10.1016/j.mbs.2014.12.010 [DOI] [PubMed] [Google Scholar]
  • 66. Giaroli M, Bihan F, Dickenstein A. Regions of multistationarity in cascades of Goldbeter–Koshland loops. J Math Biol. 2018. 10.1007/s00285-018-1304-0 [DOI] [PubMed] [Google Scholar]
  • 67.Harrington HA, Mehta D, Byrne HM, Hauenstein JD. Decomposing the parameter space of biological networks via a numerical discriminant approach. In: Gerhard J, Kotsireas I, editors. Maple in Mathematics Education and Research. MC 2019. vol. 1125 of Communications in Computer and Information Science. Springer Cham; 2020. p. 114–31.
  • 68. Lum PY, Singh G, Lehman A, Ishkanov T, Vejdemo-Johansson M, Alagappan M, et al. Extracting insights from the shape of complex data using topology. Sci Rep. 2013;3:1236 10.1038/srep01236 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69. Gunawardena J. A linear framework for time-scale separation in nonlinear biochemical systems. PLOS ONE. 2012;7(5):e36321 10.1371/journal.pone.0036321 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70. Gunawardena J. Time-scale separation: Michaelis and Menten’s old idea, still bearing fruit. FEBS J. 2014;281(2):473–88. 10.1111/febs.12532 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71. Xu Y, Gunawardena J. Realistic enzymology for post-translational modification: zero-order ultrasensitivity revisited. J Theor Biol. 2012;311:139–52. 10.1016/j.jtbi.2012.07.012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72. Dasgupta T, Croll DH, Owen JA, Vander Heiden MG, Locasale JW, Alon U, et al. A fundamental trade-off in covalent switching and its circumvention by enzyme bifunctionality in glucose homeostasis. J Biol Chem. 2014;289(19):13010–25. 10.1074/jbc.M113.546515 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73. Gunawardena J. Some lessons about models from Michaelis and Menten. Mol Biol Cell. 2012;23(4):517–9. 10.1091/mbc.E11-07-0643 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74. Ortega F, Acerenza L, Westerhoff HV, Mas F, Cascante M. Product dependence and bifunctionality compromise the ultrasensitivity of signal transduction cascades. Proc Natl Acad Sci USA. 2002;99(3):1170–5. 10.1073/pnas.022267399 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75. Blüthgen N, Bruggeman FJ, Legewie S, Herzel H, Westerhoff HV, Kholodenko BN. Effects of sequestration on signal transduction cascades. FEBS J. 2006;273(5):895–906. 10.1111/j.1742-4658.2006.05105.x [DOI] [PubMed] [Google Scholar]
  • 76. Fersht A. Enzyme Structure and Mechanism. W. H. Freeman & Company; 1985. [Google Scholar]
  • 77. Bates DJ, Sommese AJ, Hauenstein JD, Wampler CW. Numerically Solving Polynomial Systems with Bertini Software, Environment, and Tools. SIAM; 2013. [Google Scholar]
  • 78. Hauenstein JD, Sottile F. Algorithm 921: alphaCertified: certifying solutions to polynomial systems. ACM Trans Math Softw. 2012;38(4):28 10.1145/2331130.2331136 [DOI] [Google Scholar]
  • 79. Suwanmajo T, Krishnan J. Mixed mechanisms of multi-site phosphorylation. J R Soc Interface. 2015;12(107):20141405 10.1098/rsif.2014.1405 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80. Rubinstein BY, Mattingly HH, Berezhkovskii AM, Shvartsman SY. Long-term dynamics of multisite phosphorylation. Mol Biol Cell. 2016;27(14):2331–40. 10.1091/mbc.E16-03-0137 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81. Cornish-Bowden A. Fundamentals of Enzyme Kinetics. Wiley-Blackwell; 2012. [Google Scholar]
  • 82. Gunawardena J. Multisite protein phosphorylation makes a good threshold but can be a poor switch. Proc Natl Acad Sci USA. 2005;102(41):14617–22. 10.1073/pnas.0507322102 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83. Sommese AJ, Wampler CW. The Numerical Solution of Systems of Polynomials Arising in Engineering and Science. World Scientific; 2005. [Google Scholar]
  • 84. Lepage GP. A new algorithm for adaptive multidimensional integration. J Comput Phys. 1978;27(2):192–203. 10.1016/0021-9991(78)90004-9 [DOI] [Google Scholar]
  • 85. Hsu D, Latombe JC, Motwani R, Kavraki LE. Capturing the connectivity of high-dimensional geometric spaces by parallelizable random sampling techniques In: Pardalos P, Rajasekaran S, editors. Advances in Randomized Parallel Computing. vol. 5 of Combinatorial Optimization. Springer; US; 1999. p. 159–82. [Google Scholar]
  • 86. Geraerts R, Overmars MH. A comparative study of probabilistic roadmap planners In: Boissonnat JD, Burdick J, Goldberg K, Hutchinson S, editors. Algorithmic Foundations of Robotics V. vol. 7 of Springer Tracts in Advanced Robotics. Springer–Verlag; 2004. p. 43–57. [Google Scholar]
  • 87. Arya S, Mount DM, Netanyahu NS, Silverman R, Wu AY. An optimal algorithm for approximate nearest neighbour searching in fixed dimensions. J ACM. 1998;45(6):891–923. 10.1145/293347.293348 [DOI] [Google Scholar]
  • 88.Mount DM, Arya S. ANN: a library for approximate nearest neighbor searching; 2010. Available from: https://www.cs.umd.edu/~mount/ANN/.
  • 89. Conradi C, Iosif A, Kahle T. Multistationarity in the space of total concentrations for systems that admit a monomial parametrization. Bull Math Biol. 2019. 10.1007/s11538-019-00639-4 [DOI] [PubMed] [Google Scholar]
  • 90.Giaroli M, Rischter R, Pérez Millán M, Dickenstein A. Parameter regions that give rise to 2⌊n/2⌋ + 1 positive steady states in the n-site phosphorylation system; 2019. Available from: https://arxiv.org/abs/1904.11633. [DOI] [PubMed]
  • 91. Malleshaiah MK, Shahrezaei V, Swain PS, Michnick SW. The scaffold protein Ste5 directly controls a switch-like mating decision in yeast. Nature. 2010;465(7294):101–5. 10.1038/nature08946 [DOI] [PubMed] [Google Scholar]
  • 92. Ghrist R. Barcodes: the persistent topology of data. Bull Amer Math Soc. 2008;45(1):61–75. 10.1090/S0273-0979-07-01191-3 [DOI] [Google Scholar]
  • 93. Edelsbrunner H, Harer JL. Computational Topology: An Introduction. American Mathematical Society; 2010. [Google Scholar]
  • 94. Smale S. Newton’s method estimates from data at one point In: Ewing RE, Gross KI, Martin CF, editors. The Merging of Disciplines: New Directions in Pure, Applied, and Computational Mathematics. Springer; New York; 1986. p. 185–96. [Google Scholar]
  • 95. Lohr SL. Sampling: Design and Analysis. Duxbury Press; 2009. [Google Scholar]
PLoS Comput Biol. doi: 10.1371/journal.pcbi.1007573.r001

Decision Letter 0

Pedro Mendes, Mark Alber

25 Feb 2020

Dear Dr. Gunawardena,

Thank you very much for submitting your manuscript "Robustness and parameter geography in post-translational modification systems" for consideration at PLOS Computational Biology. As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. The reviewers appreciated the attention to an important topic. Based on the reviews, we are likely to accept this manuscript for publication, providing that you modify the manuscript according to the review recommendations.

Please prepare and submit your revised manuscript within 30 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. 

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to all review comments, and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Thank you again for your submission to our journal. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Pedro Mendes, PhD

Associate Editor

PLOS Computational Biology

Mark Alber

Deputy Editor

PLOS Computational Biology

***********************

A link appears below if there are any accompanying review attachments. If you believe any reviews to be missing, please contact ploscompbiol@plos.org immediately:

[LINK]

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: In their article, Nam and coworkers, computationally explore the steady-state landscale of a dual phosphorylation cycle, and in particular scan for parameter areas of bistability (or better, areas that exhibit 3 steady states, as they don’t formally prove that these areas are bistable. They compare a system in which enzymes operate fully independent of their product, and systems in which the product inhibits the enzyme. More specifically, they derive an angebraic equation (a polynome) that describes the steady-state, and after scaling the parameters/total concentrations, they scan the parameter using sophisticated sampling algorithms to quantify areas of bistability. They find that bistability occurs only if enzyme (kinase) concentration is smaller then the substrate, and that product inhibition reduces the areas of bistability.

The major contribiton of the paper is two-fold: First, they provide a detailed algorithmic approach that can serve as a blueprint to explore mono/bistability in a complex reaction system systematically, and they provide detailed material in the supplement including scripts that may guide how to do this. Second, they find that bistability is less likely to occur when product inhibition is allowed.

Overall, the methods and analyses seem sound.

While I value these contributions, I still find it very difficult to really understand the implications. What are really the biologically significant findings? Can we understand it mechanistically, why bistability requires that substrate and enzyme concentrations are like described in the paper? I think the authors need to work on interpreting and also presenting their work such that it is more clear what the results mean in terms of signaling biology.

Reviewer #2: Review of

Robustness and parameter geography in post-translational modification systems

The authors build on their respective previous work on post-translational modification (PTM) systems by the Harvard lab and that of numerical algebraic geometry developers. They set out to create a general framework for (1) writing arbitrary enzyme kinetics of PTMs, in terms of polynomial functions Phi1 and Phi2, and (2) analysing the parameter space within Eq 10 using numerical algebraic geometry, and then quantifying the volume of the bistable region of this space. The authors find that the volume of bistability of the parameter geography varies significantly between different biological mechanisms.

Overall, the paper is well written, I found no typos while reading it. The additional information was useful for understanding the analysis performed.

I recommend it to be published, but first a few clarifications to improve the paper.

- The visibility ratio sounds like it is approximating geodesics. Can numerical algebraic geometry approximate this variety in parameter space, and provide some indication of the degree of this “visibility” curve?

- The ``grammar” sounds similar to studying the graph rather than the resulting equations, which is a feature of chemical reaction network theory. It would be useful to explain the connection when presenting chemical Eqs (1). For example, this grammar sounds similar to generalized mass action kinetics.

- The authors should discuss how their parameter geography compares to the work below, and whether the methods described below could provide insight to prove their conjectures:

Joshi and Shiu (SIAM Journal on Applied Mathematics), conditions on parameter

Conradi, Feliu, et al (PLOS Computational Biology) conditions on parameter

Harrington, Mehta et al (Communications in Computer and Information Science) stability and parameter geography

- For the parameter values that could not be certified, are these near the discriminant locus?

- It would be helpful to have more details in table captions.

Overall this is a good balance between application, theory and computation and recommend it for publication in PLOS Computational Biology.

Reviewer #3: Many biological systems depend upon parameters and it is important to understand the behavior of the system as the parameters change. The parameter space is partitioned into regions where the qualitative behavior of the steady-states of the system remain unchanged on each region. When the steady-state system is polynomial, tools from numerical algebraic geometry (such as Bertini, Paramotopy, and alphaCertified) can be used to analyze the "geography" of parameter space.

This paper considers a steady-state system which is polynomial in 2 variables dependent upon 8 parameters. By using random sampling of the set of parameters which are biologically meaningful, the authors analyze the "geography" of the parameter space. Overall, the setup for the computational experiments described in the paper appear to be reasonable while the scale of these computations is quite remarkable.

My only comment on the paper is in regards to decomposing the parameter space: Was there an attempt to symbolically compute the discriminant polynomial of (10)? Or compute it using numerical algebraic geometry? What is its degree?

Overall, this paper should be accepted with a minor revision based on the answers to these questions regarding the discriminant.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Computational Biology data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, PLOS recommends that you deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see http://journals.plos.org/ploscompbiol/s/submission-guidelines#loc-materials-and-methods

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1007573.r003

Decision Letter 1

Pedro Mendes, Mark Alber

2 Apr 2020

Dear Dr. Gunawardena,

We are pleased to inform you that your manuscript 'Robustness and parameter geography in post-translational modification systems' has been provisionally accepted for publication in PLOS Computational Biology.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. 

Best regards,

Pedro Mendes, PhD

Associate Editor

PLOS Computational Biology

Mark Alber

Deputy Editor

PLOS Computational Biology

***********************************************************

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #3: The discriminant for the positive real roots is indeed algebraic -- the number of such roots can only change if a root becomes singular (classical discriminant) or by having a root intersect a coordinate axis. Lazard and Rouillier (JSC, 2007) call this the "minimal discriminant variety" for the corresponding problem -- see [Section 1.2, 67]. Nonetheless, I agree with the authors that this may be difficult to compute for this problem, so I am recommending acceptance.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Computational Biology data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #3: No

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1007573.r004

Acceptance letter

Pedro Mendes, Mark Alber

23 Apr 2020

PCOMPBIOL-D-19-02084R1

Robustness and parameter geography in post-translational modification systems

Dear Dr Gunawardena,

I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Laura Mallard

PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Appendix. Supplemental methods.

    This document provides a comprehensive description of our implementations of the methods described in the paper and guidelines for navigating the supplemental code and datasets. Supplemental figures: (A) Workflow for computing and parsing solutions with Paramotopy. Supplemental tables: (A) Seeds used to initialise the MATLAB pseudo-random number generator for sampling; (B) Seeds used to initialise the MATLAB pseudo-random number generator for VEGAS sampling; (C) Details of the refined connectivity graphs.

    (PDF)

    S1 Code. Scripts used to process Paramotopy output.

    A detailed description of each script in this collection is given in S1 Appendix.

    (GZ)

    S1 Dataset. Summary of Paramotopy runs and associated Bertini settings.

    This dataset contains: (1) a tab-delimited text file (metadata.tsv) enumerating all Paramotopy runs and re-runs performed in this analysis, along with the samples on which they were performed and the Bertini settings employed for each run and re-run; (2) a directory of tab-delimited text files (thresholds/) enumerating all Paramotopy runs and re-runs with their corresponding values of the thresholds Tzmin, Tzmax, T, and Td; and (3) an XML file (defaultprefs.xml), passed as input into Paramotopy, that enumerates the default Bertini settings given in Table 3. See S1 Appendix for details.

    (GZ)

    S1 File. Dataset DOIs.

    All datasets are available on Mendeley Data under the given DOIs. See S1 Appendix for details.

    (TSV)

    Attachment

    Submitted filename: response-to-reviewers.pdf

    Data Availability Statement

    All datasets referenced in the paper (outside of the files in S1 Dataset) are available on Mendeley Data. Their DOIs are listed in S1 File.


    Articles from PLoS Computational Biology are provided here courtesy of PLOS

    RESOURCES