Skip to main content
iScience logoLink to iScience
. 2020 Sep 15;23(10):101565. doi: 10.1016/j.isci.2020.101565

Diverse Cell Stimulation Kinetics Identify Predictive Signal Transduction Models

Hossein Jashnsaz 1, Zachary R Fox 2,3,4, Jason J Hughes 1, Guoliang Li 1, Brian Munsky 4,5,, Gregor Neuert 1,6,7,8,∗∗
PMCID: PMC7549069  PMID: 33083733

Summary

Computationally understanding the molecular mechanisms that give rise to cell signaling responses upon different environmental, chemical, and genetic perturbations is a long-standing challenge that requires models that fit and predict quantitative responses for new biological conditions. Overcoming this challenge depends not only on good models and detailed experimental data but also on the rigorous integration of both. We propose a quantitative framework to perturb and model generic signaling networks using multiple and diverse changing environments (hereafter “kinetic stimulations”) resulting in distinct pathway activation dynamics. We demonstrate that utilizing multiple diverse kinetic stimulations better constrains model parameters and enables predictions of signaling dynamics that would be impossible using traditional dose-response or individual kinetic stimulations. To demonstrate our approach, we use experimentally identified models to predict signaling dynamics in normal, mutated, and drug-treated conditions upon multitudes of kinetic stimulations and quantify which proteins and reaction rates are most sensitive to which extracellular stimulations.

Subject Areas: Bioinformatics, Complex System Biology, Systems Biology

Graphical Abstract

graphic file with name fx1.jpg

Highlights

  • Diverse kinetic cell stimulations result in distinct signaling response dynamics

  • Diverse kinetics compared with step stimulations better constrain model parameters

  • Diverse kinetic stimulations improve predictions of WT and mutant pathway responses

  • Diverse kinetic stimulations compared with steps discriminate competing signaling models


Bioinformatics; Complex System Biology; Systems Biology

Introduction

One of the longest standing challenges of modeling in systems biology has been to make accurate quantitative predictions for cell signaling responses over time, upon genetic mutations, when subjected to variable drug concentrations, and under time-varying changes of environment (Fujita et al., 2010; Handly et al., 2016; Rowland et al., 2011; Sorre et al., 2014; Lim and Bruce Mayer, 2017). Examples of environmental perturbations that change over time that cells experience (defined as “kinetic stimulations”) include changes in levels of hormones (Kubota et al., 2012; Steiner et al., 1982), growth factors (Fujita et al., 2010; Sorre et al., 2014), morphogens (Briscoe and Small, 2015; Huang et al., 2017), cytokines (Oyler-Yaniv et al., 2017), or extracellular stressors such as oxidative stress (Goulev et al., 2017) or osmolarity (Granados et al., 2017; Mitchell et al., 2015; Young et al., 2013). These environmental changes can modulate signaling dynamics in pathways in both healthy and disease tissue (Akhurst and Hata, 2012; Cildir et al., 2016; Hanahan and Weinberg, 2000; Hata and Chen, 2016; Hotamisligil and Davis, 2016). Therefore, understanding and predicting signal transduction network behavior will be a critical step to identify unknown regulatory mechanisms, to distinguish between proteins and reaction rates that are sensitive to kinetic stimulations, and to detect and treat abnormal regulation that occurs in a large number of human diseases.

A key obstacle that prevents predictive modeling of cell signaling is the gross mismatch between the preponderance of biological complexity and the sparsity of quantitative experimental data (Handly et al., 2016; Janes and Lauffenburger, 2013; Vanhaelen et al., 2017). As a consequence, most current models of signal transduction pathways suffer from lack of dynamic richness in the data resulting in either too simple (Adler and Alon, 2018; Csete and Doyle, 2002; Muzzey et al., 2009) or too complex (Groβ et al., 2019; Klipp et al., 2005; Romers et al., 2020) models with limited predictive power (Handly et al., 2016; Janes and Lauffenburger, 2013). To address the disparity between biological complexity and lack of richness in experimental data, one paradigm has been to devise experiments with higher content (e.g., sequencing or multiplexed single-cell imaging) or higher throughput (e.g., flow cytometry or parallelized microfluidics) in hope that large amounts of data will eventually fill the gap between mechanistic and predictive understanding (Efremova et al., 2020; Labib and Kelley, 2020). These approaches have been helpful to quantify RNA or protein abundances or phosphorylation states of many proteins using the minimal number of time points to better constrain complex models (Kinney et al., 2019). Another approach has been to fit computational models to experiments at steady state in different environments (Hao and O'Shea, 2012), to dose-response curves measured with sudden step-like perturbations (Shaw et al., 2019), to repetitive pulsatile perturbations of different frequencies (Albeck et al., 2013; Ashall et al., 2009; Hersen et al., 2008; Mettetal et al., 2008; Mitchell et al., 2015; Rahi et al., 2017; Wang et al., 2012; Zhang et al., 2019), or to individual spatial gradients (Huang et al., 2017) or design optimal experiments that constrain parameters for a specific perturbation profile (Apgar et al., 2008; Bandara et al., 2009; Casey et al., 2006; Faller et al., 2003; Hagen et al., 2013). In addition, pioneering studies have further demonstrated that individual and different kinetic stimulations can dramatically affect intracellular signaling dynamics to create distinct cell phenotypes (Fujita et al., 2010; Granados et al., 2017; Kubota et al., 2012; Muzzey et al., 2009; Shimizu et al., 2010; Sorre et al., 2014; Thiemicke et al., 2019; Twohig et al., 2019; Wang et al., 2012; Young et al., 2013). However, from these previous studies, it is not understood how parameter uncertainty, model predictions, and model identification quantitatively depend on the amount and type of experimental data.

Here we systematically explore how multiple, kinetically distinct inputs can activate signaling pathways to achieve diverse signaling responses, and we show that this diversity of input-output relationships is key to the identification of predictive models for complex biological pathways (Figure 1). To demonstrate the feasibility of this task, we infer several models from single-cell time-lapse microscopy data for cell signaling in the conserved High Osmolarity Glycerol (HOG/p38) pathway, a prototypical mitogen-activated protein kinase (MAPK) pathway in the yeast Saccharomyces cerevisiae that is important for stress adaptation and cell survival (Figure 1A) (Cuadrado and Nebreda, 2010; Saito and Posas, 2012; Thiemicke et al., 2019). This eukaryotic model system includes the most relevant features of signaling networks, including a terminal signaling protein (Hog1) and a branched protein network that comprises membrane sensors, phosphorelays, kinases, phosphatases, autoregulation, and feedback and feedforward loops that are found in the majority of signaling pathways (Lim and Bruce Mayer, 2017). With this system, it has been previously demonstrated experimentally that changing the extracellular osmolyte concentrations over time results in distinct Hog1 activation dynamics as pathway response output (Figures 1B and 1C) (Ferrigno et al., 1998; Hersen et al., 2008; Mettetal et al., 2008; Mitchell et al., 2015; Thiemicke et al., 2019). Because of these attributes that are common in many signal transduction pathways in eukaryotic cells, we now seek to explore what general implications the diversity of multiple input-to-output dynamics can have on the possibility to identify predictive signaling models and the regulatory mechanisms that are sensitive to kinetic environmental perturbations.

Figure 1.

Figure 1

Kinetic Stimulation of Signaling Pathways Is Required to Identify Predictive Models and Phenotypes

(A) Different extracellular kinetic stimulations (left) activate a signaling pathway (middle) and result in distinct dynamic kinase signaling and nuclear localization (right).

(B) MAPK Hog1 nuclear localization dynamics upon step increases in NaCl to different final concentrations.

(C) Hog1 dynamics upon step, linear, or quadratic increases from 0 to 0.4 M NaCl. Lines are means, and shaded areas are the standard deviation of multiple biological replicates (Transparent Methods).

(D–F) (D) Schematic overview of model identification based on models' predictions utilizing kinetic cell stimulations. (E) Schematic overview of signaling models of different complexity where pathways are modeled with ordinary differential equations (ODE) (Figure S1; Transparent Methods). The middle model is defined as the “true model” to simulate synthetic data for pathway response (defined as the activation of node x4). Different models were fitted to the same data to identify parameters. (F) These models were then used to make predictions for pathway activation, depicted as black solid line, upon a different kinetic input, which resulted in different responses for different models (colored lines).

(G) Diverse kinetic perturbations corresponding to step (t0), root (t), linear (t1), quadratic (t2), quint (t5), and heptic (t7) changes over time (left) are applied to the true model (middle) resulting in synthetic signaling activation dynamics (right). In synthetic signaling data, lines are means and shaded areas are the standard deviation of simulated replicates (Transparent Methods).

(H) Diverse kinetic inputs reaching different final concentrations plus four alternating stimulations result in 58 distinct simulated datasets for each model. These datasets are used to train models and test their predictions. The alternating stimulations include a staircase input, two pulsatile inputs, and a sinusoidal oscillatory input (Figures S3E–S3H).

See also Figures S1–S3 and Table S1.

Results

Parametrizing Signaling Models with Experimental Data Enables Predictions of Pathway Responses upon Kinetic Stimulations

As the HOG model pathway combines universal signaling network features as outlined earlier, it serves as a blueprint to build predictive signaling models of varying complexity (Figures 1A–1E). Previous published models have limitations in that they are too simple and lack molecular detail (Hersen et al., 2008; Muzzey et al., 2009) or that they are too complex having too many parameters to estimate (Klipp et al., 2005). Furthermore, other models are too specific, and their complexity is difficult to adapt when seeking to identify predictive models from finite amounts of experimental data (Granados et al., 2017; Schaber et al., 2012; Zi et al., 2010). To circumvent the challenge of variable complexity, we specify a class of expandable network topologies outlined in Figures 1E and S1K. In this class, model nodes are defined based on branching points in the Hog1 pathway starting from the Hog1 kinase, and complexity of the pathway can be increased by separating nodes into sub-nodes (Figure 1E, left to right) (Transparent Methods). For example, the representative model in Figure 1E (middle) resembles a simplified branched signaling pathway consisting of four nodes, including one activating and one repressing sensor protein, constant basal regulators, and a negative feedback loop from the terminal kinase to an upstream signaling branch (Figures S1A). In the context of the HOG pathway, the node x1 represents the SLN1 branch including the proteins Sln1, Ypd1, Ssk1, and Ssk2/Ssk22. The SLN1 branch utilizes a two-component phosphorelay mechanism to transmit its signal, with b1 representing the constant deactivation of the SLN1 branch (Hohmann et al., 2007; Maeda et al., 1994). The node x2 describes the SHO1 branch of the HOG pathway that utilizes protein kinases to relay its information. The SHO1 branch consists of the proteins Sho1, Msb2, Hkr1, Opy2, Cdc42, Ste20/Cla4, Ste11, and Ste50, with b2 modeling the basal deactivation of the SHO1 branch (Tatebayashi et al., 2015). The SHO1 branch is also regulated by the Hog1 kinase through a feedback loop (Hao et al., 2007; O'Rourke and Herskowitz, 1998; Westfall and Thorner, 2006). The node x3 represents an MAPKK such as Pbs2 that integrates information flow from two branches and has basal regulation (b3) through phosphatases such as Ptc1/2/3. Last, x4 represents a terminal kinase such as Hog1 that is activated by Pbs2. Hog1 is deactivated through phosphatases Ptc1/2/3 and Ptp2/3 (Mattison and Ota, 2000; Warmka et al., 2001; Young et al., 2002). Deactivation of Hog1 via constitutively active phosphatases is modeled as the act of basal deactivator b4 on x4.

We parametrized these representative biologically inspired models (Transparent Methods) by fitting them to experimental Hog1 nuclear localization data (Figures 1B and S1K–S1M). Using experimentally constrained parameters for each model, we predicted Hog1 signaling dynamics upon different kinetic stimulation profiles and evaluated each model's ability to predict independent experimental Hog1 dynamics. The model (Figure 1E, middle) and the parameter set (Table S1) resulting in the smallest prediction error was defined as the “true model” and the “true parameters,” respectively. From this model, we simulate synthetic signaling data upon diverse kinetic stimulations for the remainder of this study. Using a known model for this task, rather than additional experiments, allows us to systematically and quantitatively establish how diverse kinetic stimulations impact model identifiability and predictive power in a controlled setting where ground truth knowledge is available to check performance.

Diverse Kinetic Cell Stimulations Result in Distinct Pathway Activation Dynamics

With a specified “true model,” we can now rigorously explore diverse kinetic inputs to pathway dynamics (Figure 1G). We simulated 54 synthetic datasets (Figures 1H, and S2) under a wide range of physiologically feasible and mutually independent kinetic stimulation profiles (see Transparent Methods) such that each of the 54 profiles stimulates the pathway to express a unique temporal behavior. Our simulated data (Figures 2 and S2) qualitatively and quantitatively capture the main characteristics of measured dynamics for Hog and other signaling pathways, including modulation of activation level, diversity of signaling dynamics, delayed onset of activation, variable time to reach maximum activation, perfect adaptation, and measurement error observed in experiments (Figures 1B and 1C). Each signaling dataset covers the same duration and sampling range and has the same amount of data. However, different datasets may constrain model parameters to different extents that, in turn, could lead to predictions of different accuracy. To compare the performance of kinetic stimulations, we also simulated signaling activation dynamics upon staircase, pulsatile, and sinusoidal stimulation inputs (Figures 1H, datasets 55–58, and S3).

Figure 2.

Figure 2

Models Trained Using Same Kinetic-Type Inputs Fail to Predict Pathway Response to Other Kinetics

(A–F) Models trained using step inputs fail to predict pathway response to kinetic stimulations. Gray lines show synthetic pathway activation dynamics over time at different kinetic inputs as indicated inside each panel: step (t0, A), root (t, B), linear (t1, C), quadratic (t2, D), quint (t5, E), and heptic (t7, F) input kinetics over time each to increasing final concentrations of 0.10, 0.20, 0.30, and 0.50 M. (A) Model fit simultaneously to steps of 0.2 and 0.3 M data are shown in red. (A–F) Predictions under all other conditions are shown in blue. Predictions in (E) and (F) of all four concentrations overlap. Thick lines and shaded areas show median and interquartile range out of 10 independent fits and their corresponding predictions, respectively. As shown as an inset in (A), the first, second, and third quartiles are used to plot shaded error bars where the thick line and upper and lower shaded areas represent Q2, (Q3-Q2), and (Q2-Q1), respectively. This convention is used throughout the article.

(G–L) Similar to (A–F), models trained using quadratic inputs fail to predict pathway response to other kinetics.

See also Figure S2.

Lack of Kinetic Stimulation Diversity Limits Model Prediction Power

To explore what effects different stimulations have on model predictive power, we fit the true model to many independent sets of simulated data with experimentally realistic noise (Transparent Methods). The results of model fit simultaneously for two steps of 0.2 and 0.3 M NaCl is shown in Figure 2A (red). After confirming that these fits had converged to maximize the likelihood of observing the training data (Figures S1G–S1J; Transparent Methods), we then predicted the remaining 54 datasets for steps as well as all other kinetics (blue) (Figure 2). Comparison of the fits and predictions to the corresponding training and testing data shows that the quality of predictions is nearly as good as fits for the same type of kinetics (Figures 2A and 2J, blue), but predictions become worse for different types of kinetics (Figures 2B–2F, 2G–2I, 2K, and 2L). This raises the question if the lack of predictability is due to the lack of data or due to limited kinetic diversity in the training data.

To address the possibility of having too little data, we fit the model to six-step (t0) data simultaneously (Figure 3A, red), and we predicted the signaling dynamics upon all remaining kinetic stimulations (48 datasets). We compared model predictions of signaling dynamics for linear kinetic stimulations (t1) or nonlinear kinetic stimulations (t9) of different final concentrations to their corresponding synthetic data (Figure 3A, blue or green, respectively, compared to gray). After convergence, each set of model fits resulted in poor predictions for all datasets except for testing data collected using the same kinetic type as the training data. This observation was the same for subsequent training datasets with homogeneous input types (Figures S3A–S3C). Comparing how model predictability depends on the amount of training data of the same type illustrates that simply collecting more data of the same type does not automatically result in improved predictability (Figures 3B and S3A–S3C, right). Rather, the specific kinetics upon which the cells are stimulated may be of greater importance.

Figure 3.

Figure 3

Kinetic Stimulation Improves Model Predictions

(A) Simultaneous fits (red) to six simulated step input response of different concentrations (gray) and subsequent model predictions of signaling dynamics upon different concentrations of linear input stimulations (predictions1 in blue, testdata1 in gray) or different concentrations of nonlinear inputs of the shape t9 (predictions3 in green, testdata3 in gray).

(B) Box plots of fit and prediction errors when an increasing number of step inputs is used to train the model.

(C) Simultaneous fit (red) to six different kinetic input stimulations of the same final concentration (gray) and model predictions for different concentrations of linear input stimulations (predictions1 in blue, testdata1 in gray) or different concentrations of nonlinear input stimulations of the shape t9 (predictions3 in green, testdata3 in gray). In (A and C), thick lines and shaded areas in gray show the mean and the standard deviation of synthetic data. Thick lines and shaded areas in red, blue, and green show median and interquartile range of 10 independent fits and their corresponding predictions, respectively.

(D) Box plots of fit and prediction errors when an increasing number of diverse kinetics (t0 t7) is used to train the model. For (B and D), squares and error bars show median and first and third quartiles, respectively. Horizontal dotted black line denotes the standard deviation of the simulated data. Fit or prediction errors (FE or PE1-PE4) are the mean absolute difference of the fit or predictions and their corresponding synthetic data over time (Transparent Methods). Fit error (FE) statistics are drawn from ntrain datasets over 10 independent fits (10 × ntrain errors) where ntrain = 1,2, …6 is the number of datasets used to train the model. Similarly, PE1, PE2, and PE3 are drawn from prediction errors of testdata1 (36-ntrain datasets over 10 independent fits), testdata2 (12 datasets over 10 independent fits), and testdata3 (6 datasets over 10 independent fits), respectively (Figure 1H).

(E–G) (E) Upon a sinusoidal oscillatory stimulation input (Figure S3H), model predictions (examples of P4 in purple) compared with testdata4 (gray) under (F) six steps versus (G) six diverse kinetics training.

(H) Full boxplots of fit (FE) and predictions errors (PE1, PE2, PE3, and PE4) compared with standard deviation of the synthetic data (horizontal dashed line) when six datasets of each kinetics (or equivalent) are used to train the model. Quantifications are from 10 independent fits for each condition. PE4 is prediction errors quantified from alternating stimulations that consist of staircase, pulsatile1, pulsatile2, and sinusoidal oscillatory inputs (Figures S3E–S3H). When either of the alternating stimulations is used as training data, PE4 is quantified over the other three alternating stimulations.

See also Figure S3 and Video S1.

Diversified Kinetic Stimulations Better Constrain Model Parameters and Improve Model Predictions

To address the importance of kinetic diversity in training data, we fit signaling dynamics for different kinetic stimulations (t0t7) of a given final concentration (Figure 3C, red), and we predicted the signaling dynamics for the remaining kinetic stimulations (Figure 3C, blue and green). Comparing model predictions of signaling dynamics for linear kinetic stimulations (t1) or nonlinear kinetic stimulations (t9) of several different final concentrations with their corresponding synthetic data (Figures 3C and S3D, blue or green, respectively, compared with gray) indicates that all the predictions are substantially improved and are nearly as good as the fits, whereas the amount of training data is still the same as in Figure 3A. Quantitatively comparing how model predictability depends on the amount of different types of training data illustrates that kinetically diverse training data substantially improve predictability under all test kinetics (Figure 3D, blue, yellow, or green quantified from testdata1, testdata2, or testdata3).

Our results indicate that a minimum of five diverse training datasets are sufficient to achieve average prediction errors within 75% of the standard deviation of data, and six diverse datasets result in average prediction errors within 51% of the standard deviation of the data. For a more stringent evaluation of generality and extensibility, we then challenged the various trained models with entirely different sets of novel alternating stimulations including staircase, pulsatile, or complex sinusoidal oscillatory inputs (testdata4) (Figures 3E–3G and S3E–S3H). After training using six steps (Figures 3A and S3I), six t5 kinetics (Figure S3J), or six diverse kinetics (Figures 3C and S3K), we compared model predictions with their corresponding simulated data for four alternative stimulations (S3E–S3K, purple compared with gray). Finally, we fit each of the 11 different training datasets and predict the remaining data (Figure 3H). Each of the 11 different training datasets contains the same amount of data and fits equally well to its respective training data, but training using diverse kinetics led to far better predictions in comparison to any of the other sets of perturbations (Figure 3H). A comprehensive comparison of model predictions with all testdata under each of the 11 different training conditions is given in Video S1. A detailed analysis demonstrates that when using training data restricted to a single type, prediction errors increase as test data deviate away from kinetically similar training data (Figures 4A and S4A–S4I).

Figure 4.

Figure 4

Kinetically Diverse Stimulations Constrain Model Parameters Substantially Better than Homogeneous Kinetic Types

(A) Comprehensive quantification of prediction errors of each kinetic stimulation type when five datasets of any given type is used to train the model (lighter colors denote smaller errors).

(B) Comparison of prediction errors (Predictions3) under increasing amounts of training data of the same kinetic type (e.g., step, linear, quadratic) or diverse kinetic types (0.3 and 0.7 M). Horizontal dotted black line denotes the standard deviation of the simulated data.

(C–E) Verification of FIM using MLEs. Under each of six step versus six diverse 0.7 M kinetics training, total model uncertainty is estimated by determinant of (C) FIM−1, (D) covariance matrix of MLEs, and (E) covariance matrix of Bayesian posterior. Analyses in (C–E) are all performed with four free model parameters (λ11, λ13, λ15, λ21) (see Transparent Methods and Figures S4J and S4K).

(F) Parameter uncertainty of the model estimated as inverse of determinant of FIM (i.e., D-Optimality) when the model is fit to all six of each dataset.

(G) Ellipses are representative 95% confidence intervals for a representative pair of parameters estimated from FIM−1. Colors correspond to the five different sets of experiments considered in (F).

(H) FIM optimal experimental design by minimizing the determinant of inverse FIM determines D-Optimal experiments for increasing number of datasets (n = 1,2, …6) among all possible combinations (36 choose n) (purple) compared with steps (black) or diverse 0.7 M (blue).

(I) Comparison of prediction errors (Predictions1, Predictions2, and Predictions3) under increasing amounts of training data of the steps, diverse 0.7 M, and D-Optimal experiments. Results are from three independent model fits and their corresponding predictions. Horizontal dotted black line denotes average standard deviation of the simulated data.

(J–L) Bayesian analysis to quantify model parameters posterior and predictions under different kinetics. Under (J) six steps, (K) six diverse 0.7 M, and (L) six D-Optimal training kinetics, model fits (red) and predictions (green) for 100 parameters sets sampled from Bayesian posterior are compared with their corresponding training and test data (gray). Results are from multiple independent chains (steps 3, diverse 3, and D-Optimal 7 chains) (see Transparent Methods). The values for the determinant of covariance of posterior (average over all independent chains), average fit errors (<FE>), and average prediction errors (<PE>) are given for each condition. The <FE> and <PE > quantifications are from 100 samples from posteriors of each independent chain. FE are from six training datasets; PE are from the remaining 52 test datasets (Figure 1H).

See also Figure S4.

Video S1. Diversified Kinetic Stimulations Disproportionately Improve Model Predictions, Related to Figures 3 and S3

Model fits (red) simultaneously to six data (or equivalent amount of data) of each kinetic (gray) and model predictions of the other kinetics (blue, yellow, green, and purple) are compared with their corresponding test data (gray); Training data are as follows:

Frame 1: six steps (t0, A), 6 × 50 min training data,

Frame 2: six roots (t, B), 6 × 50 min training data,

Frame 3: six linears (t1, C), 6 × 50 min training data,

Frame 4: six quadratics (t2, D), 6 × 50 min training data,

Frame 5: six quints (t5, E), 6 × 50 min training data,

Frame 6: six heptics (t7, F), 6 × 50 min training data,

Frame 7: staircase (J), 1 × 300 min training data,

Frame 8: Paulsatile1 (K), 1 × 300 min training data

Frame 9: Paulsatile2 (L), 1 × 300 min training data

Frame 10: Oscillatory (M), 1 × 300 min training data,

Frame 11: six diverse kinetics (one each of t0, t, t1, t2, t5, t7), 6 × 50 min training data,

Frame 12: repeat of Frame 11 except the YScale in (N) is linear in Frame 12.

Each condition is from 10 independent fits. Shaded plots (red, blue, yellow, green, purple) present 10, 50, and 90 percentiles out of 10 independent fits and their corresponding predictions over time. (N) Full boxplots of fit (FE) and predictions errors (PE1, PE2, PE3, and PE4) compared with standard deviation of the synthetic data (black dotted line) at each frame. Fit error (FE) statistics are drawn from ntrain datasets over 10 independent fits (10 × ntrain errors) where ntrain = 6 in Frames 1–6 and Frame 11, whereas ntrain = 1 in Frames 7–10 is the number of datasets used to train the model. Similarly, PE1, PE2, and PE3 are drawn from prediction errors of testdata1 (30 datasets over 10 independent fits), testdata2 (12 datasets over 10 independent fits), and testdata3 (6 datasets over 10 independent fits), respectively (Figure 1H). PE4 is prediction errors quantified from alternating stimulations that consist of staircase, pulsatile1, pulsatile2, and sinusoidal oscillatory inputs (Figures S3E–S3H). When either of the alternating stimulations is used as training data (Frames 7–10), PE4 is quantified over the other three alternative stimulations.

Download video file (2.2MB, flv)

To better understand why multiple diverse kinetic stimulations result in better predictions and reduced predictions uncertainties, we next sought to analyze the effects of different training inputs on prediction errors (Figures 4B and S3A–S3D). By comparing predictions for an increasing number of training data, we find that prediction errors decrease with increase in the amount of training data regardless of the training data type, but the prediction improvement is much faster when using diverse training data (Figure 4B). We applied both frequentist and Bayesian analyses to explore how variability in parameters and predictions arises from different experimental settings. For the frequentist analysis, we developed a Fisher Information Matrix (FIM) analysis framework to directly estimate the expected uncertainties of maximum likelihood estimations (MLE) when performing multiple independent replicas of the same experiment designs (Apgar et al., 2010; Fox and Munsky, 2019; Hagen et al., 2013; Komorowski et al., 2011) (Transparent Methods). We verified the FIM computation by performing MLE parameter searches for 1,000 simulated datasets per experiment design (Transparent Methods), and we found that 95% confidence intervals of MLE pairwise parameters match their 95% confidence intervals predicted by the FIM analysis (Figures 4C, 4D, S4J, and S4K). The verified FIM analysis shows that MLE estimates will have the least uncertainty when trained with diverse kinetic profiles when compared with homogeneous profiles (compare ellipses in Figures S4K and S4J, respectively). This improvement is explained by the fact that different kinetic profiles constrain different combinations of parameters resulting in different parameters uncertainties (Figures 4F and S4L–S4O), as shown for a specific pair of parameters (λ1112) that defines the regulation of the terminal kinase through the activity of phosphatases (Figure 4G). Next, we employed a complementary Bayesian analysis to quantify the posterior uncertainty of parameters given a single set of training data (see Transparent Methods). In agreement with the frequentist analysis, we found that posteriors for homogeneous inputs were much broader than those found for the diverse input signals (Figure 4E).

Because the frequentist FIM analysis requires no data generation or fitting, it is many orders of magnitude more computationally efficient than the Bayesian analysis. Using the verified FIM to estimate the expected uncertainty for a given combination of input kinetics, we could next explore all possible combinations (36 choose n) for up to n kinetic inputs (2,391,495 total). Through this exhaustive search, we found that the D-Optimal (i.e., the set of experiments expected to minimize the determinant of FIM−1) combination of n = 6 kinetic inputs was a diverse combination composed of inputs of two steps to 0.5 and 0.7 M (t0), one root to 0.7 M (t), one quadratic to 0.5 M (t2), and two heptics to 0.5 and 0.7 M (t7). This optimal experiment further reduces total parameter uncertainty compared with steps or the original diverse kinetics (Figure 4H). Comparing the determinants of covariance of the posterior (Det(cov)) for all parameters using the Bayesian approach, we find that the posterior for the FIM D-Optimal inputs was tighter when compared with diverse and homogeneous step input signals (compare Det(cov) in Figures 4J–4L).

We next applied these frequentist and Bayesian perspectives to ask how parameter uncertainty affects prediction accuracy. To answer this question from the frequentist perspective (i.e., how sensitive are predictions to the spread of MLE-based estimations corresponding to different replicas of the same experiment), we sampled parameter sets from a multivariate distribution centered at the true parameters and distributed with a covariance matrix equal to the inverse FIM. To answer this question from the Bayesian perspective (i.e., how sensitive are predictions to parameter uncertainty after constraining to a single replica experiment), we used Monte Carlo to sample parameter sets from the posterior distribution (see Transparent Methods). In both cases, we found that predictions based upon models fit to the diverse data as well as the FIM D-Optimal design provided much tighter and more accurate estimates of behaviors in unseen testing conditions (Figures 4I–4L). Our results indicate that six-step training datasets achieve an average prediction error within 143% of the standard deviation of data. In comparison, six diverse and six D-Optimal datasets result in average prediction errors within 79% and 87% of the standard deviation of the data (compare <PE>/std in Figures 4J–4L). We find that with the optimally designed experiments, a combination of three diverse experiments (comprised of one step (t0), one quadratic (t2), and one heptic (t7) input), constrain the parameters sufficiently well to make predictions within 70% of the standard deviation of the data (Figure 4I). For clarity, results in Figure 4I are obtained using the frequentist approach while those in Figures 4J–4L are obtained using the Bayesian approach.

These results indicate that signaling models built based on one type of kinetic stimulations may be predictive under different intensities of that same kinetic input, but they are likely to fail to predict pathway responses upon other types of kinetic stimulations. This is important to consider because most computational models to date are often parametrized with measurements performed under constant stimulation profiles (Hao and O'Shea, 2012; Shaw et al., 2019). We find that training the model simultaneously with diverse kinetics constrains parameters better and improves predictions substantially.

Diverse Kinetic Stimulations Improve Identification of Model Structures

Next, we examined how kinetically different cell stimulations affect model structure identifiability (e.g., the number and mechanisms of interacting signaling proteins), and we sought to elucidate the contribution of specific signaling proteins to overall dynamic signaling responses. Figures 5 and S5 show fits (red) and predictions (blue, yellow, green, and purple) of five models with varying complexity to six different signaling response dynamics. This analysis is performed using datasets that are simulated from model M3. Simpler models were built by removing one or two regulatory elements from M3 to form M2 or M1, respectively, to simulate two mutants of the true model where the corresponding kinase activities are removed resulting in loss of feedback regulations (Figures S5A–S5C). A more complex model was built by adding two extra regulation elements to the true model, which could represent unknown regulatory elements yet to be discovered such as feedbacks from the terminal kinase to the upstream kinases (Figure S5D, model M4) (Suzuki et al., 2020). Finally, another complex model is generated by adding an entirely new signaling branch consisting of a third sensor node and introducing three additional regulatory elements to the true model that could correspond to SHO1 sub-branches in the HOG pathway (Figure S5E, model M5) (Tatebayashi et al., 2015). As expected, the simplest model cannot fit all data simultaneously (Figures 5A, S5A, and S5B, red), whereas the true model and the more complex models both fit well to the simulated data (Figures 5B, 5C, and S5C–S5E, red). In each case, multiple MLEs were performed and models were then used to make four sets of predictions (Figures 5D–5I, blue, green, yellow, purple) (see Transparent Methods). As expected, the simple model does not predict well (Figures 5D and 5G), whereas the medium and complex model predictions are consistent with the true responses, but with an increasing level of prediction uncertainties (Figures 5E, 5F, 5H, and 5I).

Figure 5.

Figure 5

Kinetically Diverse Stimulation Profiles Enable Unprecedented Model Predictions

(A–I) Predictions enable identification of the true model among models of increasing complexities. (A–C) Three models with increasing complexity from left to right (M1 in A, M3 in B, and M5 in C) each are trained with six diverse kinetics datasets that are simulated from M3 (see Transparent Methods). Model fits are shown in red and compared with training data in gray. (D–I) Model predictions (examples of Predictions1 in blue, Predictions2 in yellow, Predictions3 in green, and Predictions4 in purple) are compared with their corresponding test data (gray) indicated with stars in the table of train/test data in (J). Thick lines and shaded areas in gray show the means and the standard deviations of synthetic data, respectively. Thick lines and shaded areas in red, blue, yellow, green, and purple show 10, 50, and 90 percentiles out of independent MLE fits of each model and their corresponding predictions over time.

(J) An overview of sets of training and testing data that are used for fits and predictions. Red, blue, yellow, green, and purple squares indicate the datasets used in (K), and stars indicate predictions that are presented in (D–I). (K) Quantification (boxplots) of fit and prediction errors for five models of increasing complexity after training each model with six diverse kinetics (see Transparent Methods and Figure S5 for model definitions and further analysis). Horizontal dotted black line denotes the standard deviation of the simulated data. Fit errors (FE) statistics are drawn from six train datasets over multiple independent trainings. Similarly, PE1, PE2, PE3, and PE4 are drawn from prediction errors of testdata1 (30 datasets, blue in J), testdata2 (12 datasets, yellow in J), testdata3 (6 datasets, green in J), and testdata4 (4 datasets, purple in J with stimulation profiles given in Figures S3E–S3H), respectively, each collected over multiple independent fits. Predictions for M4 and M5 are all specifically collected from MLEs that have objective of at least as low as that of converged M3 MLEs.

(L) Quantification of fit and prediction errors for five models of increasing complexity after training each model with six step kinetics. See also Figure S5.

We next performed systematic cross-validation of fits to specific input-output responses and prediction validation analyses using other datasets to explore how identification of a predictive model structure (e.g., selecting among models M1 to M5) depends on the specific types of training and validation data (Figures 5J–5L, S5F, and S5G). In all cases, as the model complexity increases from M1 to M5, the fits to training data improve monotonically (Figures 5K and 5L, S5F, and S5G, red), but the validation errors should reach a minimum when using the model structure that is most predictive given the available data (Figures 5K and 5L, S5F, and S5G, non-red markers). When training data consist of steps, simpler models (M1 and M2) fit well to the simulated data, therefore it is not possible to distinguish the true model using only steps data (Figure 5L). By quantifying predictions of each model upon all kinetic inputs (Figure 5J), it is possible on average to identify the true model (M3), but the certainty of structure identification depends heavily on the type of validation data (compare different colors in Figure 5L). In comparison, similar to diverse training data (Figure 5K), when using optimal experiments suggested by the FIM analysis, the over-simplified models fit less well to training data (Figures S5F and S5G), and validation data allowed for more certain selection of the true model. These results demonstrate that different kinetic cell stimulations can improve predictability in the process of complex model structure identification.

Diverse Kinetic Stimulations Improve Predictions of Mutant Responses

Finally, we examined how diverse kinetic stimulations affect predictive performance for other in silico biologically realistic mutations to specific signaling proteins in signal transduction pathways (Figures 6A and S6) (Hohmann et al., 2007; Mattison and Ota, 2000; O'Rourke and Herskowitz, 1998; Westfall and Thorner, 2006; Young et al., 2002). Sensitivity analysis of the wild-type (WT) model that was trained with diverse kinetics allowed us to categorize model parameters into two main groups of insensitive (sensitivity = 0) and sensitive (sensitivity = 1) parameters that we then use to predict putative mutations (Figures 6A, 6B, and S6). To determine whether diverse kinetic cell stimulations can identify biological mechanisms, we computationally introduced six mutations in our true model (Transparent Methods). These include three knockout mutants that remove basal deactivators on x1 (Δb1, blue cross), x2 (Δb2, red cross), or x3 (Δb3, teal cross). Mutants Δb1 and Δb2 can be interpreted as removing the constitutive deactivation of SLN1 or SHO1 branches, respectively, which could change the half-life of their active states. Mutant Δb3 could represent regulation of Pbs2 through deletion of Ptc1/2/3 phosphatases. The node x4 can be mutated by removing kinase activity of x4 (ΔK, purple cross, e.g., kinase dead or kinase activity inhibited) and can be regulated through overexpression (b4, OE, orange), and underexpression (b4, UE, brown) of the basal deactivator on x4 such as the Ptp2/3 phosphatases (Figure 6A).

Figure 6.

Figure 6

Kinetically Diverse Stimulations Elucidate Dynamic Effects of Mutants

Training the WT true model (M3 in Figure 5B) on its signaling dynamics upon diverse kinetics (red in Figure 5J) reveals insights into the response of several mutated models under all tested kinetic inputs. (A) The six mutants of the WT model correspond to deletions of basal regulators (e.g., phosphatases) on x1 (Δb1, blue), x2 (Δb2, red), or x3 (Δb3, teal); removal of the kinase activity (e.g., kinase dead or inhibited MAPK) of x4 (ΔK, purple); and under- or overexpressing b4 (e.g. phosphatase) that regulates x4 (UE in orange and OE in brown). (B) Sensitivity analysis of WT model with respect to model parameters around their best value from the fits predicts insensitive (e.g., λ3 and λ4 corresponding to Δb1) and sensitive (λ15 and λ16 corresponding to ΔK) mutants (see Figure S6). (C) Comparing the activation dynamics of mutants (simulated using Λ0 in Table S1) with WT under a representative kinetic input (t9 to 0.2 M, inset). Thick line and shaded area (colors) show the mean and the standard deviation of synthetic data for the corresponding strains. (D) Mutation severity, defined as the difference in activation dynamics of a mutant from that of the WT (Transparent Methods). (E) Mutation severity is shown for two representative mutant strains, Δb1 (insensitive) and ΔK (sensitive) over all kinetic types summed over all their final concentrations. See also Figure S6.

We simulated corresponding synthetic pathway activations from all six mutants upon all 54 kinetic stimulations (Transparent Methods). Pathway activations upon a representative kinetic stimulation input of t9 are shown for all six mutants compared with the WT in Figure 6C. To quantify the differences in signaling dynamics between normal and mutant cells, we define severity as the difference in the activation dynamics of a mutant compared with that of the WT (Figure 6D and Transparent Methods). We observed that pathway activation in some mutant strains shows no difference from WT under all kinetic inputs (Δb1 and Δb3, mutation “severity” = 0), whereas other mutants with non-zero severity (Δb2, ΔK, UE, and OE) are different from WT (Figure 6C). Comparing the sensitivity with respect to specific parameters in the model (Figure 6B) with the severity of the corresponding mutants' effect on signaling dynamics (Figure 6E) highlights that sensitive parameters are indicators of how much specific mutations will affect signaling under different kinetics of specific types (Figure S6). Furthermore, constraining the parameters of WT model on its synthetic data under diverse kinetic stimulations enabled us to accurately predict the activation responses of all the six mutants over time under all 54 kinetic inputs tested (Figures 7 and S7). These results are quantitatively summarized in Figure 7E, showing that the prediction errors for simulated mutations are comparable to prediction errors of non-mutated WT cells.

Figure 7.

Figure 7

Kinetically Diverse Stimulations Enable to Predict Mutants' Response Dynamics

Training the WT true model on its signaling dynamics upon diverse kinetics enables predictions for the response of mutants upon all tested kinetic inputs.

(A) A mutant where the kinase activity of x4 is eliminated (e.g., kinase dead or inhibited MAPK, purple cross) leads to a loss of feedback regulation from x4 on x2.

(B) Extracellular stimulation of the models results in elongated response adaptation in ΔK mutant compared with WT.

(C and D) Example pathway activation predictions (predictions3 in green) are compared with their corresponding synthetic data for WT (C, gray) and ΔK (D, purple) under representative t9 kinetic inputs.

(E) Prediction errors (PE1, PE2, PE3) are quantified over all 54 kinetics (Figure 1H) for each of the six mutants (Figure 6A) compared with WT. These predictions are made using parameters constrained from five independent fits of WT model tarined on its signaling dynamics upon six diverse kinetics in Figure 5J.

See also Figure S7.

Discussion

Our experimental and simulation results demonstrate that different kinetic cell stimulations of a pathway give rise to distinct signaling activation dynamics (Figure 1). When compared with the same amount of any type of homogeneous kinetics, kinetically diverse cell stimulations perform much better to constrain complex model parameter sets and result in substantially reduced predictions errors (Figures 3 and 4). The FIM analysis approach provides a rigorous and clear mathematical interpretation of this effect. Specifically, the eigenvector of the FIM corresponding to the greatest eigenvalue indicates the parameter combination that is most likely to be accurately identified using a given kinetic stimulation (Figure S4O). By comparing eigenvectors corresponding to large FIM eigenvalues, it is easy to see which kinetic type may be most effective to constrain specific parameters of a complex regulatory network. Similarly, by examining eigenvectors corresponding to small FIM eigenvalues, it becomes apparent which parameter combinations cannot be precisely identified using a specific kinetic input. By choosing diverse and complementary input kinetics, such that the full parameter space is spanned by high-eigenvalue FIM eigenvectors from one or multiple kinetic inputs, it becomes possible to constrain the entire parameter set (Figures 4F, 4G, and S4L–S4O).

Better constrained parameters make it easier to identify predictive models of signal transduction (Figures 5 and S5). This also enables improved predictions of pathway activation dynamics for protein mutant strains upon kinetic stimulations (Figures 6, S6, and S7). To illustrate these predictions, we revisited the HOG pathway as an example (Figure 6). We predicted that mutations that alter the SLN1 branch (b1) will not impact Hog1 signaling dynamics, which is consistent with previously published observation that changes in Ypd1 do not alter Hog1 signaling (Mukherji, 2010). Next we focused on the node x2 that describes the SHO1 branch in which mutating b2 could result in an increase or decrease in Hog1 signaling. We predicted that Δb2 results in increased Hog1 signaling amplitude. In addition, the SHO1 branch can be altered in its activity by the feedback regulation from Hog1 kinase. In our model, we predicted that removing Hog1 kinase activity or inhibiting Hog1 kinase activity (ΔK) results in increased and prolonged Hog1 activation, which is consistent with previous published experimental work on small molecule inhibition of Hog1 activity (Muzzey et al., 2009; Westfall and Thorner, 2006). Next, we focused on the node x3, which represents Pbs2. Our model predicted that deletions of individual phosphatases such as Ptc1/2/3 would not alter Hog1 signaling dynamics through changed Pbs2 activity (Δb3) (Mapes and Ota, 2004; Warmka et al., 2001). Last, x4 represents the terminal kinase Hog1 that can be deactivated through phosphatases Ptc1/2/3 and Ptp2/3 (b4) (Martín et al., 2005; Murakami et al., 2008; Warmka et al., 2001; Young et al., 2002). The model predicted that changing the value of b4 through over- or underexpression has a strong impact on Hog1 signaling intensity. The experimental observations of the effect of above-mentioned mutants on the HOG pathway dynamics under step profiles qualitatively aligns with our simulated data for all mutants validating our modeling approach for mutants (Figure 6) (O'Rourke and Herskowitz, 1998; Saito and Posas, 2012; Tatebayashi et al., 2015; Warmka et al., 2001). Using simulated data, we verified our mutant findings by quantitatively comparing predictions from each mutant to their corresponding synthetic data (Figure 7).

These results demonstrate that kinetic cell stimulations are ideally suited to discover novel regulatory interactions, reveal key functional proteins, identify predictive and biologically meaningful models, and help to gain novel insights to cell function or dysfunction. The potential for generality of kinetic stimulations is supported by emerging studies in many pathways and in different cell types. Examples are differential cellular responses upon step, pulsatile, or linear stimulations of EGF (Akt pathway, PC12 cells, Fujita et al., 2010), TGF-β (TGF-β pathways, myoblast progenitor C2C12 cell line, Sorre et al., 2014), cytokines IL-6 (STAT1/3, lymphocytes, Twohig et al., 2019) and TNF (NF-κB signaling, HeLa/neuroblastoma cells, Ashall et al., 2009; Mokashi et al., 2019), insulin (Akt/S6K pathway, rat hepatoma Fao cells, Kubota et al., 2012), H2O2 (Yap1 signaling, yeast cells (Goulev et al., 2017), or diacetyl in tetramisole hydrochloride (C. elegans AWA neurons, Rahi et al., 2017). We believe that implementing diverse kinetic stimulations may provide new opportunities in constraining complex model parameters in situations where experimental design methods based on instantaneous changes in the environment such as steps or pulses of varying heights or frequencies have not provided great success (Billings, 2013; Isermann and Münchhof, 2011). Given the complexity of signal transduction networks (Akhurst and Hata, 2012; Cildir et al., 2016; Hanahan and Weinberg, 2000; Hata and Chen, 2016; Hotamisligil and Davis, 2016) and their limited response bandwidths (Hersen et al., 2008), steps or pulsatile stimulations of even varying intensities or frequencies may not provide enough kinetics to efficiently probe the rich dynamics underlying these networks (Billings, 2013; Isermann and Münchhof, 2011). Our approach on the other hand is widely applicable to many biological pathways that respond to a kinetic cell stimulation with a dynamic signaling response, and our approach may have far-reaching implications for predicting pathway response upon specific mutations or drugs (Akhurst and Hata, 2012; Cildir et al., 2016; Hanahan and Weinberg, 2000; Handly et al., 2016; Hata and Chen, 2016; Hotamisligil and Davis, 2016; Janes and Lauffenburger, 2013; Rowland et al., 2011; Vanhaelen et al., 2017; Lim and Bruce Mayer, 2017). Being able to predict the pathway activation dynamics upon mutations or upon environmental changes may help design better drug treatment regimes. In addition, these results could benefit our understanding of human biology, particularly in areas such as optogenetics, gene regulatory networks, or synthetic biology, where predictive understanding of the system behavior with respect to extracellular kinetics or intercellular genetic perturbations is of immense interest (Aoki et al., 2019; Bashor et al., 2019; Gardner, 2013; Harrigan et al., 2018).

Limitations of the Study

The basic requirement for applying this framework to other systems is (1) that different environmental gradients can be generated, (2) that the pathway response changes over time, (3) that the pathway response time is homogeneous from cell to cell, and (4) that the central limit theorem is fulfilled in single cells.

Resource Availability

Lead Contact

Further information and requests for resources should be directed to and will be fulfilled by the Lead Contact, Gregor Neuert (gregor.neuert@vanderbilt.edu).

Materials Availability

This study did not generate new unique reagents.

Data and Code Availability

The datasets and code generated during this study are available at https://github.com/neuertlab/Jashnsaz_iScience_2020. The published article includes all datasets generated and analyzed during this study.

Methods

All methods can be found in the accompanying Transparent Methods supplemental file.

Acknowledgments

G.N. is supported by NIH, United States, DP2 GM11484901, NIH R01GM115892 and Vanderbilt Startup Funds. J.J.H. is supported by NIH, United States, T32DK101003 and NSF, United States, GRFP. Z.R.F. and B.M. are supported by NIH, United States, R35 GM124747. ZRF was also supported by the Agence Nationale de la Recherche, France, ANR-18-CE91-0002, CyberCircuits. The authors thank Amanda Johnson, Alexander Thiemicke, Benjamin Kesler, Rohit Venkat, Joseph Cleland, Eric Ron, and Stuart McKnight for comments on the manuscript. This study used resources at the Advanced Computing Center for Research and Education (ACCRE) at Vanderbilt University, Nashville, TN (NIH, United States, S10 Shared Instrumentation Grant 1S10OD023680-01 [Meiler]).

Author Contributions

Conceptualization, G.N., B.M., H.J., and Z.R.F.; Methodology, G.N., B.M., H.J., and Z.R.F.; Software, G.N., B.M., H.J., Z.R.F., and J.J.H.; Validation, H.J. and Z.R.F.; Formal Analysis, H.J.; Investigation, H.J. and G.L.; Data Curation, H.J.; Writing – Original Draft, H.J., B.M., and G.N.; Writing – Review & Editing, H.J., B.M., G.N., Z.R.F., and J.J.H.; Visualization, H.J.; Supervision, G.N. and B.M.; Project Administration, G.N. and H.J., Funding Acquisition, G.N. and B.M.

Declaration of Interests

The authors declare that they have no conflict of interest.

Published: October 23, 2020

Footnotes

Supplemental Information can be found online at https://doi.org/10.1016/j.isci.2020.101565.

Contributor Information

Brian Munsky, Email: brian.munsky@colostate.edu.

Gregor Neuert, Email: gregor.neuert@vanderbilt.edu.

Supplemental Information

Document S1. Transparent Methods, Figures S1–S7, and Table S1
mmc1.pdf (12.4MB, pdf)

References

  1. Adler M., Alon U. Fold-change detection in biological systems. Curr. Opin. Syst. Biol. 2018;8:81–89. [Google Scholar]
  2. Akhurst R.J., Hata A. Targeting the TGFβ signalling pathway in disease. Nat. Rev. Drug Discov. 2012;11:790–811. doi: 10.1038/nrd3810. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Albeck J.G., Mills G.B., Brugge J.S. Frequency-modulated pulses of ERK activity transmit quantitative proliferation signals. Mol. Cell. 2013;49:249–261. doi: 10.1016/j.molcel.2012.11.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Aoki S.K., Lillacci G., Gupta A., Baumschlager A., Schweingruber D., Khammash M. A universal biomolecular integral feedback controller for robust perfect adaptation. Nature. 2019;570:533–537. doi: 10.1038/s41586-019-1321-1. [DOI] [PubMed] [Google Scholar]
  5. Apgar J.F., Toettcher J.E., Endy D., White F.M., Tidor B. Stimulus design for model selection and validation in cell signaling. PLoS Comput. Biol. 2008;4:e30. doi: 10.1371/journal.pcbi.0040030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Apgar J.F., Witmer D.K., White F.M., Tidor B. Sloppy models, parameter uncertainty, and the role of experimental design. Mol. Biosyst. 2010;6:1890. doi: 10.1039/b918098b. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Ashall L., Horton C.A., Nelson D.E., Paszek P., Harper C.V., Sillitoe K., Ryan S., Spiller D.G., Unitt J.F., Broomhead D.S. Pulsatile stimulation determines timing and specificity of NF-κB-dependent transcription. Science. 2009;324:242–246. doi: 10.1126/science.1164860. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Bandara S., Schlöder J.P., Eils R., Bock H.G., Meyer T. Optimal experimental design for parameter estimation of a cell signaling model. PLoS Comput. Biol. 2009;5:e1000558. doi: 10.1371/journal.pcbi.1000558. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Bashor C.J., Patel N., Choubey S., Beyzavi A., Kondev J., Collins J.J., Khalil A.S. Complex signal processing in synthetic gene circuits using cooperative regulatory assemblies. Science. 2019;364:593–597. doi: 10.1126/science.aau8287. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Billings S.A. John Wiley & Sons; 2013. Nonlinear System Identification: NARMAX Methods in the Time, Frequency, and Spatio-Temporal Domains. [Google Scholar]
  11. Briscoe J., Small S. Morphogen rules: design principles of gradient-mediated embryo patterning. Dev. 2015;142:3996–4009. doi: 10.1242/dev.129452. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Casey F.P., Baird D., Feng Q., Gutenkunst R.N., Waterfall J.J., Myers C.R., Brown K.S., Cerione R.A., Sethna J.P. Optimal Experimental Design in an EGFR Signaling and Down-Regulation Model. ArXiv. 2006 doi: 10.1049/iet-syb:20060065. http://arxiv.org/abs/q-bio/0610024 [DOI] [PubMed] [Google Scholar]
  13. Cildir G., Low K.C., Tergaonkar V. Noncanonical NF-κB signaling in health and disease. Trends Mol. Med. 2016;22:414–429. doi: 10.1016/j.molmed.2016.03.002. [DOI] [PubMed] [Google Scholar]
  14. Csete M.E., Doyle J.C. Reverse engineering of biological complexity. Science. 2002;295:1664–1669. doi: 10.1126/science.1069981. [DOI] [PubMed] [Google Scholar]
  15. Cuadrado A., Nebreda A.R. Mechanisms and functions of p38 MAPK signalling. Biochem. J. 2010;429:403–417. doi: 10.1042/BJ20100323. [DOI] [PubMed] [Google Scholar]
  16. Efremova M., Vento-Tormo R., Park J.-E., Teichmann S.A., James K.R. Immunology in the era of single-cell technologies. Annu. Rev. Immunol. 2020;38:727–757. doi: 10.1146/annurev-immunol-090419-020340. [DOI] [PubMed] [Google Scholar]
  17. Faller D., Klingmüller U., Timmer J. Simulation methods for optimal experimental design in systems biology. Simulation. 2003;79:717–725. [Google Scholar]
  18. Ferrigno P., Posas F., Koepp D., Saito H., Silver P.A. Regulated nucleo/cytoplasmic exchange of HOG1 MAPK requires the importin β homologs NMD5 and XPO1. EMBO J. 1998;17:5606–5614. doi: 10.1093/emboj/17.19.5606. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Fox Z.R., Munsky B. The finite state projection based Fisher information matrix approach to estimate information and optimize single-cell experiments. PLoS Comput. Biol. 2019;15:e1006365. doi: 10.1371/journal.pcbi.1006365. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Fujita K.A., Toyoshima Y., Uda S., Ozaki Y.I., Kubota H., Kuroda S. Decoupling of receptor and downstream signals in the Akt pathway by its low-pass filter characteristics. Sci. Signal. 2010;3:ra56. doi: 10.1126/scisignal.2000810. [DOI] [PubMed] [Google Scholar]
  21. Gardner T.S. Synthetic biology: from hype to impact. Trends Biotechnol. 2013;31:123–125. doi: 10.1016/j.tibtech.2013.01.018. [DOI] [PubMed] [Google Scholar]
  22. Goulev Y., Morlot S., Matifas A., Huang B., Molin M., Toledano M.B., Charvin G. Nonlinear feedback drives homeostatic plasticity in H2O2 stress response. Elife. 2017;6:e23971. doi: 10.7554/eLife.23971. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Granados A.A., Crane M.M., Montano-Gutierrez L.F., Tanaka R.J., Voliotis M., Swain P.S. Distributing tasks via multiple input pathways increases cellular survival in stress. Elife. 2017;6:e21415. doi: 10.7554/eLife.21415. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Groß A., Kracher B., Kraus J.M., Kühlwein S.D., Pfister A.S., Wiese S., Luckert K., Pötz O., Joos T., Van Daele D. Representing dynamic biological networks with multi-scale probabilistic models. Commun. Biol. 2019;2:1–12. doi: 10.1038/s42003-018-0268-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Hagen D.R., White J.K., Tidor B. Convergence in parameters and predictions using computational experimental design. Interf. Focus. 2013;3:20130008. doi: 10.1098/rsfs.2013.0008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Hanahan D., Weinberg R.A. The hallmarks of cancer. Cell. 2000;100:57–70. doi: 10.1016/s0092-8674(00)81683-9. [DOI] [PubMed] [Google Scholar]
  27. Handly L.N., Yao J., Wollman R. Signal transduction at the single-cell level: approaches to study the dynamic nature of signaling networks. J. Mol. Biol. 2016;428:3669–3682. doi: 10.1016/j.jmb.2016.07.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Hao N., O’Shea E.K. Signal-dependent dynamics of transcription factor translocation controls gene expression. Nat. Struct. Mol. Biol. 2012;19:31–39. doi: 10.1038/nsmb.2192. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Hao N., Behar M., Parnell S.C., Torres M.P., Borchers C.H., Elston T.C., Dohlman H.G. A systems-biology analysis of feedback inhibition in the Sho1 osmotic-stress-response pathway. Curr. Biol. 2007;17:659–667. doi: 10.1016/j.cub.2007.02.044. [DOI] [PubMed] [Google Scholar]
  30. Harrigan P., Madhani H.D., El-Samad H. Real-time genetic compensation defines the dynamic demands of feedback control. Cell. 2018;175:877–886.e10. doi: 10.1016/j.cell.2018.09.044. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Hata A., Chen Y.G. TGF-β signaling from receptors to smads. Cold Spring Harb. Perspect. Biol. 2016;8:a022061. doi: 10.1101/cshperspect.a022061. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Hersen P., McClean M.N., Mahadevan L., Ramanathan S. Signal processing by the HOG MAP kinase pathway. Proc. Natl. Acad. Sci. U S A. 2008;105:7165–7170. doi: 10.1073/pnas.0710770105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Hohmann S., Krantz M., Nordlander B. Yeast osmoregulation. Methods Enzymol. 2007;428:29–45. doi: 10.1016/S0076-6879(07)28002-4. [DOI] [PubMed] [Google Scholar]
  34. Hotamisligil G.S., Davis R.J. Cell signaling and stress responses. Cold Spring Harb. Perspect. Biol. 2016;8:a006072. doi: 10.1101/cshperspect.a006072. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Huang Y., Liu A.A., Lafon B., Friedman D., Dayan M., Wang X., Bikson M., Doyle W.K., Devinsky O., Parra L.C. Measurements and models of electric fields in the in vivo human brain during transcranial electric stimulation. Elife. 2017;6:e18834. doi: 10.7554/eLife.18834. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Isermann R., Münchhof M. Springer; 2011. Identification of Dynamic Systems. [Google Scholar]
  37. Janes K.A., Lauffenburger D.A. Models of signalling networks-what cell biologists can gain from them and give to them. J. Cell Sci. 2013;126:1913–1921. doi: 10.1242/jcs.112045. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Kinney M.A., Vo L.T., Frame J.M., Barragan J., Conway A.J., Li S., Wong K.-K., Collins J.J., Cahan P., North T.E. A systems biology pipeline identifies regulatory networks for stem cell engineering. Nat. Biotechnol. 2019;37:810–818. doi: 10.1038/s41587-019-0159-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Klipp E., Nordlander B., Krüger R., Gennemark P., Hohmann S. Integrative model of the response of yeast to osmotic shock. Nat. Biotechnol. 2005;23:5. doi: 10.1038/nbt1114. [DOI] [PubMed] [Google Scholar]
  40. Komorowski M., Costa M.J., Rand D.A., Stumpf M.P.H. Sensitivity, robustness, and identifiability in stochastic chemical kinetics models. Proc. Natl. Acad. Sci. U S A. 2011;108:8645–8650. [Google Scholar]
  41. Kubota H., Noguchi R., Toyoshima Y., Ozaki Y.I., Uda S., Watanabe K., Ogawa W., Kuroda S. Temporal coding of insulin action through multiplexing of the AKT pathway. Mol. Cell. 2012;46:820–832. doi: 10.1016/j.molcel.2012.04.018. [DOI] [PubMed] [Google Scholar]
  42. Labib M., Kelley S.O. Single-cell analysis targeting the proteome. Nat. Rev. Chem. 2020;4:143–158. doi: 10.1038/s41570-020-0162-7. [DOI] [PubMed] [Google Scholar]
  43. Lim W., Bruce Mayer T.P. Garland Science, Taylor and Francis Group, LLC; 2017. Cell Signaling: Principles and Mechanisms. [Google Scholar]
  44. Maeda T., Wurgler-Murphy S.M., Saito H. A two-component system that regulates an osmosensing MAP kinase cascade in yeast. Nature. 1994;369:242–245. doi: 10.1038/369242a0. [DOI] [PubMed] [Google Scholar]
  45. Mapes J., Ota I.M. Nbp2 targets the Ptc1-type 2C Ser/Thr phosphatase to the HOG MAPK pathway. EMBO J. 2004;23:302–311. doi: 10.1038/sj.emboj.7600036. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Martín H., Flández M., Nombela C., Molina M. Protein phosphatases in MAPK signalling: we keep learning from yeast. Mol. Microbiol. 2005;58:6–16. doi: 10.1111/j.1365-2958.2005.04822.x. [DOI] [PubMed] [Google Scholar]
  47. Mattison C.P., Ota I.M. Two protein tyrosine phosphatases, Ptp2 and Ptp3, modulate the subcellular localization of the Hog1 MAP kinase in yeast. Genes Dev. 2000;14:1229–1235. [PMC free article] [PubMed] [Google Scholar]
  48. Mettetal J.T., Muzzey D., Gómez-Uribe C., van Oudenaarden A., Bustamante C., van Oudenaarden A. The frequency dependence of osmo-adaptation in Saccharomyces cerevisiae. Science. 2008;319:482–484. doi: 10.1126/science.1151582. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Mitchell A., Wei P., Lim W.A. Oscillatory stress stimulation uncovers an Achilles’ heel of the yeast MAPK signaling network. Science. 2015;350:1379–1383. doi: 10.1126/science.aab0892. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Mokashi C.S., Schipper D.L., Qasaimeh M.A., Lee R.E.C. A System for analog control of cell culture dynamics to reveal capabilities of signaling networks. iScience. 2019;19:586–596. doi: 10.1016/j.isci.2019.08.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Mukherji S. Massachusetts Institute of Technology; 2010. Robustness and Tunability in Biological Networks. [Google Scholar]
  52. Murakami Y., Tatebayashi K., Saito H. Two adjacent docking sites in the yeast Hog1 Mitogen-Activated Protein (MAP) Kinase differentially interact with the Pbs2 MAP Kinase Kinase and the Ptp2 protein tyrosine phosphatase. Mol. Cell. Biol. 2008;28:2481–2494. doi: 10.1128/MCB.01817-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Muzzey D., Gómez-Uribe C.A., Mettetal J.T., van Oudenaarden A. A systems-level analysis of perfect adaptation in yeast osmoregulation. Cell. 2009;138:160–171. doi: 10.1016/j.cell.2009.04.047. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Oyler-Yaniv A., Oyler-Yaniv J., Whitlock B.M., Liu Z., Germain R.N., Huse M., Altan-Bonnet G., Krichevsky O. A tunable diffusion-consumption mechanism of cytokine propagation enables plasticity in cell-to-cell communication in the immune system. Immunity. 2017;46:609–620. doi: 10.1016/j.immuni.2017.03.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. O’Rourke S.M., Herskowitz I. The Hog1 MAPK prevents cross talk between the HOG and pheromone response MAPK pathways in Saccharomyces cerevisiae. Genes Dev. 1998;12:2874–2886. doi: 10.1101/gad.12.18.2874. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Rahi S.J., Larsch J., Pecani K., Katsov A.Y., Mansouri N., Tsaneva-Atanasova K., Sontag E.D., Cross F.R. Oscillatory stimuli differentiate adapting circuit topologies. Nat. Methods. 2017;14:1010–1016. doi: 10.1038/nmeth.4408. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Romers J., Thieme S., Münzner U., Krantz M. A scalable method for parameter-free simulation and validation of mechanistic cellular signal transduction network models. Npj Syst. Biol. Appl. 2020;6:1–13. doi: 10.1038/s41540-019-0120-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Rowland M., Peck C., Tucker G. Physiologically-based pharmacokinetics in drug development and regulatory science. Annu. Rev. Pharmacol. Toxicol. 2011;51:45–73. doi: 10.1146/annurev-pharmtox-010510-100540. [DOI] [PubMed] [Google Scholar]
  59. Saito H., Posas F. Response to hyperosmotic stress. Genetics. 2012;192:289–318. doi: 10.1534/genetics.112.140863. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Schaber J., Baltanas R., Bush A., Klipp E., Colman-Lerner A. Modelling reveals novel roles of two parallel signalling pathways and homeostatic feedbacks in yeast. Mol. Syst. Biol. 2012;8:622. doi: 10.1038/msb.2012.53. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Shaw W.M., Yamauchi H., Mead J., Gowers G.O.F., Bell D.J., Öling D., Larsson N., Wigglesworth M., Ladds G., Ellis T. Engineering a model cell for rational tuning of GPCR signaling. Cell. 2019;177:782–796.e27. doi: 10.1016/j.cell.2019.02.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Shimizu T.S., Tu Y., Berg H.C. A modular gradient-sensing network for chemotaxis in Escherichia coli revealed by responses to time-varying stimuli. Mol. Syst. Biol. 2010;6:382. doi: 10.1038/msb.2010.37. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Sorre B., Warmflash A., Brivanlou A.H., Siggia E.D. Encoding of temporal signals by the TGF-β pathway and implications for embryonic patterning. Dev. Cell. 2014;30:334–342. doi: 10.1016/j.devcel.2014.05.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Steiner R.A., Bremner W.J., Clifton D.K. Regulation of luteinizing hormone pulse frequency and amplitude by testosterone in the adult male rat. Endocrinology. 1982;111:2055–2061. doi: 10.1210/endo-111-6-2055. [DOI] [PubMed] [Google Scholar]
  65. Suzuki S.K., Errede B., Dohlman H.G., Elston T.C. Model-driven experimental design identifies counter-acting feedback regulation in the osmotic stress response of yeast. BioRxiv. 2020:051599. doi: 10.1101/2020.04.20.051599. [DOI] [Google Scholar]
  66. Tatebayashi K., Yamamoto K., Nagoya M., Takayama T., Nishimura A., Sakurai M., Momma T., Saito H. Osmosensing and scaffolding functions of the oligomeric four-transmembrane domain osmosensor Sho1. Nat. Commun. 2015;6:6975. doi: 10.1038/ncomms7975. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Thiemicke A., Jashnsaz H., Li G., Neuert G. Generating kinetic environments to study dynamic cellular processes in single cells. Sci. Rep. 2019;9:10129. doi: 10.1038/s41598-019-46438-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Twohig J.P., Cardus Figueras A., Andrews R., Wiede F., Cossins B.C., Derrac Soria A., Lewis M.J., Townsend M.J., Millrine D., Li J. Activation of naïve CD4+ T cells re-tunes STAT1 signaling to deliver unique cytokine responses in memory CD4+ T cells. Nat. Immunol. 2019;20:458–470. doi: 10.1038/s41590-019-0350-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Vanhaelen Q., Aliper A.M., Zhavoronkov A. A comparative review of computational methods for pathway perturbation analysis: dynamical and topological perspectives. Mol. Biosyst. 2017;13:1692–1704. doi: 10.1039/c7mb00170c. [DOI] [PubMed] [Google Scholar]
  70. Wang C.J., Bergmann A., Lin B., Kim K., Levchenko A. Diverse sensitivity thresholds in dynamic signaling responses by social amoebae. Sci. Signal. 2012;5:ra17. doi: 10.1126/scisignal.2002449. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Warmka J., Hanneman J., Lee J., Amin D., Ota I. Ptc1, a type 2C Ser/Thr phosphatase, inactivates the HOG pathway by dephosphorylating the mitogen-activated protein kinase Hog1. Mol. Cell. Biol. 2001;21:51–60. doi: 10.1128/MCB.21.1.51-60.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Westfall P.J., Thorner J. Analysis of mitogen-activated protein kinase signaling specificity in response to hyperosmotic stress: use of an analog-sensitive HOG1 allele. Eukaryot. Cell. 2006;5:1215–1228. doi: 10.1128/EC.00037-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Young C., Mapes J., Hanneman J., Al-Zarban S., Ota I. Role of Ptc2 type 2C Ser/Thr phosphatase in yeast high-osmolarity glycerol pathway inactivation. Eukaryot. Cell. 2002;1:1032–1040. doi: 10.1128/EC.1.6.1032-1040.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Young J.W., Locke J.C.W., Elowitz M.B. Rate of environmental change determines stress response specificity. Proc. Natl. Acad. Sci. U S A. 2013;110:4140–4145. doi: 10.1073/pnas.1213060110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Zhang C., Tu H.-L., Jia G., Mukhtar T., Taylor V., Rzhetsky A., Tay S. Ultra-multiplexed analysis of single-cell dynamics reveals logic rules in differentiation. Sci. Adv. 2019;5:eaav7959. doi: 10.1126/sciadv.aav7959. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Zi Z., Liebermeister W., Klipp E. A quantitative study of the Hog1 MAPK Response to fluctuating osmotic stress in saccharomyces cerevisiae. PLoS One. 2010;5:e9522. doi: 10.1371/journal.pone.0009522. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Video S1. Diversified Kinetic Stimulations Disproportionately Improve Model Predictions, Related to Figures 3 and S3

Model fits (red) simultaneously to six data (or equivalent amount of data) of each kinetic (gray) and model predictions of the other kinetics (blue, yellow, green, and purple) are compared with their corresponding test data (gray); Training data are as follows:

Frame 1: six steps (t0, A), 6 × 50 min training data,

Frame 2: six roots (t, B), 6 × 50 min training data,

Frame 3: six linears (t1, C), 6 × 50 min training data,

Frame 4: six quadratics (t2, D), 6 × 50 min training data,

Frame 5: six quints (t5, E), 6 × 50 min training data,

Frame 6: six heptics (t7, F), 6 × 50 min training data,

Frame 7: staircase (J), 1 × 300 min training data,

Frame 8: Paulsatile1 (K), 1 × 300 min training data

Frame 9: Paulsatile2 (L), 1 × 300 min training data

Frame 10: Oscillatory (M), 1 × 300 min training data,

Frame 11: six diverse kinetics (one each of t0, t, t1, t2, t5, t7), 6 × 50 min training data,

Frame 12: repeat of Frame 11 except the YScale in (N) is linear in Frame 12.

Each condition is from 10 independent fits. Shaded plots (red, blue, yellow, green, purple) present 10, 50, and 90 percentiles out of 10 independent fits and their corresponding predictions over time. (N) Full boxplots of fit (FE) and predictions errors (PE1, PE2, PE3, and PE4) compared with standard deviation of the synthetic data (black dotted line) at each frame. Fit error (FE) statistics are drawn from ntrain datasets over 10 independent fits (10 × ntrain errors) where ntrain = 6 in Frames 1–6 and Frame 11, whereas ntrain = 1 in Frames 7–10 is the number of datasets used to train the model. Similarly, PE1, PE2, and PE3 are drawn from prediction errors of testdata1 (30 datasets over 10 independent fits), testdata2 (12 datasets over 10 independent fits), and testdata3 (6 datasets over 10 independent fits), respectively (Figure 1H). PE4 is prediction errors quantified from alternating stimulations that consist of staircase, pulsatile1, pulsatile2, and sinusoidal oscillatory inputs (Figures S3E–S3H). When either of the alternating stimulations is used as training data (Frames 7–10), PE4 is quantified over the other three alternative stimulations.

Download video file (2.2MB, flv)
Document S1. Transparent Methods, Figures S1–S7, and Table S1
mmc1.pdf (12.4MB, pdf)

Data Availability Statement

The datasets and code generated during this study are available at https://github.com/neuertlab/Jashnsaz_iScience_2020. The published article includes all datasets generated and analyzed during this study.


Articles from iScience are provided here courtesy of Elsevier

RESOURCES