Skip to main content
PLOS Computational Biology logoLink to PLOS Computational Biology
. 2020 Jul 15;16(7):e1007909. doi: 10.1371/journal.pcbi.1007909

Perturbation biology links temporal protein changes to drug responses in a melanoma cell line

Elin Nyman 1,2,3,4,*,#, Richard R Stein 2,5,6,7,¤,*,#, Xiaohong Jing 8, Weiqing Wang 8, Benjamin Marks 1, Ioannis K Zervantonakis 1, Anil Korkut 9, Nicholas P Gauthier 1,2,7,‡,*, Chris Sander 1,2,7,‡,*
Editor: James R Faeder10
PMCID: PMC7384681  PMID: 32667922

Abstract

Cancer cells have genetic alterations that often directly affect intracellular protein signaling processes allowing them to bypass control mechanisms for cell death, growth and division. Cancer drugs targeting these alterations often work initially, but resistance is common. Combinations of targeted drugs may overcome or prevent resistance, but their selection requires context-specific knowledge of signaling pathways including complex interactions such as feedback loops and crosstalk. To infer quantitative pathway models, we collected a rich dataset on a melanoma cell line: Following perturbation with 54 drug combinations, we measured 124 (phospho-)protein levels and phenotypic response (cell growth, apoptosis) in a time series from 10 minutes to 67 hours. From these data, we trained time-resolved mathematical models that capture molecular interactions and the coupling of molecular levels to cellular phenotype, which in turn reveal the main direct or indirect molecular responses to each drug. Systematic model simulations identified novel combinations of drugs predicted to reduce the survival of melanoma cells, with partial experimental verification. This particular application of perturbation biology demonstrates the potential impact of combining time-resolved data with modeling for the discovery of new combinations of cancer drugs.

Author summary

Data-driven mathematical modeling of biological systems has enormous potential to understand and predict the interplay between molecular and phenotypic response to perturbation, and provides a rational approach to the nomination of therapy. In cancer, intense effort has focused on drugs that specifically target the machinery involved in tumor development, maintenance and response to therapy. Although many drugs have clinical efficacy in a fraction of patients, the response is rarely durable and patients often develop resistance within months. We believe that the robustness and complexity of living cells, as well as inaccurate assumptions about drug specificity in model systems, underlie the inadequacy of single-agent targeted therapy. In this work, we developed a framework to derive mathematical models of biological systems from molecular and phenotypic temporal responses, and test this framework on melanoma cells. These models are computationally executable and can be used to predict drug combinations likely to be effective at slowing growth or killing cancer cells. Our framework has several advantages: (1) drug specificity is learned, not assumed, during model training, (2) training on temporal (not static) response data improves predictive power, (3) data-driven dynamic models have the potential to accurately reflect a cellular system’s behavior in a context-specific manner.

Introduction

Targeted therapies are an important component of precision oncology as these agents—as opposed to standard chemotherapy—aim to counteract specific activating genetic or signaling pathway alterations and often have fewer side effects than conventional cytotoxic chemotherapy. Many targeted therapies have been approved by regulatory agencies for the treatment of various cancers [1]. However, initial response rates are generally not durable and tumors eventually develop resistance.

In melanoma, tumors with the common BRAF V600E/K gain-of-function mutation have been shown to have a remarkable response to drugs that specifically target the mutated protein kinase, such as the RAF inhibitor vemurafenib [2]. However, not all patients with a BRAF V600 mutation respond to targeted therapies, and the patients that do respond often develop resistance after only a few months. There are several known mechanisms of this resistance including primary resistance from loss-of-function mutations in PTEN that increase AKT signaling and reduce apoptosis, and CDK4 mutations and CyclinD1 amplification that promote cell cycle progression [3]. Mechanisms of acquired resistance include reactivation of the MEK/ERK pathway with new mutations in BRAF or NRAS and hyper-activation of receptor tyrosine kinases [3]. There is an urgent need for a more comprehensive understanding of resistant tumor cells in order to identify non-trivial therapeutic opportunities beyond targeting single genes.

We have previously developed a perturbation biology approach that simulates data-driven models to find combination vulnerabilities. These models are derived from measurements of the proteomic and phenotypic response of cancer cells in vitro to numerous drug combinations. For example, in a melanoma cell line, we predicted that a combination of a RAF or MEK inhibitor with a bromodomain inhibitor would be effective and synergistic and thereby reduce viability [4]. Such a combination (MEK with bromodomain inhibitor), has been subsequently proposed for a phase I/II clinical trial for small cell lung cancer and solid tumors with mutations in RAS [5]. Our approach has also been useful in identifying a synergistic drug combination (CDK4i with IGF1Ri) in dedifferentiated liposarcoma [6]. In these studies, we measured the protein response at a single time point and were thus not able to directly capture the transient cellular response to the drugs.

Several modeling frameworks to study cellular responses to perturbations over time have been proposed, for instance, ordinary differential equation models, dynamic Bayesian networks, and Boolean networks. Reaction-based models are assuming continuous dynamics and their kinetic parameters are inferred from data. Based on these fitted parameters they are executable and can be used to simulate the effect of unseen perturbations which can then be tested experimentally. They have been deployed to understand cellular responses to perturbations (e.g., [4, 711]). Dynamic Bayesian networks are stochastic and time-discrete and estimate probabilities of interactions based on data for the previous point in time, and have, for example, been used to study time-series observations in cancer [12]. Boolean networks are based on binary switches, and are discrete in time and state. However, this simplification of the underlying network, even though fuzzy logic variants allow for a softer discretization, is a rather strong one. Their benefit lies in the fact that less data is needed. Boolean networks have also been used to model cellular response data [1315]. Only few of the above mentioned approaches qualify to be called large scale (i.e., including more than 100 states) [4, 7, 13]. However, none of these—in contrast to the work presented here—is based on time-series observations.

We hypothesize that short-term responses to therapeutic intervention, implemented by adjustments in signaling networks, already reflect the shifts in cellular processes ultimately implemented in cellular long-term memory via genetic changes. Typically these long-term adaptations are amino acid changing mutations (‘missense mutations’) or amplification/deletion of DNA fragments (‘copy number changes’). It may therefore be possible to obtain evidence for the dominant resistance mechanisms in response to targeted intervention by observing the short term molecular signalling adaptations. In addition, time-series observations of response provide informative input to parameter inference for dynamic models, such as the one developed here, including the directionality, strength and sign of interactions [16, 17]. In order to study the short-term response to targeted therapies in melanoma, we produced time-resolved antibody-based measurements of protein and phospho-protein levels as A2058 cancer cells respond and adapt to drug perturbations. A2058 is resistant to both RAF and MEK inhibitors [18], and therefore represents melanoma patients that would fail one of the standard treatments. The obtained response profiles are used to infer interaction parameters of time-resolved mathematical models, which are selected for sparsity and for the ability to predict left-out data at reasonable accuracy. We simulated the consequences of all possible perturbations to these models when it comes to reduction of cell growth and induction of apoptosis. The results of these simulations were ranked by their predicted effectiveness. The most promising candidates, which are expected to reduce cell growth and increase apoptosis in melanoma, involve targeting the combination of IRS1 and EGFR. We experimentally tested the drug combination of NT157 (IRS1 inhibitor) with gefitinib (EGFR inhibitor) in A2058 cells, and find that (1) the drugs alone reduces cell growth substantially at high doses, and (2) the drugs in combination reduces cell growth further than any of the drugs alone.

In contrast to models of cell biological processes based on response data at a single time point (assumed to be steady state), deriving models from time-series data may allow one to better capture the dynamic contributions of molecular signaling processes to complex phenotypes such as cell proliferation and apoptosis. This systems biology paradigm of data-driven predictive dynamic models can be applied to other cancers, especially useful for tumors that are resistant to targeted therapies.

Results

The goal of this data-driven systems biology study is to identify actionable cellular vulnerabilities and as well as to nominate novel vulnerabilities for future drug development. Conceptually, we generated network models that link protein signaling to phenotype. These models are similar to static textbook pathway diagrams, but have the added benefit of being (1) highly specific for the system under study and (2) based on a well-defined mathematical formalism. In this study we produced dynamic (time-resolved) antibody-based measurements of protein and phospho-protein levels as the melanoma cell line A2058 adapts and responds to drugs (alone and in combination). These molecular and phenotypic data were used to train differential equations models that capture cellular dynamics. Using these models, we predicted and optimized the effects of untested molecular perturbations on cell growth and apoptosis (Fig 1).

Fig 1. Systematic experimental perturbation of cells leads to predictive network models.

Fig 1

We perturbed cells with combinations of targeted drugs and measured the time-resolved cellular response (step 1). These measurements were used as input to derive network models of the response to arbitrary combinatorial perturbations (step 2). Using these models, we identified drug combination targets that optimally reduce cell growth and increase apoptosis in a melanoma cell line (step 3).

Experimental workflow and data

We characterized the temporal response of proteins, phosphorylation sites, cell growth and apoptosis, after application of drugs alone and in combination. A2058 cells were systematically perturbed with single drugs alone (both low and high doses) or in combination (low doses only), which resulted in 54 total drug conditions (Fig 2).

Fig 2. Phenotypic and proteomic response to single and drug combinations.

Fig 2

The melanoma cell line A2058 was subjected to 54 drug combinations and the response of 124 (phospho-)proteins and two phenotypes (cell number and apoptosis) was measured at eight time points from 10 minutes to 67 hours. These data were used as input for model inference and prediction. The most informative data involved a subset of the proteins and phospho-proteins (black lines) that underwent the largest changes upon perturbation (AKT-pS473/474, ERK1/2-pT202/T204, cJUN-pS73, and p21) as well as apoptosis (blue line). The remaining 120 proteins and phospho-proteins together with cell counts had a less pronounced response to perturbation. Temporal response (vertical axis) is defined as log2(xperturbed(t)/xunperturbed(t)) where x(t) are concentrations or counts as in Eqs 1 and 2 and DMSO is the unperturbed control.

Selection of drug concentrations

For each drug, the concentration was chosen by literature review or experimental dose response measurements by western blot. The low dose was selected to reduce activity on a known target by 50%. We also used a high dose, which was defined as double the absolute concentration of the low dose.

Molecular measurements

Cells were collected at 8 time points after drug addition in logarithmic progression (10, 27 and 72 minutes and 3, 9, 24, 48, and 67 hours). The temporal proteomic response of the cells to the different conditions was derived using reverse phase protein array (RPPA) measurements in 124 total and phospho-protein levels [19] from minimally three biological replicates across two data sets. Antibodies were selected to broadly cover signaling pathways with known involvement in cancer (e.g., AKT, ERK, and JAK/STAT pathways).

Phenotypic measurements

We used live cell imaging to follow A2058 cells as they responded to drug treatment (Incucyte, Essen BioScience, Ann Arbor, MI, U.S.A.). We acquired GFP, mCherry, and phase images of all conditions every 3 hours for 72 hours following drug addition. Cell number was determined at each time point by counting using image segmentation software (Incucyte), which identified the transgenic H2B-mCherry fluoraphore. Apoptosis was determined by image segmentation and counting of the GFP channel, which measured a Caspase-3/7 fluorescent activity reagent (Essen BioScience). Data were sub-sampled to select time points that closely matched collection of the proteomic data (1, 3, 9, 24, 48, and 67 hours).

Agreement of experimental results with literature

As most of the selected drugs have well described signaling effects, i.e., kinase substrates whose activity is directly affected by the drugs (S1 Table), we asked whether each drug elicited the expected molecular changes. The drugs that target MEK, AKT, and JNK, resulted in the expected decrease of their known targets ERK1/2-pT202/T204, AKT-pS473/pT308, and CJUN-pS73, respectively (S2 Fig). RAF and mTOR inhibition did not result in a strong decrease in their direct targets (MEK1/2-pS217/221 and S6K-pT389, respectively), but they did significantly decrease signaling in further downstream nodes ERK1/2-pT202/T204 and S6-pS235/236. For the STAT3 and SRC inhibitors, the proteins’ highest-ranked responders are not known to be downstream targets, but may be indirectly affected via multiple steps, signaling feedback or crosstalk.

The live-cell imaging data (see Materials and methods) revealed the bromodomain inhibitor CPI203 (BET inhibitor) to be the most potent drug, at the chosen concentrations, in terms of both decrease in cell growth and increase in apoptosis after three days of treatment (S1 Table, S1 Fig). These data are in agreement with results on the effect of this inhibitor class in the drug-resistant melanoma cell line SkMel-133, which also strongly impaired cell growth in response to the bromodomain inhibitor JQ1 in combination with MEK and ERK inhibitors in our earlier work [4].

Network model construction and simulation

To analyze the dynamics of molecular and phenotypic response to drug perturbation and to predict the effect of unseen perturbations, we inferred and simulated network models from our rich experimental data (Fig 1). These data were divided into three disjoint subsets (top left of Fig 3). We first inferred the parameters of Eqs 24 on a subset of data (responses to single drugs, training dataset) by minimizing Eq 5 with added regularization term (Eq 6). The optimal regularization parameter λ* was then identified on a second subset of data (validation dataset) as that which minimized Eq 5 using the previously inferred model parameters. Finally, the model accuracy for the selected parameter set was estimated on a third dataset (test dataset). We then derived multiple models on the full dataset to predict the effect of unseen perturbations and rank them by their desired effect on cellular phenotype.

Fig 3. Model selection and error estimation.

Fig 3

Top left: The full dataset was divided into subsets: (i) a training dataset (gray) that contained single drug control measurements (DMSO), single drugs in low (first row) and high dose (diagonal, two times low dose), (ii) a validation dataset (green) was used to estimate the optimal regularization parameter λ*, and (iii) a test dataset (blue) was used to estimate model performance. Top right: calculated values for the Bayesian Information Criterion (BIC, gray) and number of non-zero interactions (magenta) as a function of the regularization parameter λ. Bottom left: The residual sum of squares on the validation dataset was used to identify the optimal regularization parameter λ*. The best predictive model was obtained for λ* = 3 according to lowest BIC and minimal error on the validation dataset. Error bars indicate the standard deviation from 10 independent runs. Bottom right: Agreement of measured and predicted protein and phospho-protein (dots) and phenotype levels (triangles) on the test dataset. The Pearson correlation coefficient on left-out data for the combined set of molecular and phenotype nodes is 0.54, and 0.79 for phenotypic nodes alone. The mean RSS for the combined phenotypic and molecular nodes is 0.181, and 0.118 for the phenotypic nodes alone.

Network models

To model the cellular processes, we used the previously proposed nonlinear multiple input–multiple output model [4, 20, 21]. This model has the ability to capture a wide range of biological and kinetic effects, and has been applied to diverse biological problems including to predict the effect of novel drug combinations [4, 6].

dxiμ(t)dt=εitanh(jwijxjμ(t)+uiμ(t))-αixiμ(t). (1)

The time-dependent variable xiμ describes the dynamics of node i (in our context a protein level or measured phenotype) and uiμ the impact of a perturbation on node i (e.g., the application of a single drugs or drug combinations) in experimental condition μ. Here, wij denotes the interaction between the nodes i and j (more specifically, molecules, phenotypes, drugs or processes). Intuitively, wij > 0 corresponds to activation of node i by node j, and wij < 0 to inhibition. The prefactor αi quantifies the rate at which node i returns to its initial state xi = 0. The sigmoidal transfer function tanh is used to cap the reaction rates and to account for saturation and noise. Consequently, the prefactor εi > 0 determines the dynamic range of node i as values of tanh are limited to the one-dimensional unit ball [−1, 1]. To obtain data-derived network models, we used Eq 1 and constrained its parameters to the proteomic and phenotypic measurements in 54 drug combination conditions and at 8 time points. Our full model contained 124 molecular and two phenotypic nodes. For a complete description of the model equations, see Materials and methods.

Model selection and error estimation

Datasets were subdivided into training, validation and test sets (gray, green and blue boxes, respectively, in Fig 3 top left). The training sets contained all data from single-drug experiments; the validation and test sets were constructed such that (i) they contain all pairwise drug combinations and (ii) every drug has the same number of occurrences in both sets. Model parameter estimation was performed on the training dataset. We varied the regularization parameter λ on a linear grid and trained 10 network models per grid value. As expected, higher values of λ resulted in sparser networks with fewer interactions (Fig 3 top right, magenta) and higher error on the training data (Fig 3 bottom left, gray). The optimal regularization strength was selected using the validation dataset by (i) Bayesian Information Criterion (BIC) computed on the training set (Fig 3 top right, gray) and (ii) by residual sum of squares (RSS) (Fig 3 bottom left, green). This regularization parameter was determined to be λ* = 3 for both the BIC and RSS metric (Fig 3, bottom left). The same result was obtained when using Pearson’s correlation coefficient (S4 Fig). We studied the accuracy of the final models on the independent test dataset (Fig 3 bottom right). The resulting generalization error for optimal λ* on left-out data had a mean of 0.181 across the 10 models and of 0.118 for phenotypic data alone, and the corresponding Pearson correlation coefficient between model prediction and the experimentally measured data was 0.54 (molecular and phenotypic) and 0.79 for the phenotypic data alone. The correlation was found to be higher in the later time points of molecular and phenotypic data. In particular, the Pearson correlation coefficient between predictions and measurements of the combined molecular and phenotypic data was 0.71 on the combined last three time points (24, 48, and 67 hours) and 0.74 for the last time point alone (S5 Fig). Hence, the inferred temporal model had the highest accuracy at the 67-hour time point. We conclude that the method for network modeling has a reasonably accurate predictive power, especially for predictions in the 24–67 hour range and that the models are useful for predicting the effects of new perturbations as hypotheses. Reasonable accuracy means that an affordable number of experiments would lead to a positively validated result (a ‘hit’) that can be advanced to pre-clinical investigation.

Analysis of network models

We used the optimal regularization parameter λ* = 3 to infer 101 network models on all available data. To have reasonable diversity, the models were inferred without using prior information, i.e., known biological interactions. Several key model interactions agreed with interactions reported in the literature. The inferred effect of selected drugs on proteins (Fig 4) was in agreement with known drug–protein interaction patterns (e.g., MEKi inhibits the phorphorylation of ERK1/2 at T202/Y204 and RAFi inhibits MEK1/2 phorphorylation at S217/221) as well as unknown interactions (e.g., PKCi inhibits phorphorylation of CREB at S133). In addition, these model-derived drug–protein effects were in agreement with the observed drug effects from single-drug RPPA measurements (S2 Fig).

Fig 4. Model-inferred effect of drugs on proteins and phospho-proteins.

Fig 4

The effect of drug treatment on (phospho-)protein levels is captured as edges between drugs (colored circles) and proteins (gray circles) in the model (represented by the drug–(phospo-)protein interaction strength dil from Eq 2). Some of the edges are well known (e.g., MEKi inhibits ERK1/2-pT202/T204) and some appear to be novel or indirect (e.g., inhibitory effect of PKCi on CREB-pS133). Based on the distribution of edge values over the 101 network models, only the strongest drug–protein edges are displayed for visualization purposes (85th percentile for positive/activating interactions and 15th percentile for negative/inhibiting interactions, by absolute value).

Saturation effects of drugs are taken into account by the model using the open parameter δi. For values of 0 < δi < 1, we have an almost linear drug effect; values of δi > 1 do not result in any significant change as drug concentrations increase. For the case of PKC and SRC inhibitors, we find a δi in the linear range, i.e., that a doubling of the low dose results in almost the double response, while RAF and JNK inhibitors have almost reached saturation, i.e., no significant difference between the drug effect in low and high dose of the drugs (S3 Fig).

Predicted effect of drug perturbations and nomination of targets for drug testing

Using the 101 inferred network models, we predicted the phenotypic response (cell growth and apoptosis) as a function of network node inhibition (proteins and phospho-proteins). In particular, we systematically simulated a wide spectrum of inhibition strengths (cpert) targeting every molecular node (Eq 7) and the phenotypic response according to Eq 8 at t = 72 h (see Materials and methods). The mean resulting dose response curves across all models when inhibiting each molecular node was used to estimate the half maximal effective concentrations (EC50) for cell growth (S6 Fig) and apoptosis (S7 Fig). We further used these EC50 concentrations to simulate the effect of pairwise combination perturbations of the 124 proteins and phospho-proteins on each phenotypic node independently across all models.

The predicted phenotypic responses were averaged across all models and the first 20 unique nodes were selected from the highest ranking combinations (Fig 5 left: cell growth reduction, right: apoptosis increase). In total, we predicted cell growth and apoptosis for 124 ⋅ 123 = 15, 252 conditions for 101 networks, resulting in 1, 540, 452 simulated responses (S8 and S9 Figs).

Fig 5. Model predicts the effect of combination perturbations and suggests optimal inhibitor combinations.

Fig 5

The top 20 × 20 predictions of pairwise inhibition of molecular nodes (i.e., proteins and phospho-proteins) that decrease cell growth (bottom left, blue) and increase apoptosis (bottom right, red). Cell growth and apoptosis were computed for each target combination. These values were log2-transformed, normalized to the unperturbed steady state, and the average value over 101 network model predictions is presented (diagonal represents predictions for inhibition of single targets). Combinations nominated for drug testing are highlighted by dark-rimmed squares. For complete heatmaps of all tested predictions, see S8 and S9 Figs.

The most effective predicted combination for decreasing cell number was found to be inhibition of phospho-tyrosine at position 992 on EGFR together with phospho-serine at positions 636/639 on IRS1 (Fig 5 left). Simultaneous inhibition of the protein IRS1 together with reduction of phospho-serine at positions 636/639 on IRS1 was found to the most effective combination for increasing apoptosis (Fig 5 right). Note that predicted perturbation of IRS1 alone had the nearly the same effect on apoptosis as in the combination.

EGFR is involved in growth signaling through the RAS/MEK pathway, and EGFR-pY992 is an activating auto-phosphorylation site at the receptor [22]. There are many available drugs that specifically inhibit EGFR and are expected to decrease EGFR-pY992, for example the drugs lapatinib [23], gefinitib, and erlotinib. Inhibitors of EGFR alone do not have a strong anti-proliferative effect in melanoma cell lines [24].

IRS1 transmits signals from IGFR to PI3K/AKT. The protein can be inhibited by the compound NT157 [25], which has two mechanisms of action: 1) reduce IGF1 signaling through dissociation of IRS1 from the IGF1 receptor, and 2) reduce levels of IRS1 by phosphorylation at serine sites that lead to proteosomal degradation [25]. IRS1-pS636/639 is one such serine phosphorylation site [25]. The inhibitor NT157 is reported to first increase IRS1-pS636/639, which induces degradation of IRS1 (within hours) [25]. It has been reported that NT157 inhibits proliferation in the RAF inhibitor resistant melanoma cell line A375, both through reduced IGF1-signaling and reduced STAT3-signaling [26]. The inhibitor NT157 has also been shown to increase the tyrosine-phosphorylation 1172 at EGFR in the melanoma cell line A375 [27]. Using an EGFR inhibitor to abrogate this increase in EGFR-pY1172 after NT157 treatment may provide a mechanism for increased drug efficacy. In summary, simultaneous inhibition of IRS1 and EGFR is predicted to both reduce cell growth and increase apoptosis. In theory, this effect can be achieved by using the drugs NT157 (IRS1 inhibitor) and gefitinib (EGFR inhibitor).

Experimental testing of model predicted drug effects

To test the predictive potential of our model, we selected a subset of the 124 molecular nodes for experimental testing. Nodes were chosen based on i) those that had the strongest predicted effect i.e., EGFR-pY992 and IRS1-pS636/639, ii) those that cover a wide range of predicted response patterns from substantial growth reduction to no effect on growth (Fig 6 bottom), and iii) those that were targetable by available drugs (Fig 6 top). Based on these criteria, we selected gefitinib and NT157 to target EGFR-pY992 and IRS1-pS636/639, respectively. The drug CAS285986, which inhibits STAT5b, was chosen as a proxy for STAT5a inhibition, and the p38-MAPK inhibitor SB203580 was selected to target p38-MAPK-pT180/T182. We applied seven different concentrations of each drug, alone and in selected pairwise combinations (Fig 6), and measured cell number in three-hour steps from time 0 until 96 h in three replicates (Materials and Methods).

Fig 6. Experimental testing of model predictions using single and pairwise drug combinations.

Fig 6

Several single and pairwise perturbations that were predicted to have a differing phenotypic effects were experimentally tested in the melanoma cell line A2058 (highlighted in Fig 5). Cells were perturbed with drugs that target nodes in the computational model (top). During perturbation, cells were subjected to live-cell imaging, the resulting images were segmented, and cell count was quantified, normalized relative to no-drug control and log2-transformed (see growth curves in middle panel). Agreement between model prediction (log2-transformed, normalized and averaged growth) and experiment is summarized (bottom panel).

The drug pair with the strongest predicted effect, NT157 and gefitinib, reduced cell growth substantially relative to no-drug control, and the combination is more effective than the effect of either drug alone (Fig 6 middle). The drug pair predicted to have the second strongest effect, NT157 and CAS285986, also reduced cell growth, but to a lesser extent than NT157 with gefitinib, and only for drug combinations that included the highest dose of NT157 (10 μM) (Fig 6 middle, 2nd row). NT157 and gefitinib were both predicted to reduce growth alone, and experimental tests resulted in a reduced cell number with the highest dose tested (10 μM) (Fig 6 middle, 2nd row). CAS285986 alone was predicted to cause a slight reduction in cell number, and no effect was observed at any dose of the drug (Fig 6 middle, 3rd row). SB203580 was predicted to increase growth, but no effect of the drug was seen at any concentration (Fig 6 middle, 3rd row). Although we predicted the combination of NT157 and SB203580 would have little-to-no effect on cell growth (with SB203580 antagonizing the effect of NT157), the experimental data did not support this hypothesis. However, a substantial reduction at the highest dose of NT157 (10 μM) was measured (Fig 6 middle, 1st row). The combination of gefitinib and CAS285986 failed quality control and results were inconclusive. In summary, with the exception of the NT157–SB203580 combination, the experimental results largely agreed with the model-based predictions.

Comparison with literature data

We compared our predictions for cell growth with measured drug sensitivities from literature. From the rich perturbation dataset of [28] we selected those data that (1) referred to A2058 and (2) applied drug perturbations that specified target proteins in our model (and measured input dataset). The selected data and the corresponding model predictions are shown in S2 Table. Many of our predictions of drug effects on cell growth are in agreement with previous measurements, e.g., ERK is highlighted as a potential target in both the experimental data and in our predictions. Further, we do not observe any gross false positive predictions in this comparison, i.e., where the model predicts perturbation of a protein target to be effective, and the experimental data show the opposite. However, we do observe several discrepancies between model prediction and experimental evidence in the literature, e.g., the inhibition of the receptor FGFR1 is predicted by our models to increase cell growth, while the measurements in [28] show that a drug that targets FGFR1 results in decreased cell growth. Comparison of model predictions and experimental drug sensitivities is shown in S2 Table and S10 Fig.

Discussion

We present a method to infer network models from time-resolved molecular and phenotypic perturbation biology data. These data consist of multiple drug perturbations, applied as single drugs and drug combinations in a melanoma cell line. The computational method utilizes the full time-series data, and outputs network models that characterize the interactions between drug perturbations, (phospho-)proteins, and phenotypic changes. The inferred models allow for the prediction of phenotypic responses—cell growth and apoptosis—for unseen perturbations, and can therefore be used to generate large-scale drug discovery hypotheses. As an application, we use the method to predict effective drug target combinations in melanoma. In order to estimate the confidence of the proposed approach, we experimentally test the effect of a subset of the nominated drug combinations on cell growth. Given the complexity and resource requirements of experimental testing, we believe that this method provides a rational means to test many more perturbations than can be experimentally explored.

Inhibition of proteins EGFR and IRS1, which was predicted to be the most effective combination, resulted in a reduced number of cells only at high doses (10 μM) of the selected drugs. There are several possible reasons that a high dose was required to reduce cell growth even though the model predicted the drugs to be very effective. First, drug mode-of-action is likely different than inhibiting proteomic nodes in the computational models. For example, the drug gefitinib inhibits the entire tyrosine kinase domain rather than the single modeled phosphorylation EGFR-pY992. There are several other EGFR-related nodes in our network models. For instance, EGFR-pY845 is predicted to increase cell number when reduced (S6 Fig). This difference in predicted effect between inhibiting EGFR-pY845 and EGFR-pY992 might hint at a stronger growth reduction of more specific downstream targets when EGFR is phosphorylated at Y992. Second, antibody cross-reactivity may be introducing noise in our training data leading to inaccuracies in the final predictions. For example, the EGFR-pY992 antibody can also react with HER2, and therefore the values of EGFR-pY992 in the training data may be confounded by HER2 expression. The drug we tested, gefitinib, is not a dual EGFR–HER2 inhibitor. There are other drugs that simultaneously inhibit both EGFR and HER2 (e.g., erlotinib or peletinib) and are more efficient in melanoma cell lines [24]. Third, although we attempted to select drugs that would mimic model node perturbation, the actual molecular effect of each drug in our cell line may not alter the same nodes as in the model. For example, the model predicted that reduction in IRS1-pS636/639 would alter cell growth, but the drug that we used to target IRS1 (NT157) may not decrease this phosphorylation site. In fact, NT157 has been shown to increase serine phosphorylations that mark IRS1 for proteosomal degradation in the melanoma cell line A375 [25]. Even though NT157 decreases IRS1 protein, we have no knowledge of whether NT157 has any effect on IRS1-pS636/639. For these reasons, the model-based predicted vulnerability—inhibition of EGFR-pY992 and IRS1/IRS1-pS636/639 in A2058 cells—might be more potent when alternative drugs are chosen or with other methods of perturbation. Of note, perturbations including the p38 and MAPK inhibitor SB203580 showed the strongest disagreement between prediction and experimental validation. In particular, SB203580 alone did not increase cell growth, and significantly reduced cell growth in combination with NT157. We do not have a data-driven hypothesis for this discrepancy, but speculate that it could be due to off-target effects.

Our modeling framework is data-driven, which means that network interactions are directly inferred from data without the use of prior knowledge on drug–protein, protein–protein, and protein–phenotype interactions. We believe that data-driven discovery of drug–protein interactions is an important novelty as many drugs have unspecific or even unknown molecular activity. Even kinase inhibitors—which are commonly considered specific and are therefore used to draw conclusions about the physiological roles of molecules—have been shown to have significant off-target specificity. For example, when panels of kinase inhibitors have been profiled against panels of kinases, multiple on- and off-target effects were found [2932]. Our framework is useful regardless of the specificity of inhibitors/drugs since specificity is “learned” and all perturbation data can be utilized as information for model training.

As the predictions are inferred from context-specific data without any use of prior knowledge, it is tempting to fully assess model accuracy by comparing inferred parameters to interactions found in the literature. However, a full quantitative comparison between network models and literature is difficult for several reasons: (1) The relationship between model transfer function and experimental dose response is not straightforward to establish as their shape and magnitude are unique to each modality. (2) Our models are derived in a single experimental context (e.g., A2058 cells after perturbation in our hands), while literature contains an aggregate of knowledge across many different contexts from many different labs. (3) It is not rare that drugs that target the same molecule have opposing downstream effects, e.g., PI2K alpha inhibitors alpelisib and GNE-317 have positive and negative growth effects, respectively (see S2 Table). These opposing effects may be due to dose differences or incorrectly annotated specificity, both of which are difficult to tease apart. (4) The dynamic nature of drug response is often not well described in the literature and so full comparisons with our dynamic models are difficult or impossible. To facilitate a comparison, we generally access response at a steady state timepoint, which not correspond to literature-selected timepoints. (5) Although one can take many simulation strategies, our model simulation approach applied highly specific perturbations (dose and target) that would be impossible to achieve experimentally. (6) During model training, we took a simplified view of drug–target interactions, modeling the effect of each drug as a reduction in the activity of a single corresponding target. The relationship between drugs and targets is more complex and is often is many-to-many (see S2 Table). Even with these caveats, when well-established molecular interactions overlap with strong predictions, we feel confident that the model is capturing true biological relationships.

Similar to other data-driven modeling methods, our framework is heavily dependent on both the quality and quantity of data. We measured the response to 54 drug combinations at 8 time points, resulting in 432 data points for each node. Even with these rich data, we observed large differences in individual models in the prediction of new drug targets (S11 Fig). These results suggest that the data may not be sufficiently informative to generate unique models. To avoid mis-interpreting individual predictions, we believe it is important to study a representative set of network models, i.e., not only the single best solution. In this study, we generally aggregated the responses from 101 models. Another challenge for large models is determination of unique parameters. In principle, uncertainty of parameter determination can be quantified using, e.g., profile likelihood assessment [33, 34], but this is computationally prohibitive for the relatively large models derived here.

The optimization problem dealt with in this study is of exponential complexity, with an exponent of N2, where N is the number of nodes. This severely limits the number of nodes that can be included in the model. Using the 124 (phospho-)protein nodes, each model requires inference of approximately 16,000 parameters, which required the use of high-performance compute facilities. In addition, the choice of nodes that are measured experimentally and used for model building is key to overall predictability and interpretability, and interactions between nodes should not necessarily be interpreted as direct biochemical interactions, but logical indirect interactions. In theory, more comprehensive and specific experimental measurements can be used to generate models that contain more direct interactions and predict with higher accuracy. Future studies will aim to generate such data using methods like CRISPR-Cas9 gene knockouts for perturbation and mass spectrometry for molecular characterization.

Materials and methods

Genetic features of melanoma cell line A2058

The selection of cell lines that are good models for observed alteration patterns in human tumor tissue is a non-trivial task [35]. In this study, we use the melanoma cell line A2058 for the derivation of optimal drug combinations based on patient-representative genomic profiles [36]. A2058 carries the BRAFV600E mutation that resembles the patient segment which is treatable with the RAF inhibitor vemurafenib and MEK inhibitors such as trametinib. However, A2058 is a RAF/MEK inhibitor resistant cell line with concurrent activation of ERK, PI3K/AKT and cell cycle pathways and TP53 mutation. Specifically, it carries BRAFV600E, MAP2K1P124S and TP53V274F mutations and is altered in tumor suppressor genes like CDKN2A, RB and PTEN [37]. As in most melanoma samples, the cell line A2058 carries a large number of additional genomic alterations other than those listed here [36, 3840]. For A2058 cells, a synergistic effect of drug combinations has been shown using the combination of BRAF and PI3K inhibition (drugs: PLX4720 and GDC0941, respectively) [38].

Model equations

For the molecular response model, molecular nodes represent concentrations of proteins and phospho-proteins. We start with Eq 1. Applied perturbations only affect molecular nodes. All interactions between molecular nodes are directional. Self-interaction terms are excluded from the model formulation. The resulting model of the temporal dynamics of the i-th molecular node i = 1, …, Nmolec in experimental condition μ is given by,

dxiμ(t)dt=εitanh(j=1,jiNmolecwijxjμ(t)+l=1Ndrugdilulμ(t))-αixiμ(t). (2)

In particular, xiμ(t) denotes the temporal log2-control-normalized protein level (see defining equation in Data normalization below), which is subjected to drug perturbations as defined by the superposition of drug nodes ulμ(t), l = 1, …, Ndrug (see below). In order to model saturation effects and cap the time derivatives, we use the hyperbolic tangent as the sigmoidal transfer function. By doing so, the interaction-driven differential change of variable i is bounded to values in [−εi, εi]. The interaction parameters wij characterize the effect of protein level j on i and αi describes how quickly the i-th protein level returns to the zero-level steady-state in the absence of any interaction or drug. The specific effect of the drug l on molecular node i is quantified by dil. In contrast to our previous perturbation modeling approaches [4, 21] in which the interaction between drug node and target were defined based on prior knowledge, in this work we determine the drug–molecular node interactions dil as a result of model inference. In addition to dil, the parameters αi, εi and wij are also data-derived.

Using Eq 1, we define the l-th drug node, l = 1, …, Ndrug, by,

dulμ(t)dt=10·tanh(δlclμ)-10·ulμ(t), (3)

where we enforce a fast-acting drug effect by choosing ε = α = 10 in Eq 1. Here, clμ describes the drug concentration relative to the single-drug concentration (i.e., clμ=0 when the drug not present, clμ=1 when drug is present at low dose concentration and clμ=2 when the drug is present at high dose (i.e., 2 × low dose) in experimental condition μ. Eq 3 has the analytical solution,

uiμ(t)=tanh(δlclμ)(1-e-10t)

and δl is the effective impact, which is inferred in the parameter inference process.

We also use Eq 2 to define the two phenotypic nodes cell growth (cg) and apoptosis (ap). These nodes are modeled in the same way as molecular nodes except that interactions are only unidirectional (i.e., from molecular to phenotypic node). These dynamics are described as,

dxcgμ(t)dt=εgctanh(j=1Nmolecwcg,jxjμ(t))αgcxgcμ(t),dxapμ(t)dt=εaptanh(j=1Nmolecwap,jxjμ(t))αapxapμ(t), (4)

where xcg/apμ(t) denotes the temporal log2-control-normalized apoptotic readout and cell number, respectively (see defining equation in Data normalization below).

Definition of loss function and parameter estimation

The residual sum of squares for a given parameter set Θ, RSS(Θ), is used as measure of agreement between model simulations and observations,

RSS(Θ):=μi=1Nmeask(xiμ(tk;Θ)-xi,kμ)2, (5)

where all free parameters are grouped into Θ((εi)i=1Nmeas,(αi)i=1Nmeas,(wij)i,j=1Nmeas,(dil)i=1,l=1Nmeas,Ndrug(δi)i=1Ndrug) and Nmeas = Nmolec + Nphen = Nmolec + 2 is the number of measured observables, i.e., the number of proteins and phospho-proteins (molecular nodes), and the two phenotypes. Moreover, xiμ(Θ;tk) represents the simulated response given the parameters Θ, μ is the experimental condition and tk is the k-th time point. The corresponding observed data is denoted by xi,kμ. In addition to the minimization in Eq (5), regularization was applied to reduce the number of edges of non-zero edges. For this purpose, an 1-norm on the interaction parameters wij was added to the problem of minimizing the residual sum of squares as function of Θ:

λi=1Nmeasj=1,jiNmolec|wij|, (6)

which defines the to-be minimized loss function for our parameter estimation. Here λ > 0 sets the regularization strength and was chosen based on data holdout and Bayesian Information Criterion considerations.

The loss function was minimized using Adam, an implementation of a stochastic algorithm for first-order gradient-based optimization [41]. We used the version of Adam included in TensorFlow (version 1.2.1, Google Brain team, Mountain View, CA, U.S.A.), and the entire simulation and optimization procedure was performed in a Python environment. For all minimization runs, we used a fixed learning rate of 0.01 and evaluated the results when 10,000 iterations or 24 hour run time were reached, whichever occurred first. Failed optimization runs with all wij = 0 were discarded.

The Adam minimizer does not offer soft thresholding, so no exact zeros of parameters are obtained when the loss function (i.e., the sum of Eqs (5) and (6)) is minimized. We adopt an alternative strategy to introduce parameter sparsity by cropping the interaction parameter wij in each update step nn + 1, for interactions from protein/phenotypic nodes: wijn+1=0 if wijn<0.01, and for interactions from drugs nodes: diln+1=0 if diln<0.05. For our specific set-up, this strategy to enforce sparsity in the inferred parameters produced stable parameter estimates with a decrease in computational run time relative to full representations.

Recently, other perturbation biology modeling strategies based on ours and others [4, 21] have explored approaches for better scaling data-driven models using TensorFlow [42].

Model selection

We adopted two strategies to select the most informative model without overfitting to the given data using (1) the Bayesian information criterion and (2) by division of the input data into training, validation, and test datasets.

Bayesian information criterion

Assuming independent and identically distributed model errors drawn from a normal distribution, the maximum loglikelihood ln(l(Θ^)) can be expressed in terms of the minimum RSS(Θ^), where Θ^ is the parameter set that maximizes the likelihood function. We use the following simplified definition of the Bayesian information criterion, BIC,

BIC(k)=-2ln(l(Θ^))+kln(n)nln(RSS(Θ^)/n)+kln(n)

where k is the number of non-zero model parameters and n is the number of data points used in the parameter estimation. The number of non-zero parameters is a function of the regularization applied via λ, i.e., k = k(λ). We select the optimal regularization parameter λ as the minimum of the BIC,

λ*=arg minλBIC(λ)argminλ{nln(RSS(Θ^))+k(λ)ln(n)},

where we omitted contributions from constants that are dependent on the number of data points n. For the current data set, the optimal regularization parameter is found as λ* = 3 for the Bayesian information criterion (Fig 3 top right). The Akaike information criterion did not have a clear minimum.

Dataset division

In addition to selecting λ using the Bayesian Information Criterion, the optimal regularization parameter λ* is evaluated by splitting the data into a training dataset to estimate the model parameters, a validation dataset to determine an optimal λ, and a test dataset to evaluate model performance. The training set is chosen to contain all single drug conditions, and the test sets are chosen as a split between all combinations of drugs (Fig 3 top left). As with the Bayesian Information Criterion, the optimal regularization parameter is found to be λ* = 3 (Fig 3 bottom left).

Systematic in silico perturbation of molecular nodes

In order to identify new potential drug target combinations, we combine simulations using models built with all the data with an in silico inhibitor of each of molecular node, xipert(t). The in silico inhibitor of molecular node i is added to the model equations as a step concentration cipert0 for t ≥ 0 and then we simulate the protein and phospho-protein dynamics to the last measured time point t = 72 h. The resulting dynamics for the perturbed/inhibited molecular node i = 1, … Nmolec are found from,

dxipert(t)dt=εitanh(j=1,jiNmolecwijxjpert(t)cipert)αixipert(t),dxipert(t)dt=εitanh(j=1,jiNmolecwijxjpert(t))αixipert(t) (7)

with i′ = 1, …, i − 1, i + 1, …, Nmolec. The response of the phenotypic nodes, xcgpert(t) and xappert(t), are determined as,

dxcgpert(t)dt=εgctanh(j=1Nmolecwcg,jxjpert(t))αgcxgcpert(t),dxappert(t)dt=εaptanh(j=1Nmolecwap,jxjpert(t))αapxappert(t). (8)

Selection of inhibition strength for in silico combination perturbations

We assume the solution of Eqs 7 and 8 at t = 72 h to have the following shape when molecular node i is perturbed with a single in silico inhibitor applied with concentration cipert0,

xcg/appert(cipert)=E2-E11+(cipertc)n+E1,

where xcg/appert()=E1 and xcg/appert(0)=E2 represent the maximum and minimum change in each phenotype, respectively. cipert is the concentration of the in silico inhibitor specifically targeting molecular node i. n and c are open parameters. n is the Hill coefficient and c is the in silico-derived 50%-effect concentration, EC50. In our framework, we focus on the effect on cell growth and apoptosis by single molecular node inhibition. As the data are all log2-normalized to control data, xcg/appert(0)=E2=0 and so the equation above simplifies to,

xcg/appert(cipert)=E1(1-11+(cipertc)n) (9)

with xcg/appert(0)=0 and xcg/appert()=E1 (see S6 and S7 Figs for the mean individual dose response of apoptosis and cell growth as a function of inhibiting single molecular nodes). These in silico-derived EC50 are subsequently used for molecular node inhibition simulations. These simulations are systematically carried out such that all single and pairwise molecular nodes are perturbed, and their effect on cell growth and apoptosis is calculated (Fig 5).

The scale of concentration in the simulations, and hence the estimated in silico EC50 concentration (parameter c in Eq (9), S6 Fig), is expressed in arbitrary units, but can nevertheless be related to the experimental EC50 concentration. This can be done by equalising the estimated in silico and experimentally determined EC50 values. This assumes however that the inhibitor used in the experiments is of perfect specificity.

Numerical solution and simulation of model equations and systematic perturbations

The above specified ordinary differential equations were numerically solved with a second-order Runge–Kutta method with two stages and a fixed step size in a Python environment. As a quality check, these results were compared with those using the ode15s solver for stiff differential equations in MATLAB R2018a (Mathworks, Natick, MA, U.S.A.) and were in agreement.

Experimental and data normalization protocols

Molecular measurements in response to perturbation

The melanoma cell line A2058 was seeded in 6-well plates at 50,000 cells per well. Each biological replicate was run on separate occasions. The physical difficulties and logistical complexity of the experiments required that each time point be run in batches with the early time points being harvested immediately and the late time points being setup immediately following time point harvesting. Perturbed cells were lysed in CLB1 buffer (Bayer Technology Services, Leverkusen, Germany; now NMI TT Pharmaservices, Reutlingen, Germany). For Dataset 1, we performed a 4-fold dilution series of the samples using the Biomek FXP Laboratory Automation Workstation (Beckman Coulter Inc., Brea, CA, U.S.A.) automatic pipetting system in one technical replicate [19]. For Dataset 2, [a single lysate sample was printed twice (technical replicate)] each lysate sample was printed at the respective protein concentration in two technical replicates. Diluted samples were printed onto tantalum pentoxide-coated glass chips and then blocked with an aerosol BSA solution. The protein array chips are then washed in double-distilled H2O and dried before measurement. For the immunoassay, we incubated the chips with primary antibodies for 24 hours followed by 2.5 hours incubation with Alexa Fluor-647 conjugated secondary antibody detection reagents. Antibodies are diluted in CAB1 buffer (Bayer Technology Services, Leverkusen, Germany). The immuno-stained chips were imaged using the ZeptoREADER (Zeptosens/Bayer, Witterswil, Switzerland). The ZeptoView 3.1 software (Zeptosens/Bayer, Witterswil, Switzerland) was used to output the reference net spot fluorescence intensity, RNFI. Included standard global and local normalization of sample signal to the reference BSA grid is used.

Sample lysis and RPPA chip layout and analysis

All experimental perturbations and sample preparations were carried out in the Sander lab. Cell lysate was aliquoted into multiple samples, and RPPA measurement using the zeptosens platform was carried out in both the Sander lab (MSKCC, Dataset 1) and the Pawlak lab (NMI TT Pharmaservices, Dataset 2).

Differences between each dataset

Dataset 1 consists of 71 antibody measurements of three biological replicates at eight time points (10 min, 27 min, 72 min, 3 h, 9 h, 24 h, 48 h, 67 h). For Dataset 1, each RFI data point, x^i,kμ, was derived from a four-spot protein dilution series of 0.2, 0.15, 0.1, and 0.05 mgml. Dataset 2 consists of 86 antibody measurements of three biological replicates at (a) eight time points for biological replicate 1 and (b) two time points (48 h, 67 h) for biological replicate 2 and 3. For Dataset 2, protein lysate was diluted to 3 μg/μl, when concentration was higher than 3.4 μg/μl, and each RFI data point was derived from the mean of two measured spots. Median-centered protein factors, determined for each sample by NMI TT Pharmaservices, were used to normalize protein content.

Similarities between each dataset

All data were acquired from the same RPPA platform (Zeptosens/Bayer, Witterswil, Switzerland). 33 antibodies were measured in both datasets as controls, which resulted in 71 + 86 − 33 = 124 unique antibodies. After outlier detection and loading normalization (see above), the median of the three biological replicates from Dataset 1 and the median of the data (one biological replicate in six time points and three biological replicates in two time points) from Dataset 2 was combined and used in the downstream analysis and modeling steps.

Fluorescence signal evaluation

Each spot measured on the Zeptosens array corresponds to the relative concentration of a specific protein or phospho-protein in a single condition. This value is detected by measuring the secondary antibody fluorescence that is bound to the (phospho)protein-specific antibody. The relationship between this signal and the absolute concentration of the (phospho-)protein is assumed to be linear in the observed range, i.e.,

cix^i

given constant exposure times across all the experimental conditions and time points in each antibody measurement. In Dataset 1, for each (phospho-)protein i one measures four antibody fluorescence signals x^i(1), x^i(2), x^i(3) and x^i(4), corresponding to the four serial dilutions of cell lysate ci(1)=ci, ci(2)=34ci, ci(3)=12ci and ci(4)=14ci. These values are referred to as Referenced Net spot Fluorescence Intensity (RNFI). This allows to quantify the degree of “non-linearity” of the signal by computing the discrepancy between linear fit and actual signal. Assuming a linear dependency, then x^i(c)=β0+β1c, the fitting parameters are found in a least-squares estimation,

(β0,β1)T=argminβ0,β1m=14|x^i(m)-(β0+β1m4ci)|2=(cTc)-1cTy=(10.50-0.5-65ci-25ci25ci65ci)y,

where,

c=(10.25ci1ci),y=(x^i(1)x^i(4)).

For ideal profiles, i.e., without intercept or x^i(0)=β00, we have x^i(ci)β1ci (and consequently, x^i(ci)ciβ1). A single value, the Referenced sample Fluorescence Intensity (RFI), corresponding to the linear fit of the 4-spot array corresponding to the mean concentration value, c=0.625c¯, is chosen,

x^ix^i(0.625c¯)=β0+β1·0.625c¯.

Zeptosens calls the RFI using the reference concentration of the total protein content per spot (c¯=0.2mgml). This concentration was chosen to be roughly the protein content of a eukaryote cell. Ideally, the concentrations for each sample are cic¯, but in practice ci varies due to experimental uncertainties. We could estimate a posteriori the actual protein loading quantification with the reference loading chip and subsequently adjust x^i for each sample by setting c¯=mediani{ci}. For Dataset 2, we adopt this strategy and use on-chip measured median-centered protein factors determined from protein stain assays. However, for Dataset 1 we had no measurements of the protein loading and therefore applied double-median normalization (see [39]) for protein loading normalization.

Spotting error detection

For Dataset 1, due to stochastic sampling, low concentrations or saturated sample, as well as occasional robotic errors, some lysate spots in the serial dilutions do not follow the expected linear and monotonically decreasing dilution profile. We identify outliers that deviate strongly from the linear interpolation of the RNFI values using the Cook’s distance [43]. We then use a Cook’s distance of 0.8 as cutoff. For Dataset 2, the spotting quality appeared to be more consistent and no spotting error removal was required.

Protein loading normalization

Occasionally, samples have total protein concentrations that deviate from the optimal loading concentration. To normalize for this uneven loading in the we perform double-median normalization (sometimes referred to as “loading control”) as described in [44]. For Dataset 1, the log2-transformed and normalized value for the i-th antibody in time-point k and under experimental condition μ is then calculated as,

xi,kμlog2x^i,kμ-medimadk,μ.

Here, x^i,kμ denotes the linear raw data of antibody i measured in time point k under experimental condition μ, medimediank,μ(log2(x^i,kμ)) the median of the log2-transformed values in each antibody across time points and experimental conditions and madk,μmediani(log2(x^i,kμ)-medi) its median absolute deviation. In Dataset 2, in contrast to Dataset 1, we have an estimate on the relative deviation from the target protein loading concentration in each sample of 0.3 μg/μl. The observed protein loading was lower and found to have a median concentration of 0.22 μg/μl). Based on these measurements, we normalize each lysate sample by the corresponding on-chip measured median-centered protein factors.

Data normalization

Before applying any further modeling and inference, we subtract the log2-transformed and double-median normalized RPPA data in the DMSO control condition (without perturbation) from all previously log2-transformed and double-median normalized RPPA data, xi,kμ:

xi,kμxi,kμ-xi,kDMSO.

Phenotypic measurements in response to perturbation

At the start of the experiment, 1,000 A2058 cells were seeded in three biological replicates in 96-well plates. After 24 hours, drugs were added to each well and cells were subjected to real-time imaging using the IncuCyte (Essen BioScience, Ann Arbor, MI, U.S.A.) live-cell imaging system. Images were taken every 3 hours in 3 channels (phase, GFP, RFP) at 4 regions on the plate (12 images per well at each time point). The RFP channel was used to detect nuclear-localized mCherry (constitutively expressed). To detect apoptosis, a Caspase-3/7 green reagent (Essen BioScience) was added to each well. Image analysis using the Incucyte software was used to segment and count all cells (RFP channel) and cells going through apoptosis (GFP channel) at each time point. These counts were log2-transformed and normalized relative to DMSO control condition.

Cell number measurements of optimal drug combinations

Cell line

Cell line A2058 with H2B-mCherry was thawed, tested for mycoplasma, and passaged twice prior to testing. The cell line was passaged fewer than 15 times and screened with the compounds within a month of thawing. The cell line was passaged and cultured on DMEM media with 10% FBS and 1% Penicillin/Streptomycin.

Screening protocol

Cells were harvested, counted, and deposited into 384 well plates at a total volume of 50 μl per well with a t0 seeding density of 750 cells/well using a Thermo Multidrop Combi dispenser (Thermo Fisher Scientific, Waltham, MA, U.S.A.). Cells were immediately placed into the Incucyte microscope culture and allowed to attach and proliferate for 24 hours. Four compounds (NT157, CAS285986, gefitinib, SB203580) were dissolved into 10 mM solutions two hours prior to dosing. The compounds were dispensed into the previously seeded 384 well plates using a HP D300e Digital Dispenser 24H after cell seeding in a dose matrix testing each concentration point of each drug in combination with each concentration point of the other drugs, with three of the drugs (NT157, gefitinib, and SB203580) titrated from 0.01 μM to 10 μM and CAS285986 titrated from 0.075 μM to 75 μM. The plates were then placed back into the Incucyte and cell measurements were recorded every 3 hours for 96 hours.

Supporting information

S1 Table. The drugs applied in this study together with each drugs’ literature-described downstream target(s), the downstream target(s) measured in this study, and literature references.

(XLSX)

S2 Table. Comparison of literature-derived effect of single drugs on cell growth to model predictions.

A comparison of drug sensitivity data from the A2058 melanoma cell line [28] is based on known target proteins of each drug from [28]. Positive values represent an increase in growth when the drug is given (highlighted in gray). The corresponding model-predicted drug pair values are from the diagonal of S8 Fig.

(XLSX)

S1 Fig. The simulated effect of each drug and drug pair on cell growth and apoptosis.

After 72 hours of treatment drug treatment, cell count and apoptosis was measured using live-cell imaging (Incucyte). Color intensity reflects cell count (left) and apoptosis (right) relative to untreated cells (log2-normalized). The cellular response to single drugs in both high (2 × low dose) and low dose as well as to all pair drug combinations (in low doses) were measured. The low dose effect of the drugs are depicted in the first row/column, and the high dose in the diagonal.

(EPS)

S2 Fig. Drug responses of proteins and phospho-proteins.

For each drug, the six (phospho-)proteins depicted are those that exhibited the largest magnitude of response to single drug perturbations. The data is ranked by the absolute median response over time.

(EPS)

S3 Fig. Temporal patterns of drug node dynamics.

The means and standard deviations of the simulated drug nodes for the high dose (solid line) and low dose (dashed line) of several in silico inhibitors across the 101 created network models.

(EPS)

S4 Fig. Model selection and error estimation.

Mean and standard deviation of computed correlations for the validation dataset as a function of the regularization parameter λ. In agreement with the previous analysis, the best predictive model is obtained for λ* = 3. Error bars indicate the standard deviation from 10 independent runs. Related to Fig 3.

(EPS)

S5 Fig. The correlation between model simulation and experimental data.

Comparison between prediction and experiment for the last three measured time points, 24, 48, and 67 hours, (left) and for the last measured time point alone, 67 hours (right). This result, compared with Fig 3, suggests that the model predictions are less reliable in earlier time points, potentially due to the transient nature of the drug response and / or experimental noise at earlier time points in the data.

(EPS)

S6 Fig. The effect on predicted cell growth due to single node inhibition.

All individual network model were simulated with different levels of input strength of an in silico inhibitor for each target present in the model. From these simulations, the mean effects on cell growth were extracted. Highlighted are the nodes that give at least 2% of the maximal effect. Inhibited nodes that give the desired effect (growth reduction) are depicted in blue, and inhibited nodes with the opposite effect (growth increase) are depicted in yellow.

(EPS)

S7 Fig. The effect on predicted apoptosis due to single node inhibition.

All individual network models were simulated under the effect of different levels of the input strength of an in silico inhibitor for each target present in the model. From these simulations, the mean effects on apoptosis were extracted. Highlighted are the nodes that result in at least 2% of the maximal effect. Inhibited nodes that give the desired effect (increase in apoptosis) are depicted in red, inhibited nodes with the opposite effect (reduction in apoptosis) are depicted in yellow.

(EPS)

S8 Fig. Predicted effect of pairwise node inhibition on cell growth.

The effect on cell growth is computed for each target combination averaged over 101 network model predictions. The complete set of predictions of pairwise inhibition of molecular nodes (proteins and phospho-proteins) is displayed in the heatmap. The diagonal elements represent predictions of single target inhibition. This heatmap contains the complete data, a subset of which was included in Fig 5.

(EPS)

S9 Fig. Predicted effect of pairwise node inhibition on apoptosis.

The effect on apoptosis is computed for each target combination averaged over 101 network model predictions. The complete set of predictions of pairwise inhibition of molecular nodes (proteins and phospho-proteins) is displayed in the heatmap. The diagonal elements represent predictions of single target inhibition. This heatmap contains the complete data, a subset of which was included in Fig 5.

(EPS)

S10 Fig. Comparison between mean values for drug sensitivity from [28] and model-based predictions of the effect on cell growth.

The means and standard deviations per target protein (data from S2 Table) Table are compared (left). The same mean values without errorbars (right).

(EPS)

S11 Fig. Predicted temporal patterns of growth and apoptosis.

The mean predicted growth response (top row, blue line) and apoptosis response (bottom row, red line) as well as the standard deviation (gray area) from simulation of 101 created network models to the pairwise perturbations of EGFR-pY992/IRS1, EGFR-pY992/IRS1-pS636/639, and IRS1/IRS1-pS636/639.

(EPS)

Acknowledgments

We thank Debbie Bemis for support, Berthold Gierke and Michael Pawlak at NMI Reutlingen for generating Dataset 2, John Ingraham for suggesting the use of the Adam optimizer in TensorFlow, as well as Emek Demir, Özgun Babur, and Augustin Luna for insightful discussions on protein–protein interactions.

Data Availability

All data is available from https://gitlab.liu.se/eliny61/perturbation-biology-time-resolved.

Funding Statement

EN acknowledges support from the Swedish Research Council (Dnr: 2016-00244), RRS and CS from the National Resource for Network Biology (NRNB) within the U.S. National Institutes of Health (grant P41 GM103504). CS acknowledges additional support from the U.S. National Institutes of Health (Center for Cancer Systems Biology, grant U54 CA148967). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Richard G Abramson. Overview of Targeted Therapies for Cancer; 2017. Available from: https://www.mycancergenome.org/content/molecular-medicine/overview-of-targeted-therapies-for-cancer/.
  • 2. Chapman P, Hauschild A, Robert C, Haanen J, Ascierto P, Larkin J, et al. Improved Survival with Vemurafenib in Melanoma with BRAF V600E Mutation. New England Journal of Medicine. 2011;364(26):2507–2516. 10.1056/NEJMoa1103782 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Manzano J, Layos L, Buges C, de Los Llanos Gil M, Vila L, Martinez-Balibrea E, et al. Resistant mechanisms to BRAF inhibitors in melanoma. Annals of translational medicine. 2016;4(12):237 10.21037/atm.2016.06.07 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Korkut A, Wang W, Demir E, Aksoy BA, Jing X, Molinelli E, et al. Perturbation biology nominates upstream–downstream drug combinations in RAF inhibitor resistant melanoma cells. Elife. 2015;4:e04640 10.7554/eLife.04640 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.ClinicalTrial.gov;. https://clinicaltrials.gov/ct2/show/NCT03266159.
  • 6. Miller M, Molinelli E, Nair J, Sheikh T, Samy R, Jing X, et al. Drug Synergy Screen and Network Modeling in Dedifferentiated Liposarcoma Identifies CDK4 and IGF1R as Synergistic Drug Targets. Science Signaling. 2013;6(294):ra85–ra85. 10.1126/scisignal.2004014 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Fröhlich F, Kessler T, Weindl D, Shadrin A, Schmiester L, Hache H, et al. Efficient Parameter Estimation Enables the Prediction of Drug Response Using a Mechanistic Pan-Cancer Pathway Model. Cell Systems. 2018;7(6):567–579.e6. 10.1016/j.cels.2018.10.013 [DOI] [PubMed] [Google Scholar]
  • 8. Raia V, Schilling M, Bohm M, Hahn B, Kowarsch A, Raue A, et al. Dynamic Mathematical Modeling of IL13-Induced Signaling in Hodgkin and Primary Mediastinal B-Cell Lymphoma Allows Prediction of Therapeutic Targets. Cancer Research. 2011;71(3):693–704. 10.1158/0008-5472.CAN-10-2987 [DOI] [PubMed] [Google Scholar]
  • 9. Becker V, Schilling M, Bachmann J, Baumann U, Raue A, Maiwald T, et al. Covering a broad dynamic range: information processing at the erythropoietin receptor. Science (New York, NY). 2010;328(5984):1404–8. 10.1126/science.1184913 [DOI] [PubMed] [Google Scholar]
  • 10. Lucarelli P, Schilling M, Kreutz C, Vlasov A, Boehm ME, Iwamoto N, et al. Resolving the Combinatorial Complexity of Smad Protein Complex Formation and Its Link to Gene Expression. Cell systems. 2018;6(1):75–89.e11. 10.1016/j.cels.2017.11.010 [DOI] [PubMed] [Google Scholar]
  • 11. Nyman E, Rajan M, Fagerholm S, Brännmark C, Cedersund G, Strålfors P. A single mechanism can explain network-wide insulin resistance in adipocytes from obese patients with type 2 diabetes. Journal of Biological Chemistry. 2014;289(48). 10.1074/jbc.M114.608927 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Hill S, Lu Y, Molina J, Heiser L, Spellman P, Speed T, et al. Bayesian Inference of Signaling Network Topology in a Cancer Cell Line. Bioinformatics. 2012;28(21):2804–2810. 10.1093/bioinformatics/bts514 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Münzner U, Klipp E, Krantz M. A comprehensive, mechanistically detailed, and executable model of the cell division cycle in Saccharomyces cerevisiae. Nature Communications. 2019;10(1):1308 10.1038/s41467-019-08903-w [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Liu H, Zhang F, Mishra SK, Zhou S, Zheng J. Knowledge-guided fuzzy logic modeling to infer cellular signaling networks from proteomic data. Scientific Reports. 2016;6(1):35652 10.1038/srep35652 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Bernardo-Faura M, Massen S, Falk C, Brady N, Eils R. Data-Derived Modeling Characterizes Plasticity of MAPK Signaling in Melanoma. PLoS Computational Biology. 2014;10(9):e1003795 10.1371/journal.pcbi.1003795 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Sontag E, Kiyatkin A, Kholodenko B. Inferring dynamic architecture of cellular networks using time series of gene expression, protein and metabolite data. Bioinformatics. 2004;20(12):1877–1886. 10.1093/bioinformatics/bth173 [DOI] [PubMed] [Google Scholar]
  • 17. Sontag ED. Network reconstruction based on steady-state data. Essays in Biochemistry. 2008;45:161–176. 10.1042/bse0450161 [DOI] [PubMed] [Google Scholar]
  • 18. Boussemart L, Malka-Mahieu H, Girault I, Allard D, Hemmingsson O, Tomasic G, et al. eIF4F is a nexus of resistance to anti-BRAF and anti-MEK cancer therapies. Nature. 2014;513 10.1038/nature13572 [DOI] [PubMed] [Google Scholar]
  • 19. Jing X, Wang W, Gauthier N, Kaushik P, Root A, Stein R, et al. Protein Profiling In Cancer Cell Lines And Tumor Tissue Using Reverse Phase Protein Arrays. bioRxiv. 2017; p. 144535. [Google Scholar]
  • 20. Nelander S, Wang W, Nilsson B, She Q, Pratilas C, Rosen N, et al. Models from experiments: combinatorial drug perturbations of cancer cells. Molecular Systems Biology. 2008;4(1):216 10.1038/msb.2008.53 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Molinelli E, Korkut A, Wang W, Miller ML, Gauthier N, Jing X, et al. Perturbation biology: inferring signaling networks in cellular systems. PLoS Computional Biology. 2013;9(12):e1003290 10.1371/journal.pcbi.1003290 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Downward J, Parker P, Waterfield M. Autophosphorylation sites on the epidermal growth factor receptor. Nature. 1984;311(5985):483–485. 10.1038/311483a0 [DOI] [PubMed] [Google Scholar]
  • 23. Xia W, Mullin R, Keith B, Liu L, Ma H, Rusnak D, et al. Anti-tumor activity of GW572016: a dual tyrosine kinase inhibitor blocks EGF activation of EGFR/erbB2 and downstream Erk1/2 and AKT pathways. Oncogene. 2002;21(41):6255–6263. 10.1038/sj.onc.1205794 [DOI] [PubMed] [Google Scholar]
  • 24. Garay T, Molnár E, Juhász É, László V, Barbai T, Dobos J, et al. Sensitivity of Melanoma Cells to EGFR and FGFR Activation but Not Inhibition is Influenced by Oncogenic BRAF and NRAS Mutations. Pathology & Oncology Research. 2015;21(4):957–968. 10.1007/s12253-015-9916-9 [DOI] [PubMed] [Google Scholar]
  • 25. Reuveni H, Flashner-Abramson E, Steiner L, Makedonski K, Song R, Shir A, et al. Therapeutic destruction of insulin receptor substrates for cancer treatment. Cancer Research. 2013;73(14):4383–94. 10.1158/0008-5472.CAN-12-3385 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Flashner-Abramson E, Klein S, Mullin G, Shoshan E, Song R, Shir A, et al. Targeting melanoma with NT157 by blocking Stat3 and IGF1R signaling. Oncogene. 2016;35(20):2675–2680. 10.1038/onc.2015.229 [DOI] [PubMed] [Google Scholar]
  • 27. Su S, Flashner-Abramson E, Klein S, Gal M, Lee RS, Wu J, et al. Impact of the Anticancer Drug NT157 on Tyrosine Kinase Signaling Networks. Molecular Cancer Therapeutics. 2018. 10.1158/1535-7163.MCT-17-0377 [DOI] [PubMed] [Google Scholar]
  • 28. Garnett M, Edelman E, Heidorn S, Greenman C, Dastur A, Lau K, et al. Systematic identification of genomic markers of drug sensitivity in cancer cells. Nature. 2012;483(7391):570–575. 10.1038/nature11005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Klaeger S, Heinzlmeir S, Wilhelm M, Polzer H, Vick B, Koenig P, et al. The target landscape of clinical kinase drugs. Science. 2017;358(6367):eaan4368 10.1126/science.aan4368 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Davies S, Reddy H, Caivano M, Cohen P. Specificity and mechanism of action of some commonly used protein kinase inhibitors. Biochemical Journal. 2000;351:95–105. 10.1042/0264-6021:3510095 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Bain J, Mclauchlan H, Elliott M, Cohen P. The specificities of protein kinase inhibitors: an update. Biochemical Journal. 2003;371:199–204. 10.1042/BJ20021535 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Bain J, Plater L, Elliott M, Shpiro N, Hastie C, Mclauchlan H, et al. The selectivity of protein kinase inhibitors: a further update. Biochemical Journal. 2007;408(3). 10.1042/BJ20070797 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Raue A, Schilling M, Bachmann J, Matteson A, Schelke M, Kaschek D, et al. Lessons learned from quantitative dynamical modeling in systems biology. PloS One. 2013;8(9). 10.1371/journal.pone.0074335 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Fröhlich F, Theis F, Hasenauer J. Uncertainty analysis for non-identifiable dynamical systems: Profile likelihoods, bootstrapping and more. In: International Conference on Computational Methods in Systems Biology. Springer; 2014. p. 61–72.
  • 35. Domcke S, Sinha R, Levine D, Sander C, Schultz N. Evaluating cell lines as tumour models by comparison of genomic profiles. Nature Communications. 2013;4 10.1038/ncomms3126 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Network TCGA, et al. Genomic classification of cutaneous melanoma. Cell. 2015;161(7):1681–1696. 10.1016/j.cell.2015.05.044 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.The cBioPortal for Cancer Genomics;. http://www.cbioportal.org/patient?sampleId=A2058_SKIN&studyId=cellline_ccle_broad.
  • 38. Van Allen E, Wagle N, Sucker A, Treacy D, Johannessen C, Goetz E, et al. The genetic landscape of clinical resistance to RAF inhibition in metastatic melanoma. Cancer Discovery. 2014;4(1):94–109. 10.1158/2159-8290.CD-13-0617 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Xing F, Persaud Y, Pratilas C, Taylor B, Janakiraman M, She Q, et al. Concurrent loss of the PTEN and RB1 tumor suppressors attenuates RAF dependence in melanomas harboring V600EBRAF. Oncogene. 2012;31(4):446–457. 10.1038/onc.2011.250 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Boussemart L, Malka-Mahieu H, Girault I, Allard D, Hemmingsson O, Tomasic G, et al. eIF4F is a nexus of resistance to anti-BRAF and anti-MEK cancer therapies. Nature. 2014;513(7516):105–109. 10.1038/nature13572 [DOI] [PubMed] [Google Scholar]
  • 41.Kingma D, Ba J. Adam: A method for stochastic optimization. arXiv preprint arXiv:14126980. 2014.
  • 42. Yuan B, Shen C, Luna A, Korkut A, Marks D, Ingraham J, et al. Interpretable Machine Learning for Perturbation Biology. bioRxiv. 2019; p. 746842. [DOI] [PubMed] [Google Scholar]
  • 43. Cook R, Weisberg S. Residuals and Influence in Regression Monographs on statistics and applied probability. Chapman & Hall; 1982. Available from: https://books.google.com/books?id=MVSqAAAAIAAJ. [Google Scholar]
  • 44. Liu W, Ju Z, Lu Y, Mills G, Akbani R. A comprehensive comparison of normalization methods for loading control and variance stabilization of reverse-phase protein array data. Cancer Informatics. 2014;13:CIN–S13329. 10.4137/CIN.S13329 [DOI] [PMC free article] [PubMed] [Google Scholar]
PLoS Comput Biol. doi: 10.1371/journal.pcbi.1007909.r001

Decision Letter 0

Douglas A Lauffenburger, James R Faeder

27 Jan 2020

Dear Dr Stein,

Thank you very much for submitting your manuscript "Perturbation biology links temporal protein changes to drug responses in a melanoma cell line" for consideration at PLOS Computational Biology. As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. The reviewers appreciated the attention to an important topic. Based on the reviews, we are likely to accept this manuscript for publication, providing that you modify the manuscript according to the review recommendations.

Please prepare and submit your revised manuscript within 30 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. 

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to all review comments, and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Thank you again for your submission to our journal. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

James R. Faeder

Associate Editor

PLOS Computational Biology

Douglas Lauffenburger

Deputy Editor

PLOS Computational Biology

***********************

A link appears below if there are any accompanying review attachments. If you believe any reviews to be missing, please contact ploscompbiol@plos.org immediately:

[LINK]

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: This is a well executed study covers a good mix of both computational and biological aspects of perturbation biology and is thus a good fit for PLoS Comp Bio. The authors introce an RPPA dataset covering the time resolved response of 126 markers to 54 drug combinations and train, validate and test a semi-mechanistic, datadriven ODE-type model on this dataset. They illustrate how the model rederives known drug-target interactions and introduce a measure of importance for individual nodes. The then go on to validate the importance of individual nodes and node combinations experimentally. The paper is well written and easy to follow, altough the some aspects of the study are not covered in sufficient detail, which I outlined below. Besides that I have noted a couple minor technical aspects that I think should be adressed.

Major Comments

1) Why did the authors decide to evaluate the model on the test set based on correlation in contrast to RSS in the previous analysis? I think looking at correlation is certainly valuable, but seems a bit inconsistent with the previous analysis. Both correlation and RSS have its pros and cons so analysing both would be valueable. Moreover it would be interesting to see the dependence of model predictions on drug/marker to complement the timepoint analysis.

2) How much is RSS biased based on these strong outliers with values < -4? Overall I am bit surprised by the number of datapoints with values smaller than -3 given that the number such datapoints in figure 2 seems to be very small.

3) How do the model predictions compare to simple null models for drug interaction such as bliss independence or highest single agent?

4) Regarding the optimization method I was wondering what the motivation was to crop interaction parameters during the optimization process. Using a gradient based method, this may lead to poor optimizer performance as it introduces discontinuities in the objective function. Why not just crop values after optimization is done? Its also unclear how derivatives are computed for cropped parameters?

5) I am not entirely sure what the authors are doing for the analysis presented in figure 5. The methods section is a bit vague on what was actually done. Were curves to compute EC50 values fit to simulation results? How were individual EC50 values extrapolated to combinations handled? I believe both methods description and results section need to be a bit more verbose about methods and rationale.

6) figure 6 needs better labels or colorcoding, it is very difficult to track what is what. Its unclear how observed maximal response values were computed (bottom/right). Is this log2 fold change on the cell growth node? Shouldnt normalized cell count be the result of cell growth and cell death? The methods section mentions something about "equivalencing", but I am not sure what this is supposed to mean and whether this is relevant to this figure.

7) Although I don't want to question it the equivalence of drugs with their target nodes in the model is nontrivial and deserves a bit more explanation/justification.

8) The level of technical documentation about the employed ode solver seems a bit shallow for a computational journal. It may be adequate to cite the recent preprint on Interpretable Machine Learning for Perturbation Biology by the authors as more detailed reference for the employed methodology.

The tensorflow optimization seems to use an explicit euler scheme without stepsize control for integration, which seems a bit unorthodox given the fact that many biological systems exhibit timescale separations which renders the underlying equations stiff. The model formulation and parameter boundaries may prevent this from happening, but it would be reassuring to validate the correctness of solutions with a state of the art ode solver with implicit integration scheme and adaptive step-size control such as scipy.integrate.solve_ivp (python) or ode15s (matlab)

9) In the discussion the authors claim. "Therefore, to avoid mis-interpretation of predictions, it is important to always study a set of obtained network models, and not only the single best solution". Although I agree with this notion, the authors don't seem to follow their own advice to closely, at least according to what was described in the paper. I would be good to know at which point uncertainty of predictions was evaluated in this study and when model averaging etc was performed. The only figure in which I could spot errorbars for multiple model realizations was in figure 3.

The authors should also note that just running multiple optimization runs does not warrant proper uncertainty analysis (c.f. "Uncertainty Analysis for Non-identifiable Dynamical Systems: Profile Likelihoods, Bootstrapping and More"). I am aware that proper uncertainty analysis using profile likelihood or bayesian methods is not realistic for models of the considered size and the authors approach is thus justified, but I think the paper should explicitely state this in the discussion.

Minor Comments:

Some supplemental figures are blurry in preview, I believe thats a matlab artefact with known workaround?

The importance of EGFR in the model highlights the importance of paracrine signaling in drug response. Accordingly, the authors may want to reconsider treating cell death/growth as terminal nodes in the model given that both may affect the degree of paracrine signaling. I don't think this needs to be adressed within the scope of this paper though.

There are still a couple of typos in the manuscript, the authors may want to recheck the manuscript.

Reviewer #2: In this manuscript titled Perturbation biology links temporal protein changes to drug responses in a melanoma cell line, Nyman et al. presented a completely data-driven approach of drug response prediction, building upon their previously developed framework of modeling the dynamic changes of cellular molecules under perturbations with a series of coupled nonlinear ordinary differential equations. They fitted the model using time-dependent RPPA data under various drug (combination) treatments in the A2058 melanoma cell line, applied the model to propose efficient novel drug treatments, and experimentally validated the proposed treatments. Overall, we find the work quite novel and interesting.

We have previously reviewed this manuscript during the authors’ submission to another journal. We are glad to see and appreciate that the authors have properly corrected some of the issues we brought up last time and largely improved the manuscript. However, we feel that some of our previous concerns have not been sufficiently addressed and therefore suggest a further revision. Here we re-discuss these issues as follows.

First, to recapitulate our previous major comments:

The authors made predictions on drug effects based on both cell growth and apoptosis, however it seems that the authors selected and tested particular drugs (and drug combinations) only based on the effect on cell growth (“normalized cell count” as in Figure 6). For completeness it is desirable to have additional validations specifically of the predicted drug effects on apoptosis (with caspase fluorescent assay). This can also be important since some perturbations (e.g. EGFR inhibition, Figure 5) were predicted to strongly suppress cell growth but only weakly trigger apoptosis. If validated it may testify the additional value of the model in revealing the context-specific mechanism of the drug action.

Most of the predictions the authors chose for validation are positive cases, i.e. where the treatment is predicted to be effective. While this can provide measures of sensitivity of the predictions, the specificity of the predictions is not well-accessed. It can be desired to test a few more cases of negative prediction to evaluate the false positive rate.

Essentially, our consideration underlying both of the above two comments is that experimentally testing the “negative cases”, although not interesting for application, is nonetheless important in thoroughly evaluating the method. Based on the limited one or two negative cases the authors have already tested, we have a concern that the method may suffer from low specificity. Moreover, testing the negative cases are not entirely biologically meaningless, e.g. in comment 1, if neutral effect on apoptosis can be validated, it will provide mechanistic insight for the action of the drug (since it does inhibit cell growth), and will help to demonstrate extra values of the authors’ method. In summary, we therefore think that it’s desirable to address at least one of the above comments. If additional biological experiments are not feasible, we think the authors may at least try to validate some of these by comparing to published data or literature and provide a proper discussion of these issues.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Computational Biology data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, PLOS recommends that you deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see http://journals.plos.org/ploscompbiol/s/submission-guidelines#loc-materials-and-methods

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1007909.r003

Decision Letter 1

Douglas A Lauffenburger, James R Faeder

24 Apr 2020

Dear Dr Stein,

We are pleased to inform you that your manuscript 'Perturbation biology links temporal protein changes to drug responses in a melanoma cell line' has been provisionally accepted for publication in PLOS Computational Biology.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. 

Best regards,

James R. Faeder

Associate Editor

PLOS Computational Biology

Douglas Lauffenburger

Deputy Editor

PLOS Computational Biology

***********************************************************

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: The authors have addressed all my concerns in an adequate fashion and I recommend the acceptance for publication.

Reviewer #2: We thank the authors for addressing our comments. The manuscript is now suitable to be published.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Computational Biology data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1007909.r004

Acceptance letter

Douglas A Lauffenburger, James R Faeder

17 Jun 2020

PCOMPBIOL-D-19-02028R1

Perturbation biology links temporal protein changes to drug responses in a melanoma cell line

Dear Dr Stein,

I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Laura Mallard

PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Table. The drugs applied in this study together with each drugs’ literature-described downstream target(s), the downstream target(s) measured in this study, and literature references.

    (XLSX)

    S2 Table. Comparison of literature-derived effect of single drugs on cell growth to model predictions.

    A comparison of drug sensitivity data from the A2058 melanoma cell line [28] is based on known target proteins of each drug from [28]. Positive values represent an increase in growth when the drug is given (highlighted in gray). The corresponding model-predicted drug pair values are from the diagonal of S8 Fig.

    (XLSX)

    S1 Fig. The simulated effect of each drug and drug pair on cell growth and apoptosis.

    After 72 hours of treatment drug treatment, cell count and apoptosis was measured using live-cell imaging (Incucyte). Color intensity reflects cell count (left) and apoptosis (right) relative to untreated cells (log2-normalized). The cellular response to single drugs in both high (2 × low dose) and low dose as well as to all pair drug combinations (in low doses) were measured. The low dose effect of the drugs are depicted in the first row/column, and the high dose in the diagonal.

    (EPS)

    S2 Fig. Drug responses of proteins and phospho-proteins.

    For each drug, the six (phospho-)proteins depicted are those that exhibited the largest magnitude of response to single drug perturbations. The data is ranked by the absolute median response over time.

    (EPS)

    S3 Fig. Temporal patterns of drug node dynamics.

    The means and standard deviations of the simulated drug nodes for the high dose (solid line) and low dose (dashed line) of several in silico inhibitors across the 101 created network models.

    (EPS)

    S4 Fig. Model selection and error estimation.

    Mean and standard deviation of computed correlations for the validation dataset as a function of the regularization parameter λ. In agreement with the previous analysis, the best predictive model is obtained for λ* = 3. Error bars indicate the standard deviation from 10 independent runs. Related to Fig 3.

    (EPS)

    S5 Fig. The correlation between model simulation and experimental data.

    Comparison between prediction and experiment for the last three measured time points, 24, 48, and 67 hours, (left) and for the last measured time point alone, 67 hours (right). This result, compared with Fig 3, suggests that the model predictions are less reliable in earlier time points, potentially due to the transient nature of the drug response and / or experimental noise at earlier time points in the data.

    (EPS)

    S6 Fig. The effect on predicted cell growth due to single node inhibition.

    All individual network model were simulated with different levels of input strength of an in silico inhibitor for each target present in the model. From these simulations, the mean effects on cell growth were extracted. Highlighted are the nodes that give at least 2% of the maximal effect. Inhibited nodes that give the desired effect (growth reduction) are depicted in blue, and inhibited nodes with the opposite effect (growth increase) are depicted in yellow.

    (EPS)

    S7 Fig. The effect on predicted apoptosis due to single node inhibition.

    All individual network models were simulated under the effect of different levels of the input strength of an in silico inhibitor for each target present in the model. From these simulations, the mean effects on apoptosis were extracted. Highlighted are the nodes that result in at least 2% of the maximal effect. Inhibited nodes that give the desired effect (increase in apoptosis) are depicted in red, inhibited nodes with the opposite effect (reduction in apoptosis) are depicted in yellow.

    (EPS)

    S8 Fig. Predicted effect of pairwise node inhibition on cell growth.

    The effect on cell growth is computed for each target combination averaged over 101 network model predictions. The complete set of predictions of pairwise inhibition of molecular nodes (proteins and phospho-proteins) is displayed in the heatmap. The diagonal elements represent predictions of single target inhibition. This heatmap contains the complete data, a subset of which was included in Fig 5.

    (EPS)

    S9 Fig. Predicted effect of pairwise node inhibition on apoptosis.

    The effect on apoptosis is computed for each target combination averaged over 101 network model predictions. The complete set of predictions of pairwise inhibition of molecular nodes (proteins and phospho-proteins) is displayed in the heatmap. The diagonal elements represent predictions of single target inhibition. This heatmap contains the complete data, a subset of which was included in Fig 5.

    (EPS)

    S10 Fig. Comparison between mean values for drug sensitivity from [28] and model-based predictions of the effect on cell growth.

    The means and standard deviations per target protein (data from S2 Table) Table are compared (left). The same mean values without errorbars (right).

    (EPS)

    S11 Fig. Predicted temporal patterns of growth and apoptosis.

    The mean predicted growth response (top row, blue line) and apoptosis response (bottom row, red line) as well as the standard deviation (gray area) from simulation of 101 created network models to the pairwise perturbations of EGFR-pY992/IRS1, EGFR-pY992/IRS1-pS636/639, and IRS1/IRS1-pS636/639.

    (EPS)

    Attachment

    Submitted filename: Response_to_Reviewers_PLoSCB_resub_1st_round.pdf

    Data Availability Statement

    All data is available from https://gitlab.liu.se/eliny61/perturbation-biology-time-resolved.


    Articles from PLoS Computational Biology are provided here courtesy of PLOS

    RESOURCES