Skip to main content
PLOS Computational Biology logoLink to PLOS Computational Biology
. 2021 Feb 12;17(2):e1008653. doi: 10.1371/journal.pcbi.1008653

TranSynergy: Mechanism-driven interpretable deep neural network for the synergistic prediction and pathway deconvolution of drug combinations

Qiao Liu 1, Lei Xie 1,2,3,4,*
Editor: Avner Schlessinger5
PMCID: PMC7906476  PMID: 33577560

Abstract

Drug combinations have demonstrated great potential in cancer treatments. They alleviate drug resistance and improve therapeutic efficacy. The fast-growing number of anti-cancer drugs has caused the experimental investigation of all drug combinations to become costly and time-consuming. Computational techniques can improve the efficiency of drug combination screening. Despite recent advances in applying machine learning to synergistic drug combination prediction, several challenges remain. First, the performance of existing methods is suboptimal. There is still much space for improvement. Second, biological knowledge has not been fully incorporated into the model. Finally, many models are lack interpretability, limiting their clinical applications. To address these challenges, we have developed a knowledge-enabled and self-attention transformer boosted deep learning model, TranSynergy, which improves the performance and interpretability of synergistic drug combination prediction. TranSynergy is designed so that the cellular effect of drug actions can be explicitly modeled through cell-line gene dependency, gene-gene interaction, and genome-wide drug-target interaction. A novel Shapley Additive Gene Set Enrichment Analysis (SA-GSEA) method has been developed to deconvolute genes that contribute to the synergistic drug combination and improve model interpretability. Extensive benchmark studies demonstrate that TranSynergy outperforms the state-of-the-art method, suggesting the potential of mechanism-driven machine learning. Novel pathways that are associated with the synergistic combinations are revealed and supported by experimental evidences. They may provide new insights into identifying biomarkers for precision medicine and discovering new anti-cancer therapies. Several new synergistic drug combinations have been predicted with high confidence for ovarian cancer which has few treatment options. The code is available at https://github.com/qiaoliuhub/drug_combination.

Author summary

The number of anti-cancer drugs has been consistently and quickly growing. They are mainly used as standardized mono-therapy. Drug combinations show substantial advantages over the anti-cancer mono-therapy. Cancer cells treated with the mono-therapy could later activate bypassing pathways and harbor drug resistances. Drug combinations can alleviate this issue by using a smaller doses of each anti-cancer drug or targeting multiple oncogenic pathways. However, the investigation of all anti-cancer drug combinations using experimental methods is costly and time-consuming. Machine learning provides an attractive solution to screening synergistic drug combinations, but it is a black-box and not easy to explain. We have developed a knowledge-enabled deep learning model, TranSynergy, to predict synergistic drug combinations and have demonstrated that our model outperformed other state-of-the-art methods. A novel Shapley Additive Gene Set Enrichment Analysis (SA-GSEA) method is introduced to improve the interpretability of the machine learning model. Using TransSynergy and SA-GSEA, we can deconvolute genes responsible for the synergistic drug combination, suggesting the potential of machine learning in developing precision anti-cancer therapy.

Introduction

With the advance of understandings of cancer cell disorders, more and more anti-cancer drugs have been designed and are under investigation. However, the "one drug, one target" drug monotherapy suffers limited efficiency due to inherent or acquired resistance [13]. Drug combination therapy is a more effective strategy to solve this challenging problem [48]. In addition to cancer, synergistic drug combinations have several successful applications in the treatment of other diseases, such as AIDS [9,10], and fungal or bacterial infections [1113]. Thus, the selection of efficient drug combination therapy for pathogens emerges as a compelling treatment strategy. Considering that the number of anti-cancer drugs has increased drastically, the possible combination of all these drugs has also become enormous [14,15]. The existing experimental method requires a large number of samples with different drug doses and cancer cells [16], thus it is infeasible to exhaust all the possible drug combinations. The computational method can be used to pre-select drug combinations with high synergy at lower cost and with more efficiency. The recent advancement of computational modeling, especially the deep learning technique, has dramatically increased the prediction power of computational models and has many promising applications in the biomedical field. The combination of computational and experimental methods can improve the effectiveness of the drug combination discovery.

The deep learning model has been shown to have superior performance to conventional machine learning algorithms in many biomedical applications [17,18]. High-quality experimental drug combination datasets are necessary for the success of deep learning. With the advancement of high throughput drug combination screening tests, the number of samples grows fast so that the data size limitation is considerably alleviated [1923]. DeepSynergy is a state-of-the-art deep learning-based prediction model for the prediction of the synergistic drug combination. It has been trained using the dataset released by Merck [24]. In addition to suboptimal performance, the issue for this model is that the interpretation of the model is limited by the way adopted to represent drugs and cell lines as well as the model architecture. For instance, it is not easy to associate the contribution or feature importance of drug descriptors, including toxophores, physicochemical properties, and fingerprints, with the mechanism of drug action in cells using a feedforward neural network [2527].

Recent studies have shown that gene-gene interaction and drug mechanism of action drew more and more attention in the synergistic drug combination study [28]. In addition, cell line drug sensitivity strongly depends on whether the drug directly or indirectly inhibits the essential gene of the cell line [22]. Thus, it is desirable to incorporate information from the gene-gene interaction network, gene dependency, and drug-target association into the computational model. To this end, we implemented a mechanism-driven and self-attention boosted deep learning model TranSynergy for the prediction of synergistic drug combinations and the deconvolution of cellular mechanisms contributing to them. We applied the random walk with restart algorithm (RWR) on a protein-protein interaction (PPI) network to infer a novel drug-target profile as the drug features. For the features of each cell line, we used gene expressions or gene dependencies profile. These mechanism related features make the model readily interpretable. Furthermore, we applied the self-attention transformer to encode the gene-gene interactions responsible for the synergistic drug combination. Attention mechanisms have been widely used in image processing and natural language processing [2931] and has shown promise in the predictive modeling of nucleic acid sequences [32]. When combining the network propagated drug target profile, gene dependency, and gene expression, TranSynergy outperforms the state-of-the-art model. To reveal novel genes that are associated with the synergistic drug combination from the learned biological relations in the TranSynergy model, we developed a novel Shapley Additive Gene Set Enrichment Analysis (SA-GSEA) based on SHAP [33]. The revealed novel gene set may serve as a patient-specific biomarker for precision medicine or drug targets for discovering new cancer combination therapy. We further applied the model for the prediction of novel synergistic drug combinations targeting cancers that have few treatment options. Given the emergence of next-generation sequencing technology, the transcriptome of patient-derived cancer cells can be readily obtained [34]. The TranSynergy can be used to predict and interpret the synergistic drug combination in distinct patient-derived cancer cells. Our study shows the potential of a mechanism-driven interpretable machine learning model in the application of personalized cancer treatment.

Results

TranSynergy architecture

TranSynergy is a transformer boosted deep learning model for the prediction of drug combination synergy. It includes three major components, input dimension reduction component, self-attention transformer component, and output fully connected component (Fig 1). The input features are composed of three vectors. Each vector has 2401 dimensions, forming a 2401x3 matrix. The first two columns are the representations of two drugs. The third column is the representation of the cell lines. The drug feature is a drug-target interaction profile on 2401 selected genes. The cell line feature includes the gene expression or gene dependency of 2401 genes. It is worth mentioning that the number of columns changes to 4 when both gene expression and dependency are used for the representation of cells. In the matrix, each row corresponds to a gene or protein, and encodes the impact of drug on the gene. The input dimension reduction component is a single-layer neural network to reduce the dimension of input. The modified transformer component takes the output from the first component and includes a scaled dot product based self-attention mechanism module. Here, the self-attention is applied to model gene-gene interactions. It is also worth noting that we customized the transformer model by removing the positional encoding layer since the order input feature dimensions should be irrelevant to the final prediction. Then the final output of the predicted synergy score comes from a fully connected neural network. The hyperparameters used in each component and training process are listed in S1 Table.

Fig 1. The architecture of TranSynergy.

Fig 1

The input features include vector representations of Drug A, Drug B, and cell line vector, respectively. The first input dimension reduction component reduces the input dimension from 2401x3 to 512x3. The second component is a scaled dot product self-attention transformer. The third component is a fully connected neural network. Be noted that input matrix dimension changes to 2401x4 when both gene expression and gene dependency profiles are used for cell line representation.

The input of our deep learning model includes the vector representations of two drug molecules in the drug combination and a cell line that is treated by the drug combination. One popular strategy is to use the physicochemical properties, toxicophores, or extended connected fingerprints (ECFP) that are derived from the molecular structure of chemical compounds for the representation of drugs [24]. The disadvantage of using the chemical structure as the feature is that it is not straightforward to establish a relationship between the physicochemical properties of drugs and the cellular mechanism of drug action. Biological representation based on drug-target interaction profile is an alternative strategy to infer the drug representation vector [35]. Drug target information that is collected from databases, including DrugBank and ChEMBL [36,37], are mainly proteins that can directly associate with drugs. We also need to encode the effect of drugs on down-stream non-target proteins and the whole biological system. The protein-protein interaction network is utilized to infer the drug response of the non-target proteins considering that the protein-protein interaction mediates information transmission in the biological system. We apply the RWR algorithm to simulate this network propagation process (Fig 2). Compared with the chemical information-based approaches for drug representation, the target-based representation of drug molecules has the following advantages. Firstly, drug target information is closely related to the cellular response to the drug treatment at both the molecular level and system level. Secondly, it makes it possible to explain the model output, drug combination synergy, in terms of the contribution of each protein or gene.

Fig 2. Illustration of genome-wide drug-target profile.

Fig 2

Observed drug target profile is processed with RWR to infer drug effects on both targets and non-target proteins.

Because drug-combination therapy has cell line-specific responses, another important component of inputs is the cell line vector representation. The DeepSynergy uses gene expression profile as the cell line vector representation [24]. We have applied a novel alternative strategy to infer cell line vector representation. The gene essentiality varies in the different cell lines and plays a critical role in anti-cancer drug sensitivity. Intuitively, drugs that affect essential proteins will cause the cell to have a more devastating response. Thus, we take gene essentiality or gene dependence into account and this information is one of the inputs into our model. This information has been collected by the BROAD institution using experimental methods [38]. They have performed a genome-wide loss-of-function screening with pooled RNAi or CRISPR library and have investigated the resulted cellular response.

TranSynergy outperforms state-of-the-art model and shows superior performance across different tissues

To evaluate the TranSynergy architecture on drug combination synergy score prediction, we tested our model in three different scenarios and compared it with the state-of-the-art model DeepSynergy.

We first assessed the model performance in the setting of leave drug combination out (Fig 3A). We performed a nested five-fold cross-validation using the same data for both models. We split the data into five folds based on drug combinations such that the drug combination in one fold would not appear in another fold. Four folds of data were used in the training and hyperparameter tuning stage and one fold was held out to test model performance. We performed the training/testing procedure five times with each fold held out. TranSynergy significantly outperformed the DeepSynergy model for the synergy score prediction (Table 1). The mean square error (MSE) of the TranSyerngy model was 232, significantly lower than that of DeepSynergy (p-value < 0.001). The accuracy improved by 3% and 5% when measured with Pearson’s correlation and Spearman’s correlation, respectively. We also showed that TranSynergy presented a superior performance in classification scenarios (Table 2). PR-AUC improved by 5.6% when comparing TranSynergy with DeepSynergy, although the improvement of ROC-AUC was by a small margin. It is noted that other genomic features of cell lines (mutations and CNVs) were not used in both TranSynergy and DeepSynergy. Considering that they could provide additional information relevant to the drug mode of action, especially for targeted therapy, it would be interesting to incorporate extra genomic features into TranSynergy in the future.

Fig 3. Illustration of three training/test dataset splitting scenarios.

Fig 3

Each data point is composed of two drugs and a cell line. The data samples in white and red are the training/validation dataset and test dataset, respectively. A) Leave drug combination out scenario. In this example, drug pairs, including drug1+drug2 and drug3+drug4, are colored red and held out as test dataset. B) Leave cell line out scenario. In this example, the cell line 2 is colored red and held out as test dataset. C) Leave drug out scenario. In this example, the drug 3 and all drug pairs including the drug 3 are colored red and held out as test dataset.

Table 1. Performance comparison of TranSynergy and DeepSynergy models in regression scenarios.

Model Drug features Cell line features MSE Spearman Correlation Pearson Correlation
TranSynergy Network propagated drug target profile Gene dependency + Gene expression 231 ± 21 0.730±0.016 0.746±0.018
DeepSynergy Physicochemical properties, toxicophores and fingerprints Gene expression 243 ± 17 0.698±0.016 0.726±0.015
Statistical significance (p-value) <0.0001 <0.0001 0.0001

Table 2. Performance comparison of TranSynergy with different cell line features to DeepSynergy model in classification scenarios.

Models Drug features Cell line features PR-AUC ROC-AUC
TranSynergy Network propagated drug target profile Gene dependency 0.625±0.020 0.908±0.011
Gene expression 0.618±0.016 0.901±0.009
Gene expression + Gene dependency 0.627±0.013 0.907±0.011
DeepSynergy Physicochemical properties, toxicophores and fingerprints Gene expression 0.594±0.021 0.900±0.009
Statistical significance*
(p-value)
<0.0001 0.0134

*TranSynergy that used gene expression+Gene dependency cell line features was used to compute the statistical significance of performance difference between TranSynergy and DeepSynergy.

We further explored the performance of the TranSynergy model on 6 different tissues: colon, breast, melanoma, ovarian, prostate, and lung. The average Pearson correlation coefficient between ground truth synergy scores and predicted scores were 0.741 for colon cancer cells, 0.742 for breast cancer cells, 0.705 for melanoma, 0.788 for ovarian cancer cells, 0.660 for prostate cancer cells, and 0.738 for lung cancer cells (Fig 4A and 4B). The Pearson correlation coefficients of the TranSynergy across cell lines ranged from 0.616 to 0.858, and the Spearman correlation coefficients ranged from 0.592 to 0.863 for (Fig 4C and 4D). Compared with DeepSynergy, the superior performance of TranSynergy was consistent across tissues except for the prostate. The prostate tissue had the lowest Pearson correlation coefficient among all tissues included in our data. VCAP, which is the only prostate cancer cell line, had the third-lowest Pearson correlation coefficient and Spearman correlation coefficient across all cell lines. The mediocre performance of the VCAP cell line could be due to that it is not closely similar to other cell lines. To reflect relationships between cell lines, we performed the t-SNE analysis to visualize the high dimensional vector representation of cell lines mapped to a 2D space with the first two t-SNE components (Fig 5). It showed that the VCAP cell line is located on the edge of one cluster, which indicates the VCAP cell line might have distinctive features. We also noticed that five cell lines were isolated from the remaining cell lines in the t-SNE plot. They included T47D and OCUBM from the breast tissue, HCT116 from the colon tissue, as well as UWB1289 and OVCAR3 from ovarian tissue. The variances of these three tissues were higher than those of the other three tissues. These larger variances among cell line features within these three tissues might contribute to the observed larger performance variances.

Fig 4. Tissue-specific and cell line-specific prediction performances of TranSynergy and DeepSynergy.

Fig 4

The top two panels (A and B) are tissue-specific performance with Pearson correlation and Spearman correlation as the metrics. The bottom two panels (C and D) are cell line-specific performance with Pearson correlation and Spearman correlation as the metrics.

Fig 5. Visualization of different cell lines with t-SNE analysis.

Fig 5

High dimensional cell line vector representations are projected into 2-D space with the first two t-SNE components. A) Different colors indicate assigned clusters of each cell line by the affinity propagation method. B) Different colors indicate different tissues of each cell line.

We additionally tested our models in the most challenging leave cell out scenario (Fig 3B). We used an affinity propagation method to divide the cell lines into 4 clusters based on cell line features. Since 28 cell lines are in the biggest cluster, we used them as the training dataset and held out the remaining cell lines as the test dataset. In this way, the cell lines in the test dataset are the most different from those in the training dataset. All the cell lines have a similar amount of data (S4 Table). The leave cell out is a robust method to test the generalization power of the models. As shown in Tables 3 and 4, the performance of the TranSynergy is slightly better than DeepSynergy but both are lower than the first scenario.

Table 3. Performance of TranSynergy and DeepSyenrgy on leave cell out scenario and leave drug out scenario for the regression task.

Model Leave cells out Leave drugs out
Spearman Correlation Pearson Correlation Spearman Correlation Pearson Correlation
TranSynergy 0.552 ± 0.013 0.513 ± 0.019 0.477 ± 0.040 0.457 ± 0.031
DeepSynergy 0.547 ± 0.010 0.501 ± 0.010 0.453 ± 0.025 0.437 ± 0.020
Statistical significance (p-value) 0.0716 0.0013 0.0032 0.0001

Table 4. Performance of TranSynergy and DeepSyenrgy on leave cell out scenario and leave drug out scenario for the classification task.

Model Leave cells out Leave drugs out
ROC-AUC PR-AUC ROC-AUC PR-AUC
TranSynergy 0.808 ± 0.017 0.372 ± 0.022 0.778 ± 0.032 0.406 ± 0.106
DeepSynergy 0.805 ± 0.017 0.360 ± 0.022 0.753 ± 0.030 0.378 ± 0.075
Statistical significance (p-value) 0.3328 0.0908 0.0034 0.1379

Similar to the leave cell out scenario, we split the drugs into 5 clusters with the affinity propagation method in leave drugs out scenario (Fig 6). We chose drugs in one cluster and held out all drug pairs including those selected drugs as the test set (Fig 3C). The remaining drug pairs are the training/validation set. In this scenario, TranSynergy significantly outperformed DeepSynergy, but both were lower than the first two scenarios (Tables 3 and 4).

Fig 6. Visualization of drug features with t-SNE analysis.

Fig 6

High dimensional ECFP representations are projected into 2D space with the first two t-SNE components. Different colors indicate assigned clusters of each drug by affinity propagation method.

Mechanism-driven drug representations are critical for superior model performance

To investigate whether the gene-gene interaction network propagation step is essential for the model performance, we compared the TranSynergy model trained with only observed drug target information with that trained with network propagated drug target information. The observed drug target vector is a binary 2401-dimension vector that indicates whether a drug physically binds to the corresponding protein. The cell line vector representation used in this comparison is based on the cell line dependency. Models trained with the network propagated drug target vector representation showed superior performance in comparison to those with observed drug target information (Table 5). Given that the model excluding cell line features had inferior performance in comparison to other models with cell line features, we concluded that the cell line feature contributed to the synergy score prediction (Table 5). We then investigated three different representations for cell lines, gene dependency, gene expression, or gene dependency and gene expression. When we represented cell lines with gene dependency profile only or both gene dependency and gene expression profiles, the model exhibited superior performance to that representing cells with gene expression only (Table 5).

Table 5. Ablation study of TranSynergy models.

Drug features Cell line features MSE Spearman Correlation Pearson Correlation
Network propagated drug target profile Gene dependency 232 ± 21 0.725±0.016 0.746±0.017
Gene expression 241 ± 19 0.721±0.021 0.741±0.025
Gene dependencies + Gene expression 231 ± 21 0.730±0.016 0.746±0.018
None 390 ± 11 0.505±0.007 0.400±0.012
Observed drug target profile Gene dependency 305 ± 34 0.634±0.035 0.635±0.048
Gene expression 307 ± 33 0.634±0.038 0.632±0.039
Gene dependencies + Gene expression 300 ± 31 0.645±0.036 0.634±0.038

Furthermore, we implemented two novel infrastructures, TranSynergyCI and TranSynergyGNF, to investigate whether the prediction could be improved further by integrating extra chemical information. TranSynergyCI united the physiochemical, toxophores, and fingerprint information of each drug. The difference between TranSynergy and TranSynergyCI is that the latter concatenated the aforementioned features with the output of the Transformer as the input of the last fully connected component (S1 Fig). It is worth mentioning that the DeepSynergy used the same features for the drug representation. TranSynergyGNF also integrated chemical information to the model input, however, it applied graph convolutional neural network (GCN) to extract graphical neural fingerprints from drug chemical structures (S2 Fig). More specifically, the GCN layer took drug’s atoms and bonds as inputs. Intuitively, the drug was represented as a graph structure comprised of atoms as the nodes of the graph, and bonds as the edges of the graph. The output of this GCN layer is used as the representation of each drug. The same with TranSyenrgyCI, the outputs of GCN layers are concatenated with the output of the transformer and served as the input of the last fully connected component. Both these two models showed the inferior performance to TranSynergy (S2 and S3 Tables). This illustrates adding extra chemical information may not be necessary to boost the performance any further in this experimental setup.

Shapley additive gene set enrichment analysis reveals novel oncogenic signatures associated with synergistic drug combinations

We propose the use of Shapley Additive Gene Set Enrichment Analysis (SA-GSEA) to determine the oncogenic signature and the underlying mechanisms associated with each synergistic drug combination (see methods for details). Shapley additive value is a powerful way to characterize the feature contribution to the final prediction in each instance [33]. Because each of the features in TranSynergy corresponds to a gene, the Shapley value essentially indicates the attribute of each gene to the synergy prediction. Moreover, genes could be ranked based on the Shapley value of each input gene feature. This makes it feasible to perform GSEA analysis. As an example, we explored seven samples, which had high synergistic scores and were accurately predicted (Table 6 and S3S9 Figs). These seven samples are from three cell lines, T47D, CAOV3, and MSTO-211H. For the synergistic drug pairs in T47D, BEZ-235 is one of the notable drugs. The BEZ-235 can inhibit PI3K/AKT/mTOR pathway, which is dysregulated in breast cancer [39]. In the SA-GSEA of cell line gene dependencies profile, the RAF oncogenic signature is significantly enriched. It has been established that the RAS/RAF/MEK/ERK pathway and PI3K/AKT/mTOR pathway are closely interconnected components and form feedback loops in breast cancer [40]. This may also suggest the underlying mechanism following the inhibition of the PI3K/AKT/mTOR pathway by BEZ-235 is related to the RAS/RAF/MEK/ERK pathway. The drug combination of ETOPOSIDE and MK-8669 is synergistic in the CAOV3 cancer cell line. ETOPOSIDE targets TOP2A and TOP2B, two topoisomerase components. The inhibition of them was believed to cause a DNA double-strand break [41]. MK-8669 targets mTOR, a crucial component in the PI3K/AKT/mTOR pathway. TUBB also shows surprising high importance in the SA-GSEA of MK-8669 drug targets (S3 Fig). This indicates that inhibition of mTOR or TUBB in combination with DNA damage can have a synergistic effect. In the SA-GSEA of cell line features, the JAK2 oncogenic signature is enriched in the ranked gene list based on their Shapley values. As suggested in other studies, the JAK2 oncogenic signature implies a novel pathway affected by this drug combination therapy [42]. For synergistic drug pairs in the MSTO-211H cell line, one of the drugs is DASATINIB. The DASATINIB targets BCRABL, SRC, Ephrins, and GFR. For the combination of PACLITAXEL and DASATINIB in the MSTO-211H cell line, the PACLITAXEL targets tubulin and microtubule-associated proteins. From the SA-GSEA of cell line genes, genes in the ESR1 and BMI1 oncogenic signature gene sets are top-ranked (S7 Fig) [43]. BMI1 was shown to be involved in the DNA-damage-repair process [44]. The inhibition of these pathways combining the restrain on mitosis can have high anti-tumor activities in Mesothelioma cancer [45]. For the combination of DASATINIB and ABT-888 in MSTO-211H, the SNF5 pathway is ranked on the top (S9 Fig). It is worth mentioning that both SNF5 and PARP1/2 play key roles in the DNA repair process [46,47].

Table 6. The most important oncogenic signatures revealed by SA-GSEA.

Drug combination (drug 1, drug 2) Cell line Targets with high SHAP values The most enriched gene set in the cell line
Drug 1 Drug 2
BEZ-235, ERLOTINIB T47D Targets in PI3K/AKT/mTOR pathway EGFR oncogenic signature for RAF overexpressed cells
BEZ-235,
MK-4827
T47D Targets in PI3K/AKT/mTOR pathway PARP1/2 oncogenic signature for RAF overexpressed cells
BEZ-235,
L-778123
T47D Targets in PI3K/AKT/mTOR pathway FPTase/ GGPTase oncogenic signature for RAF overexpressed cells
BEZ-235, DINACICLIB T47D Targets in PI3K/AKT/mTOR pathway CDKs oncogenic signature for RAF overexpressed cells
ETOPOSID, MK-8669 CAOV3 TOP2A/TOP2B mTOR oncogenic signature for JAK2 knock-down cells
PACLITAXEL, DASATINIB MSTO-211H Tubulin/microtubule associated proteins BCRABL, SRC, Ephrins and GFR oncogenic signature for ESR1- cells
ABT-888, DASATINIB MSTO-211H PARP1/2 BCRABL, SRC, Ephrins and GFR oncogenic signature for SNF5 know-down cells

Novel drug combination prediction

Although the majority of drug combinations composed of the drugs available in this study have been explored experimentally [20], there were still 3650 novel drug pairs that had yet to be tested. For these 3650 novel samples, Table 7 lists the top 10 pairs which have higher cell line-wise z-scores (S5 Table). It is noted that ETOPOSIDE, which targets TOP2A and TOP2B, exists in most drug pairs that are synergistic on the CAOV3 cancer cell line. It is consistent with the results of SA-GSEA in which the target combination of TOP2A, TOP2B, and TUBB shows high feature importance. Two samples show synergistic effects on OV90, another ovarian cancer cells. For ETOPOSIDE and VINORELBINE combination, drugs also target TOP2A, TOP2B, and TUBB. For PACLITAXEL and VINORELBINE, both drugs target tubulin or microtubule-associated proteins. DEXAMETHASONE exists in synergistic drug combinations in three cell lines, OCUBM, SKMES1, and KPL1.

Table 7. Examples of predicted synergistic novel drug pairs.

Drug combination Cell line Predicted Synergy score Z-score
DEXAMETHASONE and ETOPOSIDE CAOV3 94.520215 3.07259459
ETOPOSIDE and METFORMIN CAOV3 70.556332 2.21009253
ETOPOSIDE and SN-38 CAOV3 69.380475 2.16777138
DEXAMETHASONE and VINBLASTINE OCUBM 38.6095235 1.94930657
DEXAMETHASONE and PACLITAXEL SKMES1 37.9697725 1.91635083
ETOPOSIDE and VINORELBINE OV90 33.844834 1.89400207
PACLITAXEL and VINORELBINE OV90 32.5451685 1.81387084
ETOPOSIDE and VINBLASTINE CAOV3 56.799048 1.71494381
DEXAMETHASONE and VINBLASTINE KPL1 37.8084735 1.6825975
ETOPOSIDE and PACLITAXEL CAOV3 53.1252895 1.58271882

Discussion

In this study, we have presented a novel deep learning model TranSynergy for the synergy score prediction and mechanism deconvolution of drug combination cancer therapy. We have demonstrated that the network propagated drug target profile, which indicated both drug-target interaction and drug effect on non-targeted proteins, was crucial for the comprehensive representation of drug features. The use of drug target information as drug features in the model might cause limitation when novel drugs, whose molecular targets are unknown, are involved in the drug pair. However, this limitation can be partially overcome by using recently developed methods for the accurate prediction of drug targets, e.g. [48]. In addition, drugs used in the drug combinations are mainly existing drugs whose targets are largely known, at least, it is true in the dataset used in this study. Because the drug target provides crucial information on the mechanism of drug action, it not only improves the accuracy of the machine learning model for the prediction of drug combination synergy but also facilitates the model interpretation. We also show that gene essentiality in the cancer cells is a more desirable cell line representation than raw gene expression profile. Due to the limited data size, we only selected the minimum number of genes for the representation of drugs and cell lines, including only drug targets and annotated cancer-related genes. When too many input features are used, the model could suffer overfitting problems during the training stage due to the curse of dimensionality [49,50]. The performance can be further improved when more data are available or by utilizing unlabeled data sets so that more genes can be included in the representations. Furthermore, we demonstrated that the performance in cold-start problems, including leave cell out and drug out scenarios, is relatively lower than the leave drug combination out scenario. We believe that the relatively lower performance in the cold-start setting could be improved by applying pretraining strategies.

With more and more high throughput drug combination screening datasets becoming available, we have to figure out the inconsistency problem in both the experimental and quantification methods for the determination of drug combination effects. Firstly, the combinational spaces for the drug doses used to generate the drug dose matrix diversify in different studies. Secondly, several distinguished methods were proposed to calculate the expected drug combination effect from experimental data, such as combination index (CI)-isobologram equation [5153], Bliss independence (BI) method [5456], and Loewe Additivity (LA) model [57,58]. The calculated drug synergy scores are not the same when different quantification methods are utilized. To further improve the data quality, it is necessary to develop new methods to harmonize different data sets.

Deep learning-based computational models have made promising breakthroughs in many biomedical areas. Interpretation of deep learning models becomes critical to overcoming the skepticism of it being a black-box [59]. Recently, many explainable AI methods have been proposed, such as input perturbation methods [60,61], backpropagation based methods [62], and the calculation of SHAP values [33]. Attention-based approaches have been proposed to interpret the models using attention mechanisms. However, previous work has shown that the attention weights learned from self-attentions may not provide a meaningful explanation for the final prediction [63]. As a result, we carefully design the input features so that each feature dimension is corresponding to a gene and is easily interpretable. Through examining the SHAP values of gene-wise input feature, we extract the information regarding the effect of drug-target interactions and gene-gene interactions on the cancer cell response. Considering that evidence has shown synergistic effects could be attributed to either pathway cross-talking [64], nonoverlapping pathways [28], or same pathways, the explanation of drug combination synergy score learned with our model could come from either of instances. More conclusive statements for the underlying mechanisms might require additional experimental evidence, which is out of our work’s scope. However, we suggest that this potentially provides a method to generate testable hypothesis for studying the underlying mechanism of the drug combinations therapy.

Drug combinations can be a more efficient therapeutic strategy for cancer by targeting multiple proteins to defer the rapid emergence of drug resistance. The exploration of effective and synergistic drug combinations is hindered by the costly and time-consuming experimental preclinical investigation. Computational methods can be a cheaper and faster alternative approach to facilitate the development of drug combination therapy for cancer patients [65]. Nowadays, more emphasis is put on personalized medicine, which requires the consideration of the heterogeneity of each patient’s cancer types and genomics information to find more efficient therapy. Given a large amount of data for patients’ genome information, the development of an accurate and interpretable computational model is critical for the realization of personalized medicine. Mechanism-driven machine learning as demonstrated in this study is a promising direction to address challenges in precision medicine of combination therapy.

Methods

Drug combination synergy score dataset

The large-scale drug combination screening dataset was initially published by Merck [20] and preprocessed to calculate the synergy scores [24]. The screening test was performed with 38 drugs and 39 cancer cell lines. In total, 583 pairs of drug combinations were investigated and 23062 data points were collected. Among them, two drugs lack target information, and the gene dependency and gene expression data of four cells are not available. These drugs and cell lines were excluded from our data set. We finally selected 36 drugs that targeted at least one protein and 35 cell lines. The final dataset has 18553 data points and 523 pairs of drug combinations.

Drug representation

The observed drug target profile was collected from two datasets, Drugbank and ChEMBL, for the 36 drugs [36,37]. The observed drug target matrix is a 36*2401 binary matrix that indicates whether a drug targets a protein. The observed drug target profile was processed with the RWR algorithm to obtain a novel network propagated drug targets profile. We solved the RWR problem with the Fast RWR methods [66]. Following is the formal equation:

r=αWr+(1α)e,

α is a hyperparameter equal to 1—restart rate. To avoid information leaking and overfitting, we performed nested cross-validation for hyperparameter tuning and model selection. In each iteration, the data were split into training/validation/test sets. After the model was fitted using the training data set, optimal hyperparameter α in RWR was selected by the grid search over the validation set. The grid search determined the optimal α = 0.5 (S6 Table) over the validation set. W is the transition matrix denoting the transition probability between nodes. We used the protein-protein interaction network matrix from STRING as the transition probability matrix [67]. The edge weight between nodes is the protein-protein association confidence score. e is the seed vector, a drug target binary vector for each drug in this Eq 1, denotes that the drug is targeting the protein. r is the final probability distribution of each node in the network. Intuitively, r[i] denotes the effect of the drug on each protein. Drug’s atoms and edges information were extracted with rdkit from the drug’s SMILES string. We then used the multi-hot encoder method to represent each node and edge. The final representation includes 62 atoms and 6 bonds information.

Cell line representation

Gene expression profiles were downloaded from Harmonizome [68] and were initially collected in Broad Institute Cancer Cell Line Encyclopedia (CCLE) and Genomics of Drug Sensitivity in Cancer (GDSC) [6971]. Gene dependencies profile dataset is a combined dataset from the Broad Institute Project Achilles [7274] and Sanger CRISPR data from Wellcome Trust Sanger Institute and Broad Institute[75,76]. The dataset was downloaded from the DepMap portal [38]. The linear regression imputation method was used to fill in the missing value in the dataset with the MICE package [77].

Model architecture

Transformer

The transformer has been widely used and shown promising performance in many different applications, including natural language processing and image processing [2931]. It includes two major components, encoder and decoder. The input for both encoder and decoder has three matrices, query, key and values. Both encoder and decoder contain many sublayers. Each sublayer is comprised of three stages, attention mechanism, add & norm stage, and feed forward stage. The add & norm stage contains a residual structure and a layer normalization structure. The attention mechanism is the module to encode the interaction of different features with the following equation:

Attention(Q,K,V)=softmax(Q*KT/(d))*V

where Q, K, and V are the query, key and, values matrices, d is the dimension of hidden vector representation and softmax is the activation function.

Graph convolutional neural network

We applied the graph convolutional network (GCN) module to extract drug neural fingerprint in our TranSynergyGNF model (S2 Fig). Each drug is parsed as a graph, consisting of nodes that are the chemical atoms, and edges that are the chemical bonds. The GCN was used to extract and integrate the information of nodes and edges in each node’s neighborhood. The output hidden representation can be interpreted as a representation of chemical substructure within the multi-hop distances from each node. The output of each GCN layer can be calculated with the following equation:

GCN(Hl,A)=softmax(D1(A+I)HlWl),

where A is the adjacency matrix for the graph structure, I is an identity matrix, D is the diagonal node degree matrix of A+I, H is the hidden representation from the previous layer, W is the learnable parameter matrix and softmax is the activation function. To construct the hidden representation of the whole drug, we concatenate the hidden representation for each node to a single vector.

Model evaluation

Model evaluations were performed in three scenarios. a) In the leave drug combination out scenario, we applied the nested cross-validation method. The data was split into five folds such that the drug pair in one fold does not overlap with the drug pairs in other folds (Fig 3A). Because the drug combination of drug A with drug B and drug B with drug A should have the same drug combination synergy score, the size of training data was doubled by swapping the drug A and drug B. The model was iteratively trained/validated in four folds and then tested in the remaining one fold. b) We further tested our models in the most challenging leave cell out scenario. We used affinity propagation methods to group the cell lines into 4 clusters based on cell line features similarity (Fig 3B). Since 28 cell lines are in the biggest cluster, we used them as the training/validation dataset and held out the remaining cell lines in other clusters as the test dataset (Fig 5). In this way, the cell lines in the testing set are significantly different from those in the training/validation set. c) In the leave drug out scenario, we grouped the drugs into 5 clusters with the affinity propagation method based on their structural similarities (Figs 3C and 6). We chose drugs in one cluster and held out all drug pairs including these three drugs as the test set. The remaining drug pairs are the training/validation set. We performed three tests with keeping out the drugs in each of the three smallest clusters. In each test, the ratio of samples in the training and the testing data is around 4:1.

In all the above scenarios, the MSE was used as the training loss. We then investigated the Spearman correlation and Pearson correlation between predicted data and ground-truths. ROC-AUC and PR-AUC were also used to evaluate the model performance for the classification task. Drug pairs with synergy scores larger than 30 were classified as positive pairs and those with lower synergy scores were classified as negative pairs.

2.5 Shapley additive gene set enrichment analysis

We used the GradientExplainer and DeepExplainer in the SHAP package to calculate Shapley value that characterizes the contribution of each input feature to the final prediction [33]. We used k-means to summarize the total dataset as the background dataset. The final Shapley value of each input feature was the average value of 10 tests for each sample data. We then ranked genes based on the Shapley values for the gene-wise features of cell line representation and conducted gene set enrichment analysis to unveil the enriched gene sets with GSEA [78,79].

Supporting information

S1 Table. The hyperparameters of TranSynergy model with the best performance.

(XLSX)

S2 Table. Performance of TranSynergy integrated with Chemical Information (TranSynergyCI).

(XLSX)

S3 Table. Performance of TranSynergy integrated with Graphical Neural Fingerprint (TranSynergyGNF).

(XLSX)

S4 Table. Sample amounts in each cell line.

(XLSX)

S5 Table. Predicted drug pairs synergy score and z-scores.

(XLSX)

S6 Table. Grid search results of the restart rates α, a hyperparameter in random walk with restart for network propagated drug target features inference, over the validation set.

(XLSX)

S1 Fig. The architecture of TranSynergy integrated with Chemical Information (TranSynergyCI).

Extra drug features, including ECFP, physico-chemical and toxicophore features were concatenated with the output of the Transformer and input into the last fully connected component. These chemical features are the same as those used in deepSynergy.

(PDF)

S2 Fig. The architecture of TranSynergy integrated with Graphical Neural Fingerprint (TranSynergyGNF).

Extra drug features, graphical neural fingerprint for drug A and drug B extracted with a GCN layer, were concatenated with the output of the Transformer and input into the last fully connected component.

(PDF)

S3 Fig. Shapley Additive Gene Set Enrichment Analysis result of the prediction model for Etoposide and MK-8669 drug pair.

The left panel shows the SHAP values of drug targets, while the right panel shows the SHAP values of 20 genes with the most significant impact.

(PDF)

S4 Fig. Shapley Additive Gene Set Enrichment Analysis result of the prediction model for Erlotinib and BEZ-235 drug pair.

The left panel shows the SHAP values of drug targets, while the right panel shows the SHAP values of 20 genes with the most significant impact.

(PDF)

S5 Fig. Shapley Additive Gene Set Enrichment Analysis result of the prediction model for MK-4827 and BEZ-235 drug pair.

The left panel shows the SHAP values of drug targets, while the right panel shows the SHAP values of 20 genes with the most significant impact.

(PDF)

S6 Fig. Shapley Additive Gene Set Enrichment Analysis result of the prediction model for L778123 and BEZ-235 drug pair.

The left panel shows the SHAP values of drug targets, while the right panel shows the SHAP values of 20 genes with the most significant impact.

(PDF)

S7 Fig. Shapley Additive Gene Set Enrichment Analysis result of the prediction model for Paclitaxel and Dasatinib drug pair.

The left panel shows the SHAP values of drug targets, while the right panel shows the SHAP values of 20 genes with the most significant impact.

(PDF)

S8 Fig. Shapley Additive Gene Set Enrichment Analysis result of the prediction model for BEZ-235 and Dinaciclib drug pair.

The left panel shows the SHAP values of drug targets, while the right panel shows the SHAP values of 20 genes with the most significant impact.

(PDF)

S9 Fig. Shapley Additive Gene Set Enrichment Analysis result of the prediction model for Dasatinib and ABT-888 drug pair.

The left panel shows the SHAP values of drug targets, while the right panel shows the SHAP values of 20 genes with the most significant impact.

(PDF)

Data Availability

All scripts and data are available from https://github.com/qiaoliuhub/drug_combination. Other relevant results data are within the manuscript and its Supporting Information files.

Funding Statement

This work was supported by Grant Number R01GM122845 from the National Institute of General Medical Sciences (NIGMS, to LX) and Grant Number R01AD057555 of National Institute on Aging of the National Institute of Health (NIH, to LX and QL) as well as CUNY High Performance Computing Center (to LX). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Mansoori B, Mohammadi A, Davudian S, Shirjang S, Baradaran B. The Different Mechanisms of Cancer Drug Resistance: A Brief Review. Adv Pharm Bull. 2017;7(3):339–48. Epub 2017/10/27. 10.15171/apb.2017.041 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Housman G, Byler S, Heerboth S, Lapinska K, Longacre M, Snyder N, et al. Drug resistance in cancer: an overview. Cancers (Basel). 2014;6(3):1769–92. Epub 2014/09/10. 10.3390/cancers6031769 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Nikolaou M, Pavlopoulou A, Georgakilas AG, Kyrodimos E. The challenge of drug resistance in cancer treatment: a current overview. Clin Exp Metastasis. 2018;35(4):309–18. Epub 2018/05/26. 10.1007/s10585-018-9903-0 . [DOI] [PubMed] [Google Scholar]
  • 4.Bayat Mokhtari R, Homayouni TS, Baluch N, Morgatskaya E, Kumar S, Das B, et al. Combination therapy in combating cancer. Oncotarget. 2017;8(23):38022–43. Epub 2017/04/15. 10.18632/oncotarget.16723 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Raghavendra NM, Pingili D, Kadasi S, Mettu A, Prasad S. Dual or multi-targeting inhibitors: The next generation anticancer agents. Eur J Med Chem. 2018;143:1277–300. Epub 2017/11/12. 10.1016/j.ejmech.2017.10.021 . [DOI] [PubMed] [Google Scholar]
  • 6.Ramsay RR, Popovic-Nikolic MR, Nikolic K, Uliassi E, Bolognesi ML. A perspective on multi-target drug discovery and design for complex diseases. Clin Transl Med. 2018;7(1):3. Epub 2018/01/18. 10.1186/s40169-017-0181-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Fitzgerald JB, Schoeberl B, Nielsen UB, Sorger PK. Systems biology and combination therapy in the quest for clinical efficacy. Nat Chem Biol. 2006;2(9):458–66. Epub 2006/08/22. 10.1038/nchembio817 . [DOI] [PubMed] [Google Scholar]
  • 8.Lehar J, Krueger AS, Avery W, Heilbut AM, Johansen LM, Price ER, et al. Synergistic drug combinations tend to improve therapeutically relevant selectivity. Nat Biotechnol. 2009;27(7):659–66. Epub 2009/07/08. 10.1038/nbt.1549 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Henkel J. Attacking AIDS with a ’cocktail’ therapy? FDA Consum. 1999;33(4):12–7. Epub 1999/08/12. . [PubMed] [Google Scholar]
  • 10.Murphy EM, Jimenez HR, Smith SM. Current clinical treatments of AIDS. Adv Pharmacol. 2008;56:27–73. Epub 2007/12/19. 10.1016/S1054-3589(07)56002-3 . [DOI] [PubMed] [Google Scholar]
  • 11.Chen X, Ren B, Chen M, Wang Q, Zhang L, Yan G. NLLSS: Predicting Synergistic Drug Combinations Based on Semi-supervised Learning. PLoS Comput Biol. 2016;12(7):e1004975. Epub 2016/07/16. 10.1371/journal.pcbi.1004975 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Groll AH, Tragiannidis A. Recent advances in antifungal prevention and treatment. Semin Hematol. 2009;46(3):212–29. Epub 2009/06/25. 10.1053/j.seminhematol.2009.03.003 . [DOI] [PubMed] [Google Scholar]
  • 13.Tamma PD, Cosgrove SE, Maragakis LL. Combination therapy for treatment of infections with gram-negative bacteria. Clin Microbiol Rev. 2012;25(3):450–70. Epub 2012/07/06. 10.1128/CMR.05041-11 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Ali R, Mirza Z, Ashraf GM, Kamal MA, Ansari SA, Damanhouri GA, et al. New anticancer agents: recent developments in tumor therapy. Anticancer Res. 2012;32(7):2999–3005. Epub 2012/07/04. . [PubMed] [Google Scholar]
  • 15.Falzone L, Salomone S, Libra M. Evolution of Cancer Pharmacological Treatments at the Turn of the Third Millennium. Front Pharmacol. 2018;9:1300. Epub 2018/11/30. 10.3389/fphar.2018.01300 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Mott BT, Eastman RT, Guha R, Sherlach KS, Siriwardana A, Shinn P, et al. High-throughput matrix screening identifies synergistic and antagonistic antimalarial drug combinations. Sci Rep. 2015;5:13891. Epub 2015/09/26. 10.1038/srep13891 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Mater AC, Coote ML. Deep Learning in Chemistry. J Chem Inf Model. 2019;59(6):2545–59. Epub 2019/06/14. 10.1021/acs.jcim.9b00266 . [DOI] [PubMed] [Google Scholar]
  • 18.Kalinin AA, Higgins GA, Reamaroon N, Soroushmehr S, Allyn-Feuer A, Dinov ID, et al. Deep learning in pharmacogenomics: from gene regulation to patient stratification. Pharmacogenomics. 2018;19(7):629–50. Epub 2018/04/27. 10.2217/pgs-2018-0008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Bulusu KC, Guha R, Mason DJ, Lewis RP, Muratov E, Kalantar Motamedi Y, et al. Modelling of compound combination effects and applications to efficacy and toxicity: state-of-the-art, challenges and perspectives. Drug Discov Today. 2016;21(2):225–38. Epub 2015/09/12. 10.1016/j.drudis.2015.09.003 . [DOI] [PubMed] [Google Scholar]
  • 20.O’Neil J, Benita Y, Feldman I, Chenard M, Roberts B, Liu Y, et al. An Unbiased Oncology Compound Screen to Identify Novel Combination Strategies. Mol Cancer Ther. 2016;15(6):1155–62. Epub 2016/03/18. 10.1158/1535-7163.MCT-15-0843 . [DOI] [PubMed] [Google Scholar]
  • 21.Li H, Li T, Quang D, Guan Y. Network Propagation Predicts Drug Synergy in Cancers. Cancer Res. 2018;78(18):5446–57. Epub 2018/07/29. 10.1158/0008-5472.CAN-18-0740 . [DOI] [PubMed] [Google Scholar]
  • 22.Menden MP, Wang D, Mason MJ, Szalai B, Bulusu KC, Guan Y, et al. Community assessment to advance computational prediction of cancer drug combinations in a pharmacogenomic screen. Nat Commun. 2019;10(1):2674. Epub 2019/06/19. 10.1038/s41467-019-09799-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Holbeck SL, Camalier R, Crowell JA, Govindharajulu JP, Hollingshead M, Anderson LW, et al. The National Cancer Institute ALMANAC: A Comprehensive Screening Resource for the Detection of Anticancer Drug Pairs with Enhanced Therapeutic Activity. Cancer Res. 2017;77(13):3564–76. Epub 2017/04/28. 10.1158/0008-5472.CAN-17-0489 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Preuer K, Lewis RPI, Hochreiter S, Bender A, Bulusu KC, Klambauer G. DeepSynergy: predicting anti-cancer drug synergy with Deep Learning. Bioinformatics. 2018;34(9):1538–46. Epub 2017/12/19. 10.1093/bioinformatics/btx806 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Challen R, Denny J, Pitt M, Gompels L, Edwards T, Tsaneva-Atanasova K. Artificial intelligence, bias and clinical safety. BMJ Qual Saf. 2019;28(3):231–7. Epub 2019/01/14. 10.1136/bmjqs-2018-008370 PubMed Central PMCID: PMC6560460. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Benitez JM, Castro JL, Requena I. Are artificial neural networks black boxes? IEEE Trans Neural Netw. 1997;8(5):1156–64. Epub 1997/01/01. 10.1109/72.623216 . [DOI] [PubMed] [Google Scholar]
  • 27.Castelvecchi D. Can we open the black box of AI? Nature. 2016;538(7623):20–3. Epub 2016/10/07. 10.1038/538020a . [DOI] [PubMed] [Google Scholar]
  • 28.Cheng F, Kovacs IA, Barabasi AL. Network-based prediction of drug combinations. Nat Commun. 2019;10(1):1197. Epub 2019/03/15. 10.1038/s41467-019-09186-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, et al. Attention Is All You Need. 2017. [Google Scholar]
  • 30.Luong M-T, Pham H, Manning CD. Effective Approaches to Attention-based Neural Machine Translation. 2015. [Google Scholar]
  • 31.Bahdanau D, Cho K, Bengio Y. Neural Machine Translation by Jointly Learning to Align and Translate. 2014. [Google Scholar]
  • 32.Liu Q, He D, Xie L. Prediction of off-target specificity and cell-specific fitness of CRISPR-Cas System using attention boosted deep learning and network-based gene feature. PLoS Comput Biol. 2019;15(10):e1007480. Epub 2019/10/29. 10.1371/journal.pcbi.1007480 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Lundberg S, Lee S-I. A Unified Approach to Interpreting Model Predictions. 2017. [Google Scholar]
  • 34.Buermans HP, den Dunnen JT. Next generation sequencing technology: Advances and applications. Biochim Biophys Acta. 2014;1842(10):1932–41. Epub 2014/07/06. 10.1016/j.bbadis.2014.06.015 . [DOI] [PubMed] [Google Scholar]
  • 35.Ayed M, Lim H, Xie L. Biological representation of chemicals using latent target interaction profile. BMC Bioinformatics. 2019;20(Suppl 24):674. Epub 2019/12/22. 10.1186/s12859-019-3241-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Wishart DS, Feunang YD, Guo AC, Lo EJ, Marcu A, Grant JR, et al. DrugBank 5.0: a major update to the DrugBank database for 2018. Nucleic Acids Res. 2018;46(D1):D1074–D82. Epub 2017/11/11. 10.1093/nar/gkx1037 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Gaulton A, Hersey A, Nowotka M, Bento AP, Chambers J, Mendez D, et al. The ChEMBL database in 2017. Nucleic Acids Res. 2017;45(D1):D945–D54. Epub 2016/12/03. 10.1093/nar/gkw1074 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.McFarland JM, Ho ZV, Kugener G, Dempster JM, Montgomery PG, Bryan JG, et al. Improved estimation of cancer dependencies from large-scale RNAi screens using model-based normalization and data integration. Nat Commun. 2018;9(1):4610. Epub 2018/11/06. 10.1038/s41467-018-06916-5 PubMed Central PMCID: PMC6214982. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Verret B, Cortes J, Bachelot T, Andre F, Arnedos M. Efficacy of PI3K inhibitors in advanced breast cancer. Ann Oncol. 2019;30(Suppl_10):x12–x20. Epub 2019/12/21. 10.1093/annonc/mdz381 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Saini KS, Loi S, de Azambuja E, Metzger-Filho O, Saini ML, Ignatiadis M, et al. Targeting the PI3K/AKT/mTOR and Raf/MEK/ERK pathways in the treatment of breast cancer. Cancer Treat Rev. 2013;39(8):935–46. Epub 2013/05/07. 10.1016/j.ctrv.2013.03.009 . [DOI] [PubMed] [Google Scholar]
  • 41.Nitiss JL. Targeting DNA topoisomerase II in cancer chemotherapy. Nat Rev Cancer. 2009;9(5):338–50. Epub 2009/04/21. 10.1038/nrc2607 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Bartalucci N, Tozzi L, Bogani C, Martinelli S, Rotunno G, Villeval JL, et al. Co-targeting the PI3K/mTOR and JAK2 signalling pathways produces synergistic activity against myeloproliferative neoplasms. J Cell Mol Med. 2013;17(11):1385–96. Epub 2013/11/19. 10.1111/jcmm.12162 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Bhattacharyya J, Mihara K, Yasunaga S, Tanaka H, Hoshi M, Takihara Y, et al. BMI-1 expression is enhanced through transcriptional and posttranscriptional regulation during the progression of chronic myeloid leukemia. Ann Hematol. 2009;88(4):333–40. Epub 2008/09/11. 10.1007/s00277-008-0603-8 . [DOI] [PubMed] [Google Scholar]
  • 44.Ginjala V, Nacerddine K, Kulkarni A, Oza J, Hill SJ, Yao M, et al. BMI1 is recruited to DNA breaks and contributes to DNA damage-induced H2A ubiquitination and repair. Mol Cell Biol. 2011;31(10):1972–82. Epub 2011/03/09. 10.1128/MCB.00981-10 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Sekido Y. Molecular pathogenesis of malignant mesothelioma. Carcinogenesis. 2013;34(7):1413–9. Epub 2013/05/17. 10.1093/carcin/bgt166 . [DOI] [PubMed] [Google Scholar]
  • 46.Ribeiro-Silva C, Vermeulen W, Lans H. SWI/SNF: Complex complexes in genome stability and cancer. DNA Repair (Amst). 2019;77:87–95. Epub 2019/03/22. 10.1016/j.dnarep.2019.03.007 . [DOI] [PubMed] [Google Scholar]
  • 47.Javle M, Curtin NJ. The role of PARP in DNA repair and its therapeutic exploitation. Br J Cancer. 2011;105(8):1114–22. Epub 2011/10/13. 10.1038/bjc.2011.382 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Cai T, Lim H, Abbu KA, Qiu Y, Nussinov R, Xie L. A deep learning framework for elucidating whole-genome chemical interaction space. bioRxiv. 2020:2020.10.08.332346. 10.1101/2020.10.08.332346 [DOI] [Google Scholar]
  • 49.Clarke R, Ressom HW, Wang A, Xuan J, Liu MC, Gehan EA, et al. The properties of high-dimensional data spaces: implications for exploring gene and protein expression data. Nat Rev Cancer. 2008;8(1):37–49. Epub 2007/12/22. 10.1038/nrc2294 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Danaee P, Ghaeini R, Hendrix DA. A Deep Learning Approach for Cancer Detection and Relevant Gene Identification. Pac Symp Biocomput. 2017;22:219–29. Epub 2016/11/30. 10.1142/9789813207813_0022 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Chou T-C, Talalay P. Analysis of combined drug effects: a new look at a very old problem. Trends in Pharmacological Sciences. 1983;4:450–4. 10.1016/0165-6147(83)90490-X. [DOI] [Google Scholar]
  • 52.Chou TC, Talalay P. Quantitative analysis of dose-effect relationships: the combined effects of multiple drugs or enzyme inhibitors. Adv Enzyme Regul. 1984;22:27–55. Epub 1984/01/01. 10.1016/0065-2571(84)90007-4 . [DOI] [PubMed] [Google Scholar]
  • 53.Chou TC. Drug combination studies and their synergy quantification using the Chou-Talalay method. Cancer Res. 2010;70(2):440–6. Epub 2010/01/14. 10.1158/0008-5472.CAN-09-1947 . [DOI] [PubMed] [Google Scholar]
  • 54.Bliss CI. THE TOXICITY OF POISONS APPLIED JOINTLY1. Annals of Applied Biology. 1939;26(3):585–615. 10.1111/j.1744-7348.1939.tb06990.x [DOI] [Google Scholar]
  • 55.Berenbaum MC. A method for testing for synergy with any number of agents. J Infect Dis. 1978;137(2):122–30. Epub 1978/02/01. 10.1093/infdis/137.2.122 . [DOI] [PubMed] [Google Scholar]
  • 56.Greco WR, Bravo G, Parsons JC. The search for synergy: a critical review from a response surface perspective. Pharmacol Rev. 1995;47(2):331–85. Epub 1995/06/01. . [PubMed] [Google Scholar]
  • 57.Loewe S, Muischnek H. Über Kombinationswirkungen. Naunyn-Schmiedebergs Archiv für experimentelle Pathologie und Pharmakologie. 1926;114(5):313–26. 10.1007/BF01952257 [DOI] [Google Scholar]
  • 58.Loewe S. The problem of synergism and antagonism of combined drugs. Arzneimittelforschung. 1953;3(6):285–90. Epub 1953/06/01. . [PubMed] [Google Scholar]
  • 59.Ching T, Himmelstein DS, Beaulieu-Jones BK, Kalinin AA, Do BT, Way GP, et al. Opportunities and obstacles for deep learning in biology and medicine. J R Soc Interface. 2018;15(141). Epub 2018/04/06. 10.1098/rsif.2017.0387 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Heaton J, McElwee S, Fraley J, Cannady J, editors. Early stabilizing feature importance for TensorFlow deep neural networks. 2017 International Joint Conference on Neural Networks (IJCNN); 2017 14–19 May 2017.
  • 61.Zhou J, Troyanskaya OG. Predicting effects of noncoding variants with deep learning-based sequence model. Nat Methods. 2015;12(10):931–4. Epub 2015/08/25. 10.1038/nmeth.3547 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Springenberg JT, Dosovitskiy A, Brox T, Riedmiller M. Striving for Simplicity: The All Convolutional Net. 2014. [Google Scholar]
  • 63.Jain S, Wallace BC. Attention is not Explanation. 2019. [Google Scholar]
  • 64.Jaeger S, Igea A, Arroyo R, Alcalde V, Canovas B, Orozco M, et al. Quantification of Pathway Cross-talk Reveals Novel Synergistic Drug Combinations for Breast Cancer. Cancer Res. 2017;77(2):459–69. Epub 2016/11/24. 10.1158/0008-5472.CAN-16-0097 . [DOI] [PubMed] [Google Scholar]
  • 65.Yin Z, Deng Z, Zhao W, Cao Z. Searching Synergistic Dose Combinations for Anticancer Drugs. Front Pharmacol. 2018;9:535. Epub 2018/06/07. 10.3389/fphar.2018.00535 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Tong H, Faloutsos C, Pan J, editors. Fast Random Walk with Restart and Its Applications. Sixth International Conference on Data Mining (ICDM’06); 2006 18–22 Dec. 2006.
  • 67.Szklarczyk D, Gable AL, Lyon D, Junge A, Wyder S, Huerta-Cepas J, et al. STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res. 2019;47(D1):D607–D13. Epub 2018/11/27. 10.1093/nar/gky1131 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Rouillard AD, Gundersen GW, Fernandez NF, Wang Z, Monteiro CD, McDermott MG, et al. The harmonizome: a collection of processed datasets gathered to serve and mine knowledge about genes and proteins. Database (Oxford). 2016;2016. Epub 2016/07/05. 10.1093/database/baw100 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Barretina J, Caponigro G, Stransky N, Venkatesan K, Margolin AA, Kim S, et al. The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature. 2012;483(7391):603–7. Epub 2012/03/31. 10.1038/nature11003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Yang W, Soares J, Greninger P, Edelman EJ, Lightfoot H, Forbes S, et al. Genomics of Drug Sensitivity in Cancer (GDSC): a resource for therapeutic biomarker discovery in cancer cells. Nucleic Acids Res. 2013;41(Database issue):D955–61. Epub 2012/11/28. 10.1093/nar/gks1111 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Iorio F, Knijnenburg TA, Vis DJ, Bignell GR, Menden MP, Schubert M, et al. A Landscape of Pharmacogenomic Interactions in Cancer. Cell. 2016;166(3):740–54. Epub 2016/07/12. 10.1016/j.cell.2016.06.017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Dempster JM, Rossen J, Kazachkova M, Pan J, Kugener G, Root DE, et al. Extracting Biological Insights from the Project Achilles Genome-Scale CRISPR Screens in Cancer Cell Lines. bioRxiv. 2019:720243. 10.1101/720243 [DOI] [Google Scholar]
  • 73.Broad D. DepMap 20Q1 Public2020.
  • 74.Meyers RM, Bryan JG, McFarland JM, Weir BA, Sizemore AE, Xu H, et al. Computational correction of copy number effect improves specificity of CRISPR-Cas9 essentiality screens in cancer cells. Nat Genet. 2017;49(12):1779–84. Epub 2017/10/31. 10.1038/ng.3984 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Broad D. Project SCORE processed with CERES2019.
  • 76.Behan FM, Iorio F, Picco G, Goncalves E, Beaver CM, Migliardi G, et al. Prioritization of cancer therapeutic targets using CRISPR-Cas9 screens. Nature. 2019;568(7753):511–6. Epub 2019/04/12. 10.1038/s41586-019-1103-9 . [DOI] [PubMed] [Google Scholar]
  • 77.van Buuren S, Groothuis-Oudshoorn K. mice: Multivariate Imputation by Chained Equations in R. 2011. 2011;45(3):67. Epub 2011-12-12. 10.18637/jss.v045.i03 [DOI] [Google Scholar]
  • 78.Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A. 2005;102(43):15545–50. Epub 2005/10/04. 10.1073/pnas.0506580102 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Liberzon A, Subramanian A, Pinchback R, Thorvaldsdottir H, Tamayo P, Mesirov JP. Molecular signatures database (MSigDB) 3.0. Bioinformatics. 2011;27(12):1739–40. Epub 2011/05/07. 10.1093/bioinformatics/btr260 [DOI] [PMC free article] [PubMed] [Google Scholar]
PLoS Comput Biol. doi: 10.1371/journal.pcbi.1008653.r001

Decision Letter 0

Feilim Mac Gabhann, Avner Schlessinger

22 Sep 2020

Dear Dr. Xie,

Thank you very much for submitting your manuscript "TranSynergy: Mechanism-Driven Interpretable Deep Neural Network for the Synergistic Prediction and Pathway Deconvolution of Drug Combinations" for consideration at PLOS Computational Biology.

As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. In light of the reviews (below this email), we would like to invite the resubmission of a significantly-revised version that takes into account the reviewers' comments.

We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is also likely to be sent to reviewers for further evaluation.

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Please prepare and submit your revised manuscript within 60 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. Please note that revised manuscripts received after the 60-day due date may require evaluation and peer review similar to newly submitted manuscripts.

Thank you again for your submission. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Avner Schlessinger

Associate Editor

PLOS Computational Biology

Feilim Mac Gabhann

Editor-in-Chief

PLOS Computational Biology

***********************

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: In this paper, the authors proposed a novel knowledge-enabled and self-attention boosted deep learning model to investigate synergistic drug combinations. They used random walk with restart (RWR) algorithm on a PPI network (drug features) and PPI derived NetExpress scores (gene dependency) as input to their model. They also perform gene set enrichment analysis to reveal novel biological pathways associated with the drug combinations using genes ranked based on the shapely additive value. TranSynergy has three components in its architecture, feature reduction, self-attention transformer that models gene-gene interaction and a fully connected neural network for final drug combination synergy score prediction. They compared their algorithm to another one named “Deepsynergy” and used the same train test split to attain complete compatibility. Their algorithm shows significant improvement in performance over DeepSynergy. The paper is well written and organized. However, the reviewer has the following concerns.

Major:

(1) Alpha is an important hyperparameter in RWR method. The authors show the impact of this parameter on the prediction results in Table S3 and use alpha = 0.5 in the experiments. Is this an 'overfitting'? The authors used the best results without tuning this hyperparameter to compare it to the other baseline methods. A detailed description is needed.

(2) The authors used RWR on a PPI network to represent the drug features instead of chemical information. It will be interesting to see the prediction results by integrating both drug features and chemical information in the TranSynergy model.

(3) It is also better to show the baseline performance in Figure 3, since the improvement from TranSynergy can be cancer specific.

Minor:

(1) It is better to show that whether the improvement by TranSynergy in Table 1 and Table 2 are statistical significance.

Reviewer #2: Summary

Authors Liu and Xie create a machine learning algorithm, TransSynergy, that aims to identify synergistic drug combinations by using information derived from drug modulation of protein-protein interaction networks. The authors claim to outperform drug-drug synergy predictions from a model created by called DeepSynergy and demonstrated how their Shapley additive gene set enrichment analysis could be used to deconvolute the transynergy black box and identify pathways associated with synergistic drug combinations.

Review:

The approach the authors describe is quite novel. Encoding drugs not as chemical descriptors but as a weighted network of their protein protein interactions is very innovative; considering most other approaches that are used in this field utilize descriptors such as ECFP4 and etc. As the authors correctly point out, it may be difficult to explain causal relationships and is not indicative of drug action at a cellular level.

In terms of performance, the authors claim that TranSynergy along with certain features can outperform deepsynergy. However there are some issues, first the main text does not include the number of cross validation steps the authors performed. Second, deepsynergy includes several other benchmarks other than correlation coefficients, such as mean squared error, ROC AUC, precision, recall, etc. that the authors do not include in this analysis. Because of this, their approach is not an apples-to-apples comparison, as the authors claim. This issue is persistent through to figure 3 and table 4, which only includes correlation coefficients and does not include other metrics of classifier performance.

Next the authors attempt to describe the cell line specific performance of their approach in figure 3. It is unclear in the main text the approach they used to predict cell-line specific performance. Did they train transsynergy on all other cell line data and test on a new cell line, or did they take 80% of all cell line data, and they are showing the performance on the hold out set? If it’s the latter, then confidence intervals should be shown. It would also be important to show the number of samples available for each cell line. Perhaps the low performance on prostate tissues may be due to low training data available for prostate cells, or the model is overfit to other cell lines.

Finally the authors show that using their variant of GSEA can identify pathways involved in the PPI of drugs. What would be helpful here would be a comparison of their shapely method and typical GSEA results. And observing the differences between the top pathways associated with either.

Other general comments:

While the author’s approach for representing drugs as networks of their PPI is interesting, there might be important chemical properties that the authors might be missing by not incorporating chemical features.

The deepsynergy paper uses more robust cross validation strategies than presented in this paper. Why wasn’t those strategies used in this work?

There are numerous grammatical errors present throughout the paper.

Reviewer #3: This manuscript introduced the development of a deep learning-based method for cell line-specific synergy prediction for drug combinations. It also introduced the application of Shapley Additive Gene Set Enrichment Analysis that can deconvolute the prediction to determine the synergy-associated biological pathways. The major contributions are as follows:

A. Compared to the earlier work of DeepSynergy, this study has adopted unique input features for both drugs and cell lines. Specifically, drugs are not represented as chemical features of drugs themselves but as gene targets (with gene-gene network propagation) of drugs, which helps the post-hoc interpretability of the model. Cell lines are not represented as gene expressions but experimentally determined or computationally predicted gene dependencies of cell lines, which again helps model interpretability.

B. The deep learning model uses self attention-empowered transformer. A post-hoc interpretability analysis further reveals gene sets associated with predicted synergy.

Numerical experiments show that the resulting TranSynergy made improvement compared to DeepSynergy (Pearson's r increased from 0.73 to 0.77 and Spearman's rho increased from 0.69 to 0.75). Ablation studies show that both the cell line features and the self attention modules contributed to the improvement. TransSynergy was split-analyzed based on tissues, interpreted with the gene enrichment analysis, and prospectively applied to predict for 3650 novel samples.

Major comments:

1> The use of targets to featurize drugs, although helping model interpretability, limits the applicability of the model. Unlike DeepSynergy that only assumes chemical information of the drugs themselves, TranSynergy demands the drug-target association whereas such data is not always available to drugs or compounds. Meanwhile, just using the new compound features with neither new cell line features nor self attention (Ablation Study) led to similar performance as DeepSynergy.

2> Please include more details about what DeepSynergy model was used and how it was derived. For fair comparison, DeepSynergy is supposed to be re-trained using the same training set (the original study used cross validation instead) and the training procedures ought to be provided as well.

3> Is it possible to examine the generalizability of the model under various scenarios of the test set, including new drugs (one or both in the pair), new targets (one or both in the pair), and new cell lines not seen in the training set. Performances on a new dataset (for instance, part of DrugComb) with such scenario splits would be useful. If not, performances on the existing test set with such split could help as well.

4> The performance analyses on different tissues are very interesting and worthwhile further interpretation. To help understand model behaviour, what could have made predictions be high mean and high variance for ovarian cell lines whereas low(er) mean and low variance for prostate cell lines?

5> Interpretation of the model predictions.

a. Self attention was mentioned for encoding drug-target interactions on Page 9 but for representing gene-gene interactions on Page 11. Is there a conflict or the two statements are equal because drugs are represented as target genes (and others in biological networks)?

b. Moreover, why not use learned self-attentions to interpret the model behavior but use the post-hoc Shapley Additive Gene Set Enrichment Analysis? Or maybe self-attentions were indeed used (although not documented in the enrichment analysis part of the methods)?

c. The interpretability results emphasize "pathways" associated with the synergy. But the results seem more about genes. It is not clear that pathway information was used for associated pathways, after finding those genes. Also, conceptually, could any explanation be introduced about what pathways are associated with synergy? The same pathway for both drugs? Two overlapping or cross-talking or nonoverlapping pathways covering the same disease module (consistent with the cell line)? Or something else?

Minor comments:

6> Please include more details about the 3650 novel samples in the section of methods.

7> With the current readme of the github repo, it is hard to figure out the structure of many source code files laid at the same top level. In addition, to help achieve more impact of the work, please consider including in readme how to train the model with the codes and which data files are what.

8> There were light grammar issues.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Computational Biology data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, PLOS recommends that you deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions, please see http://journals.plos.org/compbiol/s/submission-guidelines#loc-materials-and-methods

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1008653.r003

Decision Letter 1

Feilim Mac Gabhann, Avner Schlessinger

21 Dec 2020

Dear Dr. Xie,

We are pleased to inform you that your manuscript 'TranSynergy: Mechanism-Driven Interpretable Deep Neural Network for the Synergistic Prediction and Pathway Deconvolution of Drug Combinations' has been provisionally accepted for publication in PLOS Computational Biology.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. 

Best regards,

Avner Schlessinger

Associate Editor

PLOS Computational Biology

Feilim Mac Gabhann

Editor-in-Chief

PLOS Computational Biology

***********************************************************

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: The authors have addressed most of the concerns I had with the previous submission. I don't have any further comments

Reviewer #2: Authors sufficiently addressed my comments

Reviewer #3: The reviewer would like to thank the authors for the thorough revision. The additional experiments, analysis, clarification, discussion, and GitHub repo restructuring have adequately addressed the earlier comments.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Computational Biology data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1008653.r004

Acceptance letter

Feilim Mac Gabhann, Avner Schlessinger

2 Feb 2021

PCOMPBIOL-D-20-01357R1

TranSynergy: Mechanism-Driven Interpretable Deep Neural Network for the Synergistic Prediction and Pathway Deconvolution of Drug Combinations

Dear Dr Xie,

I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Alice Ellingham

PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Table. The hyperparameters of TranSynergy model with the best performance.

    (XLSX)

    S2 Table. Performance of TranSynergy integrated with Chemical Information (TranSynergyCI).

    (XLSX)

    S3 Table. Performance of TranSynergy integrated with Graphical Neural Fingerprint (TranSynergyGNF).

    (XLSX)

    S4 Table. Sample amounts in each cell line.

    (XLSX)

    S5 Table. Predicted drug pairs synergy score and z-scores.

    (XLSX)

    S6 Table. Grid search results of the restart rates α, a hyperparameter in random walk with restart for network propagated drug target features inference, over the validation set.

    (XLSX)

    S1 Fig. The architecture of TranSynergy integrated with Chemical Information (TranSynergyCI).

    Extra drug features, including ECFP, physico-chemical and toxicophore features were concatenated with the output of the Transformer and input into the last fully connected component. These chemical features are the same as those used in deepSynergy.

    (PDF)

    S2 Fig. The architecture of TranSynergy integrated with Graphical Neural Fingerprint (TranSynergyGNF).

    Extra drug features, graphical neural fingerprint for drug A and drug B extracted with a GCN layer, were concatenated with the output of the Transformer and input into the last fully connected component.

    (PDF)

    S3 Fig. Shapley Additive Gene Set Enrichment Analysis result of the prediction model for Etoposide and MK-8669 drug pair.

    The left panel shows the SHAP values of drug targets, while the right panel shows the SHAP values of 20 genes with the most significant impact.

    (PDF)

    S4 Fig. Shapley Additive Gene Set Enrichment Analysis result of the prediction model for Erlotinib and BEZ-235 drug pair.

    The left panel shows the SHAP values of drug targets, while the right panel shows the SHAP values of 20 genes with the most significant impact.

    (PDF)

    S5 Fig. Shapley Additive Gene Set Enrichment Analysis result of the prediction model for MK-4827 and BEZ-235 drug pair.

    The left panel shows the SHAP values of drug targets, while the right panel shows the SHAP values of 20 genes with the most significant impact.

    (PDF)

    S6 Fig. Shapley Additive Gene Set Enrichment Analysis result of the prediction model for L778123 and BEZ-235 drug pair.

    The left panel shows the SHAP values of drug targets, while the right panel shows the SHAP values of 20 genes with the most significant impact.

    (PDF)

    S7 Fig. Shapley Additive Gene Set Enrichment Analysis result of the prediction model for Paclitaxel and Dasatinib drug pair.

    The left panel shows the SHAP values of drug targets, while the right panel shows the SHAP values of 20 genes with the most significant impact.

    (PDF)

    S8 Fig. Shapley Additive Gene Set Enrichment Analysis result of the prediction model for BEZ-235 and Dinaciclib drug pair.

    The left panel shows the SHAP values of drug targets, while the right panel shows the SHAP values of 20 genes with the most significant impact.

    (PDF)

    S9 Fig. Shapley Additive Gene Set Enrichment Analysis result of the prediction model for Dasatinib and ABT-888 drug pair.

    The left panel shows the SHAP values of drug targets, while the right panel shows the SHAP values of 20 genes with the most significant impact.

    (PDF)

    Attachment

    Submitted filename: ResponseToReviewers.pdf

    Data Availability Statement

    All scripts and data are available from https://github.com/qiaoliuhub/drug_combination. Other relevant results data are within the manuscript and its Supporting Information files.


    Articles from PLoS Computational Biology are provided here courtesy of PLOS

    RESOURCES