Abstract
Drug combinations have demonstrated great potential in cancer treatments. They alleviate drug resistance and improve therapeutic efficacy. The fast-growing number of anti-cancer drugs has caused the experimental investigation of all drug combinations to become costly and time-consuming. Computational techniques can improve the efficiency of drug combination screening. Despite recent advances in applying machine learning to synergistic drug combination prediction, several challenges remain. First, the performance of existing methods is suboptimal. There is still much space for improvement. Second, biological knowledge has not been fully incorporated into the model. Finally, many models are lack interpretability, limiting their clinical applications. To address these challenges, we have developed a knowledge-enabled and self-attention transformer boosted deep learning model, TranSynergy, which improves the performance and interpretability of synergistic drug combination prediction. TranSynergy is designed so that the cellular effect of drug actions can be explicitly modeled through cell-line gene dependency, gene-gene interaction, and genome-wide drug-target interaction. A novel Shapley Additive Gene Set Enrichment Analysis (SA-GSEA) method has been developed to deconvolute genes that contribute to the synergistic drug combination and improve model interpretability. Extensive benchmark studies demonstrate that TranSynergy outperforms the state-of-the-art method, suggesting the potential of mechanism-driven machine learning. Novel pathways that are associated with the synergistic combinations are revealed and supported by experimental evidences. They may provide new insights into identifying biomarkers for precision medicine and discovering new anti-cancer therapies. Several new synergistic drug combinations have been predicted with high confidence for ovarian cancer which has few treatment options. The code is available at https://github.com/qiaoliuhub/drug_combination.
Author summary
The number of anti-cancer drugs has been consistently and quickly growing. They are mainly used as standardized mono-therapy. Drug combinations show substantial advantages over the anti-cancer mono-therapy. Cancer cells treated with the mono-therapy could later activate bypassing pathways and harbor drug resistances. Drug combinations can alleviate this issue by using a smaller doses of each anti-cancer drug or targeting multiple oncogenic pathways. However, the investigation of all anti-cancer drug combinations using experimental methods is costly and time-consuming. Machine learning provides an attractive solution to screening synergistic drug combinations, but it is a black-box and not easy to explain. We have developed a knowledge-enabled deep learning model, TranSynergy, to predict synergistic drug combinations and have demonstrated that our model outperformed other state-of-the-art methods. A novel Shapley Additive Gene Set Enrichment Analysis (SA-GSEA) method is introduced to improve the interpretability of the machine learning model. Using TransSynergy and SA-GSEA, we can deconvolute genes responsible for the synergistic drug combination, suggesting the potential of machine learning in developing precision anti-cancer therapy.
Introduction
With the advance of understandings of cancer cell disorders, more and more anti-cancer drugs have been designed and are under investigation. However, the "one drug, one target" drug monotherapy suffers limited efficiency due to inherent or acquired resistance [1–3]. Drug combination therapy is a more effective strategy to solve this challenging problem [4–8]. In addition to cancer, synergistic drug combinations have several successful applications in the treatment of other diseases, such as AIDS [9,10], and fungal or bacterial infections [11–13]. Thus, the selection of efficient drug combination therapy for pathogens emerges as a compelling treatment strategy. Considering that the number of anti-cancer drugs has increased drastically, the possible combination of all these drugs has also become enormous [14,15]. The existing experimental method requires a large number of samples with different drug doses and cancer cells [16], thus it is infeasible to exhaust all the possible drug combinations. The computational method can be used to pre-select drug combinations with high synergy at lower cost and with more efficiency. The recent advancement of computational modeling, especially the deep learning technique, has dramatically increased the prediction power of computational models and has many promising applications in the biomedical field. The combination of computational and experimental methods can improve the effectiveness of the drug combination discovery.
The deep learning model has been shown to have superior performance to conventional machine learning algorithms in many biomedical applications [17,18]. High-quality experimental drug combination datasets are necessary for the success of deep learning. With the advancement of high throughput drug combination screening tests, the number of samples grows fast so that the data size limitation is considerably alleviated [19–23]. DeepSynergy is a state-of-the-art deep learning-based prediction model for the prediction of the synergistic drug combination. It has been trained using the dataset released by Merck [24]. In addition to suboptimal performance, the issue for this model is that the interpretation of the model is limited by the way adopted to represent drugs and cell lines as well as the model architecture. For instance, it is not easy to associate the contribution or feature importance of drug descriptors, including toxophores, physicochemical properties, and fingerprints, with the mechanism of drug action in cells using a feedforward neural network [25–27].
Recent studies have shown that gene-gene interaction and drug mechanism of action drew more and more attention in the synergistic drug combination study [28]. In addition, cell line drug sensitivity strongly depends on whether the drug directly or indirectly inhibits the essential gene of the cell line [22]. Thus, it is desirable to incorporate information from the gene-gene interaction network, gene dependency, and drug-target association into the computational model. To this end, we implemented a mechanism-driven and self-attention boosted deep learning model TranSynergy for the prediction of synergistic drug combinations and the deconvolution of cellular mechanisms contributing to them. We applied the random walk with restart algorithm (RWR) on a protein-protein interaction (PPI) network to infer a novel drug-target profile as the drug features. For the features of each cell line, we used gene expressions or gene dependencies profile. These mechanism related features make the model readily interpretable. Furthermore, we applied the self-attention transformer to encode the gene-gene interactions responsible for the synergistic drug combination. Attention mechanisms have been widely used in image processing and natural language processing [29–31] and has shown promise in the predictive modeling of nucleic acid sequences [32]. When combining the network propagated drug target profile, gene dependency, and gene expression, TranSynergy outperforms the state-of-the-art model. To reveal novel genes that are associated with the synergistic drug combination from the learned biological relations in the TranSynergy model, we developed a novel Shapley Additive Gene Set Enrichment Analysis (SA-GSEA) based on SHAP [33]. The revealed novel gene set may serve as a patient-specific biomarker for precision medicine or drug targets for discovering new cancer combination therapy. We further applied the model for the prediction of novel synergistic drug combinations targeting cancers that have few treatment options. Given the emergence of next-generation sequencing technology, the transcriptome of patient-derived cancer cells can be readily obtained [34]. The TranSynergy can be used to predict and interpret the synergistic drug combination in distinct patient-derived cancer cells. Our study shows the potential of a mechanism-driven interpretable machine learning model in the application of personalized cancer treatment.
Results
TranSynergy architecture
TranSynergy is a transformer boosted deep learning model for the prediction of drug combination synergy. It includes three major components, input dimension reduction component, self-attention transformer component, and output fully connected component (Fig 1). The input features are composed of three vectors. Each vector has 2401 dimensions, forming a 2401x3 matrix. The first two columns are the representations of two drugs. The third column is the representation of the cell lines. The drug feature is a drug-target interaction profile on 2401 selected genes. The cell line feature includes the gene expression or gene dependency of 2401 genes. It is worth mentioning that the number of columns changes to 4 when both gene expression and dependency are used for the representation of cells. In the matrix, each row corresponds to a gene or protein, and encodes the impact of drug on the gene. The input dimension reduction component is a single-layer neural network to reduce the dimension of input. The modified transformer component takes the output from the first component and includes a scaled dot product based self-attention mechanism module. Here, the self-attention is applied to model gene-gene interactions. It is also worth noting that we customized the transformer model by removing the positional encoding layer since the order input feature dimensions should be irrelevant to the final prediction. Then the final output of the predicted synergy score comes from a fully connected neural network. The hyperparameters used in each component and training process are listed in S1 Table.
The input of our deep learning model includes the vector representations of two drug molecules in the drug combination and a cell line that is treated by the drug combination. One popular strategy is to use the physicochemical properties, toxicophores, or extended connected fingerprints (ECFP) that are derived from the molecular structure of chemical compounds for the representation of drugs [24]. The disadvantage of using the chemical structure as the feature is that it is not straightforward to establish a relationship between the physicochemical properties of drugs and the cellular mechanism of drug action. Biological representation based on drug-target interaction profile is an alternative strategy to infer the drug representation vector [35]. Drug target information that is collected from databases, including DrugBank and ChEMBL [36,37], are mainly proteins that can directly associate with drugs. We also need to encode the effect of drugs on down-stream non-target proteins and the whole biological system. The protein-protein interaction network is utilized to infer the drug response of the non-target proteins considering that the protein-protein interaction mediates information transmission in the biological system. We apply the RWR algorithm to simulate this network propagation process (Fig 2). Compared with the chemical information-based approaches for drug representation, the target-based representation of drug molecules has the following advantages. Firstly, drug target information is closely related to the cellular response to the drug treatment at both the molecular level and system level. Secondly, it makes it possible to explain the model output, drug combination synergy, in terms of the contribution of each protein or gene.
Because drug-combination therapy has cell line-specific responses, another important component of inputs is the cell line vector representation. The DeepSynergy uses gene expression profile as the cell line vector representation [24]. We have applied a novel alternative strategy to infer cell line vector representation. The gene essentiality varies in the different cell lines and plays a critical role in anti-cancer drug sensitivity. Intuitively, drugs that affect essential proteins will cause the cell to have a more devastating response. Thus, we take gene essentiality or gene dependence into account and this information is one of the inputs into our model. This information has been collected by the BROAD institution using experimental methods [38]. They have performed a genome-wide loss-of-function screening with pooled RNAi or CRISPR library and have investigated the resulted cellular response.
TranSynergy outperforms state-of-the-art model and shows superior performance across different tissues
To evaluate the TranSynergy architecture on drug combination synergy score prediction, we tested our model in three different scenarios and compared it with the state-of-the-art model DeepSynergy.
We first assessed the model performance in the setting of leave drug combination out (Fig 3A). We performed a nested five-fold cross-validation using the same data for both models. We split the data into five folds based on drug combinations such that the drug combination in one fold would not appear in another fold. Four folds of data were used in the training and hyperparameter tuning stage and one fold was held out to test model performance. We performed the training/testing procedure five times with each fold held out. TranSynergy significantly outperformed the DeepSynergy model for the synergy score prediction (Table 1). The mean square error (MSE) of the TranSyerngy model was 232, significantly lower than that of DeepSynergy (p-value < 0.001). The accuracy improved by 3% and 5% when measured with Pearson’s correlation and Spearman’s correlation, respectively. We also showed that TranSynergy presented a superior performance in classification scenarios (Table 2). PR-AUC improved by 5.6% when comparing TranSynergy with DeepSynergy, although the improvement of ROC-AUC was by a small margin. It is noted that other genomic features of cell lines (mutations and CNVs) were not used in both TranSynergy and DeepSynergy. Considering that they could provide additional information relevant to the drug mode of action, especially for targeted therapy, it would be interesting to incorporate extra genomic features into TranSynergy in the future.
Table 1. Performance comparison of TranSynergy and DeepSynergy models in regression scenarios.
Model | Drug features | Cell line features | MSE | Spearman Correlation | Pearson Correlation |
---|---|---|---|---|---|
TranSynergy | Network propagated drug target profile | Gene dependency + Gene expression | 231 ± 21 | 0.730±0.016 | 0.746±0.018 |
DeepSynergy | Physicochemical properties, toxicophores and fingerprints | Gene expression | 243 ± 17 | 0.698±0.016 | 0.726±0.015 |
Statistical significance (p-value) | <0.0001 | <0.0001 | 0.0001 |
Table 2. Performance comparison of TranSynergy with different cell line features to DeepSynergy model in classification scenarios.
Models | Drug features | Cell line features | PR-AUC | ROC-AUC |
---|---|---|---|---|
TranSynergy | Network propagated drug target profile | Gene dependency | 0.625±0.020 | 0.908±0.011 |
Gene expression | 0.618±0.016 | 0.901±0.009 | ||
Gene expression + Gene dependency | 0.627±0.013 | 0.907±0.011 | ||
DeepSynergy | Physicochemical properties, toxicophores and fingerprints | Gene expression | 0.594±0.021 | 0.900±0.009 |
Statistical significance* (p-value) |
<0.0001 | 0.0134 |
*TranSynergy that used gene expression+Gene dependency cell line features was used to compute the statistical significance of performance difference between TranSynergy and DeepSynergy.
We further explored the performance of the TranSynergy model on 6 different tissues: colon, breast, melanoma, ovarian, prostate, and lung. The average Pearson correlation coefficient between ground truth synergy scores and predicted scores were 0.741 for colon cancer cells, 0.742 for breast cancer cells, 0.705 for melanoma, 0.788 for ovarian cancer cells, 0.660 for prostate cancer cells, and 0.738 for lung cancer cells (Fig 4A and 4B). The Pearson correlation coefficients of the TranSynergy across cell lines ranged from 0.616 to 0.858, and the Spearman correlation coefficients ranged from 0.592 to 0.863 for (Fig 4C and 4D). Compared with DeepSynergy, the superior performance of TranSynergy was consistent across tissues except for the prostate. The prostate tissue had the lowest Pearson correlation coefficient among all tissues included in our data. VCAP, which is the only prostate cancer cell line, had the third-lowest Pearson correlation coefficient and Spearman correlation coefficient across all cell lines. The mediocre performance of the VCAP cell line could be due to that it is not closely similar to other cell lines. To reflect relationships between cell lines, we performed the t-SNE analysis to visualize the high dimensional vector representation of cell lines mapped to a 2D space with the first two t-SNE components (Fig 5). It showed that the VCAP cell line is located on the edge of one cluster, which indicates the VCAP cell line might have distinctive features. We also noticed that five cell lines were isolated from the remaining cell lines in the t-SNE plot. They included T47D and OCUBM from the breast tissue, HCT116 from the colon tissue, as well as UWB1289 and OVCAR3 from ovarian tissue. The variances of these three tissues were higher than those of the other three tissues. These larger variances among cell line features within these three tissues might contribute to the observed larger performance variances.
We additionally tested our models in the most challenging leave cell out scenario (Fig 3B). We used an affinity propagation method to divide the cell lines into 4 clusters based on cell line features. Since 28 cell lines are in the biggest cluster, we used them as the training dataset and held out the remaining cell lines as the test dataset. In this way, the cell lines in the test dataset are the most different from those in the training dataset. All the cell lines have a similar amount of data (S4 Table). The leave cell out is a robust method to test the generalization power of the models. As shown in Tables 3 and 4, the performance of the TranSynergy is slightly better than DeepSynergy but both are lower than the first scenario.
Table 3. Performance of TranSynergy and DeepSyenrgy on leave cell out scenario and leave drug out scenario for the regression task.
Model | Leave cells out | Leave drugs out | ||
---|---|---|---|---|
Spearman Correlation | Pearson Correlation | Spearman Correlation | Pearson Correlation | |
TranSynergy | 0.552 ± 0.013 | 0.513 ± 0.019 | 0.477 ± 0.040 | 0.457 ± 0.031 |
DeepSynergy | 0.547 ± 0.010 | 0.501 ± 0.010 | 0.453 ± 0.025 | 0.437 ± 0.020 |
Statistical significance (p-value) | 0.0716 | 0.0013 | 0.0032 | 0.0001 |
Table 4. Performance of TranSynergy and DeepSyenrgy on leave cell out scenario and leave drug out scenario for the classification task.
Model | Leave cells out | Leave drugs out | ||
---|---|---|---|---|
ROC-AUC | PR-AUC | ROC-AUC | PR-AUC | |
TranSynergy | 0.808 ± 0.017 | 0.372 ± 0.022 | 0.778 ± 0.032 | 0.406 ± 0.106 |
DeepSynergy | 0.805 ± 0.017 | 0.360 ± 0.022 | 0.753 ± 0.030 | 0.378 ± 0.075 |
Statistical significance (p-value) | 0.3328 | 0.0908 | 0.0034 | 0.1379 |
Similar to the leave cell out scenario, we split the drugs into 5 clusters with the affinity propagation method in leave drugs out scenario (Fig 6). We chose drugs in one cluster and held out all drug pairs including those selected drugs as the test set (Fig 3C). The remaining drug pairs are the training/validation set. In this scenario, TranSynergy significantly outperformed DeepSynergy, but both were lower than the first two scenarios (Tables 3 and 4).
Mechanism-driven drug representations are critical for superior model performance
To investigate whether the gene-gene interaction network propagation step is essential for the model performance, we compared the TranSynergy model trained with only observed drug target information with that trained with network propagated drug target information. The observed drug target vector is a binary 2401-dimension vector that indicates whether a drug physically binds to the corresponding protein. The cell line vector representation used in this comparison is based on the cell line dependency. Models trained with the network propagated drug target vector representation showed superior performance in comparison to those with observed drug target information (Table 5). Given that the model excluding cell line features had inferior performance in comparison to other models with cell line features, we concluded that the cell line feature contributed to the synergy score prediction (Table 5). We then investigated three different representations for cell lines, gene dependency, gene expression, or gene dependency and gene expression. When we represented cell lines with gene dependency profile only or both gene dependency and gene expression profiles, the model exhibited superior performance to that representing cells with gene expression only (Table 5).
Table 5. Ablation study of TranSynergy models.
Drug features | Cell line features | MSE | Spearman Correlation | Pearson Correlation |
---|---|---|---|---|
Network propagated drug target profile | Gene dependency | 232 ± 21 | 0.725±0.016 | 0.746±0.017 |
Gene expression | 241 ± 19 | 0.721±0.021 | 0.741±0.025 | |
Gene dependencies + Gene expression | 231 ± 21 | 0.730±0.016 | 0.746±0.018 | |
None | 390 ± 11 | 0.505±0.007 | 0.400±0.012 | |
Observed drug target profile | Gene dependency | 305 ± 34 | 0.634±0.035 | 0.635±0.048 |
Gene expression | 307 ± 33 | 0.634±0.038 | 0.632±0.039 | |
Gene dependencies + Gene expression | 300 ± 31 | 0.645±0.036 | 0.634±0.038 |
Furthermore, we implemented two novel infrastructures, TranSynergyCI and TranSynergyGNF, to investigate whether the prediction could be improved further by integrating extra chemical information. TranSynergyCI united the physiochemical, toxophores, and fingerprint information of each drug. The difference between TranSynergy and TranSynergyCI is that the latter concatenated the aforementioned features with the output of the Transformer as the input of the last fully connected component (S1 Fig). It is worth mentioning that the DeepSynergy used the same features for the drug representation. TranSynergyGNF also integrated chemical information to the model input, however, it applied graph convolutional neural network (GCN) to extract graphical neural fingerprints from drug chemical structures (S2 Fig). More specifically, the GCN layer took drug’s atoms and bonds as inputs. Intuitively, the drug was represented as a graph structure comprised of atoms as the nodes of the graph, and bonds as the edges of the graph. The output of this GCN layer is used as the representation of each drug. The same with TranSyenrgyCI, the outputs of GCN layers are concatenated with the output of the transformer and served as the input of the last fully connected component. Both these two models showed the inferior performance to TranSynergy (S2 and S3 Tables). This illustrates adding extra chemical information may not be necessary to boost the performance any further in this experimental setup.
Shapley additive gene set enrichment analysis reveals novel oncogenic signatures associated with synergistic drug combinations
We propose the use of Shapley Additive Gene Set Enrichment Analysis (SA-GSEA) to determine the oncogenic signature and the underlying mechanisms associated with each synergistic drug combination (see methods for details). Shapley additive value is a powerful way to characterize the feature contribution to the final prediction in each instance [33]. Because each of the features in TranSynergy corresponds to a gene, the Shapley value essentially indicates the attribute of each gene to the synergy prediction. Moreover, genes could be ranked based on the Shapley value of each input gene feature. This makes it feasible to perform GSEA analysis. As an example, we explored seven samples, which had high synergistic scores and were accurately predicted (Table 6 and S3–S9 Figs). These seven samples are from three cell lines, T47D, CAOV3, and MSTO-211H. For the synergistic drug pairs in T47D, BEZ-235 is one of the notable drugs. The BEZ-235 can inhibit PI3K/AKT/mTOR pathway, which is dysregulated in breast cancer [39]. In the SA-GSEA of cell line gene dependencies profile, the RAF oncogenic signature is significantly enriched. It has been established that the RAS/RAF/MEK/ERK pathway and PI3K/AKT/mTOR pathway are closely interconnected components and form feedback loops in breast cancer [40]. This may also suggest the underlying mechanism following the inhibition of the PI3K/AKT/mTOR pathway by BEZ-235 is related to the RAS/RAF/MEK/ERK pathway. The drug combination of ETOPOSIDE and MK-8669 is synergistic in the CAOV3 cancer cell line. ETOPOSIDE targets TOP2A and TOP2B, two topoisomerase components. The inhibition of them was believed to cause a DNA double-strand break [41]. MK-8669 targets mTOR, a crucial component in the PI3K/AKT/mTOR pathway. TUBB also shows surprising high importance in the SA-GSEA of MK-8669 drug targets (S3 Fig). This indicates that inhibition of mTOR or TUBB in combination with DNA damage can have a synergistic effect. In the SA-GSEA of cell line features, the JAK2 oncogenic signature is enriched in the ranked gene list based on their Shapley values. As suggested in other studies, the JAK2 oncogenic signature implies a novel pathway affected by this drug combination therapy [42]. For synergistic drug pairs in the MSTO-211H cell line, one of the drugs is DASATINIB. The DASATINIB targets BCRABL, SRC, Ephrins, and GFR. For the combination of PACLITAXEL and DASATINIB in the MSTO-211H cell line, the PACLITAXEL targets tubulin and microtubule-associated proteins. From the SA-GSEA of cell line genes, genes in the ESR1 and BMI1 oncogenic signature gene sets are top-ranked (S7 Fig) [43]. BMI1 was shown to be involved in the DNA-damage-repair process [44]. The inhibition of these pathways combining the restrain on mitosis can have high anti-tumor activities in Mesothelioma cancer [45]. For the combination of DASATINIB and ABT-888 in MSTO-211H, the SNF5 pathway is ranked on the top (S9 Fig). It is worth mentioning that both SNF5 and PARP1/2 play key roles in the DNA repair process [46,47].
Table 6. The most important oncogenic signatures revealed by SA-GSEA.
Drug combination (drug 1, drug 2) | Cell line | Targets with high SHAP values | The most enriched gene set in the cell line | |
---|---|---|---|---|
Drug 1 | Drug 2 | |||
BEZ-235, ERLOTINIB | T47D | Targets in PI3K/AKT/mTOR pathway | EGFR | oncogenic signature for RAF overexpressed cells |
BEZ-235, MK-4827 |
T47D | Targets in PI3K/AKT/mTOR pathway | PARP1/2 | oncogenic signature for RAF overexpressed cells |
BEZ-235, L-778123 |
T47D | Targets in PI3K/AKT/mTOR pathway | FPTase/ GGPTase | oncogenic signature for RAF overexpressed cells |
BEZ-235, DINACICLIB | T47D | Targets in PI3K/AKT/mTOR pathway | CDKs | oncogenic signature for RAF overexpressed cells |
ETOPOSID, MK-8669 | CAOV3 | TOP2A/TOP2B | mTOR | oncogenic signature for JAK2 knock-down cells |
PACLITAXEL, DASATINIB | MSTO-211H | Tubulin/microtubule associated proteins | BCRABL, SRC, Ephrins and GFR | oncogenic signature for ESR1- cells |
ABT-888, DASATINIB | MSTO-211H | PARP1/2 | BCRABL, SRC, Ephrins and GFR | oncogenic signature for SNF5 know-down cells |
Novel drug combination prediction
Although the majority of drug combinations composed of the drugs available in this study have been explored experimentally [20], there were still 3650 novel drug pairs that had yet to be tested. For these 3650 novel samples, Table 7 lists the top 10 pairs which have higher cell line-wise z-scores (S5 Table). It is noted that ETOPOSIDE, which targets TOP2A and TOP2B, exists in most drug pairs that are synergistic on the CAOV3 cancer cell line. It is consistent with the results of SA-GSEA in which the target combination of TOP2A, TOP2B, and TUBB shows high feature importance. Two samples show synergistic effects on OV90, another ovarian cancer cells. For ETOPOSIDE and VINORELBINE combination, drugs also target TOP2A, TOP2B, and TUBB. For PACLITAXEL and VINORELBINE, both drugs target tubulin or microtubule-associated proteins. DEXAMETHASONE exists in synergistic drug combinations in three cell lines, OCUBM, SKMES1, and KPL1.
Table 7. Examples of predicted synergistic novel drug pairs.
Drug combination | Cell line | Predicted Synergy score | Z-score |
---|---|---|---|
DEXAMETHASONE and ETOPOSIDE | CAOV3 | 94.520215 | 3.07259459 |
ETOPOSIDE and METFORMIN | CAOV3 | 70.556332 | 2.21009253 |
ETOPOSIDE and SN-38 | CAOV3 | 69.380475 | 2.16777138 |
DEXAMETHASONE and VINBLASTINE | OCUBM | 38.6095235 | 1.94930657 |
DEXAMETHASONE and PACLITAXEL | SKMES1 | 37.9697725 | 1.91635083 |
ETOPOSIDE and VINORELBINE | OV90 | 33.844834 | 1.89400207 |
PACLITAXEL and VINORELBINE | OV90 | 32.5451685 | 1.81387084 |
ETOPOSIDE and VINBLASTINE | CAOV3 | 56.799048 | 1.71494381 |
DEXAMETHASONE and VINBLASTINE | KPL1 | 37.8084735 | 1.6825975 |
ETOPOSIDE and PACLITAXEL | CAOV3 | 53.1252895 | 1.58271882 |
Discussion
In this study, we have presented a novel deep learning model TranSynergy for the synergy score prediction and mechanism deconvolution of drug combination cancer therapy. We have demonstrated that the network propagated drug target profile, which indicated both drug-target interaction and drug effect on non-targeted proteins, was crucial for the comprehensive representation of drug features. The use of drug target information as drug features in the model might cause limitation when novel drugs, whose molecular targets are unknown, are involved in the drug pair. However, this limitation can be partially overcome by using recently developed methods for the accurate prediction of drug targets, e.g. [48]. In addition, drugs used in the drug combinations are mainly existing drugs whose targets are largely known, at least, it is true in the dataset used in this study. Because the drug target provides crucial information on the mechanism of drug action, it not only improves the accuracy of the machine learning model for the prediction of drug combination synergy but also facilitates the model interpretation. We also show that gene essentiality in the cancer cells is a more desirable cell line representation than raw gene expression profile. Due to the limited data size, we only selected the minimum number of genes for the representation of drugs and cell lines, including only drug targets and annotated cancer-related genes. When too many input features are used, the model could suffer overfitting problems during the training stage due to the curse of dimensionality [49,50]. The performance can be further improved when more data are available or by utilizing unlabeled data sets so that more genes can be included in the representations. Furthermore, we demonstrated that the performance in cold-start problems, including leave cell out and drug out scenarios, is relatively lower than the leave drug combination out scenario. We believe that the relatively lower performance in the cold-start setting could be improved by applying pretraining strategies.
With more and more high throughput drug combination screening datasets becoming available, we have to figure out the inconsistency problem in both the experimental and quantification methods for the determination of drug combination effects. Firstly, the combinational spaces for the drug doses used to generate the drug dose matrix diversify in different studies. Secondly, several distinguished methods were proposed to calculate the expected drug combination effect from experimental data, such as combination index (CI)-isobologram equation [51–53], Bliss independence (BI) method [54–56], and Loewe Additivity (LA) model [57,58]. The calculated drug synergy scores are not the same when different quantification methods are utilized. To further improve the data quality, it is necessary to develop new methods to harmonize different data sets.
Deep learning-based computational models have made promising breakthroughs in many biomedical areas. Interpretation of deep learning models becomes critical to overcoming the skepticism of it being a black-box [59]. Recently, many explainable AI methods have been proposed, such as input perturbation methods [60,61], backpropagation based methods [62], and the calculation of SHAP values [33]. Attention-based approaches have been proposed to interpret the models using attention mechanisms. However, previous work has shown that the attention weights learned from self-attentions may not provide a meaningful explanation for the final prediction [63]. As a result, we carefully design the input features so that each feature dimension is corresponding to a gene and is easily interpretable. Through examining the SHAP values of gene-wise input feature, we extract the information regarding the effect of drug-target interactions and gene-gene interactions on the cancer cell response. Considering that evidence has shown synergistic effects could be attributed to either pathway cross-talking [64], nonoverlapping pathways [28], or same pathways, the explanation of drug combination synergy score learned with our model could come from either of instances. More conclusive statements for the underlying mechanisms might require additional experimental evidence, which is out of our work’s scope. However, we suggest that this potentially provides a method to generate testable hypothesis for studying the underlying mechanism of the drug combinations therapy.
Drug combinations can be a more efficient therapeutic strategy for cancer by targeting multiple proteins to defer the rapid emergence of drug resistance. The exploration of effective and synergistic drug combinations is hindered by the costly and time-consuming experimental preclinical investigation. Computational methods can be a cheaper and faster alternative approach to facilitate the development of drug combination therapy for cancer patients [65]. Nowadays, more emphasis is put on personalized medicine, which requires the consideration of the heterogeneity of each patient’s cancer types and genomics information to find more efficient therapy. Given a large amount of data for patients’ genome information, the development of an accurate and interpretable computational model is critical for the realization of personalized medicine. Mechanism-driven machine learning as demonstrated in this study is a promising direction to address challenges in precision medicine of combination therapy.
Methods
Drug combination synergy score dataset
The large-scale drug combination screening dataset was initially published by Merck [20] and preprocessed to calculate the synergy scores [24]. The screening test was performed with 38 drugs and 39 cancer cell lines. In total, 583 pairs of drug combinations were investigated and 23062 data points were collected. Among them, two drugs lack target information, and the gene dependency and gene expression data of four cells are not available. These drugs and cell lines were excluded from our data set. We finally selected 36 drugs that targeted at least one protein and 35 cell lines. The final dataset has 18553 data points and 523 pairs of drug combinations.
Drug representation
The observed drug target profile was collected from two datasets, Drugbank and ChEMBL, for the 36 drugs [36,37]. The observed drug target matrix is a 36*2401 binary matrix that indicates whether a drug targets a protein. The observed drug target profile was processed with the RWR algorithm to obtain a novel network propagated drug targets profile. We solved the RWR problem with the Fast RWR methods [66]. Following is the formal equation:
α is a hyperparameter equal to 1—restart rate. To avoid information leaking and overfitting, we performed nested cross-validation for hyperparameter tuning and model selection. In each iteration, the data were split into training/validation/test sets. After the model was fitted using the training data set, optimal hyperparameter α in RWR was selected by the grid search over the validation set. The grid search determined the optimal α = 0.5 (S6 Table) over the validation set. W is the transition matrix denoting the transition probability between nodes. We used the protein-protein interaction network matrix from STRING as the transition probability matrix [67]. The edge weight between nodes is the protein-protein association confidence score. e is the seed vector, a drug target binary vector for each drug in this Eq 1, denotes that the drug is targeting the protein. r is the final probability distribution of each node in the network. Intuitively, r[i] denotes the effect of the drug on each protein. Drug’s atoms and edges information were extracted with rdkit from the drug’s SMILES string. We then used the multi-hot encoder method to represent each node and edge. The final representation includes 62 atoms and 6 bonds information.
Cell line representation
Gene expression profiles were downloaded from Harmonizome [68] and were initially collected in Broad Institute Cancer Cell Line Encyclopedia (CCLE) and Genomics of Drug Sensitivity in Cancer (GDSC) [69–71]. Gene dependencies profile dataset is a combined dataset from the Broad Institute Project Achilles [72–74] and Sanger CRISPR data from Wellcome Trust Sanger Institute and Broad Institute[75,76]. The dataset was downloaded from the DepMap portal [38]. The linear regression imputation method was used to fill in the missing value in the dataset with the MICE package [77].
Model architecture
Transformer
The transformer has been widely used and shown promising performance in many different applications, including natural language processing and image processing [29–31]. It includes two major components, encoder and decoder. The input for both encoder and decoder has three matrices, query, key and values. Both encoder and decoder contain many sublayers. Each sublayer is comprised of three stages, attention mechanism, add & norm stage, and feed forward stage. The add & norm stage contains a residual structure and a layer normalization structure. The attention mechanism is the module to encode the interaction of different features with the following equation:
where Q, K, and V are the query, key and, values matrices, d is the dimension of hidden vector representation and softmax is the activation function.
Graph convolutional neural network
We applied the graph convolutional network (GCN) module to extract drug neural fingerprint in our TranSynergyGNF model (S2 Fig). Each drug is parsed as a graph, consisting of nodes that are the chemical atoms, and edges that are the chemical bonds. The GCN was used to extract and integrate the information of nodes and edges in each node’s neighborhood. The output hidden representation can be interpreted as a representation of chemical substructure within the multi-hop distances from each node. The output of each GCN layer can be calculated with the following equation:
where A is the adjacency matrix for the graph structure, I is an identity matrix, D is the diagonal node degree matrix of A+I, H is the hidden representation from the previous layer, W is the learnable parameter matrix and softmax is the activation function. To construct the hidden representation of the whole drug, we concatenate the hidden representation for each node to a single vector.
Model evaluation
Model evaluations were performed in three scenarios. a) In the leave drug combination out scenario, we applied the nested cross-validation method. The data was split into five folds such that the drug pair in one fold does not overlap with the drug pairs in other folds (Fig 3A). Because the drug combination of drug A with drug B and drug B with drug A should have the same drug combination synergy score, the size of training data was doubled by swapping the drug A and drug B. The model was iteratively trained/validated in four folds and then tested in the remaining one fold. b) We further tested our models in the most challenging leave cell out scenario. We used affinity propagation methods to group the cell lines into 4 clusters based on cell line features similarity (Fig 3B). Since 28 cell lines are in the biggest cluster, we used them as the training/validation dataset and held out the remaining cell lines in other clusters as the test dataset (Fig 5). In this way, the cell lines in the testing set are significantly different from those in the training/validation set. c) In the leave drug out scenario, we grouped the drugs into 5 clusters with the affinity propagation method based on their structural similarities (Figs 3C and 6). We chose drugs in one cluster and held out all drug pairs including these three drugs as the test set. The remaining drug pairs are the training/validation set. We performed three tests with keeping out the drugs in each of the three smallest clusters. In each test, the ratio of samples in the training and the testing data is around 4:1.
In all the above scenarios, the MSE was used as the training loss. We then investigated the Spearman correlation and Pearson correlation between predicted data and ground-truths. ROC-AUC and PR-AUC were also used to evaluate the model performance for the classification task. Drug pairs with synergy scores larger than 30 were classified as positive pairs and those with lower synergy scores were classified as negative pairs.
2.5 Shapley additive gene set enrichment analysis
We used the GradientExplainer and DeepExplainer in the SHAP package to calculate Shapley value that characterizes the contribution of each input feature to the final prediction [33]. We used k-means to summarize the total dataset as the background dataset. The final Shapley value of each input feature was the average value of 10 tests for each sample data. We then ranked genes based on the Shapley values for the gene-wise features of cell line representation and conducted gene set enrichment analysis to unveil the enriched gene sets with GSEA [78,79].
Supporting information
Data Availability
All scripts and data are available from https://github.com/qiaoliuhub/drug_combination. Other relevant results data are within the manuscript and its Supporting Information files.
Funding Statement
This work was supported by Grant Number R01GM122845 from the National Institute of General Medical Sciences (NIGMS, to LX) and Grant Number R01AD057555 of National Institute on Aging of the National Institute of Health (NIH, to LX and QL) as well as CUNY High Performance Computing Center (to LX). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Mansoori B, Mohammadi A, Davudian S, Shirjang S, Baradaran B. The Different Mechanisms of Cancer Drug Resistance: A Brief Review. Adv Pharm Bull. 2017;7(3):339–48. Epub 2017/10/27. 10.15171/apb.2017.041 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Housman G, Byler S, Heerboth S, Lapinska K, Longacre M, Snyder N, et al. Drug resistance in cancer: an overview. Cancers (Basel). 2014;6(3):1769–92. Epub 2014/09/10. 10.3390/cancers6031769 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Nikolaou M, Pavlopoulou A, Georgakilas AG, Kyrodimos E. The challenge of drug resistance in cancer treatment: a current overview. Clin Exp Metastasis. 2018;35(4):309–18. Epub 2018/05/26. 10.1007/s10585-018-9903-0 . [DOI] [PubMed] [Google Scholar]
- 4.Bayat Mokhtari R, Homayouni TS, Baluch N, Morgatskaya E, Kumar S, Das B, et al. Combination therapy in combating cancer. Oncotarget. 2017;8(23):38022–43. Epub 2017/04/15. 10.18632/oncotarget.16723 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Raghavendra NM, Pingili D, Kadasi S, Mettu A, Prasad S. Dual or multi-targeting inhibitors: The next generation anticancer agents. Eur J Med Chem. 2018;143:1277–300. Epub 2017/11/12. 10.1016/j.ejmech.2017.10.021 . [DOI] [PubMed] [Google Scholar]
- 6.Ramsay RR, Popovic-Nikolic MR, Nikolic K, Uliassi E, Bolognesi ML. A perspective on multi-target drug discovery and design for complex diseases. Clin Transl Med. 2018;7(1):3. Epub 2018/01/18. 10.1186/s40169-017-0181-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Fitzgerald JB, Schoeberl B, Nielsen UB, Sorger PK. Systems biology and combination therapy in the quest for clinical efficacy. Nat Chem Biol. 2006;2(9):458–66. Epub 2006/08/22. 10.1038/nchembio817 . [DOI] [PubMed] [Google Scholar]
- 8.Lehar J, Krueger AS, Avery W, Heilbut AM, Johansen LM, Price ER, et al. Synergistic drug combinations tend to improve therapeutically relevant selectivity. Nat Biotechnol. 2009;27(7):659–66. Epub 2009/07/08. 10.1038/nbt.1549 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Henkel J. Attacking AIDS with a ’cocktail’ therapy? FDA Consum. 1999;33(4):12–7. Epub 1999/08/12. . [PubMed] [Google Scholar]
- 10.Murphy EM, Jimenez HR, Smith SM. Current clinical treatments of AIDS. Adv Pharmacol. 2008;56:27–73. Epub 2007/12/19. 10.1016/S1054-3589(07)56002-3 . [DOI] [PubMed] [Google Scholar]
- 11.Chen X, Ren B, Chen M, Wang Q, Zhang L, Yan G. NLLSS: Predicting Synergistic Drug Combinations Based on Semi-supervised Learning. PLoS Comput Biol. 2016;12(7):e1004975. Epub 2016/07/16. 10.1371/journal.pcbi.1004975 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Groll AH, Tragiannidis A. Recent advances in antifungal prevention and treatment. Semin Hematol. 2009;46(3):212–29. Epub 2009/06/25. 10.1053/j.seminhematol.2009.03.003 . [DOI] [PubMed] [Google Scholar]
- 13.Tamma PD, Cosgrove SE, Maragakis LL. Combination therapy for treatment of infections with gram-negative bacteria. Clin Microbiol Rev. 2012;25(3):450–70. Epub 2012/07/06. 10.1128/CMR.05041-11 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Ali R, Mirza Z, Ashraf GM, Kamal MA, Ansari SA, Damanhouri GA, et al. New anticancer agents: recent developments in tumor therapy. Anticancer Res. 2012;32(7):2999–3005. Epub 2012/07/04. . [PubMed] [Google Scholar]
- 15.Falzone L, Salomone S, Libra M. Evolution of Cancer Pharmacological Treatments at the Turn of the Third Millennium. Front Pharmacol. 2018;9:1300. Epub 2018/11/30. 10.3389/fphar.2018.01300 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Mott BT, Eastman RT, Guha R, Sherlach KS, Siriwardana A, Shinn P, et al. High-throughput matrix screening identifies synergistic and antagonistic antimalarial drug combinations. Sci Rep. 2015;5:13891. Epub 2015/09/26. 10.1038/srep13891 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Mater AC, Coote ML. Deep Learning in Chemistry. J Chem Inf Model. 2019;59(6):2545–59. Epub 2019/06/14. 10.1021/acs.jcim.9b00266 . [DOI] [PubMed] [Google Scholar]
- 18.Kalinin AA, Higgins GA, Reamaroon N, Soroushmehr S, Allyn-Feuer A, Dinov ID, et al. Deep learning in pharmacogenomics: from gene regulation to patient stratification. Pharmacogenomics. 2018;19(7):629–50. Epub 2018/04/27. 10.2217/pgs-2018-0008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Bulusu KC, Guha R, Mason DJ, Lewis RP, Muratov E, Kalantar Motamedi Y, et al. Modelling of compound combination effects and applications to efficacy and toxicity: state-of-the-art, challenges and perspectives. Drug Discov Today. 2016;21(2):225–38. Epub 2015/09/12. 10.1016/j.drudis.2015.09.003 . [DOI] [PubMed] [Google Scholar]
- 20.O’Neil J, Benita Y, Feldman I, Chenard M, Roberts B, Liu Y, et al. An Unbiased Oncology Compound Screen to Identify Novel Combination Strategies. Mol Cancer Ther. 2016;15(6):1155–62. Epub 2016/03/18. 10.1158/1535-7163.MCT-15-0843 . [DOI] [PubMed] [Google Scholar]
- 21.Li H, Li T, Quang D, Guan Y. Network Propagation Predicts Drug Synergy in Cancers. Cancer Res. 2018;78(18):5446–57. Epub 2018/07/29. 10.1158/0008-5472.CAN-18-0740 . [DOI] [PubMed] [Google Scholar]
- 22.Menden MP, Wang D, Mason MJ, Szalai B, Bulusu KC, Guan Y, et al. Community assessment to advance computational prediction of cancer drug combinations in a pharmacogenomic screen. Nat Commun. 2019;10(1):2674. Epub 2019/06/19. 10.1038/s41467-019-09799-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Holbeck SL, Camalier R, Crowell JA, Govindharajulu JP, Hollingshead M, Anderson LW, et al. The National Cancer Institute ALMANAC: A Comprehensive Screening Resource for the Detection of Anticancer Drug Pairs with Enhanced Therapeutic Activity. Cancer Res. 2017;77(13):3564–76. Epub 2017/04/28. 10.1158/0008-5472.CAN-17-0489 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Preuer K, Lewis RPI, Hochreiter S, Bender A, Bulusu KC, Klambauer G. DeepSynergy: predicting anti-cancer drug synergy with Deep Learning. Bioinformatics. 2018;34(9):1538–46. Epub 2017/12/19. 10.1093/bioinformatics/btx806 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Challen R, Denny J, Pitt M, Gompels L, Edwards T, Tsaneva-Atanasova K. Artificial intelligence, bias and clinical safety. BMJ Qual Saf. 2019;28(3):231–7. Epub 2019/01/14. 10.1136/bmjqs-2018-008370 PubMed Central PMCID: PMC6560460. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Benitez JM, Castro JL, Requena I. Are artificial neural networks black boxes? IEEE Trans Neural Netw. 1997;8(5):1156–64. Epub 1997/01/01. 10.1109/72.623216 . [DOI] [PubMed] [Google Scholar]
- 27.Castelvecchi D. Can we open the black box of AI? Nature. 2016;538(7623):20–3. Epub 2016/10/07. 10.1038/538020a . [DOI] [PubMed] [Google Scholar]
- 28.Cheng F, Kovacs IA, Barabasi AL. Network-based prediction of drug combinations. Nat Commun. 2019;10(1):1197. Epub 2019/03/15. 10.1038/s41467-019-09186-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, et al. Attention Is All You Need. 2017. [Google Scholar]
- 30.Luong M-T, Pham H, Manning CD. Effective Approaches to Attention-based Neural Machine Translation. 2015. [Google Scholar]
- 31.Bahdanau D, Cho K, Bengio Y. Neural Machine Translation by Jointly Learning to Align and Translate. 2014. [Google Scholar]
- 32.Liu Q, He D, Xie L. Prediction of off-target specificity and cell-specific fitness of CRISPR-Cas System using attention boosted deep learning and network-based gene feature. PLoS Comput Biol. 2019;15(10):e1007480. Epub 2019/10/29. 10.1371/journal.pcbi.1007480 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Lundberg S, Lee S-I. A Unified Approach to Interpreting Model Predictions. 2017. [Google Scholar]
- 34.Buermans HP, den Dunnen JT. Next generation sequencing technology: Advances and applications. Biochim Biophys Acta. 2014;1842(10):1932–41. Epub 2014/07/06. 10.1016/j.bbadis.2014.06.015 . [DOI] [PubMed] [Google Scholar]
- 35.Ayed M, Lim H, Xie L. Biological representation of chemicals using latent target interaction profile. BMC Bioinformatics. 2019;20(Suppl 24):674. Epub 2019/12/22. 10.1186/s12859-019-3241-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Wishart DS, Feunang YD, Guo AC, Lo EJ, Marcu A, Grant JR, et al. DrugBank 5.0: a major update to the DrugBank database for 2018. Nucleic Acids Res. 2018;46(D1):D1074–D82. Epub 2017/11/11. 10.1093/nar/gkx1037 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Gaulton A, Hersey A, Nowotka M, Bento AP, Chambers J, Mendez D, et al. The ChEMBL database in 2017. Nucleic Acids Res. 2017;45(D1):D945–D54. Epub 2016/12/03. 10.1093/nar/gkw1074 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.McFarland JM, Ho ZV, Kugener G, Dempster JM, Montgomery PG, Bryan JG, et al. Improved estimation of cancer dependencies from large-scale RNAi screens using model-based normalization and data integration. Nat Commun. 2018;9(1):4610. Epub 2018/11/06. 10.1038/s41467-018-06916-5 PubMed Central PMCID: PMC6214982. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Verret B, Cortes J, Bachelot T, Andre F, Arnedos M. Efficacy of PI3K inhibitors in advanced breast cancer. Ann Oncol. 2019;30(Suppl_10):x12–x20. Epub 2019/12/21. 10.1093/annonc/mdz381 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Saini KS, Loi S, de Azambuja E, Metzger-Filho O, Saini ML, Ignatiadis M, et al. Targeting the PI3K/AKT/mTOR and Raf/MEK/ERK pathways in the treatment of breast cancer. Cancer Treat Rev. 2013;39(8):935–46. Epub 2013/05/07. 10.1016/j.ctrv.2013.03.009 . [DOI] [PubMed] [Google Scholar]
- 41.Nitiss JL. Targeting DNA topoisomerase II in cancer chemotherapy. Nat Rev Cancer. 2009;9(5):338–50. Epub 2009/04/21. 10.1038/nrc2607 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Bartalucci N, Tozzi L, Bogani C, Martinelli S, Rotunno G, Villeval JL, et al. Co-targeting the PI3K/mTOR and JAK2 signalling pathways produces synergistic activity against myeloproliferative neoplasms. J Cell Mol Med. 2013;17(11):1385–96. Epub 2013/11/19. 10.1111/jcmm.12162 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Bhattacharyya J, Mihara K, Yasunaga S, Tanaka H, Hoshi M, Takihara Y, et al. BMI-1 expression is enhanced through transcriptional and posttranscriptional regulation during the progression of chronic myeloid leukemia. Ann Hematol. 2009;88(4):333–40. Epub 2008/09/11. 10.1007/s00277-008-0603-8 . [DOI] [PubMed] [Google Scholar]
- 44.Ginjala V, Nacerddine K, Kulkarni A, Oza J, Hill SJ, Yao M, et al. BMI1 is recruited to DNA breaks and contributes to DNA damage-induced H2A ubiquitination and repair. Mol Cell Biol. 2011;31(10):1972–82. Epub 2011/03/09. 10.1128/MCB.00981-10 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Sekido Y. Molecular pathogenesis of malignant mesothelioma. Carcinogenesis. 2013;34(7):1413–9. Epub 2013/05/17. 10.1093/carcin/bgt166 . [DOI] [PubMed] [Google Scholar]
- 46.Ribeiro-Silva C, Vermeulen W, Lans H. SWI/SNF: Complex complexes in genome stability and cancer. DNA Repair (Amst). 2019;77:87–95. Epub 2019/03/22. 10.1016/j.dnarep.2019.03.007 . [DOI] [PubMed] [Google Scholar]
- 47.Javle M, Curtin NJ. The role of PARP in DNA repair and its therapeutic exploitation. Br J Cancer. 2011;105(8):1114–22. Epub 2011/10/13. 10.1038/bjc.2011.382 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Cai T, Lim H, Abbu KA, Qiu Y, Nussinov R, Xie L. A deep learning framework for elucidating whole-genome chemical interaction space. bioRxiv. 2020:2020.10.08.332346. 10.1101/2020.10.08.332346 [DOI] [Google Scholar]
- 49.Clarke R, Ressom HW, Wang A, Xuan J, Liu MC, Gehan EA, et al. The properties of high-dimensional data spaces: implications for exploring gene and protein expression data. Nat Rev Cancer. 2008;8(1):37–49. Epub 2007/12/22. 10.1038/nrc2294 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Danaee P, Ghaeini R, Hendrix DA. A Deep Learning Approach for Cancer Detection and Relevant Gene Identification. Pac Symp Biocomput. 2017;22:219–29. Epub 2016/11/30. 10.1142/9789813207813_0022 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Chou T-C, Talalay P. Analysis of combined drug effects: a new look at a very old problem. Trends in Pharmacological Sciences. 1983;4:450–4. 10.1016/0165-6147(83)90490-X. [DOI] [Google Scholar]
- 52.Chou TC, Talalay P. Quantitative analysis of dose-effect relationships: the combined effects of multiple drugs or enzyme inhibitors. Adv Enzyme Regul. 1984;22:27–55. Epub 1984/01/01. 10.1016/0065-2571(84)90007-4 . [DOI] [PubMed] [Google Scholar]
- 53.Chou TC. Drug combination studies and their synergy quantification using the Chou-Talalay method. Cancer Res. 2010;70(2):440–6. Epub 2010/01/14. 10.1158/0008-5472.CAN-09-1947 . [DOI] [PubMed] [Google Scholar]
- 54.Bliss CI. THE TOXICITY OF POISONS APPLIED JOINTLY1. Annals of Applied Biology. 1939;26(3):585–615. 10.1111/j.1744-7348.1939.tb06990.x [DOI] [Google Scholar]
- 55.Berenbaum MC. A method for testing for synergy with any number of agents. J Infect Dis. 1978;137(2):122–30. Epub 1978/02/01. 10.1093/infdis/137.2.122 . [DOI] [PubMed] [Google Scholar]
- 56.Greco WR, Bravo G, Parsons JC. The search for synergy: a critical review from a response surface perspective. Pharmacol Rev. 1995;47(2):331–85. Epub 1995/06/01. . [PubMed] [Google Scholar]
- 57.Loewe S, Muischnek H. Über Kombinationswirkungen. Naunyn-Schmiedebergs Archiv für experimentelle Pathologie und Pharmakologie. 1926;114(5):313–26. 10.1007/BF01952257 [DOI] [Google Scholar]
- 58.Loewe S. The problem of synergism and antagonism of combined drugs. Arzneimittelforschung. 1953;3(6):285–90. Epub 1953/06/01. . [PubMed] [Google Scholar]
- 59.Ching T, Himmelstein DS, Beaulieu-Jones BK, Kalinin AA, Do BT, Way GP, et al. Opportunities and obstacles for deep learning in biology and medicine. J R Soc Interface. 2018;15(141). Epub 2018/04/06. 10.1098/rsif.2017.0387 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Heaton J, McElwee S, Fraley J, Cannady J, editors. Early stabilizing feature importance for TensorFlow deep neural networks. 2017 International Joint Conference on Neural Networks (IJCNN); 2017 14–19 May 2017.
- 61.Zhou J, Troyanskaya OG. Predicting effects of noncoding variants with deep learning-based sequence model. Nat Methods. 2015;12(10):931–4. Epub 2015/08/25. 10.1038/nmeth.3547 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Springenberg JT, Dosovitskiy A, Brox T, Riedmiller M. Striving for Simplicity: The All Convolutional Net. 2014. [Google Scholar]
- 63.Jain S, Wallace BC. Attention is not Explanation. 2019. [Google Scholar]
- 64.Jaeger S, Igea A, Arroyo R, Alcalde V, Canovas B, Orozco M, et al. Quantification of Pathway Cross-talk Reveals Novel Synergistic Drug Combinations for Breast Cancer. Cancer Res. 2017;77(2):459–69. Epub 2016/11/24. 10.1158/0008-5472.CAN-16-0097 . [DOI] [PubMed] [Google Scholar]
- 65.Yin Z, Deng Z, Zhao W, Cao Z. Searching Synergistic Dose Combinations for Anticancer Drugs. Front Pharmacol. 2018;9:535. Epub 2018/06/07. 10.3389/fphar.2018.00535 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Tong H, Faloutsos C, Pan J, editors. Fast Random Walk with Restart and Its Applications. Sixth International Conference on Data Mining (ICDM’06); 2006 18–22 Dec. 2006.
- 67.Szklarczyk D, Gable AL, Lyon D, Junge A, Wyder S, Huerta-Cepas J, et al. STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res. 2019;47(D1):D607–D13. Epub 2018/11/27. 10.1093/nar/gky1131 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Rouillard AD, Gundersen GW, Fernandez NF, Wang Z, Monteiro CD, McDermott MG, et al. The harmonizome: a collection of processed datasets gathered to serve and mine knowledge about genes and proteins. Database (Oxford). 2016;2016. Epub 2016/07/05. 10.1093/database/baw100 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Barretina J, Caponigro G, Stransky N, Venkatesan K, Margolin AA, Kim S, et al. The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature. 2012;483(7391):603–7. Epub 2012/03/31. 10.1038/nature11003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Yang W, Soares J, Greninger P, Edelman EJ, Lightfoot H, Forbes S, et al. Genomics of Drug Sensitivity in Cancer (GDSC): a resource for therapeutic biomarker discovery in cancer cells. Nucleic Acids Res. 2013;41(Database issue):D955–61. Epub 2012/11/28. 10.1093/nar/gks1111 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Iorio F, Knijnenburg TA, Vis DJ, Bignell GR, Menden MP, Schubert M, et al. A Landscape of Pharmacogenomic Interactions in Cancer. Cell. 2016;166(3):740–54. Epub 2016/07/12. 10.1016/j.cell.2016.06.017 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Dempster JM, Rossen J, Kazachkova M, Pan J, Kugener G, Root DE, et al. Extracting Biological Insights from the Project Achilles Genome-Scale CRISPR Screens in Cancer Cell Lines. bioRxiv. 2019:720243. 10.1101/720243 [DOI] [Google Scholar]
- 73.Broad D. DepMap 20Q1 Public2020.
- 74.Meyers RM, Bryan JG, McFarland JM, Weir BA, Sizemore AE, Xu H, et al. Computational correction of copy number effect improves specificity of CRISPR-Cas9 essentiality screens in cancer cells. Nat Genet. 2017;49(12):1779–84. Epub 2017/10/31. 10.1038/ng.3984 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Broad D. Project SCORE processed with CERES2019.
- 76.Behan FM, Iorio F, Picco G, Goncalves E, Beaver CM, Migliardi G, et al. Prioritization of cancer therapeutic targets using CRISPR-Cas9 screens. Nature. 2019;568(7753):511–6. Epub 2019/04/12. 10.1038/s41586-019-1103-9 . [DOI] [PubMed] [Google Scholar]
- 77.van Buuren S, Groothuis-Oudshoorn K. mice: Multivariate Imputation by Chained Equations in R. 2011. 2011;45(3):67. Epub 2011-12-12. 10.18637/jss.v045.i03 [DOI] [Google Scholar]
- 78.Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A. 2005;102(43):15545–50. Epub 2005/10/04. 10.1073/pnas.0506580102 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Liberzon A, Subramanian A, Pinchback R, Thorvaldsdottir H, Tamayo P, Mesirov JP. Molecular signatures database (MSigDB) 3.0. Bioinformatics. 2011;27(12):1739–40. Epub 2011/05/07. 10.1093/bioinformatics/btr260 [DOI] [PMC free article] [PubMed] [Google Scholar]