Skip to main content
. 2021 Sep 27;13:152. doi: 10.1186/s13073-021-00968-x

Table 1.

Summary of deep learning methods, their relevant applications and brief technical descriptions of each DL model

Application DL method Reference Description
Microscopy-based assessment of cancer CNN

Ruy et al. [40]

Nir et al. [41]

Ström et al. [42]

Ehteshami Bejnordi et al. [43]

Vuong et al. [44]

El Achi and Khoury [45]

Trained CNNs on pathology images to predict grading of prostate [4042], breast [43], colon cancer [44] and lymphoma [45]
CNN & explainability Hägele et al. [46] LRP used to assigned feature contribution for cancer grade for each pixel of WSIs
Semantic segmentation Poojitha and Lal Sharma [47] A semantic segmentation technique called GAN was used to segment tissue maps for prostate cancer grade prediction
Molecular subtyping MLP DeepCC [48] Gene set enrichment analysis used to transform gene expression input into functional spectra
CNN

imCMS [49],

Sirinukuwattana et al. [50], Stalhammar et al. [51], Couture et al. [52]

Woerl et al. [53]

Models trained on histopathology images to classify molecular subtypes of of lung [49], colorectal [50], breast [51, 52] and bladder cancer [53]
GCNN Rhee et al. [18] Utilised a hybrid GCNN model to organise input gene expression profiles into STRING PPI network [16] and predict breast cancer molecular subtypes
Multimodal learning Islam et al. [54]

Two CNN models used to predict breast cancer molecular subtypes from CNAs and gene expression;

Outputs of the last fully connected layer of each model concatenated for a final subtype prediction

Cancer of unknown primary MLP Jiao et al. [55] Model trained to predict origins of 24 cancer types using somatic mutation patterns and driver genes
CNN

SCOPE [56],

CUP-AI-Dx [57]

Both studies trained models to predict different cancer types from gene expression
RNN & explainability TOAD [58]

RNN-based model called Attention was trained on WSIs to predict metastasis and cancer origin;

Attention algorithm reveal image regions contributing most to predictions were mostly cancer cells

Prognosis prediction MLP

Cox-nnet [59],

DeepSurv [60], RankedDeepSurv [61]

Cox regression used as the last layer of MLP models for prognosis prediction
MLP & AEs AECOX [62] AE used to “compress” gene expression into low-dimensional embedding vector and used as an input for Cox-regression
Explainability

PASNET [63],

Cox-PASNET [64]

A pathway layer used between the input and the hidden layers with each node representing a known pathway;

Analysis of weight differences in pathway layers reveal clinically actionable genetic traits

MesoNet [65]

Histopathology images split into tiles and scored by survival prediction contributions;

Scores used to identify top-contributing regions, reviewed by pathologists

GCNN & explainability Chereda et al. [19] Combine GCNN and explainability method LRP to identify biologically and therapeutically relevant genes in predicting metastasis of breast cancer
Explainability with multimodal learning PAGE-Net [66]

CNN used to compress features from WSIs;

Cox-PASNet used to incorporate gene pathway and provide cross-modal analysis with image features extracted by CNN

PathME [67]

AEs used to compress features from four omics modalities, which are combined to predict survival;

SHAP used to assign each omics feature survival prediction contribution score

Precision Oncology MLP HER2RNA [68]

Transcriptomic profiles inferred from histopathology images divided into tiles;

Predictions added up for all tiles and compared with ‘ground truth’ transcriptomic profiles

CNN Image2TMB [69]

Ensemble of three CNNs to extract features from histopathological images at different resolutions (x5, x10 and x20);

Extracted features are combined to infer TMB

Kather et al. [70] TCGA histopathology images used to predict mutational status of key genes, molecular subtypes and gene expression of standard biomarkers
Tumour microenvironment MLP Scaden [71]

Ensemble of three models with different filter sizes to predict TME composition from gene expression;

Predictions from the models are averaged into a final prediction

Explainability with MLP MethylNet [72]

MLP and AE used to ‘compress’ CpG beta values into an embedding vector for predicting TME composition;

SHAP used to assign feature contribution to each CpG site

Semantic segmentation Saltz et al. [20] Semantic segmentation model used on H&E images to localise spatial heterogeneity patterns of TIL and necrosis
Spatial transcriptomics CNN ST-Net [73]

Images split into tiles centred on spatial transcriptomics spots;

Tiles used to train a CNN to predict expression of 250 target genes

Pharmacogenomics CNN CDRscan [74]

Two models used to extract features from somatic mutational fingerprints and molecular profiles of drugs (cell lines);

Feature vectors combined to predict efficacy of drugs based on genomic profiles

MLP DeepSynergy [75]

Cell line gene expression and chemical features of drugs in drug combinations used as input;

Predicts ‘synergy score’ between the drug combinations and transcriptomic profiles

GCNN Jiang et al. [76] Utilised graph structure to integrate protein-protein, drug-drug and drug-protein interactions to predict synergistic drug combination for specific cell lines
Multimodal learning DeepDR [77] Collection of ten AEs to integrate ten drug-disease networks, which predict drug-disease associations
CNN DeepDTI [78] Protein sequence and drug fingerprint as input to predict drug protein-binding sites

AE: autoencoder, CNA: copy number alterations, CNN: convolutional neural network, DL: deep learning, GCNN: graph convolutional neural network, H&E: haematoxylin and eosin, LRP: layer-wise relevance propagation, MLP: multilayer perceptron, RNN: recurrent neural netowrk, SHAP: SHapley Additive exPlanations, TIL: tumour-infiltrating lymphocytes, TMB: tumour mutational burden, WSI: whole slide image