Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2020 Sep 1.
Published in final edited form as: Cancer Immunol Res. 2019 Dec 23;8(3):396–408. doi: 10.1158/2326-6066.CIR-19-0464

High-throughput prediction of MHC class I and class II neoantigens with MHCnuggets

Xiaoshan M Shao 1,2,*, Rohit Bhattacharya 1,3,*, Justin Huang 1,3,*, IK Ashok Sivakumar 1,3,4,*, Collin Tokheim 1,2, Lily Zheng 1,5, Dylan Hirsch 1,2, Benjamin Kaminow 1,6, Ashton Omdahl 1,2, Maria Bonsack 7,8,9, Angelika B Riemer 7,8, Victor E Velculescu 1,5,10, Valsamo Anagnostou 10, Kymberleigh A Pagel 1,2, Rachel Karchin 1,2,10,
PMCID: PMC7056596  NIHMSID: NIHMS1547573  PMID: 31871119

Abstract

Computational prediction of binding between neoantigen peptides and major histocompatibility complex (MHC) proteins can be used to predict patient response to cancer immunotherapy. Current neoantigen predictors focus on in silico estimation of MHC binding affinity and are limited by low predictive value for actual peptide presentation, inadequate support for rare MHC alleles and poor scalability to high-throughput data sets. To address these limitations, we developed MHCnuggets, a deep neural network method that predicts peptide-MHC binding. MHCnuggets can predict binding for common or rare alleles of MHC class I or II with a single neural network architecture. Using a long short-term memory network (LSTM), MHCnuggets accepts peptides of variable length and is faster than other methods. When compared to methods that integrate binding affinity and MHC-bound peptide (HLAp) data from mass spectrometry, MHCnuggets yields a fourfold increase in positive predictive value on independent HLAp data. We applied MHCnuggets to 26 cancer types in TCGA, processing 26.3 million allele-peptide comparisons in under 2.3 hours, yielding 101,326 unique candidate immunogenic missense mutations (IMMs). Predicted IMM hotspots occurred in 38 genes, including 24 driver genes. Predicted IMM load was significantly associated with increased immune cell infiltration (P < 2 × 10−16) including CD8+ T cells. Only 0.16% of predicted immunogenic missense mutations were observed in more than two patients, with 61.7% of these derived from driver mutations. Thus, we describe a method for neoantigen prediction and its performance characteristics and demonstrate its utility in data sets representing multiple human cancers.

Introduction:

The presentation of peptides bound to major histocompatibility complex (MHC) proteins on the surface of antigen-presenting cells and subsequent recognition by T-cell receptors is fundamental to the mammalian adaptive immune system. Neoantigens derived from somatic mutations are targets of immunoediting and drive therapeutic responses in cancer patients treated with immunotherapy (1,2). Because experimental characterization of neoantigens is both costly and time-consuming, computational methods have been developed to predict peptide-MHC binding and subsequent immune response (3,4). Supervised neural network machine learning approaches have performed the best (57) and are the most widely used in silico methods. Despite advances in computational approaches, improvements in predictive performance have been minimal, due in part to a lack of sufficiently large sets of experimentally characterized peptide binding affinities for most MHC alleles.

Although neoantigen prediction for common MHC class I alleles is well studied (8), predictive accuracy on rare and less characterized MHC alleles remains poor (9,10). class II predictors are scarce(11). Current estimates suggest that class II antigen lengths primarily range from 13–25 amino acids (12), and this diversity has been an obstacle to developing in silico neoantigen predictors (11,13). As most neural network architectures are designed for fixed-length inputs, methods such as NetMHC (1417) and MHCflurry (18) require pre-processing of peptide sequences or training of separate classifiers for each peptide length.

Clinical application of MHC-peptide binding predictors, to identify biomarkers for cancer immunotherapy, requires scalability to large patient cohorts and low false positive rates (19). A cancer may contain hundreds of candidate somatically altered peptides, but few will actually bind to MHC proteins and elicit an immune response (20). For many years, most neoantigen predictors were trained primarily on quantitative peptide-HLA binding affinity data from in vitro experiments (21). Advances in immunopeptidomics technologies have enabled identification of thousands of naturally presented MHC bound peptides (HLAp) from cancer patient samples and cell lines (22) (19). Several neoantigen predictors are trained only on HLAp data for class I, and only for a limited number of peptide lengths (21) (23). The EDGE neural network is trained primarily on multi-allelic HLAp and RNAseq data from 74 cancer patients; ForestMHC is a random forest trained on HLAp from publicly available mono-allelic and deconvoluted multi-allelic cell lines. The potential to improve neoantigen predictors by integrating binding affinity and HLAp data (19) has motivated hybrid approaches (14,18). However, most methods predict more candidate neoantigens than are actually immunogenic in patients (11,19).

Here we present a long short-term memory (LSTM) neural network method, MHCnuggets, a neoantigen predictor designed for MHC class I and II alleles in a single framework. The method leverages transfer learning and allele clustering to accommodate both common, well-characterized MHC alleles and rare, less-studied alleles. Although existing computational neoantigen predictors generate a ranked list of candidate peptides, maximizing the number of predictions that identify immunogenic peptides would be preferred in many applications (18). We demonstrate that MHCnuggets’ predictive performance is competitive with widely-used methods on binding affinity benchmark datasets. In comparison to hybrid methods that integrate binding affinity and HLAp data, MHCnuggets shows fewer false positives and increased positive predictive value in a held-out cell line data set of ligands identified by mass spectrometry (7,22). To demonstrate the clinical utility and applicability of MHCnuggets to large patient cohorts, we investigated candidate immunogenic mutations from 26 tumor types in The Cancer Genome Atlas (TCGA). MHCnuggets yielded 101,326 candidate immunogenic missense mutations, observed in at least one individual (out of 1,124,266) in less than 2.3 hours. These mutations were correlated with increased lymphocyte infiltration, however only 0.16% were observed in more than 2 patients.

Methods:

Implementation

MHCnuggets uses a long short-term memory (LSTM) neural network architecture (24) (Fig. 1A). LSTM architectures excel at handling variable length sequence inputs and can learn long-term dependencies between non-contiguous elements, enabling an input encoding that does not require peptide shortening or splitting (Fig. 1B). LSTMs are capable of handling peptides of any length. In practice, a maximum peptide length should be selected for network training. We set maximum peptide input length of 15 for class I and 30 for class II, for computational efficiency purposes. These values cover the majority of lengths observed in naturally presented MHC bound peptides (12). The networks were trained with transfer learning (25), which allows networks for less well-characterized alleles to leverage information from extensively studied alleles (Fig. 1C). Transfer learning was also used to train networks combining binding affinity and HLAp datasets. In addition, MHCnuggets architectures can be trained using either continuous binding affinity measurements from in vitro experiments (half maximal affinity or IC50) and/or immunopeptidomic (HLAp) binary labels. The former utilizes a mean-squared error (MSE) loss, whereas the latter utilizes binary cross-entropy (BCE) loss for training. For each MHC allele, we trained a neural network model consisting of an LSTM layer of 64 hidden units, a fully connected layer of 64 hidden units, and a final output layer of a single sigmoid unit (Fig. 1A).

Figure 1. A) MHCnuggets’ architecture.

Figure 1.

A network is trained for each MHC allele. Each network has a LSTM layer with 64 hidden units, a Fully Connected (FC) layer with 64 hidden units and a final output layer of a single sigmoid unit. B) Input scheme for peptides with variable lengths. MHCnuggets architecture is capable of handling peptides of any length, but in practice a maximum length should be selected. Peptides are extended with padding until they reach the maximum length, prior to input into the neural network. The example shows padding for class II peptides with maximum length set to 30 amino acids. C) Transfer learning protocol for parameter sharing among alleles. A base allele-specific network is trained for each MHC class, with an allele selected by largest number of training examples. Transfer learning is applied to train networks for the remaining alleles with initial network weights set to final base network weights. A fine-tuning step identifies alleles that can be leveraged for a second round of transfer learning to produce a final network (Methods).

For the 16 alleles where allele-specific HLAp training data was available (26), we trained networks on both binding affinity and HLAp data (MHCnuggets). Next, we trained networks only with binding affinity measurements (MHCnuggets without mass spectrometry data or noMS) for all MHC class I alleles. Due to the lack of allelic-specific HLAp training data for class II, all MHC class II networks were trained only on binding affinity measurements. In total, we trained 148 class I and 136 class II allele-specific networks. Common alleles comprise a small fraction (< 1%) of all known MHC alleles (27). To handle binding predictions for rare alleles, MHCnuggets selects a network by searching for the closest allele, based on previously published supertype clustering approaches. We prioritized approaches based on binding pocket biochemical similarity when available. Briefly, HLA-A and HLA-B alleles were clustered by MHC binding pocket amino acid residue composition (28), and HLA-C and all MHC II alleles were hierarchically clustered based upon experimental mass spectrometry and binding assay results (29,30). For alleles with no supertype classification, the closest allele was from the same HLA gene, and allele group if available, with preference for alleles with the largest number of characterized binding peptides. All networks were implemented with the Keras Python package (TensorFlow back-end) (31,32). Open source software is available at https://github.com/KarchinLab/mhcnuggets, installable via pip or Docker, and has been integrated into the PepVacSeq (33), pvactools (34) and Neoepiscope (35) pipelines.

Transformation of peptide binding affinities.

Predicted binding affinity can be transformed into a range of values well suited for neural network learning by selecting a logarithmic base to match the weakest binding affinity of interest (36). For most benchmarks in this work, we used the standard upper limit of 50,000 nM, so that predicted binding affinity was

y=max(0,1log50k(IC50))

For the Bonsack et al. dataset (8), the upper limit was changed to 100,000 nM because in their experiments, as described in O’Donnell et al. (18), binders were defined as peptides with IC50<100,000 nM. As binding affinity was determined based on in vitro HLA binding-competition vs. a known strong binder (reported IC50 <50nM) experimental IC50 values were in μM range.

Selection of final network weights.

To minimize overfitting, network training was stopped after 100 epochs but if the best PPVn was reached earlier, network weights from that earlier epoch were used in the final network. Notably, although we chose to optimize the networks on PPVn, an alternative approach could optimize on auROC, Kendall’s tau or Pearson’s r correlation. For the two alleles in IEDB with the most training examples in their respective class, HLA-A*02:01 for Class I and HLA-DRB2*01 for Class II, training was stopped after 200 epochs.

Network training.

Mean-squared error loss LMSE was used to train networks with continuous-valued binding affinity data and binary cross-entropy loss LBCE for binary HLAp data. For a dataset with n samples,

LMSE(y^,y) =1ni=1n(y(i)y^(i))2
LBCE(y^,y)=1ni=1ny(i)log(y^(i))+(1y(i))log(1y^(i))

All training used backpropagation with the Adam optimizer (37) and learning rate of 0.001. Regularization was performed with dropout and recurrent dropout (38) probabilities of 0.2. The number of hidden units, dropout rate, and number of training epochs was estimated by three-fold cross-validation on MHC Class I A*02:01, a common allele with a large number of experimentally characterized binding peptides.

One-hot encoding:

Peptides were represented to the network as a series of amino acids; each amino acid was represented as a 21-dimensional smoothed, one-hot encoded vector (0.9 and 0.005 replace 1 and 0, respectively).

Peptide padding:

MHCnuggets’ architecture is capable of handling peptides of any length, but in practice a maximum length should be selected, which in this work was 15 for class I and 30 for class II). Peptides that are less than the maximum length are padded at the end with a character (“Z” which is not in the amino acid alphabet) until they reach the maximum length.

Transfer Learning Protocol for binding affinity data only.

We used transfer learning to improve network learning for MHC alleles with limited characterized peptides available for training. We first trained base allele-specific networks for class I and class II, using alleles with the most training examples in IEDB (HLA-A*02:01 for class I and HLA-DRB2*01 for class II). For all other alleles, the final weights of the base network for its respective class were used to initialize network training, and then an allele-specific network was trained for each allele. Next, we assessed prediction performance of each allele-specific network on the training examples for each of the alleles. For each allele, if the network that performed best was not the HLA-A*02:01 network (for class I alleles) or HLA-DRB1*01:01 network (for class II alleles), we did a second round of training, with the best performing network’s weights used in the initialization step.

Transfer Learning Protocol for binding affinity and HLAp data.

To integrate HLAp data into the class I networks we initially trained each network with binding affinity data as described above, transferred the final weights to a new network, and then continued training with the HLAp data as positive examples augmented with random peptide decoys as negative examples.

Performance Assessment

To accurately assess the performance of MHCnuggets on a variety of MHC-peptide binding prediction tasks, we utilized six benchmark sets: MHC class I alleles, MHC class II alleles, common alleles with a trained model (allele-specific prediction) and rare alleles (pan-allele prediction) (Fig. 2, Supplementary Table S1). To compare to the HLA ligand prediction tools from the NetMHC group (NetMHC3.0, NetMHC 4.0, NetMHCpan2.0, NetMHCpan 4.0) (16,17), which can be trained only by their developers, as well as the open source MHCflurry tools, we employed multiple benchmarking strategies: 1) independent benchmark test set of peptides not included as training data for any of the methods; 2) a previously published paired training/testing benchmark; 3) five-fold cross-validation benchmark; 4) leave-one-molecule-out (LOMO) benchmark.

Figure 2. MHCnuggets’ features.

Figure 2.

A) Venn diagram representation of the MHC-peptide binding prediction functions of MHCnuggets and similar tools. B) Training and MHC allele model selection scheme for MHCnuggets.

We evaluated six MHC class I predictors on independent binding affinity and HLAp datasets (7,8,22). First, we compared MHCnuggets to several class I predictors that incorporate both binding affinity and HLAp data: MHCflurry 1.2.0, MHCflurry (train-MS), NetMHC 4.0, and NetMHCpan 4.0. Each method was benchmarked using an independent set of MHC-bound peptides identified by mass spectrometry across seven cell-lines for six MHC I alleles. For testing, HLAp hits were combined with random decoy peptides sampled from the human proteome in a 1:999 hit-decoy ratio, as described by Abelin et al. (26), totaling 23,971,000 peptides. Next, four MHC class I predictors trained only on binding affinity data (MHCnuggets (noMS) and MHCflurry (noMS), NetMHC 3.0 and NetMHCpan 2.0) were evaluated with the Kim et al. dataset (5), in which each predictor was trained with the BD2009 data and tested on BLIND data. It was possible to compare NetMHC3.0 and NetMHCpan2.0 performance on Kim et al., because they have previously published predicted IC50s for all peptide-MHC pairs in BLIND. This allowed us to calculate their PPVn, area under the ROC curve (auROC), Kendall’s tau, and Pearson’s r correlations.

Next, we compared MHCnuggets’ class II ligand prediction performance with self-reported performance statistics of NetMHC group’s MHC class II methods (39). We used the Jensen et al. five-fold cross-validation benchmark to assess allele-specific MHC class II prediction of MHCnuggets and NetMHCII 2.3, for 27 alleles. NetMHCII 2.3 reported the average auROC for five-fold cross-validation, and we report MHCnugget’s positive predictive value for each of the 27 alleles as well as the average auROC, Pearson’s r and Kendall-Tau correlations.

The leave-one-molecule-out (LOMO) benchmarks are a type of cross-validation designed to estimate the performance of peptide binding prediction with respect to rare MHC alleles. Given training data for n MHC alleles, the data for a single allele is held out and networks are trained for the remaining n-1 alleles. Then for each peptide, predictions are generated by the remaining networks. We designed a LOMO benchmark to evaluate MHC class I rare allele prediction, by selecting 20 alleles with 30 to 100 characterized peptides in the Immune Epitope Database (IEDB) (40). For class II rare allele prediction, we used the Jensen et al. (39) LOMO benchmark. We were unable to assess rare allele prediction for NetMHC class I methods, as no published results were available. For the NetMHC class II methods, we compared MHCnuggets to their self-reported auROCs.

Performance metrics.

We calculated positive predictive value with respect to the top-ranked n peptides, where n is the number of true binders in the ranked list, denoted as PPVn. PPV = NTP / (NTP+NFP), where NTP=number of true positives and NFP=number of false positives. We calculated PPV with respect to the top-ranked n peptides, where n is the number of true binders in the ranked list, denoted as PPVn. For the BST benchmark, we also calculated PPV over the top 50 and 500 ranked peptides.

Dataset collection and curation

Data sources for network training and testing, TCGA somatic mutations, TCGA tumor gene expression, and haplotype calling are shown in Supplementary Table S1 and Supplementary Table S2. A curated version of the IEDB database 2018 (40) and the sixteen class I monoallelic B-cell line immunopeptidomes (26) was provided by Tim O’Donnell (https://data.mendeley.com/datasets/8pz43nvvxh/2), binding affinity assays of HPV-derived peptides were provided by Maria Bonsack and Angelika Riemer (8), BST = immunopeptidomes from six cell lines with multiallelic MHCs (26 MHC class I alleles) (22)and from soluble HLA(sHLA)-transfected HeLa cells separated by allele (4 MHC class I alleles) (7). Decoy random peptides were sampled from the human proteome.

Training Data Sets.

The networks were trained with data from a curated version of the Immune Epitope database (IEDB 2018) (40), containing chemical binding affinity measurements for 241,553 peptide-allele pairs covering 217 class I alleles and 96,211 peptide-allele pairs covering 135 class II alleles. Additional training data consisted of 16 class I mon-allelic B-cell line immunopeptidomes (26). The immunopeptidome data is limited to HLAp binders and was supplemented by decoy random peptides sampled from the human proteome (https://data.mendeley.com/datasets/8pz43nvvxh/2).

Benchmark Data Sets.

Kim et al: This benchmark contained 53 MHC Class I alleles and 137,654 IC50 measurements published prior to 2009 (training set) and 53 unique MHC class I alleles with 26,888 IC50 measurements, published from 2009–2013 (test set). Three alleles (HLA-B*27:03, HLA-B*38:01, HLA-B*08:03) did not contain sufficient training data, and two alleles (HLA-A*46:01, and HLA-B*27:03) did not contain any peptides defined as binders in this work (IC50 < 500 nM). Therefore, a total of four alleles (HLA-A*46:01,HLA-B*27:03, HLA-B*38:01, and HLA-B*08:03) were dropped from the analysis. All peptides in this benchmark set consisted of 8–11 amino acid residues.

Bonsack et al.

This dataset contains 475 synthetic peptides derived from model protein sequences HPV16 E6 and E7 tested for binding to 7 alleles (HLA-A*01:01, HLA-A*02:01, HLA-A*03:01, HLA-A*11:01, HLA-A*24:02, HLA-B*07:02 and HLA-B*15:01). Each peptide was tested in competition-based cellular binding assays with a known high-affinity fluorescein-labeled reference peptide. EBV-transformed B-lymphoblastic cells were stripped of their naturally-bound peptides and mixed with serially diluted test peptides and 150 nM of reference peptide. Each synthetic peptide was tested at 8 different concentrations ranging from 780 nM to 100,000 nM. Mixture fluorescence at each synthetic peptide concentration was measured with flow cytometry, and a nonlinear regression analysis was used to find the test peptide concentration that inhibited 50% of the reference peptide binding (IC50). Peptides were classified as binders (IC50 ≤ 100,000 nM) or nonbinders (IC50 > 100,000 nM). Peptides in this independent benchmark set do not have IEDB entries.

Bassani-Sternberg et al. 2015.

This dataset contains 22,598 unique peptides eluted from 6 cell lines with multi-allelic MHCs. Out of the total 6 cell lines, a total of 26 alleles were reported. For each multiallelic cell line, peptide/MHC pairs were found through deconvolution, following the protocol described by (26), with the difference that we used MHCnuggets rather than NetMHCpan2.8 (41) to predict IC50 values for each peptide-MHC pair. For each cell line, each peptide was initially assigned as a binder to all expressed alleles. Then, for each allele, we filtered out any peptide predicted to bind with IC50 > 1000 nM to that allele, and with IC50 < 150 nM to any other allele. Peptides found for 6 alleles (HLA-A*01:01, HLA-A*02:01, HLA-A*03:01, HLA-A*24:02, HLA-A*31:01, and HLA-B*51:01), were selected for allele-specific prediction testing. Trained networks were available for these alleles from all the methods that we compared.

Trolle et al.

This dataset contains 15,524 unique peptides eluted from soluble HLA(sHLA) transfected HeLa cells, a process that allowed for separating binding peptides to a single MHC allele. This dataset reports peptides for 5 MHC alleles. Peptides found for 4 alleles (HLA-A*01:01, HLA-A*02:01, HLA-A24*02, HLA-B*51:01) were selected for testing. Peptide lengths in this dataset range from 8–15 amino acid residues.

BST.

This benchmark consists of 23,971 HLAp hits for 6 alleles, from Bassani-Sternberg et al. 2015 (22) and Trolle et al. (42) plus 23,947,029 random decoy peptides sampled from the human proteome. Any peptides found to overlap with the training HLAp data (26) were removed.

Jensen et al.

This benchmark was designed to assess both allele-specific and rare MHC class II binding affinity predictors. Allele-specific prediction was tested with a five-fold cross validation experiment on peptides found in IEDB in 2016 but not 2013. Rare allele predictions were tested with the LOMO protocol.

IEDB Class I rare alleles.

This dataset was designed to apply the LOMO protocol to class I alleles. It included 20 “pseudo-rare” alleles with 30–100 binding affinity peptide measurements in IEDB.

All datasets used in this work are available at http://dx.doi.org/10.17632/8c26kkrfpr.2.

TCGA analysis pipeline

To assess candidate immunogenic somatic mutations in patients from the TCGA cohort, we developed and implemented a basic pipeline based on whole-exome and RNA sequencing data (Supplementary Fig. S1). Our analysis builds upon work from the TCGA PanCancer Analysis teams for drivers (43), mutation calling (44) and cancer immune landscapes (45). We obtained somatic mutation calls for all cancer types from Multi-Center Mutation Calling in Multiple Cancers (MC3) (v0.2.8) (7775 patients). Tumor-specific RNA expression values from Broad TCGA Firehose were standardized across tumor types using the RSEM Z-score (46). MHC allele calls were obtained from the TCGA cancer immune landscape publication, in which up to six MHC class I alleles (HLA-A, HLA-B, and HLA-C) were identified for each patient using OptiType (47) (Supplementary Table S2). We included patients for which mutation calls, MHC allele calls and RNA expression values were available from TCGA. After these considerations, the analysis included 6613 patients from 26 TCGA tumor types. Six cancer types were not included in our analysis, because 15 or fewer patients met this requirement: lymphoid neoplasm diffuse large B-cell lymphoma (DLBC), esophageal carcinoma (ESCA), mesothelioma (MESO), skin cutaneous melanoma (SKCM), stomach adenocarcinoma (STAD), ovarian serous cystadenocarcinoma (OV).

The somatic missense mutations identified in each patient were filtered to include only those with strong evidence of mutant gene RNA expression in that patient (Z > 1.0). For each mutation that passed this filter, we used the transcript assigned by MC3 to pull flanking amino acid residues from the SwissProt database (48), yielding a 21 amino acid residue sequence fragment centered at the mutated residue. All candidate peptides of length 8, 9, 10 and 11 that included the mutated residue were extracted from each sequence fragment. Next binding affinity predictions were generated for each mutated peptide for up to six MHC class I alleles, depending on the patient’s HLA genotypes. In total, each somatic mutation was represented by 38 mutated peptides for up to 6 possible MHC pairings.

We applied a permissive filter to select candidate immunogenic peptides, requiring mutated peptides to have binding affinity of IC50 < 500 nM for at least one MHC allele. Somatic missense mutations that generated neoantigens meeting these criteria were considered candidate immunogenic missense mutations (IMMs). For a given patient, if a mutation was predicted to be a candidate IMM for multiple alleles, it was counted only once using the MHC allele with the lowest predicted IC50. Finally, for each patient we counted the number of predicted IMMs found in their exome and stratified by tumor type. We then identified predicted IMMs that were harbored by more than one patient.

We sought to ascertain whether predicted IMMs occurred preferentially in particular gene or protein regions. Using the HotMaps 1D algorithm v1.2.2 (49), we clustered primary amino acid residue sequence to identify regions where mutations were frequently predicted as IMM, with statistical significance (q < 0.01, Benjamini-Hochberg method (50)). In this analysis, mutations were stratified by cancer type, and we considered enrichment within linear regions of 50 amino acid residues.

We considered that mutation immunogenicity might be associated with potential driver status of a mutation. Driver status was inferred by CHASMplus (51), a random forest classifier that utilizes a multi-faceted feature set to predict driver missense mutations and is effective at identifying both common and rare driver mutations. For each mutation, immunogenicity was represented as a binary response variable and driver status was used as a covariate. Mutations with CHASMplus q-value < 0.01 were considered drivers (51). We modeled the relationship with univariate logistic regression (R glm package with binomial link logit function).

To assess whether the total number of predicted IMMs per patient was associated with changes in tumor immune infiltrates, we performed Poisson regression (R glm package with Poisson link log function). All estimates of immune infiltrates were obtained from Thorsson et al. (45,52). We fit two univariate models in which the response variable was the predicted IMM count and the covariate was either total leukocyte fraction or fraction of CD8+ T-cells.

MC3 mutation filtering:

MC3 TCGA somatic mutation calls were filtered for missense mutations.

Regression models:

We applied two univariate Poisson regression models. In the first model, each patient’s predicted immunogenic missense mutation load was the response variable and the independent variable X was the total leukocyte fraction. The fitted coefficient β=0.75 (P < 2 × 10−16, Wald test) indicated that increased predicted IMM load was significantly associated with increased leukocyte fraction in a patient’s cancer. In a second model, X was the proportion of CD8+ T cells inferred by CIBERSORT (53). The fitted coefficient β=5.9 (P < 2 × 10−16, Wald test) indicated that increased predicted IMM load was associated with increased tumor-infiltrating CD8+ T cells. Total lymphocyte and (Aggregate3) CD8+ T-cell fractions were estimated in Thorsson et al. (45).

Runtime analysis

To assess the speed and scalability of the tested methods, we selected one million peptides sampled from the Abelin et al. dataset (26) for class I alleles, and one million peptides sampled from the IEDB (curated dataset 2018 (54)) for class II alleles. Sampling was done with replacement. For each method listed in Fig. 1A, networks for three class I MHC alleles (HLA-A*02:01, HLA-A*02:07, HLA-A*01:01) and three class II MHC alleles (HLA-DRB1*01:01, HLA-DRB1*11:01, HLA-DRB1*04:01) were used to predict binding over a range of input sample sizes (102, 103, 104, 105, 106 ). All methods were run on a single GPU compute node (one NVIDIA TESLA K80 GPU plus six 2.50GHz Intel Xeon E5–2680v3 CPUs, 20GB memory).

Results:

High-throughput MHCnuggets breaks the MHC ligand prediction plateau

The MHCnuggets LSTM neural network architecture accepts peptides of variable lengths as inputs so that ligand binding prediction can be performed for both MHC class I and class II alleles. To enable binding prediction for rare MHC alleles that have limited associated experimental data, we designed a method that leverages networks built for closely-related common alleles with extensive data. When available, we utilize a transfer learning protocol to integrate binding affinity and HLAp results in a single network model, to better represent the natural diversity of MHC-binding peptides.

To assess the baseline performance of MHCnuggets allele-specific networks on binding affinity data, we compared our approach with widely used MHC class I ligand prediction methods using two validation sets of binding affinity measurements (5, 8). We trained and tested MHCnuggets (noMS) and MHCflurry (noMS) using the Kim et al dataset (5), and evaluated the predictions provided by NetMHC 3.0 and NetMHCpan 2.0. We observed that MHCnuggets’ performance (PPVn = 0.829, auROC=0.924) was comparable to these methods (Fig. 3a) (PPVn of all methods=0.825 +/− 0.005, auROC of all methods = 0.928 +/− 0.0031). MHCnuggets was also comparable (PPVn = 0.633, auROC=0.794) to these methods when tested on the Bonsack et al. dataset (8) (PPVn of all methods = 0.625 +/− 0.008, auROC of all methods = 0.77 +/− 0.02) (Fig. 3A) (+/− refers to standard deviation) (Supplementary Tables S3a and b, S4a and b).

Figure 3. MHC class I benchmark comparisons.

Figure 3.

A) PPVn for MHC class I allele-specific prediction on binding affinity test sets from Bonsack et al. (7 alleles) and Kim et al. (53 alleles) (5,8) B) PPVn for MHC class I allele-specific prediction on HLAp BST data set (Bassani-Sternberg et al. and Trolle et al. (7,22)), stratified by allele (6 alleles). C) PPVn for MHC class I allele-specific prediction on HLAp BST data set (from B) stratified by peptide sequence length. D) True and false positives for each method on the top 50 ranked peptides from the HLAp BST data set. PPVn = positive predictive value on the top n ranked peptides, where n is the number of true binders. TP=true positives. FP=false positives.

Earlier neoantigen prediction methods focused on class I and trained on binding affinity data from IEDB (54). More recent work incorporated both binding affinity and HLAp data into network training (14,18). We compared MHCnuggets with several class I predictors that used both binding affinity and HLAp data: MHCflurry 1.2.0, MHCflurry (train-MS), NetMHC 4.0, and NetMHCpan 4.0. We selected the Bassani-Sternberg/Trolle (BST) HLAp dataset (7,22,26) as an independent benchmark, as it was not used as training data by any of these methods. For all alleles tested, MHCnuggets achieved an overall PPVn of 0.42 and auROC of 0.82 (Fig. 3B). On average, MHCnuggets’ PPVn was more than three times higher than MHCflurry 1.2.0, MHCflurry (train-MS), NetMHC 4.0, and NetMHCpan 4.0. For all alleles, MHCnuggets predicted fewer binders than other methods, resulting in fewer false positive predictions. Stratifying by peptide length, MHCnuggets’ increased PPVn was most evident for peptides of length 9, 10, and 11 (Fig. 3C). The length distribution of predicted binders was commensurate with the observed distribution of naturally occurring binders in the HLAp benchmark tests (7) Supplementary Tables S5ad).

For some clinical applications, it may be desirable to minimize the number of false positives among a small number of top-scored peptides. We also compared PPV of the methods listed above on their top 50 and 500 ranked peptides from the BST dataset (six MHC class I alleles, Supplementary Tables S5e and S5f). MHCnuggets exhibited the highest PPV in the top 50 for all alleles except HLA-B*51:01 and the highest PPV in the top 500 for all alleles (Fig. 3D).

Prediction of peptide-MHC binding for class II and rare alleles

We assessed baseline performance of MHCnuggets class II allele-specific networks on binding affinity data. To enable comparison with the class II methods from the NetMHC group, we used a five-fold cross validation benchmark derived from IEDB that was included in the publication describing NetMHCII-2.3 and NetMHCIIpan-3.2 (39). First, we computed PPVn for each of the 27 allele-specific networks separately (Fig. 4A) (mean PPVn = 0.739). Next, we computed the overall auROC, Pearson r and Kendall Tau correlations for all 27 class II alleles. MHCnuggets overall auROC (0.849) was comparable to that of the NetMHCII-2.3 (0.861) and NetMHCIIpan-3.2 (0.861). Comparison to NetMHC class II methods was limited to overall auROC as published in (39), because their PPVn results are not publicly available (Fig. 4B) (Supplementary Table S6a and b).

Figure 4. MHC class II benchmark comparisons.

Figure 4.

A) PPVn for MHC class II allele-specific prediction on binding affinity test set from Jensen et al. (27 alleles, stratified by allele). B) auROC, K-Tau, Pearson r scores for MHC class II alleles from five-fold cross-validation. NetMHCII2.3 performance is from their self-reported auROC. auROC= area under the receiving operator characteristic curve. K-Tau = Kendall’s tau correlation. PPVn = positive predictive value on the top n ranked peptides, where n is the number of true binders.

We estimated performance for those class I and class II MHC alleles for which we were unable to train allele-specific networks, using leave-one-molecule-out (LOMO) cross-validation (39). In this LOMO protocol, MHC-peptide binding is assessed for a well-characterized allele that has been held out from training, to approximate prediction performance for a rare allele (Fig. 5A). For the 20 class I alleles, the mean PPVn was 0.65 and the mean auROC was 0.671. For the 27 class II alleles, the mean PPVn was 0.65 and the mean auROC was 0.792. In comparison, the class II mean auROC of NetMHCIIpan-3.2 was 0.781 (Fig. 5B, C, D, Supplementary Tables S7, S8a and b). NetMHCpan rare allele predictor LOMO test results for class I are not publicly available, therefore we were unable to compare with them.

Figure 5. MHC class I and II benchmark comparisons to estimate rare allele performance.

Figure 5.

A) Schematic representation of leave one molecule out (LOMO) testing. B) PPVn for MHC class I rare allele prediction on IEDB pseudo-rare alleles binding affinity test set (20 alleles, stratified by allele). C) PPVn for MHC class II rare allele prediction on binding affinity test set from Jensen et al. (27 alleles, stratified by allele) (39). D) auROC for MHC class II rare allele prediction on LOMO binding affinity test set from Jensen et al. (27 alleles, stratified by allele) (39). NetMHCIIpan3.2 results are from their self-reported auROC. auROC = area under the receiving operator characteristic curve. PPVn = positive predictive value on the top n ranked peptides, where n is the number of true binders.

Fast and scalable computation

When run on a graphics processing unit (GPU) architecture, MHCnuggets was faster and scaled more efficiently than MHC ligand predictors from the NetMHC family and MHCflurry. Given an input of one million peptides randomly selected from Abelin et al. (26), MHCnuggets runtime was 4.5, 3.2, and 18 times faster than MHCflurry 1.2.0, NetMHC 4.0, NetMHCpan 4.0, respectively (Fig. 6A). The improvement was more pronounced for class II peptides, for which an input of one million peptides to MHCnuggets ran 65.6 times and 126 times faster than NetMHCII2.3 and NetMHCIIpan 3.2, respectively (Fig. 6B). As the total number of input peptides was increased from 0 to one million, the runtime per peptide plateaued for other methods but decreased exponentially for MHCnuggets.

Figure 6. Timing and scalability.

Figure 6.

Runtime benchmark of tested methods using versions available on October 1, 2019 over a range of inputs (up to 1 million peptides). A) MHC class I prediction. B) MHC class II prediction

Candidate MHC class I immunogenic missense mutations in TCGA patients

To illustrate the utility of MHCnuggets’ improvements in scalability and positive predictive value for the analysis of very large patient cohorts, we predicted class I immunogenic missense mutations (IMMs) in patients whose exomes were sequenced by the TCGA consortium (Methods). In our analysis pipeline, patient exomes were split into 21 amino acid residue sequence fragments, centered on each somatic missense mutation. For each sequence fragment, MHCnuggets predicted the MHC binding for all possible 8-, 9-, 10- and 11-length peptide windows. Peptides that passed filters of predicted IC50 threshold (<500nM) and gene expression (Z>1.0) (Methods) for at least one patient-specific MHC allele were classified as predicted IMMs (Supplementary Table S9a). Finally, we characterized driver status and positional hotspot propensity of the predicted IMMs.

Total processing time for 26,284,638 allele-peptide comparisons supported by RNAseq expression was under 2.3 hours. First, we sought to ascertain the extent of variability in predicted IMM count among individuals with different cancer types. Next, we identified predicted IMMs and protein regions enriched for predicted IMMs that were shared across patients, because these might be informative for neoantigen-based therapeutic applications. Then we considered whether predicted IMMs were more or less likely to be driver mutations. Finally, we assessed the associations between predicted patient IMM load and computationally estimated immune cell infiltrates.

After applying a gene expression filter, we identified 101,326 unique predicted IMMs in 26 TCGA cancer types, with a mean of 15.6 per patient. We found that the majority of patients harbored fewer than 6 predicted IMMs, and 197 patients had none. Seventy-two percent of patients had from 1 and 10 predicted IMMs, compared to 1.9% of patients with more than 100, and nine patients with more than 1000 (Fig. 7A). Cancer types with the highest number of predicted IMMs were uterine corpus endometrial carcinoma (UCEC), colon adenocarcinoma (COAD), and lung adenocarcinoma (LUAD), all three of which are known for high mutation burden and immunogenicity (45). UCEC and COAD are also known to have a high frequency of microsatellite-instable (MSI) tumors. The lowest number were found in uveal melanoma (UVM), paraganglioma and pheochromocytoma (PCPG), and testicular germ cell cancer (TGCT) (Fig. 7B, Supplementary Table S9b).

Figure 7. MHC class I IMMs in TCGA patients.

Figure 7.

A). Number of predicted immunogenic missense mutations (IMMs) identified in 6,613 TCGA patients. Dotted line = mean IMMs per patient (15.6). Note, 123 patients had >100 predicted IMMs but are not included for visual clarity. B) Number of predicted IMMs by cancer type. C) IMMs shared by three or more patients and the cancer types in which they occurred. Each row represents a cancer type and each column illustrates the overlap of IMMs seen in a single cancer type or multiple cancer types. For example, the first column shows the number of IMMs shared among patients with colorectal adenocarcinoma (COAD) and uterine corpus endometrial carcinoma (UCEC). Bars to the left show the total number of unique IMMs in each cancer type. *Bar heights reflect count of unique shared IMMs, not total number of patients in which the IMM was observed. Cancer type abbreviations are in Methods. Image generated with UpSetR. D) Fibroblast growth factor receptor (FGFR3) IMM hot region identified by HotMAPs in bladder cancer (BLCA). IMMs shown and number of BLCA patients with the IMM: p.E216K (1), p.D222N (1), p.G235D (1) p.R248C (3) and p.S249C (24). Except for p.G235D, these IMMs are proximal to the interface of FGFR3 protein and the light and heavy chains of an antibody fragment designed for therapeutic application in bladder cancer (PDB ID: 3GRW) (61). ACC, adrenocortical carcinoma; BLCA, bladder urothelial carcinoma; BRCA, breast invasive carcinoma; CESC, cervical squamous cell carcinoma and endocervical adenocarcinoma; CHOL, cholangiocarcinoma; COAD, colon adenocarcinoma; GBM, glioblastoma multiforme; HNSC, head and neck squamous cell carcinoma; KICH, kidney chromophobe; KIRC; kidney renal clear cell carcinoma; KIRP, kidney renal papillary cell carcinoma; LGG, brain lower grade glioma; LIHC, liver hepatocellular carcinoma; LUAD, lung adenocarcinoma; LUSC, lung squamous cell carcinoma; PAAD, pancreatic adenocarcinoma; PCPG, pheochromocytoma and paraganglioma; PRAD, prostate adenocarcinoma; READ, rectum adenocarcinoma; SARC, sarcoma.

Across all cancer types, we identified 1,393 predicted IMMs harbored by two or more patients, of which 167 were identified in three or more patients. Of these 167 only 11.5% occurred exclusively in a single cancer type (Fig. 7C). The predicted IMMs identified in the largest number of patients were IDH1 R132H (62), FGFR3 S249C (24), PIK3CA E545K (23), KRAS G12D (18), PIK3CA E542K (18), TP53 R175H (18), TP53 R248Q (18), TP53 R273C (17), and KRAS G12V (16), which are known recurrent oncogenic driver mutations (55,56). Of the 1071 genes harboring predicted IMMs in 2 or more patients, the ones containing the most included TP53 (68), CTNNB1 (18), PIK3CA (16), HRAS (8), KRAS (7), PTEN (7), FBXW7 (6), EGFR (5), MDN1 (5), POLE (5), TRRAP (5), and VPS13C (5) (Supplementary Table S9c). Six missense mutations harbored by patients in the TCGA cohorts were previously validated by CD8+ T cell response assays (57) (58) (59). Of the six missense mutations, TP53R248Q, TP53Y220C, TP53R175H, TP53R248W and KRASG12D were predicted to be IMMs by our MHCnuggets pipeline and were shared by three or more of the TCGA patients.

Furthermore, 61.7% of the 167 predicted IMMs shared by three or more patients were classified as driver missense mutations by CHASMplus (q<0.01). This percentage is significantly higher than the number of predicted drivers among all TCGA missense mutations (9,821 out of 791,637 or 1.2%). Although many shared IMMs were predicted to be driver missense mutations, the percentage of predicted IMMs predicted to be drivers was ~0.1% of total predicted IMMs in our study. When compared to the OncoKB database of experimentally confirmed driver mutations (60), 53.9% of the shared predicted IMMs identified as “oncogenic” or “likely oncogenic” driver mutations. The percentage is lower (25.7%) if “likely oncogenic” mutations are excluded.

Although we observed a limited number of shared IMMs, we reasoned that protein regions enriched for predicted IMMs could present a therapeutic opportunity in certain cancer types. Using HotMaps 1D, we identified clusters of residues within protein regions having statistically significant enrichment of predicted IMMs (q<0.01). These included CIC in low grade glioma (LGG); NFE2L2 and FGFR3 (Fig. 7D) in bladder cancer (BLCA)(61), KRAS in pancreatic adenocarcinoma (PAAD), KIT in testicular germ cell tumors (TGCT), HRAS in head and neck squamous carcinoma (HNSC); PTEN, POLE and PPP2R1A in uterine corpus endometrial carcinoma (UCEC) and GNAQ and SF3B1 in uveal melanoma (UVM). Three genes harbored predicted immunogenic regions in more than one cancer type: P53 in BLCA, BRCA, HNSC, LGG and UCEC, PIK3CA in HNSC and cervical squamous cell carcinoma (CESC) and CTNNB1 in liver hepatocellular carcinoma (LIHC) and UCEC. (Supplementary Table S9d)

We explored the relationship between mutation driver status predicted by CHASMplus, and IMM status using logistic regression. The log-odds of being an IMM was significantly decreased for drivers (β=0.66, Wald test p<2e-16), which is consistent with previous work suggesting that negative evolutionary selection eliminates MHC class I immunogenic oncogenic mutations early in tumor development (62).

Finally, we considered whether a patient’s predicted IMM load was associated with changes in immune cell infiltrates as estimated from RNA sequencing of bulk cancer tissue. IMM load was significantly associated with increased total leukocyte fraction (β=0.75, Wald test P < 2 × 10−16) and with increased CD8+ T-cell fraction (β=5.9, Wald test P < 2 × 10−16) (Supplementary Table S9e).

These findings suggest that IMMs drive tumor immunoediting and may be informative for the interpretation of clinical responses to immunotherapy.

Discussion

MHCnuggets provides a flexible open-source platform for MHC-peptide binding prediction that can handle common MHC class I and class II alleles as well as rare alleles of both classes. The LSTM network architecture can handle peptide sequences of arbitrary length, without shortening or splitting. The single neural network architecture requires fewer hyperparameters than more complex architectures and simplifies network training. In addition, our neural network transfer learning protocols allow for parameter sharing among allele-specific, binding affinity -and HLAp-trained networks. When trained on binding affinity data, MHCnuggets performs as well as other current methods. When trained on both binding affinity and HLAp data, we demonstrate improved PPVn on an independent HLAp test set, with respect to other methods that use both binding affinity and HLAp data. Although PPVn was lowest for the independent HLAp test set for all methods, this result is likely due to systematic differences between training HLAp data (monoallelic B-cell lines) (26) and the test data comprised of seven multi-allelic cell lines (HeLA, HTC116, JY, fibroblasts, SupB15, HCC1937, HCC1143) (7,22), yielding a more challenging prediction problem. We attribute MHCnuggets’ improvement on the independent test set with respect to other methods to: 1) optimization of PPVn in our network training protocol and 2) our implementation of transfer learning to integrate information from binding affinity and HLAp measurements. The performance of all methods is generally highest when both training and test data come from similar binding affinity experiments, but performance improvement on HLAp data is more biologically relevant (63).

We demonstrate improved scalability by comparing the runtime of MHCnuggets on 1 million peptides to comparable methods, and further by processing over 26 million expressed peptide-allele pairs across TCGA samples in under 2.3 hours. We identified 101,326 unique immunogenic missense mutations (IMMs) harbored by patients using 26 cancer types sequenced by the TCGA, based on transcriptional abundance and differential binding affinity compared to reference peptides. These results contrast with a previous report of neoantigens in TCGA patients in several respects. Rech et al. (64) applied a minimum expression threshold of 1 RNA sequencing read count, an IEDB-recommended combination of neoantigen predictors derived primarily from different versions of NetMHC, and IC50 threshold of 50nM to identify strong MHC binders. Their approach yielded 495,793 predicted class I classically defined neoantigen peptides (each harboring a single immunogenic mutation) from 6,324 patients in 26 cancer types. As in our study, high variability in neoantigen burden across cancer types was observed. The difference between IMM and neoantigen burden in the two studies is likely due to differences in RNA expression threshold and the low false positive rate of MHCnuggets compared to IEDB-recommended tools.

Based on our conservative thresholds, IMMs were almost exclusively private to individual TCGA patients, with only 1,393 IMMs observed in more than one patient. Although more than 61% of IMMs shared by more than two patients were predicted to be driver mutations, the overall log odds of immunogenicity decreased for predicted driver mutations, indicating immunogenicity might shape the driver mutation landscape. Patient IMM counts were also associated with increase in total leukocyte fraction and fraction of CD8+ T-cells, suggesting that they may be relevant to immune system response to cancer.

This work has several limitations. First, our analyses are limited to missense mutations, which, although numerous, cannot account for the various somatic gene fusions, frameshift indels, splice variants etc. in tumors that also generate neoantigens. Although MHCnuggets can handle peptide sequences regardless of their mutational origins, we prioritized missense mutations in this study. Indeed, the context of a peptide sequence, such as what sequences are flanking, its source protein and the expression of the source protein, is informative for MHC ligand prediction (21,26). This type of information is only available for a limited number of HLAp data sets, which were unavailable to us for training purposes. As more well-characterized HLAp datasets become available, we will further develop MHCnuggets to include these features. We did not address T-cell receptor (TCR) binding to bound peptide-MHC complexes or T-cell activation upon complex binding. Although we are pursuing this more complex modeling problem, we believe that improved prediction of peptide binding to MHC is also therapeutically relevant (21). Finally, we are unable to directly compare performance to the MHC class II prediction methods from the NetMHC group, except for self-reported auROC. Although we are not able to do a rigorous comparison of MHCnuggets class II prediction, our benchmark comparisons suggested that MHCnuggets was competitive with NetMHCII2.3 and that MHCnuggets class II rare allele performance was competitive with NetMHCIIpan3.2., suggesting that further work in this area is warranted.

In summary, we present MHCnuggets, an open source software package for MHC ligand prediction that performs better than existing methods with respect to positive predictive value by leveraging transfer learning to integrate binding affinity and HLAp data. In contrast to previous methods, MHCnuggets handles both MHC class I and class II ligand prediction and both common and rare HLA alleles, all within a single framework. We demonstrated the utility of MHCnuggets as a basic pipeline to analyze mutation immunogenicity, shared IMMs and the relationship between mutation immunogenicity, driver potential and immune infiltrates from large-scale cancer patient sequencing data from TCGA.

Supplementary Material

1
2
3

Acknowledgments

Part of this research project was conducted using computational resources at the Maryland Advanced Research Computing Center (MARCC).

Financial support

This work was supported in part by a William R. Brody Faculty Scholarship to R.K., Dr. Miriam and Sheldon G. Adelson Medical Research Foundation, the Stand Up to Cancer–Dutch Cancer Society International Translational Cancer Research Dream Team Grant (SU2C-AACR-DT1415), the Commonwealth Foundation, U.S. NIH (grants CA121113, CA006973, and CA180950), the V Foundation and LUNGevity to V.E.V. Stand Up To Cancer is a program of the Entertainment Industry Foundation administered by the American Association for Cancer Research.

V.A receives research funding from Bristol-Myers Squibb. V.E.V. is a founder of Personal Genome Diagnostics, a member of its Scientific Advisory Board and Board of Directors, and owns Personal Genome Diagnostics stock, which are subject to certain restrictions under university policy. V.E.V. is an advisor to Takeda Pharmaceuticals. Within the last five years, V.E.V. has been an advisor to Daiichi Sankyo, Janssen Diagnostics, and Ignyta.

Footnotes

Disclosure of potential conflicts of interest

The terms of these arrangements are managed by Johns Hopkins University in accordance with its conflict of interest policies.

Potential Conflicts of Interest: The terms of these arrangements are managed by Johns Hopkins University in accordance with its conflict of interest policies.

References

  • 1.Anagnostou V, Smith KN, Forde PM, Niknafs N, Bhattacharya R, White J, et al. Evolution of Neoantigen Landscape during Immune Checkpoint Blockade in Non–Small Cell Lung Cancer. Cancer Discovery 2017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Yarchoan M, Johnson BA, Lutz ER, Laheru DA, Jaffee EM. Targeting neoantigens to augment antitumour immunity. Nature reviews Cancer 2017;17:209–22 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Lundegaard C, Lund O, Buus S, Nielsen M. Major histocompatibility complex class I binding predictions as a tool in epitope discovery. Immunology 2010;130:309–18 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Andreatta M, Nielsen M. Gapped sequence alignment using artificial neural networks: application to the MHC class I system. Bioinformatics 2016;32:511–7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Kim Y, Sidney J, Buus S, Sette A, Nielsen M, Peters B. Dataset size and composition impact the reliability of performance benchmarks for peptide-MHC binding predictions. BMC Bioinformatics 2014;15:241- [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Kim Y, Sidney J, Pinilla C, Sette A, Peters B. Derivation of an amino acid similarity matrix for peptide: MHC binding and its application as a Bayesian prior. BMC Bioinforma 2009;10 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Trolle T, McMurtrey CP, Sidney J, Bardet W, Osborn SC, Kaever T, et al. The Length Distribution of Class I-Restricted T Cell Epitopes Is Determined by Both Peptide Supply and MHC Allele-Specific Binding Preference. J Immunol 2016;196:1480–7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Bonsack M, Hoppe S, Winter J, Tichy D, Zeller C, Kupper MD, et al. Performance Evaluation of MHC Class-I Binding Prediction Tools Based on an Experimentally Validated MHC-Peptide Binding Data Set. Cancer Immunol Res 2019;7:719–36 [DOI] [PubMed] [Google Scholar]
  • 9.Gfeller D, Bassani-Sternberg M, Schmidt J, Luescher IF. Current tools for predicting cancer-specific T cell immunity. OncoImmunology 2016;5:e1177691-e [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Liu XS, Mardis ER. Applications of Immunogenomics to Cancer. Cell 2017;168:600–12 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Editorial NB. The problem with neoantigen prediction. Nature Biotechnology 2017;35:97. [DOI] [PubMed] [Google Scholar]
  • 12.Wieczorek M, Abualrous ET, Sticht J, Alvaro-Benito M, Stolzenberg S, Noe F, et al. Major Histocompatibility Complex (MHC) Class I and MHC Class II Proteins: Conformational Plasticity in Antigen Presentation. Front Immunol 2017;8:292. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Lu YC, Robbins PF. Targeting neoantigens for cancer immunotherapy. Int Immunol 2016;28:365–70 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Jurtz V, Paul S, Andreatta M, Marcatili P, Peters B, Nielsen M. NetMHCpan-4.0: Improved Peptide–MHC Class I Interaction Predictions Integrating Eluted Ligand and Peptide Binding Affinity Data. The Journal of Immunology 2017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Karosiene E, Rasmussen M, Blicher T, Lund O, Buus S, Nielsen M. NetMHCIIpan-3.0, a common pan-specific MHC class II prediction method including all three human MHC class II isotypes, HLA-DR, HLA-DP and HLA-DQ. Immunogenetics 2013;65:711–24 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Lundegaard C, Lamberth K, Harndahl M, Buus S, Lund O, Nielsen M. NetMHC-3.0: accurate web accessible predictions of human, mouse and monkey MHC class I affinities for peptides of length 8–11. Nucleic Acids Research 2008;36:509–12 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Nielsen M, Lundegaard C, Blicher T, Lamberth K, Harndahl M, Justesen S, et al. NetMHCpan, a Method for Quantitative Predictions of Peptide Binding to Any HLA-A and -B Locus Protein of Known Sequence. PLOS ONE 2007;2:1–10 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.O’Donnell TJ, Rubinsteyn A, Bonsack M, Riemer AB, Laserson U, Hammerbacher J. MHCflurry: Open-Source Class I MHC Binding Affinity Prediction. Cell Syst 2018;7:129–32 e4 [DOI] [PubMed] [Google Scholar]
  • 19.Bassani-Sternberg M, Coukos G. Mass spectrometry-based antigen discovery for cancer immunotherapy. Curr Opin Immunol 2016;41:9–17 [DOI] [PubMed] [Google Scholar]
  • 20.Schumacher TN, Schreiber RD. Neoantigens in cancer immunotherapy. Science 2015;348:69–74 [DOI] [PubMed] [Google Scholar]
  • 21.Bulik-Sullivan B, Busby J, Palmer CD, Davis MJ, Murphy T, Clark A, et al. Deep learning using tumor HLA peptide mass spectrometry datasets improves neoantigen identification. Nat Biotechnol 2018 [DOI] [PubMed] [Google Scholar]
  • 22.Bassani-Sternberg M, Pletscher-Frankild S, Jensen LJ, Mann M. Mass spectrometry of human leukocyte antigen class I peptidomes reveals strong effects of protein abundance and turnover on antigen presentation. Mol Cell Proteomics 2015;14:658–73 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Boehm KM, Bhinder B, Raja VJ, Dephoure N, Elemento O. Predicting peptide presentation by major histocompatibility complex class I: an improved machine learning approach to the immunopeptidome. BMC Bioinformatics 2019;20:7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Hochreiter S, Schmidhuber J. Long short-term memory. Neural computation 1997;9:1735–80 [DOI] [PubMed] [Google Scholar]
  • 25.Tan C, Sun F, Kong T, Zhang W, Yang C, Liu C. A survey on deep transfer learning. 2018. Springer; p 270–9. [Google Scholar]
  • 26.Abelin JG, Keskin DB, Sarkizova S, Hartigan CR, Zhang W, Sidney J, et al. Mass Spectrometry Profiling of HLA-Associated Peptidomes in Mono-allelic Cells Enables More Accurate Epitope Prediction. Immunity 2017;46:315–26 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Lefranc MP, Giudicelli V, Duroux P, Jabado-Michaloud J, Folch G, Aouinti S, et al. IMGT(R), the international ImMunoGeneTics information system(R) 25 years on. Nucleic Acids Res 2015;43:D413–22 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Sidney J, Assarsson E, Moore C, Ngo S, Pinilla C, Sette A, et al. Quantitative peptide binding motifs for 19 human and mouse MHC class I molecules derived using positional scanning combinatorial peptide libraries. Immunome Res 2008;4:2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Gfeller D, Guillaume P, Michaux J, Pak HS, Daniel RT, Racle J, et al. The Length Distribution and Multiple Specificity of Naturally Presented HLA-I Ligands. J Immunol 2018;201:3705–16 [DOI] [PubMed] [Google Scholar]
  • 30.Greenbaum J, Sidney J, Chung J, Brander C, Peters B, Sette A. Functional classification of class II human leukocyte antigen (HLA) molecules reveals seven different supertypes and a surprising degree of repertoire sharing across supertypes. Immunogenetics 2011;63:325–35 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Abadi Mn, Agarwal A, Barham P, Brevdo E, Chen Z, Citro C, et al. TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems. CoRR 2016;abs/1603.04467 [Google Scholar]
  • 32.Chollet F, others Keras. GitHub; 2015. [Google Scholar]
  • 33.Hundal J, Carreno BM, Petti AA, Linette GP, Griffith OL, Mardis ER, et al. pVAC-Seq: A genome-guided in silico approach to identifying tumor neoantigens. Genome Med 2016;8:11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Hundal J, Kiwala S, McMichael J, Miller CA, Wollam AT, Xia H, et al. pVACtools: a computational toolkit to identify and visualize cancer neoantigens. bioRxiv 2019:501817 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Wood MA, Paralkar M, Paralkar MP, Nguyen A, Struck AJ, Ellrott K, et al. Population-level distribution and putative immunogenicity of cancer neoepitopes. BMC Cancer 2018;18:414. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Nielsen M, Lundegaard C, Worning P, Lauemoller SL, Lamberth K, Buus S, et al. Reliable prediction of T-cell epitopes using neural networks with novel sequence representations. Protein Sci 2003;12:1007–17 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Kingma DP, Ba J. Adam: A Method for Stochastic Optimization. CoRR 2014;abs/1412.6980 [Google Scholar]
  • 38.Gal Y. A Theoretically Grounded Application of Dropout in Recurrent Neural Networks. arXiv:151205287 2015 [Google Scholar]
  • 39.Jensen KK, Andreatta M, Marcatili P, Buus S, Greenbaum JA, Yan Z, et al. Improved methods for predicting peptide binding affinity to MHC class II molecules. Immunology 2018;154:394–406 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Vita R, Mahajan S, Overton JA, Dhanda SK, Martini S, Cantrell JR, et al. The Immune Epitope Database (IEDB): 2018 update. Nucleic Acids Res 2019;47:D339–D43 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Hoof I, Peters B, Sidney J, Pedersen LE, Sette A, Lund O, et al. NetMHCpan, a method for MHC class I binding prediction beyond humans. Immunogenetics 2009;61:1–13 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Trolle T, Metushi IG, Greenbaum JA, Kim Y, Sidney J, Lund O, et al. Automated benchmarking of peptide-MHC class I binding predictions. Bioinformatics 2015;31:2174- [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Bailey MH, Tokheim C, Porta-Pardo E, Sengupta S, Bertrand D, Weerasinghe A, et al. Comprehensive Characterization of Cancer Driver Genes and Mutations. Cell 2018;173:371–85 e18 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Ellrott K, Bailey MH, Saksena G, Covington KR, Kandoth C, Stewart C, et al. Scalable Open Science Approach for Mutation Calling of Tumor Exomes Using Multiple Genomic Pipelines. Cell Syst 2018;6:271–81 e7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Thorsson V, Gibbs DL, Brown SD, Wolf D, Bortone DS, Ou Yang TH, et al. The Immune Landscape of Cancer. Immunity 2018;48:812–30 e14 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Li B, Dewey CN. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics 2011;12:323. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Szolek A, Schubert B, Mohr C, Sturm M, Feldhahn M, Kohlbacher O. OptiType: precision HLA typing from next-generation sequencing data. Bioinformatics 2014;30:3310–6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.UniProt C. Activities at the Universal Protein Resource (UniProt). Nucleic Acids Res 2014;42:D191–8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Tokheim C, Bhattacharya R, Niknafs N, Gygax DM, Kim R, Ryan M, et al. Exome-Scale Discovery of Hotspot Mutation Regions in Human Cancer Using 3 D Protein Structure. Cancer Res 2016;76:3719–31 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal statistical society: series B (Methodological) 1995;57:289–300 [Google Scholar]
  • 51.Tokheim C, Karchin R. CHASMplus Reveals the Scope of Somatic Missense Mutations Driving Human Cancers. Cell Syst 2019 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Newman AM, Liu CL, Green MR, Gentles AJ, Feng W, Xu Y, et al. Robust enumeration of cell subsets from tissue expression profiles. Nat Methods 2015;12:453–7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Chen B, Khodadoust MS, Liu CL, Newman AM, Alizadeh AA. Profiling Tumor Infiltrating Immune Cells with CIBERSORT. Methods Mol Biol 2018;1711:243–59 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Vita R, Overton JA, Greenbaum JA, Ponomarenko J, Clark JD, Cantrell JR, et al. The immune epitope database (IEDB) 3.0. Nucleic Acids Research 2015;43:D405-D [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Karakas B, Bachman KE, Park BH. Mutation of the PIK3CA oncogene in human cancers. Br J Cancer 2006;94:455–9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Tomlinson DC, Hurst CD, Knowles MA. Knockdown by shRNA identifies S249C mutant FGFR3 as a potential therapeutic target in bladder cancer. Oncogene 2007;26:5889–99 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Tran E, Ahmadzadeh M, Lu YC, Gros A, Turcotte S, Robbins PF, et al. Immunogenicity of somatic mutations in human gastrointestinal cancers. Science 2015;350:1387–90 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Malekzadeh P, Pasetto A, Robbins PF, Parkhurst MR, Paria BC, Jia L, et al. Neoantigen screening identifies broad TP53 mutant immunogenicity in patients with epithelial cancers. J Clin Invest 2019;129:1109–14 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Parkhurst MR, Robbins PF, Tran E, Prickett TD, Gartner JJ, Jia L, et al. Unique Neoantigens Arise from Somatic Mutations in Patients with Gastrointestinal Cancers. Cancer Discov 2019;9:1022–35 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Chakravarty D, Gao J, Phillips SM, Kundra R, Zhang H, Wang J, et al. OncoKB: A Precision Oncology Knowledge Base. JCO Precis Oncol 2017;2017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Qing J, Du X, Chen Y, Chan P, Li H, Wu P, et al. Antibody-based targeting of FGFR3 in bladder carcinoma and t(4;14)-positive multiple myeloma in mice. J Clin Invest 2009;119:1216–29 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Marty R, Kaabinejadian S, Rossell D, Slifker MJ, van de Haar J, Engin HB, et al. MHC-I Genotype Restricts the Oncogenic Mutational Landscape. Cell 2017;171:1272–83 e15 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Bassani-Sternberg M, Chong C, Guillaume P, Solleder M, Pak H, Gannon PO, et al. Deciphering HLA-I motifs across HLA peptidomes improves neo-antigen predictions and identifies allostery regulating HLA specificity. PLoS Comput Biol 2017;13:e1005725 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Rech AJ, Balli D, Mantero A, Ishwaran H, Nathanson KL, Stanger BZ, et al. Tumor Immunity and Survival as a Function of Alternative Neopeptides in Human Cancer. Cancer Immunol Res 2018 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1
2
3

RESOURCES