Accurate modeling of peptide-MHC structures with AlphaFold

Victor Mikhaylov; Chad A Brambley; Grant L J Keller; Alyssa G Arbuiso; Laura I Weiss; Brian M Baker; Arnold J Levine

doi:10.1016/j.str.2023.11.011

. Author manuscript; available in PMC: 2025 Feb 1.

Published in final edited form as: Structure. 2023 Dec 18;32(2):228–241.e4. doi: 10.1016/j.str.2023.11.011

Accurate modeling of peptide-MHC structures with AlphaFold

Victor Mikhaylov ^1,^3,^*, Chad A Brambley ², Grant L J Keller ², Alyssa G Arbuiso ², Laura I Weiss ², Brian M Baker ², Arnold J Levine ¹

PMCID: PMC10872456 NIHMSID: NIHMS1954172 PMID: 38113889

SUMMARY

Major histocompatibility complex (MHC) proteins present peptides on the cell surface for T-cell surveillance. Reliable in silico prediction of which peptides would be presented and which T-cell receptors would recognize them is an important problem in structural immunology. Here, we introduce an AlphaFold-based pipeline for predicting the three-dimensional structures of peptide-MHC complexes for class I and class II MHC molecules. Our method demonstrates high accuracy, outperforming existing tools in class I modeling accuracy and class II peptide register prediction. We validate its performance and utility with new experimental data on a recently described cancer neoantigen/wild-type peptide pair and explore applications towards improving peptide-MHC binding prediction.

eTOC Blurb

Mikhaylov et al. developed an AlphaFold-based pipeline for accurate structure modeling of peptides bound to major histocompatibility complex proteins. The structures of these protein complexes determine their recognition by T-cell receptors which in turn drives the immune response in infection, cancer and autoimmunity.

Graphical Abstract

graphic file with name nihms-1954172-f0001.jpg

INTRODUCTION

T-cells play a crucial role in effecting and regulating the immune response in the context of infection, cancer, and autoimmunity. Their ability to recognize and respond to infected or abnormal cells is mediated by the interaction between T-cell receptors and peptides presented by MHC proteins on the cell surface. Predicting which peptides will bind to MHC, and understanding the properties of resulting peptide-MHC (pMHC) complexes such as immunogenicity and repertoires of cognate T-cell receptors, is essential for designing effective vaccines and immunotherapies. All of these properties ultimately depend on the structure of the peptide-MHC complex, making accurate in silico prediction of these structures an important task.

A number of pMHC structure prediction tools have been developed based on homology modeling, conformation sampling, and empirical energy minimization.1 Protein modeling tools such as Rosetta² and MODELLER³ were adapted to pMHC structure prediction in the context of immunogenicity prediction⁴ and the modeling of pMHC-TCR complexes⁵. An example of an automated user-friendly tool is PANDORA⁶, a MODELLER-based pipeline. Since the deep learning revolution in protein structure prediction^7,8, AlphaFold^8,9 (AF) has been applied to pMHC structure and binding prediction¹⁰, and custom neural nets were built specifically for this task^11,12.

Our goal in this paper is to create and test an automated AlphaFold-based pipeline for pMHC modeling that would produce accurate models and be convenient to use. The main components of TFold, our pipeline, are paired pMHC template assignment, paired pMHC multiple-sequence alignments, and peptide register filtering with a sequence-based neural net.

TFold works for peptides of different lengths in class I and class II structures with MHC alleles from human, mouse, and a few other species. It demonstrates high accuracy, for class I pMHCs significantly outperforming PANDORA⁶. For class II pMHCs, it outperforms state-of-the-art methods netMHCIIpan 3.2 and 4.0^13,14 in peptide register prediction. Building on these results, we also explore applications of our pipeline to improved prediction of pMHC binding. Lastly, we demonstrate a potential use case in evaluating the properties of cancer neoantigens by evaluating its performance and utility with new pMHC structural data.

Results

A dataset of pMHC structures: peptide registers and geometric features.

We collected structures representing 928 unique class I and class II peptide-MHC protein complexes from the PDB¹⁵. Most of these have human or mouse MHC alleles, but a number of structures with class I alleles for other species were also included (Table [S1]). In our workflow, the pMHCs for which at least one structure was deposited prior to the AlphaFold training cutoff date (2018–04-30) were assigned to a discovery dataset, which we used to explore pMHC features and optimize the pipeline’s hyperparameters. (No training of continuous parameters was done on the discovery dataset.) The rest of the data were assigned to a held out test set. To reduce redundancy, we clustered pMHCs by sequence distance (hierarchical clustering, tree cut at pMHC sequence mismatch equal to four) and chose one representative per cluster. The statistics of the resulting non-redundant dataset are plotted in Figure 1A. (Please see [Online Methods] for the details of data pre-processing.)

Figure 1. — Structure dataset and the modeling pipeline.

(A) Counts of non-redundant pMHCs in the discovery and test datasets, for class I and class II.

(B) A schematic of a peptide position relative to the MHC binding groove. (For class I, positions P2 and P9 are the primary anchors. For class II, positions P1 and P9 are the two ends of the peptide core.) Peptide registers can be parameterized by the lengths of the C-terminal and N-terminal regions $(n_{ℓ}, n_{r})$ . The sets of class I and class II registers observed in the discovery dataset can be characterized by a few simple rules.

(C) The four registers that are possible for a class I 9-mer peptide, according to our register selection rules.

(D) For a pMHC sequence and a choice of peptide register, our pipeline assigns templates and a paired peptide-MHC multiple sequence alignment. From these data, AlphaFold (AF) produces a model and an error score. (We use 100-pLDDT averaged over the peptide core as the score.)

(E) A neural net *seqnn* predicts the pMHC dissociation constant $K_{d}$ for each peptide register. Only registers with $K_{d}$ within a certain factor of the lowest $K_{d}$ for a given pMHC are then considered in modeling.

For a pMHC structure, we define the peptide binding core (equivalently, peptide binding register) as the portion of the peptide that is positioned within the MHC binding groove. The binding register can be characterized by the lengths $(n_{ℓ}, n_{r})$ of the N-terminal and C-terminal peptide flanking regions (Figure 1B). Misidentifying the register in structure prediction leads to grossly incorrect models, and therefore, it is important to define the set of registers that are to be considered in modeling.

Among non-redundant class I pMHCs in the discovery dataset, around 95% of the structures assume the canonical register $(n_{ℓ}, n_{r}) = (0,0)$ with the peptide termini buried in the binding groove. The set of registers in the remaining 5% can be characterized by three simple rules (Figure 1B): (1) $n_{ℓ} \in {- 1,0, 1}$ and $n_{r} \geq 0$ . (The case $n_{ℓ} = - 1$ corresponds to the N-terminal residue of the peptide being the P2 primary anchor position); (2) the peptide core must be of length at least 8; (3) only one of $n_{ℓ}$ or $n_{r}$ can be non-zero. The third rule is likely probabilistic in origin: having a non-canonical terminal region is unlikely, and having both of them non-canonical is extremely unlikely. The single exception to these rules in the discovery dataset is a murine H2-Kd structure 5trz with register (−1,1), violating rule (3). In modeling class I structures, for peptide of any length we only consider registers that are allowed by the three rules above. Figure 1C illustrates the four registers that should be considered for a 9-mer peptide.

For class II structures that appear in the discovery dataset, the set of registers can be characterized by two rules (Figure 1B): (1) $n_{ℓ}, n_{r} \geq 0$ , and (2) the core length is nine. There exists one exception, a HLA-DRB1*01:01 structure 4gbx with register (−1,2), i.e., residue in position P1 missing. However, in modeling, we will assume that such exceptions are rare and will only consider registers allowed by the rules above. Recently, it has been demonstrated that HLA-DP alleles commonly bind peptides in reverse binding mode^16,17. We excluded two such structures from our dataset, and we leave incorporating these binding modes into our pipeline for future work. We also do not consider class II binding modes with overstretched or bulged peptides that have been predicted in silico¹⁸, as we find no such experimental structures in the PDB. (See STAR★Methods for more details on our register identification method.)

In modeling class I pMHC structures, our primary metric will be peptide RMSD (pRMSD), and more specifically, alpha-carbon peptide RMSD ( $C_{α}$ -pRMSD). We define it by superimposing a model onto a true structure by MHC chains only, and then computing the error for the peptide. Unlike class I, for class II structures, the peptide core lies flat in the binding groove, while the terminal regions can adopt diverse conformations (Figures S3A,B). The pRMSD metric would then be dominated by errors in the flanking regions, which typically have few contacts with a T-cell receptor, except for residue P0 (Figures S3C,D). To focus on the most important geometric features, for class II models our primary metric will be peptide core RMSD (cRMSD), which only includes ten residues P0-P9 of the (extended) core of the peptide.

TFold pipeline.

The standard AlphaFold pipeline takes a list of protein sequences, builds a multiple-sequence alignment (MSA), and, optionally, finds templates for each chain in the PDB. The resulting data along with the chain sequences constitute the inputs to the AlphaFold neural net^8,9. For modeling pMHC structures, this standard pipeline produces poor results^6,12. We introduce a custom AlphaFold-based pipeline TFold (Figure 1D,E) for modeling pMHC structures that is tailored to this particular problem and shows much better performance. We briefly describe our pipeline below, leaving the details for STAR Methods.

To guide the network on the mutual positioning of the peptide and the MHC, we provide peptide-MHC templates aligned to the whole pMHC sequence. The pMHC sequence is inputed as a single chain using the residue gap trick¹⁹. This way of sequence input allows the information about relative chain orientation in the templates to be used, unlike when using AlphaFold in the multimer regime. Given a pMHC to be modeled, we consider all possible peptide registers subject to the selection rules described above. A register choice defines how the target peptide is aligned to the structural template. MHC sequences are aligned according to the IMGT numbering²⁰. Aligned templates are then sorted by total pMHC sequence mismatch, and the top few are used in modeling.

It is computationally costly to create models for all possible registers for a given peptide. Furthermore, for class I, we know that the canonical register is the right answer for ~95% of the structures, and we would like to impose this prior. To do so, we train a sequence-based pMHC binding and register predictor seqnn, and only consider peptide registers with high predicted score.

AlphaFold can use multiple sequence alignments (MSA) to extract residue co-occurrence patterns. We reasoned that an alignment of sequences of binding peptide-MHC pairs could serve as a substitute for co-evolutionary MSA for our problem. Indeed, adding such paired pMHC MSA lead to a slightly better class I modeling accuracy and substantially improved register prediction accuracy for class II structures, as described below.

Multiple models are produced by our pipeline for each pMHC. To select the best one, and in particular, to predict the peptide register, we use the AlphaFold pLDDT score averaged over the peptide core. It is sometimes convenient to plot 100-pLDDT, to which we refer as the error score. We experimented with assigning different weights to pLDDT for anchor and non-anchor residues or adding scores that quantify template quality, but did not observe any improvement in model discrimination.

Modeling class I pMHC structures.

To establish a reference which modeling accuracy can be compared to, we assessed variation in peptide geometry in pairs of experimental structures obtained for the same pMHC sequence. For each class I pMHC sequence for which more than one PDB entry is available, we chose one pair of entries and computed peptide RMSD between them. We did this separately for pairs of unbound structures and for unbound vs TCR-bound structures (Figure 3A). For pairs of unbound structures, median $C_{α}$ -pRMSD is very low, at 0.38 Å, and for unbound vs bound structures it is slightly higher, at 0.57 Å.

Figure 3. — Additional details on structures and modeling results.

(A) Comparison of peptide geometry in pairs of PDB entries that share the same class I pMHC sequence. Data for pairs of TCR-unbound pMHC structures and TCR-bound vs unbound pMHC structures are shown separately. Here and below, box plots show median value and first quartiles, whiskers show the rest of the distribution.

(B), (C) TFold models compared to TCR-bound and unbound pMHC structures, for class I pMHCs for which both bound and unbound experimental structures exist.

(D), (E) Comparison of peptide $C_{α}$ -RMSD for TFold models selected by best pLDDT or best RMSD and pMHC templates selected by best sequence match or best RMSD.

(F)-(I) Modeling accuracy as a function of different features, for the class I discovery dataset. The features include peptide length, MHC locus and species, MHC sequence mismatch of the best available template, and dissociation constant as predicted by netMHCpan 4.1.

(J) Modeling accuracy for the class I difficult pMHC pairs (see also Figure 2H,I). The two box plots are for models selected by predicted accuracy (“best pLDDT”) and for the best models (“best RMSD”).

(K) Score vs accuracy plots for TFold models for the class I difficult pMHC pairs. (See caption for figure 2D for a description of such plots.)

(L) MHC sequence mismatch for the best template, for different MHC species and loci. Each point is a pMHC from the class I discovery dataset.

(M) Fraction of incorrect registers as predicted by different algorithms for class II pMHCs from the discovery and test datasets, stratified by HLA-DQ vs all other loci. Error bars show 95% confidence intervals (Agresti-Coull estimates).

(N) MHC sequence mismatch for the best template, for HLA-DQ vs all other loci or species. Each point is a pMHC from the class II discovery dataset.

To evaluate the performance of the TFold pipeline for class I complexes, we model structures for non-redundant pMHCs from the discovery and test datasets. They mostly represent human loci HLA-A and B, as well as MHC alleles from mouse and other species, and a few HLA-C,E,G structures (Figure 2A). In modeling the discovery dataset, in order to ensure that the chosen templates are not closely related to the targets, we only allow templates from sequence clusters different from the target. For the test dataset, we allow arbitrary templates from the discovery dataset, which mimics the real life scenario where previously deposited structures are used to model a new target.

Figure 2. — Results of pMHC modeling for class I structures.

(A) Numbers of non-redundant pMHCs for different MHC loci and species in the class I dataset.

(B) $C_{α}$ -pRMSD (alpha-carbon peptide RMSD) for TFold models in the class I discovery and test datasets. RMSD is computed upon superimposing MHC chains of the model and the experimental structure. Results shown for models selected by predicted LDDT (“best pLDDT”) and by peptide RMSD (“best RMSD”). Here and below, box plots show median value and first quartiles, whiskers show the rest of the distribution.

(C) Fractions of incorrect predictions of peptide registers for class I models in the discovery and test datasets, for different methods. The first method (“assign canonical”) assigns the canonical register to all pMHCs. Error bars show 95% confidence intervals (Agresti-Coull estimate).

(D) Score vs accuracy plots for TFold models in the discovery and test datasets. Four accuracy groups of models based on $C_{α}$ -pRMSD are denoted by color: sub-angstrom (<1 Å), good (1–1.5 Å), poor (1.5–2.5 Å), and unacceptable (>2.5 Å). For every score cutoff (100-pLDDT plotted along the horizontal axis), the plot shows fractions of pMHCs with models in the four accuracy groups among the pMHCs with score below the cutoff. The fractions are computed relative to the total number of pMHCs to illustrate what fraction of targets is retained for each cutoff. A vertical dashed line marks the median score. For models below and above the median score, percentages of models with $C_{α}$ -pRMSD>1.5Å are shown to illustrate the score’s discrimination ability. Spearman’s $ρ$ for the score vs RMSD is also printed on the plots.

(E) Detailed diagram of register errors made by different algorithms on the discovery and test datasets. Rows correspond to algorithms, and columns to structures, with PDB IDs indicated below. Columns are colored by MHC locus and species. Each filled square indicates that the corresponding algorithm predicted the register incorrectly for the corresponding pMHC structure.

(F) Comparison of $C_{α}$ -pRMSDs for class I models produced by PANDORA and TFold. Percentages in the left plot are fractions of pMHCs above and below the diagonal. Both algorithms were run on the subset of the test set that only includes human and mouse MHC proteins.

(G) Score vs accuracy plots for models produced by PANDORA and TFold. (See caption for figure 2D for a description of such plots.) For PANDORA, the scores in the plot are values of the MODELLER³ *molpdf* energy function.

(H) TFold modeling results for the set of class I pMHC pairs that are similar in sequence but differ in geometry (“difficult pMHC pairs”). Each point in the scatterplot is a modeled pMHC, and the coordinates are $C_{α}$ -pRMSD of the model relative to the native structure (“true RMSD”) or to the experimental structure for the other pMHC in the pair (“cross RMSD”). Percentages indicate fractions of points above and below the diagonal. Points are colored by error score (100-pLDDT) of the models.

(I) Details on the modeling results for the class I difficult pMHC pairs. Each column corresponds to a pair of pMHCs similar in sequence. (Some of them differ only by mutations in the MHC sequence, which are not shown.) Markers indicate $C_{α}$ -pRMSDs for models w.r.t. their experimental structures, between the two experimental structures, and between the models. Markers for model to native RMSDs are colored by the error scores, and average scores for each pair are used to sort the columns left to right. A perfect modeling algorithm would have low error for the models (colored markers near zero) and similar RMSDs between models and between true structures (crosses and empty circles overlapping).

TFold demonstrates sub-angstrom accuracy with median $C_{α}$ -pRMSD of 0.73 Å on the discovery dataset and 0.77 Å on the test dataset, showcasing its consistent performance (Figure 2B). (Median all-atom pRMSD is 1.55 Å and 1.77 Å, respectively.) If the best model is chosen for each pMHC instead of the model with the highest predicted accuracy, median $C_{α}$ -pRMSD improves slightly to 0.64 Å on both datasets.

For 37 pMHCs for which both an unbound and a TCR-bound structure is available, we compared TFold models to both. The model is closer to the unbound structure for 68% of pMHCs, but the difference is small, and median $C_{α}$ -pRMSD shows the opposite trend, with 0.85 Å and 0.79 Å for unbound and bound structures, respectively (Figures 3B,C). Given that the median RMSD between unbound and bound experimental structures is below TFold error, this is not surprising.

On the discovery dataset, TFold predicts peptide registers better than netMHCpan 4.1, and also improves over seqnn, which is used for register pre-filtering (Figure 2C). TFold makes incorrect prediction for two structures, for which the other algorithms also fail (Figure 2E). One of these (5trz) has register (−1,1) and is the only exception in the discovery dataset to our register selection rules of Figure 1B. The other structure (5ymv) has a chicken MHC allele. We caution that the discovery dataset was used to choose hyperparameters for seqnn and TFold, and therefore these comparisons should not be over-interpreted. But notably, in all hyperparameter configurations that we have tested, TFold improved register prediction over seqnn.

On the test set, all algorithms show similar accuracy in register prediction (Figure 2C), however, these data include only three examples with non-canonical register. One of them (6zkx) has an HLA-E allele, with the register predicted correctly by all algorithms. The other two (6lf8, 6lf9) have swine MHC alleles, with registers predicted wrong by all algorithms (Figure 2E). TFold also predicts incorrectly a non-canonical register for a structure (6lup) with a shark MHC allele, while netMHCpan fails for a pMHC with a bat allele (6j2h).

Despite sub-angstrom median accuracy, TFold produces models of unacceptable quality for a substantial fraction of pMHCs. Many of these can be filtered out by setting a threshold on the predicted error score (100-pLDDT) which correlates well with $C_{α}$ -pRMSD (Spearman’s 𝜌 equal to 0.55 on the discovery and 0.42 on the test data). We define four accuracy groups of models according to $C_{α}$ -pRMSD: sub-angstrom (<1Å), good (1–1.5Å), poor (1.5–2.5Å), and unacceptable (≥2.5Å). In Figure 2D, we plot fractions of pMHCs in these accuracy groups as a function of the score cutoff. This figure can guide the choice of a score cutoff in a modeling application according to the desired accuracy and fraction of pMHCs to be retained. For a low enough threshold, almost all modeled pMHC structures are sub-angstrom, while unacceptable models appear only at the highest values of the score. Separating the test set by median value of the score, only 10% of pMHCs in the top half are in the poor and unacceptable groups, compared to 39.2% in the bottom half, illustrating the score’s ability to enrich for high quality models. (For the discovery dataset, these fractions are even better, at 4.4% and 33.3%.)

Next we explore whether TFold improves the peptide backbone conformation over a baseline model which simply takes the backbone from one of the templates. (For this baseline model we use register predictions from netMHCpan-4.1. Further, we restrict to pMHCs for which at least one aligned template fully covers the peptide sequence.) A summary of TFold vs baseline performance is plotted in Figures 3D,E. Comparing to the template with the best sequence match, TFold achieves higher backbone accuracy for 73% of pMHCs on the discovery dataset (median $C_{α}$ -pRMSD 0.70 Å for TFold vs 1.10 Å for the baseline) and for 65% of pMHCs on the test set (median $C_{α}$ -pRMSD 0.76 Å for TFold vs 1.13 Å for the baseline). However, an oracle that could identify the best RMSD template would perform comparably or slightly outperform TFold (median $C_{α}$ -pRMSD 0.71 Å on the discovery dataset, 0.63 Å on the test set), and TFold models selected by best RMSD are only slightly better (median $C_{α}$ -pRMSD 0.62 Å on the discovery dataset, 0.61 Å on the test set).

We further explore what features of a pMHC are predictive of model accuracy (Figures 3A–D). Peptide length is the strongest predictor (Figure 3A), with median $C_{α}$ -pRMSD increasing from 0.46 Å for 8-mers to 2.14 Å for peptides longer than eleven residues. This is as expected, since longer peptides have a longer flexible loop region bulging from the MHC binding groove (Figure S3A). (Notably, this middle region that is hard to model is the most important one for TCR recognition, Figure S3C.) Next, we observe that structures with HLA-B alleles are on average harder to model than HLA-A (p=0.03, two-sided t-test) or HLA-C,E,G alleles (Figure 3B). Still, about half of HLA-B pMHCs are modeled with sub-angstrom accuracy. We do not observe any pattern in allele representation among HLA-B pMHCs with high vs low accuracy models. The lower accuracy for HLA-B could possibly be explained by worse template matching. Indeed, HLA-B templates on average have higher MHC sequence mismatch than HLA-A templates (Figure 3G). However, HLA-C templates have even higher mismatch, but modeling results for such pMHCs are no worse than for HLA-A. Furthermore, across all loci, MHC mismatch of the template is not a good predictor of modeling accuracy (Figure 3C). The last feature that we consider is peptide-MHC binding affinity. One may surmise that peptides that do not bind strongly have a less stable conformation and therefore are harder to model. A plot of netMHCpan-predicted binding affinity versus model accuracy demonstrates a weak relationship (Figure 3D). The set of peptide-MHC complexes that are selected for crystallization is enriched for good binders, and therefore a substantial fraction of such pMHCs for which netMHCpan predicts poor may be false negatives. It is therefore possible that the relation between binding affinity and modeling accuracy would be more pronounced if it were plotted for the experimentally-measured $K_{d}$ . Also, only a few pMHCs in our data are predicted to be poor binders.

TFold significantly outperforms PANDORA, a MODELLER-based pipeline.

We compare TFold to PANDORA⁶, a state-of-the-art automatic pipeline for modeling pMHC structures. PANDORA pipeline is based on MODELLER³ and comes with a structure database. Given a pMHC sequence, it finds a suitable template and performs anchor-restrained structure refinement. It uses netMHCpan to identify peptide anchor residues.

We compare the algorithms on the set of human and mouse structures from our class I test set. For fair benchmarking, we removed from the PANDORA template database structures deposited after the AlphaFold training date. We remind that the same date cutoff was used to define our discovery dataset, which is the set of templates that TFold is allowed to use in this benchmark.

Peptide $C_{α}$ -RMSDs for models produced by the two algorithms are plotted in Figure 2F. TFold outperforms PANDORA on 79% of pMHCs (p-value < 10⁻⁷, Wilcoxon signed-rank test). TFold achieves $C_{α}$ -pRMSD of 0.77 Å, compared to 1.48 Å for PANDORA. (The original paper⁶ reports median backbone pRMSD of 0.70 Å. This is likely due to the large number of close template-target matches, see Figure S3C of that reference.) Thus, our pipeline demonstrates a large and significant improvement in modeling accuracy.

PANDORA uses MODELLER’s score function molpdf to rank models. In Figure 2G, we compare its ability to discriminate structures to AlphaFold’s pLDDT. The Spearman’s correlation coefficient of molpdf with $C_{α}$ -pRMSD is 0.07, compared to 0.36 for the TFold score. Separating by the median molpdf score allows to filter out one out of three unacceptable models. For TFold, setting the threshold at median score filters out one out of one unacceptable models. Among PANDORA models with molpdf below the median value (highest predicted accuracy), 50% of the models are poor or unacceptable, which is more than 43.9% in the other half (lowest predicted accuracy). For TFold, these fractions are 12.5% and 31.7%, respectively. We conclude that TFold pipeline outperforms PANDORA in predicting model quality.

Difficult pairs of class I pMHCs.

Two pMHCs that are similar in sequence are typically close in structure. There exist exceptions to this rule — pMHCs where the change of one or a few residues substantially alters the peptide conformation. Such pMHCs may be of interest in the context of cancer. Indeed, if a single amino-acid mutant pMHC has peptide conformation significantly different from the corresponding wild type, it may be a good candidate for an immunogenic neoantigen²². Predicting effects of a minute sequence change is a challenging benchmark for a modeling algorithm.

In the class I discovery dataset, we identified sixteen pairs of pMHCs in which the two sequences belong to the same sequence cluster but the two structures have $C_{α}$ -pRMSD greater than 1.5 Å. (At most one pMHC in each pair was part of the non-redundant discovery dataset discussed in previous sections.) In some of these pairs, the pMHCs differ only in the peptide (1–3 substitutions), and in others, they have different but similar MHC alleles. We modeled this dataset withTFold, with the number of models per register increased from 5 (default) to 10 due to expected difficulty of the task.

Median $C_{α}$ -pRMSD on this dataset was 1.37 Å, which is quite a bit higher than 0.77 Å on the test dataset (Figure 3E). It is plausible that when a small sequence change can substantially disturb the structure, the conformation is not stable, making modeling harder. (Correcting for the different distribution of peptide lengths using per length RMSD from Figure 3A does not explain the difference.) Notably, the pLDDT score can still predict model quality well (Spearman’s $ρ = 0.70$ , Figure 3F). Let us define high predicted accuracy as error score below 6.8, the median score on the test set (Figure 2D). Restricting to models of high predicted accuracy, median $C_{α}$ -pRMSD becomes 0.84 Å, which is not far from 0.64 Å for high predicted accuracy models in the test set.

For a basic test of TFold’s ability to account for small changes in the sequence, for each model, we can compare its $C_{α}$ -pRMSD to the true structure (true RMSD) vs to the other structure in the pair (cross RMSD), see Figure 2H. For 59% of the models (19 of 32), true RMSD is smaller than cross RMSD, and if we restrict to models of high predicted accuracy, this number increases to 77% (10 of 13).

For a more detailed look into the modeling results, in Figure 2I, for each pMHC pair we plot $C_{α}$ -pRMSD between the true structures, between the models, and between the models and their respective true structures. Ignoring models with low predicted accuracy, two approximate patterns can be discerned in the data. For some pMHCs, the models are close to each other, i.e. TFold fails to account for the sequence difference. Both models can be close to one of the true structures, or they may interpolate somewhere in the middle. For other pMHCs, the models are far from each other and close to their respective true structures. These are modeling successes. For a closer look at specific examples, see Figure S4 and STAR Methods.

Modeling class II pMHC structures.

We modeled non-redundant class II structures from the discovery and test datasets. These structures represent all class II MHC loci for human and mouse, although notably almost half of the discovery dataset is HLA-DR, and the test dataset contains only eleven structures (Figure 4A).

For the task of identifying the peptide binding register, we compared TFold to netMHCIIpan versions 3.2 and 4.0. (See Figure 4C. It also includes data for our pre-filtering neural net seqnn and its variant seqnn-f, which will be described below.) Of the two netMHCIIpan algorithms, version 3.2 performed the best, predicting incorrect register for 22.6% (14/62) of pMHCs on the discovery set and 27.3% (3/11) on the test set. (NetMHCIIpan errors are not due to a different definition of registers or shifted register patterns for different MHC alleles, see Figure S2D.) TFold makes a large improvement with 1.6% (1/62) and 9.1% (1/11) mistakes on the two datasets. For the discovery dataset, the difference with netMHCIIpan 3.2 is statistically significant (p-value 5e-4, Fisher’s exact test).

TFold register error rate on the test set is higher than on the discovery set. This could be just a fluctuation because the difference is not statistically significant (p-value 0.28, Fisher’s exact test). But there is another possible explanation. For all algorithms, register prediction for the HLA-DQ locus is harder (Figure 3H). The fraction of HLA-DQ pMHCs is higher in the test set than in the discovery set (Figure 4A), and this is enough to explain the difference: TFold error rates for DQ and non-DQ structures separately are consistent between the two datasets (Figure 3H).

There may be several reasons why structure prediction for HLA-DQ is harder. The HLA-DQ dimers are more diverse than HLA-DR due to the polymorphic $α$ -chain. There are also less HLA-DQ than HLA-DR structures available for use as templates. In combination, these factors lead to worse template matching (Figure 3I). It has also been observed²³ in mass spectrometry data for naturally presented peptides that binding motifs for HLA-DQ alleles are hard to discern in the absence of the peptide exchange chaperon HLA-DM. If HLA-DQ proteins on their own have lower specificity, that would explain why identifying cores for HLA-DQ complexes is harder for a modeling algorithm that is not aware of HLA-DM. (This argument may also apply to the cores captured in experimentally determined structures, which we use as ground truth. Indeed, in crystallographic experiments peptides usually are not loaded via the native antigen presentation machinery.) We further note that TFold’s ability to predict class II binding cores is markedly reduced in the absence of paired peptide-MHC MSA input: the number of register errors increases from two to seven, six of which are in HLA-DQ examples. Thus, adding sequence information in the form of paired MSA is important for HLA-DQ register prediction.

The error score (100-pLDDT) is predictive of model accuracy (Figure 4D). The median score for the discovery dataset is 3.0, and the model with incorrect register appears at score 5.4 (ranked 61/62 by the score). In the test dataset, the model with register error appears at score 4.8 (ranked 8/11 by the score). However, only one out of nine HLA-DQ models in the discovery set and zero out of six HLA-DQ models in the test have score below 3.0, indicating that TFold has low confidence for HLA-DQ structures.

Alpha-carbon peptide core RMSD ( $C_{α}$ -cRMSD) is plotted in Figure 4B. Median $C_{α}$ -cRMSD is 0.46 Å for both the discovery and the test set pMHCs. Median all-atom cRMSDs are 1.18 Å and 1.07 Å. Among pMHCs for which the register is predicted correctly, 93% in the discovery set and 100% in the test set are modeled with sub-angstrom accuracy ( $C_{α}$ -cRMSD), highlighting the fact that flat peptide geometry of class II complexes (Figure S3B) is easy to model, once the register is identified.

In the description of the class II discovery dataset, we mentioned an HLA-DRB1*01:01 pMHC (PDB ID: 4gbx) that has peptide register (−1,2). This structure does not conform to the rules of Figure 1B, because one residue is missing at the N-terminus. Because of that, the register for 4gbx is predicted incorrectly by all sequence-based algorithms, but notably, TFold does not make a mistake (Figure 4E). The correct model is produced from the templates with register (0,1), demonstrating that AlphaFold is not confined to the immediate neighborhood of the template and may be able to recover a correct structure even when the template is misleading.

Using TFold to identify registers at training improves performance of a sequence-based class II pMHC binding predictor.

The data used in training sequence-based tools such as netMHCIIpan does not have labeled registers. During training, the neural net has to simultaneously solve the tasks of learning the motifs and selecting the registers where these motifs are the most prominent. Notably, the number of registers to consider is large, e.g. seven for a typical peptide length of fifteen. Fairly high register error (above 20% for both versions of netMHCIIpan and for seqnn, Figure 4C) attests to the difficulty of the problem.

We sought to leverage TFold advantage in class II register identification towards improving sequence-based binding predictors. We modeled top binders ( $K_{d}$ up to 285nM) from the seqnn training set with TFold and selected pMHCs with models with high predicted accuracy (score below 1.3). This yielded a set of about 25000 pMHCs with predicted peptide register. We then trained a neural net seqnn-f, a version of seqnn, on the same data as before, but with the register fixed when a prediction is available. (The architecture and hyperparameters for seqnn and seqnn-f were optimized independently. The training process was two-step with additional register filtering. Please see Online Methods for the details.)

Seqnn-f outperformed seqnn but didn’t quite reach TFold accuracy in terms of register prediction, both on the discovery and test datasets (Figure 4C,E). To test its ability to predict the dissociation constant, we collected 472 pMHCs with measured $K_{d}$ , recently deposited in IEDB and not present in our or netMHCIIpan training set. On these data, seqnn-f demonstrated improvement over seqnn in terms of Spearman’s correlation between the predicted and measured $K_{d}$ values. Both seqnn and seqnn-f outperformed netMHCIIpan version 3.2 but not version 4.0 (Figure 4F).

Due to computational resource limitations, TFold modeling of pMHCs for the seqnn-f training set was performed without paired MSAs, which substantially decreases the accuracy of register prediction (register error rate 7/73 instead of 2/73 on the discovery plus test datasets). More importantly, netMHCIIpan version 4.0 leverages the massive dataset of naturally presented ligands¹⁴, extending its training set by almost a factor of ten compared to version 3.2 (counting positive examples), while for seqnn and seqnn-f, we only used the binding constant assays. Competitive performance of seqnn-f relative to netMHCIIpan-3.2 suggests that with elution data included and with MSAs used in TFold runs, tools similar to seqnn-f may improve over state-of-the-art methods in class II pMHC binding prediction.

Assessing TFold with new experimental structures of a class I MHC neoantigen/wild type pair.

To assess TFold’s performance with novel data, we determined new crystallographic structures of two class I pMHC complexes and compared these to the TFold predictions, demonstrating a potential use case for evaluating the structural properties of cancer neoantigens (Table S3, Figure S5). The neoantigen KLSHQLVLL, derived from the SNX24 gene, was identified in a melanoma patient and used to generate antigen-specific CD8+ T cells in a healthy donor²⁴. The KLSHQLVLL neoantigen incorporates a mutation of proline-to-leucine at position 6 and is restricted by HLA-A*02:01. The best-scoring TFold model of the pMHC complex showed a bulged nonameric structure, with the mutant leucine at position 6 extending away from the base of the binding groove rather than serving as a secondary anchor. Exposure of central hydrophobic side chains in neoantigens have been associated with immunogenicity^4,25, suggesting a mechanism for the activity of the KLSHQLVLL neoantigen.

The model of KLSHQLVLL bound to HLA-A2 was in excellent agreement with the 2.9 Å structure of the complex (PDB ID 8U9G), with a peptide $C_{α}$ RMSD of 0.6 Å after alignment of the HLA-A2 binding groove (Figure 5A). The mutant leucine was indeed pointed away from the binding groove, albeit at a more pronounced angle due to a slightly underpredicted peptide bulge. The side chains of K1, H4, and L8 were also correctly modeled as exposed (with rotamer variances as expected for exposed amino acids), and the side chains of L2 and V7 were correctly modeled as partially buried in the groove. For comparison, the PANDORA model was less accurate, with a $C_{α}$ RMSD of 1.0 Å, due primarily to more pronounced mis-modeling of the bulge at positions 5 and 6 (Figure 5B).

Figure 5. — Structures and models of the KLSHQLVLL neoantigen and wild-type peptides bound to HLA-A2.

(A) Comparison of the KLSHQLVLL neoantigen peptide/HLA-A2 structure with the TFold model, colored as indicated. The left image shows a structural overview, the right image shows a comparison of the peptide at the atomic level with the Cα RMSD indicated (replicated for all panels below). The TFold model was in excellent agreement with the structure.

(B) Comparison of the KLSHQLVLL neoantigen peptide/HLA-A2 structure with the PANDORA model. PANDORA performed less favorably than TFold, mismodeling the peptide’s central bulge as shown.

(C) Comparison of the KLSHQLVLL neoantigen peptide/HLA-A2 structure with the KLSHQPVLL wild-type peptide/HLA-A2 structure. The conformations of the two peptides are nearly identical.

(D) Comparison of the KLSHQPVLL wild-type peptide/HLA-A2 structure with the TFold model. While the path of the peptide was captured, TFold misplaced the orientation of the proline at position 6.

(E) Comparison of the KLSHQPVLL wild-type peptide/HLA-A2 structure with the PANDORA model. Notably, PANDORA made the same error as TFold with regard to the orientation of proline at position 6, although the overall prediction is slightly better.

We compared the KLSHQLVLL neoantigen structure to that of its wild-type counterpart, KLSHQPVLL, with a proline rather than leucine at position 6. The 2.1 Å structure of the wild-type pMHC structure (PDB ID 8TBW) was essentially identical to the neoantigen structure, with the two peptides exhibiting a $C_{α}$ RMSD of 0.6 Å (Figure 5C). Here the best-scoring TFold model performed more poorly, with a $C_{α}$ RMSD of 1.2 Å, owing to TFold’s placement of the proline at position 6 in a “down” conformation and possibly highlighting a challenge for central prolines in class I pMHC structure prediction (Figure 5D). Notably though, the error score for this model was 6.72, just below the threshold for high accuracy, illustrating this metric’s utility in assigning model confidence (by comparison, the error score was 4.84 for the better-modeled neoantigen). The PANDORA prediction for the wild-type peptide was only slightly better with a $C_{α}$ RMSD of 1.1 Å. Interestingly, PANDORA made the same error (Figure 5E), further suggesting a general difficulty in modeling central prolines in class I MHC presented peptides, potentially related to sparse template structures within the PDB.

Discussion

In this work, we developed TFold, an AlphaFold-based automated pipeline for modeling peptide-MHC structures. It can predict pMHC conformations for various class I and class II MHC loci from human, mouse, and a few other species for which templates are available (Table S1, Figures 2A, 3A). We analyzed peptide registers that occur in pMHC structures and incorporated this knowledge into the algorithm. The key elements of our pipeline are paired pMHC template assignment, paired multiple-sequence alignments derived from the binding data, and selecting subsets of peptide registers with a custom sequence-based neural net.

TFold shows competitive performance. For class I pMHC complexes, it achieves median peptide $C_{α}$ -RMSD of 0.77 Å, outperforming by a large margin a state-of-the-art MODELLER-based pipeline⁶ (Figure 2B,F), and demonstrated by successful application to a new experimental neoantigen pMHC structure (Figure 5A). We also demonstrated that the AlphaFold pLDDT score is a useful predictor of model quality (Figure 2D,G), further reinforced by application to a new experimental structure where an inaccurate model was reflected by a high error score (Figure 5D). We analyzed pairs of pMHCs that are similar in sequence but have divergent peptide geometry. Such pMHCs, when their peptides are derived from the human proteome and differ by a point mutation, may be interesting candidates for cancer neoantigens. Correctly translating a small change in sequence into an altered peptide conformation is a challenging task for a structure modeling algorithm. TFold is able to produce models that are closer to the true structure than to the structure of the other pMHC complex in the pair in 10 out of 13 cases, when restricted to pMHCs with high predicted accuracy. It is able to achieve some remarkable successes, such as catching the secondary anchor switch within the pair of very similar HLA-A*02:01-presented CMV pp65 epitope variants (Figure S4A). However, it also produces some fairly inaccurate models (Figure S4C), and the pLDDT score for them is not lower. There is room for improvement, and we suggest using this challenging benchmark in the evaluation of future pMHC structure modeling algorithms.

For class II pMHCs, TFold produces highly accurate models with median peptide core $C_{α}$ -RMSD of 0.46 Å (Figure 4B). (Our accuracy metric is focused on the peptide core residues P0-P9 because they make the most contacts with the MHC and T-cell receptors.) The main challenge in class II structure modeling is identifying the binding core, and in this task, TFold outperforms netMHCIIpan (Figure 4C). Moreover, setting a reasonable threshold for the predicted accuracy score allows to filter out register errors completely (Figure 4D). Predicting the binding core for peptides presented by MHC from the HLA-DQ locus is the most challenging both for TFold and for the sequence-based algorithms (Figure 3H). This can be related to worse template matching (Figure 3I) or to intrinsic properties of HLA-DQ proteins²³.

TFold ability to identify class II registers can be used to improve conventional sequence-based pMHC binding predictors by labeling registers in the training set. We demonstrated that this indeed improves our algorithm seqnn in terms of both register and dissociation constant prediction. (Figure 4C,F; the version of seqnn with TFold register filtering is labeled seqnn-f.) Notably, this way of leveraging the power of AlphaFold for binding prediction does not increase the runtime at inference. In training seqnn or seqnn-f in this paper, we did not use the elution data, which is critical for state-of-the-art prediction of binding¹⁴, as we recapitulate in Figure 4F. Training a neural net with elution data included, and with registers in the training set labeled by TFold is a promising avenue for the improvement of class II pMHC binding predictors.

We envision that accurate in silico modeling of pMHC structures will facilitate the development of structure-based peptide:MHC and pMHC:TCR binding predictors. While this paper was in preparation, a related work¹⁰ appeared that has substantial overlap with our work. It uses AlphaFold for accurate class I and class II pMHC structure prediction, making the crucial step of providing paired peptide-MHC templates. It does not use paired MSAs, which we observed to be helpful for accurate register identification in class II structures, but does AlphaFold fine-tuning on structural and binding data. It would be very interesting to see how AlphaFold fine-tuning would improve the accuracy of our pipeline. (Some direct evidence of such improvement in structure prediction has been demonstrated recently²⁶.) The focus of the publication¹⁰ is on binding prediction, while in the present paper, we focus on thoroughly evaluating the pMHC modeling abilities of AlphaFold, therefore we hope that our work will usefully complement these recent developments. In another exciting recent work²⁶, AlphaFold modeling is successfully applied to matching pMHCs to their cognate TCR repertoires, even for epitopes with no TCR training data.

Finally, we would like to highlight the difference between modeling class I and class II pMHC structures. In class I, apart from the rare cases (about 5%) of peptides with non-canonical binding registers, it is easy to position the primary anchor residues in the corresponding MHC pockets, and the challenge lies in modeling the peptide middle, including possible secondary anchors (Figure S3A). These middle residues are the most important for TCR recognition (Figure S3C), and even for 9-mer peptides, inaccurate models can substantially misrepresent the molecular features seen by a TCR (Figures S4B,C). For class II pMHCs, on the other hand, the peptide lies flat in the binding groove (Figure S3B) and is easy to model with consistent sub-angstrom accuracy (Figure 4B), once the binding register is identified and so long as we only focus on the peptide core. These observations may have implications for the accuracy of structure-based peptide:MHC and pMHC:TCR binding predictors.

STAR★Methods

Resource Availability

Lead contact

Further information and requests for resources should be directed to and will be fulfilled by the Lead Contact, Victor Mikhaylov (vmikhayl@ias.edu).

Materials availability

This study did not generate new unique reagents.

Data and code availability

Structural data for the neoantigen/WT pMHC pair have been submitted to the PDB with accession codes 8U9G and 8TBW.

All original code has been deposited at https://github.com/v-mikhaylov/tfold-release and is publicly available as of the date of publication. DOIs are listed in the key resources table.

Key resources table.

REAGENT or RESOURCE	SOURCE	IDENTIFIER
Deposited data
Structure of human Class I MHC HLA-A2 bound to sorting nexin 24 (127–135) neoantigen KLSHQLVLL	This paper	PDB: 8U9G
Structure of human Class I MHC HLA-A2 in complex with sorting nexin 24 (127–135) peptide KLSHQPVLL	This paper	PDB: 8TBW
Software and algorithms
TFold	This paper	https://github.com/v-mikhaylov/tfold-release, DOI:10.5281/zenodo.10073699

Open in a new tab

Any additional information required to reanalyze the data reported in this paper is available from the lead contact upon request.

Method Details

Preparing sequence data.

MHC protein sequences for species other than mouse were downloaded from the Immuno Polymorphism Database^27,28 on 2021–09-17. Mouse MHC sequences were manually curated from UniProt²⁹ and PDB¹⁵. Each sequence was aligned to an IMGT-numbered sequence²⁰ for the closest species and locus to establish IMGT numbering. For class I $G_{α} 2$ domains, the numbering was shifted by 1000 to distinguish from $G_{α} 1$ domains, e.g., a residue in the middle of an MHC alpha helix would be numbered 66 and 1066 for $α 1$ and $α 2$ helices, respectively. For consistency, we similarly shifted the IMGT numbering for class II G $β$ domains. TCR $α, β, γ, δ$ V- and J-gene sequences were downloaded from IMGT³⁰. TCR V-regions were numbered according to the IMGT system²⁷.

Preparing structures.

Using the AlphaFold template search pipeline, we searched the PDB database (downloaded on 2022–07-05) for entries containing an MHC chain. In brief, MSAs were constructed for representatives of class I and both chains of class II MHC by applying Jackhmmer³² on the Uniref90³³ database. The MSAs were then used to search the PDB with hmmsearch³⁴.

For each chain fragment in a PDB file, protein type (MHC class and chain, TCR V and J regions, $β 2 m$ ), locus and allele were identified with a BLAST search. (Since allele assignments were done automatically, they may differ from alleles reported in PDB for the structures.) MHC and TCR segments were then realigned to numbered sequences using the BioPython³⁵ tool pairwise2.align to assign IMGT numbering. MHC chains were truncated to G-domains.

Class II MHC chains were matched together by proximity of $β_{1}$ - strand residues 4–11 and 1004–1011. TCR chains were matched together by proximity of fragments 49–53 and 115–118. TCRs and pMHC were matched into complexes by proximity of CDR3 $β$ to the peptide core. Chains were renamed into A (TCR $α$ or $γ$ ), B (TCR $β$ or $δ$ ), M (MHC $α$ ), N (MHC class II $β$ ), and P (peptide; see the next paragraph for peptide identification method).

Chain fragments that were not mapped to MHC, TCR or $β 2 m$ were considered as potential MHC-bound peptides. For each MHC protein in a query structure, we superimposed the structure onto a reference structure (3mre for class I, 4×4w for class II) by MHC chains. For each residue in each candidate peptide fragment, we found the closest ( $C_{α}$ distance) residue in the reference structure peptide. Fragments with minimal residue-pair distance below 2 $Å$ were further considered. For them, we looked at consecutive residue pairs that mapped to residues (8,9) and either (1,2) or (2,3) in the reference structure peptide. If two such pairs were found, the corresponding fragment in the query structure was identified as a peptide bound to the MHC by which we superimposed. The same procedure allowed to identify peptide binding cores and therefore the registers.

This peptide identification procedure failed on 35 PDB structures that are listed in Table S2. We next comment on some of these exceptions. The reference structures to which the queries were superimposed have human MHC alleles, nevertheless, the procedure works surprisingly well even for MHCs from other species (Table S1), attesting to the evolutionary conservation of MHC geometry. However, Table S2 does contain one swine MHC structure. It also contains a human HLA-F structure, in which the peptide N-terminus is extended from the binding groove. It is known that HLA-F binds peptides differently than classic MHC class I proteins³⁶, and this geometry cannot be processed by our pipeline. Table S2 also contains an interesting example of a class II structure with the peptide bound in reverse. (There are two such structures, but in one of them the reverse orientation is enforced by a linker.)

In pMHC complexes made for crystallization, the peptide is sometimes connected to the MHC by a linker (1.3% of class I and 25% of class II PDB entries in our data). For such structures, we trimmed the corresponding peptide flanking region to zero in class I and to three residues beyond the binding core in class II complexes.

Contact counts for Figure S3 were computed as follows. Two atoms at distance $d$ were considered in contact if $d < 1.1 \cdot (r_{1} + r_{2})$ , where $r_{1}$ and $r_{2}$ are their van der Waals radii taken from³⁷. For a pair of residues, the contact count is the number of pairs of heavy atoms in contact.

Structure manipulation, including superimposing and RMSD computations, was done using BioPython.PDB³⁸.

Aggregating structure data and selecting pMHCs for benchmarks.

The PDB often contains multiple structures for the same pMHC. Therefore, we merged structure records with identical pMHC sequences and peptide registers into pMHC records. (Before merging, gaps in peptide sequence were imputed from SEQRES entries, and gaps in MHC from the MHC sequence database.) For each pMHC record, the structure with the minimal number of missing peptide residues, with no linker (if available), and with the best resolution was chosen as a representative.

pMHC records were clustered into “sequence clusters” by sums of edit distances between peptide core sequences and between MHC sequences, separately for discovery and test datasets, with hierarchical clustering tree cut at distance 4. A representative pMHC was chosen for each cluster by the same criteria as in the structure per pMHC choice above. These representatives constitute our non-redundant discovery and test datasets of Figure 1A.

Separately, pMHC records were clustered by peptide backbone geometry (“geometry clusters”). For that, structures were transformed to the same frame by superimposing them onto the reference structures (3mre for class I, 4×4w for class II) by MHC chains. Vectors of peptide $C_{α}$ coordinates were collected and brought to the same dimension by restricting to residues P1-P9 (class I) or P0-P10 (class II) and imputing from neighboring residues. These vectors were clustered by k-means, with k set to the number of sequence clusters, and centroids initialized at average $C_{α}$ positions in the sequence clusters. In assigning templates in the TFold pipeline, in order to ensure template diversity, no more than one representative from each geometry cluster is allowed among templates for each pMHC-register pair.

For modeling and benchmarking, we used all non-redundant pMHCs from the discovery and test datasets, subject to the following restrictions: no missing residues in the peptide (class I) or in peptide positions P0-P9 (class II); no non-canonical residues in the peptide; no pMHCs with unstable peptide, i.e. for which available structures demonstrate more than one register.

The set of difficult class I pMHCs was chosen as follows. All records from the discovery dataset (not only non-redundant representatives) were filtered by the same rules as in the previous paragraph. Then we grouped pMHCs by sequence cluster and peptide length, and retained only groups with representatives of more than one geometry cluster. In each group, the matrix of $C_{α}$ -cRMSD distances was computed and the pair of pMHCs with the largest distance was retained. Further, we dropped pairs with peptide sequence mismatch greater than three or $C_{α}$ -cRMSD less than 1.5 $Å$ . This resulted in the 16 pMHC pairs reported in the main text.

Preparing peptide:MHC binding data.

IEDB data for peptide:MHC affinity were downloaded on 2022–04-06. We only kept assays for peptidic antigens with no non-canonical amino-acids, presented by MHC alleles for which we have a sequence, with no mutations in MHC chains. Assay group “half maximal effective concentration (EC50)” was excluded. We further downloaded netMHCpan-4.1/IIpan-4.0 training data, which largely overlaps with the data from IEDB. For each unique pMHC, assays were merged by taking the geometric mean of the dissociation constants, and $K_{d}$ values were clipped to 1–50000 nM. Only pMHCs with peptides of length 8–15 (for class I) and 9–25 (for class II) were kept. IEDB pMHCs with the earliest assay deposited no earlier than 2020, and which do not appear in the netMHC training set, were set aside as a test set. The rest of the data, with the test set pMHCs excluded, were used as the training set. For training seqnn models, five training-validation splits of the training data were prepared.

Paired multiple sequence alignments.

We construct paired pMHC MSAs, separately for class I and class II, from pMHCs identified as good binders in IEDB data and in netMHCpan/IIpan training sets¹⁴. MHC sequences are aligned according to their IMGT numbering²⁰, and peptides according to the numbering induced by their registers.

For class I, registers were predicted using seqnn. We restricted to pMHCs with both measured and predicted dissociation constant below 100 nM, and further subsampled to no more than 100 peptides per MHC allele. This gave the final class I paired MSA of 8232 sequences.

For class II, register prediction with sequence-based neural nets is unreliable. Instead, we modeled the top 42413 binders ( $K_{d}$ up to 285 nM) with TFold with no MSA. The top 10000 pMHCs by predicted accuracy were aligned for the MSA.

Details on the modeling pipeline.

For single chain input, we join peptide and MHC sequences into a single string but introduce a 200-residue gap in the internal AlphaFold sequence numbering¹⁹.

For template alignment, contact analysis, and other purposes, we introduce a peptide numbering system that assigns indices P1 and P9 to the N- and C-termini of the binding core. (If the register has $n_{t} = - 1$ then residue P1 is missing and indexing starts with P2.) This numbering system is illustrated in Figure S1A. For class I structures with binding core longer than nine, we use single digit insertion codes after residue P5, and for shorter cores we omit residue P6, if needed. Index P0 with insertion codes is used for residues in the N-terminal flanking region.

Given a pMHC to be modeled, we consider all possible peptide registers subject to the selection rules and the pre-filtering process. A register choice defines a peptide numbering. In template assignment, this numbering is used to align the peptides, allowing when necessary to use templates with non-matching peptide length. MHC sequences are aligned according to the IMGT numbering²⁰. Templates are then sorted by total pMHC sequence mismatch. AlphaFold takes four templates per run, and in default settings, we create five models per register, thus using the top 20 templates.

For register pre-filtering, we considered using netMHCpan/IIpan¹⁴ and only creating models for the single register that it predicts. However, netMHCpan/IIpan register prediction is not perfect, especially for class II, therefore, we created our own sequence-based neural net seqnn for pMHC binding prediction. It was trained on IEDB²¹ data and netMHCpan/IIpan training data, separately for class I and class II, as described below. This algorithm gives access to $K_{d}$ predictions for each register separately. For modeling, we retain all registers that have predicted $K_{d}$ within a chosen threshold of the best predicted $K_{d}$ for a given pMHC. The threshold was set to $10 \cdot K_{d}^{b e s t}$ for class I and $100 \cdot K_{d}^{b e s t}$ for class II. This cuts the average number of registers from 4.2 to 1.2 for class I, and from 6.9 to 4.2 for class II (data for the discovery dataset). On an NVIDIA A100 GPU, creating a single AF model takes about 30 seconds, and therefore, at five models per register, the average runtime is about 3 min and 10 min per class I and class II pMHC, respectively. For class II, the number of retained registers, and hence the runtime, may be higher for poor binders.

AlphaFold.

We used AlphaFold version 2.1.0 with parameter set “model_1”.

Training seqnn and seqnn-f.

Affinity-predicting neural nets seqnn were trained separately for class I and class II pMHC data. Their architecture is shown in Figures S1B, S2A. For each peptide register, one-hot encoded amino-acids are placed into input vectors according to the residue position number induced by the register choice. For class I, one residue beyond the binding core is included on each side. For class II, the 9mer binding core is provided to the network, as well as 3-neuron encodings of lengths of both flanking regions. The MHC is provided as a pseudo-sequence of 26 (class I) or 30 (class II) residues from positions with maximal peptide contact numbers in the discovery dataset. The input layer is followed by a number of fully-connected layers interspersed with batch normalization. The output neuron predicts logarithm of the dissociation constant. Minimal pooling over $K_{d}$ for different peptide registers is used to select a single value during training, but the network provides access to $K_{d}$ predictions for all registers on inference. Model architecture and hyperparameters were selected using the validation pMHC data and the discovery dataset of structures.

For class I, 40 models for each of the five train-validation splits were trained for 15 epochs. In the first 10 epochs, the canonical register was imposed for a randomly chosen half of 9mer peptides, to force a choice of the input neurons used for the binding core residues. We observed that the trained models clearly fall into two clusters with low/high register error on the discovery dataset, irrespectively of their $K_{d}$ prediction accuracy (Figure S1C). This indicates that some models learn a shifted pattern of anchors, for some or all MHC alleles. Therefore, we only retained models from the cluster with low register error by imposing a threshold of 23 errors on the discovery dataset. Geometric mean of the $K_{d}$ predictions of the resulting ensemble of 135 models is the output of seqnn for class I. The resulting predictor has slightly lower accuracy for (Figures S1D,E) and similar accuracy for register prediction (Figure 2C), compared to $K_{d}$ netMHCpan-4.1.

For class II, we first trained 40 models for 25 epochs for each of the five train-validation splits (200 models in total). Averaged register predictions of these models were used to label registers in the training set, and another 150 models were trained on these labeled data. Geometric average of predictions of these 150 models is the seqnn output. The resulting predictor performs similarly to netMHCIIpan-3.2 for $K_{d}$ prediction (Figures 3F, S2B) and similarly to netMHCIIpan-4.0 for register prediction (Figure 4C).

Our pipeline uses seqnn to pre-filter peptide registers before modeling. For each pMHC, we keep registers with predicted $K_{d}$ within a certain factor of the lowest predicted $K_{d}$ for that pMHC. If this factor is set too high, the filter would not eliminate any registers, and if it is set too low, true registers will be eliminated too often. This tradeoff is illustrated in Figures S1F and S2C for pMHCs from the class I and class II discovery datasets. We choose the thresholds at x10 and x100 for class I and class II predictors, respectively.

The network seqnn-f was trained on the same data as seqnn, but with registers partially labeled by TFold. We utilized models from the TFold run used to build the MSA (hence no MSA was used in the run), as described above. Therefore, models were available for 42413 pMHCs. The optimal neural network architecture was found to be similar to seqnn, but with more hidden neurons (four or five hidden layers with 512 neurons), and with dropout regularization (dropout fraction 0.6). Like for seqnn, the training procedure was two-step. In the first step, TFold models for pMHCs with predicted error score (100-pLDDT) below 1.3 were used to label registers on the training set, and 150 models were trained. In the second step, for pMHCs with no model of accuracy below 1.3, we labeled registers using the models from the first step ( $K_{d}$ threshold set to x10). In this second step, 150 models were trained. The geometric mean of their output is the prediction of seqnn-f.

Analysis of predicted structures for difficult pMHC pairs (Figure S4).

Consider a pair of variant CMV pp65 epitopes NLVPMGATV and NLVPMVAAV, presented by HLA-A*02:01 (Figure S4A). For NLVPMVAAV, residue V6 is a secondary anchor, while the side chain of M5 is facing up and is available for TCR recognition. For the other variant NLVPMGATV, G6 has no side chain and instead M5 becomes a secondary anchor, turning away from the TCR. TFold correctly recognizes this conformational change, producing models that are far from each other and close to their corresponding native structures. Another example of a modeling success is a pair of HLA-B*53:01-presented HIV-1 Gag-Pol epitope variants QASQEVKNW and QATQEVKNW (Figure S4B). A single mutation adding a methyl group to the side chain in position three leads to a 1.92 Å change in the backbone and turns residue K7 towards the TCR. This is correctly reproduced by TFold models. However, for the same pair of peptides presented by HLA-B*57:01, the two TFold models generally follow the backbone of QASQEVKNW, failing to account for the sequence difference (Figure S4C). They both also have incorrect orientation of the K7 side chain. Similarly, for the pair of HCV NS3 epitope variants CINMWCWTV and CISGVCWTV presented by HLA-A*02:01, the two TFold models have similar backbones that are close to the native conformation for the second peptide, not being able to reproduce the difference in native structures. All models discussed in this paragraph have predicted error score below 6.8 (median score of the test set), and the score cannot distinguish the successes from the failures.

Protein crystallization, data collection, and structure determination.

Complexes of the KLSHQLVLL and KLSHQLVLL peptides with HLA-A2 were produced by refolding from bacterially (E. coli) expressed inclusion bodies, followed by purification via size exclusion chromatography as previously described³⁹. Peptides were obtained from GenScript at >90% purity. Crystallization was performed using a Mosquito robot via hanging drop/vapor diffusion. For the KLSHQLVLL/HLA-A2 complex, protein was concentrated to 7.4 mg/mL and crystals grown at 4 °C in 12.5% polyethylene glycol 4000 and 0.1 M MES (pH 6.5). Crystals were harvested in a cryoprotectant solution of mother liquor supplemented with 8% glycerol and flash frozen in liquid nitrogen. For the KLSHQLVLL/HLA-2 complex, protein was concentrated to 5.25 mg/mL and crystals grown at 4 °C in 20% polyethylene glycol 3350 and 0.2 M potassium nitrate. Crystals were harvested in a cryoprotectant solution of mother liquor supplemented with 12 % glycerol and flash frozen in liquid nitrogen. X-ray diffraction data was collected at beamline 24-ID-E of the Advanced Photon Source at Argonne National Laboratory. Indexing and scaling was carried out in DIALS⁴⁰. Molecular replacement and automated refinement were performed in Phenix⁴¹; coordinates of PDB 3PWL with the peptide removed were used as a search model for molecular replacement⁴². Manual refinement was performed in Coot⁴³. As both structures had two copies of the molecules in their asymmetric units, all comparisons and RMSD calculations were performed with both molecules and averaged. As the two copies are nearly identical in both structures, this had a negligible impact on the results.

Quantification and Statistical Analysis

P-values were computed by Fisher’s exact test, and 95% confidence intervals for proportions were computed using the Agresti-Coull estimate.

Supplementary Material

NIHMS1954172-supplement-1.pdf^{(17.8MB, pdf)}

Highlights.

TFold is an AlphaFold-based pipeline for peptide-MHC structure modeling
TFold outperforms state-of-the-art in accuracy and register prediction
Performance was validated on a neoantigen/wild-type peptide pair
Improves fast sequence-based register prediction in class II pMHCs

Acknowledgements

We are grateful to Daniel Mattox, Damon May, Matthew Noakes, Ravi Pandya, and Jeremy Shaver for helpful discussions, to Dario Marzella for help with setting up PANDORA, and Jiaqi Ma for advice on structure refinement. This work was supported by NIH-NCI grant 5PO1CA087497–20 and NIH-NIGMS grant 3R35GM118166–08. X-ray diffraction data were collected at the Northeastern Collaborative Access Team beamlines, which are funded by the National Institute of General Medical Sciences from the National Institutes of Health (grant P30GM124165). The Eiger 16M detector on the 24-ID-E beam line was funded by a NIH-ORIP HEI grant (S10OD021527). This research used resources of the Advanced Photon Source, a U.S. Department of Energy (DOE) Office of Science User Facility operated for the DOE Office of Science by Argonne National Laboratory under Contract No. DE-AC02–06CH11357.

Footnotes

Declaration of interests

Arnold Levine is a founder, director, shareholder and receives fees for these activities of PMV Pharma. He also is a consultant for Chugai Pharma and receives a fee for that position. Neither company works in the topic of this manuscript. Victor Mikhaylov is an employee and shareholder of BioNTech US Inc. All other authors declare no competing interests.

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

[1].Antunes D, Abella JR, Devaurs D, Rigo MM, Kavraki LE (2018). Structure-based methods for binding mode and binding affinity prediction for peptide-MHC complexes. Curr. Top. Med. Chem. 18(26), 2239–2255, 10.2174/1568026619666181224101744. [DOI] [PMC free article] [PubMed] [Google Scholar]
[2].Chaudhury S, Lyskov S, Gray JJ (2010). PyRosetta: a script-based interface for implementing molecular modeling algorithms using Rosetta. Bioinformatics 26, 689–91, 10.1093/bioinformatics/btq007. [DOI] [PMC free article] [PubMed] [Google Scholar]
[3].Webb B, Sali A (2016). Comparative protein structure modeling using MODELLER. Curr. Protoc. Bioinformatics 54, 5.6.1–5.6.37, 10.1002/cpbi.3. [DOI] [PMC free article] [PubMed] [Google Scholar]
[4].Riley TP, Keller GLJ, Smith AR, Davancaze LM, Arbuiso AG, Devlin JR, Baker BM (2019). Structure based prediction of neoantigen immunogenicity. Front. Immunol. 10, 2047, 10.3389/fimmu.2019.02047. [DOI] [PMC free article] [PubMed] [Google Scholar]
[5].Jensen KK, Rantos V, Jappe EC, Olsen TH, Jespersen MC, Lanzarotti E, Mahajan S, Peters B, Nielsen M, Marcatili P, et al. (2019). TCRpMHCmodels: structural modelling of TCR-pMHC class I complexes. Scientific Reports 9, 14530, 10.1038/s41598-019-50932-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
[6].Marzella DF, Parizi FM, van Tilborg D, Renaud N, Sybrandi D, Buzatu R, Rademaker DT, ‘t Hoen PAC, Xue LC (2022). PANDORA: a fast, anchor-restrained modelling protocol for peptide: MHC complexes. Front. Immunol. 13, 10.3389/fimmu.2022.878762. [DOI] [PMC free article] [PubMed] [Google Scholar]
[7].Baek M, Dimaio F, Anishchenko I, Dauparas J, Ovchinnikov S, Lee GR, Wang J, Cong Q, Kinch LN, Baker D, et al. (2021). Accurate prediction of protein structures and interactions using a three-track neural network. Science 373, 871–876, 10.1126/science.abj875. [DOI] [PMC free article] [PubMed] [Google Scholar]
[8].Jumper J, Evans R, Pritzel A, Green T, Figurnov M, Ronneberger O, Tunyasuvunakool K, Bates R, Žídek A, Hassabis D, et al. (2021). Highly accurate protein structure prediction with AlphaFold. Nature 596(7873), 583–589, 10.1038/s41586-021-03819-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
[9].Evans R, O’Neill M, Pritzel A, Antropova N, Senior A, Green T, Žídek A, Bates R, Jumper J, Hassabis D, et al. (2021). Protein complex prediction with AlphaFold-Multimer. Preprint at bioRxiv, 10.1101/2021.10.04.463034. [DOI] [Google Scholar]
[10].Motmaen A, Dauparas J, Baek M, Abedi MH, Baker D, Bradley P (2023). Peptide-binding specificity prediction using fine-tuned protein structure prediction networks. PNAS 120(9), 10.1073/pnas.2216697120. [DOI] [PMC free article] [PubMed] [Google Scholar]
[11].Delaunay AP, Fu Y, Bégué A, McHardy R, Djermani BA, Rooney M, Tovchigrechko A, Lang M, Beguir K, Şahin U, et al. (2022). Peptide-MHC structure prediction with mixed residue and atom graph neural network. Preprint at bioRxiv, 10.1101/2022.11.23.517618 [DOI] [Google Scholar]
[12].Aronson A, Hochner T, Cohen T, Schneidman-Duhovny D (2022). Structure modeling and specificity of peptide-MHC class I interactions using geometric deep learning. Preprint at bioRxiv, 10.1101/2022.12.15.520566. [DOI] [Google Scholar]
[13].Jensen KK, Andreatta M, Marcatili P, Buus S, Greenbaum JA, Yan Z, Sette A, Peters B, Nielsen M (2018). Improved methods for predicting peptide binding affinity to MHC class II molecules. Immunology 154(3), 394–406, 10.1111/imm.12889. [DOI] [PMC free article] [PubMed] [Google Scholar]
[14].Reynisson B, Alvarez B, Paul S, Peters B, Nielsen M (2020). NetMHCpan-4.1 and NetMHCIIpan-4.0: improved predictions of MHC antigen presentation by concurrent motif deconvolution and integration of MS MHC elited ligand data. Nucleic Acids Res. 48(W1), W449–W454, 10.1093/nar/gkaa379. [DOI] [PMC free article] [PubMed] [Google Scholar]
[15].Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE (2000). The Protein Data Bank. Nucleic Acids Res. 28(1), 235–42, 10.1093/nar/28.1.235. [DOI] [PMC free article] [PubMed] [Google Scholar]
[16].Klobuch S, Lim JJ, van Balen P, Kester MGD, de Klerk W, de Ru AH, Pothast CR, Jedema I, Falkenburg JHF, Heemskerk MHM, et al. (2022). Human T cells recognize HLA-DP–bound peptides in two orientations. PNAS 119 (49), 10.1073/pnas.2214331119. [DOI] [PMC free article] [PubMed] [Google Scholar]
[17].Racle J, Guillaume P, Schmidt J, Michaux J, Larabi A, Lau K, Perez MAS, Bassani-Sternberg M, Harari A, Gfeller D, et al. (2023). Machine learning predictions of MHC-II specificities reveal alternative binding mode of class II epitopes. Immunity 56(3), 1359–1375, 10.1016/j.immuni.2023.03.009. [DOI] [PubMed] [Google Scholar]
[18].Andreatta M, Jurtz VI, Kaever T, Sette A, Peters B, Nielsen M (2017). Machine learning reveals a non-canonical mode of peptide binding to MHC class II molecules. Immunology 152(2), 255–264, 10.1111/imm.12763. [DOI] [PMC free article] [PubMed] [Google Scholar]
[19].Bryant P, Pozzati G, Elofsson A (2022) Improved prediction of protein-protein interactions using AlphaFold2. Nat. Commun. 13, 1265, 10.1038/s41467-022-28865-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
[20].Lefranc M-P, Duprat E, Kaas Q, Tranne M, Thiriot A, Lefranc G (2005). IMGT unique numbering for MHC groove G-DOMAIN and MHC superfamily (MhcSF) G-LIKE-DOMAIN. Dev. Comp. Immunol. 29(11), 917–38, 10.1016/j.dci.2005.03.003. [DOI] [PubMed] [Google Scholar]
[21].Vita R, Mahajan S, Overton JA, Dhanda SK, Martini S, Cantrell JR, Wheeler DK, Sette A, Peters B (2018). The Immune Epitope Database (IEDB): 2018 update. Nucleic Acids Res. 47(D1):D339–D343, 10.1093/nar/gky1006. [DOI] [PMC free article] [PubMed] [Google Scholar]
[22].Devlin JR, Alonso JA, Ayres CM, Keller GLJ, Bobisse S, Vander Kooi CW, Coukos G, Gfeller D, Harari A, Baker BM (2020). Structural dissimilarity from self drives neoepitope escape from immune tolerance. Nat. Chem. Biol. 16 (11), 1269–1276, 10.1038/s41589-020-0610-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
[23].Abelin JG, Harjanto D, Malloy M, Suri P, Colson T, Goulding S, Creech A, Serrano LR, Nasir G, Rooney MS, et al. (2019). Defining HLA-II ligand processing and binding rules with mass spectrometry enhances cancer epitope prediction. Immunity 51(4), 766–779, 10.1016/j.immuni.2019.08.012. [DOI] [PubMed] [Google Scholar]
[24].Strønen E, Toebes M, Kelderman S, van Buuren MM, Yang W, van Rooij N, Donia M, Böschen M-L, Lund-Johansen F, Olweus J, Schumacher TN (2016). Targeting of cancer neoantigens with donor-derived T cell receptor repertoires. Science 352 (6291), 1337–1341, 10.1126/science.aaf2288. [DOI] [PubMed] [Google Scholar]
[25].Schmidt J, Smith AR, Magnin M, Racle J, Devlin JR, Bobisse S, Cesbron J, Bonnet V, Carmona SJ, Gfeller D (2021). Prediction of neo-epitope immunogenicity reveals TCR recognition determinants and provides insight into immunoediting. Cell Rep. Med. 2 (2), 100194, 10.1016/j.xcrm.2021.100194. [DOI] [PMC free article] [PubMed] [Google Scholar]
[26].Bradley P (2023). Structure-based prediction of T cell receptor:peptide-MHC interactions. eLife 12:e82813, 10.7554/eLife.82813 [DOI] [PMC free article] [PubMed] [Google Scholar]
[27].Robinson J, Halliwell JA, McWilliam H, Lopez R, Marsh SGE (2013). IPD - the Immuno Polymorphism Database. Nucleic Acids Res. 41, D1234–40, 10.1093/nar/gks1140. [DOI] [PMC free article] [PubMed] [Google Scholar]
[28].Barker DJ, Maccari G, Georgiou X, Cooper MA, Flicek P, Robinson J, Marsh SGE (2023). IPD-IMGT/HLA Database. Nucleic Acids Res. 51, D1053–60, 10.1093/nar/gkac1011. [DOI] [PMC free article] [PubMed] [Google Scholar]
[29].The UniProt Consortium. (2023). UniProt: the Universal Protein Knowledgebase in 2023. Nucleic Acids Res. 51, D523–D531, 10.1093/nar/gkac1052. [DOI] [PMC free article] [PubMed] [Google Scholar]
[30].Lefranc M-P (2011). IMGT, the International ImMunoGeneTics Information System. Cold Spring Harb Protoc. 6, 10.1101/pdb.top115. [DOI] [PubMed] [Google Scholar]
[31].Lefranc M-P, Pommié C, Ruiz M, Giudicelli V, Foulquier E, Truong L, Thouvenin-Contet V, Lefranc G (2003). IMGT unique numbering for immunoglobulin and T cell receptor variable domains and Ig superfamily V-like domains. Dev. Comp. Immunol. 27(1), 55–77, 10.1016/s0145-305x(02)00039-3. [DOI] [PubMed] [Google Scholar]
[32].Johnson LS, Eddy SR, Portugaly E (2010). Hidden Markov model speed heuristic and iterative HMM search procedure. BMC Bioinformatics, 11(1), 1–8, 10.1186/1471-2105-11-431. [DOI] [PMC free article] [PubMed] [Google Scholar]
[33].Suzek BE, Wang Y, Huang H, McGarvey PB, Wu CH, and UniProt Consortium. (2015). Uniref clusters: a comprehensive and scalable alternative for improving sequence similarity searches. Bioinformatics 31(6), 926–932, 10.1093/bioinformatics/btu739. [DOI] [PMC free article] [PubMed] [Google Scholar]
[34].Eddy SR (2011). Accelerated profile HMM searches. PLoS Comput. Biol, 7(10):e1002195, 10.1371/journal.pcbi.1002195. [DOI] [PMC free article] [PubMed] [Google Scholar]
[35].Cock PA, Antao T, Chang JT, Chapman BA, Cox CJ, Dalke A, Friedberg I, Hamelryck T, Kauff F, Wilczynski B and de Hoon MJL (2009). Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics, 25, 1422–1423, 10.1093/bioinformatics/btp163. [DOI] [PMC free article] [PubMed] [Google Scholar]
[36].Dulberger CL, McMurtney CP, Holzemer A, Neu KE, Liu V, Steinbach AM, Garcia-Beltran WF, Sulak M, Jabri B, Lynch VJ, Altfeld M, Hildebrand WH, Adams EJ (2017). Human leukocyte antigen F presents peptides and regulates immunity through interactions with NK cell receptors. Immunity 46, 1018–1029, 10.1016/j.immuni.2017.06.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
[37].Alvarez S (2013). A cartography of the van der Waals territories. Dalton Trans, 42, 8617–8636, 10.1039/C3DT50599E. [DOI] [PubMed] [Google Scholar]
[38].Hamerlyck T, Manderick B (2003). PDB file parser and structure class implemented in Python. Bioinformatics, 22, 2308–2310, 10.1093/bioinformatics/btg299. [DOI] [PubMed] [Google Scholar]
[39].Davis-Harrison RL, Armstrong KM, Baker BM (2005). Two different T cell receptors use different thermodynamic strategies to recognize the same peptide/MHC ligand. J. Mol. Biol. 346 (2), 533–550, 10.1016/j.jmb.2004.11.063. [DOI] [PubMed] [Google Scholar]
[40].Winter G, Waterman DG, Parkhurst JM, Brewster AS, Gildea RJ, Gerstel M, Fuentes-Montero L, Vollmar M, Michels-Clark T, Young ID, et al. (2018). DIALS: implementation and evaluation of a new integration package. Acta Crystallogr. D Struct. Biol. 74 (2), 85–97, 10.1107/S2059798317017235. [DOI] [PMC free article] [PubMed] [Google Scholar]
[41].Afonine PV, Grosse-Kunstleve RW, Echols N, Headd JJ, Moriarty NW, Mustyakimov M, Terwilliger TC, Urzhumtsev A, Zwart PH, Adams PD (2012). Towards automated crystallographic structure refinement with phenix.refine. Acta Crystallogr. D Biol. Crystallogr. 68 (4), 352–367, 10.1107/S0907444912001308. [DOI] [PMC free article] [PubMed] [Google Scholar]
[42].Borbulevych OY, Piepenbrink KH, Baker BM (2011). Conformational melding permits a conserved binding geometry in TCR recognition of foreign and self molecular mimics. J. Immunol. 186 (5), 2950–8, 10.4049/jimmunol.1003150. [DOI] [PMC free article] [PubMed] [Google Scholar]
[43].Emsley P, Lohkamp B, Scott WG, Cowtan K (2010). Features and development of Coot. Acta Crystallogr. D Biol. Crystallogr., 66 (4), 486–501, 10.1107/S0907444910007493. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

NIHMS1954172-supplement-1.pdf^{(17.8MB, pdf)}

Data Availability Statement

Structural data for the neoantigen/WT pMHC pair have been submitted to the PDB with accession codes 8U9G and 8TBW.

All original code has been deposited at https://github.com/v-mikhaylov/tfold-release and is publicly available as of the date of publication. DOIs are listed in the key resources table.

Key resources table.

REAGENT or RESOURCE	SOURCE	IDENTIFIER
Deposited data
Structure of human Class I MHC HLA-A2 bound to sorting nexin 24 (127–135) neoantigen KLSHQLVLL	This paper	PDB: 8U9G
Structure of human Class I MHC HLA-A2 in complex with sorting nexin 24 (127–135) peptide KLSHQPVLL	This paper	PDB: 8TBW
Software and algorithms
TFold	This paper	https://github.com/v-mikhaylov/tfold-release, DOI:10.5281/zenodo.10073699

Open in a new tab

Any additional information required to reanalyze the data reported in this paper is available from the lead contact upon request.

[R1] [1].Antunes D, Abella JR, Devaurs D, Rigo MM, Kavraki LE (2018). Structure-based methods for binding mode and binding affinity prediction for peptide-MHC complexes. Curr. Top. Med. Chem. 18(26), 2239–2255, 10.2174/1568026619666181224101744. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R2] [2].Chaudhury S, Lyskov S, Gray JJ (2010). PyRosetta: a script-based interface for implementing molecular modeling algorithms using Rosetta. Bioinformatics 26, 689–91, 10.1093/bioinformatics/btq007. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R3] [3].Webb B, Sali A (2016). Comparative protein structure modeling using MODELLER. Curr. Protoc. Bioinformatics 54, 5.6.1–5.6.37, 10.1002/cpbi.3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R4] [4].Riley TP, Keller GLJ, Smith AR, Davancaze LM, Arbuiso AG, Devlin JR, Baker BM (2019). Structure based prediction of neoantigen immunogenicity. Front. Immunol. 10, 2047, 10.3389/fimmu.2019.02047. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R5] [5].Jensen KK, Rantos V, Jappe EC, Olsen TH, Jespersen MC, Lanzarotti E, Mahajan S, Peters B, Nielsen M, Marcatili P, et al. (2019). TCRpMHCmodels: structural modelling of TCR-pMHC class I complexes. Scientific Reports 9, 14530, 10.1038/s41598-019-50932-4. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R6] [6].Marzella DF, Parizi FM, van Tilborg D, Renaud N, Sybrandi D, Buzatu R, Rademaker DT, ‘t Hoen PAC, Xue LC (2022). PANDORA: a fast, anchor-restrained modelling protocol for peptide: MHC complexes. Front. Immunol. 13, 10.3389/fimmu.2022.878762. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R7] [7].Baek M, Dimaio F, Anishchenko I, Dauparas J, Ovchinnikov S, Lee GR, Wang J, Cong Q, Kinch LN, Baker D, et al. (2021). Accurate prediction of protein structures and interactions using a three-track neural network. Science 373, 871–876, 10.1126/science.abj875. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R8] [8].Jumper J, Evans R, Pritzel A, Green T, Figurnov M, Ronneberger O, Tunyasuvunakool K, Bates R, Žídek A, Hassabis D, et al. (2021). Highly accurate protein structure prediction with AlphaFold. Nature 596(7873), 583–589, 10.1038/s41586-021-03819-2. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R9] [9].Evans R, O’Neill M, Pritzel A, Antropova N, Senior A, Green T, Žídek A, Bates R, Jumper J, Hassabis D, et al. (2021). Protein complex prediction with AlphaFold-Multimer. Preprint at bioRxiv, 10.1101/2021.10.04.463034. [DOI] [Google Scholar]

[R10] [10].Motmaen A, Dauparas J, Baek M, Abedi MH, Baker D, Bradley P (2023). Peptide-binding specificity prediction using fine-tuned protein structure prediction networks. PNAS 120(9), 10.1073/pnas.2216697120. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R11] [11].Delaunay AP, Fu Y, Bégué A, McHardy R, Djermani BA, Rooney M, Tovchigrechko A, Lang M, Beguir K, Şahin U, et al. (2022). Peptide-MHC structure prediction with mixed residue and atom graph neural network. Preprint at bioRxiv, 10.1101/2022.11.23.517618 [DOI] [Google Scholar]

[R12] [12].Aronson A, Hochner T, Cohen T, Schneidman-Duhovny D (2022). Structure modeling and specificity of peptide-MHC class I interactions using geometric deep learning. Preprint at bioRxiv, 10.1101/2022.12.15.520566. [DOI] [Google Scholar]

[R13] [13].Jensen KK, Andreatta M, Marcatili P, Buus S, Greenbaum JA, Yan Z, Sette A, Peters B, Nielsen M (2018). Improved methods for predicting peptide binding affinity to MHC class II molecules. Immunology 154(3), 394–406, 10.1111/imm.12889. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R14] [14].Reynisson B, Alvarez B, Paul S, Peters B, Nielsen M (2020). NetMHCpan-4.1 and NetMHCIIpan-4.0: improved predictions of MHC antigen presentation by concurrent motif deconvolution and integration of MS MHC elited ligand data. Nucleic Acids Res. 48(W1), W449–W454, 10.1093/nar/gkaa379. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R15] [15].Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE (2000). The Protein Data Bank. Nucleic Acids Res. 28(1), 235–42, 10.1093/nar/28.1.235. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R16] [16].Klobuch S, Lim JJ, van Balen P, Kester MGD, de Klerk W, de Ru AH, Pothast CR, Jedema I, Falkenburg JHF, Heemskerk MHM, et al. (2022). Human T cells recognize HLA-DP–bound peptides in two orientations. PNAS 119 (49), 10.1073/pnas.2214331119. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R17] [17].Racle J, Guillaume P, Schmidt J, Michaux J, Larabi A, Lau K, Perez MAS, Bassani-Sternberg M, Harari A, Gfeller D, et al. (2023). Machine learning predictions of MHC-II specificities reveal alternative binding mode of class II epitopes. Immunity 56(3), 1359–1375, 10.1016/j.immuni.2023.03.009. [DOI] [PubMed] [Google Scholar]

[R18] [18].Andreatta M, Jurtz VI, Kaever T, Sette A, Peters B, Nielsen M (2017). Machine learning reveals a non-canonical mode of peptide binding to MHC class II molecules. Immunology 152(2), 255–264, 10.1111/imm.12763. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R19] [19].Bryant P, Pozzati G, Elofsson A (2022) Improved prediction of protein-protein interactions using AlphaFold2. Nat. Commun. 13, 1265, 10.1038/s41467-022-28865-w. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R20] [20].Lefranc M-P, Duprat E, Kaas Q, Tranne M, Thiriot A, Lefranc G (2005). IMGT unique numbering for MHC groove G-DOMAIN and MHC superfamily (MhcSF) G-LIKE-DOMAIN. Dev. Comp. Immunol. 29(11), 917–38, 10.1016/j.dci.2005.03.003. [DOI] [PubMed] [Google Scholar]

[R21] [21].Vita R, Mahajan S, Overton JA, Dhanda SK, Martini S, Cantrell JR, Wheeler DK, Sette A, Peters B (2018). The Immune Epitope Database (IEDB): 2018 update. Nucleic Acids Res. 47(D1):D339–D343, 10.1093/nar/gky1006. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R22] [22].Devlin JR, Alonso JA, Ayres CM, Keller GLJ, Bobisse S, Vander Kooi CW, Coukos G, Gfeller D, Harari A, Baker BM (2020). Structural dissimilarity from self drives neoepitope escape from immune tolerance. Nat. Chem. Biol. 16 (11), 1269–1276, 10.1038/s41589-020-0610-1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R23] [23].Abelin JG, Harjanto D, Malloy M, Suri P, Colson T, Goulding S, Creech A, Serrano LR, Nasir G, Rooney MS, et al. (2019). Defining HLA-II ligand processing and binding rules with mass spectrometry enhances cancer epitope prediction. Immunity 51(4), 766–779, 10.1016/j.immuni.2019.08.012. [DOI] [PubMed] [Google Scholar]

[R24] [24].Strønen E, Toebes M, Kelderman S, van Buuren MM, Yang W, van Rooij N, Donia M, Böschen M-L, Lund-Johansen F, Olweus J, Schumacher TN (2016). Targeting of cancer neoantigens with donor-derived T cell receptor repertoires. Science 352 (6291), 1337–1341, 10.1126/science.aaf2288. [DOI] [PubMed] [Google Scholar]

[R25] [25].Schmidt J, Smith AR, Magnin M, Racle J, Devlin JR, Bobisse S, Cesbron J, Bonnet V, Carmona SJ, Gfeller D (2021). Prediction of neo-epitope immunogenicity reveals TCR recognition determinants and provides insight into immunoediting. Cell Rep. Med. 2 (2), 100194, 10.1016/j.xcrm.2021.100194. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R26] [26].Bradley P (2023). Structure-based prediction of T cell receptor:peptide-MHC interactions. eLife 12:e82813, 10.7554/eLife.82813 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R27] [27].Robinson J, Halliwell JA, McWilliam H, Lopez R, Marsh SGE (2013). IPD - the Immuno Polymorphism Database. Nucleic Acids Res. 41, D1234–40, 10.1093/nar/gks1140. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R28] [28].Barker DJ, Maccari G, Georgiou X, Cooper MA, Flicek P, Robinson J, Marsh SGE (2023). IPD-IMGT/HLA Database. Nucleic Acids Res. 51, D1053–60, 10.1093/nar/gkac1011. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R29] [29].The UniProt Consortium. (2023). UniProt: the Universal Protein Knowledgebase in 2023. Nucleic Acids Res. 51, D523–D531, 10.1093/nar/gkac1052. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R30] [30].Lefranc M-P (2011). IMGT, the International ImMunoGeneTics Information System. Cold Spring Harb Protoc. 6, 10.1101/pdb.top115. [DOI] [PubMed] [Google Scholar]

[R31] [31].Lefranc M-P, Pommié C, Ruiz M, Giudicelli V, Foulquier E, Truong L, Thouvenin-Contet V, Lefranc G (2003). IMGT unique numbering for immunoglobulin and T cell receptor variable domains and Ig superfamily V-like domains. Dev. Comp. Immunol. 27(1), 55–77, 10.1016/s0145-305x(02)00039-3. [DOI] [PubMed] [Google Scholar]

[R32] [32].Johnson LS, Eddy SR, Portugaly E (2010). Hidden Markov model speed heuristic and iterative HMM search procedure. BMC Bioinformatics, 11(1), 1–8, 10.1186/1471-2105-11-431. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R33] [33].Suzek BE, Wang Y, Huang H, McGarvey PB, Wu CH, and UniProt Consortium. (2015). Uniref clusters: a comprehensive and scalable alternative for improving sequence similarity searches. Bioinformatics 31(6), 926–932, 10.1093/bioinformatics/btu739. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R34] [34].Eddy SR (2011). Accelerated profile HMM searches. PLoS Comput. Biol, 7(10):e1002195, 10.1371/journal.pcbi.1002195. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R35] [35].Cock PA, Antao T, Chang JT, Chapman BA, Cox CJ, Dalke A, Friedberg I, Hamelryck T, Kauff F, Wilczynski B and de Hoon MJL (2009). Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics, 25, 1422–1423, 10.1093/bioinformatics/btp163. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R36] [36].Dulberger CL, McMurtney CP, Holzemer A, Neu KE, Liu V, Steinbach AM, Garcia-Beltran WF, Sulak M, Jabri B, Lynch VJ, Altfeld M, Hildebrand WH, Adams EJ (2017). Human leukocyte antigen F presents peptides and regulates immunity through interactions with NK cell receptors. Immunity 46, 1018–1029, 10.1016/j.immuni.2017.06.002. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R37] [37].Alvarez S (2013). A cartography of the van der Waals territories. Dalton Trans, 42, 8617–8636, 10.1039/C3DT50599E. [DOI] [PubMed] [Google Scholar]

[R38] [38].Hamerlyck T, Manderick B (2003). PDB file parser and structure class implemented in Python. Bioinformatics, 22, 2308–2310, 10.1093/bioinformatics/btg299. [DOI] [PubMed] [Google Scholar]

[R39] [39].Davis-Harrison RL, Armstrong KM, Baker BM (2005). Two different T cell receptors use different thermodynamic strategies to recognize the same peptide/MHC ligand. J. Mol. Biol. 346 (2), 533–550, 10.1016/j.jmb.2004.11.063. [DOI] [PubMed] [Google Scholar]

[R40] [40].Winter G, Waterman DG, Parkhurst JM, Brewster AS, Gildea RJ, Gerstel M, Fuentes-Montero L, Vollmar M, Michels-Clark T, Young ID, et al. (2018). DIALS: implementation and evaluation of a new integration package. Acta Crystallogr. D Struct. Biol. 74 (2), 85–97, 10.1107/S2059798317017235. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R41] [41].Afonine PV, Grosse-Kunstleve RW, Echols N, Headd JJ, Moriarty NW, Mustyakimov M, Terwilliger TC, Urzhumtsev A, Zwart PH, Adams PD (2012). Towards automated crystallographic structure refinement with phenix.refine. Acta Crystallogr. D Biol. Crystallogr. 68 (4), 352–367, 10.1107/S0907444912001308. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R42] [42].Borbulevych OY, Piepenbrink KH, Baker BM (2011). Conformational melding permits a conserved binding geometry in TCR recognition of foreign and self molecular mimics. J. Immunol. 186 (5), 2950–8, 10.4049/jimmunol.1003150. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R43] [43].Emsley P, Lohkamp B, Scott WG, Cowtan K (2010). Features and development of Coot. Acta Crystallogr. D Biol. Crystallogr., 66 (4), 486–501, 10.1107/S0907444910007493. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Accurate modeling of peptide-MHC structures with AlphaFold

Victor Mikhaylov

Chad A Brambley

Grant L J Keller

Alyssa G Arbuiso

Laura I Weiss

Brian M Baker

Arnold J Levine

SUMMARY

eTOC Blurb

Graphical Abstract

INTRODUCTION

Results

A dataset of pMHC structures: peptide registers and geometric features.

Figure 1.

TFold pipeline.

Modeling class I pMHC structures.

Figure 3.

Figure 2.

TFold significantly outperforms PANDORA, a MODELLER-based pipeline.

Difficult pairs of class I pMHCs.

Modeling class II pMHC structures.

Figure 4.

Using TFold to identify registers at training improves performance of a sequence-based class II pMHC binding predictor.

Assessing TFold with new experimental structures of a class I MHC neoantigen/wild type pair.

Figure 5.

Discussion

STAR★Methods

Resource Availability

Lead contact

Materials availability

Data and code availability

Key resources table.

Method Details

Preparing sequence data.

Preparing structures.

Aggregating structure data and selecting pMHCs for benchmarks.

Preparing peptide:MHC binding data.

Paired multiple sequence alignments.

Details on the modeling pipeline.

AlphaFold.

Training seqnn and seqnn-f.

Analysis of predicted structures for difficult pMHC pairs (Figure S4).

Protein crystallization, data collection, and structure determination.

Quantification and Statistical Analysis

Supplementary Material

Highlights.

Acknowledgements

Footnotes

References

Associated Data

Supplementary Materials

Data Availability Statement

Key resources table.

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases