Skip to main content
. 2021 Apr 1;118(14):e2023141118. doi: 10.1073/pnas.2023141118

Fig. 5.

Fig. 5.

Selection-based prediction of epitope specificity for TCR. TCRs are classified based on their reactivity to three pathogenic epitopes (columns), using three classification methods: TCRex, log-likelihood ratio (Eq. 5), and linear logistic regression (Eq. 6). (A–C) ROC curves and (D–F) precision-recall curves for (A and D) influenza epitope GILGFVFTL (N=3,107 TCR), (B and E) CMV epitope NLVPMVATV (N=4,812), and (C and F) SARS-CoV-2 epitope YLQPRTFLL (N=315) are shown. (G–I) Comparison between log-likelihood scores R(x) and logistic regression scores Rlog(x) for the three epitopes. Red points are TCRs that bind the specific epitope (positive set), and black points are TCRs from bulk sequencing (negative set). r is Pearson’s correlation. For all panels, we used pooled data from ref. 43 as the negative set. We used 10 times more negative data than positive data for training. Performance was quantified using fivefold cross-validation.