Abstract
Proteasomes are responsible for the production of the majority of cytotoxic T lymphocyte (CTL) epitopes. Hence, it is important to identify correctly which peptides will be generated by proteasomes from an unknown protein. However, the pool of proteasome cleavage data used in the prediction algorithms, whether from major histocompatibility complex (MHC) I ligand or in vitro digestion data, is not identical to in vivo proteasomal digestion products. Therefore, the accuracy and reliability of these models still need to be improved. In this paper, three types of proteasomal cleavage data, constitutive proteasome (cCP), immunoproteasome (iCP) in vitro cleavage, and MHC I ligand data, were used for training cleave-site predictive methods based on the kernel-function stabilized matrix method (KSMM). The predictive accuracies of the KSMM+pair coefficients were 75.0%, 72.3%, and 83.1% for cCP, iCP, and MHC I ligand data, respectively, which were comparable to the results from support vector machine (SVM). The three proteasomal cleavage methods were combined in turn with MHC I-peptide binding predictions to model MHC I-peptide processing and the presentation pathway. These integrations markedly improved MHC I peptide identification, increasing area under the receiver operator characteristics (ROC) curve (AUC) values from 0.82 to 0.91. The results suggested that both MHC I ligand and proteasomal in vitro degradation data can give an exact simulation of in vivo processed digestion. The information extracted from cCP and iCP in vitro cleavage data demonstrated that both cCP and iCP are selective in their usage of peptide bonds for cleavage.
Keywords: Cytotoxic T lymphocyte epitopes, Kernel function, Proteasome, Stabilized matrix method
1. Introduction
The proteasome plays an important role in the preservation of protein homeostasis as its proteolytic activity can efficiently clear unneeded foreign viral proteins or damaged intracellular self-proteins. The proteasome has also been associated with the supply of the majority of cytotoxic T lymphocyte (CTL) epitopes for antigen presentation. There are three classes of vertebrate proteasomes: constitutive proteasomes (cCPs), immunoproteasomes (iCPs), and thymoproteasomes (tCPs). cCPs are expressed in most tissues (e.g., hematopoietic cells, lymphocytes, and monocytes). iCPs are developed from cCPs: with the stimulation of interferon (IFN)-γ, the three catalytic subunits β 1, β 2, and β 5 of cCPs are replaced by their homologous counterparts β 1i, β 2i, and β 5i to form iCPs. tCPs are found in thymic epithelial cells and participate in T cell positive selection (Huber et al., 2012). Relative to cCPs, iCPs exhibit altered cleavage preferences, resulting in increased peptide supply for epitope presentation.
Several methods for computational recognition of proteasomal cleavage sites have been presented, e.g., PAProC (Nussbaum et al., 2001), support vector machines (SVMs) (Bhasin and Raghava, 2005), kinetic models (Holzhütter and Kloetzel, 2000), and artificial neural networks (Keşmir et al., 2002). There is also a method for predicting the production probability of an entire peptide using a linear score function (Ginodi et al., 2008). All of these methods are based on experimental proteasomal cleavage sites and statistical analysis of the flanking regions of such sites. Kernel-function stabilized matrix method (KSMM) used in this paper is a modified version of the stabilized matrix method (SMM) (Peters et al., 2003). SMM is a linear method which has been successfully applied to the prediction of MHC-peptide binding affinities, antigenic peptides presented by transporters associated with antigen processing (TAPs) and proteasomal cleavage sites (Peters et al., 2003; Peters and Sette, 2005; Nielsen et al., 2007). Its online software can be found at http://70.167.3.42/smm/. However, some studies found that the contributions of amino acids (AAs) to the predictive values (like binding affinities) are not linearly related (Jacob and Vert, 2008). This led to a lower prediction accuracy of SMM (Nielsen et al., 2007). Thus, it is reasonable to develop a nonlinear method KSMM by integrating kernel functions into SMM. In addition, the kernel functions are incorporated into pair-coefficients estimation between AAs in different peptide positions. The benchmark for cleavage data shows that KSMM is consistently better than SMM and that KSMM+pair coefficients are comparable to SVMs.
There are usually two types of data used in these models, proteasomal in vitro degradation data and MHC I ligand data. However, these two types of proteasomal digestion data do not necessarily bear enough resemblance to natural proteasomal cleavage. The proteasomal in vitro degradation data are generated by activated 20S proteasomes, but the bulk of cellular proteins (80%–90%) are degraded by 26S proteasomes (Sorokin et al., 2009). The 20S proteasome forms the catalytic core of the 26S proteasome, and generates a different pool of products from 26S proteasomes (Emmerich et al., 2000). As the C-termini of MHC I ligands are mostly cleaved by proteasomes (Heinemeyer et al., 2004), MHC I ligands are also used for studying the specificity of the proteasome. However, MHC I ligands represent only a subset of in vivo degradation products: in view of the length distribution of degradation products, possible CTL epitopes must be 8–15 AAs long, accounting for 15% of the cleavage products (Keşmir et al., 2003). As 1%–2% of them are MHC I epitopes (Kosmrlj et al., 2010), MHC I ligands account for at most 0.3% of the cleavage products. Therefore, the accuracy and reliability of these models still need to be improved (Diez-Rivero et al., 2010). In this study, the two types of data, from experimentally identified cCP and iCP digestion products and naturally processed MHC I ligands, both of which are mainly from viruses, were used for training and evaluating KSMM. These three cleavage models, singly or combined with peptide binding prediction, were tested for CTL epitope identification. cCP and iCP cleavage specificities were predicted computationally through reduction of the noise from the experimental data, and the profiles of the cCP and iCP interacting with their substrate were analyzed from our research results. This approach might lead to a better understanding of the processing of MHC I antigens and to the design of peptide vaccines with longer half-life.
2. Materials and methods
2.1. Data
2.1.1. cCP and iCP in vitro cleavage training data
Firstly, we trained the cCP predictive model. For the purpose of extracting comprehensive experimental information and reducing the risk of over-training the model with a limited sample size, proteasomal digestion products of both whole proteins and short peptides were included in the cCP and iCP training sets. We made the assumption that there was no difference in the proteasomal proteolysis degradation of whole proteins and peptides. We collected four fully quantified cCP in vitro digests of whole proteins (Emmerich et al., 2000; Toes et al., 2001; Lucchiari-Hartz et al., 2003; Tenzer et al., 2004): yeast enolase, bovine casein, prion protein, and Nef protein, as well as cCP in vitro digests of 32 peptides 10–56 AAs in length for training our cCP predictive model. Twenty-five of the 32 peptide digest datasets were extracted from cCP in vitro studies (Rivett, 1985; Dick L.R. et al., 1994; Leibovitz et al., 1994; Niedermann et al., 1995; 1996; 1997; Ossendorp et al., 1996; Shimbara et al., 1997; Dick T.P. et al., 1998; Alvarez et al., 2001; Cascio et al., 2001; Peters et al., 2002; Sun et al., 2002; Vigneron et al., 2004; Warren et al., 2006; Chapiro et al., 2006; Goldobin and Zaikin, 2009; Ma et al., 2011). The 7 remaining samples were downloaded from Tenzer-Suppl-Table1.xls at http://70.167.3.42/supplement/ (Jan. 11, 2006). The final cCP training set comprised 650 cleavage sites and 1 184 internal non-cleavage sites. Secondly, we trained iCP predictive models. The iCP degradation training data consisted of results of three whole proteins (Toes et al., 2001; Lucchiari-Hartz et al., 2003; Tenzer et al., 2004): yeast enolase, prion protein and Nef protein, and iCP in vitro digests of 27 peptides 18–56 AAs in length, including 21 peptide products from iCP in vitro digestion studies (Ehring et al., 1996; Beekman et al., 2000; Morel et al., 2000; Alvarez et al., 2001; Mommaas et al., 2002; Peters et al., 2002; Schultz et al., 2002; Sun et al., 2002; Chapiro et al., 2006; Ma et al., 2011) and 6 peptide products from Tenzer-Suppl-Table1.xls at http://70.167.3.42/supplement/ (Jan. 11, 2006). This gave a total of 537 internal cleavage sites and 961 internal non-cleavage sites.
2.1.2. CTL epitope data
The third method was trained with CTL epitope data. Proteasomes generate most MHC I peptides, but other proteases have also been found to generate some MHC I peptides (Cascio et al., 2001). The use of MHC I peptides for training proteasomal cleavage models is based on the hypothesis that MHC I peptides are all generated by proteasomes. A total of 6 277 naturally processed MHC I epitopes were extracted from the Antigen database (Blythe et al., 2002). Because the C-terminus of ligands presented by human leukocyte antigen (HLA)-A*03 is very unlikely to be generated by the proteasome (Tenzer et al., 2005), HLA-A3 restricted epitopes were excluded from the training set. The C-termini of these epitopes were assigned as cleavage samples, and the internal positions within epitopes were assigned as non-cleavage samples. Non-cleavage samples that showed strong cleavage characteristics (e.g., those duplicated as cleavage samples) were discarded. The final CTL epitope training data contained 6 872 non-cleavage and 1 362 cleavage sites. The training set was referred to as ‘Antigen-human’. In these cleavage training sets, peptides duplicated or included in the test set of Saxová et al. (2003) were discarded.
2.1.3. MHC I binding affinity data
Datasets for binding affinity predictive models consisted of a training set and a test set extracted from the MHCBN database (Bhasin et al., 2003). Three hundred and sixteen peptides with measured half maximal inhibitory concentration (IC50) constituted the training set, which is termed ‘MHCBN-train’. Binding affinity in practical calculation is defined as −lg 50 000 (IC50 in nmol/L) (Nielsen et al., 2007). Nine hundred and seventy-five peptides that had interacted with HLA-A*0201 constituted the test set. Every peptide could be mapped back to either a human protein or a protein from a human pathogen in the SwissProt database (Bairoch and Apweiler, 2000). Peptides that did not have an exact matched source protein or a duplicate in the proteasomal in vitro digestion data and the Antigen-human dataset were discarded. The resulting test set contained 754 peptides, including 621 binders and 133 non-binders. The test set was referred to as ‘MHCBN-test’.
2.2. Sequence encoding
A cleavage within the peptide was mapped back to its source protein to flank the cleavage site region (PL … P2 P1 | P1′ … P2′ PL′). The cleavage site is signified by ‘|’ and the N-terminus is on the left. Following the suggestions of Keşmir et al. (2003) and Tenzer et al. (2005), a window size of 8 was used for training the three cleavage models. Through the AA descriptors, peptides can be represented by numerical vectors. For the MHC I binding affinity models, we used 5z-scale descriptors (Sandberg et al., 1998); and for the cleavage prediction models, we used sparse binary descriptors (Nussbaum et al., 2001). Every peptide corresponds to an attribute value. An MHC I epitope has a binding affinity of [0, 1], and in vitro cleavage samples and CTL epitope cleavage samples have a cleavage value which is either 1 (positive sample) or −1 (negative sample). So every training set and test set can be denoted by , where is the peptide-sequence encoding vector, and yi is its attribute value.
2.3. Kernel-function stabilized matrix method
The key idea of SMM is to seek a vector w, which can reflect the corresponding relationship of vector xi and value yi. Given an input x, the model can predict the reasonable output , which best approximates the unknown y. The corresponding minimized energy function is
, | (1) |
where b is a constant offset, and λ is the penalty parameter controlling the trade-off between the margin maximization and the degree of misclassification. If the pair coefficient quantifies the effect of interactions between an AA a at position i and an AA a′ at position i′ on the binding or cleavage, then the minimized energy function for the combined SMM+pair coefficient algorithm (Peters et al., 2003) is
. | (2) |
We then set out to introduce KSMM by incorporation of kernel functions into SMM. For different problems, the most suitable kernel functions are different. We compared three kernel functions: linear, polynomial, and radial-based (Eqs. (3)–(5)). KSMM with a linear kernel is the same as SMM.
Linear kernel function:
. | (3) |
Polynomial kernel function:
. | (4) |
Radial-based kernel function:
. | (5) |
Thus, the minimized energy function (Eq. (1)) is rewritten as
, | (6) |
where c 1 is a constant that scales the predictive attribute range [−1, 1]. The optimal value for λ is determined by minimizing the distance defined by the first term in Eq. (6).
It is impossible to estimate all pairs of AAs in the samples with limited data. For 8-mer peptides, the number of different pair coefficients reaches 20×20×28=11 200. We follow the proposal of Peters et al. (2003) that defines N min as the minimum number of peptides in the training set that we can determine reliably, and discard the pair coefficients which have less than N min peptides. N min was set at 40, 12, and 12 for the Antigen-human, cCP, and iCP training data respectively, and pair coefficients 682, 340, and 212, respectively needed to be identified. The presence or absence of a chosen pair of AAs in the samples was represented by 1/0 in a sparse binary encoding vector . Using the optimal value for the vector w determined from Eq. (6), we drew the coefficient vector w′ of peptide position pairs from the systematic difference between the predicted values y pred and the measured values y meas. The optimal value for the pair coefficients vector w′ was searched by minimizing the energy function equation:
, | (7) |
where c 2 is a scale parameter. The optimal value for λ′ was determined by minimizing the distance defined by the first term in Eq. (7). The same kernel function was used for K 1 and K 2 in Eq. (7). Table 1 lists the ranges and best values (BVs) of different parameters in KSMM.
Table 1.
Function type | Kernel function parameter |
Scale parameter |
Penalty parameter |
|||||
Range | BV of K 1 | BV of K 2 | c 1 | c 2 | Range | BV of λ | BV of λ′ | |
Polynomial | 2–6 | 4.00 | 4.00 | 2.00 | 1.00 | 10−2−106 | 10.00 | 100.00 |
Radial-based | 10−4–104 | 2.00 | 2.00 | 4.00 | 0.80 | 10−2–106 | 10.00 | 10.00 |
Linear | 1.00 | 1.00 | 10−2–106 | 1.00 | 5.00 |
Monte Carlo (MC) method (Metropolis and Ulam, 1949) was implemented for optimizing components of vectors w and w′. Our algorithm has two stages: firstly, to effectively escape from local minima, the MC calculations are repeated 200 times with different initial configurations. For each run, 20 000 MC moves are carried out and the final energy and weight matrix are recorded. In each move, the components of the vector are assigned random values. Secondly, the average of the top 10 records in stage 1 is treated as the initial weight of the vector. In each MC move, two components of the vector are selected at random, and the weights on these two components are updated, keeping the sum of the weights unchanged.
The probability of accepting a move in our algorithm is
, | (8) |
where dE is the difference in energy between the end and start weights, and T is a scalar. Eq. (8) shows that moves that decrease E will always be accepted, since P=1 (dE>0). On the other hand, moves that increase E will be accepted with P=exp(dE/T) (dE<0). T is lowered during the calculation in order to reduce the probability of accepting unfavorable moves. Finally, the classification function of the prediction is
, | (9) |
where w * and are the weight vectors of the optimal plane in feature space, and b * is the threshold value of classification.
2.4. Performance evaluation measures
When using in vitro digestion data and MHC binding data, it is easy to make a clear assignment of positive and negative samples. However, for Antigen-human, definite negative samples do not exist. Goldberg et al. (2002) demonstrated that internal positions within the epitopes are also cleaved. To obtain the most cleavage characteristics, we adopted Saxová’s schema which assumes that positions within an epitope are less likely to be cleaved than C-terminus. Under this requirement, classification using Antigen-human is as follows:
TP (true positive): PC>Td,
FN (false negative): PC<Td,
TN (true negative): for every internal position i within the epitope, Pi<PC or Pi<Td,
FP (false positive): at least one internal position i within the epitope, Pi>PC and Pi>Td,
where PC is the predictive cleavage score at the C-terminus, Pi is the predictive cleavage score at the position i, and Td is the threshold value for sorting predictive cleavage scores into cleavage and non-cleavage sites. The following measures are introduced, namely sensitivity (SE), specificity (SP), accuracy (AC), and Matthews correlation coefficient (MCC) (Saxová et al., 2003):
, | (10) |
, | (11) |
, | (12) |
. | (13) |
The receiver operator characteristics (ROC) curve is a 2D curve in which the true positive rate is plotted on the Y axis and the false positive rate is plotted on the X axis over a continuous range of cut-off values from high to low. The area under the ROC curve (AUC) reflects the ability of a model to distinguish a randomly chosen positive instance from a randomly chosen negative one (Swets, 1988). A random model has an AUC of about 0.5, but reasonable models should have AUC values higher than 0.7. In this study, ROC analysis was used to measure the ability of different models to identify the CTL epitopes. For a robust comparison of different methods, we generated 10 sets by randomly drawing n samples from the test set with a constant ratio of positives to negatives, where n is the size of MHCBN-test. We then performed ROC analyses 10 times. A method is considered significantly better than another if the distribution of its AUC values in a paired two-tailed t-test is significantly higher (P≤0.05).
3. Results
3.1. Comparison of KSMM and SVM in proteasomal cleavage benchmarking
We have developed a kernel-based prediction method called KSMM. A 5-fold cross-validation technique using three types of proteasome data was used to evaluate the performance of classifiers with different kernels. The results are summarized in Table 2. To identify the quality of KSMM, the SVM method was also implemented using the freely downloadable software LIBSVM (Chang and Lin, 2011). SVM is a popular machine learning tool. Due to its superiority in bioinformatics classification problems, SVM has been used in a wide range of protein-peptide interaction computational predictions (Bhasin and Raghava, 2005; Liu et al., 2009). Polynomial KSMM performed the best when cCP in vitro data were used (Table 2), achieving an accuracy of 75% and an MCC of 0.482. Using iCP digested data, radial-based SVM had the highest accuracy (72.4%) and MCC (0.461). For the MHC I ligand data, radial-based KSMM was able to recognize the cleavage sites and non-cleavage sites with >83% accuracy. Taken together, KSMM is a bioinformatics tool that is comparable to the SVM method. The incorporation of kernels into SMM method gave a significant improvement: the accuracy of KSMM with nonlinear kernels was 3% higher than that of SMM (linear kernel).
Table 2.
Classifier | cCP in vitro data |
iCP in vitro data |
MHC I ligand data |
|||||||||
SE | SP | AC | MCC | SE | SP | AC | MCC | SE | SP | AC | MCC | |
KSMM | ||||||||||||
Linear | 68.5 | 68.2 | 68.3 | 0.385 | 67.8 | 70.3 | 69.4 | 0.370 | 80.0 | 79.8 | 79.9 | 0.603 |
Radial-based | 74.5 | 74.3 | 74.3 | 0.471 | 72.9 | 72.0 | 72.3 | 0.436 | 82.9 | 83.2 | 83.1 | 0.661 |
Polynomial | 74.8 | 75.1 | 75.0 | 0.482 | 71.1 | 72.0 | 71.7 | 0.419 | 81.6 | 82.5 | 82.1 | 0.640 |
SVM | ||||||||||||
Radial-based | 82.1 | 69.7 | 74.0 | 0.476 | 79.5 | 68.3 | 72.4 | 0.461 | 83.2 | 81.7 | 82.5 | 0.649 |
Polynomial | 80.3 | 70.1 | 73.6 | 0.457 | 77.0 | 67.4 | 70.9 | 0.428 | 81.5 | 81.4 | 81.5 | 0.630 |
SE: sensitivity; SP: specificity; AC: accuracy; MCC: Matthews correlation coefficient
Using the MHC I ligand test set and the cCP in vitro test set of Saxová et al. (2003), we tested the quality of three proteasomal classifiers: polynomial KSMM trained with cCP in vitro data (KSMM-c), radial-based KSMM trained with iCP in vitro data (KSMM-i), and radial-based KSMM trained with MHC I ligands (KSMM-ld). The comparison shows that our classifiers gave the most satisfying performance using both in vitro and MHC I data (Table 3). The results of PRProC (Nussbaum et al., 2001), FragPredict (Holzhütter and Kloetzel, 2000), and NetChop (Keşmir et al., 2002) were given by Saxová et al. (2003). An unexpected result is that KSMM-i outperforms KSMM-c using the cCP digestion data.
Table 3.
Method | cCP in vitro |
MHC I ligand |
||||
SE | SP | MCC | SE | SP | MCC | |
PAProC | 45.6 | 30.0 | −0.25 | 46.4 | 64.7 | 0.11 |
FragPredict | 83.5 | 16.5 | 0.00 | 72.1 | 41.4 | 0.12 |
NetChop 1.0 | 39.8 | 16.3 | −0.14 | 34.4 | 91.4 | 0.31 |
NetChop 2.0 | 73.6 | 42.4 | 0.16 | 57.4 | 76.4 | 0.32 |
KSMM-c | 73.8 | 77.7 | 0.46 | 61.5 | 56.3 | 0.18 |
KSMM-i | 73.8 | 79.4 | 0.49 | 73.6 | 60.6 | 0.35 |
KSMM-ld | 68.9 | 61.7 | 0.35 | 75.8 | 72.7 | 0.49 |
SE: sensitivity; SP: specificity; MCC: Matthews correlation coefficient
3.2. Proteasomal models benchmarked on MHC I epitope prediction
Using the MHCBN-test described in Section 2, we measured the suitability of different proteasome models for MHC I ligand identification (Fig. 1). Before evaluating the combination of proteasome and MHC I binding predictions, we used the thresholds when the sensitivity of cleavage prediction was 0.90. The highest corresponding specificities (true negative rates) of KSMM-c, KSMM-i, and KSMM-ld were 51%, 46%, and 64%, respectively. A sensitivity of 0.90 could significantly diminish false positives at the cost of a slight increase in false negatives. We filtered the MHCBN test by discarding ligands that had a prediction lower than the threshold. Then, we made a binding affinity prediction based on the reduced test set and generated an ROC graph. Comparing proteasome models alone (Fig. 1), KSMM-ld trained with MHC I ligands performed the best, with an AUC value of 0.78. KSMM-i again clearly gave a better prediction than KSMM-c (0.72 vs. 0.66). The MHC binding affinity predictive method was developed by training radial-based KSMM with MHCBN-train, which achieved an AUC value of 0.82. The joint combination of MHC and the cleavage predictions significantly enhanced the AUC values (~0.90). When combined with MHC, the difference between KSMM-i and KSMM-c was statistically significant, whereas the difference between KSMM-c and KSMM-ld was not. The increased AUC values of integral predictions resulted from a decrease in false positives. Compared to MHC I affinity prediction alone, over the entire range of true positive rates in the ROC calculation, the reduction in the false positive rate was 50% when combining KSMM-c, and 55% when combining KSMM-ld. The integration with KSMM-i led to an even larger reduction in the false positive rate (71%). The basis of this reduction is that the proteasome digestion of source protein into MHC I ligands has some additional degree of selectivity.
3.3. Weight coefficients of the linear iCP and cCP models
Besides those AAs at P1/P1′, AAs nearby also have a great influence on the choice of proteasomal cleavage (Toes et al., 2001). This explains why cleavage happens for certain P1/P1′ compositions, while other identical P1/P1′ compositions are ignored. Each score of the optimal plane w * in Eq. (9) corresponds to a kind of AA in a sequence position, and the sum of these scores is the predicted usage of the cleavage site. Hence, the weight coefficient w * can be seen as the impact of each AA in a sequence position on the cleavage site usage. For the 8-residue window of cCP and iCP in vitro data, we calculated w * with a linear function in Eq. (6). The values of w * are shown in Fig. 2.
4. Discussion
The 20S proteasome is shaped like a hollow barrel containing 4 layers of rings, each composed of seven subunits. The 2 inner rings each have 7 β subunits. Among them, the β 1, β 2, and β 5 subunits are proteolytically active, and each has an NH2-terminus Thr1 playing a critical role in the hydrolysis of peptide bonds. Protein substrate is degraded not only by single active subunits, but also by cooperation between two active subunits (Wenzel et al., 1994). The protein substrate within the proteasome is fully unfolded (Sorokin et al., 2009). According to the ‘molecular ruler’ of Wenzel et al. (1994), the length of the degradation products is related to the distance between active sites. We computed the distances between the active sites of mouse 20S cCP and iCP (Table 4). Distances between active sites of iCP were a little longer (0.04–0.26 nm) than those between active sites of cCP. Toes et al. (2001) reported that cCP and iCP generated peptides with an average lengths of 7.4 and 8.6 AAs, respectively. The average length of iCP production is about 1 AA longer than that of cCP production. On the basis of the correspondence between the length of unfolded protein sequence and the number of AAs in the sequence, as proposed by Coux et al. (1996), a length of 0.26 nm of the cleavage product is close to 1 AA. Therefore, the different average fragment lengths generated by cCP and iCP production may relate to the gap between the active sites of cCP and iCP.
Table 4.
cCP PDB id 3UNE | Distance between active sites of mouse 20S cCP (nm) |
||||
(Thr1) β 1c | (Thr1) β 2c | (Thr1) β 5c | |||
β ring | (Thr1) β 1c | 2.75 | 6.11 | ||
(Thr1) β 2c | 2.75 | 6.17 | |||
(Thr1) β 5c | 6.11 | 6.17 | |||
β * ring | (Thr1) β * 1c | 2.77 | 4.87 | 5.66 | |
(Thr1) β * 2c | 4.87 | 6.37 | 3.93 | ||
(Thr1) β * 5c | 5.66 | 3.93 | 4.76 | ||
| |||||
| |||||
iCP PDB id 3UNH | Distance between active sites of mouse iCP (nm) |
||||
(Thr1) β 1i | (Thr1) β 2i | (Thr1) β 5i | |||
| |||||
β ring | (Thr1) β 1i | 2.79 | 6.26 | ||
(Thr1) β 2i | 2.79 | 6.33 | |||
(Thr1) β 5i | 6.26 | 6.33 | |||
β * ring | (Thr1) β * 1i | 3.03 | 5.05 | 5.86 | |
(Thr1) β * 2i | 5.05 | 6.54 | 4.09 | ||
(Thr1) β * 5i | 5.86 | 4.09 | 4.99 |
The KSMM+pair coefficient method achieved accuracies of 75.0%, 72.3%, and 83.1% using cCP, iCP, and MHC I ligand data, respectively (Table 2). These results are comparable to those of SVM. This demonstrates that the KSMM+pair coefficient method could serve as a protein-protein interaction predictive algorithm. Using Saxová’s test set as a benchmark, we found that the predictive performance of our optimal iCP and cCP and MHC C-terminus cleavage models was superior to that of the reference models (Holzhütter and Kloetzel, 2000; Nussbaum et al., 2001; Keşmir et al., 2002) (Table 3). Our cleavage models also helped improve HLA-A*0201 peptide identification when integrated with MHC I binding predictions (Fig. 1), i.e., the AUC values increased from 0.82 to 0.91. The improved AUC values indicate that the 8-residue, 4 residues on each side of the cleavage/non-cleavage site of immuno-, constitutive, or MHC I restricted C-terminus samples, could precisely reflect the information of cleavage fragments generated by 20S cCP and iCP. They also demonstrate that both MHC I ligands and proteasomal in vitro degradation bear a close resemblance to naturally processed digestion.
KSMM-i performed better than KSMM-c in cCP cleavage site identification (Table 2) and in single or combined HLA-A*0201 peptide identification (Fig. 1). This suggests that, compared with cCP, iCP is more specific in cleavage site selection and generates MHC I ligands more efficiently. As for KSMM-ld, it can be regarded as another type of cCP cleavage model. Most of the MHC I epitopes are processed and eluted in an environment without inducing the expression of iCPs. Differences in the performance of KSMM-c, KSMM-i, and KSMM-ld in combined HLA-A*0201 peptide identification were small (Fig. 1). This may result from the fact that the selectivity of the MHC I molecule may reduce the gap in MHC I ligand generation efficiency between iCP and cCP.
iCP and cCP in vitro digestion data could more accurately reflect the pure information of proteasomal cleavage fragments with respect to the C-terminus of MHC I restricted samples. This may be because MHC I restricted samples are likely to contain not only the information of proteasomal cleavage fragments, but also the information of peptides binding to MHC I and TAP transport (Liu et al., 2009). KSMM-ld is markedly better than KSMM-c/i in MHC I epitope identification (0.78 vs. 0.66 and 0.72, respectively; Fig. 1), but KSMM-ld is no better than KSMM-c or KSMM-i when combined with MHC-binding prediction. The MHC I and TAP binding information in the Antigen-human is responsible for the excellent performance of KSMM-ld in single MHC I epitope identification. In Fig. 2, the height of each AA symbol is proportional to its absolute w * value, which is the contribution of the AA in a sequence position to the usage of potential cleavage sites. The height of all the AA symbols stacked at each position along P4–P4′ is proportional to the sum of the corresponding absolute scores for the 20 possible AAs at the positions which are the contribution to possible cleavage sites. The AAs on adjacent positions of the cleavage site, i.e., P1, P3′ positions, have a distinct effect on the potential cleavage site, demonstrating that both cCP and iCP species are selective in the hydrolysis of peptide bonds.
For the purpose of interpreting the AA characteristics in cCP and iCP cleavage-site flanking positions (Fig. 2), the crystal structures of mouse 20S iCP and cCP (PDB id: 3UNE and 3UNH) were used to find AAs which make nonbonding contact with residue Thr1 of active subunits (i.e., where there is at least one AA atom at ≤0.5 nm distance from any other atom of Thr1) (Holzhütter and Kloetzel, 2000). Table 5 lists all the AAs making nonbonding contact with the Thr1 of active subunits. The AAs surrounding the active sites of 20S iCP and cCP differ in β 1 and β 5: the electropositive and basic AA Arg45 in β 1c is substituted by the neutral AA Leu45 in β 1i, and the neutral AA Ala46 in β 5c is substituted by the neutral AA Ser46 in β 5i. It may simply be that, compared to β 1c, the AAs surrounding the active site of the β 1i subunit are less electropositive and tend to be more neutral on the whole, preferring to cleave after the neutral AAs. The substitution of neutral Ala46 in β 5c by neutral Ser46 in β 5i makes little difference to the physicochemical characteristics surrounding the β 5 active sites and does not alter the hydrolysis preference of β 5i. Relative to cCP, iCP preferentially cleaves peptides after nonpolar AAs (Seifert et al., 2010). So the cleavage difference between cCP and iCP may relate to the change in the physicochemical characteristics surrounding β 1-subunit active sites.
Table 5.
AA No. | AAs making nonbonding contact with their own Thr1 |
|||||
β 1c | β 1i | β 2c | β 2i | β 5c | β 5i | |
1 | Thr2 | Thr2 | Thr2 | Thr2 | Thr2 | Thr2 |
2 | Ile3 | Ile3 | Ile3 | Ile3 | Thr3 | Thr3 |
3 | Asp17 | Asp17 | Asp17 | Asp17 | Asp17 | Asp17 |
4 | Arg19 | Arg19 | Arg19 | Arg19 | Arg19 | Arg19 |
5 | Lys33 | Lys33 | Lys33 | Lys33 | Lys33 | Lys33 |
6 | Arg45 | Leu45 | Gly45 | Gly45 | Met45 | Met45 |
7 | Ser46 | Ser46 | Ala46 | Ala46 | Ala46 | Ser46 |
8 | Gly47 | Gly47 | Gly47 | Gly47 | Gly47 | Gly47 |
PDB id: 3UNE for cCP and 3UNH for iCP. Neutral AAs are colored black, electropositive and basic AAs are colored blue, and electronegative and acidic AAs are colored red (Note: for interpretation of the references to color in this figure legend, the reader is referred to the web version of this article)
The AA specificities of the P1 position correspond to the integral binding characteristics of the S1 pockets of three active subunits (Fig. 2). A binding mechanism is common to all active sites of cCP and iCP (Huber et al., 2012). Its selectivity depends, apart from the reactive warhead, solely on the interactions with S1 pockets. According to this mechanism, besides the P1 position, the proteasomal selective specificities in the P3′ position also relate to the S1 pockets of active subunits. The length of the unfolded peptide between the cleavage site and the P3′ position is 3 AAs, which is about 1.2–1.6 nm. Distances between any pair of pocket centers of the six active subunits of two β rings have been calculated (Table 6). The Ser130 (Gly130) Cα atoms of active subunits, which are close to the S1 pocket centers, are used as the S1 pocket centers. Of all 12 pairs of distances for cCP and iCP, the distances between the S1 pockets centers of β 1 and β 1 * are 1.48 and 1.56 nm, resepctively, and fall within the range 1.2–1.6 nm; the remaining 11 distances both for cCP and iCP are much longer than 1.6 nm. The AA specificities of the P3′ position are associated with the preference of the S1 pocket of β 1 *, and the AA specificities of the P1 position most probably reflect the characteristics of the S1 pocket of the β 1 subunit. Together, this demonstrates the selectivity of the cCP and iCP for certain cleavage sites, and that this selectivity may be related to the hydrolysis characteristics of their β 1 subunits.
Table 6.
cCP PDB id 3UNE | Distance between S1 pocket centers of active subunits of mouse 20S cCP (nm) |
|||
(Ser130) β 1c | (Gly130) β 2c | (Ser130) β 5c | ||
β ring | (Ser130) β 1c | 2.83 | 5.79 | |
(Gly130) β 2c | 2.83 | 6.29 | ||
(Ser130) β 5c | 5.79 | 6.29 | ||
β * ring | (Ser130) β * 1c | 1.56 | 3.79 | 5.66 |
(Gly130) β * 2c | 3.79 | 6.04 | 4.43 | |
(Ser130) β * 5c | 5.66 | 4.43 | 3.71 | |
| ||||
| ||||
iCP PDB id 3UNH | Distance between S1 pocket centers of active subunits of mouse iCP (nm) |
|||
(Ser130) β 1i | (Gly130) β 2i | (Ser130) β 5i | ||
| ||||
β ring | (Ser130) β 1i | 2.94 | 10.7 | |
(Gly130) β 2i | 2.94 | 11.0 | ||
(Ser130) β 5i | 10.7 | 11.0 | ||
β * ring | (Ser130) β * 1i | 1.48 | 4.11 | 9.67 |
(Gly130) β * 2i | 4.11 | 11.5 | 10.9 | |
(Ser130) β * 5i | 9.67 | 10.9 | 3.71 |
Ser130 (Gly130) Cα atoms of active subunits are used as the S1 pocket centers
Nevertheless, this leads to a contradiction. The in vitro cleavage samples are a collection of peptide fragments generated by three types of active subunits (‘caspase-like’ β 1, ‘trypsin-like’ β 2, and ‘chymotrypsin-like’ β 5); and the AA specificity information reflected by Fig. 2 is also a comprehensive reflection of different hydrolysis characteristics of active subunits. This resulted in low predictive accuracies of the cCP and iCP models (Table 2). If more digestion data for the β 1i/β 1c-, β 2i/β 2c-, and β 5i/β 5c-subunits were available, a predictive model specific for a single active subunit could be established; this could improve the proteasomal predictive accuracy.
The following results may explain the apparent contradiction. Muchamuel et al. (2009) showed that β 1i/β 1c- and β 2i/β 2c-subunits have much higher activities than β 5i/β 5c-subunits; β 2i/β 2c-subunits harbor very spacious S1 pockets and therefore do not show marked specificities (Huber et al., 2012); however, functional data indicate that incorporation of i-subunits enhances the activity of β 1i (Nussbaum et al., 1998), thereby resulting in the stronger preference of iCPs for hydrophobic AAs at P1. All these results suggest that the β 1 subunit is both specific and active, relative to the β 2 and β 5 subunits. These results are consistent with our conclusion.
Using the absolute amounts of AAs found in positions close to the cleavage sites generated by 20S proteasomes (6.135×10−9 mol cCP and 6.370×10−9 mol iCP cleavage sites in Toes et al. (2001), and 15.964×10−9 mol cCP and 9.164×10−9 mol iCP cleavage sites in Tenzer et al. (2004)), we calculated the log-odds ratios of all 20 AAs at P1 and P3′ (log-odds ratio, lg(fi/pi), where fi is the frequency of occurrence of AA i at a certain position, and pi is the background frequency of AA i in the Swiss-Prot database (Bairoch and Apweiler, 2000)). The sign of the log-odds ratio could reflect the preference of the position for the AA i, i.e., a positive sign ‘+’ implies that the AA i at this position is beneficial to the usage of the cleavage site, whereas a negative sign ‘−’ implies that the AA at this position is not. The analysis of yeast enolase degradation data (Toes et al., 2001) is shown in Table 7. The comparison of all 20 signs at P1 and P3′ shows that there are 16 identical signs at P1 and P3′ for cCP and 14 for iCP. This demonstrates that the preference for AAs at P1 and P3′ is the same for AAs at cCP and iCP (P<0.001 and P<0.05). This may be related to the selectivities of the two β 1 subunits. The analysis of prion degradation data shows that there are 13 identical signs at P1 and P3′ for cCP and 10 for iCP (Tenzer et al., 2004). This result, to some extent, reflects a similar preference of cCP and iCP for AAs at P1 and P3′ (P>0.1 and P≥0.5). This may be due to the highly biased composition of AAs in the prion sequence, i.e., about 20% Gly and many repeat regions (Tenzer et al., 2004).
Table 7.
AA | cCP |
iCP |
||
P1 | P3′ | P1 | P3′ | |
Ala | + | + | + | + |
Cys | − | − | − | − |
Asp | + | + | − | − |
Glu | + | + | − | − |
Phe | − | − | + | − |
Gly | − | + | − | + |
His | − | − | − | − |
Ile | + | − | + | − |
Lys | − | + | − | − |
Leu | + | + | + | + |
Met | − | − | − | − |
Asn | − | − | − | + |
Arg | − | − | − | − |
Pro | − | − | − | − |
Arg | − | − | − | − |
Ser | − | − | − | − |
Thr | − | − | − | − |
Val | + | + | − | + |
Trp | − | − | − | − |
Tyr | + | − | + | − |
Signs of log-odds ratios reflect the preference of the position for the amino acid (AA) i; i.e., ‘+’ implies that AA i at this position is beneficial to the usage of the cleavage site, whereas ‘−’ implies it is not
5. Conclusions
The KSMM+pair coefficient method could serve as a competitive bioinformatic tool. The three proteasomal cleavage predictive models in this paper are reasonable when the window size is 8 AAs. The cCP and iCP in vitro cleavage data show additional specificities in the MHC I-peptide processing and presentation pathway. The cleavage specificities of cCP and iCP towards substrate proteins may be related to the physicochemical characteristics of their β 1-subunit active sites.
Footnotes
Project (No.11271059) supported by the National Natural Science Foundation of China
Compliance with ethics guidelines: Yu-feng LU, Hao SHENG, Yi ZHANG, and Zhi-yang LI declare that they have no conflict of interest.
This article does not contain any studies with human or animal subjects performed by any of the authors.
References
- 1.Alvarez I, Sesma L, Marcilla M, Ramos M, Marti M, Camafeita E, de Castro JA. Identification of novel HLA-B27 ligands derived from polymorphic regions of its own or other class I molecules based on direct generation by 20S proteasome. J Biol Chem. 2001;276(35):32729–32737. doi: 10.1074/jbc.M104663200. [DOI] [PubMed] [Google Scholar]
- 2.Bairoch A, Apweiler R. The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res. 2000;28(1):45–48. doi: 10.1093/bib/3.3.275. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Beekman NJ, van Veelen PA, van Hall T, Neisig A, Sijts A, Camps M, Kloetzel PM, Neefjes JJ, Melief CJ, Ossendorp F. Abrogation of CTL epitope processing by single amino acid substitution flanking the C-terminal proteasome cleavage site. J Immunol. 2000;164(4):1898–1905. doi: 10.4049/jimmunol.164.4.1898. [DOI] [PubMed] [Google Scholar]
- 4.Bhasin M, Raghava GPS. Pcleavage: an SVM based method for prediction of constitutive proteasome and immunoproteasome cleavage sites in antigenic sequences. Nucleic Acids. 2005;33(s2):W202–W207. doi: 10.1093/nar/gki587. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Bhasin M, Singh H, Raghava GPS. MHCBN: a comprehensive database of MHC binding and non-binding peptides. Bioinformatics. 2003;19(5):665–666. doi: 10.1093/bioinformatics/btg055. [DOI] [PubMed] [Google Scholar]
- 6.Blythe MJ, Doytchinova IA, Flower DR. JenPep: a database of quantitative functional peptide data for immunology. Bioinformatics. 2002;18(3):434–439. doi: 10.1093/bioinformatics/18.3.434. [DOI] [PubMed] [Google Scholar]
- 7.Cascio P, Hilton C, Kisselev AF, Rock KL, Goldberg AL. 26S proteasomes and immunoproteasomes produce mainly N-extended versions of an antigenic peptide. EMBO J. 2001;20(10):2357–2366. doi: 10.1093/emboj/20.10.2357. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Chang CC, Lin CJ. LIBSVM: a library for support vector machines. ACM Trans Intell Syst Technol. 2011;2(3):27. doi: 10.1145/1961189.1961199. [DOI] [Google Scholar]
- 9.Chapiro J, Claverol S, Piette F, Ma W, Stroobant V, Guillaume B, Gairin JE, Morel S, Burlet-Schiltz O, Monsarrat B, et al. Destructive cleavage of antigenic peptides either by the immunoproteasome or by the standard proteasome results in differential antigen presentation. J Immunol. 2006;176(2):1053–1061. doi: 10.1002/ijc.25911. [DOI] [PubMed] [Google Scholar]
- 10.Coux O, Tanaka K, Goldberg AL. Structure and functions of the 20S and 26S proteasomes. Ann Rev Biochem. 1996;65:801–847. doi: 10.1146/annurev.bi.65.070196.004101. [DOI] [PubMed] [Google Scholar]
- 11.Dick LR, Aldrich C, Jameson SC, Moomaw CR, Pramanik BC, Doyle CK, DeMartino GN, Bevan MJ, Forman JM, Slaughter CA. Proteolytic processing of ovalbumin and beta-galactosidase by the proteasome to a yield antigenic peptides. J Immunol. 1994;152(8):3884–3894. [PMC free article] [PubMed] [Google Scholar]
- 12.Dick TP, Nussbaum AK, Deeg M, Heinemeyer W, Groll M, Schirle M, Keilholz W, Stevanovic S, Wolf DH, Huber R, et al. Contribution of proteasomal beta-subunits to the cleavage of peptide substrates analyzed with yeast mutants. J Biol Chem. 1998;273(40):25637–25646. doi: 10.1074/jbc.273.40.25637. [DOI] [PubMed] [Google Scholar]
- 13.Diez-Rivero CM, Lafuente EM, Reche PA. Computational analysis and modeling of cleavage by the immunoproteasome and the constitutive proteasome. BMC Bioinf. 2010;11(1):479. doi: 10.1186/1471-2105-11-479. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Ehring B, Meyer TH, Eckerskorn C, Lottspeich F, Tampé R. Effects of major-histocompatibility-complex-encoded subunits on the peptidase and proteolytic activities of human 20S proteasomes. Cleavage of proteins and antigenic peptides. Eur J Biochem. 1996;235(1-2):404–415. doi: 10.1111/j.1432-1033.1996.00404.x. [DOI] [PubMed] [Google Scholar]
- 15.Emmerich NPN, Nussbaum AK, Stevanovic S, Priemer M, Toes REM, Rammensee HG. The human 26S and 20S proteasomes generate overlapping but different sets of peptide fragments from a model protein substrate. J Biol Chem. 2000;275(28):21140–21148. doi: 10.1074/jbc.M000740200. [DOI] [PubMed] [Google Scholar]
- 16.Ginodi I, Vider-Shalit T, Tsaban L, Louzoun Y. Precise score for the prediction of peptides cleaved by the proteasome. Bioinformatics. 2008;24(4):477–483. doi: 10.1093/bioinformatics/btm616. [DOI] [PubMed] [Google Scholar]
- 17.Goldberg AL, Cascio P, Saric T, Rock KL. The importance of the proteasome and subsequent proteolytic steps in the generation of antigenic peptides. Mol Immunol. 2002;39:147–164. doi: 10.1371/journal.pbio.0040267. [DOI] [PubMed] [Google Scholar]
- 18.Goldobin DS, Zaikin A. Towards quantitative prediction of proteasomal digestion patterns of proteins. J Stat Mech. 2009;2009:P01009. doi: 10.1088/1742-5468/2009/01/P01009. [DOI] [Google Scholar]
- 19.Heinemeyer W, Ramos PC, Dohmen RJ. The ultimate nanoscale mincer: assembly, structure and active sites of the 20S proteasome core. Cell Mol Life Sci. 2004;61(13):1562–1578. doi: 10.1007/s00018-004-4130-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Holzhütter HG, Kloetzel PM. A kinetic model of vertebrate 20S proteasome accounting for the generation of major proteolytic fragments from oligomeric peptide substrates. Biophys J. 2000;79(3):1196–1205. doi: 10.1016/S0006-3495(00)76374-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Huber EM, Basler M, Schwab R, Heinemeyer W, Kirk CJ, Groettrup M, Groll M. Immuno- and constitutive proteasome crystal structures reveal differences in substrate and inhibitor specificity. Cell. 2012;148(4):727–738. doi: 10.1016/j.cell.2011.12.030. [DOI] [PubMed] [Google Scholar]
- 22.Jacob L, Vert JP. Efficient peptide-MHC-I binding prediction for alleles with few known binders. Bioinformatics. 2008;24(3):358–366. doi: 10.1186/1471-2105-9-363. [DOI] [PubMed] [Google Scholar]
- 23.Keşmir C, Nussbaum AK, Schild H, Detours V, Brunak S. Prediction of proteasome cleavage motifs by neural networks. Prot Eng. 2002;15(4):287–296. doi: 10.1093/protein/15.4.287. [DOI] [PubMed] [Google Scholar]
- 24.Keşmir C, van Noort V, de Boer RJ, Hogeweg P. Bioinformatic analysis of functional differences between the immunoproteasome and the constitutive proteasome. Immunogenetics. 2003;55(7):437–449. doi: 10.1007/s00251-003-0585-6. [DOI] [PubMed] [Google Scholar]
- 25.Kosmrlj A, Read EL, Qi Y, Allen TM, Altfeld M, Deeks SG, Pereyra F, Carrington M, Walker BD, Chakraborty AK. Effects of thymic selection of the T-cell repertoire on HLA class I-associated control of HIV infection. Nature. 2010;465(7296):350–354. doi: 10.1038/nature08997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Leibovitz D, Koch Y, Pitzer F, Fridkin M, Dantes A, Baumeister W, Amsterdam A. Sequential degradation of the neuropeptide gonadotropin-releasing hormone by the 20 S granulosa cell proteasomes. FEBS Lett. 1994;346(2-3):203–206. doi: 10.1016/0014-5793(94)00472-2. [DOI] [PubMed] [Google Scholar]
- 27.Liu T, Liu W, Song Z, Jiao CB, Zhu MH, Wang XG. Computational prediction of the specificities of proteasome interaction with antigen protein. Cell Mol Immunol. 2009;6(2):135–142. doi: 10.1038/cmi.2009.19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Lucchiari-Hartz M, Lindo V, Hitziger N, Gaedicke S, Saveanu L, Endert PM. Differential proteasomal processing of hydrophobic and hydrophilic protein regions: contribution to cytotoxic T lymphocyte epitope clustering in HIV-1-Nef. PNAS. 2003;100(13):7755–7760. doi: 10.1073/pnas.1232228100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Ma WB, Vigneron N, Chapiro J, Stroobant V, Germeau C, Boon T, Coulie PG, van den Eynde BJ. A MAGE-C2 antigenic peptide processed by the immunoproteasome is recognized by cytolytic T cells isolated from a melanoma patient after successful immunotherapy. Int J Cancer. 2011;129(10):2427–2434. doi: 10.1002/ijc.25911. [DOI] [PubMed] [Google Scholar]
- 30.Metropolis N, Ulam S. The Monte Carlo method. J Am Stat Assoc. 1949;44(247):335–341. doi: 10.2307/2280232. [DOI] [PubMed] [Google Scholar]
- 31.Mommaas B, Kamp J, Drijfhout JW, Beekman N, Ossendorp F, van Veelen P, den Haan J, Goulmy E, Mutis T. Identification of a novel HLA-B60-restricted T cell epitope of the minor histocompatibility antigen HA-1 locus. J Immunol. 2002;169(6):3131–3136. doi: 10.4049/jimmunol.169.6.3131. [DOI] [PubMed] [Google Scholar]
- 32.Morel S, Lévy F, Burlet-Schiltz O, Brasseur F, Probst-Kepper M, Peitrequin AL, Monsarrat B, van Velthoven R, Cerottini JC, Boon T, et al. Processing of some antigens by the standard proteasome but not by the immunoproteasome results in poor presentation by dendritic cells. Immunity. 2000;12(1):107–117. doi: 10.1016/S1074-7613(00)80163-6. [DOI] [PubMed] [Google Scholar]
- 33.Muchamuel T, Basler M, Aujay MA, Suzuki E, Kalim KW, Lauer C, Sylvain C, Ring ER, Shields J, Jiang J, et al. A selective inhibitor of the immunoproteasome subunit LMP7 blocks cytokine production and attenuates progression of experimental arthritis. Nat Med. 2009;15(7):781–787. doi: 10.1038/nm.1978. [DOI] [PubMed] [Google Scholar]
- 34.Niedermann G, Butz S, Ihlenfeldt HG, Grimm R, Lucchiari M, Hoschützky H, Jung G, Maier B, Eichmann K. Contribution of proteasome-mediated proteolysis to the hierarchy of epitopes presented by major histocompatibility complex class I molecules. Immunity. 1995;2(3):289–299. doi: 10.1016/1074-7613(95)90053-5. [DOI] [PubMed] [Google Scholar]
- 35.Niedermann G, King G, Butz S, Birsner U, Grimm R, Shabanowitz J, Hunt DF, Eichmann K. The proteolytic fragments generated by vertebrate proteasomes: structural relationships to major histocompatibility complex class I binding peptides. PNAS. 1996;93(16):8572–8577. doi: 10.1073/pnas.93.16.8572. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Niedermann G, Grimm R, Geier E, Maurer M, Realini C, Gartmann C, Soll J, Omura S, Rechsteiner MC, Baumeister W, et al. Potential immunocompetence of proteolytic fragments produced by proteasomes before evolution of the vertebrate immune system. J Exp Med. 1997;186(2):209–220. doi: 10.1084/jem.186.2.209. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Nielsen M, Lundegaard C, Lund O. Prediction of MHC class II binding affinity using SMM-align, a novel stabilization matrix alignment method. BMC Bioinf. 2007;8(1):238. doi: 10.1186/1471-2105-8-238. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Nussbaum AK, Dick TP, Keilholz W. Cleavage motifs of the yeast 20S proteasome b subunits deduced from digests of enolase 1. PNAS. 1998;95(21):12504–12509. doi: 10.1073/pnas.95.21.12504. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Nussbaum AK, Kuttler C, Hadeler KP, Rammensee HG, Schild H. PAProC: a prediction algorithm for proteasomal cleavages available on the WWW. Immunogenetics. 2001;53(2):87–94. doi: 10.1007/s002510100300. [DOI] [PubMed] [Google Scholar]
- 40.Ossendorp F, Neisig A, Ruppert T, Groettrup M, Sijts A, Mengedë E, Kloetzel PM, Neefjes J, Koszinowski U, Melief C. A single residue exchange within a viral ctl epitope alters proteasome-mediated degradation resulting in lack of antigen presentation. Immunity. 1996;5(2):115–124. doi: 10.1016/S1074-7613(00)80488-4. [DOI] [PubMed] [Google Scholar]
- 41.Peters B, Sette A. Generating quantitative models describing the sequence specificity of biological processes with the stabilized matrix method. BMC Bioinf. 2005;6(1):132. doi: 10.1186/1471-2105-6-132. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Peters B, Janek K, Kuckelkorn U, Holzhütter HG. Assessment of proteasomal cleavage probabilities from kinetic analysis of time-dependent product formation. J Mol Biol. 2002;318(3):847–862. doi: 10.1016/S0022-2836(02)00167-5. [DOI] [PubMed] [Google Scholar]
- 43.Peters B, Tong W, Sidney J, Sette A, Weng Z. Examining the independent binding assumption for binding of peptide epitopes to MHC-I molecules. Bioinformatics. 2003;19(14):1765–1772. doi: 10.1093/bioinformatics/btg247. [DOI] [PubMed] [Google Scholar]
- 44.Rivett AJ. Purification of a liver alkaline protease which degrades oxidatively modified glutamine synthetase. Characterization as a high molecular weight cysteine proteinase. J Biol Chem. 1985;260(23):12600–12606. [PubMed] [Google Scholar]
- 45.Sandberg M, Eriksson L, Jonsson J, Sjöström M, Wold S. New chemical descriptors relevant for the design of biologically active peptides. A multivariate characterization of 87 amino acids. J Med Chem. 1998;41(14):2481–2491. doi: 10.1021/jm9700575. [DOI] [PubMed] [Google Scholar]
- 46.Saxová P, Buus S, Brunak S, Keşmir C. Predicting proteasomal cleavage sites: a comparison of available methods. Int Immunol. 2003;15(7):781–787. doi: 10.1093/intimm/dxg084. [DOI] [PubMed] [Google Scholar]
- 47.Schultz ES, Chapiro J, Lurquin C, Claverol S, Burlet-Schiltz O, Warnier G, Russo V, Morel S, Levy F, Boon T, et al. The production of a new MAGE-3 peptide presented to cytolytic T lymphocytes by HLA-B40 requires the immunoproteasome. J Exp Med. 2002;195(4):391–399. doi: 10.1084/jem.20011974. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Seifert U, Bialy LP, Ebstein F, Bech-Otschir D, Voigt A, Schröter F, Prozorovski T, Lange N, Steffen J, Rieger M, et al. Immunoproteasomes preserve protein homeostasis upon interferon-induced oxidative stress. Cell. 2010;142(4):613–624. doi: 10.1016/j.cell.2010.07.036. [DOI] [PubMed] [Google Scholar]
- 49.Shimbara N, Nakajima H, Tanahashi N, Ogawa K, Niwa S, Uenaka A, Nakayama E, Tanaka K. Double-cleavage production of the CTL epitope by proteasomes and PA28: role of the flanking region. Genes Cells. 1997;2(12):785–800. doi: 10.1046/j.1365-2443.1997.1610359.x. [DOI] [PubMed] [Google Scholar]
- 50.Sorokin AV, Kim ER, Ovchinnikov LP. Proteasome system of protein degradation and processing. Biochemistry. 2009;74(13):1411–1442. doi: 10.1134/S000629790913001X. [DOI] [PubMed] [Google Scholar]
- 51.Sun Y, Sijts AJ, Song M, Janek K, Nussbaum AK, Kral S, Schirle M, Stevanovic S, Paschen A, Schild H, et al. Expression of the proteasome activator PA28 rescues the presentation of a cytotoxic T lymphocyte epitope on melanoma cells. Cancer Res. 2002;62(10):2875–2882. [PubMed] [Google Scholar]
- 52.Swets JA. Measuring the accuracy of diagnostic systems. Science. 1988;240(4857):1285–1293. doi: 10.1126/science.3287615. [DOI] [PubMed] [Google Scholar]
- 53.Tenzer S, Stoltze L, Schonfisch B, Dengjel J, Muller M, Stevanovic S. Quantitative analysis of prion protein degradation by constitutive and immuno-20S proteasomes indicates differences correlated with disease susceptibility. J Immunol. 2004;172(2):1083–1091. doi: 10.4049/jimmunol.172.2.1083. [DOI] [PubMed] [Google Scholar]
- 54.Tenzer S, Peters B, Bulik S, Schoor O, Lemmel C, Schatz MM, Kloetzel PM, Rammensee HG, Schild H, Holzhütter HG. Modeling the MHC class I pathway by combining predictions of proteasomal cleavage, TAP transport and MHC class I binding. Cell Mol Life Sci. 2005;62(9):1025–1037. doi: 10.1007/s00018-005-4528-2. [DOI] [PubMed] [Google Scholar]
- 55.Toes RE, Nussbaum AK, Degermann S, Schirle M, Emmerich NP. Discrete cleavage motifs of constitutive and immunoproteasomes revealed by quantitative analysis of cleavage products. J Exp Med. 2001;194(1):1–12. doi: 10.1084/jem.194.1.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Vigneron N, Stroobant V, Chapiro J, Ooms A, Degiovanni G, Morel S, van der Bruggen P, Boon T, van den Eynde BJ. An antigenic peptide produced by peptide splicing in the proteasome. Science. 2004;304(5670):587–590. doi: 10.1126/science.1095522. [DOI] [PubMed] [Google Scholar]
- 57.Warren EH, Vigneron NJ, Gavin MA, Coulie PG, Stroobant V, Dalet A, Tykodi SS, Xuereb SM, Mito JK, Riddell SR, et al. An antigen produced by splicing of noncontiguous peptides in the reverse order. Science. 2006;313(5792):1444–1447. doi: 10.1126/science.1130660. [DOI] [PubMed] [Google Scholar]
- 58.Wenzel T, Eckerskorn C, Lottspeich F, Baumeister W. Existence of a molecular ruler in proteasomes suggested by analysis of degradation products. FEBS Lett. 1994;349(2):205–209. doi: 10.1016/0014-5793(94)00665-2. [DOI] [PubMed] [Google Scholar]