Abstract
Simple Summary
Colorectal cancer (CRC) is currently the third leading cause of cancer death worldwide. Early diagnosis of CRC is important for increasing the opportunity for treatment and receiving a good prognosis. The aim of our study was to develop a detection method that combined wheat germ agglutinin (WGA) chromatography with mass spectrometry (MS) for early detection of CRC. Further, machine learning algorithms and logistic regression were applied to combine multiple biomarkers we discovered. We validated in a population of 286 plasma samples the diagnostic performance of peptides corresponding to WGA-captured protein and its combination, which received a sensitivity of 84.5% and a specificity of 97.5% in the diagnoses of CRC. Proteomic biomarkers combined with algorithms can provide a powerful tool for discriminating patients with CRC and health controls (HCs). Measurements of WGA-captured PF4, ITIH4, and APOE with MS are then useful for early detection of CRC. Additionally, our study revealed the potential of applying lectin chromatography with MS for disease diagnosis.
Abstract
Colorectal cancer (CRC) is currently the third leading cause of cancer-related mortality in the world. U.S. Food and Drug Administration-approved circulating tumor markers, including carcinoembryonic antigen, carbohydrate antigen (CA) 19-9 and CA125 were used as prognostic biomarkers of CRC that attributed to low sensitivity in diagnosis of CRC. Therefore, our purpose is to develop a novel strategy for novel clinical biomarkers for early CRC diagnosis. We used mass spectrometry (MS) methods such as nanoLC-MS/MS, targeted LC-MS/MS, and stable isotope-labeled multiple reaction monitoring (MRM) MS coupled to test machine learning algorithms and logistic regression to analyze plasma samples from patients with early-stage CRC, late-stage CRC, and healthy controls (HCs). On the basis of our methods, 356 peptides were identified, 6 differential expressed peptides were verified, and finally three peptides corresponding wheat germ agglutinin (WGA)-captured proteins were semi-quantitated in 286 plasma samples (80 HCs and 206 CRCs). The novel peptide biomarkers combination of PF454–62, ITIH4429–438, and APOE198–207 achieved sensitivity 84.5%, specificity 97.5% and an AUC of 0.96 in CRC diagnosis. In conclusion, our study demonstrated that WGA-captured plasma PF454–62, ITIH4429–438, and APOE198–207 levels in combination may serve as highly effective early diagnostic biomarkers for patients with CRC.
Keywords: colorectal cancer, multiple reaction monitoring, tandem mass spectrometer, wheat germ agglutinin, plasma, machine learning algorithm
1. Introduction
Colorectal cancer (CRC) is currently the third leading cause of cancer-related mortality worldwide and the most common cancer in Taiwan [1,2]. The global incidence by the year 2030 is estimated to be 2.2 million new cases, with the number of CRC-related deaths for that period to be 1.1 million [3]. Currently, the guaiac fecal occult blood test (gFOBT) is the most common non-invasive screening method, which is based on detecting the activity of hemoglobin peroxidase. Despite the FOBT being a rapid and low-cost method for screening CRC, poor selectivity and sensitivity by FOBT result in high rates of false positives and false negatives [4]. Various circulating protein biomarkers, including carbohydrate antigen 125 (CA125), CA19-9, carcinoembryonic (CEA), alpha fetoprotein (AFP), and ferritin, are applied for monitoring and diagnosing CRC in the clinic; however, these biomarkers only achieved 10.39–46.59% sensitivity and 80~95% specificity in diagnosing CRC [5,6]. Therefore, an alternative rapid, high throughput and accurate screening procedure is urgently needed for early diagnosis of CRC.
Liquid chromatography coupled with tandem mass spectrometry (LC-MS/MS) has been widely used for proteomic studies [7]. For example, Saleem S. et al. revealed that caveolin-1 and matrix metalloproteinase-9 are stage dependent in CRC in proteomic analysis [8]. Furthermore, it has been applied for biomarker discovery in qualitative and quantitative analyses [9]. An elegant study from Shiromizu T. et al. identified and validated 22 biomarker candidates for CRC with LC-MS/MS [10]. Moreover, Beretov J. et al. identified biomarker candidates with label-free LC-MS/MS [11]. Label-free LC-MS/MS can provide global characterization of proteomic features to identify biomarker candidates; however, the findings required further verification and validation [9]. Therefore, verification and validation of findings from label-free LC-MS/MS usually use isotope-label multiple reaction monitoring (MRM) which provides more-sensitive and more-reliable quantitative results [12]. In an MRM-based assay, analyzed by LC-MS/MS, targeted transitions of precursors selected in Q1 and fragment ions selected in Q3 are monitored by a triple-quadrupole MS instrument that generates signals for qualification and quantitation [12]. MRM-based assays have been widely applied to protein quantitation in various fluids, including plasma [13] and serum [14].
Glycan can affect tumor progression in different ways, including metastasis, invasion, and proliferation [15,16]. The heterogeneity of glycosylation sites or changes in glycan structures in body fluids were shown to be correlated with the development and progression of certain cancer states [17,18]. Further, alteration of glycoprotein levels was reported in CRC [19], breast cancer [20], and prostate cancer [21]. Therefore, glycoproteins can be considered an ideal source for the early detection of cancers [22]. For instance, many cancer biomarkers in the clinic are glycoproteins, including CEA in CRC [6], carbohydrate antigen 19-9 (CA19-9) in gastrointestinal cancer [23], and prostate-specific antigen (PSA) in prostate cancer [24]. Lectin chromatography has been widely used to capture glycoproteins for MS analyses [25,26]. Lectins are a group of proteins that have unique affinities to carbohydrates; they can reversibly and specifically interact with certain glycan structural motifs [27]. For example, wheat germ agglutinin (WGA) can bind with N-acetylglucosamine (GlcNAc) on glycoproteins and interact with sialic acid-containing oligosaccharides [28]. Studies showed that the addition or removal of O-linked N-acetylglucosamines on proteins very likely plays key roles in tumor pathogenesis [29]. However, only a few studies have applied lectin affinity chromatography with LC-MS/MS in biomarker discovery and validation [30,31,32].
Hence, in this study we performed WGA chromatography and nanoLC-MS/MS to discover biomarker candidates in patients with CRC and utilized ultra-performance (UP) LC-MS/MS to perform a targeted method to verify our biomarker candidates. We then synthesized the verified biomarker candidates and optimized the LC-MS/MS parameters. To further examine the biomarker performance, a stable isotope-labeled MRM assay with machine learning algorithms was used to validate the performance of the diagnostic biomarkers. In short, after biomarker discovery and analytical method development, 80 plasma samples from healthy controls (HCs) and 206 plasma samples from patients with CRC were analyzed; a set of biomarkers consisting of platelet factor 4 (PF454–62), apolipoprotein E (APOE198–207) and inter-alpha-trypsin inhibitor heavy chain H4 (ITIH4429–438) showed significant differences among HCs, early-stage CRC patients, and late-stage CRC patients. Moreover, machine learning algorithms and logistic regression were incorporated to combine the diagnostic performances of the biomarkers.
2. Materials and Methods
2.1. Materials
The synthetic peptides and their stable isotope-labeled counterparts were obtained from Yao-Hong Corp. (New Taipei City, Taiwan). Sequencing-grade trypsin was purchased from Promega (Madison, WI, USA). Chicken serum was purchased from MyBioSource (San Diego, CA, USA). Agarose-bound wheat germ agglutinin (WGA) was obtained from Vector Laboratories (Burlingame, CA, USA). N-Acetylglucoseamine and formic acid were purchased from Tokyo Chemical Industry (Tokyo, Japan). Acetonitrile, methanol, dithiothreitol (DTT), and iodoacetamide (IAM) were obtained from Sigma-Aldrich (St. Louis, MO, USA).
2.2. Patients and Specimens
Plasma samples from 286 patients with CRC and 120 healthy volunteers (healthy controls (HCs)) were all purchased from the Joint Biobank of Taipei Medical University. This study was approved by the Taipei Medical University-Joint Institutional Review Board (nos. 201308022 and N202007061). As for the sample collection procedure in biobank, the whole blood samples were withdrawn in EDTA tubes and centrifuged at 3500 rpm for 5 min. After centrifugation, the plasma samples were stored at −80 °C until analyzed. Information on patient demographics and clinical features are summarized in Table 1. CRC plasma samples were from patients with stage I, II, III, and IV tumors. The pooled samples of CRC were generated into CRC stage I, CRC stage II, CRC stage III, and CRC stage IV from 80 CRC plasma samples. As for the pooled samples of the healthy control, they were generated into HCs from 40 HC plasma samples. The following validation set of 286 samples (early-stage CRC (stage I/II tumors, 100 samples), late-stage CRC (stage III/IV tumors, 106 samples), and healthy controls (80 samples)) were sampled from independent plasma samples of CRC patients and HCs.
Table 1.
Varible | Discovery Set | Validation Set | ||||||||
---|---|---|---|---|---|---|---|---|---|---|
group | HC | early-stage CRC | late-stage CRC | HC | early-stage CRC | late-stage CRC | ||||
number of samples | n = 40 | n = 40 | n = 40 | n = 80 | n = 100 | n = 106 | ||||
stage I | stage II | stage III | stage IV | stage I | stage II | stage III | stage IV | |||
n =20 | n =20 | n =20 | n =20 | n = 47 | n =53 | n =50 | n =56 | |||
male: female ratio | 24:16 | 24:16 | 24:16 | 46:34 | 71:29 | 66:40 | ||||
mean age years ± SD | 57.65 ± 3.48 | 52.5 ± 6.87 | 51.25 ± 4.85 | 39.43 ± 11.14 | 70.84 ± 10.7 | 67.33 ± 10.06 |
2.3. Wheat Germ Agglutinin Chromatograhpy and Sample Preparation
20 μL of plasma sample was added and mixed with agarose bound WGA for 1 h. WGA-bound proteins were washed with phosphate-buffered saline (PBS) three times and eluted with elution buffer (0.5 M N-acetylglucosamine dissolved in 1 mM acetic acid). The Bradford assay was conducted to determine protein concentrations after elution. Proteins (20 μg) were lyophilized in a SpeedVac system. Extended isotope-labeled peptides or BSA were spiked into the sample after the protein pellet had been reconstituted in 30 μL of double-distilled (dd) H2O. DTT (550 mM, 1 μL) was added to the sample and incubated at 56 °C for 45 min. After 2 μL of IAM (450 mM) was added to the sample and incubated in the dark for 45 min, the sample was digested with trypsin (0.5 μg) for 16 h at 37 °C. Digestion was quenched by adjusting the final concentration in 0.1% formic acid.
2.4. Nano-LC-MS/MS Analyses
The nano-LC-MS/MS analysis was performed on a nanoAcquity system (Waters, Milford, MA, USA) connected to an Orbitrap Elite hybrid mass spectrometer (Thermo Electron, Bremen, Germany) equipped with a PicoView nanospray interface (New Objective, Woburn, MA, USA). Peptide mixtures were loaded onto a 75-μm inner diameter, 25-cm-long C18 BEH column (Waters) packed with 1.7-μm particles with a pore width of 130 Å and were separated using a segmented gradient in 60 min from 5% to 35% solvent B (acetonitrile with 0.1% formic acid) at a flow rate of 300 nl/min and a column temperature of 35 °C. Solvent A was 0.1% formic acid in water. The mass spectrometer was operated in the data-dependent mode. Briefly, surveyed full-scan MS spectra were acquired in the orbitrap (m/z 350~1600) with resolution set to 120 K at m/z 400 and an automatic gain control (AGC) target of 106. The 20 most intense ions were sequentially isolated for collision induced dissociation MS/MS fragmentation and detection in a linear ion trap (AGC target of 104) with previously selected ions dynamically excluded for 60 s. Ions with a single and unrecognized charge state were also excluded. PEAKS 7 software (Bioinformatics Solutions, Waterloo, ON, Canada) was used to sequence WGA-captured proteins from acquired MS/MS spectra against the Universal Protein Knowledgebase, a human protein database (UniProt; http://www.uniprot.org/, 18 January 2020) containing 168,088 protein entities (UniProt, January 2020) with MS tolerance set to 10 ppm, MS/MS tolerance set to 0.6 Da, and with a false discovery rate (FDR) of 1% at PSM level. The total ion current (TIC) normalization and label-free quantification was performed by Peaks Q module in PEAKS 7. The Peaks PTM module of PEAKS 7 software was used to identify sequences of glycosylation and methylation. Carbamidomethylation (C)/+57.0215 Da was set as the fixed, whereas oxidation (M)/+15.9949 Da and the following glycosylation were specified as variables: hexose modified CRKTW (+162.0528 Da), fucose modified TS (+146.0579 Da), O-GlcNac modified STN (+203.195 Da), Hex1HexNAc1 modified N (+511.1901 Da), Hex1HexNAc1NeuAc1 modified NTS (+656.2276 Da), and Hex1HexNAc1NeuAc2 modified NTS (+947.3231 Da). Also, the following methylation were specified as variables: methylation modified CDEHIKLNQRST (+14.0156 Da), dimethylation modified KNR (+28.0313 Da), K (+32.0564 Da), K (+34.0631 Da), and trimethylation modified KAR (+43.0058 Da). The MS/MS data are available through ProteomeXchange Consortium via the PRIDE partner repository with the dataset identifier PXD024997 [33].
2.5. MRM Method
The MRM method was performed on a 1260 Infinity II Quaternary Pump LC system (Agilent, Santa Clara, CA, USA) connected to an Agilent 6470 triple quadrupole mass spectrometer (Agilent, Santa Clara, CA, USA) in the dynamic multiple-reaction monitoring (dMRM) mode. Tryptic-digested samples were loaded onto a 50-mm-long C18 column (Phenomenex, Torrance, CA, USA) packed with 2.6-μm particles with a pore size of 100 Å and were separated using an optimized gradient in 15 min, from 5% to 15% solvent B (acetonitrile with 0.1% formic acid) at a flow rate of 0.4 mL/min and a column temperature of 40 °C. Solvent A was 0.1% formic acid in water. The mass spectrometer was operated in the dMRM mode. The samples in each batch were randomly analyzed. dMRM data was processed with Skyline 20.1.0.76 (MacCoss Lab Software, Seattle, WA, USA) and normalized by internal standards. Details of method validation are provided in “Supplementary Information”.
2.6. Statistical Anaylses
In targeted LC-MS/MS, the significance levels of PF4, FIBA, ITIH4, AACT, APOE, and CFAH were determined using the Student’s t-test. A one-way analysis of variance (ANOVA) was used to test levels among early-stage CRC, late-stage CRC, and HCs. Scheffe’s post-hoc test was applied to evaluate differences in the mean between any two groups; in addition, a post-hoc test using the Bonferroni method was applied with a 0.0167-adjusted significance level in three groups. In semi-quantification of peptides, the significance levels of PF4, ITIH4, and APOE were determined using the Student’s t-test. ANOVA was used to test levels among early-stage CRC, late-stage CRC, and HCs. Scheffe’s post-hoc test was applied to evaluate differences in the mean between any two groups; in addition, a post-hoc test using the Bonferroni method was applied with a 0.0167-adjusted significance level in three groups’ comparison and 0.0083-adjusted significance level in four groups comparison. We used GraphPad Prism (vers. 5.0; GraphPad Software, San Diego, CA, USA) to evaluate differences among groups and generated receiver operating characteristic (ROC) curves to evaluate the diagnostic performance of the biomarkers. The cutoff value for an ROC curve was determined by the Youden index, which represents the sum of sensitivity and [1 − specificity], and the maximum value of the Youden index is a suitable cutoff point for that curve. Pair-wise comparisons of ROC curves were assessed using MedCalc Statistical Software (vers. 15.4; MedCalc Software, Ostend, Belgium). The one-way ANOVA and power were determined using SAS (vers. 9.3; SAS Institute, Cary, NC, USA), and power estimations were calculated according to the ROC analysis. The area under the ROC curve (AUC), sensitivity, and specificity were calculated at a 95% confidence level. The significance level of all statistical tests was set to p < 0.05. To combine the diagnostic performance from multiple biomarkers, we incorporated four different algorithms, including logistic regression (LR), decision trees (DT), random forests (RF) and support vector machine (SVM) with 10-fold cross validation in scikit-learn (vers. 0.21.3). Parameter tuning was performed for each training and validation set on the basis of the 10-fold cross-validation. Further, the tuning process was based on the value of AUC. For RF, we used the initial tree value number of 100, which increased by 100 until reaching 500. The kernel of the model was set to gini or entropy. As for DT, we used the initial value of tree depth, which was set to 1–10 with a step of 1. The kernel of the model was set to gini or entropy. For SVM, the initial value of gamma was set to 1 × 10−6–1 × 10−10 with a step of 1e-1. The initial value of C was set to 1 × 10−4–1 × 10−7 with a 10-fold step. The kernel of the model was set to RBF. Lastly, we used the default setting in LR. To assess the predictive performance, we applied a confusion matrix to calculate the accuracy, sensitivity, specificity, and AUC.
3. Results
3.1. Discovery MS
In this study, biomarker candidates were identified using pooled samples of 20 plasma samples corresponding to CRC stage I, CRC stage II, CRC stage III, CRC stage IV, all-stage CRC, and HC samples as the discovery set (Table 1). Pooled samples were individually purified by WGA chromatography. WGA-captured plasma samples were trypsin-digested and analyzed in an LTQ-Orbitrap-Elite instrument with two replicates. Peptides corresponding to WGA-captured proteins were identified and label-free quantification was performed with PEAKS 7 software. Differentially expressed unique peptides derived from WGA-captured proteins among the early-stage CRC (stage I and stage II), late-stage CRC (stage III and stage IV), and HC groups are shown in Table S1. In this study, 55 proteins and 356 peptides were identified differentially with a false discovery rate (FDR) of 1%. The peptides were considered significantly changed if −10lgP < 13 (p-value < 0.05); of these, 269 peptides were increased by >1.5-fold in WGA-captured plasma from CRC groups compared to the HC group, and 4 peptides were decreased by <0.8-fold in the CRC group (Table S1). To ensure that peptide characteristics were suitable for analysis, we excluded peptides with the following signatures: (1) mis-cleavage peptides, which may lack the reproducibility in each analysis and (2) peptides containing more than 10 amino acids, which may increase the difficulty of synthesizing [34,35]. In total, 79 peptides were selected for further analysis in the pooled samples from early-stage CRC, late-stage CRC, and HC samples in the discovery set (Table 1). These selected peptides corresponding to differentially expressed WGA-captured plasma proteins were then analyzed using an Agilent 6470 instrument to examine the quality of signals on a triple quadrupole. We discovered that only two methylated peptides and four unmodified peptides satisfied a signal-to noise ratio (S/N) of >5, including ADLS*GITGAR (AACT341–350), HITSLEVIK (PF454–62), LALDNGGLAR (ITIH4429–438), LGPLVEQGR (APOE198–207), QLEQVIAK (FIBA202–210), and SLGN*VIMVCR (CFAH58–67) (* indicates a methylated site, Table S2). Information on targeted transitions, optimized collisions, and fragmentors is summarized in Table S2.
3.2. Targeted LC-MS/MS
To examine the discriminative ability of these biomarker candidates in early-stage CRC, late-stage CRC, and HC groups, we performed targeted LC-MS/MS to analyze another randomly selected 20 paired early-stage CRC, late-stage CRC, and HC plasma samples individually from the discovery set (Table 1). Plasma samples were analyzed with UPLC-MS/MS after purification with WGA chromatography and trypsin digestion. Results showed that four of six peptides corresponding to WGA-captured proteins changed in early-stage CRC compared to the HC group. The statistically significant p value was set to 0.0167 in a one-way analysis of variance (ANOVA). Levels of FIBA202–210 and PF454–62 (p = 0.0134) increased in late-stage CRC groups compared to the HC group (Figure 1). Levels of ITIH4429–438 (p = 0.036), APOE198–207 (p < 0.0001), and CFAH58–67 decreased in early-stage CRC groups compared to the HC group. Among these results, PF4429–438 expression levels increased the most in early-stage and late-stage CRC, while APOE198–207 and ITIH4429–438 expression levels decreased the most in early-stage and late-stage CRC (Figure 1). Therefore, three peptides (PF454–62, ITIH4429–438, and APOE198–207) from among the WGA-captured proteins were selected as biomarker candidates for further validation and semi-quantification with the stable isotope-labeled MRM assay.
3.3. Analytical Method Development
In order to ensure that our MRM assay was acceptable for use, extended stable isotope-labeled peptides and extended peptides corresponding to the three selected WGA-captured proteins (PF454–62, ITIH4429–438, APOE198–207) were synthesized for semi-quantification. Details of extended peptides and extended stable isotope-labeled peptides are shown in Table S3. To validate the analytical method, the following parameters were evaluated: calibration curve, analytical specificity (selectivity), analytical sensitivity, and carryover. Details of all procedures are summarized in “Supplementary Information”. Analytical measurement ranges, including the lower limit of quantification (LLOQ) and upper limit of quantification (ULOQ), in chicken serum were 3.90 ~ 1000 ng/mL for PF454–62, 1.95~250 ng/mL for ITIH4429–438, 1.95–250 ng/mL for APOE192-207, and were linear (R2 > 0.99) for all peptides (Table S4). The analytical specificity and sensitivity satisfied the criteria when calculated with respect to the LLOQ sample (Tables S5 and S6). The absence of carryover was confirmed by analyzing ULOQ samples, followed by a blank sample (Table S7). All of these results proved the reliability of the MRM method we developed.
3.4. Semi-Quantification of Peptides in Large Samples
On the basis of results from targeted LC-MS/MS and analytical performance development, three peptides corresponding to PF4, ITIH4, and APOE were measured in 100 early-stage CRC, 106 late-stage CRC, and 80 HC samples (Table 1). The extracted ion chromatograms, standard curves, and group comparisons of concentrations are presented in Figure 2. The statistically significant p value was set to 0.0083 in a one-way analysis of variance (ANOVA). The HITSLEVIK peptide corresponding to PF4 in patients with early-stage CRC (2.65-fold, p < 0.0001) and late-stage CRC (2.90-fold, p < 0.0001) was significantly higher than that of the HC group (Figure 2A). The LALDNGGLAR peptide corresponding to ITIH4 in patients with early-stage CRC (0.68-fold, p < 0.0001) and late-stage CRC (0.69-fold, p < 0.0001) was significantly lower than that of the HC group (Figure 2B). The LGPLVEQGR peptide corresponding to APOE in patients with early-stage CRC (0.58-fold, p < 0.0001) and late-stage CRC (0.68-fold, p < 0.0001) was significantly lower than that of the HC group (Figure 2C).
3.5. Diagnostic Performance
We first evaluated the diagnostic performances of individual peptide biomarkers of PF454–62, ITIH4429–438, APOE198–207 and determined values of the area under the receiver operating characteristics (ROC) curve (AUC) of 0.67, 0.80, and 0.82 in early-stage CRC; 0.63, 0.72, and 0.70 in late-stage CRC; and 0.66, 0.77, and 0.79 in all-stage CRC, respectively. The power estimations in this study were all above 0.885. To gain further insights into the utility of the three biomarkers, we performed several predictive models that were evaluated by the AUC, sensitivity, specificity, and accuracy. The predictive performances of decision trees, random forests, support vector machines, and logistic regressions based on combinations of the three peptide biomarkers are summarized in Table S8. Of all the models we built, the random forest (RF) and logistic regression (LR) models achieved the best predictive performances. With the combination of two peptide biomarkers in early-stage CRC detection, we observed that PF454–62 + APOE198–207 with RF increased the AUC to 0.88 (p < 0.0001), the sensitivity was 81.5%, and the specificity was 82.5%. As for the combination of three peptide biomarkers in early-stage CRC detection, PF454–62 + APOE198–207 + ITIH4429-439 with LR increased the AUC to 0.90 (p < 0.00001), the sensitivity was 87.4%, and the specificity was 75.9% (Figure 3A,B). Further, with the combination of two peptide biomarkers in late-stage CRC detection, we observed that PF454–62 + APOE198–207 with LR increased the AUC to 0.84 (p < 0.0001), the sensitivity was 78.3%, and the specificity was 73.8%. As for the combination of three peptide biomarkers in late-stage CRC detection, PF454–62 + APOE198–207 + ITIH4429-439 with RF increased the AUC to 0.88 (p < 0.00001), the sensitivity was 76.4%, and the specificity was 80.4% (Figure 3A,B). Moreover, in a combination of two peptide biomarkers in all-stage CRC detection, ITIH4429–438 + APOE198–207 combined with RF increased the AUC to 0.94 (p < 0.0001), the sensitivity was 80.6%, and the specificity was 96.3%. In a combination of three peptide biomarkers in all-stage CRC detection, PF454–62 + APOE198–207 + ITIH4429–438 combined with RF increased the AUC to 0.96 (p < 0.00001), the sensitivity was 84.5%, and the specificity was 97.5% (Figure 3A,B). The combination of all three peptide biomarkers proved to be highly discriminatory for early-stage and all-stage CRC.
4. Discussion
To the best of our knowledge, this is the first study to combine WGA chromatography with an MRM assay and applied them in developing diagnostic biomarkers for CRC. Abundant proteins in plasma samples result in ion suppression and a matrix effect. In a previous study, WGA was used to remove abundant proteins such as albumin and enriched glycoproteins [28]. A pooling strategy by equalizing samples proved to be useful in biomarker discovery [36]. A pooling strategy can help overcome resource constraints while many individuals are analyzed; further, the variation in biological samples should be reduced and should provide increased power for detecting differences [37]. Herein, we subjected pooled CRC plasma samples and pooled HCs to WGA chromatography. After that, an LTQ elite Orbitrap mass spectrometer instrument was used to analyze the samples. In the discovery set, 55 plasma proteins and 356 plasma peptides in total were identified. We selected the peptides that received -10lgP <13 (p-value < 0.05) and were increased by >1.5-fold or decreased by <0.8-fold in the CRC groups for further validation. In addition to O-GlcNacylation, several PTMs that are related with cancer were also searched with Peaks PTM module, including glycosylation and methylation [38,39]. However, we found that the differentially expressed peptides were mostly unmodified and methylated. We speculated that glycosylation on peptides such as O-GlcNacylation may alter the ionization efficiency of peptides [40]. A study from Phueaouan et al. suggested O-GlcNacylation is enhanced in primary colorectal cancer tissues [41]. On a contradictory note, Krzeslak et al. showed that O-GlcNacylated protein levels were decreased in thyroid tumors [42]. Furthermore, few studies have been done to examine associated levels of glycosylated protein in blood from CRC. Thus, there is insufficient evidence to make a claim about the effect of glycosylation in cancer.
To further verify the results from the discovery set, the pooled plasma samples were purified, and differentially expressed peptides were analyzed on an Agilent 6470 triple-quadrupole mass spectrometer instrument to optimize the parameters in the LC and MS system. Also, the sensitivity (S/N > 5) and specificity (retention time within 0.1 in three transitions) were evaluated in this experiment to exclude invalid biomarker candidates. Finally, we received 6 peptides that satisfied the criteria. To further evaluate the discriminant ability of the diagnostic biomarker candidates, targeted LC-MS/MS was applied to confirm the differentially expressed peptides in the discovery set. A reference protein normalization (RFN) technique was applied in targeted LC-MS/MS. Zauber H. et al. suggested that a different species protein can be used as the reference protein to normalize and quantify without interfering with the co-analyzed sample peptides; the RFN technique is based on the addition of a protein of known concentration for normalization of sample peptide intensities [43]. Therefore, we added Bovine serum albumin (BSA) as a reference protein to 20 paired WGA-purified plasma samples to normalize the impacts from the matrix of samples. We discovered that unmodified peptides corresponding to WGA-captured PF4, ITIH4, and APOE were significantly different between CRC and HCs. Next, we attempted to develop a method to measure the glycoprotein level by targeting its unmodified peptides. The standard peptides and internal standard peptides with cleavage sites can provide insight into the cleavage process; for instance, extended stable isotope-labeled peptides were spiked into samples before tryptic digestion can compensate the digestion variability [44]. Further, studies showed that extended stable isotope-labeled peptides can be used as internal standards to account for sample processing and can provide precise and accurate results during LC-MS/MS assays [44,45]. Thus, extended peptides (as reference standards) and isotope-labeled extended peptides (as internal standards) corresponding to PF4, ITIH4, and APOE were synthesized to correct for the digestion efficiency, matrix effect, and instrument deviation. To simulate the matrix during analytical method development, an elegant study performed by Chen et al. suggested that chicken serum can be used as an alternative matrix when developing analytical methods [46]. To evaluate the performance of the analytical method, chicken serum samples were purified with WGA chromatography and spiked with exogenous peptides. Parameters including the calibration curve, sensitivity, selectivity, and carryover were evaluated in this study and proved to be acceptable for measuring by following the guideline from the U.S. Food and Drug Administration (FDA) [47]. Therefore, reliable semi-quantification of peptides corresponding to WGA-captured proteins was achieved. Finally, significantly altered peptides were semi-quantitated and evaluated in 400 plasma samples with the isotope-labeled MRM assay.
The in vitro diagnostic multivariate index assay (IVDMIA) was approved by the U.S. FDA in 2007. The score from the IVDMIA is calculated from a number of measurement values using algorithms [48]. Recently, machine learning has been combined with multiplex technologies to develop an IVDMIA; for example, Zang et. al. performed UPLC-MS/MS and machine learning methods to develop a metabolite-based IVDMIA to predict prostate cancer [49]. Hyun et al. used five biomarkers and applied algorithms, including random forest, a support vector machine, and logistic regression, to a validation cohort to predict non-small cell lung cancer [50]. Different algorithms may affect the diagnostic performance due to their characteristics. For example, LR is a traditional statistical model which is a linear model while RF is an ensemble learning method. In this study, PF454–62, ITIH4429–438, and APOE198–207 were incorporated with four machine learning algorithms to predict early-stage and all-stage CRC. In early-stage prediction, we received an AUC of 0.90, a sensitivity of 87.4%, and a specificity of 75.9% with LR (Figure 3A,B). In all-stage CRC, we received an AUC of 0.96, a sensitivity of 84.5% and a specificity of 97.5% with RF (Figure 3A,B). Further, many studies have applied MS with machine learning algorithms to develop diagnostic tools in CRC diagnosis. Marin-Vincente et al. utilized THBS1 and APOC3 with a decision tree classifier and achieved an AUC of 0.83 (with a sensitivity of 90% and a specificity of 65%) in CRC diagnoses [51]. Another study carried out by Xie et al. used CELA1, CEL2A, CTRL, and TRY2 with a logistic regression and achieved an AUC of 0.90 (with a sensitivity of 86.7% and a specificity of 83.3%) in CRC diagnoses [52]. Bhardwaj et al. utilized AREG, MASP1, OPN, PON3, and TR with the least absolute shrinkage and selection operator (LASSO) and achieved an AUC of 0.86 (with a sensitivity of 83% and a specificity of 80%) in diagnosing early-stage CRC [53]. In another study from Bhardwaj et al., they combined HP, LRG1, and PON3 with LASSO and achieved an AUC of 0.83 (with a sensitivity of 67% and a specificity of 80%) in diagnosing early-stage CRC; they also utilized eight biomarkers with LASSO and achieved an AUC of 0.96 (with a sensitivity of 93% and a specificity of 80%) in diagnosing late-stage CRC [54]. In this study, PF454–62, ITIH4429–438, and APOE198–207 were incorporated with four machine learning algorithms to predict early-stage and late-stage CRC. Compared to studies mentioned above, our results can provide higher sensitivity and higher specificity in diagnosing early-stage CRC and all-stage CRC with only three biomarkers. To further calculate the required sensitivity and specificity for our method to be of sufficient clinical performance for screening, Lord S. J. et al. suggested that the performance of an old test should be used as a reference to evaluate the new test [55]. Therefore, we compared our method with FOBT which is the gold standard for CRC screening. A meta-analysis from Ramdzan A. R. et al. reported that the sensitivity and specificity of FOBT are 31% and 87%, respectively [56]. In our study, PF454–62, ITIH4429–438, and APOE198–207 combined by RF can provide the sensitivity 84.5 % and specificity 97.5% on diagnosis all-stage CRC, which is better than FOBT.
The potential biomarkers we discovered in this study are related to highly abundant plasma proteins; for these, it is easy to detect the alteration in plasma. However, our method is to measure the unmodified peptides derived from glycosylated protein which still provide the specificity for diagnostic test. PF4 is known as an endocrine factor and is preserved in α-granules of megakaryocytes and mature platelets [57]. Although PF4 is not a glycoprotein, studies indicated that PF4 may bind to glycoprotein [58,59]. Furthermore, Muramatsu T. et al. suggested a glycoprotein binding protein can be captured by affinity chromatography, such as lectin chromatography [60]. We identified PF4 in WGA-binding proteins from CRC patients and HCs in a discovery study and further validated PF4 in the large samples. A study by Pucci et al. suggested that higher levels of PF4 in tumors can cause a poor survival rate in patients; additionally, they found that overproduction of PF4 can accelerate de novo adenocarcinogenesis and suggested that platelets can modulate the tumor microenvironment by releasing PF4 [61]. Zhang et al. discovered that PF4 can induce CRC recurrence in patients who received chemotherapy by suppressing antitumor immunity [62]. A study by Peterson et al. found that PF4 in platelets from CRC patients was significantly higher compared to levels in HCs; using a logistic regression, they achieved a sensitivity of 82.8% and a specificity of 79.4% in diagnosing CRC [63]. In our study, we observed that PF4 was significantly increased in early-stage CRC (2.65-fold) and late-stage CRC (2.90-fold) compared to healthy controls, which is consistent with previous studies. Altogether, our results and the literature indicate that PF4 is associated with CRC progression.
ITIH4 is secreted by the liver into the circulation and is related to inflammation. Further, ITIH4 was found to mediate tumorigenesis, invasion, and metastasis of solid tumors [64,65]. Huang et al. discovered that a decreasing level of ITIH4 promoted the invasion and metastasis of ovarian cancer cells; moreover, they also found that ITIH4 was downregulated in tissues from patients with ovarian cancer compared to HCs [66]. An elegant study of liver cancer by Lee et al. suggested that increasing levels of ITIH4 can suppress tumor invasion, whereas decreasing levels of ITIH4 can promote tumor metastasis [67]. A retrospective study by Hamm et al. showed that the gene expression level of ITIH4 decreased in tissues derived from patients with CRC [68]. Further, detection of ITIH4 by an enzyme-linked immunosorbent assay was applied to patients with CRC and achieved a sensitivity of 78.2% and a specificity of 76.3% [69]. In our study, a decreasing level of ITIH4 was observed in early-stage CRC (0.68-fold) and late-stage CRC (0.69-fold), which is consistent with previous studies.
APOE, a glycoprotein that is associates with triglyceride-rich lipoproteins, were predominantly synthesized in the liver; further, APOE mediates the clearance of triglyceride and lipoprotein remnants [70]. While it is unclear exactly how APOE is involved in cancer development, it may affect the metabolism of cancer cells by regulating lipid homeostasis [71]. Further, Pencheva et al. suggested that APOE secreted from melanoma cells can bind to APOE receptors on melanoma cells and endothelial cells to inhibit invasiveness and clustering [72]. A study by Buss et al. showed that APOE−/− breast cancer tumors grew faster than controls in mouse experiments, which indicates that APOE may inhibit cancer development [73]. Moreover, EI-Bahrawy et al. suggested that APOE plays an important role in metabolism in the colon; the absence of APOE can stimulate the expression of cyclooxygenase (COX)-2 by increasing oxidized low-density lipoprotein (Ox-LDL) and tumor necrosis factor (TNF)-α, which induces inflammation and results in colon disease [74]. Also, APOE was observed to have decreased in lung cancer tissues [75]. Decreases in APOE in patients with early-stage CRC (0.58-fold) and late-stage CRC (0.68-fold) in this study indicated that APOE may play roles in suppressing tumor activity, which is consistent with previous studies.
5. Conclusions
In the present study, we discovered that PF454–62 was greatly increased while ITIH4429–438 and APOE198–207 were significantly decreased in CRC patients. Moreover, we combined PF454–62, APOE198–207, and ITIH4429–438 with RF to predict all-stage CRC and received an AUC of 0.96, a sensitivity of 84.5% and a specificity of 97.5%. In conclusion, our study demonstrated that WGA-captured plasma PF454–62, ITIH4429–438, and APOE198–207 levels in combination may serve as highly effective early diagnostic biomarkers for patients with CRC.
Acknowledgments
LTQ-Orbitrap data were acquired at Academia Sinica Common Mass Spectrometry Facilities for Proteomics and Protein Modification Analysis located at the Institute of Biological Chemistry, Academia Sinica (Taipei, Taiwan), supported by the Academia Sinica Core Facility and Innovative Instrument Project (AS-CFII-108-107).
Supplementary Materials
The following are available online at https://www.mdpi.com/article/10.3390/cancers13092190/s1, Table S1. Differentially expressed peptides derived from wheat germ agglutinin (WGA)-captured proteins with nanoLC-MS/MS from healthy control groups compared to patients at various colorectal cancer (CRC) stages, Table S2. Multiple-reaction monitoring (MRM) transitions and mass parameters of selected peptides, Table S3. Information of the synthetic extended peptides and their internal standards, Table S4. Results of the calibration curve analysis, Table S5. Results of the specificity analysis. Table S6. Results of the sensitivity analysis, Table S7. Results of the carryover analysis, Table S8. Predictive performance and power analysis using peptide factor 4 (PF4), inter-alpha-trypsin inhibitor heavy chain H4 (ITIH4), apolipoprotein E (APOE), and combinations of the three biomarkers, File S1: The supplementary information for analytical method development.
Author Contributions
Conceptualization, C.-Y.L., E.C.-Y.S. and I.-L.T.; methodology, I.-L.T.; statistical analysis E.C.-Y.S. and I.-J.T.; experiment execution I.-J.T.; writing—original draft preparation, I.-J.T.; writing—review and editing, C.-Y.L.; supervision, C.-Y.L. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Institutional Review Board Statement
The study was conducted according to the guidelines of the Declaration of Helsinki and approved by the Taipei Medical University-Joint Institutional Review Board (nos. 201308022 and N202007061).
Informed Consent Statement
Informed consent was obtained from all subjects involved in the study.
Data Availability Statement
The mass spectrometry proteomics data have been deposited to ProteomeXchange Consortium via the PRIDE [33] partner repository with the dataset identifier PXD024997.
Conflicts of Interest
The authors declare no conflict of interest.
Footnotes
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Kuo C.-N., Liao Y.-M., Kuo L.-N., Tsai H.-J., Chang W.-C., Yen Y. Cancers in Taiwan: Practical insight from epidemiology, treatments, biomarkers, and cost. J. Formos. Med. Assoc. 2020;119:1731–1741. doi: 10.1016/j.jfma.2019.08.023. [DOI] [PubMed] [Google Scholar]
- 2.Bhandari A., Woodhouse M., Gupta S. Colorectal cancer is a leading cause of cancer incidence and mortality among adults younger than 50 years in the USA: A SEER-based analysis with comparison to other young-onset cancers. J. Investig. Med. 2017;65:311–315. doi: 10.1136/jim-2016-000229. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Douaiher J., Ravipati A., Grams B., Chowdhury S., Alatise O., Are C. Colorectal cancer-global burden, trends, and geographical variations. J. Surg. Oncol. 2017;115:619–630. doi: 10.1002/jso.24578. [DOI] [PubMed] [Google Scholar]
- 4.Ang C.S., Baker M.S., Nice E.C. Mass Spectrometry-Based Analysis for the Discovery and Validation of Potential Colorectal Cancer Stool Biomarkers. Methods Enzymol. 2017;586:247–274. doi: 10.1016/bs.mie.2016.10.019. [DOI] [PubMed] [Google Scholar]
- 5.Huo Y.R., Huang Y., Liauw W., Zhao J., Morris D.L. Prognostic Value of Carcinoembryonic Antigen (CEA), AFP, CA19-9 and CA125 for Patients with Colorectal Cancer with Peritoneal Carcinomatosis Treated by Cytoreductive Surgery and Intraperitoneal Chemotherapy. Anticancer Res. 2016;36:1041–1049. [PubMed] [Google Scholar]
- 6.Gao Y., Wang J., Zhou Y., Sheng S., Qian S.Y., Huo X. Evaluation of Serum CEA, CA19-9, CA72-4, CA125 and Ferritin as Diagnostic Markers and Factors of Clinical Parameters for Colorectal Cancer. Sci. Rep. 2018;8:2732. doi: 10.1038/s41598-018-21048-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Han X., Aslanian A., Yates J.R., 3rd Mass spectrometry for proteomics. Curr. Opin. Chem. Biol. 2008;12:483–490. doi: 10.1016/j.cbpa.2008.07.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Saleem S., Tariq S., Aleem I., Sadr-Ul S., Tahseen M., Atiq A., Hassan S., Abu Bakar M., Khattak S., Syed A.A., et al. Proteomics analysis of colon cancer progression. Clin. Proteom. 2019;16:44. doi: 10.1186/s12014-019-9264-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Simpson K.L., Whetton A.D., Dive C. Quantitative mass spectrometry-based techniques for clinical use: Biomarker identification and quantification. J. Chromatogr. B Anal. Technol. Biomed. Life Sci. 2009;877:1240–1249. doi: 10.1016/j.jchromb.2008.11.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Shiromizu T., Kume H., Ishida M., Adachi J., Kano M., Matsubara H., Tomonaga T. Quantitation of putative colorectal cancer biomarker candidates in serum extracellular vesicles by targeted proteomics. Sci. Rep. 2017;7:12782. doi: 10.1038/s41598-017-13092-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Beretov J., Wasinger V.C., Millar E.K.A., Schwartz P., Graham P.H., Li Y. Proteomic Analysis of Urine to Identify Breast Cancer Biomarker Candidates Using a Label-Free LC-MS/MS Approach. PLoS ONE. 2015;10:e0141876. doi: 10.1371/journal.pone.0141876. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Kumar V., Ray S., Ghantasala S., Srivastava S. An Integrated Quantitative Proteomics Workflow for Cancer Biomarker Discovery and Validation in Plasma. Front. Oncol. 2020;10 doi: 10.3389/fonc.2020.543997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Dardé V.M., Barderas M.G., Vivanco F. Multiple reaction monitoring (MRM) of plasma proteins in cardiovascular proteomics. Methods Mol. Biol. 2013;1000:191–199. doi: 10.1007/978-1-62703-405-0_14. [DOI] [PubMed] [Google Scholar]
- 14.Arora A., Patil V., Kundu P., Kondaiah P., Hegde A.S., Arivazhagan A., Santosh V., Pal D., Somasundaram K. Serum biomarkers identification by iTRAQ and verification by MRM: S100A8/S100A9 levels predict tumor-stroma involvement and prognosis in Glioblastoma. Sci. Rep. 2019;9:2749. doi: 10.1038/s41598-019-39067-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Kobata A., Amano J. Altered glycosylation of proteins produced by malignant cells, and application for the diagnosis and immunotherapy of tumours. Immunol. Cell Biol. 2005;83:429–439. doi: 10.1111/j.1440-1711.2005.01351.x. [DOI] [PubMed] [Google Scholar]
- 16.Taniguchi N., Kizuka Y. Glycans and cancer: Role of N-glycans in cancer biomarker, progression and metastasis, and therapeutics. Adv. Cancer Res. 2015;126:11–51. doi: 10.1016/bs.acr.2014.11.001. [DOI] [PubMed] [Google Scholar]
- 17.Durand G., Seta N. Protein glycosylation and diseases: Blood and urinary oligosaccharides as markers for diagnosis and therapeutic monitoring. Clin. Chem. 2000;46:795–805. doi: 10.1093/clinchem/46.6.795. [DOI] [PubMed] [Google Scholar]
- 18.Zhao J., Qiu W., Simeone D.M., Lubman D.M. N-linked glycosylation profiling of pancreatic cancer serum using capillary liquid phase separation coupled with mass spectrometric analysis. J. Proteome Res. 2007;6:1126–1138. doi: 10.1021/pr0604458. [DOI] [PubMed] [Google Scholar]
- 19.Wei H., Cheng Z., Ouyang C., Zhang Y., Hu Y., Chen S., Wang C., Lu F., Zhang J., Wang Y., et al. Glycoprotein screening in colorectal cancer based on differentially expressed Tn antigen. Oncol. Rep. 2016;36:1313–1324. doi: 10.3892/or.2016.4937. [DOI] [PubMed] [Google Scholar]
- 20.Ju L., Wang Y., Xie Q., Xu X., Li Y., Chen Z., Li Y. Elevated level of serum glycoprotein bifucosylation and prognostic value in Chinese breast cancer. Glycobiology. 2016;26:460–471. doi: 10.1093/glycob/cwv117. [DOI] [PubMed] [Google Scholar]
- 21.Duskova K., Vesely S. Prostate specific antigen. Current clinical application and future prospects. Biomed. Pap. Med. Fac. Univ. Palacky Olomouc Czech Repub. 2015;159:18–26. doi: 10.5507/bp.2014.046. [DOI] [PubMed] [Google Scholar]
- 22.Kailemia M.J., Park D., Lebrilla C.B. Glycans and glycoproteins as specific biomarkers for cancer. Anal. Bioanal. Chem. 2017;409:395–410. doi: 10.1007/s00216-016-9880-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Poruk K.E., Gay D.Z., Brown K., Mulvihill J.D., Boucher K.M., Scaife C.L., Firpo M.A., Mulvihill S.J. The clinical utility of CA 19-9 in pancreatic adenocarcinoma: Diagnostic and prognostic updates. Curr. Mol. Med. 2013;13:340–351. doi: 10.2174/1566524011313030003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Ideo H., Kondo J., Nomura T., Nonomura N., Inoue M., Amano J. Study of glycosylation of prostate-specific antigen secreted by cancer tissue-originated spheroids reveals new candidates for prostate cancer detection. Sci. Rep. 2020;10:2708. doi: 10.1038/s41598-020-59622-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Aimjongjun S., Reamtong O., Janvilisri T. Lectin affinity chromatography and quantitative proteomic analysis reveal that galectin-3 is associated with metastasis in nasopharyngeal carcinoma. Sci. Rep. 2020;10:16462. doi: 10.1038/s41598-020-73498-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Totten S.M., Adusumilli R., Kullolli M., Tanimoto C., Brooks J.D., Mallick P., Pitteri S.J. Multi-lectin Affinity Chromatography and Quantitative Proteomic Analysis Reveal Differential Glycoform Levels between Prostate Cancer and Benign Prostatic Hyperplasia Sera. Sci. Rep. 2018;8:6509. doi: 10.1038/s41598-018-24270-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Zhao J., Patwa T.H., Lubman D.M., Simeone D.M. Protein biomarkers in cancer: Natural glycoprotein microarray approaches. Curr. Opin. Mol. 2008;10:602–610. [PMC free article] [PubMed] [Google Scholar]
- 28.Lee A., Nakano M., Hincapie M., Kolarich D., Baker M.S., Hancock W.S., Packer N.H. The lectin riddle: Glycoproteins fractionated from complex mixtures have similar glycomic profiles. Omics. 2010;14:487–499. doi: 10.1089/omi.2010.0075. [DOI] [PubMed] [Google Scholar]
- 29.Chou T.Y., Hart G.W. O-linked N-acetylglucosamine and cancer: Messages from the glycosylation of c-Myc. Adv. Exp. Med. Biol. 2001;491:413–418. doi: 10.1007/978-1-4615-1267-7_26. [DOI] [PubMed] [Google Scholar]
- 30.Song E., Zhu R., Hammoud Z.T., Mechref Y. LC–MS/MS Quantitation of Esophagus Disease Blood Serum Glycoproteins by Enrichment with Hydrazide Chemistry and Lectin Affinity Chromatography. J. Proteome Res. 2014;13:4808–4820. doi: 10.1021/pr500570m. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Drake P.M., Schilling B., Niles R.K., Prakobphol A., Li B., Jung K., Cho W., Braten M., Inerowicz H.D., Williams K., et al. Lectin Chromatography/Mass Spectrometry Discovery Workflow Identifies Putative Biomarkers of Aggressive Breast Cancers. J. Proteome Res. 2012;11:2508–2520. doi: 10.1021/pr201206w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Ahn Y.H., Shin P.M., Oh N.R., Park G.W., Kim H., Yoo J.S. A lectin-coupled, targeted proteomic mass spectrometry (MRM MS) platform for identification of multiple liver cancer biomarkers in human plasma. J. Proteom. 2012;75:5507–5515. doi: 10.1016/j.jprot.2012.06.027. [DOI] [PubMed] [Google Scholar]
- 33.Deutsch E.W., Csordas A., Sun Z., Jarnuczak A., Perez-Riverol Y., Ternent T., Campbell D.S., Bernal-Llinares M., Okuda S., Kawano S., et al. The ProteomeXchange consortium in 2017: Supporting the cultural change in proteomics public data deposition. Nucleic Acids Res. 2017;45:D1100–D1106. doi: 10.1093/nar/gkw936. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Rauniyar N. Parallel Reaction Monitoring: A Targeted Experiment Performed Using High Resolution and High Mass Accuracy Mass Spectrometry. Int. J. Mol. Sci. 2015;16:28566–28581. doi: 10.3390/ijms161226120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Yuan L., Zhu M. Quantitative Bioanalysis of Proteins by Mass Spectrometry. Mater. Methods. 2015;5 doi: 10.13070/mm.en.5.1332. [DOI] [Google Scholar]
- 36.Tsai K.-L., Chang C.-C., Chang Y.-S., Lu Y.-Y., Tsai I.J., Chen J.-H., Lin S.-H., Tai C.-C., Lin Y.-F., Chang H.-W., et al. Isotypes of autoantibodies against novel differential 4-hydroxy-2-nonenal-modified peptide adducts in serum is associated with rheumatoid arthritis in Taiwanese women. BMC Med. Inform. Decis. Mak. 2021;21:49. doi: 10.1186/s12911-020-01380-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Diz A.P., Truebano M., Skibinski D.O.F. The consequences of sample pooling in proteomics: An empirical study. Electrophoresis. 2009;30:2967–2975. doi: 10.1002/elps.200900210. [DOI] [PubMed] [Google Scholar]
- 38.Rodríguez-Paredes M., Lyko F. The importance of non-histone protein methylation in cancer therapy. Nat. Rev. Mol. Cell Biol. 2019;20:569–570. doi: 10.1038/s41580-019-0147-x. [DOI] [PubMed] [Google Scholar]
- 39.Stowell S.R., Ju T., Cummings R.D. Protein glycosylation in cancer. Annu. Rev. Pathol. 2015;10:473–510. doi: 10.1146/annurev-pathol-012414-040438. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Illiano A., Pinto G., Melchiorre C., Carpentieri A., Faraco V., Amoresano A. Protein Glycosylation Investigated by Mass Spectrometry: An Overview. Cells. 2020;9:1986. doi: 10.3390/cells9091986. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Phueaouan T., Chaiyawat P., Netsirisawan P., Chokchaichamnankit D., Punyarit P., Srisomsap C., Svasti J., Champattanachai V. Aberrant O-GlcNAc-modified proteins expressed in primary colorectal cancer. Oncol. Rep. 2013;30:2929–2936. doi: 10.3892/or.2013.2794. [DOI] [PubMed] [Google Scholar]
- 42.Krześlak A., Jóźwiak P., Lipińska A. Down-regulation of β-N-acetyl-D-glucosaminidase increases Akt1 activity in thyroid anaplastic cancer cells. Oncol. Rep. 2011;26:743–749. doi: 10.3892/or.2011.1333. [DOI] [PubMed] [Google Scholar]
- 43.Zauber H., Schüler V., Schulze W. Systematic evaluation of reference protein normalization in proteomic experiments. Front. Plant Sci. 2013;4:25. doi: 10.3389/fpls.2013.00025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Faria M., Halquist M.S., Yuan M., Mylott W., Jr., Jenkins R.G., Karnes H.T. An extended stable isotope-labeled signature peptide internal standard for tracking immunocapture of human plasma osteopontin for LC-MS/MS quantification. Biomed. Chromatogr. BMC. 2015;29:1780–1782. doi: 10.1002/bmc.3471. [DOI] [PubMed] [Google Scholar]
- 45.Faria M., Halquist M.S. Internal Standards for Absolute Quantification of Large Molecules (Proteins) from Biological Matrices by LC-MS/MS. Calibration Valid. Anal. Methods A Sampl. Curr. Approaches. 2018;61 doi: 10.5772/intechopen.75569. [DOI] [Google Scholar]
- 46.Chen H.F., Shiao C.Y., Wu M.Y., Lin Y.C., Chen H.H., Chang W.C., Wu M.S., Kao C.C., Tsai I.L. Quantitative determination of human IgA subclasses and their Fc-glycosylation patterns in plasma by using a peptide analogue internal standard and ultra-high-performance liquid chromatography/triple quadrupole mass spectrometry. Rapid Commun. Mass Spectrom. 2020;34:e8606. doi: 10.1002/rcm.8606. [DOI] [PubMed] [Google Scholar]
- 47.Kim H., Sohn A., Yeo I., Yu S.J., Yoon J.H., Kim Y. Clinical Assay for AFP-L3 by Using Multiple Reaction Monitoring-Mass Spectrometry for Diagnosing Hepatocellular Carcinoma. Clin. Chem. 2018;64:1230–1238. doi: 10.1373/clinchem.2018.289702. [DOI] [PubMed] [Google Scholar]
- 48.Kato K. Algorithm for in vitro diagnostic multivariate index assay. Breast Cancer. 2009;16:248–251. doi: 10.1007/s12282-009-0141-9. [DOI] [PubMed] [Google Scholar]
- 49.Zang X., Jones C.M., Long T.Q., Monge M.E., Zhou M., Walker L.D., Mezencev R., Gray A., McDonald J.F., Fernández F.M. Feasibility of detecting prostate cancer by ultraperformance liquid chromatography-mass spectrometry serum metabolomics. J. Proteome Res. 2014;13:3444–3454. doi: 10.1021/pr500409q. [DOI] [PubMed] [Google Scholar]
- 50.Lee H.J., Kim Y.T., Park P.J., Shin Y.S., Kang K.N., Kim Y., Kim C.W. A novel detection method of non-small cell lung cancer using multiplexed bead-based serum biomarker profiling. J. Thorac. Cardiovasc. Surg. 2012;143:421–427. doi: 10.1016/j.jtcvs.2011.10.046. [DOI] [PubMed] [Google Scholar]
- 51.Marín-Vicente C., Mendes M., de los Ríos V., Fernández-Aceñero M.J., Casal J.I. Identification and Validation of Stage-Associated Serum Biomarkers in Colorectal Cancer Using MS-Based Procedures. Proteom. Clin. Appl. 2020;14:1900052. doi: 10.1002/prca.201900052. [DOI] [PubMed] [Google Scholar]
- 52.Xie Y., Chen L., Lv X., Hou G., Wang Y., Jiang C., Zhu H., Xu N., Wu L., Lou X., et al. The levels of serine proteases in colon tissue interstitial fluid and serum serve as an indicator of colorectal cancer progression. Oncotarget. 2016;7:32592–32606. doi: 10.18632/oncotarget.8693. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Bhardwaj M., Gies A., Weigl K., Tikk K., Benner A., Schrotz-King P., Borchers C.H., Brenner H. Evaluation and Validation of Plasma Proteins Using Two Different Protein Detection Methods for Early Detection of Colorectal Cancer. Cancers. 2019;11:1426. doi: 10.3390/cancers11101426. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Bhardwaj M., Weigl K., Tikk K., Holland-Letz T., Schrotz-King P., Borchers C.H., Brenner H. Multiplex quantitation of 270 plasma protein markers to identify a signature for early detection of colorectal cancer. Eur. J. Cancer. 2020;127:30–40. doi: 10.1016/j.ejca.2019.11.021. [DOI] [PubMed] [Google Scholar]
- 55.Lord S.J., Irwig L., Simes R.J. When is measuring sensitivity and specificity sufficient to evaluate a diagnostic test, and when do we need randomized trials? Ann. Intern. Med. 2006;144:850–855. doi: 10.7326/0003-4819-144-11-200606060-00011. [DOI] [PubMed] [Google Scholar]
- 56.Ramdzan A.R., Abd Rahim M.A., Mohamad Zaki A., Zaidun Z., Mohammed Nawi A. Diagnostic Accuracy of FOBT and Colorectal Cancer Genetic Testing: A Systematic Review & Meta-Analysis. Ann. Glob. Health. 2019;85 doi: 10.5334/aogh.2466. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Ruytinx P., Proost P., Struyf S. CXCL4 and CXCL4L1 in cancer. Cytokine. 2018;109:65–71. doi: 10.1016/j.cyto.2018.02.022. [DOI] [PubMed] [Google Scholar]
- 58.Lord M.S., Cheng B., Farrugia B.L., McCarthy S., Whitelock J.M. Platelet Factor 4 Binds to Vascular Proteoglycans and Controls Both Growth Factor Activities and Platelet Activation. J. Biol. Chem. 2017;292:4054–4063. doi: 10.1074/jbc.M116.760660. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Sikara M.P., Routsias J.G., Samiotaki M., Panayotou G., Moutsopoulos H.M., Vlachoyiannopoulos P.G. β2 Glycoprotein I (β2GPI) binds platelet factor 4 (PF4): Implications for the pathogenesis of antiphospholipid syndrome. Blood. 2010;115:713–723. doi: 10.1182/blood-2009-03-206367. [DOI] [PubMed] [Google Scholar]
- 60.Muramatsu T., Muramatsu H., Kojima T. Identification of proteoglycan-binding proteins. Methods Enzymol. 2006;416:263–278. doi: 10.1016/s0076-6879(06)16017-6. [DOI] [PubMed] [Google Scholar]
- 61.Pucci F., Rickelt S., Newton A.P., Garris C., Nunes E., Evavold C., Pfirschke C., Engblom C., Mino-Kenudson M., Hynes R.O., et al. PF4 Promotes Platelet Production and Lung Cancer Growth. Cell Rep. 2016;17:1764–1772. doi: 10.1016/j.celrep.2016.10.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Zhang Y., Gao J., Wang X., Deng S., Ye H., Guan W., Wu M., Zhu S., Yu Y., Han W. CXCL4 mediates tumor regrowth after chemotherapy by suppression of antitumor immunity. Cancer Biol. 2015;16:1775–1783. doi: 10.1080/15384047.2015.1095404. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Peterson J.E., Zurakowski D., Italiano J.E., Jr., Michel L.V., Connors S., Oenick M., D’Amato R.J., Klement G.L., Folkman J. VEGF, PF4 and PDGF are elevated in platelets of colorectal cancer patients. Angiogenesis. 2012;15:265–273. doi: 10.1007/s10456-012-9259-z. [DOI] [PubMed] [Google Scholar]
- 64.Bhanumathy C.D., Tang Y., Monga S.P., Katuri V., Cox J.A., Mishra B., Mishra L. Itih-4, a serine protease inhibitor regulated in interleukin-6-dependent liver formation: Role in liver development and regeneration. Dev. Dyn. 2002;223:59–69. doi: 10.1002/dvdy.1235. [DOI] [PubMed] [Google Scholar]
- 65.Tang Y., Kitisin K., Jogunoori W., Li C., Deng C.X., Mueller S.C., Ressom H.W., Rashid A., He A.R., Mendelson J.S., et al. Progenitor/stem cells give rise to liver cancer due to aberrant TGF-beta and IL-6 signaling. Proc. Natl. Acad. Sci. USA. 2008;105:2445–2450. doi: 10.1073/pnas.0705395105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Huang M., Zhang W., Zhao B., Li L. Relationship between inter-α-trypsin inhibitor heavy chain 4 and ovarian cancer. Chin. J. Cancer Res. 2019;31:955–964. doi: 10.21147/j.issn.1000-9604.2019.06.11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Lee E.J., Yang S.H., Kim K.J., Cha H., Lee S.J., Kim J.H., Song J., Chun K.H., Seong J. Inter-alpha Inhibitor H4 as a Potential Biomarker Predicting the Treatment Outcomes in Patients with Hepatocellular Carcinoma. Cancer Res. Treat. 2018;50:646–657. doi: 10.4143/crt.2016.550. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Hamm A., Veeck J., Bektas N., Wild P.J., Hartmann A., Heindrichs U., Kristiansen G., Werbowetski-Ogilvie T., Del Maestro R., Knuechel R., et al. Frequent expression loss of Inter-alpha-trypsin inhibitor heavy chain (ITIH) genes in multiple human solid tumors: A systematic expression analysis. BMC Cancer. 2008;8:25. doi: 10.1186/1471-2407-8-25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Jiang X., Bai X.Y., Li B., Li Y., Xia K., Wang M., Li S., Wu H. Plasma Inter-Alpha-Trypsin Inhibitor Heavy Chains H3 and H4 Serve as Novel Diagnostic Biomarkers in Human Colorectal Cancer. Dis. Markers. 2019;2019:5069614. doi: 10.1155/2019/5069614. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Marais A.D. Apolipoprotein E in lipoprotein metabolism, health and cardiovascular disease. Pathology. 2019;51:165–176. doi: 10.1016/j.pathol.2018.11.002. [DOI] [PubMed] [Google Scholar]
- 71.Long J., Zhang C.-J., Zhu N., Du K., Yin Y.-F., Tan X., Liao D.-F., Qin L. Lipid metabolism and carcinogenesis, cancer development. Am. J. Cancer Res. 2018;8:778–791. [PMC free article] [PubMed] [Google Scholar]
- 72.Pencheva N., Tran H., Buss C., Huh D., Drobnjak M., Busam K., Tavazoie S.F. Convergent multi-miRNA targeting of ApoE drives LRP1/LRP8-dependent melanoma metastasis and angiogenesis. Cell. 2012;151:1068–1082. doi: 10.1016/j.cell.2012.10.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Buss L.A., Mandani A., Phillips E., Scott N.J.A., Currie M.J., Dachs G.U. Characterisation of a Mouse Model of Breast Cancer with Metabolic Syndrome. In Vivo. 2018;32:1071–1080. doi: 10.21873/invivo.11348. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.El-Bahrawy A., Tarhuni A., Kim H., Subramaniam V., Benslimane I., Abd Elmageed Z.Y., Okpechi S., Ghonim M., Hemeida R., Abo-Yousef A., et al. ApoEdeficiency promotes colon inflammation and enhances inflammatory potential oxidized-LDL and TNF-α in colon epithelial cells. Biosci. Rep. 2016;36 doi: 10.1042/bsr20160195. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.An H.J., Koh H.M., Song D.H. Apolipoprotein E is a predictive marker for assessing non-small cell lung cancer patients with lymph node metastasis. Pathol. Res. Pract. 2019;215:152607. doi: 10.1016/j.prp.2019.152607. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The mass spectrometry proteomics data have been deposited to ProteomeXchange Consortium via the PRIDE [33] partner repository with the dataset identifier PXD024997.