Abstract
Pancreatic cancer (PC) is difficult to detect in the early stages; thus, identifying specific and sensitive biomarkers for PC diagnosis is crucial, especially in the case of early-stage tumors. Circulating microRNAs are promising non-invasive biomarkers. Therefore, we aimed to identify non-invasive miRNA biomarkers and build a model for PC diagnosis. For the training model, blood serum samples from 63 PC patients and 63 control subjects were used. We selected 39 miRNA markers using a smoothly clipped absolute deviation-based penalized support vector machine and built a PC diagnosis model. From the double cross-validation, the average test AUC was 0.98. We validated the diagnosis model using independent samples from 25 PC patients and 81 patients with intrahepatic cholangiocarcinoma (ICC) and compared the results with those obtained from the diagnosis using carbohydrate antigen 19-9. For the markers miR-155-5p, miR-4284, miR-346, miR-7145-5p, miR-5100, miR-661, miR-22-3p, miR-4486, let-7b-5p, and miR-4703-5p, we conducted quantitative reverse transcription PCR using samples from 17 independent PC patients, 8 ICC patients, and 8 healthy individuals. Differential expression was observed in samples from PC patients. The diagnosis model based on the identified markers showed high sensitivity and specificity for PC detection and is potentially useful for early PC diagnosis.
Keywords: microRNA, pancreatic cancer, diagnosis, sensitivity, specificity, biomarker
1. Introduction
Pancreatic cancer (PC) is one of the leading causes of cancer-related mortality, as the symptoms of PC seldom appear in the early stages of the disease, and the cancer is mostly detected after it has metastasized to other organs. According to cancer statistics in 2020, the five-year survival rate of patients with PC is 9%, although that of patients with localized PC is higher than 37%, based on patients diagnosed with pancreatic cancer between 2009 and 2015 [1].
The most effective strategy for reducing PC-related mortality is early diagnosis and treatment. However, the lack of reliable markers for PC detection reduces the efficacy of screening strategies in at-risk populations, such as those with chronic pancreatitis [2]. Carbohydrate antigen 19-9 (CA19-9) and carcinoembryonic antigen (CEA) are the most commonly used serological biomarkers; however, they lack sufficient sensitivity and specificity for the detection of PC [3]. To improve the prognosis of patients with this form of cancer, it is important to identify diagnostic biomarkers for PC.
Recently, microRNAs (miRNAs), which are small non-coding RNA molecules, have been reported to play important roles in post-transcriptional regulation in cancer [4]. Increasing evidence has shown that miRNAs are essential for the development, diagnosis, and prognosis of cancer, suggesting that these RNAs have potential for use as diagnostic markers in cancer [5]. To date, nearly 100 miRNAs have been identified to be associated with PC using tissue samples [2]. However, it is difficult to perform tissue biopsies in every patient suspected of having PC. Therefore, the optimal biomarkers would be non-invasive and derived from blood, such as circulating miRNAs, which may be readily collected from the patient. Another reason for using circulating miRNAs as biomarkers is their remarkable stability in plasma and serum. They are protected from RNAse degradation as they can be packaged in microparticles (e.g., exosomes) or bound to Argonaut proteins or high-density lipoproteins [6,7,8,9,10].
Currently, highly sensitive and specific invasive biomarkers are not available for the detection of PC. Therefore, the primary objective of this study was to identify non-invasive miRNA biomarkers and to build a prediction model for the diagnosis of PC. In this study, 63 PC patients and 63 control subjects were used for the identification of miRNA biomarkers, and an additional 25 PC samples and 81 intrahepatic cholangiocarcinoma (ICC) samples were used for the validation of our proposed prediction model. For comparison, we also obtained diagnosis results based on serum levels of CA19-9 in the same blood samples. For additional validation, quantitative reverse transcription PCR (qRT-PCR) was conducted using additional RNA samples from 17 patients with PC, 8 patients with ICC, and 8 healthy individuals.
2. Materials and Methods
2.1. Study Design
The present study included 105 patients with PC, 109 patients with ICC, 7 patients with stomach cancer (SC), 5 patients with colorectal cancer (CRC), 2 patients with gastrointestinal stromal tumor (GIST), 10 patients with cholelithiasis (Ch), and 27 healthy subjects who had been clinically classified at the time of participation. A case-control study was designed to identify differentially expressed miRNAs (DEmiRNAs) between the case-control groups and to build a diagnostic model for PC. For cases, 63 PC patients were used, and for controls, two types of control groups were used.
The first type of control group consisted of 19 healthy subjects and 10 Ch patients. The second type of control group, the non-PC group, included samples from patients with other cancers as well as those from non-cancer subjects. In particular, we included 20 ICC patients, 7 SC patients, 5 CRC patients, and 2 GIST patients. We set aside 25 PC and 81 ICC samples for the validation study. The clinical characteristics of the samples in the microarray experiments and the grouping details are presented in Table 1. qRT-PCR was conducted using samples from 17 PC patients, 8 ICC patients, and 8 healthy individuals. The purpose of our study was not only to identify biomarkers for PC but also to build a prediction model. Therefore, although the age of the subjects was significantly different between the case and control groups, we decided to use the model without the covariate, as the model with the covariate had a similar prediction performance to the model without the covariate. The study protocol conformed to the ethical guidelines of the 1975 Declaration of Helsinki, and the Ethical Committee and Institutional Review Board of Yonsei University College of Medicine approved the protocol of serum acquisition from the patients’ specimens. Written informed consent was obtained from all participating patients and healthy controls (IRB approval code 4-2012-0528, 20 September 2012).
Table 1.
Variables | Training Cohort | Validation Cohort | |||||||
---|---|---|---|---|---|---|---|---|---|
PC | ICC | CRC | SC | GIST | Ch | N | PC | ICC | |
Count | 63 | 20 | 5 | 7 | 2 | 10 | 19 | 25 | 81 |
Age | 63.0 ± 9.6 | 66.4 ± 10.5 | 66.2 ± 5.2 | 60.0 ± 14.4 | 54.5 ± 10.6 | 60.2 ± 12.1 | 46.8 ± 9.8 | 66.0 ± 8.1 | 64.6 ± 7.1 |
Female | 19 (30.2) | 8 (40.0) | 2 (40.0) | 1 (14.3) | 1 (50.0) | 6 (54.5) | 6 (30.0) | 7 (28.0) | 6 (33.3) |
Stage * | |||||||||
I | 4 | 1 | 1 | 1 | 2 | 1 | 7 | ||
II | 12 | 7 | 2 | 1 | 4 | 18 | |||
III | 13 | - | - | 1 | - | 8 | 8 | ||
IV | 34 | - | 2 | 4 | - | 12 | 48 | ||
CA19-9, U/mL | |||||||||
Median level | 336.5 ± 6055.7 | 181.5 ± 4441.1 | 20.0 ± 28.8 | 62.1 ± 355.6 | 8.91 ± 9.89 | 10.1 ± 5.5 | 196.9 ± 5985.5 | 45.8 ± 5287.2 | |
≤37 | 16 (25.4) | 6 (30.0) | 4 (80.0) | 3 (42.9) | 2 (100) | 19 (100) | 7 (28.0) | 37 (46.8) | |
>37 | 47 (74.6) | 14 (70.0) | 1 (20.0) | 4 (57.1) | 0 (0) | 0 (0) | 18 (72.0) | 42 (53.2) | |
Overall Survival, months | 15.4 | 19.6 | 32.8 | 62.4 | 25.9 | - | 12 | 21.9 |
* Tumor stages were based on the staging classification of the 7th edition of the American Joint Committee on Cancer. Variables are expressed as mean ± standard deviation, median ± standard deviation, or n (%). PC, pancreatic cancer; ICC, intrahepatic cholangiocarcinoma; SC, stomach cancer; CRC, colorectal cancer; GIST, gastrointestinal stromal tumors; Ch, cholelithiasis; N, normal; SD, standard deviation; CA19-9, carbohydrate antigen 19-1.
2.2. Sample Preparation
Patient samples were prospectively obtained from consenting individuals who underwent a detailed clinical examination and were diagnosed at the Severance Hospital, Yonsei University College of Medicine. Serum samples from 63 patients with PC, 63 non-PC control subjects, and another 25 patients with PC and 81 patients with ICC were collected in 10-mL BD serum tubes. Samples were centrifuged at 4 °C for 20 min at 3000× g. The supernatant serum was then aliquoted and stored at −80 °C until further use.
2.3. MicroRNA Extraction
Total RNA containing miRNA was extracted from the serum samples using a serum miRNA purification kit (Genolution, Seoul, Korea) according to the manufacturer’s instructions, and the RNA was resuspended in 12 μL of RNase-free water and stored at −80 °C until microarray or qRT-PCR analysis.
2.4. MicroRNA Microarray Experiments
For quality control, the purity and integrity of the RNA were evaluated based on the OD260/280 ratio and analyzed using the Agilent 2100 Bioanalyzer (Agilent Technologies, Palo Alto, CA, USA). Analysis using the Affymetrix GeneChip miRNA 4.0 array (Affymetrix, Santa Clara, CA, USA) was performed according to the manufacturer’s protocol. RNA samples (130 ng) were labeled using the FlashTag Biotin RNA Labeling Kit (Genisphere, Hatfield, PA, USA). The labeled RNA was quantified, fractionated, and hybridized to the miRNA microarray according to the standard procedures provided by the manufacturer.
Next, the labeled RNA was heated to 99 °C for 5 min and then to 45 °C for 5 min. RNA-array hybridization was performed with agitation at 60 rotations per minute for 16 h at 48 °C on an Affymetrix 450 Fluidics Station. The chips were washed and stained using a GeneChip Fluidics Station 450 (Affymetrix). The chips were then scanned using an Affymetrix GCS 3000 scanner; 232 CEL files were analyzed and normalized using the Expression Console software. The Affymetrix GeneChip Micro 4.0 Array provides 100% miRBase v20 coverage (www.mirbase.org) using a one-color approach. This chip contains 6658 human probe sets, which includes pre-mature miRNAs (n = 2025) and other small RNAs (n = 1996), including internal and negative controls. For further analysis, we extracted 2578 mature human miRNAs, from all probe sets.
2.5. Principal Component Analysis Based on Differentially Expressed Genes
Log2-transformed and normalized intensities for the 2578 human mature miRNAs were analyzed for the difference in expression levels between the cases and controls. To identify DEmiRNAs, we used a logistic regression analysis. Statistical significance was determined using the false discovery rate (FDR) method; FDR < 0.05 was considered significant in this analysis.
To examine the difference in miRNA profiles between the cases and controls, we conducted a principal component analysis (PCA). The principal components of the two groups were computed based on different sets of miRNAs: (i) all miRNAs and (ii) FDR < 0.05. Based on this PCA model, we also predicted the principal components of the validation samples (25 PC samples and 81 ICC samples). To visualize the pattern of each group, we added 95% confidence ellipses of principal components in a PCA plot based on the multivariate t distribution.
2.6. Biomarker Selection for Diagnosis
For diagnosis of PC, miRNA biomarkers were selected from the 2578 human mature miRNAs, using the following procedure:
- Step 1 (training/test data assigning):
-
○Whole data were randomly divided into 5 approximately equal-sized subsets (folds).
-
○Each of the five folds were considered test data, and the remaining folds were designated as training data (5-fold cross-validation).
-
○
- Step 2 (candidate variable selection):
-
○Using the individualized assigned training data, logistic regression analysis was conducted, and p-values and adjusted p-values (FDR) were computed for each miRNA.
-
○First candidate miRNAs were selected (FDR < 0.05).
-
○By applying a smoothly clipped absolute deviation (SCAD) penalty to the first candidate miRNAs, second candidate miRNAs with non-zero coefficients were selected.
-
○
- Step 3 (repetition):
-
○Steps 1 and 2 were repeated 200 times with random seed.
-
○1000 (5-fold CV × 200 repetitions) sets of candidate miRNAs were obtained.
-
○
- Step 4 (final variable selection by voting):
-
○From the 1000 sets of candidate miRNAs, the frequency of each candidate miRNA was computed.
-
○The candidate miRNAs were sorted by frequency.
-
○
- Step 5 (prediction model building):
-
○From the top K ranked miRNAs, the radial basis function (RBF)-kernel SVM model was built (K = 2, …, 50).
-
○From each model (K = 2, …, 50), the optimal hyperparameters and performance were calculated using double cross-validation.
-
○Optimal K was determined based on the performance of each model.
-
○As a final model, RBF-kernel SVM with K top-ranked miRNAs was applied using the whole training dataset.
-
○
2.7. Double Cross-Validation
For the parametrization and validation of our diagnostic model, we used double cross-validation [11,12], which consists of inner and outer cross validation. We conducted the outer 5-fold cross validation to determine the optimal K and the inner 5-fold cross validation for the hyperparameter assignment of SVM. In the inner 5-fold cross validation, for the grid search of kernel hyperparameters, we assigned gamma values in the range of −2−10 to 210 (−2−10, 2−9, …, 29, 210) and cost values in the range of −2−7 to 27 (2−7, 2−6, …, 26, 27). In the outer 5-fold cross validation, the diagnostic models with K top-ranked miRNAs were applied to the test data, and the area under the curve (AUC), sensitivity, and specificity were calculated for each fold. We calculated these performances for several K values (K = 2, …, 50). This double cross-validation was repeated 20 times in random seeds. The performance metrics were then averaged for 5 folds and 20 repetitions. Based on this performance, we determined the final number of biomarkers (=K).
2.8. Smoothly Clipped Absolute Deviation (SCAD) Penalty
SCAD is a non-concave penalty function introduced by Fan and Li [13], and Zhang et al. [14] considered the sparse SVM with SCAD for feature selection. The SCAD-penalized term for each coefficient tj has the following form Equation (1) [14]:
(1) |
In our analysis, Fan and Li’s suggested value for = 3.7 was used. The parameter λ was assigned by minimizing the approximate generalized cross-validation statistics. Among the various penalized methods for feature selection, we chose SCAD because it has several desirable properties. For example, SCAD produces nearly unbiased estimates for large coefficients, and the set of features selected using SCAD are asymptotically equivalent to the set of true signal features; that is, SCAD satisfies the oracle property. We conducted a penalized SVM with the SCAD penalty for multiple miRNA selection in the double cross-validation in our study.
2.9. Quantitative RT-PCR
Reverse transcription and qRT-PCR were performed using a TaqMan Advanced miRNA cDNA Synthesis Kit (Applied Biosystems, Foster City, CA, USA), TaqMan Advanced miRNA Assays (Applied Biosystems), and TaqMan Fast Advanced Master Mix (Applied Biosystems), according to the manufacturer’s protocols. qRT-PCR was performed using an ABI Prism 7300 Sequence Detection System (Applied Biosystems), and primers for the mature miRNAs were purchased from Applied Biosystems. PCR amplification consisted of an initiation step at 95 °C for 10 min, followed by 55 cycles at 95 °C for 30 s, 56 °C for 30 s, and 72 °C for 15 s. All qRT-PCR assays were performed in triplicate using total RNA samples from 17 patients with PC, 8 patients with ICC, and 8 healthy individuals. Statistical analyses were analyzed using GraphPad 5 (GraphPad Software). The miRNA expression between groups were calculated by a one-way ANOVA and Bonferroni post-tests.
3. Results
3.1. Comparison between Case and Control by PCA
Upon comparing the 63 PC samples with the 29 non-cancer samples in the DEmiRNA analysis, we identified 103 miRNAs that showed significant differences in expression between the two groups (FDR < 0.05) (Table S1, Figure S1). When 103 miRNAs were used in the PCA, the 63 PC samples (green dots) and 29 non-cancer samples (red dots) were well-distinguished compared to when all miRNAs were used. Furthermore, 25 validation-PC samples (purple dots) had similar patterns to the 63 PC samples, as shown in Figure 1a,b. However, some of the 81 validation-ICC samples (blue points) had overlapping patterns with the PC case samples. Thus, if we used only non-cancer samples as controls, the biomarkers led to many false positives (for example, the biomarker could diagnose some ICC patients as PC patients) and were not appropriate for the PC-specific diagnostic model.
Upon comparing the 63 PC samples with the 63 non-PC samples, we identified 149 miRNAs that showed significant differential expression between the two groups (FDR < 0.05) (Table S2, Figure S2). When we used all the miRNA data in the PCA, the PC and non-PC samples exhibited overlapping patterns of principal components (Figure 1c). When the 149 differentially expressed miRNAs were used in the PCA, the clustering patterns of the 63 PC samples and 63 non-PC samples were nearly distinguished, and the validation samples (25 PC and 81 ICC) had similar patterns to those of the training case-control samples, as shown in Figure 1d.
3.2. Building a Diagnostic Model Based on the Selected miRNA Markers
To build the diagnostic model, we decided to use the results of the comparison between the PC samples and non-PC samples, including samples from patients with other cancers as controls to obtain PC-specific diagnostic markers. For the selection of diagnostic markers, we used 5-fold cross validation with 200 repetitions. In each fold of the cross-validation, we conducted a logistic regression analysis without a covariate and selected a set of candidate miRNA markers whose FDR was less than 0.05. Then, through the use of the SVM with the SCAD penalty function, the candidate markers were narrowed down to the markers with non-zero coefficients.
We ranked the markers according to the selection frequency. Based on these frequencies, K top-ranked miRNAs were used to build the RBF kernel SVM model. To determine the value of K, through double cross-validation, we estimated the diagnostic performance of the model with the K top-ranked miRNAs by varying K (K = 1, …, 50). As shown in Figure 2, the performance measures increased as K increased and began to saturate at an AUC of 0.98 and an accuracy of 0.93 when K was 39. Therefore, we decided to select the top 39 miRNAs as diagnostic biomarkers for PC among the candidate miRNAs (Table 2).
Table 2.
miRNA | Selection Freq | AUC | Reference |
---|---|---|---|
hsa-miR-548ay-5p * | 950 | 0.734 | |
hsa-miR-155-5p * | 842 | 0.808 | [15,16,17,18] |
hsa-miR-4284 | 836 | 0.708 | [19] |
hsa-miR-939-5p * | 810 | 0.734 | [20] |
hsa-miR-642b-3p * | 805 | 0.759 | [21,22] |
hsa-miR-346 * | 736 | 0.749 | [23] |
hsa-miR-4690-5p * | 690 | 0.716 | |
hsa-miR-7154-5p | 675 | 0.698 | |
hsa-miR-3064-5p * | 625 | 0.785 | [24] |
hsa-miR-1269b * | 607 | 0.854 | |
hsa-miR-4708-3p * | 580 | 0.82 | |
hsa-miR-5100 * | 580 | 0.792 | [25,26] |
hsa-miR-548aq-3p * | 580 | 0.76 | |
hsa-miR-661 | 523 | 0.701 | [27] |
hsa-miR-4701-3p * | 509 | 0.699 | |
hsa-miR-1272 * | 455 | 0.771 | |
hsa-miR-455-3p * | 454 | 0.732 | [28,29] |
hsa-miR-26a-5p | 422 | 0.711 | [30,31,32] |
hsa-miR-22-3p * | 388 | 0.758 | [21,33] |
hsa-miR-6894-3p | 384 | 0.697 | |
hsa-miR-3620-3p * | 377 | 0.679 | |
hsa-miR-4775 * | 377 | 0.759 | |
hsa-miR-4745-5p * | 371 | 0.688 | [29] |
hsa-miR-6737-3p | 371 | 0.673 | |
hsa-miR-5189-3p * | 357 | 0.693 | |
hsa-miR-4647 | 356 | 0.741 | |
hsa-miR-4486 * | 349 | 0.788 | |
hsa-miR-6865-5p * | 345 | 0.764 | |
hsa-miR-200b-5p * | 344 | 0.726 | [34,35] |
hsa-miR-548ac * | 313 | 0.852 | |
hsa-let-7b-5p | 298 | 0.676 | |
hsa-miR-2278 * | 291 | 0.741 | [36] |
hsa-miR-4703-5p * | 262 | 0.603 | |
hsa-miR-1226-5p | 257 | 0.653 | |
hsa-miR-640 * | 245 | 0.725 | |
hsa-miR-1277-3p | 240 | 0.672 | |
hsa-miR-218-5p | 232 | 0.624 | [19,37,38,39] |
hsa-miR-512-3p * | 218 | 0.664 | |
hsa-miR-16-2-3p * | 213 | 0.8 |
* The markers that were also differentially expressed between PC and non-cancer samples; other markers were only differentially expressed between PC and non-PC samples.
At K = 39, the mean sensitivity and mean specificity of the diagnostic model were 0.93 and 0.93, respectively, given an optimal decision threshold. The optimal threshold of diagnosis probability was determined to be 0.55 by comparing the performance results based on thresholds (0.5, 0.55, 0.6, 0.65, and 0.7). Among the 39 miRNAs, 28 miRNAs were also differentially expressed between the PC samples and non-cancer samples (FDR < 0.05); 11 miRNAs were differentially expressed between the PC and non-PC samples (FDR < 0.05) (Figure 3).
For validation, we next applied our PC-specific diagnostic model to a different set of 25 PC and 81 ICC samples. When the PC-diagnosis probability from the diagnostic model was >0.55, we diagnosed the patient as having PC. We also applied CA19-9 diagnosis to the same samples for comparison. When the CA19-9 value was >37, we diagnosed the patient as having PC. As shown in Figure 4, the AUC of the proposed diagnostic model was 1.5 times higher, the sensitivity was 1.3 times higher, and the specificity was 2 times higher than that of the CA19-9 diagnosis model (the AUC, sensitivity, and specificity are presented in Figure 4).
We also validated 10 miRNAs of the 39 diagnostic markers using qRT-PCR. For qRT-PCR, blood samples from another 17 patients with PC, 8 patients with ICC, and 8 healthy individuals were used (Table S3). The expression levels of miR-155-5p, miR-4284, let-346, miR-7154-5p, miR-5100, miR-661, miR-22-3p, miR-4486, let-7b-5p, and miR-4703-5p were analyzed using primers for mature miRNAs. The findings indicated differential expression in PC versus ICC and healthy individuals. Decreased expression of miR-155-5p, miR-7154-5p, miR-661, and miR-4703-5p and elevated expression of miR-5100, miR-22-3p, miR-4486, and let-7b-5p were observed in PC patients. miR-4284 was only detected in cancer groups and miR-346 was absent in patients with PC (Figure 5).
4. Discussion
Despite multiple clinical trials and continued efforts, PC remains the most difficult cancer to cure as it is difficult to diagnose at the early stages. In this study, we aimed to identify circulating miRNA biomarkers for the detection of PC and to develop a diagnostic model based on these markers. For the identification of diagnostic markers, we used two types of control group. The first control group consisted of 29 non-cancer samples and the second consisted of 63 non-PC samples, including those from patients with other cancers. DEmiRNAs selected from the PC vs. non-cancer study successfully enabled discrimination between the training-case samples and the training-control samples but could not distinguish the validation-case samples from the validation-control samples, possibly because the validation-control samples consisted of samples from patients with ICC. For validation control, patients with ICC were used instead of healthy patients to verify the specificity of PC diagnosis. PC and ICC are known to have overlapping immunohistochemical profiles [40]. DEmiRNAs selected from the PC vs. non-PC study differentiated the cases from the controls well, both in the training samples and in the validation samples. As a result, we found that the PC vs. non-PC grouping was more acceptable for identification of PC-specific diagnostic markers than the PC vs. non-cancer grouping. Based on this grouping, we tried to identify PC-specific diagnosis markers from 2578 miRNAs. In order to consider joint effects from multiple core miRNAs and filter the negative effects caused by irrelevant miRNAs, we conducted a penalized SVM with SCAD penalty. As a result, we identified 39 PC-specific diagnostic markers using SCAD-based penalized SVM with a double cross-validation technique.
Among the 39 PC-specific markers, 15 miRNAs have been reported to serve as PC markers in previous studies. Some of these (miR-155-5p, miR-939-5p, miR-346, miR-3064-5p, miR-661, miR-26a-5p, miR-200b-5p, and miR-218-5p) were found to be differentially expressed in PC tissues [15,16,17,18,19,20,23,24,27,30,31,32,34,35,37,38,39]. miR-939-5p, miR-200b-5p, and miR-218-5p were also differentially expressed in PC cell lines. miR-642b-3p and miR-22-3p, which have previously been reported as early diagnostic markers for PC, showed altered levels in the blood of patients with PC [21,22,33]. miR-4284 and miR-5100 are underexpressed in cyst fluid and saliva samples, respectively, in patients with PC [19,25,26]. miR-455-3p and miR-4745-5p have been found to be related to resistance to gemcitabine treatment in patients with PC [28,29]. Among the 39 markers obtained, several of them have also been reported to act as markers of other cancers, including colon, ovarian, breast, liver, lung, and prostate cancer. These markers, individually, are not specific to PC; however, we believe that our joint algorithm based on all 39 markers enables highly specific diagnosis of PC.
Among the present selected diagnostic markers, 10 markers were analyzed by qRT-PCR and differential expression was observed between cancer groups and healthy individuals. Previously reported PC markers including miR-155-5p, miR-4284, miR-5100, miR-346, miR-661, and miR-22-3p, as well as the novel markers including miR-7154-5p, miR-miR-4486, let-7b-5p, and miR-4703-5p, consistently showed differential expression in the PC samples, in both the microarray and qRT-PCR.
The markers identified in the present study have potential for use in the early diagnosis of PC and are expected to serve as a major platform for developing commercial models for the timely diagnosis of PC.
5. Conclusions
In this study, we identified 39 circulating miRNAs as PC-specific diagnostic markers using penalized methods. They include several novel biomarkers that have not yet been reported for PC diagnosis. For inner validation, we estimated the sensitivity and specificity of our diagnostic model through double cross-validation and obtained a mean sensitivity of 0.93 and mean specificity of 0.93. We also validated the specificity using 25 independent PC and 81 ICC samples with a PCA analysis and conducted qRT-PCR validation on several diagnostic markers using independent samples from 17 PC, 8 ICC, and 8 healthy control patients. qRT-PCR analysis indicated that miR-155-5p, miR-4284, miR-346, miR-7145-5p, miR-5100, miR-661, miR-22-3p, miR-4486, let-7b-5p, and miR-4703-5p were differentially expressed in samples from patients with PC. Overall, while we are convinced that our identified miRNA biomarkers based on the PC-specific diagnosis model improve the detection rate for PC, further validation studies will be needed in the future.
Supplementary Materials
The following are available online at https://www.mdpi.com/1422-0067/22/3/1007/s1, Figure S1: The heatmap of 103 miRNAs for distinguishing PC and non-cancer samples, Figure S2: The heatmap of 149 miRNAs for distinguishing PC and non-PC samples, Table S1: The differentially expressed 103 miRNAs for distinguishing PC and non-cancer samples, Table S2: The differentially expressed 149 miRNAs for distinguishing PC and non-PC samples, Table S3: Clinical characteristic of the samples used for qRT-PCR.
Author Contributions
Conceptualization, S.Y.S.; Methodology, J.L., D.E.J. and S.Y.S.; Formal analysis, J.L., H.S.L., S.B.P., C.K. and K.K.; Writing—original draft preparation, J.L. and D.E.J.; Writing—review and editing, J.L., D.E.J., and S.Y.S.; Visualization, J.L. and D.E.J.; Supervision, S.Y.S. All authors have read and agreed to the published version of the manuscript.
Funding
This work was supported by the Post-Genome Technology Development Program (No. 10040174, Multiple biomarker development through validation of useful markers generated by next generation bio-data based genome research) funded by the Ministry of Trade, Industry and Energy (MOTIE, Korea). This research was also supported by the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education, Science and Technology (NRF-2019R1I1A1A01061764).
Institutional Review Board Statement
The study was conducted according to the guidelines of the Declaration of Helsinki, and the Ethical Committee for the Clinical Research of the Institutional Review Board of Severance Hospital, Yonsei University College of Medicine, Seoul, Korea, approved this study protocol (IRB approval code 4-2012-0528, 20 September 2012).
Informed Consent Statement
Informed consent was obtained from all subjects involved in the study.
Data Availability Statement
The data in this study are openly available in the Gene Expression Omnibus accession number GSE85589 (www.ncbi.nlm.nih.gov/geo).
Conflicts of Interest
The authors declare no conflict of interest.
Footnotes
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.American Cancer Society . Cancer Facts & Figures 2020. American Cancer Society; Atlanta, GA, USA: 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Cote G.A., Gore A.J., McElyea S.D., Heathers L.E., Xu H., Sherman S., Korc M. A pilot study to develop a diagnostic test for pancreatic ductal adenocarcinoma based on differential expression of select miRNA in plasma and bile. Am. J. Gastroenterol. 2014;109:1942–1952. doi: 10.1038/ajg.2014.331. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Liu R., Chen X., Du Y., Yao W., Shen L., Wang C., Hu Z., Zhuang R., Ning G., Zhang C., et al. Serum microRNA expression profile as a biomarker in the diagnosis and prognosis of pancreatic cancer. Clin. Chem. 2012;58:610–618. doi: 10.1373/clinchem.2011.172767. [DOI] [PubMed] [Google Scholar]
- 4.Bartel D.P. MicroRNAs: Genomics, biogenesis, mechanism, and function. Cell. 2004;116:281–297. doi: 10.1016/S0092-8674(04)00045-5. [DOI] [PubMed] [Google Scholar]
- 5.Iorio M.V., Croce C.M. MicroRNAs in cancer: Small molecules with a huge impact. J. Clin. Oncol. 2009;27:5848–5856. doi: 10.1200/JCO.2009.24.0317. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Arroyo J.D., Chevillet J.R., Kroh E.M., Ruf I.K., Pritchard C.C., Gibson D.F., Mitchell P.S., Bennett C.F., Pogosova-Agadjanyan E.L., Stirewalt D.L., et al. Argonaute2 complexes carry a population of circulating microRNAs independent of vesicles in human plasma. Proc. Natl. Acad. Sci. USA. 2011;108:5003–5008. doi: 10.1073/pnas.1019055108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Creemers E.E., Tijsen A.J., Pinto Y.M. Circulating microRNAs: Novel biomarkers and extracellular communicators in cardiovascular disease? Circ. Res. 2012;110:483–495. doi: 10.1161/CIRCRESAHA.111.247452. [DOI] [PubMed] [Google Scholar]
- 8.Kodahl A.R., Lyng M.B., Binder H., Cold S., Gravgaard K., Knoop A.S., Ditzel H.J. Novel circulating microRNA signature as a potential non-invasive multi-marker test in ER-positive early-stage breast cancer: A case control study. Mol. Oncol. 2014;8:874–883. doi: 10.1016/j.molonc.2014.03.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Valadi H., Ekstrom K., Bossios A., Sjostrand M., Lee J.J., Lotvall J.O. Exosome-mediated transfer of mRNAs and microRNAs is a novel mechanism of genetic exchange between cells. Nat. Cell. Biol. 2007;9:654–659. doi: 10.1038/ncb1596. [DOI] [PubMed] [Google Scholar]
- 10.Vickers K.C., Palmisano B.T., Shoucri B.M., Shamburek R.D., Remaley A.T. MicroRNAs are transported in plasma and delivered to recipient cells by high-density lipoproteins. Nat. Cell. Biol. 2011;13:423–433. doi: 10.1038/ncb2210. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Mosteller F., Tukey J. Data Analysis, Including Statistics. In: Lindzey G., Aronson E., editors. The Handbook of Social Psychology. Addison-Wesley; Boston, MA, USA: 1968. [Google Scholar]
- 12.Stone M. Cross-validatory choice and assessment of statistical predictions. J. R. Stat. Soc. Ser. B Methodol. 1974;36:111–147. doi: 10.1111/j.2517-6161.1974.tb00994.x. [DOI] [Google Scholar]
- 13.Fan J., Li R. Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties. J. Am. Stat. Assoc. 2001;96:1348–1360. doi: 10.1198/016214501753382273. [DOI] [Google Scholar]
- 14.Zhang H.H., Ahn J., Lin X., Park C. Gene selection using support vector machines with non-convex penalty. Bioinformatics. 2006;22:88–95. doi: 10.1093/bioinformatics/bti736. [DOI] [PubMed] [Google Scholar]
- 15.Habbe N., Koorstra J.B., Mendell J.T., Offerhaus G.J., Ryu J.K., Feldmann G., Mullendore M.E., Goggins M.G., Hong S.M., Maitra A. MicroRNA miR-155 is a biomarker of early pancreatic neoplasia. Cancer Biol. Ther. 2009;8:340–346. doi: 10.4161/cbt.8.4.7338. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Huang C., Li H., Wu W., Jiang T., Qiu Z. Regulation of miR-155 affects pancreatic cancer cell invasiveness and migration by modulating the STAT3 signaling pathway through SOCS1. Oncol. Rep. 2013;30:1223–1230. doi: 10.3892/or.2013.2576. [DOI] [PubMed] [Google Scholar]
- 17.Liu J., Gao J., Du Y., Li Z., Ren Y., Gu J., Wang X., Gong Y., Wang W., Kong X. Combination of plasma microRNAs with serum CA19-9 for early detection of pancreatic cancer. Int. J. Cancer. 2012;131:683–691. doi: 10.1002/ijc.26422. [DOI] [PubMed] [Google Scholar]
- 18.Rawat M., Kadian K., Gupta Y., Kumar A., Chain P.S.G., Kovbasnjuk O., Kumar S., Parashe G. MicroRNA in Pancreatic Cancer: From Biology to Therapeutic Potential. Genes. 2019;10:752. doi: 10.3390/genes10100752. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Wang J., Paris P.L., Chen J., Ngo V., Yao H., Frazier M.L., Killary A.M., Liu C.G., Liang H., Mathy C., et al. Next generation sequencing of pancreatic cyst fluid microRNAs from low grade-benign and high grade-invasive lesions. Cancer Lett. 2015;356:404–409. doi: 10.1016/j.canlet.2014.09.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Shen Y., Chen G., Gao H., Li Y., Zhuang L., Meng Z., Liu L. miR-939-5p Contributes to the Migration and Invasion of Pancreatic Cancer by Targeting ARHGAP4. Oncol. Targets Ther. 2020;13:389–399. doi: 10.2147/OTT.S227644. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Ganepola G.A., Rutledge J.R., Suman P., Yiengpruksawan A., Chang D.H. Novel blood-based microRNA biomarker panel for early diagnosis of pancreatic cancer. World J. Gastrointest. Oncol. 2014;6:22–33. doi: 10.4251/wjgo.v6.i1.22. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Hussein N., Kholy Z., Anwar M.M., Ahmad M.A., Ahmad S.M. Plasma miR-22-3p, miR-642b-3p and miR-885-5p as diagnostic biomarkers for pancreatic cancer. J. Cancer Res. Clin. Oncol. 2017;143:83–93. doi: 10.1007/s00432-016-2248-7. [DOI] [PubMed] [Google Scholar]
- 23.Ma X., Tao R., Li L., Chen H., Liu Z., Bai J., Shuai X., Wu C., Tao K. Identification of a 5-microRNA signature and hub miRNA-mRNA interactions associated with pancreatic cancer. Oncol. Rep. 2019;41:292–300. doi: 10.3892/or.2018.6820. [DOI] [PubMed] [Google Scholar]
- 24.Yan J., Jia Y., Chen H., Chen W., Zhou X. Long non-coding RNA PXN-AS1 suppresses pancreatic cancer progression by acting as a competing endogenous RNA of miR-3064 to upregulate PIP4K2B expression. J. Exp. Clin. Cancer Res. 2019;38:390. doi: 10.1186/s13046-019-1379-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Chijiiwa Y., Moriyama T., Ohuchida K., Nabae T., Ohtsuka T., Miyasaka Y., Fujita H., Maeyama R., Manabe T., Abe A., et al. Overexpression of microRNA-5100 decreases the aggressive phenotype of pancreatic cancer cells by targeting PODXL. Int. J. Oncol. 2016;48:1688–1700. doi: 10.3892/ijo.2016.3389. [DOI] [PubMed] [Google Scholar]
- 26.Xie Z., Yin X., Gong B., Nie W., Wu B., Zhang X., Huang J., Zhang P., Zhou Z., Li Z. Salivary microRNAs show potential as a noninvasive biomarker for detecting resectable pancreatic cancer. Cancer Prev. Res. 2015;8:165–173. doi: 10.1158/1940-6207.CAPR-14-0192. [DOI] [PubMed] [Google Scholar]
- 27.Lv F., Zheng K., Yu J., Huang Z. MicroRNA-661 expression is upregulated in pancreatic ductal adenocarcinoma and promotes cell proliferation. Oncol. Lett. 2018;16:6293–6298. doi: 10.3892/ol.2018.9454. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Bera A., VenkataSubbaRao K., Manoharan M.S., Hill P., Freeman J.W. A miRNA signature of chemoresistant mesenchymal phenotype identifies novel molecular targets associated with advanced pancreatic cancer. PLoS ONE. 2014;9:e106343. doi: 10.1371/journal.pone.0106343. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Dhayat S.A., Mardin W.A., Seggewiss J., Strose A.J., Matuszcak C., Hummel R., Senninger N., Mees S.T., Haier J. MicroRNA Profiling Implies New Markers of Gemcitabine Chemoresistance in Mutant p53 Pancreatic Ductal Adenocarcinoma. PLoS ONE. 2015;10:e0143755. doi: 10.1371/journal.pone.0143755. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Deng J., He M., Chen L., Chen C., Zheng J., Cai Z. The loss of miR-26a-mediated post-transcriptional regulation of cyclin E2 in pancreatic cancer cell proliferation and decreased patient survival. PLoS ONE. 2013;8:e76450. doi: 10.1371/journal.pone.0076450. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Fu X., Jin L., Wang X., Luo A., Hu J., Zheng X., Tsark W.M., Riggs A.D., Ku H.T., Huang W. MicroRNA-26a targets ten eleven translocation enzymes and is regulated during pancreatic cell differentiation. Proc. Natl. Acad. Sci. USA. 2013;110:17892–17897. doi: 10.1073/pnas.1317397110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Zhong Y.Y.G., Zhao G., Zhang D. miR-26a suppresses the growth and metastasis via targeting matrix metalloproteinase 14 in pancreatic ductal adenocarcinoma. Int. J. Clin. Exp. Pathol. 2016;9:4803–4809. [Google Scholar]
- 33.Wang X., Zhao J., Huang J., Tang H., Yu S., Chen Y. The regulatory roles of miRNA and methylation on oncogene and tumor suppressor gene expression in pancreatic cancer cells. Biochem. Biophys. Res. Commun. 2012;425:51–57. doi: 10.1016/j.bbrc.2012.07.047. [DOI] [PubMed] [Google Scholar]
- 34.Zhang Y., Li M., Wang H., Fisher W.E., Lin P.H., Yao Q., Chen C. Profiling of 95 microRNAs in pancreatic cancer cell lines and surgical specimens by real-time PCR analysis. World J. Surg. 2009;33:698–709. doi: 10.1007/s00268-008-9833-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Gregory P.A., Bert A.G., Paterson E.L., Barry S.C., Tsykin A., Farshid G., Vadas M.A., Khew-Goodall Y., Goodall G.J. The miR-200 family and miR-205 regulate epithelial to mesenchymal transition by targeting ZEB1 and SIP1. Nat. Cell. Biol. 2008;10:593–601. doi: 10.1038/ncb1722. [DOI] [PubMed] [Google Scholar]
- 36.Georgikou C., Buglioni L., Bremerich M., Roubicek N., Yin L., Gross W., Sticht C., Bolm C., Herr I. Novel Broccoli Sulforaphane-Based Analogues Inhibit the Progression of Pancreatic Cancer without Side Effects. Biomolecules. 2020;10:769. doi: 10.3390/biom10050769. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Liu Z., Du R., Long J., Guo K., Ge C., Bi S., Xu Y. microRNA-218 promotes gemcitabine sensitivity in human pancreatic cancer cells by regulating HMGB1 expression. Chin. J. Cancer Res. 2015;27:267–278. doi: 10.3978/j.issn.1000-9604.2015.04.06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Liu Z., Xu Y., Long J., Guo K., Ge C., Du R. microRNA-218 suppresses the proliferation, invasion and promotes apoptosis of pancreatic cancer cells by targeting HMGB1. Chin. J. Cancer Res. 2015;27:247–257. doi: 10.3978/j.issn.1000-9604.2015.04.07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Zhu Z., Xu Y., Du J., Tan J., Jiao H. Expression of microRNA-218 in human pancreatic ductal adenocarcinoma and its correlation with tumor progression and patient survival. J. Surg. Oncol. 2014;109:89–94. doi: 10.1002/jso.23475. [DOI] [PubMed] [Google Scholar]
- 40.Collins A.L., Wojcik S., Liu J., Frankel W.L., Alder H., Yu L., Schmittgen T.D., Croce C.M., Bloomston M. A differential microRNA profile distinguishes cholangiocarcinoma from pancreatic adenocarcinoma. Ann. Surg. Oncol. 2014;21:133–138. doi: 10.1245/s10434-013-3240-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data in this study are openly available in the Gene Expression Omnibus accession number GSE85589 (www.ncbi.nlm.nih.gov/geo).