Abstract
Purpose
To develop a support vector machine (SVM) classifier using CT texture-based analysis in differentiating focal-type autoimmune pancreatitis (AIP) and pancreatic duct carcinoma (PD), and to assess the radiologists’ diagnostic performance with or without SVM.
Materials and methods
This retrospective study included 50 patients (20 patients with focal-type AIP and 30 patients with PD) who underwent dynamic contrast-enhanced CT. Sixty-two CT texture-based features were extracted from 2D images of the arterial and portal phase CTs. We conducted data compression and feature selections using principal component analysis (PCA) and produced the SVM classifier. Four readers participated in this observer performance study and the statistical significance of differences with and without the SVM was assessed by receiver operating characteristic (ROC) analysis.
Results
The SVM performance indicated a high performance in differentiating focal-type AIP and PD (AUC = 0.920). The AUC for all 4 readers increased significantly from 0.827 to 0.911 when using the SVM outputs (p = 0.010). The AUC for inexperienced readers increased significantly from 0.781 to 0.905 when using the SVM outputs (p = 0.310). The AUC for experienced readers increased from 0.875 to 0.912 when using the SVM outputs, however, there was no significant difference (p = 0.018).
Conclusion
The use of SVM classifier using CT texture-based features improved the diagnostic performance for differentiating focal-type AIP and PD on CT.
Keywords: Machine learning, Autoimmune pancreatitis, Pancreatic duct carcinoma, Texture analysis, CT
Introduction
Autoimmune pancreatitis (AIP) is a rare chronic pancreatitis marked by pancreatic enlargement, irregular pancreatic duct stenosis, and elevated serum immunoglobulin 4 (IgG4) levels, mediated by autoimmune mechanisms [1]. AIP doesn't have any characteristic clinical manifestations and is often misdiagnosed as pancreatic duct carcinoma (PD), and therefore, some patients undergo unnecessary surgery as a result [2]. On the other hand, the imaging findings associated with serological examinations (IgG4 and Ca19-9) plays an important role in the distinction between these entities. Since, 7–10% of pancreatic cancer patients show high serum IgG4 levels, and some AIP patients may exhibit equivocal serum IgG4 levels and elevated levels of CA19-9, the utility of the serological examinations may be limited [3–6]. Therefore, the diagnostic imaging is the key in differentiating between AIP and PD.
AIP is categorized into Type 1 and Type 2. Type 1 is common, and is regarded as a prototype of IgG4-related disease, with high serum levels of IgG4 (> 140 mg/dl), IgG4-positive plasma cell infiltration, and sclerosis. Type 2 is considered granulocytic epithelial lesions. Both types can present various morphological changes of the pancreas, which include diffuse, focal/mass-forming, or multifocal disease [7–9]. Focal-type AIP is accounting for 33–41% of the cases of AIP [10, 11]. Differentiation between focal-type AIP and PD by conventional imaging methods can be difficult.
Multiple imaging techniques, including CT, MRI, and 18F-fluorodeoxyglucose positron-emission tomography/computerized tomography (18F-FDG PET/CT), have been used for solving this problem [12–14].
Recent advances and application developments of radiomics have helped the improvement of disease prediction and classification accuracy in differentiating tumors. Many researchers have used texture-based analysis in an attempt to differentiate tumors using machine learning [15–17]. However, to our knowledge, only a few studies have investigated CT texture-based analysis to differentiate focal-type AIP and PD via a machine learning approach [17–19]. In addition, the impact of machine learning model on observer performance has not been reported in previous studies.
The purpose of this study was to develop a CT texture-based support vector machine (SVM) classifier in differentiating focal-type AIP and PD via a machine learning approach, and to compare the diagnostic performance with and without the SVM classifier.
Materials and methods
Subjects
Our institutional review board approved this retrospective study and informed consent from patients was waived. We retrospectively reviewed the medical records of all patients who had undergone abdominal dynamic contrast-enhanced CT at our hospital between March 2005 and August 2019, and selected for this analysis the patients who met the following criteria: (a) fulfilled with the International Consensus Diagnostic Criteria (ICDC) for AIP [20] or Revised Japanese Pancreas Society criteria of AIP [21]; or (b) histopathologically diagnosed with PD after surgical resection at our institution. Finally, 20 patients with focal-type AIP (11 men, 9 women; mean age, 65.3 years; range, 46–81 years) and 30 patients with PD (18 men, 12 women; mean age, 66.9 years; range, 52–82 years) were included for this study. In the AIP group, 14 patients were diagnosed on the basis of ICDC, all of whom had type 1 AIP (definite, 12; probable, 2). All patients were diagnosed on the basis of the Japanese diagnostic criteria (definite, 12; probable, 4; possible, 4). In the PD group, the number of patients with clinical T stages 1a, 1b, 1c, 2, 3 and 4 were 0, 0, 8, 22, 0 and 0, respectively. All patients with focal-type AIP and PD underwent CT examinations before receiving therapy. Intergroup comparisons between focal-type AIP and PD were performed using Chi-square test for categorical variables and the Student’s t test for numeric variables.
Image acquisition
In the AIP group, the number of patients scanned on a 16-slice MDCT scanner (Aquilion®, Toshiba Medical Systems, Tokyo, Japan), a 32-slice MDCT scanner (Aquilion®, Toshiba Medical Systems, Tokyo, Japan), a 64-slice MDCT scanner (Aquilion®, Toshiba Medical Systems, Tokyo, Japan), and a 320-slice MDCT scanner (Aquilion ONE®, Toshiba Medical Systems, Tokyo, Japan) were 4, 1, 12, and 3, respectively. In the PD group, the number of patients scanned on the 64-slice MDCT scanner (Aquilion®, Toshiba Medical Systems, Tokyo, Japan) and the 320-slice MDCT scanner (Aquilion ONE®, Toshiba Medical Systems, Tokyo, Japan) were 21 and 9, respectively.
CT data of the 16-slice MDCT scanner were acquired using the following parameters: tube voltage, 120 kV; tube current, 200 mA without automatic exposure control; gantry rotation speed, 0.5 s; collimation, 16 × 2 mm; and beam pitch, 0.938. CT data of the 32-slice MDCT scanner were acquired using the following parameters: tube voltage, 120 kV; tube current, 270 mA without automatic exposure control; gantry rotation speed, 0.5 s; collimation, 32 × 1 mm; and beam pitch, 0.828. CT data of the 64-slice MDCT scanner were acquired using the following parameters: tube voltage, 120 kV; tube current, automatic exposure control with a fixed noise index (SD 10 at 3 mm thickness) gantry rotation speed, 0.5 s; collimation, 32 × 1 mm; and beam pitch, 0.828. CT data of the 320-slice MDCT scanner were acquired using the following parameters: tube voltage, 120 kV; tube current, automatic exposure control with a fixed noise index (SD 10 at 3 mm thickness); gantry rotation speed, 0.5 s; collimation, 80 × 0.5 mm; and beam pitch, 0.813.
After unenhanced images had been acquired, non-ionic contrast material (550 mg I/kg body weight) was injected through the peripheral venous line within 30 s. Arterial phase imaging was performed with fixed delay or bolus triggering, usually between 35 and 40 s post-injection, and portal phase imaging was performed at 60–70 s.
The helical data of the 16-slice MDCT scanner were reconstructed using FBP with our standard reconstruction kernel (FC10). The helical data of the 32- and 64-slice MDCT scanner were reconstructed using FBP with our standard reconstruction kernel (FC14). The helical data of the 320-slice MDCT scanner were reconstructed using AIDR3D (weak or mild setting) with our standard reconstruction kernel (FC14). The image reconstruction was performed in a 32 to 45 cm display field of view depending on the patient's physique. CT image analysis was performed using arterial and portal phase images with a 3 mm section thickness at 3-mm intervals.
Texture feature extraction
The overall flowchart explaining the process in this study is shown in Fig. 1. We chose an axial image slice of arterial and portal phase on the basis of the maximum diameter of the lesion. Texture parameter calculation was performed with the software LIFEx ([20]; version 4.00, available at https://www.lifexsoft.org/). Two-dimensional segmentation was performed by manually drawing a region of interest (ROI) around the lesion outline as big as possible by one radiologist (K.A) (Fig. 2), and verified and confirmed by another radiologist (T.H); both were blinded to the patients’ clinical outcomes. First, we calculated 37 texture features defined in LIFEx version 4.00 software. The texture features of each ROI were then obtained, we excluded the data of six texture features (i.e., GLRLM_LGRE, GLRLM_LRLGE, GLRLM_SRLGE, GLZLM_LGZE, GLZLM_LZLGE and GLZLM_SZLGE) since they showed zero values in all of the ROIs from pancreatic lesions. Therefore, we used the 31 textural feature data set for this study. The 31 texture features used in this study are shown in Table 1. Finally, we used a total of 62 texture features from arterial and portal phases.
Table 1.
Feature name | Symbol/abbreviation |
---|---|
Geometry based and histogram based features | |
Skewness | – |
Kurtosis | – |
Entropy log10 | – |
Entropy log2 | – |
Energy | – |
Gray-level co-occurrence matrix (GLCM) | |
Homogeneity | – |
Energy | – |
Contrast | – |
Correlation | – |
Entropy log10 | – |
Entropy log2 | – |
Dissimilarity | – |
Neighborhood gray-level different matrix (NGLDM) | |
Contrast | – |
Coarseness | – |
Busyness | – |
Grey level run length matrix GLRLM) | – |
Short-Run Emphasis | SRE |
Long-Run Emphasis | LRE |
Low Gray-level Run Emphasis | LGRE |
High Gray-level Run Emphasis | HGRE |
Short-Run High Gray-level Emphasis | SRHGE |
Long-Run Low Gray-level Emphasis | LRLGE |
Long-Run High Gray-level Emphasis | LRHGE |
Gray-Level Non-Uniformity for run | GLNU |
Run Length Non-Uniformity | RLNU |
Run Percentage | RP |
Grey level zone length matrix (GLZLM) | |
Short-Zone Emphasis | SZE |
Long-Zone Emphasis | LZE |
High Gray-level Zone Emphasis | HGZE |
Short-Zone High Gray-level Emphasis | SZHGE |
Long-Zone Low Gray-level Emphasis | LZLGE |
Long-Zone High Gray-level Emphasis | LZHGE |
Gray-Level Non-Uniformity for zone | GLNU |
Zone Length Non-Uniformity | ZLNU |
Zone Length Non-Uniformity Zone Percentage | ZP |
Data compression and feature selection
To keep the interpretability as well as the high classification performance of the classifier, we conducted data compression and feature selections by using principal component analysis (PCA). Since we could not make any prior selection of features based on their mathematical definitions or previous studies, we compressed all of the 62 features by extracting original features as well as new features with PCA. After compressing the data, we selected the principal components (PC) which were able to explain at least 80% of the total variance and plotted their scores. We then selected the best two components with scores that clearly separated between the AIP group and PD group subjectively. We also performed Wilcoxon–Mann–Whitney tests to assess objectively whether the score differences were significant between the two groups. p < 0.025 was considered statistically significant.
SVM classifier construction and evaluation
We constructed an SVM classifier to predict the disease type (AIP or PD) using the two principal components selected in the previous section. SVM is a discriminant function that uses multiple features to classify into two classes. When the e1071 package (https://cran.r-project.org/web/packages/e1071/index.html) is used, the SVM model function outputs a decision value for each region. Each region is classified according to whether the output value of the discriminant function exceeds 0 or not. When the output value is greater than zero, unknown data is classified as AIP. When the output value is less than zero, unknown data is classified as PD. In the training phase, we constructed many classifiers with various hyper parameter combinations. Then we chose the best model that was able to classify the regions with the smallest classification error (i.e. the highest accuracy). The tunable hyper-parameters in this work were the cost of misclassification (C) and the inverse of the standard deviation of the RBF kernel (γ). Coarse grid searches were used to tune these parameters and threefold cross-validation was conducted to find the average performance of the classifier. Sensitivity, specificity, accuracy, positive predictive value (PPV) and negative predictive value (NPV) were calculated using this classification result. On the other hand, receiver operating characteristic (ROC) analysis was performed using the output data of the discriminant function, and the area under the curve (AUC) was calculated. For observer performance study, a model was trained using the 2 folds as training data. The resulting model was validated on the remaining part of the data (i.e. a test data) and extracted output. The data analyses were conducted by using the R software program (version 3.4.1; www.R-project.org).
Observer performance study
We used an independent test method for diagnostic performance evaluation, and one radiologist lined up 20 patients with focal-type AIP and 30 patients with PD in random order. Four radiologists with 5, 6, 24 and 30 years of experience participated in the observer performance study. The performance of the SVM classifier and clinical data were not informed. Assessment of each radiologist’s performance was determined by receiver operating characteristic (ROC) analysis with a continuous rating scale with a 0–100 scale. First, the readers reviewed an axial image slice of the arterial and portal phase of the lesion without SVM outputs. Axial images were chosen for the pancreas masses avoiding other key findings (such as lymph node (LN) swellings, vascular invasion or pancreatic duct dilatations). Subsequently, SVM outputs (the percentage of probability for each category) were presented to the observers via the bar graph, and a second observer performance study with SVM outputs was performed. The time interval between the first and second review was more than one month. The performance of the 4 readers was evaluated in terms of sensitivity, specificity, accuracy, PPV and NPV, and the AUC of the ROC. We divided readers into two groups, experienced and inexperienced, and the statistical significance of the differences between AUC values for radiologists with and without SVM outputs was evaluated by DeLong's test. For the both classifier performance and the observer performance, the data analyses were conducted by using the R software program (version 3.4.1; www.R-project.org). The sensitivities, specificities, PPV, NPV, and AUC of the SVM classifier were calculated automatically by the R software.
Results
The clinical characteristics of 20 focal-type AIP and 30 PD patients are summarized in Table 2. No significant differences were found in age, sex or lesion size between the two groups.
Table 2.
Focal-type AIP n = 20 (± SD) |
PD n = 30 (± SD) |
p | |
---|---|---|---|
Age (y) | 65.25 ± 9.40 | 66.87 ± 8.45 | 0.539 |
Male/Female | 11/9 | 18/12 | 0.953 |
Lesion size (mm) | 33.54 ± 13.62 | 26.96 ± 7.78 | 0.061 |
IgG4 > 280 (mg/dl) | 8 | ||
280≧IgG4≧135 (mg/dl) | 7 | ||
Narrowing of main pancratic duct on ERP/MRCP | 18 | ||
Pathological findings | 9 | ||
Other organ involvement | 6 | ||
Effectiveness of steroid therapy | 15 | ||
cT Stagea | |||
T1a | 0 | ||
T1b | 0 | ||
T1c | 8 | ||
T2 | 22 | ||
T3 | 0 | ||
T4 | 0 |
ERP endoscopic retrograde pancreatography, MRCP magnetic resonance cholangiopancreatography
aAccording to the 8th edition of the TNM classification of malignant tumors
We chose the best combination of principal components that could separate AIP group and PD group by plotting their scores (Fig. 3). As showing on Fig. 3, the combination of the second principal component (PC2) and the third principal component (PC3) was chosen because they could clearly separate two groups subjectively. The scores of PC2 and PC3 showed significant differences in focal-type AIP and PD (both p < 0.001) (Table 3). The top three texture features in PC2 loadings were arterial phase Skewness, portal phase NGLDM Coarseness and portal phase Skewness. The bottom three texture features in PC2 loadings were arterial phase GLRLM HGRE, arterial phase GLZLM HGZE and arterial phase GLRLM SRHGE (Table 4). The top three texture features in PC3 loadings were portal phase GLRLM RLNU, portal phase GLZLM GLNU and portal phase GLRLM GLNU. The bottom three texture features in PC3 loadings were arterial phase NGLDM Coarseness, arterial phase NGLDM Contrast and arterial phase GLRLM SRE (Table 5). ROC curves were adopted to determine the diagnostic performance of SVM classifiers in differentiating focal-type AIP and PD (Fig. 4). The parameters of SVM classifier including AUC, sensitivity (%), specificity (%), accuracy (%), PPV (%), and NPV (%) were 0.920, 100.0, 75.0, 90.0, 85.7 and 100.0%, respectively.
Table 3.
Focal-type AIP (n = 20) | PD (n = 30) | p | |
---|---|---|---|
PC2 score | −2.26 ± 2.03 | 1.50 ± 3.42 | < .001* |
PC3 score | 1.59 ± 2.58 | −1.06 ± 2.75 | < .001* |
*A significant difference (p < 0.025)
Table 4.
Texture feature | PC2 loading |
---|---|
ap Skewness | 0.1995 |
pp NGLDM Coarseness | 0.1721 |
pp Skewness | 0.1617 |
ap Kurtosis | 0.1182 |
pp NGLDM Contrast | 0.0855 |
・・・ | ・・・ |
ap GLCM Entropy log2 | −0.2055 |
ap GLCM Entropy log10 | −0.2056 |
ap GLRLM SRHGE | −0.2157 |
ap GLZLM HGZE | −0.2356 |
ap GLRLM HGRE | −0.2377 |
ap arterial phase, pp portal phase
Table 5.
Texture feature | PC3 loading |
---|---|
pp GLRLM RLNU | 0.2366 |
pp GLZLM GLNU | 0.2332 |
pp GLRLM GLNU | 0.2296 |
ap GLRLM RLNU | 0.2249 |
ap Kurtosis | 0.2172 |
・・・ | ・・・ |
pp NGLDM Coarseness | −0.1814 |
ap GLZLM HGZE | −0.1819 |
ap GLRLM SRE | −0.1839 |
ap NGLDM Contrast | −0.1871 |
ap NGLDM Coarseness | −0.2567 |
ap arterial phase, pp portal phase
ROC curves were adopted to determine the diagnostic performance of all readers in differentiating focal-type AIP and PD (Fig. 5). The AUC without and with SVM outputs for each radiologist are shown in the Table 6. The AUC of all radiologists without SVM outputs were 0.827 and those with were 0.911 (p = 0.010). Divided into two groups, the AUC of the experienced group without SVM outputs was 0.875 and that with was 0.912 (p = 0.310). The AUC of the inexperienced group with SVM outputs was 0.781 and that with was 0.905 (p = 0.018).
Table 6.
Observer | No.1 | No.2 | No.3 | No.4 | All | Experienced group (No.1 and 2) | Inexperienced group (No.3 and 4) |
---|---|---|---|---|---|---|---|
Exp. years | 30 | 24 | 6 | 5 | |||
AUC without SVM | 0.852 | 0.900 | 0.789 | 0.783 | 0.827* | 0.875 | 0.781* |
AUC with SVM | 0.912 | 0.918 | 0.913 | 0.898 | 0.911* | 0.912 | 0.905* |
AUC the area under ROC curve
*A significant difference (p < 0.05)
Discussion
We developed an SVM classifier to differentiate focal-type AIP and PD and evaluated the diagnostic performance of readers with and without SVM outputs. To our knowledge, there is no prior study dealing with observer performance study for CT texture-based analysis in differentiating focal-type AIP and PD using machine learning approaches. Although the SVM outputs improved the diagnostic performance for all four readers including the experienced radiologists, an observer category with less experience benefited from the SVM outputs more. The AUC for inexperienced radiologists with the SVM outputs (0.905) was higher than that for experienced radiologists without the SVM outputs (0.875). These results may suggest that the SVM classifier may be more useful for inexperienced radiologists and the radiologist’s performance in differentiating focal-type AIP and PD depends on their experience.
In machine learning, SVMs are learning algorithms for analyzing data used for classification. The SVM can build a model that classifies new data into different categories from a set of training examples, each of them belonging to one of the categories. Relative to the other machine learning methods, SVMs are powerful methods at recognizing subtle patterns in complex datasets [23]. For diagnostic classification, an SVM classification scheme of CT/MRI texture analysis was used for differentiating glioma vs. primary central nervous system (CNS) lymphoma [16], renal cell carcinoma (RCC) vs. angiomyolipoma (AML) [24] and LN metastasis [25]. It has been reported that SVM based on MRI textural features was noninferior to expert human evaluation in the differentiation of CNS lymphoma and glioma [16]. You et al. reported SVM based on CT textural features (SVM on TF) of renal masses could accurately differentiate AML without visible fat from RCC [24]. Yang et al. reported that in predicting occult LN metastasis of lung cancer before surgery, the performance of the SVM on TF was higher than that of the model based on clinical-histopathologic features (i.e., age, sex, tumor location, diameter, and histology)[25]. Considering these classification abilities, we developed texture-based SVM classifier in differentiating focal-type AIP and PD in this study.
CT is one of the most commonly used imaging modalities for the diagnosis of pancreatic lesions. Although known to be difficult in the diagnosis of focal-type AIP based on imaging, previous studies reported that some imaging findings of CT were useful in distinguishing focal-type AIP from PD. These imaging findings contained; delayed homogeneous enhancement on dynamic CT, a hypoattenuating capsule-like rim, the presence of the “duct-penetrating” sign (mass penetrated by an unobstructed pancreatic duct), the absence of significant upstream main pancreatic duct (MPD) dilatation (> 5 mm), the absence of atrophic changes in the body and the tail of the pancreas, and enhanced duct sign (wall enhancement of MPD in the lesion) on multiphase contrast-enhanced CT [6, 26–29]. Furuhashi et al. reported that homogeneous enhancement during the portal phase, dotted enhancement during the pancreatic phase, duct-penetrating sign, enhanced duct sign and capsule-like rim were more frequently observed in focal-type AIP. On the other hand, ring-like enhancement during the delayed phase and peripancreatic strands with a length of at least 10 mm were more frequently observed in PD. Focal-type AIP was identified with 82% sensitivity and 98% specificity by using any four of these seven findings [26]. Although AIP was more likely to show a dotted enhancement in the pancreatic phase and a homogeneous enhancement in the portal phase, PD was more likely to show a heterogeneously decreased enhancement in the portal phase. In our study, we were able to objectively assess intralesional heterogeneity by using CT texture-based analysis to differentiate focal-type AIP and PD via a machine learning approach. Our results of CT texture analysis may support the findings of previous subjective studies.
In MRI, Choi et al. reported that the homogeneous enhancement (p = 0.001), duct penetrating sign (p < 0.001), and an ADC value less than 0.9407 × 10–3 mm2/s (p < 0.001) were significant for differentiating focal-type AIP from PD in multivariate analysis. When two of these three criteria were satisfied, 80% (12/15) of focal-type AIPs were identified with specificity of 98.7% [12]. 18F-FDG PET/CT findings have also been reported to differentiate AIP from PD (p < 0.05) with the AUCs of 0.700 (early SUV max) and 0.687 (delayed SUV max) in the previous study [14]. Although we developed SVM classifier based on only CT texture, a combination of the imaging features of these modalities may be more practical in clinical settings.
In our study, AUC, sensitivity (%), specificity (%), accuracy (%), PPV (%), and NPV (%) of our SVM classifier were 0.920, 100.0, 75.0, 90.0, 85.7, and 100.0%, respectively. Differentiation between focal AIP and PD should ideally be based on a combination of CT, MRI, and PET imaging findings, however, there are cases that are difficult to differentiate even with all these imaging findings. We believe that CT texture analysis can be a new quantitative tool in the differential diagnosis of pancreatic lesions as well as ADC values or SUV max. However, the performance of our learning model was lower than that of prior published work in differentiating AIP from PD with CT radiomics features [17]. In the previous study AIP patients included not only focal-type but also nearly diffuse type and diffuse type. Our study was limited to only focal type AIP and PD with surgical indication. This may be the cause of relatively low AUC values compared to previous studies.
Our SVM outputs could significantly improve reader’s performance including that of the expert readers. There were five cases in which all radiologists diagnosed correctly and the SVM failed to do so, and only one case in which all radiologists failed and the SVM diagnosed correctly. It is interesting to analyze the discrepancies between the SVM output and the radiologist’s interpretation, particularly for those cases in which the radiologists diagnosed correctly and the SVM failed to do so. Figure 6 shows the representative case of the SVM failure. In this case, the tumor had an absence of atrophic changes in the body and the tail of the pancreas and an absence of significant upstream MPD dilatation. This case was more typical of AIP, however, the SVM classified it as a PD. One cause of disagreement between radiologists and the SVM classifier may be that radiologists take into account the changes in the surroundings of lesions, such as atrophic changes in the body and the tail of the pancreas, upstream MPD dilatation, and a hypoattenuating capsule-like rim. Radiomics represents a method for the quantitative description of medical images. It is, however, important to note that radiomics should only be viewed as an additional tool and not as a standalone diagnostic algorithm [30]. Furthermore, we believe that ideally, machine learning should be used by radiologists applying it with other key image findings who have the knowledge of the limitation of radiomics.
Our study has several limitations. First, our study was a retrospective study with only a small number of patients. Second, the CT scanners used in our study were not uniform between patients and an inter-scanner difference may have affected the texture analysis results. Further studies with a larger study population and uniform CT scanner are needed to confirm the results of this study. Third, the readers reviewed only an axial image slice of the arterial and portal phase of the lesion in the observer performance study. This process might be influenced in readers performance. Fourth, in the present study, we constructed the SVM classifier to differentiate focal-type AIP and PD, though etiologies other than those diseases can also appear as pancreatic mass. Finally, lesion segmentation in our study was conducted manually by only one radiologist confirmed by the other radiologist because prior studies of CT texture analysis have shown good to excellent interobserver agreement [31–35]. However, a risk of subjective tendency or bias must be considered.
In conclusion, CT texture analysis via a machine learning approach can help clinicians differentiate focal-type AIP and PD. Where radiologists effectively incorporate machine learning methods into the clinical practices their diagnostic abilities will be extremely important.
Declarations
Conflict of interest
We declare that we have no conflicts of interest with any organization or institute.
Ethical approval
All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards.
Informed consent
The research protocols were approved by the institution review boards, which granted a waiver of informed consent from study participants due to the retrospective nature of the study.
Footnotes
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Change history
7/9/2022
A Correction to this paper has been published: 10.1007/s11604-022-01313-x
Contributor Information
Kenta Anai, Email: k-anai@med.uoeh-u.ac.jp.
Yoshiko Hayashida, Email: yhlinda@med.uoeh-u.ac.jp.
Issei Ueda, Email: i-ueda@med.uoeh-u.ac.jp.
Eri Hozuki, Email: e.hoduki@hotmail.com.
Yuuta Yoshimatsu, Email: y-yoshimatsu@med.uoeh-u.ac.jp.
Jun Tsukamoto, Email: jun-053-ashimo@med.uoeh-u.ac.jp.
Toshihiko Hamamura, Email: t-hamamura@med.uoeh-u.ac.jp.
Norihiro Onari, Email: ohnari@med.uoeh-u.ac.jp.
Takatoshi Aoki, Email: a-taka@med.uoeh-u.ac.jp.
Yukunori Korogi, Email: yuk_55korogi@mojih.johas.go.jp.
References
- 1.Finkelberg DL, Sahani D, Deshpande V, Brugge WR. Autoimmune pancreatitis. N Engl J Med. 2006;355:2670–2676. doi: 10.1056/NEJMra061200. [DOI] [PubMed] [Google Scholar]
- 2.Yadav D, Notahara K, Smyrk TC, Clain JE, Pearson RK, Farnell MB, et al. Idiopathic tumefactive chronic pancreatitis: clinical profile, histology, and natural history after resection. Clin Gastroenterol Hepatol. 2003;1:129–135. doi: 10.1053/cgh.2003.50016. [DOI] [PubMed] [Google Scholar]
- 3.Raina A, Krasinskas AM, Greer JB, Lamb J, Fink E, Moser AJ, et al. Serum immunoglobulin G fraction 4 levels in pancreatic cancer: elevations not associated with autoimmune pancreatitis. Arch Pathol Lab Med. 2008;132:48–53. doi: 10.5858/2008-132-48-SIGFLI. [DOI] [PubMed] [Google Scholar]
- 4.Kim H-R, Lee C-H, Kim YW, Han SK, Shim Y-S, Yim J-J. Increased CA 19–9 level in patients without malignant disease. Clin Chem Lab Med. 2009;47:750–754. doi: 10.1515/CCLM.2009.152. [DOI] [PubMed] [Google Scholar]
- 5.La Greca G, Sofia M, Lombardo R, Latteri S, Ricotta A, Puleo S, et al. Adjusting CA19-9 values to predict malignancy in obstructive jaundice: influence of bilirubin and C-reactive protein. World J Gastroenterol. 2012;18:4150–4155. doi: 10.3748/wjg.v18.i31.4150. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Lopes Vendrami C, Shin JS, Hammond NA, Kothari K, Mittal PK, Miller FH. Differentiation of focal autoimmune pancreatitis from pancreatic ductal adenocarcinoma. Abdom Radiol (NY) 2020;45:1371–1386. doi: 10.1007/s00261-019-02210-0. [DOI] [PubMed] [Google Scholar]
- 7.Hoshimoto S, Aiura K, Tanaka M, Shito M, Kakefuda T, Sugiura H. Mass-forming type 1 autoimmune pancreatitis mimicking pancreatic cancer. J Dig Dis. 2016;17:202–209. doi: 10.1111/1751-2980.12316. [DOI] [PubMed] [Google Scholar]
- 8.de Pretis N, Amodio A, Frulloni L. Updates in the field of autoimmune pancreatitis: a clinical guide. Expert Rev Gastroenterol Hepatol. 2018;12:705–709. doi: 10.1080/17474124.2018.1489230. [DOI] [PubMed] [Google Scholar]
- 9.Madhani K, Farrell JJ. Autoimmune pancreatitis: an update on diagnosis and management. Gastroenterol Clin North Am. 2016;45:29–43. doi: 10.1016/j.gtc.2015.10.005. [DOI] [PubMed] [Google Scholar]
- 10.Irie H, Honda H, Baba S, Kuroiwa T, Yoshimitsu K, Tajima T, et al. Autoimmune pancreatitis: CT and MR characteristics. AJR Am J Roentgenol. 1998;170:1323–1327. doi: 10.2214/ajr.170.5.9574610. [DOI] [PubMed] [Google Scholar]
- 11.Wakabayashi T, Kawaura Y, Satomura Y, Watanabe H, Motoo Y, Okai T, et al. Clinical and imaging features of autoimmune pancreatitis with focal pancreatic swelling or mass formation: comparison with so-called tumor-forming pancreatitis and pancreatic carcinoma. Am J Gastroenterol. 2003;98:2679–2687. doi: 10.1111/j.1572-0241.2003.08727.x. [DOI] [PubMed] [Google Scholar]
- 12.Choi S-Y, Kim SH, Kang TW, Song KD, Park HJ, Choi Y-H. Differentiating mass-forming autoimmune pancreatitis from pancreatic ductal adenocarcinoma on the basis of contrast-enhanced MRI and DWI findings. AJR Am J Roentgenol. 2016;206:291–300. doi: 10.2214/AJR.15.14974. [DOI] [PubMed] [Google Scholar]
- 13.Ruan Z, Jiao J, Min D, Qu J, Li J, Chen J, et al. Multi-modality imaging features distinguish pancreatic carcinoma from mass-forming chronic pancreatitis of the pancreatic head. Oncol Lett. 2018;15:9735–9744. doi: 10.3892/ol.2018.8545. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Zhang J, Jia G, Zuo C, Jia N, Wang H. 18F- FDG PET/CT helps differentiate autoimmune pancreatitis from pancreatic cancer. BMC Cancer. 2017;17:695. doi: 10.1186/s12885-017-3665-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Yang R, Wu J, Sun L, Lai S, Xu Y, Liu X, et al. Radiomics of small renal masses on multiphasic CT: accuracy of machine learning-based classification models for the differentiation of renal cell carcinoma and angiomyolipoma without visible fat. Eur Radiol. 2019;30:1254–1263. doi: 10.1007/s00330-019-06384-5. [DOI] [PubMed] [Google Scholar]
- 16.Alcaide-Leon P, Dufort P, Geraldo AF, Alshafai L, Maralani PJ, Spears J, et al. Differentiation of enhancing glioma and primary central nervous system lymphoma by texture-based machine learning. AJNR Am J Neuroradiol. 2017;38:1145–1150. doi: 10.3174/ajnr.A5173. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Park S, Chu LC, Hruban RH, Vogelstein B, Kinzler KW, Yuille AL, et al. Differentiating autoimmune pancreatitis from pancreatic ductal adenocarcinoma with CT radiomics features. Diagn Interv Imaging. 2020;101:555–564. doi: 10.1016/j.diii.2020.03.002. [DOI] [PubMed] [Google Scholar]
- 18.Ziegelmayer S, Kaissis G, Harder F, Jungmann F, Müller T, Makowski M, et al. Deep convolutional neural network-assisted feature extraction for diagnostic discrimination and feature visualization in pancreatic ductal adenocarcinoma (PDAC) versus autoimmune pancreatitis (AIP) J Clin Med. 2020;9:4013. doi: 10.3390/jcm9124013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Li J, Liu F, Fang X, Cao K, Meng Y, Zhang H, et al. CT radiomics features in differentiation of focal-type autoimmune pancreatitis from pancreatic ductal adenocarcinoma: a propensity score analysis. Acad Radiol. 2021;29:358–366. doi: 10.1016/j.acra.2021.04.014. [DOI] [PubMed] [Google Scholar]
- 20.Shimosegawa T, Chari ST, Frulloni L, Kamisawa T, Kawa S, Mino-Kenudson M, et al. International consensus diagnostic criteria for autoimmune pancreatitis: Guidelines of the International Association of Pancreatology. Pancreas. 2011;40:352–358. doi: 10.1097/MPA.0b013e3182142fd2. [DOI] [PubMed] [Google Scholar]
- 21.The Japan Pancreas Society. The Min. Clinical diagnostic criteria for autoimmune pancreatitis proposal. Suizo. 2011;2012(27):17–25. [Google Scholar]
- 22.Orlhac, Nioche, Buvat. Local Image Features.
- 23.Aruna S, Rajagopalan SP. A novel SVM based CSSFFS feature selection algorithm for detecting breast cancer. Int J Comput Appl. 2011;31:14–20. [Google Scholar]
- 24.You MW, Kim N, Choi HJ. The value of quantitative CT texture analysis in differentiation of angiomyolipoma without visible fat from clear cell renal cell carcinoma on four-phase contrast-enhanced CT images. Clin Radiol. 2019;74:547–554. doi: 10.1016/j.crad.2019.02.018. [DOI] [PubMed] [Google Scholar]
- 25.Zhong Y, Yuan M, Zhang T, Zhang Y-D, Li H, Yu T-F. Radiomics approach to prediction of occult mediastinal lymph node metastasis of lung adenocarcinoma. AJR Am J Roentgenol. 2018;211:109–113. doi: 10.2214/AJR.17.19074. [DOI] [PubMed] [Google Scholar]
- 26.Furuhashi N, Suzuki K, Sakurai Y, Ikeda M, Kawai Y, Naganawa S. Differentiation of focal-type autoimmune pancreatitis from pancreatic carcinoma: assessment by multiphase contrast-enhanced CT. Eur Radiol. 2015;25:1366–1374. doi: 10.1007/s00330-014-3512-3. [DOI] [PubMed] [Google Scholar]
- 27.Lee-Felker SA, Felker ER, Kadell B, Farrell J, Raman SS, Sayre J, et al. Use of MDCT to differentiate autoimmune pancreatitis from ductal adenocarcinoma and interstitial pancreatitis. AJR Am J Roentgenol. 2015;205:2–9. doi: 10.2214/AJR.14.14059. [DOI] [PubMed] [Google Scholar]
- 28.Chang WI, Kim BJ, Lee JK, Kang P, Lee KH, Lee KT, et al. The clinical and radiological characteristics of focal mass-forming autoimmune pancreatitis: comparison with chronic pancreatitis and pancreatic cancer. Pancreas. 2009;38:401–408. doi: 10.1097/MPA.0b013e31818d92c0. [DOI] [PubMed] [Google Scholar]
- 29.Kamisawa T, Imai M, Yui Chen P, Tu Y, Egawa N, Tsuruta K, et al. Strategy for differentiating autoimmune pancreatitis from pancreatic cancer. Pancreas. 2008;37:e62–e67. doi: 10.1097/MPA.0b013e318175e3a0. [DOI] [PubMed] [Google Scholar]
- 30.van Timmeren JE, Cester D, Tanadini-Lang S, Alkadhi H, Baessler B. Radiomics in medical imaging-"how-to" guide and critical reflection. Insights Imaging. 2020;11:91. doi: 10.1186/s13244-020-00887-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Hayano K, Tian F, Kambadakone AR, Yoon SS, Duda DG, Ganeshan B, et al. Texture analysis of non-contrast-enhanced computed tomography for assessing angiogenesis and survival of soft tissue sarcoma. J Comput Assist Tomogr. 2015;39:607–612. doi: 10.1097/RCT.0000000000000239. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Smith AD, Gray MR, del Campo SM, Shlapak D, Ganeshan B, Zhang X, et al. Predicting overall survival in patients with metastatic melanoma on antiangiogenic therapy and RECIST stable disease on initial posttherapy images using CT texture analysis. AJR Am J Roentgenol. 2015;205:W283–W293. doi: 10.2214/AJR.15.14315. [DOI] [PubMed] [Google Scholar]
- 33.Hodgdon T, McInnes MDF, Schieda N, Flood TA, Lamb L, Thornhill RE. Can quantitative CT texture analysis be used to differentiate fat-poor renal angiomyolipoma from renal cell carcinoma on unenhanced CT images? Radiology. 2015;276:787–796. doi: 10.1148/radiol.2015142215. [DOI] [PubMed] [Google Scholar]
- 34.Schieda N, Thornhill RE, Al-Subhi M, McInnes MDF, Shabana WM, van der Pol CB, et al. Diagnosis of sarcomatoid renal cell carcinoma with CT: evaluation by qualitative imaging features and texture analysis. AJR Am J Roentgenol. 2015;204:1013–1023. doi: 10.2214/AJR.14.13279. [DOI] [PubMed] [Google Scholar]
- 35.Takahashi N, Leng S, Kitajima K, Gomez-Cardona D, Thapa P, Carter RE, et al. Small (< 4 cm) renal masses: differentiation of angiomyolipoma without visible fat from renal cell carcinoma using unenhanced and contrast-enhanced CT. AJR Am J Roentgenol. 2015;205:1194–1202. doi: 10.2214/AJR.14.14183. [DOI] [PubMed] [Google Scholar]