Abstract
Objective:
To evaluate the diagnostic performance of computed tomography (CT) radiomics analysis for differentiating pleomorphic adenoma (PA), Warthin tumor (WT), and basal cell adenoma (BCA).
Methods:
A total of 189 patients with PA (n = 112), WT (n = 53) and BCA (n = 24) were divided into a training set (n = 133) and a test set (n = 56). The radiomics features were extracted from plain CT and contrast-enhanced CT images. After dimensionality reduction, plain CT, multiphase-enhanced CT, integrated radiomics signature models and radiomics score (Rad-score) were established and calculated. The receiver operating characteristic (ROC) curve analysis was taken for the assessment of the model performance, and then comparison was conducted among these models. Decision curve analysis (DCA) was adopted to assess the clinical benefits of the models. Diagnostic performances including sensitivity, specificity, and accuracy of the radiologists were evaluated.
Results:
Seven, nine, fourteen, and fourteen optimal features were used to constructed plain scan, arterial phase, venous phase, and integrated radiomics signature models, respectively. ROC analysis showed these four models were able to differentiate PA from BCA and WT, with the area under the ROC curve (AUC) values of 0.79, 0.90, 0.87, and 0.94 in the training set, and 0.79, 0.89, 0.86, and 0.94 in the test set, respectively. The integrated model had better diagnostic performance than single-phase radiomics model, but it had similar diagnostic performance to that of the radiomics model based on the arterial phase (p > 0.05). The sensitivity, specificity, and accuracy in the diagnosis of PA were 0.86, 0.46, and 0.70 for the non-subspecialized radiologist and 0.88, 0.77, and 0.84 for the subspecialized radiologist, respectively. Six venous phase parameters were finally selected in differentiating WT from BCA. The predictive effect of the model was favorable, with AUC value of 0.95, sensitivity of 0.96, specificity of 0.83, and accuracy of 0.92. The sensitivity, specificity, and accuracy in the diagnosis between WT and BCA were 0.26, 0.87, and 0.45 for the non-subspecialized radiologist and 0.85, 0.58, and 0.77 for the subspecialized radiologist, respectively.
Conclusion:
The CT-based radiomics analysis showed favorable predictive performance for differentiating PA, WT, and BCA, thus may be helpful in the clinical decision-making.
Keywords: computed tomography, pleomorphic adenoma, Warthin tumor, basal cell adenoma, radiomics
Introduction
Parotid gland (PG) tumors are not common neoplasms, accounting for about 2 to 3% of all head and neck tumors, among which some 75–80% are benign. 1,2 Benign parotid tumors mainly originate from epithelial and non-epithelial tissues. The most common benign tumors of epithelial origin are pleomorphic adenoma (PA), Warthin tumor (WT) and basal cell adenoma (BCA), which usually require different treatment because of different biological characteristics. For instance, approximately 5 to 10% of PAs carry a risk of malignant transformation if there is any delay for the excision and can recur after surgery. 3,4 Thus, as for those patients with PA, partial superficial parotidectomy is recommended. The malignant transformation of WT rarely occurs, less than 1% of cases, and in the clinical practice, enucleation or conservative observation is recommended. 5 BCA can be classified into four histologic subtypes according to cellular growth patterns, including solid (most frequent form), tubular, trabecular, and membranous. The membranous type of BCA may have a comparatively high tendency to recur, while malignant transformation can occur in all four types. For the primary treatment of BCA in the cases where parotid affectation exists, surgical excision via a suprafacial or total parotidectomy is conducted. 6 Accurate preoperative diagnosis of PA, WT, and BCA would be significant in the consideration of an optimal individualized surgical planning and provides useful information for prognosis counseling.
Imaging tools, including magnetic resonance imaging (MRI), and computed tomography (CT), are commonly applied to examine the parotid region. MRI provides extremely good soft tissue resolution with extra information on the cellularity, organization, vascularity, neoangiogenesis, and microbleeds. Recently, there has been an increased interest in using the advanced MRI techniques including dynamic contrast-enhanced (DCE) MRI, diffusion-weighted imaging (DWI), and susceptibility-weighted imaging (SWI), to improve the distinction of parotid tumors. 7,8 Contrast-enhanced CT may increase the incidence of contrast media-related adverse reactions, but it can also help clinicians to distinguish between benign and malignant parotid tumors and assess tumor boundaries, especially in the deep lobe. Different enhancement patterns of PA and WT can help improve the diagnostic efficiency, particularly gradual enhancement for PA and washout pattern enhancement for WT. 9 A previous study showed that energy spectrum CT had better discrimination ability compared to conventional CT in differentiating benign PG tumors. 10 However, parotid gland benign tumors often exhibit overlapping imaging characteristics, in addition, the conclusions mainly depend on the subjective judgment of radiologists, which is of low efficiency and brings difficultly for the radiologists with low levels of experience. Radiomics, as a medical tool can extract quantitative features from medical images via a variety of computational methods. It can reflect tumor heterogeneity by way of extracting, analyzing, and interpreting the quantitative imaging parameters and provide useful information for tumor classification, treatment efficacy, and evaluation of prognosis. 11,12 Recently, several studies have reported successful applications of radiomics in head and neck tumors, such as the classification of primary oropharyngeal squamous cell carcinomas in accordance with human papillomavirus status based on CT texture analysis. 13,14 In head and neck cancer radiotherapy, radiomics can predict the risk of severe toxicities, especially xerostomia, in addition to the response to radiochemotherapy or induction chemotherapy. 15,16 Furthermore, it has been reported in several studies that the applications of radiomics analysis might be helpful in differentiating benign from malignant parotid tumors and PA from WT. 17–19 A recent study 20 used the CT-based radiomics nomogram for differentiating the lympho-associated benign and malignant PG tumors and demonstrated favorable discriminative ability.
However, there is a paucity of literature on the use of radiomics for precise classification of parotid PA, WT, and BCA. General consensus suggests that conventional multiphase contrast-enhanced CT imaging takes a crucial part in differentiating these three classes of benign PG tumors. In this study, we used conventional plain CT and multiphase-enhanced images to extract radiomics features and build a classification model for differentiation of PA, WT, and BCA. The aim of this study was to investigate the diagnostic performance of the prediction models based on plain CT and multiphase-enhanced CT, and to investigate whether the diagnostic performance could be improved by the integrated model in comparison with single-phase model and the radiologists.
Methods and materials
Patients
Study and review on the collected data for 357 patients who underwent CT for presurgical assessment of PG tumors from January 1, 2016, to December 30, 2020. One hundred eighty-nine patients diagnosed with PA (n = 112, 48 men and 64 women; mean age, 44 ± 14.8 years), WT (n = 53, 51 men and 2 women; mean age, 58.5 ± 12.1 years), and BCA (n = 24, 9 men and 15 women; mean age, 56.9 ± 9.7 years) were recorded in this study. The inclusion criteria were: (1) patients with pathologically confirmed PA, WT or BCA; (2) patients with complete clinic data; (3) patients who underwent non-contrast CT and contrast-enhanced CT of the head and neck within 7 days prior to the operation. The exclusion criteria were: (1) patients with a maximum tumor diameter < 5 mm; (2) images were not eligible for analysis because obvious artifacts, such as metallic or motion artifacts; (3) fine needle aspiration was carried out prior to imaging of the parotid tumor.
We combined BCA cohort with WT cohort, and performed radiomics analysis with PA cohort. All patients were randomly assigned to the training set and the test set in a ratio of 7:3, thereby 133 patients (79 PAs, 16 BCAs, 38 WTs) were allocated to the training set and 56 patients (33 PAs, 8 BCAs, 15 WTs) to the test set. Then we separately analyzed the texture characteristics of WT and BCA.
The approval for this retrospective study was obtained from the institutional review board of our hospital, and informed consent from the patients was not required in this study. The clinical and image data of patients were acquired from the hospital’s medical record system and the picture archiving and communication systems (PACS).
CT image acquisition
CT examinations were conducted on the 64-slice CT scanners (Discovery 750, GE Healthcare; optima 680, GE Healthcare) with the following parameters: 120 kVp tube voltage, 180–300 mAs tube current, matrix of 512 × 512, slice thickness of 2.5 mm, and section interval of 2.5 mm. The scanning ranged from zygomatic arch to mandible. The contrast-enhanced CT scan was made after intravenous injection of 70–80 ml of nonionic iodinated iodophosphol with 320 mg iodine per ml at 2.5 ml s−1. Arterial and venous phase images were acquired at 25 sec and 70 sec after contrast agent injection.
Clinical and CT feature evaluation
Patients’ clinical parameters, including gender and age, were derived from admission records. Two radiologists (radiologist 1 with 8 years of experience, and radiologist 2 with 12 years of experience) who were unknown to the clinical data, including tumor location, shape, maximum diameter, distribution, calcification, cystic lesion, arterial enhancement, and enhancement pattern, independently reviewed and assessed CT feature analysis. For quantitative analysis of the enhancing characteristics, CT attenuation values (in HU) on arterial (25 s) and venous (70 s) CT scans were measured by placing the largest possible circular region of interest (3–120 mm2) within the solid portion of the lesion, but taking care to avoid the cystic area. The following diagnostic parameters in each tumor were calculated: A0 was attenuation value on plain CT scan, A1 was attenuation value on arterial-phase CT scan, and A2 was attenuation value on venous-phase CT scan. The degree of arterial enhancement (Ad) as Ad = A1 - A0. We defined severe enhancement as Ad > 40 Hu, moderate enhancement as 20 Hu < Ad< 40 Hu, mild enhancement as 10 Hu < Ad < 20 Hu. Then the following diagnostic parameters were calculated: washout attenuation (Aw) as Aw = A1 - A2; and relative percentage enhancement washout ratio (RPEWR) as RPEWR (%) = (AW / A1) × 100. We defined the washout enhancement pattern as A1 > A2, and RPEWR > 10%; the platform type as A1 > A2, and RPEWR < 10%. When A1 < A2, we defined it as the gradual enhancement pattern.
Two other independent radiologists (radiologist 3: with 8 years of experience, but not subspecialized in head and neck radiology; radiologist 4: with 7 years of experience in head and neck radiology), unknown to the purpose of the study and the pathological outcomes, were requested to give their diagnostic impression among PA, WT or BCA.
Image segmentation and radiomics feature extraction
Two independent radiologists (radiologist 5 and radiologist 6 with 8 years of experience, trained in head and neck radiology) who were unknown to the clinical information and pathological outcomes selected the phase with the clearest lesion boundary after negotiation and then performed three-dimensional (3-D) segmentation of tumor region of interest (ROI) via ITK-SNAP software. All ROIs were then copied to the other phase images. The position, size, and shape of the tumor were manually adjusted. The contours of tumor were drawn slice-by-slice on axial CT slices meanwhile avoiding adjacent blood vessels and normal bone tissue. The imported image format was DICOM; and the exported format was NII. The image window width and window level were normalized before drawing the ROI (the window width was set to 350 Hu, and the window level to 40 Hu). An example of a manual segmentation was shown in Figure 1. Z-score normalization was conducted as preprocessing step for data to reduce the influence of different devices on the stability of imaging features. All ROIs from the plain CT, arterial, and venous phases images were uploaded into AK analysis software (Analysis Kit, V3.2.0.R, GE Healthcare) for feature extraction.
Figure 1.
Manual three-dimensional segmentation of a basal cell adenoma in a 65-year-old female. (a) segmentation on the axial slice. (b) three-dimensional volumetric reconstruction. (c) Case 1: pleomorphic adenoma of the left parotid gland in a 66-year-old female (white arrow). The mass shows mild contrast enhancement at the arterial phase, with CT value of 27–44 HU. (d) Case 2: Warthin tumor of the right parotid gland in a 68-year-old male (white arrow). The mass shows marked contrast enhancement at the arterial phase, with CT value of 90–110 HU. (e) Case 3: basal cell adenoma of the right parotid gland in a 65-year-old female (white arrow). The mass shows marked contrast enhancement at the arterial phase, with CT value of 90–140 HU.
Radiomics feature selection and development of the radiomics signature
Firstly, we evaluated the performance of CT radiomics analysis for differentiating PA from WT and BCA. We used three feature selection methods to eliminate the redundant and irrelevant features, i.e., Mann-Whitney U test, minimal redundancy maximal relevance (mRMR), and least absolute shrinkage and selection operator (LASSO). We used 10-fold cross-validation to tune the regularization parameter (λ) with minimum criteria in the training set. The average of the 10 CVs was used to evaluate if the model was overfitting, and the model of the one with the lowest validation loss was kept for the final model. Eventually, the plain CT, multiphase-enhanced CT, and integrated radiomics signature models were established. A radiomics score (Rad-score) was calculated for each patient. The Rad-score values of PA and the integrated cohorts in the training and test sets were compared. In the texture analysis of BCA and WT, the following four feature selection methods were applied, i.e., Mann-Whiney U-test, univariate logistic regression, mRMR, and multivariable logistic regression. After filtering the independently discriminative features, we constructed the final model and calculated the Rad-score.
Thirty randomly chosen CT images (10 PAs, 10 BCAs, and 10 WTs) were analyzed to assess the intraobserver and interobserver repeatability and reproducibility of radiomics features extraction. Radiologist 5 and radiologist 6 independently performed ROI segmentation on CT images during the same period in order to evaluate the features’ reproducibility between observers. To assess the repeatability within the observer, Radiologist 5 performed the same procedure two weeks later. The intraobserver and interobserver agreements were assessed using the intraclass correlation coefficient (ICC), with an ICC > 0.75 indicating good agreement. If there was a strong agreement, radiologist 5 performed image segmentation on the remaining samples.
Statistical analysis
R statistical software (version 3.5.1; https://www.r-project.org) was adopted for the radiomics statistical analysis. Wilcoxon statistical method was used to compare the Rad-score values of PA and integrated cohorts in the training and test sets. The receiver operating characteristic (ROC) curve was applied to evaluate the diagnostic performance of the models and to calculate the models’ specificity, sensitivity, accuracy, positive predictive value, and negative predictive value based on the Youden index. The Delong test was applied to compare the differences between the various area under the curve (AUC) values. Decision curve analysis (DCA) plots were generated with the “risk model decision analysis (rmda[R])” package.
SPSS version 23.0 was applied to analyze the clinical data and imaging features of all patients. Normally distributed quantitative data were analyzed by t-test whereas for non-normally distributed quantitative data the Mann-Whitney test was used. Chi-square or Fisher’s exact test was performed for qualitative data. A two-tailed p-value < 0.05 was considered to be statistically significant.
Results
Clinical and radiological features
In the PA and integrated cohorts, all data were divided into a training set and a test set. The clinical and radiological characteristics of the patients in the training and test sets were shown in Table 1. There was a statistically significant difference only in patients’ gender between the two groups. And the patients’ age, gender, tumor distribution, arterial enhancement, and enhancement pattern were significantly different between PA and integrated cohorts in both training and test sets (Table 2). Moreover, there were significant differences in the gender, tumor distribution, cystic degeneration, and enhancement pattern between the BCA and WT cohorts (Table 3).
Table 1.
Clinical and radiological characteristics of patients in the training and test sets
| Variable | Training set (n = 133) | Test set (n = 56) | P-value | 
|---|---|---|---|
| Age (year) | 48.5 ± 15.1 | 52.9 ± 14.9 | 0.069 | 
| Gender (male/female) | 69/64 | 39/17 | 0.024 | 
| Tumor Distribution (single/multiple) | 120/13 | 52/4 | 0.564 | 
| Location (shallow/deep/across) | 90/0/43 | 44/0/12 | 0.187 | 
| Maximum diameter (mm) | 22.8 ± 7.5 | 23 ± 8.5 | 0.907 | 
| Calcification (yes/no) | 2/131 | 1/55 | 1.000 | 
| Shape (round/non-rounded) | 127/6 | 54/2 | 1.000 | 
| Cystic degeneration (yes/no) | 65/68 | 27/29 | 0.934 | 
| Arterial Enhancement (mild/moderate/severe) | 59/28/46 | 24/10/22 | 0.794 | 
| Enhancement curve (inflow/outflow/platform) | 96/34/3 | 36/19/1 | 0.467 | 
Table 2.
Clinical and radiological characteristics of pleomorphic adenoma (PA), basal cell adenoma (BCA), and Warthin tumor (WT) in the training and test sets
| Variable | Training set (n = 133) | Test set (n = 56) | ||||
|---|---|---|---|---|---|---|
| PA | BCA&WT | P-value | PA | BCA&WT | P-value | |
| Age (year) | 42.6 ± 15 | 56.7 ± 10.9 | <0.001 | 47.3 ± 14.2 | 61.0 ± 12.1 | <0.001 | 
| Gender (male/female) | 29/50 | 40/14 | <0.001 | 19/14 | 20/3 | 0.019 | 
| Distribution (single/multiple) | 79/0 | 41/13 | <0.001 | 33/0 | 19/4 | 0.024 | 
| Location (shallow/deep/across) | 51/0/28 | 39/0/15 | 0.353 | 25/0/8 | 19/0/4 | 0.743 | 
| Maximum diameter (mm) | 22.6 ± 7.4 | 23.3 ± 7.7 | 0.811 | 21.3 ± 8.1 | 25.5 ± 8.5 | 0.053 | 
| Calcification (yes/no) | 1/78 | 1/53 | 1.000 | 0/33 | 1/22 | 0.411 | 
| Shape (round/non-rounded) | 74/5 | 53/1 | 0.400 | 31/2 | 23/0 | 0.507 | 
| Cystic degeneration (yes/no) | 34/45 | 31/23 | 0.104 | 13/20 | 14/9 | 0.114 | 
| Arterial Enhancement (mild/moderate/severe) | 48/20/11 | 11/8/35 | <0.001 | 21/5/7 | 3/5/15 | 0.001 | 
| Enhancement pattern (inflow/outflow/platform) | 76/3/0 | 20/31/3 | <0.001 | 30/3/0 | 6/16/1 | <0.001 | 
Table 3.
Clinical and radiological characteristics of basal cell adenoma (BCA) and Warthin tumor (WT)
| Variable | WT (n = 53) | BCA (n = 24) | P-value | 
|---|---|---|---|
| Age (year) | 58.5 ± 12.1 | 56.9 ± 9.7 | 0.562 | 
| Gender (male/female) | 51/2 | 9/15 | <0.001 | 
| Distribution (single/multiple) | 36/17 | 24/0 | 0.002 | 
| Location (shallow/deep/across) | 41/0/12 | 17/0/7 | 0.538 | 
| Maximum diameter (mm) | 24.4 ± 7.9 | 22.8 ± 8.1 | 0.254 | 
| Calcification (yes/no) | 1/52 | 1/23 | 0.529 | 
| Shape (round/non-rounded) | 52/1 | 24/0 | 1.000 | 
| Cystic degeneration (yes/no) | 36/17 | 9/15 | 0.012 | 
| Arterial Enhancement (mild/moderate/severe) | 10/10/33 | 4/3/17 | 0.824 | 
| Enhancement curve (inflow/outflow/platform) | 12/41/0 | 14/6/4 | <0.001 | 
Radiomics feature extraction, selection, and radiomics signature construction
PA vs. BCA and WT
A total of 718 features showed good inter- and intraobserver agreement, with ICCs > 0.75. Radiologist five performed the feature extraction on the remaining samples because of strong agreement. First-, second-, and higher order texture features were extracted, including shape, gray histogram features, gray-level co-occurrence matrix (GLCM), gray-level run-length matrix (GLRLM), gray-level dependence matrix (GLDM), gray-level size zone matrix (GLSZM), Gauss transform and wavelet transform. After feature dimensionality reduction, seven, nine, fourteen, and fourteen optimal features were used to construct plain scan, arterial phase, venous phase and integrated radiomics signature models, respectively (Figure 2). Rad-score was then calculated by these selected features weighted by their coefficients, and the Rad-scores were quite different between PA and integrated cohorts in both training and test sets (Supplementary Material 1). Moreover, the ROC curves of the four models demonstrated good predictive performance for both the training and test sets (Table 4 and Figure 3). ROC analysis showed that plain CT, arterial phase, venous phase, and integrated models were able to differentiate PA from BCA and WT, with the AUC values of 0.79, 0.90, 0.87 and 0.94 in the training set, and 0.79, 0.89, 0.86 and 0.94 in the test set, respectively. In the training set, Delong test showed that the AUC value of the integrated model was remarkably different from that of the plain CT and venous phase models (p < 0.001 and p = 0.036, respectively). In the test set, the AUC value of the integrated model was only remarkably different from that of the plain CT model (p = 0.04). The DCAs demonstrated that the integrated model had a higher overall net benefit than the other three models (Figure 4).
Figure 2.
Radiomics feature selection using the least absolute shrinkage and selection operator (LASSO) regression model. (a) plain scan model. (b) arterial phase model. (c) venous phase model. (d) integrated model.
Table 4.
Diagnostic efficacy of the four radiological models
| Group/Model | Se | Sp | PPV | NPV | Accuracy | AUC | 95% CI | P [a] | 
|---|---|---|---|---|---|---|---|---|
| Training set | ||||||||
| Plain CT | 0.76 | 0.71 | 0.64 | 0.81 | 0.73 | 0.79 | 0.72–0.87 | < 0.001 | 
| Arterial phase | 0.93 | 0.73 | 0.70 | 0.93 | 0.81 | 0.90 | 0.85–0.95 | 0.114 | 
| Venous phase | 0.80 | 0.81 | 0.74 | 0.85 | 0.80 | 0.87 | 0.81–0.93 | 0.036 | 
| Integrated group | 0.94 | 0.77 | 0.74 | 0.95 | 0.84 | 0.94 | 0.91–0.98 | - | 
| Test set | ||||||||
| Plain CT | 0.70 | 0.70 | 0.62 | 0.77 | 0.70 | 0.79 | 0.67–0.92 | 0.040 | 
| Arterial phase | 0.87 | 0.76 | 0.71 | 0.89 | 0.80 | 0.89 | 0.80–0.98 | 0.400 | 
| Venous phase | 0.65 | 0.90 | 0.83 | 0.79 | 0.80 | 0.86 | 0.75–0.97 | 0.188 | 
| Integrated group | 1.00 | 0.76 | 0.74 | 1.00 | 0.86 | 0.94 | 0.88–1.00 | - | 
The differences between AUC values in the integrated model were compared with that in the plain scan, arterial phase, and venous phase models by Delong’s test. Se, sensitivity; Sp, specificity; AUC, area under the curve; PPV, positive predictive value; NPV, negative predictive value; CI, confidence interval.
Figure 3.
The receiver operating characteristic (ROC) curves of the four models in the training (a) and test (b) sets, respectively.
Figure 4.
Decision curve analysis (DCA) for four models. The y-axis indicates the net benefit; the x-axis indicates threshold probability. The black line, blue line, yellow line, and green line represent the net benefit of the plain scan, artery phase, venous phase, and integrated models, respectively. The combined model had a higher overall net benefit in differentiating pleomorphic adenoma, basal cell adenoma, and Warthin tumor than the other three models.
The sensitivity, specificity and accuracy in the diagnosis of PA were 0.86, 0.46 and 0.70 for the non-subspecialized radiologist, and 0.88, 0.77 and 0.84 for the subspecialized radiologist, respectively.
BCA vs. WT
Texture analysis based on the plain CT and multiphase-enhanced CT showed that totally 2154 radiomics features were selected by Mann-Whitney U-test, and 415 features were retained by univariate logistic regression. Next, mRMR method was used to filter the redundancy features and obtain eight features (Figure 5). Finally, six most predictive features were retained through backward stepwise selection with the likelihood ratio test. Radiomics signature was eventually constructed by multivariable logistic regression. The predictive effect of the model was favorable, with an AUC value of 0.95, specificity of 0.83, sensitivity of 0.96, and accuracy of 0.92 (Figure 6).
Figure 5.
Identification of parotid basal cell adenoma and Warthin tumor by multiple parameters of CT texture analysis with ROC curves.
Figure 6.
The ROC curve of the radiomics signature in differentiating the basal cell adenoma and Warthin tumor.
The rad-score was calculated as follows: Rad-score = 2.21203943475944×(Intercept) + (4.89166638004698 × original_glcm_Id) + (1.35617355250434 × wavelet_HLL_gldm_Dependence Non-Uniformity Normalized) + (4.68288775352499 × original_glcm_Joint Energy) + (−3.30766515475012 × original_glrlm_Long Run Emphasis) + (3.53064532937874 × wavelet_LLL_glcm_Joint Entropy) + (−1.08608003217607 × log_sigma_3_0_mm_3D_firstorder_Skewness).
The sensitivity, specificity and accuracy in the diagnosis of WT were 0.26, 0.87 and 0.45 for the non-subspecialized radiologist, and 0.85, 0.58 and 0.77 for the subspecialized radiologist, respectively.
Discussion
Our results indicated that the multiphase combined model had a better diagnostic performance than single-phase radiomics models in differentiating PA from WT and BCA, but it had similar diagnostic performance to the arterial phase radiomics signature model. Moreover, our results also showed that texture analysis based on venous phase parameters might be helpful to distinguish WT from BCA.
Although there is a potential risk associated with CT radiation dose, multiphase contrast-enhanced CT is still used as the preoperative method for the evaluation of PG tumors. Because parotid gland lesions have a variable enhancement pattern. In this study, PA frequently showed mild to moderate enhancement in the arterial phase (94/112, 84%) and gradual enhancement pattern (106/112, 95%); the majority of WT showed rapidly significant enhancement (33/53, 62%) and a washout enhancement pattern (41/53, 77%). Previous studies have also shown that WT may present delayed enhancement, while PA a quick expurgation, 21 which was consistent with our data. In addition, BCA not only showed the characteristics of rapidly significant enhancement in arterial (17/24, 71%), but also showed both gradual, platform and washout enhancement patterns. 22 The mechanisms behind these features have not been fully understood. We presumed that the different histologic constituents, cellularity, extracellular space, vascular density, and leaky blood vessels might partly account for the various patterns of enhancement. Thus, diagnosis of these tumors may be challenging considering that PA, BCA, and WT present similar imaging features.
Radiomics, a recently technique, for characterizing intratumoral heterogeneity by extracting high-throughput features from medical imaging. 23 Zhang et al 24 reported that texture analysis based on arterial-phase images could be used for the preoperative, non-invasive diagnosis of PA and WT. In this study, the 3D whole tumor radiomics analysis based on both plain CT and multiphase-enhanced CT images was conducted to differentiate PA, WT, and BCA. The heterogeneous area of the tumor, including cystic, necrotic, and hemorrhagic areas, may help in the categorization and differentiation of tumors. One point should be noted was that the clinical features of the training and test sets were preferable to maintain balance. However, gender distribution between training and test sets was different in this study. All patients were randomly divided into the training and test sets in a ratio of 7:3 in our cohort. The randomization process may be resulted in an unequal sex distribution. But we suggested that it might be more important to control for consistency in the distribution of imaging features compared to gender distribution in this study. We selected 7, 9, and 14 effective features from plain CT, arterial phase, and venous phase by 10-fold cross-validation LASSO regression, respectively. At the same time, we also selected 14 optimal features in the integrated model, including three plain CT features (Mean Absolute Deviation, Skewness, GLCM_IMC1), eight arterial phase features (10th percentile, Skewness, 90th percentile, GLRLM_Short Run Emphasis, GLDM_Small Dependence Emphasis, GLCM_Inverse Variance, GLCM_Correlation), and three venous phase features (GLDM_Dependence Variance, GLCM_Inverse Variance, GLDM_Dependence Non-Uniformity Normalized). Mean absolute deviation refers to the mean distance of all intensity values from the mean value of the image array. Short run emphasis measures the distribution of short run lengths, with a greater value indicative of more fine textural textures. GLDM features also reflect heterogeneity and homogeneity, with a greater value of small dependence emphasis indicating less homogeneous textures, and a lower value of dependence non-uniformity normalized indicating more homogeneous textures. GLCM features that depict the spatial relationship between two neighboring pixels or voxels with predefined gray-level intensities, can show intratumor heterogeneity and may be associated with tumor malignancy. 19 Zhang et al 24 reported that texture features, mainly comprised by GLCM, were able to distinguish benign from malignant PG tumors. Gabelloni et al 25 reported that GLCM could not only distinguish between benign and malignant PG tumors, but also PAs from malignant tumors or WTs. Therefore, the intratumor heterogeneity may be used to explain the results of this study. PA contains varying histologic components of epithelial, myxoid, and fibrous elements, while WT consists of epithelia, monomorphic, oncocytic components, and lymphoid stroma. 26 Moreover, BCA is a benign tumor comprised of a single layer of basal cell-like epithelial cells with a clear basement membrane-like structure, lacking the mesenchymal or cartilage-like matrix components which are often found in PA. 27 WT has high vascularity and capillary-like vessel networks, while solid type BCA has numerous endothelial-line vascular channels with prominent small capillaries and venules. 28 In addition, features including skewness, 10 percentile, and 90 percentile also showed good diagnostic performance. The differences of the 10th and 90th percentile also reflect the heterogeneity of tumor tissue composition or distribution. Skewness measures the asymmetry of the distribution of values about the Mean value. Higher skewness values have been related to tumor angiogenesis. 29–31 WT has higher vascularity and microvessel permeability, while BCA has a rich capillary networks and venules. Therefore, the skewness of arterial and venous phases may have higher diagnostic efficiency.
Our study showed that the radiomics models based on plain CT and multiphase-enhanced CT texture features had favorable predictive value in distinguishing PA from WT and BCA, and the integrated model presented the best efficacy, with AUC value of 0.94. Compared with the other three models, the AUC values of the integrated models were only statistically significant with that of the plain CT models in both training and test sets, but not with that of the arterial phase models. This may be related to the obvious early enhancement of WT and BCA in the arterial phase, while mild enhancement of PA was observed. The characteristics of arterial hemodynamics had an important role in distinguishing PA from WT and BCA. The texture features were consistent with the routine findings. Moreover, when we compared our results to those in recent publications evaluating MR imaging for the distinguishment of PA and WT, our study showed better diagnostic performance. 17
In addition, texture analysis based on the venous phase parameters had a higher value in differentiating WT from BCA. The reason may be that some BCAs show continuous enhancement in the venous phase, while WTs often demonstrate a washout pattern of enhancement. Inverse difference (ID) based on the vein phase texture parameter showed the best diagnostic performance. It is a measure of the uniformity of the local grayscale distribution of an image. The more uniform gray levels would result in a higher ID value. Nevertheless, the lower of the ID value, the more irregular texture are.
The predictive radiomics models in our study revealed that the arterial phase model and the integrated model resulted in higher sensitivity for the diagnosis of PA in comparison with the diagnosis made by both non-subspecialized and subspecialized head and neck radiologists, and higher than 0.46 specificity and 0.70 accuracy by the non-subspecialized radiologist, but similar to 0.77 specificity and 0.84 accuracy by subspecialized radiologist. Vernuccio et al 32 reported that radiomics model based on combination of all MRI sequences had higher sensitivity to distinguish PA from WT in comparison with the diagnosis made by both non-subspecialized and subspecialized radiologists, but without a benefit from the aspect of specificity. There were some consistencies with our findings. Furthermore, the model based on venous phase parameters showed higher sensitivity and accuracy in differentiating WT from BCA in comparison with the diagnosis made by both non-subspecialized and subspecialized radiologists. Interestingly, the specificity of the model was higher than subspecialized radiologists, but similar to non-subspecialized radiologists. Therefore, if our results are validated in an external large-cohort study, we believe that the enhanced CT-based radiomics analysis can improve the differentiation of PA, WT, and BCA, and help the clinicians in decision making.
Radiomics analysis may be affected by several factors, including random variations in scanner and acquisition protocol, reconstruction settings, and ROI segmentation. 33,34 If the data were from different manufacturers with different resolution and scan parameters, even different models of scanners from the same manufacturer, batch effect would exist and affect the value of the radiomics features, so data preprocessing such as z-score, combat and functional normalization and so on should be used to eliminate the batch effect first. However, these methods need further validation, because important textural information may be lost in this process. Manual segmentation has high interobserver variability, which can lead to the derivation of unstable radiomic features. Forde et al 35 demonstrated that 7 and 10% of features were regarded to be unstable for the auto-segmentation and manual delineations respectively, with the larger sample size of 40 participants. The intraclass correlation coefficient (ICC) was used to evaluate intraobserver and interobserver agreement. In this study, we only observed interobserver and intraobserver variability between the two radiologists, with an ICC > 0.75 indicating good agreement. We did not use higher ICC cutoff values or 95% confidence intervals for verification of the reproducibility of these results. This was a limitation of the present study. The solution is to increase the number of observers and expand the sample size, or to use an accurate automatic method in order to obtain more robust features. Overfitting or underfitting is an important concern for radiomics. In this study, we selected optimal features using LASSO regression analysis, and used 10-fold cross-validation procedures, which have been shown to reduce the overfitting to a certain extent. Underfitting is a scenario in data science when a model is unable to classify data correctly and generate a high error rate in both the training and the validation sets. 36 The consistency of AUC values on the training and test sets may indicate underfitting, and a good model should produce a small gap. In the present study, the training and test sets showed identical AUC values in the plain CT and integrated radiomics signature models, but with different confidence intervals. In addition, there were 7 and 14 optimal features in these two models, respectively, to reduce the underfitting caused by too few features, and the models had relatively high accuracy. Of course, such results remain to be further externally validated.
This study still has some limitations. First of all, it was a retrospective study that included a single institution and had a relatively small sample size, all of which may lead to bias. Secondly, the agreement analysis was conducted with about one-six of the study population, not all cases. Therefore, further multicenter studies with larger patient population are needed to validate the radiomics model. Thirdly, the present study used manual segmentation which was time consuming, and the results may be compromised due to interobserver variability. So, the generalizability of our results to other institutions and users is unclear. Fourthly, there was a lack of understanding on the potential relationship between specific texture features and histopathologic features. Finally, only three benign parotid tumors were analyzed, without expansion of analysis to other benign or malignant PG tumors. In order to increase the accuracy of diagnosis, it is planned to perform further studies to evaluate the radiomics analysis in other parotid gland lesions and try to apply other machine learning algorithms for data analysis in the near future.
Conclusion
In conclusion, our study demonstrated that CT-based radiomics analysis could be applied in differentiating PA, WT, and BCA. The integrated model had similar diagnostic performance to that of the arterial-phase radiomics model in differentiating PA from WT and BCA. Moreover, texture analysis based on the venous phase parameters had a higher value in differentiating WT from BCA. Therefore, the radiomics model may be taken as a non-invasive and valid tool to potentially complement conventional clinical imaging in clinical decision-making, although further validation is required before it can be widely used in clinical application.
Footnotes
Competing interests: The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding: The author(s) received no financial support for the research, authorship, and/or publication of this article.
Contributor Information
Fangfang Chen, Email: 728867965@qq.com.
Yaqiong Ge, Email: Yaqiong.Ge@ge.com.
Shuang Li, Email: 547602089@qq.com.
Mengqiu Liu, Email: 290534306@qq.com.
Jiaoyan Wu, Email: 825715848@qq.com.
Ying Liu, Email: canle6862@sina.com.
REFERENCES
- 1. Stenner M, Klussmann JP. Current update on established and novel biomarkers in salivary gland carcinoma pathology and the molecular pathways involved. Eur Arch Otorhinolaryngol 2009; 266: 333–41. doi: 10.1007/s00405-008-0882-7 [DOI] [PubMed] [Google Scholar]
- 2. Bussu F, Parrilla C, Rizzo D, Almadori G, Paludetti G, Galli J. Clinical approach and treatment of benign and malignant parotid masses, personal experience. Acta Otorhinolaryngol Ital 2011; 31: 135–43. [PMC free article] [PubMed] [Google Scholar]
- 3. Papadogeorgakis N, Skouteris CA, Mylonas AI, Angelopoulos AP. Superficial parotidectomy: technical modifications based on tumour characteristics. J Craniomaxillofac Surg 2004; 32: 350–53. doi: 10.1016/j.jcms.2004.05.004 [DOI] [PubMed] [Google Scholar]
- 4. Gnepp DR. Malignant mixed tumors of the salivary glands: a review. Pathol Annu 1993; 28 Pt 1: 279–328. [PubMed] [Google Scholar]
- 5. Lim YC, Lee SY, Kim K, Lee JS, Koo BS, Shin HA, et al. Conservative parotidectomy for the treatment of parotid cancers. Oral Oncol 2005; 41: 1021–27. doi: 10.1016/j.oraloncology.2005.06.004 [DOI] [PubMed] [Google Scholar]
- 6. Hellquist H, Paiva-Correia A, Vander Poorten V, Quer M, Hernandez-Prera JC, Andreasen S, et al. Analysis of the clinical relevance of histological classification of benign epithelial salivary gland tumours. Adv Ther 2019; 36: 1950–74. doi: 10.1007/s12325-019-01007-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Sun Q, Ma C, Dong M, Jiang M, Tao X. Effects of region of interest sizes on apparent diffusion coefficient measurements of pleomorphic adenoma, Warthin tumor, and normal parotid parenchyma. Quant Imaging Med Surg 2019; 9: 681–90. doi: 10.21037/qims.2019.04.11 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Xu Z, Chen M, Zheng S, Chen S, Xiao J, Hu Z, et al. Differential diagnosis of parotid gland tumours: application of SWI combined with DWI and DCE-MRI. Eur J Radiol 2022; 146: 110094: S0720-048X(21)00575-1. doi: 10.1016/j.ejrad.2021.110094 [DOI] [PubMed] [Google Scholar]
- 9. Choi DS, Na DG, Byun HS, Ko YH, Kim CK, Cho JM, et al. Salivary gland tumors: evaluation with two-phase helical CT. Radiology 2000; 214: 231–36. doi: 10.1148/radiology.214.1.r00ja05231 [DOI] [PubMed] [Google Scholar]
- 10. Al Ajmi E, Forghani B, Reinhold C, Bayat M, Forghani R. Spectral multi-energy CT texture analysis with machine learning for tissue classification: an investigation using classification of benign parotid tumours as a testing paradigm. Eur Radiol 2018; 28: 2604–11. doi: 10.1007/s00330-017-5214-0 [DOI] [PubMed] [Google Scholar]
- 11. Lambin P, Rios-Velazquez E, Leijenaar R, Carvalho S, van Stiphout RGPM, Granton P, et al. Radiomics: extracting more information from medical images using advanced feature analysis. Eur J Cancer 2012; 48: 441–46. doi: 10.1016/j.ejca.2011.11.036 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Gillies RJ, Kinahan PE, Hricak H. Radiomics: images are more than pictures, they are data. Radiology 2016; 278: 563–77. doi: 10.1148/radiol.2015151169 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Jethanandani A, Lin TA, Volpe S, Elhalawani H, Mohamed ASR, Yang P, et al. Exploring applications of radiomics in magnetic resonance imaging of head and neck cancer: a systematic review. Front Oncol 2018; 8: 131. doi: 10.3389/fonc.2018.00131 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Buch K, Fujita A, Li B, Kawashima Y, Qureshi MM, Sakai O. Using texture analysis to determine human papillomavirus status of oropharyngeal squamous cell carcinomas on CT. AJNR Am J Neuroradiol 2015; 36: 1343–48. doi: 10.3174/ajnr.A4285 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Riesterer O, Pruschy M, Bender S, Sharma A, Bogowicz M, Tanadini-Lang S, et al. Consolidation cetuximab after concurrent triplet radiochemotherapy+cetuximab in patients with advanced head and neck cancer: a randomized phase II study. Radiother Oncol 2020; 150: 62–69: S0167-8140(20)30334-0. doi: 10.1016/j.radonc.2020.06.011 [DOI] [PubMed] [Google Scholar]
- 16. Sheikh K, Lee SH, Cheng Z, Lakshminarayanan P, Peng L, Han P, et al. Predicting acute radiation induced xerostomia in head and neck cancer using Mr and CT radiomics of parotid and submandibular glands. Radiat Oncol 2019; 14: 131. doi: 10.1186/s13014-019-1339-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Liu Y, Zheng J, Lu X, Wang Y, Meng F, Zhao J, et al. Radiomics-based comparison of MRI and CT for differentiating pleomorphic adenomas and Warthin tumors of the parotid gland: a retrospective study. Oral Surg Oral Med Oral Pathol Oral Radiol 2021; 131: 591–99: S2212-4403(21)00042-0. doi: 10.1016/j.oooo.2021.01.014 [DOI] [PubMed] [Google Scholar]
- 18. Xu Y, Shu Z, Song G, Liu Y, Pang P, Wen X, et al. The role of preoperative computed tomography radiomics in distinguishing benign and malignant tumors of the parotid gland. Front Oncol 2021; 11: 634452. doi: 10.3389/fonc.2021.634452 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Shao S, Zheng N, Mao N, Xue X, Cui J, Gao P, et al. A triple-classification radiomics model for the differentiation of pleomorphic adenoma, Warthin tumour, and malignant salivary gland tumours on the basis of diffusion-weighted imaging. Clin Radiol 2021; 76: 472. S0009-9260(21)00090-8. doi: 10.1016/j.crad.2020.10.019 [DOI] [PubMed] [Google Scholar]
- 20. Zheng Y-M, Xu W-J, Hao D-P, Liu X-J, Gao C-P, Tang G-Z, et al. A CT-based radiomics nomogram for differentiation of lympho-associated benign and malignant lesions of the parotid gland. Eur Radiol 2021; 31: 2886–95. doi: 10.1007/s00330-020-07421-4 [DOI] [PubMed] [Google Scholar]
- 21. Xu Z-F, Yong F, Yu T, Chen Y-Y, Gao Q, Zhou T, et al. Different histological subtypes of parotid gland tumors: CT findings and diagnostic strategy. World J Radiol 2013; 5: 313–20. doi: 10.4329/wjr.v5.i8.313 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Jang M, Park D, Lee SR, Hahm CK, Kim Y, Kim Y, et al. Basal cell adenoma in the parotid gland: CT and Mr findings. AJNR Am J Neuroradiol 2004; 25: 631–35. [PMC free article] [PubMed] [Google Scholar]
- 23. Wei W, Liu Z, Rong Y, Zhou B, Bai Y, Wei W, et al. A computed tomography-based radiomic prognostic marker of advanced high-grade serous ovarian cancer recurrence: a multicenter study. Front Oncol 2019; 9: 255. doi: 10.3389/fonc.2019.00255 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Zhang D, Li X, Lv L, Yu J, Yang C, Xiong H, et al. Improving the diagnosis of common parotid tumors via the combination of CT image biomarkers and clinical parameters. BMC Med Imaging 2020; 20(1): 38. doi: 10.1186/s12880-020-00442-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Gabelloni M, Faggioni L, Attanasio S, Vani V, Goddi A, Colantonio S, et al. Can magnetic resonance radiomics analysis discriminate parotid gland tumors? A pilot study. Diagnostics (Basel) 2020; 10: 900: E900. doi: 10.3390/diagnostics10110900 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Woo SH, Choi D-S, Kim J, Park JJ, Joo YH, Chung P-S, et al. Two-Phase computed tomography study of Warthin tumor of parotid gland: differentiation from other parotid gland tumors and its pathologic explanation. J Comput Assist Tomogr 2013; 37: 518–24. doi: 10.1097/RCT.0b013e31828aede8 [DOI] [PubMed] [Google Scholar]
- 27. Takeshita T, Tanaka H, Harasawa A, Kaminaga T, Imamura T, Furui S. Ct and Mr findings of basal cell adenoma of the parotid gland. Radiat Med 2004; 22: 260–64. [PubMed] [Google Scholar]
- 28. Okahara M, Kiyosue H, Matsumoto S, Hori Y, Tanoue S, Uchida D, et al. Basal cell adenoma of the parotid gland: MR imaging findings with pathologic correlation. AJNR Am J Neuroradiol 2006; 27: 700–704. [PMC free article] [PubMed] [Google Scholar]
- 29. Association between tumor heterogeneity and progression-free survival in non-small cell lung cancer patients with EGFR mutations undergoing tyrosine kinase inhibitors therapy. Annu Int Conf IEEE Eng Med Biol Soc 2016; 2016: 1268–71. doi: 10.1109/EMBC.2016.7590937 [DOI] [PubMed] [Google Scholar]
- 30. Ganeshan B, Panayiotou E, Burnand K, Dizdarevic S, Miles K. Tumour heterogeneity in non-small cell lung carcinoma assessed by CT texture analysis: a potential marker of survival. Eur Radiol 2012; 22: 796–802. doi: 10.1007/s00330-011-2319-8 [DOI] [PubMed] [Google Scholar]
- 31. Makanyanga J, Ganeshan B, Rodriguez-Justo M, Bhatnagar G, Groves A, Halligan S, et al. Mri texture analysis (MRTA) of T2-weighted images in Crohn’s disease may provide information on histological and MRI disease activity in patients undergoing ileal resection. Eur Radiol 2017; 27: 589–97. doi: 10.1007/s00330-016-4324-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Vernuccio F, Arnone F, Cannella R, Verro B, Comelli A, Agnello F, et al. Diagnostic performance of qualitative and radiomics approach to parotid gland tumors: which is the added benefit of texture analysis? Br J Radiol 2021; 94(1128): 20210340. doi: 10.1259/bjr.20210340 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Meyer M, Ronald J, Vernuccio F, Nelson RC, Ramirez-Giraldo JC, Solomon J, et al. Reproducibility of CT radiomic features within the same patient: influence of radiation dose and CT reconstruction settings. Radiology 2019; 293: 583–91. doi: 10.1148/radiol.2019190928 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Berenguer R, Pastor-Juan MDR, Canales-Vázquez J, Castro-García M, Villas MV, Mansilla Legorburo F, et al. Radiomics of CT features may be nonreproducible and redundant: influence of CT acquisition parameters. Radiology 2018; 288: 407–15. doi: 10.1148/radiol.2018172361 [DOI] [PubMed] [Google Scholar]
- 35. Forde E, Leech M, Robert C, Herron E, Marignol L. Influence of inter-observer delineation variability on radiomic features of the parotid gland. Phys Med 2021; 82: 240–48: S1120-1797(21)00088-0. doi: 10.1016/j.ejmp.2021.01.084 [DOI] [PubMed] [Google Scholar]
- 36. Mayerhoefer ME, Materka A, Langs G, Häggström I, Szczypiński P, Gibbs P, et al. Introduction to radiomics. J Nucl Med 2020; 61: 488–95. doi: 10.2967/jnumed.118.222893 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.






