Ultrasound-Based Deep Learning Radiomics to Predict Cervical Lymph Node Metastasis in Major Salivary Gland Carcinomas

Huan-Zhong Su; Long-Cheng Hong; Zhi-Yong Li; Qiao-Min Fu; Yu-Hui Wu; Shao-Feng Wu; Zuo-Bing Zhang; Dao-Hui Yang; Xiao-Dong Zhang

doi:10.1016/j.identj.2025.103895

. 2025 Sep 12;75(6):103895. doi: 10.1016/j.identj.2025.103895

Ultrasound-Based Deep Learning Radiomics to Predict Cervical Lymph Node Metastasis in Major Salivary Gland Carcinomas

Huan-Zhong Su ^a,^†, Long-Cheng Hong ^a,^†, Zhi-Yong Li ^b, Qiao-Min Fu ^c, Yu-Hui Wu ^a, Shao-Feng Wu ^a, Zuo-Bing Zhang ^a, Dao-Hui Yang ^d,^⁎, Xiao-Dong Zhang ^a,^∗∗

PMCID: PMC12790077 PMID: 40945314

Abstract

Introduction and Aims

Cervical lymph node metastasis (CLNM) critically impacts surgery approaches, prognosis, and recurrence in patients with major salivary gland carcinomas (MSGCs). We aimed to develop and validate an ultrasound (US)-based deep learning (DL) radiomics model for noninvasive prediction of CLNM in MSGCs.

Methods

A total of 214 patients with MSGCs from 4 medical centers were divided into training (Centers 1-2, n = 144) and validation (Centers 3-4, n = 70) cohorts. Radiomics and DL features were extracted from preoperative US images. Following feature selection, radiomics score and DL score were constructed respectively. Subsequently, the least absolute shrinkage and selection operator (LASSO) regression was used to identify optimal features, which were then employed to develop predictive models using logistic regression (LR) and 8 machine learning algorithms. Model performance was evaluated using multiple metrics, with particular focus on the area under the receiver operating characteristic curve (AUC).

Results

Radiomics and DL scores showed robust performance in predicting CLNM in MSGCs, with AUCs of 0.819 and 0.836 in the validation cohort, respectively. After LASSO regression, 6 key features (patient age, tumor edge, calcification, US reported CLN-positive, radiomics score, and DL score) were selected to construct 9 predictive models. In the validation cohort, the models' AUCs ranged from 0.770 to 0.962. The LR model achieved the best performance, with an AUC of 0.962, accuracy of 0.886, precision of 0.762, recall of 0.842, and an F1 score of 0.8.

Conclusion

The composite model integrating clinical, US, radiomics, and DL features accurately noninvasively predicts CLNM preoperatively in MSGCs.

Clinical Relevance

CLNM in MSGCs is critical for treatment planning, but noninvasive prediction is limited. This study developed an US-based DL radiomics model to enable noninvasive CLNM prediction, supporting personalized surgery and reducing unnecessary interventions.

Keywords: Major salivary gland carcinomas, Ultrasound, Cervical lymph node metastasis, Radiomics, Deep learning

Introduction

Major salivary gland carcinomas (MSGCs) are rare head and neck malignancies, comprising 3-11% of all head and neck cancers.¹ With the rapid advancement of medical imaging technologies in recent years, the incidence of MSGCs has significantly increased due to the detection of more occult lesions. Surgical resection remains the cornerstone of treatment for MSGCs;² however, optimizing treatment strategies, particularly the selection of neck dissection techniques, remains challenging. The lack of uniform clinical standards and expert consensus has led to substantial variability in treatment protocols across institutions, compromising patient outcomes and complicating clinical decision-making.³

Cervical lymph node metastasis (CLNM) is a critical prognostic factor for MSGCs, with reported incidence rates ranging from 15% to 38%.⁴^,⁵ Beyond its impact on survival, CLNM is associated with a higher risk of local recurrence and severe complications (e.g., facial nerve palsy), significantly deteriorating patients' quality of life.⁶^,⁷ Existing evidence indicates that, regardless of histologic type, high-grade tumor histology and advanced T stage are independent predictors of CLNM.8, 9, 10 Clinically, the 5-year survival rate drops from 75% in patients without CLNM to 50% in those with CLNM,¹¹ underscoring the urgent need for preoperatively identifying high-risk patients to tailor surgical strategies and optimize the benefit of lymph node dissection.

Despite this urgency, current clinical diagnostic approaches for CLNM in MSGCs exhibit significant limitations. Clinical indicators such as physical examination and palpation are highly operator-dependent, often failing to detect minute CLNM. Traditional pathological metrics such as TNM staging and tumor histologic grade serve as robust prognostic indicators of CLNM, informing risk stratification and treatment decision-making in clinical practice.¹²^,¹³ However, their complete and precise assessment can only be performed postoperatively, which limits their utility in preoperative planning. Although fine-needle aspiration cytology (FNAC) is often utilized as a minimally invasive and safe method for evaluating lymph node status, cytological diagnoses are not always definitive due to sampling errors and heterogeneous CLN involvement.¹⁴ In contrast, imaging examinations, while more objective, also have limitations: computed tomography (CT) involves radiation exposure, and magnetic resonance imaging (MRI) is associated with higher costs.¹⁵ Ultrasound (US), as a cost-effective, radiation-free, and widely accessible imaging modality, serves as the preferred initial tool for evaluating salivary gland lesions and CLN pathologies.¹⁶^,¹⁷ However, its interpretation is highly dependent on the experience of radiologists, leading to significant inter-observer variability in identifying subtle CLN abnormalities, and there remains a marked paucity of research on CLNM prediction in MSGCs based on US features.

These limitations underscore the need for more reliable tools, and machine learning (ML)-based decision support systems are demonstrating great potential in this regard. As a core component of artificial intelligence (AI), ML algorithms provide precise predictive insights for individual patients by identifying complex patterns in existing data, thereby guiding optimal treatment strategies.¹⁸ Currently, AI technologies, including ML, are revolutionizing the field of dentistry and the diagnosis and management of head and neck tumors through the implementation of precise risk stratification and personalized treatment plans, highlighting their significant potential to overcome the limitations of traditional diagnostic methods.18, 19, 20, 21, 22

Radiomics, an emerging imaging analysis technique, enables high-throughput extraction of quantitative texture and morphological features from medical images, transforming qualitative imaging data into high-dimensional datasets that capture tumor heterogeneity imperceptible to the human eye.²³ Currently, radiomics has demonstrated promise in the diagnosis, staging, and prognosis prediction of head and neck cancers.24, 25, 26 Deep learning (DL), leveraging neural network architectures, automates the extraction of hierarchical image features and has shown preliminary efficacy in CLNM prediction.27, 28, 29 Deep transfer learning further addresses the challenge of small clinical datasets by fine-tuning pre-trained networks for new tasks,³⁰ offering a viable solution for MSGCs research with limited data. Notably, no consensus has been reached on whether DL inherently outperforms radiomics in prediction, suggesting that integrating radiomic and DL features may synergistically enhance predictive performance.

Despite these advancements, no prior study has integrated clinical data, US features, radiomic features, and DL features to develop a comprehensive risk prediction model for CLNM in MSGCs. This study aims to develop and validate an US-based deep learning radiomics composite model for noninvasive prediction of CLNM risk in MSGCs.

Materials and methods

Patients

A comprehensive search of the electronic databases from 4 medical centers (Center 1: The First Affiliated Hospital of Xiamen University; Center 2: Fujian Medical University Union Hospital; Center 3: The Affiliated Zhongshan Hospital of Xiamen University; Center 4: Xiamen Branch of Zhongshan Hospital, Fudan University) was conducted. Medical records of patients from January 2018 to August 2024 were retrospectively reviewed, and 1,068 patients diagnosed with MSGCs based on surgical and pathological records were identified. The exclusion criteria for this study were as follows: (1) undetermined histological type or primary tumor site (to maintain cohort homogeneity, as these affect metastatic potential and could confound models); (2) prior treatment before surgical resection (preoperative interventions alter both primary lesion and lymph node characteristics, biasing US-based feature analysis); (3) CLN status unconfirmed by pathological examination via excisional biopsy or neck dissection (pathology is the gold standard, ensuring reference standard accuracy for training); (4) lymph node evaluation based solely on FNAC (insufficient for micrometastases, compromising ground truth reliability); (5) distant metastases or undocumented metastatic status (to focus on cervical involvement, avoiding heterogeneity from distant disease or unclear status); (6) recurrent lesions (exhibit distinct biological and imaging characteristics, which may confound model generalization to de novo disease); (7) unavailability or poor quality of pretreatment US images (high-quality images are essential for reliable feature extraction and robust model development).

After applying these criteria, 214 patients with MSGCs were ultimately included, consisting of 128 males (mean age 52.16 ± 17.16 years; range 18-88 years) and 86 females (mean age 48.26 ± 17.26 years; range 22-85 years). Initial clinical and pathological information, such as age, sex, location, smoking history, pain symptoms, facial nerve function, clinical staging (T1-T2/T3-T4), and histological type, was retrieved from the medical records for each patient. Facial nerve function was evaluated using the House-Brackmann grading system.³¹ Subsequently, patients from Center 1 (89 cases) and Center 2 (55 cases) were included in the training cohort (144 cases), comprising 94 pathologically lymph node-negative (pN0) cases and 50 pathologically lymph node-positive (pN+) cases; while patients from Center 3 (46 cases) and Center 4 (24 cases) were assigned to the validation cohort (70 cases), including 51 pN0 cases and 19 pN+ cases. The patient recruitment process is illustrated in Figure 1A, and the study design and workflow are depicted in Figure 1B.

Fig 1 — Patient enrollment process and study design workflow. (A) Patient enrollment process of this study. (B) Study design and workflow. Note: MSGCs, major salivary gland carcinomas; pN0, pathologically lymph node-negative; pN+, pathologically lymph node-positive; ICC, intraclass correlation coefficient; mRMR, maximum relevance minimum redundancy; LASSO, the least absolute shrinkage and selection operator; ROC, receiver operating characteristic; DCA, decision curve analysis.

US assessment

All US examinations were performed by radiologists with 5 to 20 years of experience in superficial US evaluation, including the assessment of salivary glands, thyroid gland, and other relevant areas. Advanced diagnostic US devices were utilized, such as the Philips EPIQ 7 (Philips Ultrasound, Bothell, WA, USA), GE Logiq E9 (GE Healthcare, Wauwatosa, WI, USA), Philips IU22 (Philips Ultrasound, Bothell, WA, USA), Aixplorer ultrasound scanner (Supersonic Imagine, Aix-en-Provence, France), and Siemens Acuson Sequoia (Siemens Medical Solutions, Inc., Malvern, PA, USA), among others. These devices were equipped with high-resolution linear array probes with a frequency range of 6-18 MHz, ensuring optimal imaging clarity. Patients were positioned supine with their necks fully exposed and heads tilted to the opposite side, enabling a comprehensive scan of all sections within the parotid and submandibular gland regions. Subsequently, the clearest and most comprehensive ultrasound images were obtained and recorded. These images were stored digitally in Digital Imaging and Communications in Medicine (DICOM) format for subsequent evaluation.

The US assessment of lesions covered multiple aspects, including the number, maximum diameter, margin, edge, shape, echogenicity, heterogeneity, presence of cystic areas, calcification, internal vascularity (grade 0: no visible tumor vessels; grade 1: 1-2 isolated vessels; grade 2: 3-5 independently visible vessels; grade 3: >5 identifiable vessels),³² and vascular distributions (central, vascular signals were predominantly detected within the tumor mass; peripheral, vascular signals were primarily observed along the outer margins of the tumors; mixed, both peripheral and central vascular flows were present concurrently, indicating a combination of the 2 aforementioned patterns).³³ Additionally, the status of CLNs was evaluated. Suspicious CLNs typically appeared round, with a long-axis to short-axis ratio of less than 2, absence of a fatty hilum, increased echogenicity, calcification, ill-defined margins, and peripheral or mixed vascular patterns.³⁴ If any of these criteria were met, the US report would classify the CLN as positive. All US images were independently reviewed by Radiologist 1, who had 5 years of experience, and then verified by Radiologist 2, who had 10 years of experience, aiming to minimize potential biases. In cases where the 2 reviewers disagreed, a consensus was reached through collaborative discussions.

Interobserver agreement assessment of US features

The US features of all participants in this study were independently evaluated by 2 radiologists in a double-blind fashion. In order to precisely quantify the diagnostic consistency, an interobserver reliability analysis was conducted using Cohen's κ coefficient.³⁵ The levels of agreement are categorized as follows: a κ value ranging from 0.01 to 0.20 signifies slight agreement; a range of 0.21-0.40 indicates fair agreement; 0.41-0.60 represents moderate agreement; 0.61-0.80 implies substantial agreement; and 0.81-0.99 denotes almost perfect agreement. A κ value ≥0.60 is generally considered acceptable, indicating substantial or higher consistency between observers and ensuring the reliability of the evaluated features.

Image segmentation and feature extraction

All images were imported into ITK-SNAP software (Version 3.8; http://www.itksnap.org). Radiologist 1 manually delineated the region of interest (ROI) on the section with the maximum diameter of each lesion to define the lesion boundaries. One month later, 30 patients were randomly selected, and Radiologist 1 and Radiologist 2 redelineated the ROIs. The intraclass correlation coefficient (ICC) was calculated to assess the consistency of lesion delineation.

Radiomics features were extracted using the Pyradiomics package (https://github.com/Radiomics/pyradiomics) based on Python 3.7, following the guidelines of the Imaging Biomarker Standardization Initiative.³⁶ First, the images were resampled to a pixel size of 1mm×1mm×1mm via B-spline interpolation. Second, grayscale discretization of the histograms was performed with a fixed bin width of 25. Subsequently, original and filter-based features were extracted from each patient's images, encompassing 7 categories: logarithmic, exponential, gradient, square, square root, local binary pattern, and wavelet features. In total, 1239 quantitative radiomic features were extracted and grouped into 7 sets: first-order statistical features, shape features, gray-level co-occurrence matrix (GLCM) features, gray-level run-length matrix (GLRLM) features, gray-level size-zone matrix (GLSZM) features, gray-level dependence matrix (GLDM) features, and neighborhood gray-tone difference matrix (NGTDM) features.

DL features were extracted using the ResNet 50 network pre-trained on the ImageNet database (http://www.image-net.org), with all convolutional layers frozen to preserve pre-learned visual features. During model training, the training and validation sets were kept consistent, employing an initial learning rate of 0.01, 50 epochs, and a batch size of 32, with model parameters fine-tuned using the stochastic gradient descent optimizer. From the output of the average pooling layer of the adjusted ResNet 50 model, 2048 DL features were obtained from the ROI of each patient's US image. ResNet-50 was selected over shallower alternatives (ResNet-18/34) for its superior capacity to capture subtle pathological features in US images. Notably, its pre-training on ImageNet's 1.2 million diverse images was critical for mitigating overfitting risks—an essential consideration given our small dataset size. The batch size of 32, determined after testing smaller sizes (8, 16), balanced statistical representation (stabilizing gradients to avoid overfitting to US noise) and computational feasibility; smaller batches caused training instability with fluctuating validation metrics due to insufficient sample diversity, compromising extraction of subtle pathology-relevant features.

Fusion strategy

We adopted an early fusion strategy to integrate radiomics and DL features prior to final modeling, chosen for 3 key reasons: (1) it better suits our small dataset than end-to-end DL models (which demand large samples) by combining domain-specific radiomics features with data-driven DL features; (2) it preserves inter-feature correlations critical for CLNM detection, whereas late fusion (which merges outputs from separate models) risks losing such synergies; (3) it maintains interpretability through traceable feature contributions, a critical advantage for clinical translation.

Feature selection and construction of radiomics and DL scores

Consistent with the early fusion design, radiomics and DL features were processed separately to preserve their distinct characteristics while enabling subsequent integration. The feature selection process comprised 3 sequential steps: ICC analysis, application of the maximum relevance minimum redundancy (mRMR) algorithm, and implementation of the least absolute shrinkage and selection operator (LASSO) regression algorithm. First, features with an ICC >0.75 were kept and Z-score normalized. Then, the mRMR algorithm refined the selection on the training cohort. Next, LASSO regression with 10-fold cross-validation determined the most predictive features by identifying those with non-zero coefficients. Pearson correlation analysis was conducted to assess relationships both among radiomics features and among DL features. Finally, radiomics scores and DL scores for each patient were computed using a linear combination of the selected features, where each feature was weighted according to its corresponding coefficient obtained from the LASSO model.

Model construction and evaluation

To construct different predictive models, LASSO regression analysis was first conducted on the clinical features, US features, radiomics score, and DL score in the training cohort. This analysis aimed to identify the most relevant parameters for predicting CLNM in MSGCs. Subsequently, the effective parameters obtained from the screening process were utilized to construct 9 distinct predictive models, employing logistic regression (LR) algorithm along with 8 different ML algorithms: decision tree (DT), random forest (RF), adaptive boosting (AdaBoost), extreme gradient boosting (XGBoost), artificial neural network (ANN), support vector machine (SVM), Naive Bayes, and K-nearest neighbors (KNN). The hyperparameter settings for these various ML algorithms are detailed in Supplementary Table 1.

The performance of the models was evaluated by plotting the receiver operating characteristic (ROC) curve. The DeLong test was employed to compare the area under the curve (AUC) values of different models. In addition, metrics such as accuracy, precision, recall, F1 score, and the confusion matrix were utilized to further assess the predictive performance of the models on both the training and the validation cohorts. Through the comparison of predictive efficacy, the model with the best performance was identified. Meanwhile, the LR model was visualized as a nomogram. Finally, decision curve analysis (DCA) and clinical impact curve (CIC) were plotted to determine the clinical effectiveness of the optimal model.

Statistical analysis

Statistical analyses were conducted using IBM SPSS (Version 22.0; https://www.ibm.com/spss), R (Version 4.4.1; https://www.r-project.org), and Python (Version 3.7.0; https://www.python.org). All quantitative variables were first tested for normality using the Shapiro-Wilk test and for homogeneity of variances using Levene's test. For variables following a normal distribution, descriptive statistics were presented as mean ± standard deviation, whereas non-normally distributed variables were reported as median with range. Qualitative variables were analyzed using the Chi-square test and Fisher's exact test. In the case of quantitative variables, 1-way analysis of variance (ANOVA) was used for comparisons among 3 or more independent groups with normally distributed data and homogeneous variances, the Kruskal-Wallis H test was utilized for comparisons among 3 or more independent groups with non-normally distributed data, the independent samples t-test was used for comparisons between 2 independent groups with normally distributed data and homogeneous variances, and the Mann-Whitney U test was employed for comparisons between 2 independent groups with non-normally distributed data. A 2-sided P value < .05 was considered statistically significant. All necessary packages in the R environment could be downloaded from https://cran.r-project.org/web/packages/.

Results

Histological types of patients with MSGCs

The histologic types of MSGCs patients are shown in Supplementary Table 2. Among 214 MSGCs patients, there were 54 mucoepidermoid carcinoma, 39 adenoid cystic carcinoma, 24 lymphoepithelial carcinoma, 21 salivary duct carcinoma, 18 acinic cell carcinoma, 18 carcinoma ex pleomorphic adenoma, 16 adenocarcinoma not otherwise specified, and 24 others. A total of 69 patients (69/214, 32.24%) exhibited pN+. The highest proportion of pN+ was observed in adenocarcinoma not otherwise specified (11/16, 68.8%), followed by salivary duct carcinoma (12/21, 57.1%), lymphoepithelial carcinoma (9/24, 37.5%), and mucoepidermoid carcinoma (18/54, 33.3%). In contrast, adenoid cystic carcinoma and acinic cell carcinoma showed relatively lower pN+ proportions, at 8/39 (20.5%) and 2/18 (11.1%), respectively.

Clinical and US characteristics

Supplementary Table 3 summarizes the clinical and US characteristics of all patients in the study. Assessment of all US features demonstrated substantial interobserver agreement (κ values ranging from 0.748 to 0.867), which meets the acceptable level of consistency (κ ≥ 0.60). No significant differences were observed in any characteristics across the 4 medical centers (all P > .05). Similarly, no significant feature differences were noted between the training and validation cohorts (all P > .05; Supplementary Table 4). As shown in Table 1, significant differences existed between pN0 and pN+ groups in both the training and validation cohorts regarding patient age, tumor maximum diameter, facial nerve dysfunction, margin, edge, shape, calcification, vascularity, and US reported CLN-positive. Additionally, in the training cohort, smoking history, pain symptoms, clinical staging, tumor number, heterogeneity, and cystic areas also showed significant intergroup differences. For other characteristics, no statistical significance was detected (all P > .05).

Table 1.

Comparison of clinical and US characteristics between the pN0 and pN+ groups in the training and validation cohorts.

Characteristics	Training cohort			Validation cohort
Characteristics	pN0 (n = 94)	pN+ (n = 50)	P value	pN0 (n = 51)	pN+ (n = 19)	P value
Age, years	47.21 ± 15.54	59.52 ± 16.55	< .001^*	45.45 ± 18.25	57.58 ± 13.66	.011^*
sex (male/female)	50/44	34/16	.086^†	30/21	14/5	.253^†
Location (parotid/submandibular)	74/20	33/17	.096^†	41/10	15/4	.870^†
Smoking history (no/yes)	70/24	21/29	< .001^†	39/12	11/8	.126^†
Pain symptoms (no/yes)	82/12	26/24	< .001^†	40/11	11/8	.086^†
Facial nerve dysfunction (no/yes)	92/2	40/10	< .001^‡	49/2	15/4	.032^‡
Clinical staging (T1-T2/T3-T4)	88/6	26/24	< .001^†	45/6	15/4	.546^‡
Skin ulceration (no/yes)	94/0	49/1	.347^‡	51/0	18/1	.083^‡
Number (single/multiple)	89/5	39/11	.002^†	48/3	18/1	1.000^‡
Maximum diameter, cm	2.28 ± 0.89	3.81 ± 1.42	< .001^†	2.46 ± 1.18	3.44 ± 1.29	.004^†
Margin (clear/unclear)	72/22	15/35	< .001^†	39/12	7/12	.002^†
Edge (smooth/rough)	65/29	8/42	< .001^†	36/15	7/12	.010^†
Shape (regular/irregular)	50/44	8/42	< .001^†	23/28	3/16	.025^†
Echogenicity (hypoechoic/isoechoic or hyperechoic)	92/2	48/2	.906^‡	50/1	19/0	1.000^‡
Heterogeneity (homogeneous/heterogeneous)	26/68	5/45	.014^†	9/42	2/17	.720^‡
Cystic areas (no/yes)	52/42	39/11	.007^†	36/15	12/7	.552^†
Calcification (no/yes)	84/10	31/19	< .001^†	47/4	13/6	.011^‡
Vascularity (grade 0-1/grade 2-3)	49/45	10/40	< .001^†	32/19	5/14	.007^†
Vascular distributions (peripheral/central/mixed)	43/13/38	13/7/30	.052^†	21/6/24	8/3/8	.910^‡
US reported CLN-positive (no/yes)	85/9	30/20	< .001^†	44/7	10/9	.003^†

Open in a new tab

US, ultrasound; pN0, pathological lymph node-negative; pN+, pathological lymph node-positive; CLN, cervical lymph node.

^⁎

Independent samples t-test test.

^†

Chi-square test.

^‡

Fisher’s exact test. P values with statistically significant differences (P < 0.05) are highlighted in bold.

Feature extraction and selection

A total of 1239 radiomics features and 2048 DL features were extracted from the US images of each patient. After excluding features with poor reproducibility (ICC < 0.75), 953 radiomics features and 1618 DL features were retained. Subsequently, the mRMR algorithm was applied to further select 50 radiomics features and 50 DL features, respectively. Finally, 11 radiomics features and 39 DL features were identified via LASSO regression analysis for constructing the radiomics score and DL score (Supplementary Figure 1). The Pearson correlation results showed that the correlation coefficients among the radiomics features and those among the DL features were each below 0.5 (P > .05; Supplementary Figure 2).

Radiomics and DL scores construction

The features selected by LASSO regression and their coefficients are presented in Figures 2A, 2B and Supplementary Tables 5 and 6. The radiomics and DL scores were calculated using the following formulas, where $c o e f f i c i e n t_{i}$ denotes the risk coefficient of each feature derived from the LASSO model and $F e a t u r e_{i}$ represents the expression value of each feature (11 selected radiomics features and 39 selected DL features, respectively): (1) radiomics score: $- 0.919 + \sum_{i = 1}^{n} (c o e f f i c i e n t_{i} \times F e a t u r e_{i})$ ; (2) DL score: $- 1.792 + \sum_{i = 1}^{n} (c o e f f i c i e n t_{i} \times F e a t u r e_{i})$ . Significant differences in both scores were observed between pN0 and pN+ groups in the training and validation cohorts (P < .001; Figures 2C and 2D). ROC curve analysis revealed AUCs of 0.895 and 0.819 for the radiomics score in the training and validation cohorts (Figure 3A), while the DL score achieved AUCs of 0.937 and 0.836 in the training and validation cohorts, respectively (Figure 3B).

Fig 2 — After dimension reduction by mRMR and LASSO regression, the distribution of radiomics and deep learning features and coefficients, as well as the construction of radiomics and deep learning scores. (A, B) Distribution of radiomics (A) and deep learning (B) features and coefficients. (C, D) Distribution of radiomics scores (C) and deep learning scores (D) in the training and validation cohorts. Note: mRMR, maximum relevance minimum redundancy; LASSO, the least absolute shrinkage and selection operator. ^⁎⁎⁎Statistically significant at P < 0.001 as determined by the Mann-Whitney U test.

Fig 3 — ROC curves for radiomics and deep learning scores. (A) ROC curves for radiomics scores in the training and validation cohorts. (B) ROC curves for deep learning scores in the training and validation cohorts. Note: ROC, receiver operating characteristic; AUC, area under the curve.

Model construction and evaluation

LASSO regression analysis of clinical and US features, radiomics score, and DL score in the training cohort identified 6 optimal predictors (Figures 4A and 4B): patient age, tumor edge, calcification, US reported CLN-positive, radiomics score, and DL score. Leveraging these features, predictive models were constructed using LR and 8 ML algorithms. In the validation cohort, the AUC values of different models ranged from 0.770 to 0.962. The LR model exhibited the optimal predictive performance, with an AUC of 0.962, accuracy of 0.886, precision of 0.762, recall of 0.842, and an F1 score of 0.8 (Figures 4C and 4D). A comparison of AUC values across models in the validation cohort is shown in Supplementary Figure 3, while predictive performance metrics for all models in both training and validation cohorts are illustrated in Figure 5. Corresponding confusion matrices for each model in the validation cohort are presented in Supplementary Figure 4.

Fig 4 — Feature variable selection and ROC curves of different models. (A, B) 6 features with non-zero coefficients were selected through LASSO regression; (C) ROC curves for the training cohort; (D) ROC curves for the validation cohort. Note: LASSO, least absolute shrinkage and selection operator; ROC, receiver operating characteristic; AUC, area under the curve; LR, logistic regression; DT, decision tree; RF, random forest; AdaBoost, adaptive boosting; XGBoost, extreme gradient boosting; ANN, artificial neural network; SVM, support vector machine; KNN, K-nearest neighbors.

Fig 5 — Predictive performance of different models. (A-D) Predictive performance of different models in the training cohort. (E-H) Predictive performance of different models in the validation cohort. Note: LR, logistic regression; DT, decision tree; RF, random forest; AdaBoost, adaptive boosting; XGBoost, extreme gradient boosting; ANN, artificial neural network; SVM, support vector machine; KNN, K-nearest neighbors.

The multivariate regression results of the LR model are shown in Figure 6A, and the LR model was visualized as a nomogram (Figure 6B). The nomogram assigns a score to each relevant feature according to its specific value, with higher scores indicating a greater contribution to the risk of CLNM. By summing these individual feature scores, a total score is generated, which directly corresponds to the probability of CLNM. DCA demonstrated that within most reasonable threshold probability ranges in both the training and validation cohorts, using the nomogram to predict CLNM in MSGCs provided greater clinical benefit to patients compared with strategies of “considering all patients as pN0” or “considering all patients as pN+” (Supplementary Figure 5A). The results of the CIC further confirmed that across most threshold probability intervals, the nomogram effectively stratified patients into high-risk groups (blue curve) and matched the actual risk stratification of the population (light blue curve), demonstrating high clinical utility (Supplementary Figures 5B and 5C).

Fig 6 — Results of multivariate regression analysis and construction of nomogram. (A) The multivariate regression results of the LR model. (B) Nomogram for the LR model. Note: logistic regression.

Discussion

The presence of CLNM significantly impacts the prognosis of patients with MSGCs by influencing surgical approach selection, recurrence risk, and ultimately patient survival rates.⁶^,⁷^,¹¹ Therefore, the ability to accurately predict CLNM is critical for tailoring personalized treatment strategies and potentially improving patient outcomes. However, current routine imaging techniques struggle to reliably characterize CLNM, with diagnostic accuracy heavily dependent on radiologists' experience. This underscores the need for novel, more precise methods to identify CLNM, such as radiomics and DL-based approaches. Notably, no prior studies have explored the use of radiomics or DL features for predicting CLNM in MSGCs. This study innovatively developed and validated a composite model integrating clinical and US data, radiomics, and DL features for noninvasive prediction of CLNM in MSGCs. Our findings demonstrate that the LR model incorporating clinical, US, radiomics, and DL features exhibited superior and optimal predictive performance in the validation cohort, achieving an AUC of 0.962—outperforming other ML algorithms. Additionally, DCA and CIC further validated the clinical utility of the model, demonstrating favorable net benefits across a broad range of threshold probabilities.

Through a analysis of clinical characteristics, our study identified that CLNM in MSGCs is significantly associated with several key factors: older patient age, a history of smoking, the presence of pain symptoms, facial nerve dysfunction, and clinical T3/T4 tumor staging. These findings are consistent with the majority of previous reports,8, 9, 10^,³⁷^,³⁸ reinforcing established clinical risk factors. Although previous studies have indicated that male patients are associated with a higher risk of CLNM,¹⁹ this correlation was not statistically significant in our cohort, potentially due to sample size limitations or distribution bias. Additionally, our pathological analysis revealed that among the 69 patients with pN+ (32.2%), mucoepidermoid carcinoma (18/69, 26.1%), salivary duct carcinoma (12/69, 17.4%), and adenocarcinoma not otherwise specified (11/69, 15.9%) were the most frequently encountered metastatic subtypes. However, assessment of metastatic propensity within specific subtypes identified adenocarcinoma not otherwise specified (11/16, 68.8%), salivary duct carcinoma (12/21, 57.1%), and lymphoepithelial carcinoma (9/24, 37.5%) as having the highest rates of CLNM, findings consistent with prior studies highlighting their inherent aggressive potential for nodal spread.³⁹^,⁴⁰ Thus, these findings suggest that selective neck dissection may be beneficial for patients with these histopathological subtypes.

In our analysis of US characteristics, MSGCs patients exhibiting multiple lesions, larger maximum tumor diameter, unclear margins, rough edges, irregular shape, heterogeneous internal echoes, absence of cystic areas, presence of calcification, grade 2-3 vascularity, and US reported CLN-positive were found to be more likely to develop CLNM. Notably, these US features demonstrated excellent interobserver consistency, collectively reflecting the tumor's aggressive biological behavior. Multiple lesions and a larger maximum tumor diameter indicate extensive tumor spread and prolonged growth duration, increasing the likelihood of lymphatic vessel invasion.⁴¹ Unclear margins, rough edges, and irregular shapes suggest an invasive growth pattern, enabling tumor cells to penetrate surrounding tissues and lymphatic vessels.⁴² Heterogeneous internal echoes and the absence of cystic areas reflect complex tumor composition and high cellular density, often indicating a higher histological grade.⁴³ Calcification may signal abnormal tumor metabolism or necrosis, while abundant vascularity (grade 2-3) reflects robust tumor angiogenesis, which not only provides nutrients for tumor growth but also pathways for metastasis.⁴⁴

Recent breakthroughs in radiomics and DL have unlocked novel pathways for radiologists, facilitating non-invasive assessment of hidden features and achieving more accurate tumor profiling.²³^,²⁷ Our study demonstrated the efficacy of these techniques in predicting CLNM in MSGCs, with the radiomics score achieving an AUC of 0.895 in the training cohort and 0.819 in the validation cohort. Similarly, the DL score reached AUC values of 0.937 and 0.836 in the training and validation cohorts, respectively.

While various individual features have been associated with CLNM in MSGCs, the current trend towards precision medicine underscores the necessity of integrating multi-dimensional parameters for comprehensive tumor evaluation. Leveraging the LASSO regression, we identified 6 optimal features, including 1 clinical characteristic (patient age), 3 US features (tumor edge, calcification, and US reported CLN-positive), the radiomics score, and the DL score. LASSO regression-based feature selection reduces model complexity, mitigates overfitting, and enhances interpretability by identifying key predictors.⁴⁵ ML is capable of automatically identifying complex patterns and relationships within datasets, thereby enabling efficient model construction and optimization.⁴⁶ In this study, we developed predictive models using LR and 8 ML algorithms. Significantly, the LR model outperformed other ML models in the validation cohort, achieving an AUC of 0.962, accuracy of 0.886, precision of 0.762, recall of 0.842, and an F1 score of 0.8. The optimal performance of the LR model might be attributed to its simplicity, which reduces the risk of overfitting in this dataset, combined with the strong linear separability of the input features.⁴⁷ Meanwhile, we visualized the LR model as a nomogram, which enhances the interpretability and clinical applicability of the prediction model for CLNM risk assessment. Subsequent clinical utility analyses further validated the LR model's practical value. DCA demonstrated that the LR model provided greater net benefits across a wide range of threshold probabilities in both the training and validation sets, outperforming strategies of "treat all" and "treat none." Additionally, CIC analysis confirmed significant net clinical benefits at realistic risk thresholds, underscoring the model's potential for clinical translation and patient management.

This study has certain limitations. First, due to the retrospective research design and the rarity of MSGCs, the sample size is small, which may lead to selection bias. Second, this retrospective study only used conventional US data because the data of multimodal imaging (such as US elastography, contrast-enhanced US, CT, MRI and PET) are incomplete. We believe that further research combining multimodal imaging techniques can show higher predictive performance. Third, defining the boundaries of the ROI may introduce the subjectivity of researchers. In the future, we plan to use DL technology to automatically identify and delineate the ROI and intend to carry out prospective, multicenter studies to further verify the model we proposed. Fourth, the validation dataset is imbalanced in the distribution of pN0 vs. pN+ cases, which could bias the model toward the majority class and compromise performance metrics such as precision and recall. No mitigation techniques (e.g., resampling, class weighting, or use of balanced accuracy) were applied, and this limitation requires further addressing in future research. Additionally, although the proposed model performs well in predicting CLNM of MSGCs, its current implementation requires a multi-step computational workflow (see the methods section for details). To better facilitate clinical translation, the integration of dedicated software is crucial for streamlining the model's accessibility in routine clinical use.

Conclusion

In conclusion, our study demonstrates that the comprehensive model integrating clinical and US data, radiomics, and DL features holds significant clinical potential in predicting CLNM of MSGCs. This AI-driven approach provides clinicians with a non-invasive preoperative diagnostic tool, enabling more accurate prediction of CLNM, thus promoting personalized surgical planning and is expected to significantly optimize treatment strategies.

Authorship contribution statement

Huan-Zhong Su and Long-Cheng Hong contributed to design of the study, acquisition, analysis, interpretation of data, drafting of the article, and review of submitted version of the manuscript and supervision. Zhi-Yong Li and Qiao-Min Fu contributed to acquisition, analysis, interpretation of data. Yu-Hui Wu and Shao-Feng Wu contributed to acquisition and interpretation of data. Zuo-Bing Zhang, Dao-Hui Yang and Xiao-Dong Zhang contributed to acquisition, analysis, interpretation of data, and review of submitted version of the manuscript.

Ethical approval

All procedures involving human participants were conducted in accordance with the ethical standards of the institutional and/or national research committee as well as the 1964 Helsinki Declaration and its subsequent amendments or comparable ethical guidelines, and this study was approved by the Ethics Committees of the First Affiliated Hospital of Xiamen University, Xiamen, Fujian, China (approval number: 2025-003).

Informed consent

The need for informed consent was waived because of the retrospective nature of this study.

Funding

This study was financially supported by the Fujian Provincial Natural Science Foundation of China (No. 2023J011617, No. 2022J011366), the Natural Science Foundation of Xiamen, China (No. 3502Z202372078), and the Xiamen Medical and Health Guidance Project (No. 3502Z20214ZD1010, No. 3502Z20244ZD1018). The funders did not influence the study design, data collection, analysis, decision to publish or preparation of the manuscript.

Conflict of interest

None disclosed.

Footnotes

Supplementary material associated with this article can be found in the online version at doi:10.1016/j.identj.2025.103895.

Contributor Information

Dao-Hui Yang, Email: yang.daohui@zsxmhospital.com.

Xiao-Dong Zhang, Email: zxdon11@163.com.

Appendix. Supplementary materials

mmc1.docx^{(2.7MB, docx)}

References

1.Boukheris H., Curtis R.E., Land C.E., Dores GM. Incidence of carcinoma of the major salivary glands according to the WHO classification, 1992 to 2006: a population-based study in the United States. Cancer Epidemiol Biomarkers Prev. 2009;18:2899–2906. doi: 10.1158/1055-9965.EPI-09-0638. [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Kwon M., Lee J.I., Roh J.L., et al. Second cancer incidence and risk factors in patients with salivary gland cancers. JAMA Otolaryngol Head Neck Surg. 2014;140:118–123. doi: 10.1001/jamaoto.2013.6149. [DOI] [PubMed] [Google Scholar]
3.Nobis C.P., Rohleder N.H., Wolff K.D., Wagenpfeil S., Scherer E.Q., Kesting MR. Head and neck salivary gland carcinomas–elective neck dissection, yes or no? J Oral Maxillofac Surg. 2014;72:205–210. doi: 10.1016/j.joms.2013.05.024. [DOI] [PubMed] [Google Scholar]
4.Shi X., Liu X.K., An C.M., et al. Anatomic extent of lymph node metastases as an independent prognosticator in node-positive major salivary gland carcinoma: a study of the US SEER database and a Chinese multicenter cohort. Eur J Surg Oncol. 2019;45:2143–2150. doi: 10.1016/j.ejso.2019.06.029. [DOI] [PubMed] [Google Scholar]
5.Ettl T., Gosau M., Brockhoff G., et al. Predictors of cervical lymph node metastasis in salivary gland cancer. Head Neck. 2014;36:517–523. doi: 10.1002/hed.23332. [DOI] [PubMed] [Google Scholar]
6.Fang Q., Zhang X., Dai L., Luo R., Yuan J. Association between factor of parotid lymph node and prognosis in parotid cancer. Eur J Surg Oncol. 2023;49:1405–1410. doi: 10.1016/j.ejso.2023.04.002. [DOI] [PubMed] [Google Scholar]
7.Han X., Yang C., Tan X., Li Y. The 8th AJCC classification is inferior to a new neck stage based on intraparotid lymph node in parotid gland cancer. BMC Oral Health. 2024;24:590. doi: 10.1186/s12903-024-04346-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Wang Y.L., Li D.S., Gan H.L., et al. Predictive index for lymph node management of major salivary gland cancer. Laryngoscope. 2012;122:1497–1506. doi: 10.1002/lary.23227. [DOI] [PubMed] [Google Scholar]
9.Jinnin T., Kawata R., Higashino M., Nishikawa S., Terada T., Haginomori SI. Patterns of lymph node metastasis and the management of neck dissection for parotid carcinomas: a single-institute experience. Int J Clin Oncol. 2019;24:624–631. doi: 10.1007/s10147-019-01411-3. [DOI] [PubMed] [Google Scholar]
10.Asarkar A., Chang B.A., Nathan C.O. When should elective neck dissection be performed for parotid gland malignancy? Laryngoscope. 2021;131:2164–2165. doi: 10.1002/lary.29263. [DOI] [PubMed] [Google Scholar]
11.Xiao C.C., Zhan K.Y., White-Gilbertson S.J., Day TA. Predictors of nodal metastasis in parotid malignancies: a national cancer data base study of 22,653 patients. Otolaryngol Head Neck Surg. 2016;154:121–130. doi: 10.1177/0194599815607449. [DOI] [PubMed] [Google Scholar]
12.Qian K., Sun W., Guo K., et al. The number and ratio of positive lymph nodes are independent prognostic factors for patients with major salivary gland cancer: results from the surveillance, epidemiology, and end results dataset. Eur J Surg Oncol. 2019;45:1025–1032. doi: 10.1016/j.ejso.2018.11.008. [DOI] [PubMed] [Google Scholar]
13.Wang W., Yi S., An P.G., Zhang Z.Q., Lyu X.M., Zhang J. Patterns of lymph node metastasis and treatment outcomes of parotid gland malignancies. BMC Oral Health. 2025;25:286. doi: 10.1186/s12903-025-05561-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Ren Y., Shi Y., Zhou Z., et al. Ultrasonographic and cytological characterization of ultrasound-guided fine-needle aspiration cytology of cervical lymph nodes for false-negative and false-positive diagnosis. Eur Arch Otorhinolaryngol. 2023;280:2463–2470. doi: 10.1007/s00405-022-07802-z. [DOI] [PubMed] [Google Scholar]
15.Junn J.C., Soderlund K.A., Glastonbury CM. Imaging of head and neck cancer with CT, MRI, and US. Semin Nucl Med. 2021;51:3–12. doi: 10.1053/j.semnuclmed.2020.07.005. [DOI] [PubMed] [Google Scholar]
16.Johnson F., Bozzato A., Mansour N., et al. Sonography of salivary gland tumors and disorders. Ultraschall Med. 2025;46:318–344. doi: 10.1055/a-2481-7248. [DOI] [PubMed] [Google Scholar]
17.Deng Z., Liu X., Wu R., et al. Ultrasound-based radiomics machine learning models for diagnosing cervical lymph node metastasis in patients with non-small cell lung cancer: a multicentre study. BMC Cancer. 2024;24:536. doi: 10.1186/s12885-024-12306-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Samaranayake L., Tuygunov N., Schwendicke F., et al. The transformative role of artificial intelligence in dentistry: a comprehensive overview. part 1: fundamentals of AI, and its contemporary applications in dentistry. Int Dent J. 2025;75:383–396. doi: 10.1016/j.identj.2025.02.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Costantino A., Canali L., Festa B.M., Kim S.H., Spriano G., De Virgilio A. Development of machine learning models to predict lymph node metastases in major salivary gland cancers. Eur J Surg Oncol. 2023;49 doi: 10.1016/j.ejso.2023.06.017. [DOI] [PubMed] [Google Scholar]
20.Mäkitie A.A., Alabi R.O., Ng S.P., et al. Artificial intelligence in head and neck cancer: a systematic review of systematic reviews. Adv Ther. 2023;40:3360–3380. doi: 10.1007/s12325-023-02527-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Mahmood H., Shaban M., Rajpoot N., Khurram SA. Artificial Intelligence-based methods in head and neck cancer diagnosis: an overview. Br J Cancer. 2021;124:1934–1940. doi: 10.1038/s41416-021-01386-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Alabi R.O., Elmusrati M., Leivo I., Almangush A., Mäkitie AA. Artificial intelligence-driven radiomics in head and neck cancer: current status and future prospects. Int J Med Inform. 2024;188 doi: 10.1016/j.ijmedinf.2024.105464. [DOI] [PubMed] [Google Scholar]
23.Qi Y.J., Su G.H., You C., et al. Radiomics in breast cancer: Current advances and future directions. Cell Rep Med. 2024;5 doi: 10.1016/j.xcrm.2024.101719. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Li Q., Jiang T., Zhang C., et al. A nomogram based on clinical information, conventional ultrasound and radiomics improves prediction of malignant parotid gland lesions. Cancer Lett. 2022;527:107–114. doi: 10.1016/j.canlet.2021.12.015. [DOI] [PubMed] [Google Scholar]
25.Wang Z., Fang M., Zhang J., et al. Radiomics and deep learning in nasopharyngeal carcinoma: a review. IEEE Rev Biomed Eng. 2024;17:118–135. doi: 10.1109/RBME.2023.3269776. [DOI] [PubMed] [Google Scholar]
26.Lin C.H., Yan J.L., Yap W.K., et al. Prognostic value of interim CT-based peritumoral and intratumoral radiomics in laryngeal and hypopharyngeal cancer patients undergoing definitive radiotherapy. Radiother Oncol. 2023;189 doi: 10.1016/j.radonc.2023.109938. [DOI] [PubMed] [Google Scholar]
27.Wang W., Liang H., Zhang Z., et al. Comparing three-dimensional and two-dimensional deep-learning, radiomics, and fusion models for predicting occult lymph node metastasis in laryngeal squamous cell carcinoma based on CT imaging: a multicentre, retrospective, diagnostic study. EClinicalMedicine. 2024;67 doi: 10.1016/j.eclinm.2023.102385. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Lan T., Kuang S., Liang P., et al. MRI-based deep learning and radiomics for prediction of occult cervical lymph node metastasis and prognosis in early-stage oral and oropharyngeal squamous cell carcinoma: a diagnostic study. Int J Surg. 2024;110:4648–4659. doi: 10.1097/JS9.0000000000001578. [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Zhou L.Q., Zeng S.E., Xu J.W., et al. Deep learning predicts cervical lymph node metastasis in clinically node-negative papillary thyroid carcinoma. Insights Imaging. 2023;14:222. doi: 10.1186/s13244-023-01550-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Toseef M., Olayemi Petinrin O., Wang F., et al. Deep transfer learning for clinical decision-making based on high-throughput data: comprehensive survey with benchmark results. Brief Bioinform. 2023;24 doi: 10.1093/bib/bbad254. [DOI] [PubMed] [Google Scholar]
31.House J.W., Brackmann DE. Facial nerve grading system. Otolaryngol Head Neck Surg. 1985;93:146–147. doi: 10.1177/019459988509300202. [DOI] [PubMed] [Google Scholar]
32.Adler D.D., Carson P.L., Rubin J.M., Quinn-Reid D. Doppler ultrasound color flow imaging in the study of breast cancer: preliminary findings. Ultrasound Med Biol. 1990;16:553–559. doi: 10.1016/0301-5629(90)90020-d. [DOI] [PubMed] [Google Scholar]
33.Zhou J., Yin L., Wei X., et al. 2020 Chinese guidelines for ultrasound malignancy risk stratification of thyroid nodules: the C-TIRADS. Endocrine. 2020;70:256–279. doi: 10.1007/s12020-020-02441-y. [DOI] [PubMed] [Google Scholar]
34.de Koekkoek-Doll P.K., Roberti S., van den Brekel M.W., et al. Value of assessing peripheral vascularization with micro-flow imaging, resistive index and absent hilum sign as predictor for malignancy in lymph nodes in head and neck squamous cell carcinoma. Cancers (Basel) 2021;13 doi: 10.3390/cancers13205071. [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Vanacore A., Pellegrino MS. Robustness of κ -type coefficients for clinical agreement. Stat Med. 2022;41:1986–2004. doi: 10.1002/sim.9341. [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Zwanenburg A., Vallières M., Abdalah M.A., et al. The image biomarker standardization initiative: standardized quantitative Radiomics for high-throughput image-based phenotyping. Radiology. 2020;295:328–338. doi: 10.1148/radiol.2020191145. [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Stodulski D., Mikaszewski B., Majewska H., Wiśniewski P., Stankiewicz C. Probability and pattern of occult cervical lymph node metastases in primary parotid carcinoma. Eur Arch Otorhinolaryngol. 2017;274:1659–1664. doi: 10.1007/s00405-016-4407-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
38.Voora R.S., Panuganti B., Califano J., Coffey C., Guo T. Patterns of lymph node metastasis in parotid cancer and implications for extent of neck dissection. Otolaryngol Head Neck Surg. 2023;168:1067–1078. doi: 10.1002/ohn.167. [DOI] [PubMed] [Google Scholar]
39.Vedula S., Shah Y.S., Barinsky G.L., Baredes S., Park RCW. Utility of elective neck dissection in clinically node-negative parotid malignancy. Otolaryngol Head Neck Surg. 2023;169:917–927. doi: 10.1002/ohn.264. [DOI] [PubMed] [Google Scholar]
40.Kucharska E., Rzepakowska A., Żurek M., et al. Oncologic outcomes of the most prevalent major salivary gland cancers: retrospective cohort study from single center. Eur Arch Otorhinolaryngol. 2024;281:4305–4313. doi: 10.1007/s00405-024-08650-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
41.Choi S.Y., Lee E., Kim E., et al. Clinical outcomes of bulky parotid gland cancers: need for self-examination and screening program for early diagnosis of parotid tumors. BMC Cancer. 2021;21:178. doi: 10.1186/s12885-021-07902-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
42.Bae M.S., Shin S.U., Song S.E., Ryu H.S., Han W., Moon WK. Association between US features of primary tumor and axillary lymph node metastasis in patients with clinical T1-T2N0 breast cancer. Acta Radiol. 2018;59:402–408. doi: 10.1177/0284185117723039. [DOI] [PubMed] [Google Scholar]
43.Brandwein M.S., Ivanov K., Wallace D.I., et al. Mucoepidermoid carcinoma: a clinicopathologic study of 80 patients with special reference to histological grading. Am J Surg Pathol. 2001;25:835–845. doi: 10.1097/00000478-200107000-00001. [DOI] [PubMed] [Google Scholar]
44.Gao P., Ren G., Liang J., Liu J. STAT6 upregulates NRP1 expression in endothelial cells and promotes angiogenesis. Front Oncol. 2022;12 doi: 10.3389/fonc.2022.823377. [DOI] [PMC free article] [PubMed] [Google Scholar]
45.Muthukrishnan R., Rohini R. LASSO: A feature selection technique in predictive modeling for machine learning. 2016 IEEE International Conference on Advances in Computer Applications (ICACA) 2016:18–20. [Google Scholar]
46.Fountzilas E., Pearce T., Baysal M.A., Chakraborty A., Tsimberidou AM. Convergence of evolving artificial intelligence and machine learning techniques in precision oncology. NPJ Digit Med. 2025;8:75. doi: 10.1038/s41746-025-01471-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
47.Shipe M.E., Deppen S.A., Farjah F., Grogan EL. Developing prediction models for clinical use using logistic regression: an overview. J Thorac Dis. 2019;11:S574–s584. doi: 10.21037/jtd.2019.01.25. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

mmc1.docx^{(2.7MB, docx)}

[bib0001] 1.Boukheris H., Curtis R.E., Land C.E., Dores GM. Incidence of carcinoma of the major salivary glands according to the WHO classification, 1992 to 2006: a population-based study in the United States. Cancer Epidemiol Biomarkers Prev. 2009;18:2899–2906. doi: 10.1158/1055-9965.EPI-09-0638. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0002] 2.Kwon M., Lee J.I., Roh J.L., et al. Second cancer incidence and risk factors in patients with salivary gland cancers. JAMA Otolaryngol Head Neck Surg. 2014;140:118–123. doi: 10.1001/jamaoto.2013.6149. [DOI] [PubMed] [Google Scholar]

[bib0003] 3.Nobis C.P., Rohleder N.H., Wolff K.D., Wagenpfeil S., Scherer E.Q., Kesting MR. Head and neck salivary gland carcinomas–elective neck dissection, yes or no? J Oral Maxillofac Surg. 2014;72:205–210. doi: 10.1016/j.joms.2013.05.024. [DOI] [PubMed] [Google Scholar]

[bib0004] 4.Shi X., Liu X.K., An C.M., et al. Anatomic extent of lymph node metastases as an independent prognosticator in node-positive major salivary gland carcinoma: a study of the US SEER database and a Chinese multicenter cohort. Eur J Surg Oncol. 2019;45:2143–2150. doi: 10.1016/j.ejso.2019.06.029. [DOI] [PubMed] [Google Scholar]

[bib0005] 5.Ettl T., Gosau M., Brockhoff G., et al. Predictors of cervical lymph node metastasis in salivary gland cancer. Head Neck. 2014;36:517–523. doi: 10.1002/hed.23332. [DOI] [PubMed] [Google Scholar]

[bib0006] 6.Fang Q., Zhang X., Dai L., Luo R., Yuan J. Association between factor of parotid lymph node and prognosis in parotid cancer. Eur J Surg Oncol. 2023;49:1405–1410. doi: 10.1016/j.ejso.2023.04.002. [DOI] [PubMed] [Google Scholar]

[bib0007] 7.Han X., Yang C., Tan X., Li Y. The 8th AJCC classification is inferior to a new neck stage based on intraparotid lymph node in parotid gland cancer. BMC Oral Health. 2024;24:590. doi: 10.1186/s12903-024-04346-y. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0008] 8.Wang Y.L., Li D.S., Gan H.L., et al. Predictive index for lymph node management of major salivary gland cancer. Laryngoscope. 2012;122:1497–1506. doi: 10.1002/lary.23227. [DOI] [PubMed] [Google Scholar]

[bib0009] 9.Jinnin T., Kawata R., Higashino M., Nishikawa S., Terada T., Haginomori SI. Patterns of lymph node metastasis and the management of neck dissection for parotid carcinomas: a single-institute experience. Int J Clin Oncol. 2019;24:624–631. doi: 10.1007/s10147-019-01411-3. [DOI] [PubMed] [Google Scholar]

[bib0010] 10.Asarkar A., Chang B.A., Nathan C.O. When should elective neck dissection be performed for parotid gland malignancy? Laryngoscope. 2021;131:2164–2165. doi: 10.1002/lary.29263. [DOI] [PubMed] [Google Scholar]

[bib0011] 11.Xiao C.C., Zhan K.Y., White-Gilbertson S.J., Day TA. Predictors of nodal metastasis in parotid malignancies: a national cancer data base study of 22,653 patients. Otolaryngol Head Neck Surg. 2016;154:121–130. doi: 10.1177/0194599815607449. [DOI] [PubMed] [Google Scholar]

[bib0012] 12.Qian K., Sun W., Guo K., et al. The number and ratio of positive lymph nodes are independent prognostic factors for patients with major salivary gland cancer: results from the surveillance, epidemiology, and end results dataset. Eur J Surg Oncol. 2019;45:1025–1032. doi: 10.1016/j.ejso.2018.11.008. [DOI] [PubMed] [Google Scholar]

[bib0013] 13.Wang W., Yi S., An P.G., Zhang Z.Q., Lyu X.M., Zhang J. Patterns of lymph node metastasis and treatment outcomes of parotid gland malignancies. BMC Oral Health. 2025;25:286. doi: 10.1186/s12903-025-05561-x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0014] 14.Ren Y., Shi Y., Zhou Z., et al. Ultrasonographic and cytological characterization of ultrasound-guided fine-needle aspiration cytology of cervical lymph nodes for false-negative and false-positive diagnosis. Eur Arch Otorhinolaryngol. 2023;280:2463–2470. doi: 10.1007/s00405-022-07802-z. [DOI] [PubMed] [Google Scholar]

[bib0015] 15.Junn J.C., Soderlund K.A., Glastonbury CM. Imaging of head and neck cancer with CT, MRI, and US. Semin Nucl Med. 2021;51:3–12. doi: 10.1053/j.semnuclmed.2020.07.005. [DOI] [PubMed] [Google Scholar]

[bib0016] 16.Johnson F., Bozzato A., Mansour N., et al. Sonography of salivary gland tumors and disorders. Ultraschall Med. 2025;46:318–344. doi: 10.1055/a-2481-7248. [DOI] [PubMed] [Google Scholar]

[bib0017] 17.Deng Z., Liu X., Wu R., et al. Ultrasound-based radiomics machine learning models for diagnosing cervical lymph node metastasis in patients with non-small cell lung cancer: a multicentre study. BMC Cancer. 2024;24:536. doi: 10.1186/s12885-024-12306-6. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0018] 18.Samaranayake L., Tuygunov N., Schwendicke F., et al. The transformative role of artificial intelligence in dentistry: a comprehensive overview. part 1: fundamentals of AI, and its contemporary applications in dentistry. Int Dent J. 2025;75:383–396. doi: 10.1016/j.identj.2025.02.005. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0019] 19.Costantino A., Canali L., Festa B.M., Kim S.H., Spriano G., De Virgilio A. Development of machine learning models to predict lymph node metastases in major salivary gland cancers. Eur J Surg Oncol. 2023;49 doi: 10.1016/j.ejso.2023.06.017. [DOI] [PubMed] [Google Scholar]

[bib0020] 20.Mäkitie A.A., Alabi R.O., Ng S.P., et al. Artificial intelligence in head and neck cancer: a systematic review of systematic reviews. Adv Ther. 2023;40:3360–3380. doi: 10.1007/s12325-023-02527-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0021] 21.Mahmood H., Shaban M., Rajpoot N., Khurram SA. Artificial Intelligence-based methods in head and neck cancer diagnosis: an overview. Br J Cancer. 2021;124:1934–1940. doi: 10.1038/s41416-021-01386-x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0022] 22.Alabi R.O., Elmusrati M., Leivo I., Almangush A., Mäkitie AA. Artificial intelligence-driven radiomics in head and neck cancer: current status and future prospects. Int J Med Inform. 2024;188 doi: 10.1016/j.ijmedinf.2024.105464. [DOI] [PubMed] [Google Scholar]

[bib0023] 23.Qi Y.J., Su G.H., You C., et al. Radiomics in breast cancer: Current advances and future directions. Cell Rep Med. 2024;5 doi: 10.1016/j.xcrm.2024.101719. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0024] 24.Li Q., Jiang T., Zhang C., et al. A nomogram based on clinical information, conventional ultrasound and radiomics improves prediction of malignant parotid gland lesions. Cancer Lett. 2022;527:107–114. doi: 10.1016/j.canlet.2021.12.015. [DOI] [PubMed] [Google Scholar]

[bib0025] 25.Wang Z., Fang M., Zhang J., et al. Radiomics and deep learning in nasopharyngeal carcinoma: a review. IEEE Rev Biomed Eng. 2024;17:118–135. doi: 10.1109/RBME.2023.3269776. [DOI] [PubMed] [Google Scholar]

[bib0026] 26.Lin C.H., Yan J.L., Yap W.K., et al. Prognostic value of interim CT-based peritumoral and intratumoral radiomics in laryngeal and hypopharyngeal cancer patients undergoing definitive radiotherapy. Radiother Oncol. 2023;189 doi: 10.1016/j.radonc.2023.109938. [DOI] [PubMed] [Google Scholar]

[bib0027] 27.Wang W., Liang H., Zhang Z., et al. Comparing three-dimensional and two-dimensional deep-learning, radiomics, and fusion models for predicting occult lymph node metastasis in laryngeal squamous cell carcinoma based on CT imaging: a multicentre, retrospective, diagnostic study. EClinicalMedicine. 2024;67 doi: 10.1016/j.eclinm.2023.102385. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0028] 28.Lan T., Kuang S., Liang P., et al. MRI-based deep learning and radiomics for prediction of occult cervical lymph node metastasis and prognosis in early-stage oral and oropharyngeal squamous cell carcinoma: a diagnostic study. Int J Surg. 2024;110:4648–4659. doi: 10.1097/JS9.0000000000001578. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0029] 29.Zhou L.Q., Zeng S.E., Xu J.W., et al. Deep learning predicts cervical lymph node metastasis in clinically node-negative papillary thyroid carcinoma. Insights Imaging. 2023;14:222. doi: 10.1186/s13244-023-01550-2. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0030] 30.Toseef M., Olayemi Petinrin O., Wang F., et al. Deep transfer learning for clinical decision-making based on high-throughput data: comprehensive survey with benchmark results. Brief Bioinform. 2023;24 doi: 10.1093/bib/bbad254. [DOI] [PubMed] [Google Scholar]

[bib0031] 31.House J.W., Brackmann DE. Facial nerve grading system. Otolaryngol Head Neck Surg. 1985;93:146–147. doi: 10.1177/019459988509300202. [DOI] [PubMed] [Google Scholar]

[bib0032] 32.Adler D.D., Carson P.L., Rubin J.M., Quinn-Reid D. Doppler ultrasound color flow imaging in the study of breast cancer: preliminary findings. Ultrasound Med Biol. 1990;16:553–559. doi: 10.1016/0301-5629(90)90020-d. [DOI] [PubMed] [Google Scholar]

[bib0033] 33.Zhou J., Yin L., Wei X., et al. 2020 Chinese guidelines for ultrasound malignancy risk stratification of thyroid nodules: the C-TIRADS. Endocrine. 2020;70:256–279. doi: 10.1007/s12020-020-02441-y. [DOI] [PubMed] [Google Scholar]

[bib0034] 34.de Koekkoek-Doll P.K., Roberti S., van den Brekel M.W., et al. Value of assessing peripheral vascularization with micro-flow imaging, resistive index and absent hilum sign as predictor for malignancy in lymph nodes in head and neck squamous cell carcinoma. Cancers (Basel) 2021;13 doi: 10.3390/cancers13205071. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0035] 35.Vanacore A., Pellegrino MS. Robustness of κ -type coefficients for clinical agreement. Stat Med. 2022;41:1986–2004. doi: 10.1002/sim.9341. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0036] 36.Zwanenburg A., Vallières M., Abdalah M.A., et al. The image biomarker standardization initiative: standardized quantitative Radiomics for high-throughput image-based phenotyping. Radiology. 2020;295:328–338. doi: 10.1148/radiol.2020191145. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0037] 37.Stodulski D., Mikaszewski B., Majewska H., Wiśniewski P., Stankiewicz C. Probability and pattern of occult cervical lymph node metastases in primary parotid carcinoma. Eur Arch Otorhinolaryngol. 2017;274:1659–1664. doi: 10.1007/s00405-016-4407-5. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0038] 38.Voora R.S., Panuganti B., Califano J., Coffey C., Guo T. Patterns of lymph node metastasis in parotid cancer and implications for extent of neck dissection. Otolaryngol Head Neck Surg. 2023;168:1067–1078. doi: 10.1002/ohn.167. [DOI] [PubMed] [Google Scholar]

[bib0039] 39.Vedula S., Shah Y.S., Barinsky G.L., Baredes S., Park RCW. Utility of elective neck dissection in clinically node-negative parotid malignancy. Otolaryngol Head Neck Surg. 2023;169:917–927. doi: 10.1002/ohn.264. [DOI] [PubMed] [Google Scholar]

[bib0040] 40.Kucharska E., Rzepakowska A., Żurek M., et al. Oncologic outcomes of the most prevalent major salivary gland cancers: retrospective cohort study from single center. Eur Arch Otorhinolaryngol. 2024;281:4305–4313. doi: 10.1007/s00405-024-08650-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0041] 41.Choi S.Y., Lee E., Kim E., et al. Clinical outcomes of bulky parotid gland cancers: need for self-examination and screening program for early diagnosis of parotid tumors. BMC Cancer. 2021;21:178. doi: 10.1186/s12885-021-07902-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0042] 42.Bae M.S., Shin S.U., Song S.E., Ryu H.S., Han W., Moon WK. Association between US features of primary tumor and axillary lymph node metastasis in patients with clinical T1-T2N0 breast cancer. Acta Radiol. 2018;59:402–408. doi: 10.1177/0284185117723039. [DOI] [PubMed] [Google Scholar]

[bib0043] 43.Brandwein M.S., Ivanov K., Wallace D.I., et al. Mucoepidermoid carcinoma: a clinicopathologic study of 80 patients with special reference to histological grading. Am J Surg Pathol. 2001;25:835–845. doi: 10.1097/00000478-200107000-00001. [DOI] [PubMed] [Google Scholar]

[bib0044] 44.Gao P., Ren G., Liang J., Liu J. STAT6 upregulates NRP1 expression in endothelial cells and promotes angiogenesis. Front Oncol. 2022;12 doi: 10.3389/fonc.2022.823377. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0045] 45.Muthukrishnan R., Rohini R. LASSO: A feature selection technique in predictive modeling for machine learning. 2016 IEEE International Conference on Advances in Computer Applications (ICACA) 2016:18–20. [Google Scholar]

[bib0046] 46.Fountzilas E., Pearce T., Baysal M.A., Chakraborty A., Tsimberidou AM. Convergence of evolving artificial intelligence and machine learning techniques in precision oncology. NPJ Digit Med. 2025;8:75. doi: 10.1038/s41746-025-01471-y. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0047] 47.Shipe M.E., Deppen S.A., Farjah F., Grogan EL. Developing prediction models for clinical use using logistic regression: an overview. J Thorac Dis. 2019;11:S574–s584. doi: 10.21037/jtd.2019.01.25. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Ultrasound-Based Deep Learning Radiomics to Predict Cervical Lymph Node Metastasis in Major Salivary Gland Carcinomas

Huan-Zhong Su

Long-Cheng Hong

Zhi-Yong Li

Qiao-Min Fu

Yu-Hui Wu

Shao-Feng Wu

Zuo-Bing Zhang

Dao-Hui Yang

Xiao-Dong Zhang

Abstract

Introduction and Aims

Methods

Results

Conclusion

Clinical Relevance

Introduction

Materials and methods

Patients

Fig. 1.

US assessment

Interobserver agreement assessment of US features

Image segmentation and feature extraction

Fusion strategy

Feature selection and construction of radiomics and DL scores

Model construction and evaluation

Statistical analysis

Results

Histological types of patients with MSGCs

Clinical and US characteristics

Table 1.

Feature extraction and selection

Radiomics and DL scores construction

Fig. 2.

Fig. 3.

Model construction and evaluation

Fig. 4.

Fig. 5.

Fig. 6.

Discussion

Conclusion

Authorship contribution statement

Ethical approval

Informed consent

Funding

Conflict of interest

Footnotes

Contributor Information

Appendix. Supplementary materials

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases