Abstract
Occult nodal metastasis (ONM) plays a significant role in comprehensive treatments of non-small cell lung cancer (NSCLC). This study aims to develop a deep learning signature based on positron emission tomography/computed tomography to predict ONM of clinical stage N0 NSCLC. An internal cohort (n = 1911) is included to construct the deep learning nodal metastasis signature (DLNMS). Subsequently, an external cohort (n = 355) and a prospective cohort (n = 999) are utilized to fully validate the predictive performances of the DLNMS. Here, we show areas under the receiver operating characteristic curve of the DLNMS for occult N1 prediction are 0.958, 0.879 and 0.914 in the validation set, external cohort and prospective cohort, respectively, and for occult N2 prediction are 0.942, 0.875 and 0.919, respectively, which are significantly better than the single-modal deep learning models, clinical model and physicians. This study demonstrates that the DLNMS harbors the potential to predict ONM of clinical stage N0 NSCLC.
Subject terms: Predictive markers, Risk factors
Occult node metastasis is a key staging component of non-small cell lung cancer. Here, the authors use deep learning to improve diagnosis of lymph node metastasis from PET and CT radiomics.
Introduction
In the era of molecular imaging, positron emission tomography/computed tomography (PET/CT), which concurrently characterizes metabolic and anatomic representations about lesions, has emerged as the most dependable non-invasive modality for clinical N staging of non-small cell lung cancer (NSCLC)1. However, despite the tremendous advances in staging modality, there are still 12.9%–39.3%2–4 of lymph nodal metastasis that are not identified by this state-of-the-art procedure and instead are unexpectedly recognized during surgery, which is defined as occult nodal metastasis (ONM).
Lymph node staging including N1 and N2 status plays a crucial role throughout the whole process of management for NSCLC. Hence, accurately recognizing ONM is critical in determining the optimal therapeutic strategies for patients with NSCLC. In a presurgical setting, nodal biopsy remains the gold-standard reference for defining the N stage of NSCLC. The routine adoption of this procedure, however, increases the risk of overdiagnosis, which is attributable to its invasive nature, and potentially leads to missed diagnosis considering the diagnostic pitfalls for N1 stations5–7. Accordingly, it is necessary to obtain the pretest probability of ONM to equipoise the superiority and inferiority of this dual-nature procedure.
In terms of surgical decisions, substantial evidence has emerged that sublobectomy and limited nodal dissection (LND), which preserves more of the lung parenchyma, could deliver comparable oncological efficacy to conventional lobectomy and systematic nodal dissection (SND) in early-stage NSCLC. However, tumors with nodal metastasis harbor a more aggressive behavior and greater malignancy burden, making that sublobectomy and LND insufficient. Therefore, lobectomy and SND should be conducted to ensure the adequacy of surgical margins and radicality of nodal removals8–10.
In a postsurgical setting, the benefits of adjuvant therapy in early-stage NSCLC has been passionately debated11,12. The occurrence of nodal involvement heralds a more guarded prognosis13 and thereby calls for more aggressive treatments10. For NSCLC with nodal metastasis, surgery alone cannot confer sufficient oncological efficacy, and adjuvant therapy, capable of eradicating the residual mircometastasis, has been demonstrated to provide additional survival benefits14–19. Therefore, it is of paramount importance to develop a robust instrument for ONM prediction to recognize candidates for nodal biopsy, lobectomy, SND and adjuvant therapy in clinical stage N0 NSCLC.
The deep learning technology which allows the high-dimensional quantification of radiological images and greater extraction of detailed characterizations than the human vision, has been proposed as a revolutionary approach for disease diagnoses, prognosis evaluations, and therapeutic decisions20–22. PET/CT, which is capable of capturing the anatomic and metabolic representations of tumors23, has been leveraged as a dependable imaging modality to characterize malignancy grade and metastasis burden24,25. Its multimodal nature, on the one hand increases the feature dimensions and information abundance, but on the other hand poses a higher requirement for the deep learning algorithm.
With the development of multimodal algorithms, the current deep learning technology has evolved to be an effective method for PET/CT image analyzing26,27, which harbors the capability of taking full advantages of the complementary information of PET and CT modalities. It has been demonstrated that multimodal deep learning algorithms shown potentials in cancer identification28, tumor segmentation29,30, and risk quantification31 based on PET/CT imaging. Despite these tremendous breakthroughs, the application of PET/CT based deep learning for ONM prediction of lung cancer is limited. We hypothesize that cross-modal dominance complementation based on PET and CT imaging is capable of quantifying ONM probability to support the comprehensive treatments of clinical N0 NSCLC, and the captured ONM risks would be associated with histologic, genetic, and microenvironment behaviors.
Therefore, this study aims to combine PET and CT radiomics to construct a deep learning nodal metastasis signature (DLNMS) to predict ONM and personalize comprehensive treatments of clinical N0 NSCLC, and tentatively explore the underlying biologic basis of DLNMS, based on a large multicenter population.
Results
Study design and baseline information
The study design is described in Fig. 1. The baseline characteristics of the internal cohort, external cohort and prospective cohort are detailed in Table 1. The mean age of the entire cohort was 60.00 years and 48.61% (n = 1587) of the population were male. There were 2776 (85.02%) adenocarcinomas and 340 (10.41%) squamous cell carcinomas. The maximum standard uptake value (SUVmax), metabolic tumor volume (MTV), total lesion glycolysis (TLG) of the primary tumors were 5.43, 10.13 and 37.74, respectively. With respect to N status, 11.64% (n = 380) and 8.42% (n = 275) of patients were diagnosed as occult N1 and N2 diseases. In addition, compared to the internal cohort, patients in the external cohort were associated with significantly and older age (61.78 years versus 59.42 years, p < 0.001) and patients in the prospective cohort yielded an older age (60.46 years versus 59.42 years, p = 0.005), higher SUVmax of primary tumor (5.67 versus 5.25, p = 0.022) and larger tumor size (2.64 cm versus 2.53 cm, p = 0.030).
Table 1.
Characteristics | Entire | Internal cohort | External cohort | Prospective cohort | p1 value | p2 value |
---|---|---|---|---|---|---|
(n = 3265) | (n = 1911) | (n = 355) | (n = 999) | |||
Age (years) | ||||||
>65, n (%) | 991 (30.35) | 524 (27.42) | 133 (37.46) | 334 (33.43) | <0.001 | 0.001 |
≤65, n (%) | 2274 (69.65) | 1387 (72.58) | 222 (62.54) | 665 (66.57) | ||
Mean ± SD | 60.00 ± 9.31 | 59.42 ± 9.31 | 61.78 ± 8.71 | 60.46 ± 9.42 | <0.001 | 0.005 |
Sex, n (%) | 0.949 | 0.985 | ||||
Male | 1587 (48.61) | 917 (47.99) | 171 (48.17) | 479 (47.95) | ||
Female | 1678 (51.39) | 994 (52.01) | 184 (51.83) | 520 (52.05) | ||
Smoking, n (%) | 0.986 | 0.858 | ||||
Ever | 491 (15.04) | 286 (14.97) | 53 (14.93) | 152 (15.22) | ||
Never | 2774 (84.96) | 1625 (85.03) | 302 (85.07) | 847 (84.78) | ||
Radiologic type, n (%) | 0.166 | 0.071 | ||||
Pure solid | 1860 (56.97) | 1060 (55.47) | 211 (59.44) | 589 (58.96) | ||
Subsolid | 1405 (43.03) | 851 (44.53) | 144 (40.56) | 410 (41.04) | ||
PET parameters | ||||||
SUVmax mean ± SD | 5.43 ± 4.74 | 5.25 ± 4.69 | 5.72 ± 5.21 | 5.67 ± 4.66 | 0.086 | 0.022 |
MTV, mean ± SD | 10.13 ± 15.62 | 9.84 ± 15.40 | 10.53 ± 12.36 | 10.53 ± 17.03 | 0.427 | 0.270 |
TLG, mean ± SD | 37.74 ± 160.35 | 37.07 ± 153.29 | 37.73 ± 91.45 | 42.30 ± 190.10 | 0.938 | 0.422 |
Surgery procedure, n (%) | 0.993 | 0.852 | ||||
Sublobectomy | 184 (5.64) | 105 (5.49) | 19 (5.35) | 60 (6.01) | ||
Lobectomy | 3045 (93.26) | 1785 (93.41) | 332 (93.52) | 928 (92.89) | ||
Pneumonectomy | 36 (1.10) | 21 (1.10) | 4 (1.13) | 11 (1.10) | ||
Location, n (%) | ||||||
Left | 1426 (43.68) | 803 (42.02) | 166 (46.76) | 457 (45.7) | 0.097 | 0.054 |
Right | 1839 (56.32) | 1108 (57.98) | 189 (53.24) | 542 (54.3) | ||
Central | 566 (17.34) | 346 (18.10) | 54 (15.20) | 166 (16.62) | 0.189 | 0.317 |
Peripheral | 2699 (82.66) | 1565 (81.90) | 301 (84.80) | 833 (83.38) | ||
Radiological size (cm), mean ± SD | 2.58 ± 1.26 | 2.53 ± 1.22 | 2.66 ± 1.39 | 2.64 ± 1.29 | 0.068 | 0.030 |
N1 involvement, n (%) | 0.807 | 0.930 | ||||
Yes | 380 (11.64) | 224 (11.72) | 40 (11.27) | 116 (11.61) | ||
No | 2885 (88.36) | 1687 (88.28) | 315 (88.73) | 883 (83.39) | ||
N2 involvement, n (%) | 0.291 | 0.819 | ||||
Yes | 275 (8.42) | 156 (8.20) | 35 (9.90) | 84 (8.41) | ||
No | 2990 (91.58) | 1755 (91.80) | 320 (90.10) | 915 (91.59) | ||
Pathological type, n (%) | 0.764 | 0.566 | ||||
Adenocarcinoma | 2776 (85.02) | 1633 (85.45) | 302 (85.07) | 841 (84.18) | ||
Squamous cell carcinoma | 340 (10.41) | 197 (10.31) | 35 (9.86) | 108 (10.81) | ||
Others | 149 (4.56) | 81 (4.24) | 18 (5.07) | 50 (5.01) |
PET, positron emission tomography; SUV, standard uptake value; MTV, metabolic tumor volume; TLG, total lesion glycolysis; SD, standard deviation; p1 value for comparing the internal cohort with the external cohort; p2 value for comparing the internal cohort with the prospective cohort; categorical variables were analyzed by Pearson χ2 test and Fisher exact test, continuous variables were compared by Student t-test and Mann-Whitney U test.
Variables associated with ONM
As displayed in Table 2, in the training set, a younger age (odds ratio [OR]: 0.967, 95% confidence interval [CI]: [0.951, 0.984], adjusted p < 0.001), pure solid type (OR: 2.525, 95% CI: [1.638, 3.891], adjusted p < 0.001), left location (OR: 1.512, 95% CI: [1.088, 2.100], adjusted p = 0.023), and central location (OR: 1.743, 95% CI: [1.202, 2.530], adjusted p = 0.007) were identified as independent predictors for occult N1 metastasis, and the pure solid type (OR: 3.389, 95% CI: [1.999, 5.745], adjusted p < 0.001) was independently related to occult N2 involvement. Most variables remained predictive for patients in the validation set, external cohort and prospective cohort (Supplementary Table 1). In addition, after incorporation of the DLNMS into analyses (Supplementary Table 2 & 3), the DLNMS was revealed as independent predictors for both occult N1 and N2 involvements.
Table 2.
Variables | Occult N1 | Occult N2 | |||||||
---|---|---|---|---|---|---|---|---|---|
Univariable | Multivariable | Univariable | Multivariable | ||||||
OR (95% CI) | p value | OR (95% CI) | adjusted p value | OR (95% CI) | p value | OR (95% CI) | adjusted p value | ||
Age | 0.979 (0.963-0.994) | 0.008 | 0.967 (0.951-0.984) | <0.001 | 0.992 (0.973-1.012) | 0.445 | |||
Sex (Male) | 1.929 (1.404-2.652) | <0.001 | 1.403 (1.001-1.990) | 0.066 | 1.463 (1.008-2.125) | 0.046 | 0.996 (0.670-1.482) | 0.985 | |
Smoking history (Ever) | 1.152 (0.824-1.470) | 0.855 | 1.301 (0.878-1.353) | 0.765 | |||||
Radiological type (Solid) | 4.005 (2.718-5.903) | <0.001 | 2.525 (1.638-3.891) | <0.001 | 4.231 (2.614-6.848) | <0.001 | 3.389 (1.999-5.745) | <0.001 | |
Location (Left) | 1.429 (1.048-1.949) | 0.024 | 1.512 (1.088-2.100) | 0.023 | 1.249 (0.862-1.810) | 0.241 | |||
Location (Central) | 2.998 (2.146-4.188) | <0.001 | 1.743 (1.202-2.530) | 0.007 | 1.936 (1.282-2.924) | 0.002 | 1.240 (0.794-1.935) | 0.430 | |
Radiological size | 1.385 (1.239-1.548) | <0.001 | 1.146 (1.001-1.313) | 0.057 | 1.269 (1.124-1.433) | <0.001 | 1.144 (1.084-1.330) | 0.100 | |
SUVmax | 1.127 (1.095-1.160) | <0.001 | 1.045 (0.805-1.356) | 0.741 | 1.094 (1.060-1.129) | <0.001 | 0.862 (0.657-1.131) | 0.473 | |
MTV | 1.001 (0.993-1.010) | 0.746 | 0.999 (0.986-1.011) | 0.845 | |||||
TLG | 1.001 (1.000-1.002) | 0.105 | 1.000 (1.000-1.001) | 0.242 |
DLNMS, deep learning nodal metastasis signature; SUV, standard uptake value; MTV, metabolic tumor volume; TLG, total lesion glycolysis; HR, hazard ratio: CI, confidence interval; p values of multivariable analyses were corrected by the Benjamini and Hochberg method.
Predictive performance of DLNMS
With an increase of DLNMS scores, more cases with occult N1 and N2 tumors were observed in the validation set (Supplementary Fig. 1A &B), external cohort (Supplementary Fig. 1C & D) and prospective cohort (Supplementary Fig. 1E & F). In addition, the DLMNS was represented by conventional PET and CT texture features in ONM prediction, implying the significant correlations between the DLNMS and PET/CT texture features (Fig. 2).
As illustrated in Fig. 3A and B, Table 3 and Supplementary Fig. 2, in the validation set, the abilities of the DLNMS to predict occult N1 and N2 diseases were shown to have areas under the receive operating characteristic curve (AUROCs) of 0.958 (95% CI: [0.923, 0.992]) and 0.942 (95% CI: [0.911, 0.973]), respectively, which were significantly better than 0.873 (95% CI: [0.835, 0.911]) and 0.761 (95% CI: [0.680, 0.842]) of the PET model, 0.913 (95% CI: [0.875, 0.952]) and 0.887 (95% CI: [0.823, 0.952]) of the CT model, 0.752 (95% CI: [0.685, 0.819]) and 0.690 (95% CI: [0.603, 0.776]) of the clinical model, 0.612 (95% CI: [0.536, 0.689]) and 0.672 (95% CI: [0.574, 0.771]) of the senior physicians, and 0.616 (95% CI: [0.544, 0.687]) and 0.556 (95% CI: [0.465, 0.647]) of the junior physicians (DeLong’s test: all p < 0.05). The areas under the precision-recall curve (AUPRC), sensitivity, specificity, positive predictive value (PPV), positive predictive value (NPV) and accuracy of the DLNMS for predicting occult N1 and N2 metastasis were 0.882, 0.898, 0.928, 0.647, 0.984 and 0.924, and 0.876, 0.897, 0.842, 0.317, 0.990, and 0.846, respectively.
Table 3.
Models | Occult N1 prediction | Occult N2 prediction | |||||
---|---|---|---|---|---|---|---|
Validation set | External cohort | Prospective cohort | Validation set | External cohort | Prospective cohort | Biopsy cohort | |
DLNMS | 0.882 | 0.853 | 0.871 | 0.876 | 0.849 | 0.863 | 0.857 |
PET model | 0.756 | 0.731 | 0.748 | 0.753 | 0.710 | 0.741 | 0.752 |
CT model | 0.779 | 0.751 | 0.764 | 0.765 | 0.746 | 0.755 | 0.761 |
Clinical model | 0.656 | 0.612 | 0.627 | 0.694 | 0.635 | 0.648 | 0.695 |
Senior physicians | 0.504 | 0.562 | 0.538 | 0.563 | 0.583 | 0.569 | 0.550 |
Junior physicians | 0.582 | 0.590 | 0.599 | 0.514 | 0.555 | 0.534 | 0.514 |
DLNMS deep learning nodal metastasis signature; PET positron emission tomography; CT computed tomography.
In the external cohort (Fig. 3C, D), the DLNMS achieved AUROCs of 0.879 (95% CI: [0.813, 0.946]) and 0.875 (95% CI: [0.820, 0.930]) in predicting occult N1 and N2 metastasis, respectively, and were significantly superior than the PET model (0.790, 95% CI: [0.733, 0.847] and 0.727, 95% CI: [0.649, 0.805]), the CT model (0.826, 95% CI: [0.747, 0.905] and 0.817, 95% CI: [0.748, 0.887]), the clinical model (0.722, 95% CI: [0.642, 0.802] and 0.723, 95% CI: [0.648, 0.797]), the senior physicians (0.676, 95% CI: [0.590, 0.763] and 0.645, 95% CI: [0.554, 0.735]), and the junior physicians (0.633, 95% CI: [0.548, 0.719] and 0.594, 95% CI: [0.503, 0.685]) (DeLong’s test: all p < 0.05). In addition, the AUPRC, sensitivity, specificity, PPV, NPV and accuracy of the DLNMS for predicting occult N1 and N2 metastasis were 0.853, 0.700, 0.905 0.483, 0.960 and 0.882, and 0.849, 0.857, 0.813, 0.333, 0.981, and 0.817, respectively.
In the prospective cohort (Fig. 3E, F), the DLNMS achieved AUROCs of 0.914 (95% CI: [0.877, 0.949]) and 0.919 (95% CI: [0.886, 0.942]) in discriminating occult N1 and N2 involvements, and were evidently better than the PET model (0.796, 95% CI: [0.751, 0.841] and 0.712, 95% CI: [0.656, 0.768]), the CT model (0.828, 95% CI: [0.777, 0.879] and 0.835, 95% CI: [0.779, 0.891]), the clinical model (0.749, 95% CI: [0.708, 0.791] and 0.675, 95% CI: [0.629, 0.721]), the senior physicians (0.672, 95% CI: [0.623, 0.722] and 0.670, 95% CI: [0.613, 0.723]), and the junior physicians (0.645, 95% CI: [0.596, 0.693] and 0.635, 95% CI: [0.580, 0.691]) (DeLong’s test: all p < 0.05). Additionally, the AUPRC, sensitivity, specificity, PPV, NPV and accuracy of the DLNMS for occult N1 and N2 prediction were 0.871, 0.793, 0.926 0.586, 0.971 and 0.911, and 0.863, 0.833, 0.828, 0.308, 0.982, and 0.829, respectively.
In subgroup analyses regarding pathological types for patients in the validation set, external cohort and prospective cohort, the DLNMS achieved AUROCs of 0.916 (95% CI: [0.885, 0.947]) and 0.934 (95% CI: [0.915, 0.953]) in adenocarcinoma population for occult N1 and N2 prediction, respectively. Additionally, for squamous cell carcinoma population, the DLNMS yielded AUROCs of 0.904 (95% CI: [0.842, 0.966]) and 0.858 (95% CI: [0.779, 0.937]) for occult N1 and N2 prediction, respectively (Fig. 3G, H).
For patients in the validation set, external cohort and prospective cohort, the DLNMS could correct 38.30% occult N1, 73.11% benign N1, 78.13% occult N2, and 53.04% benign N2 diseases in those incorrectly diagnosed by the PET model (Supplementary Fig. 3A & B). Similarly, for those incorrectly predicted by the CT model, the DLNMS could correct 35.42% occult N1, 67.06% benign N1, 93.80% occult N2, and 41.18% benign N2 diseases (Supplementary Fig. 3C, D).
The calibration curves revealed that the DLNMS yielded good performances (Supplementary Fig. 4). Furthermore, we evaluated the clinical usefulness of the DLNMS compared to single-modal models for ONM detection via decision curve analyses, indicating that the DLNMS achieved better net benefits than other models no matter for occult N1 or N2 prediction (Supplementary Fig. 5). As summarized in Supplementary Table 4, the positive values of integrated discrimination improvements (all adjusted p < 0.05) and net reclassification index (all adjusted p < 0.05) for occult N1 and N2 predictions could be achieved when comparing the DLNMS to single-modal models.
Decision support for nodal biopsy
For 366 patients receiving nodal biopsy (Supplementary Table 5), the DLNMS yielded an AUROC of 0.853 (95% CI: [0.812, 0.895]) for predicting occult N2 diseases, which was significantly better than the PET model (0.644, 95% CI: [0.573, 0.715]), the CT model (0.780, 95% CI: [0.718, 0.841]), the clinical model (0.543, 95% CI: [0.471, 0.715]), the senior physicians (0.621, 95% CI: [0.554, 0.688]), and the junior physicians (0.525, 95% CI: [0.457, 0.594]). The AUPRC, sensitivity, specificity, PPV, NPV and accuracy of the DLNMS were 0.857, 0.919, 0.699, 0.436, 0.971 and 0.743, respectively (Fig. 4A & Table 3). In addition, with an increase in the DLNMS scores, more patients with occult N2 tumors were observed in the nodal biopsy cohort (Fig. 4B). Moreover, the DLNMS could correct 79.13% occult N2 and 56.41% benign N2 diseases in patients incorrectly diagnosed by the PET model (Fig. 4C). Similarly, for those incorrectly predicted by the CT model, the DLNMS could correct 100% occult N2 and 41.50% benign N2 diseases (Fig. 4D).
Decision support for surgical treatment
Survival analyses revealed that both N1 and N2 cutoff values could significantly stratify the prognosis of patients in the validation set and external cohort (Supplementary Fig. 6). In addition, patients with clinical stage I NSCLC (including patients receiving LND) were divided into low-risk (N1 score <0.362 and N2 score <0.356) and high-risk (N1 score > 0.362 or N2 score > 0.356) groups. The baseline characteristics of 654 clinical stage I patients receiving LND are provided in Supplementary Table 6. As illustrated in Fig. 5, for the low-risk population (Fig. 5A-D), sublobectomy did not compromise oncological results to lobectomy (3-year overall survival [OS]: 98.1% versus 97.4%, p = 0.458; 3-year recurrence-free survival [RFS]: 90.0% versus 90.6%, p = 0.749), and LND could achieve similar survival outcomes to SND (3-year OS: 98.1% versus 97.3%, p = 0.428; 3-year RFS: 90.4% versus 93.0%, p = 0.965). In contrast, for the high-risk population (Fig. 5E–H), patients receiving lobectomy yielded improved prognosis compared to those with sublobectomy (3-year OS: 90.9% versus 80.9%, p = 0.011; 3-year RFS: 79.0% versus 59.0%, p < 0.001) and SND conferred superior prognosis to LND (3-year OS: 91.7% versus 81.7%, p = 0.008; 3-year RFS: 79.2% versus 62.8%, p = 0.001).
Decision support for adjuvant therapy
As illustrated in Fig. 6, for patients diagnosed as pathological stage I NSCLC (including patients receiving LND), those without postoperative adjuvant therapy achieved comparable prognosis to those with postoperative adjuvant therapy in the low-risk group (3-year OS: 98.0% versus 97.5%, p = 0.581; 3-year RFS: 91.3% versus 89.3%, p = 0.323) (Fig. 6A & B). Conversely, in the high-risk group (Fig. 6C & D), patients receiving postoperative adjuvant therapy conferred significantly superior oncological results than those without postoperative adjuvant therapy (3-year OS: 95.9% versus 86.2%, p = 0.034; 3-year RFS: 90.5% versus 76.1%, p = 0.012).
Biologic basis of DLNMS
Both higher N1 and N2 scores were significantly related to the presence of aggressive histologic patterns including lymphovascular invasion (LVI), visceral pleural invasion (VPI), tumor spread through air space (STAS), micropapillary component, and solid component (all p < 0.001) (Fig. 7A, B). In addition, among patients with available data for common gene alternations, patients with high N1 scores were significantly relevant to the higher frequency of BRAF mutation (p < 0.001) and larger proportion of AKL mutation (p = 0.004) (Fig. 7C). Patients with high N2 scores yielded a significantly lower mutation rate of EGFR (p < 0.001) (Fig. 7D). In the gene set enrichment analysis (GSEA) and single sample gene set enrichment analysis (ssGSEA) analysis (Fig. 7E–G), pathways related to tumors proliferation such as signaling by GPCR, NTRKs and WNT in cancer were significantly unregulated in patients with high N1 and N2 scores. Finally, in the analyses of the tumor microenvironments, tumors with high N1 scores showed more infiltrations of central memory CD4 T cells, mast cells and plasmacytoid dendritic cells. High N2 scores were significantly associated with greater proportions of central memory CD4 T cells and central memory CD8 T cells (Fig. 7H).
Discussion
Preoperative nodal staging is a critical determinant for individualized treatments of patients with NSCLC10. For clinical stage N0 NSCLC, the occurrence of ONM would reduce the theoretical benefits of the initial treatments, therefore inadvertently excluding patients from optimal therapeutic strategies. In this regard, obtaining an accurate pretest probability of ONM prior to treatments is of paramount importance. The current study managed to develop a cross-modal deep learning signature based on PET/CT images. The proposed DLNMS achieved AUROCs of 0.958, 0.879 and 0.914 for occult N1 prediction, and 0.942, 0.875 and 0.919 for occult N2 prediction, in the validation set, external cohort and prospective cohort, respectively. Moreover, high-risk patients defined by the DLNMS could benefit from nodal biopsy, lobectomy, SND and adjuvant therapy.
In clinical practice, clinical physicians mainly rely on certain clinical characteristics especially imaging features to capture the ONM risks of clinical stage N0 NSCLC. Evidences have emerged that metabolic and morphologic parameters on PET/CT, such as tumor size, central location, consolidation ratio, and metabolic value might provide efficient clues for ONM diseases32–35. Nevertheless, this subjective evaluation yields low AUROCs of 0.525-0.676 due to heterogenous experiences among physicians, and is incapable of comprehensively estimating the probability of ONM, so as to convey a direct implication to the management strategy for a given patient. The triumph of individually quantifying ONM risks based on predictive models represented a crucial step. Predictive rules integrating clinical variables could calculate the probability of ONM involvement in clinical N0 NSCLC. However, in spite of their higher accuracies than clinical physicians, these clinical models were far from meeting clinical requirements, resulting in AUROCs of 0.700-0.75636–38, which was also observed by the current study, our clinical model only yielded AUROCs of 0.675-0.794 for ONM identification. As such, more valuable radiographic features for predicting ONM should be investigated to achieve clinical utility.
Radiomics, which allows quantitative extraction of high-dimensional radiological features, has provided a promising approach for more accurate evaluation of the lymph node status of lung cancer. Several studies have been successful in recognizing ONM in early-stage NSCLC utilizing radiomics phenotypes, which yielded AUROC values of 0.808 to 0.82039–41. Despite such inspiring success, the above radiomics studies were limited in the CT modal, and the added value of PET radiomics features for ONM prediction of NSCLC are still ambiguous. With the development of multimodal algorithms, the deep learning approach has been applied to analyze PET/CT imaging26–31. Based on the main advancements of deep learning technology, multimodal fusion primarily involved three strategies: input-level concatenation42,43, feature-level combination44, and output-level average45. Our preliminary experiments investigated multiple deep learning architectures and fusion strategies, revealing feature-level fusion based on the ResNet 1846 backbone yield better efficiencies and was finally utilized to generate our DLNMS. The current study demonstrated that the cross-modal DLNMS incorporating PET and CT radiomics features achieved AUROCs of 0.875-0.958, make it superior to single-modal models based on PET or CT alone for ONM prediction.
In the domain of machine learning, one issue worth mentioning is the method for performance evaluation. On an imbalanced dataset with a low proportion of positive classifications, the PR curve might be more effective than the ROC curve in quantifying positive discriminative ability47,48. However, what needs to be emphasized is that the PR curve only focuses on the efficiency to identify diseased cases but ignores those correctly predicted healthy cases49. Different from conventional classification tasks, ONM recognition would pose a direct impact on treatment decisions, which emphasizes model’s discriminative abilities for both positive and negative subjects. If a patient diagnosed as healthy actually is ONM disease (false negative), this patient would directly lose the opportunity of receiving optimal treatments. The AUPRC is a summary indicator comprehensively quantifying the positive and negative predictive capabilities50, we therefore chose the Youden Index based on ROC curves to determine the cutoff values of DLNMS.
Whether a radiomics signature can be introduced into the clinical workflow to optimize the treatment decision is the benchmark for demonstrating its clinical utility. Distinguished from other radiomics studies that are limited in model constructions and efficiency evaluations, the current study took a further step to elucidate the potential application scenarios of the proposed DLNMS. In a presurgical setting, nodal biopsy serves as the gold-standard reference for N2 staging, but concurrently suffers from its invasive nature, thus emphasizing the necessity of equipoising the superiority and inferiority of this dual-nature procedure to individualize the N2 staging of NSCLC5–7. The DLNMS maintained efficiencies in the nodal biopsy population, therefore sparing patients with low N2 scores from this invasive procedure and ensuring that patients with high N2 scores receive nodal biopsy for adequate N2 staging. Additionally, for surgical decisions, sublobectomy and LND, with more lung preserves than conventional lobectomy and SND, have been increasingly adopted in the surgical treatment of clinical stage I NSCLC. However, if ONM occurs, lobectomy and SND are more appropriate choices8–10. Our results demonstrated that sublobectomy and LND were effective for patients with low ONM risks, while lobectomy and SND were mandatory in patients with high ONM risks to achieve the oncological radicality. Finally, in a postsurgical setting, adjuvant therapy eradicates the residual micrometastastic disease, but simultaneously has significant side effects, thus calling for appropriate patient selection to identify candidates for this double-edged sword11,12. Based on our results, patients with low risks would not benefit from adjuvant therapy. In contrast, adjuvant therapy conferred survival superiority in patients with high risks.
Several limitations of this study should be acknowledged. Firstly, as a retrospective study, selection bias was inevitable, despite the inclusion of a prospective cohort for validation, and whether our findings are applicable to other territories remains unknown. To be confirmed, an international clinical trial is required. Secondly, the main histology of included cases were adenocarcinomas, and different histologies are represented by discrepant radiological phenotypes and tumor aggressiveness, contributing to their heterogeneity in the metastasis nature. Thus, a future study with adequate sample sizes in histologic subgroups should be conducted to validate the efficiency of the DLNMS. Thirdly, high-resolution CT findings is necessary to analyze the subtle images, however, not all PET/CT equipment harbor the capability of outputting such high-quality images, which might reduce the clinical applicability of the DLNMS in certain institutions. Finally, the main limitation of the current deep learning technique regarding medical imaging analyses is that, its black-box setting has the problem of interpretability. Despite our exploration of the biologic basis of the DLNMS, its working rationale was ambiguous and the predictive features were nameless. Therefore, studies deciphering the opaqueness of deep learning features in future is warranted.
In conclusion, the developed DLNMS is reliable in predicting ONM of clinical stage N0 NSCLC. Furthermore, the DLNMS has potentials for guiding individualized decisions for nodal biopsy, surgery and adjuvant therapy in clinical stage N0 NSCLC.
Methods
Study design and participants
This study was implemented under the approval of the Institutional Review Board of Shanghai Pulmonary Hospital, The First Affiliated Hospital of Nanchang University, Affiliated Hospital of Zunyi Medical College and Ningbo HwaMei Hospital. Written informed consent was waived for the internal and external cohorts and acquired for the prospective cohort. The study design is described in Fig. 1. The DLNMS was developed using an internal cohort (entire: n = 1911, occult N1 proportion = 11.64%, occult N2 proportion = 8.42%; training: n = 1528, occult N1 proportion = 11.45%, occult N2 proportion = 8.31%; validation: n = 383, occult N1 proportion = 12.79%, occult N2 proportion = 7.57%). Subsequently, a multicenter external cohort (n = 355, occult N1 proportion = 11.27%, occult N2 proportion = 9.90%) and a multicenter prospective cohort (n = 999, occult N1 proportion = 11.64%, occult N2 proportion = 8.41%; ClinicalTrials.gov, NCT05425134) were adopted to fully validate the predictive efficiencies of the DLNMS by benchmarking the single-modal deep learning model, clinical model and physicians. Moreover, the values of the DLNMS for guiding nodal biopsy, surgery and adjuvant therapy decision-makings were explored via efficiency evaluations in a nodal biopsy cohort (n = 366) and survival stratifications on different risk groups. Finally, the biologic basis of DLNMS was investigated by comparing histologic patterns, common genetic alternations, genetic pathways, and infiltrations of immune cells in microenvironments between patients with low and high scores. Patient selection details are provided in Supplementary Method 1 and Supplementary Fig. 7.
Data acquisition and deep learning algorithm
Clinical information was retrieved from medical records, and follow-up data were acquired from outpatient visits and telephone interviews. The pathologic nodal status in the internal cohort, external cohort and prospective cohort was defined based on surgically resected specimens and that in the nodal biopsy cohort was defined based on nodal biopsy specimens. SND was defined as dissected N2 stations ≥ 3 with complete N1 dissection according to National Comprehensive Cancer Network guidelines10. Follow-up protocol details are described in Supplementary Method 2. The region of interest of the primary tumor was annotated by a junior thoracic radiologist (T.W., with 5 years of experiences) and confirmed by an expert thoracic radiologist (J.S., with 25 years of experiences). Details regarding the parameters of PET/CT scanners and tumor annotation are summarized in Supplementary Method 3 & 4. The structure of the DLNMS was illustrated in Supplementary Fig. 8. Two ResNet18 backbone networks46 with the same structure were used to extract features from PET and CT images separately. Then, the PET and CT features were fused using the concat operation and input into a fully connected layer for classifications of ONM. The DLNMS consisted of two separate models predicting occult N1 and N2, respectively. For occult N1 prediction, data were divided into N1 metastasis and non-N1 metastasis. Similarly, in N2 prediction, data were divided into N2 metastasis and non-N2 metastasis. Details of image preprocessing and model construction procedures are provided in Supplementary Method 5-8. All computer codes for preprocessing and training are summarized at https://github.com/zhongthoracic/DLNMS.
Cutoff calculation
Based on the maximum Youden index in the training set, the cutoff values of all models were determined to calculate the performance metrics and define the risk groups. The cutoff values of the DLNMS for occult N1 and N2 were calculated as 0.362 and 0.356, respectively. Therefore, patients with N1 scores > 0.362 and <0.362 were considered to have high and low occult N1 probabilities, respectively, and those with N2 scores > 0.356 and <0.356 were considered to have high and low occult N2 probabilities, respectively. Finally, by combining the N1 and N2 scores, patients were divided into high-risk (N1 scores > 0.362 or N2 scores > 0.356) and low-risk (N1 scores < 0.362 and N2 scores < 0.356) groups.
Benchmarking
The predictive efficiency of the DLNMS was compared to the PET model, CT model, clinical model, senior physicians and junior physicians. The PET model and CT model were developed by the deep leaning algorithm based on the PET modality and CT modality, respectively. The clinical model was constructed by logistic analyses on the training set. For physicians, 3 senior radiologists and 3 junior radiologists blinded to pathological information were required to classify the ONM status based on imaging data. Benchmarking details are summarized in Supplementary Method 9.
Comprehensive treatments support
For nodal biopsy decisions, the predictive efficiency and performance metrics of the DLNMS in the nodal biopsy cohort were evaluated. For surgery decisions of clinical stage I NSCLC, ONM risks for patients receiving LND were predicted by the generated DLNMS and included into analyses (Supplementary Method 10). The prognosis of patients receiving lobectomy versus sublobectomy and SND versus LND was compared between the DLNMS defined low-risk and high-risk groups, respectively. For adjuvant therapy decisions of pathological stage I NSCLC, the oncological results of patients receiving adjuvant therapy versus not receiving adjuvant therapy were compared between the low-risk and high-risk groups.
Biologic basis exploration
According to the cutoff values, distributions of patients with N1 scores < 0.362 versus N1 scores > 0.362 and N2 scores < 0.356 versus N2 scores > 0.356 in aggressive histologic patterns (LVI, STAS, VPI, micropapillary component, and solid component) and common genetic alternations (EGFR, KRAS, BRAF, ALK, and ROS1) were compared, respectively. Additionally, based on patients with NSCLC in the radiogenomics dataset (a public dataset comprising paired PET/CT and RNA sequencing data, https://wiki.cancerimagingarchive.net), the GSEA and ssGSEA were implemented to reveal heterogeneity in genetic pathways and infiltration of immune cells in tumor microenvironment between patients with different ONM scores. GSEA and ssGSEA procedures are detailed in Supplementary Method 11 & 12.
Statistical analysis
Categorical variables were analyzed by Pearson χ2 test and Fisher exact test, continuous variables were compared by Student t-test and Mann-Whitney U test. The clinical model was generated based on the logistic regression analyses using a p-value level of 0·1. Survival data were assessed using the Kaplan-Meier method, log-rank test and Cox regression analyses. Predictive efficiency was evaluated by the AUROC and AUPRC. AUROCs among models were compared using the Delong’s test. Performance metrics containing sensitivity, specificity, accuracy, PPV, and NPV were generated based on cutoff values determined by the maximum Youden index in the training set. CIs were computed by 10, 000 bootstrap replicates. The Benjamini and Hochberg method was utilized to correct p values from multiple comparisons. Analyses mentioned above was conducted using SPSS (version 25.0, IBM SPSS Statistics) and R program (version 4.1.3, http://www.Rproject.org). A p < 0·05 was regarded as having statistical significance.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Supplementary information
Source data
Acknowledgements
This study was supported by National Natural Science Foundation of China (92259205, 82102126, 82272943); National Key Research and Development Program of China (2022YFC2407401); Science and Technology Commission of Shanghai Municipality (21YF1438200); Clinical Research Foundation of Shanghai Pulmonary Hospital (SKPY2021008); Investigator-Initiated Trial of Shanghai Pulmonary Hospital (2021LY1144, 2023LY0310); Ningbo Top Medical and Health Research Program (2022030208); and Medicine and Public Health Scientific Projects in Zhejiang Province (2020KY270). In addition, we would like to thank all members in the MultiomIcs claSSIfier for pulmOnary Nodules (MISSION) Collaborative Group for their supports and efforts.
Author contributions
Y.Z., C.C., T.C., D.X., Y.S., and C.C. designed this study and wrote the paper. C.C. and H.G. built the deep learning models. J.D., T.W., X.S., J.S., and Y.C. processed and analyzed the data. M.Y., B.Y., and Y.S. collected the clinical dataset and performed data preprocessing. D.X., Y.S., and C.C. conceived the project and edited the paper. All authors reviewed and approved the final manuscript for submission. We ensured that all authors had access to all the raw datasets. Y.Z., C.C., and T.C. have verified the data and are independent of any company or investor. D.X., Y.S., and C.C. had full access to all the data in the study and had final responsibility for the decision to submit for publication.
Peer review
Peer review information
Nature Communications thanks Alexandr Kalinin and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Data availability
The PET/CT imaging data in the current study are not publicly available for patient privacy purposes. However, if researchers wish to access our data solely for scientific research purposes and are willing to sign a data transfer agreement, the corresponding author can share the relevant data. Source data are provided with this paper.
Code availability
Are provided at GitHub (https://github.com/zhongthoracic/DLNMS).
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Yifan Zhong, Chuang Cai, Tao Chen.
Contributor Information
Dong Xie, Email: xiedong@tongji.edu.cn.
Chang Chen, Email: changchenc@tongji.edu.cn.
Yunlang She, Email: langthoracic@tongji.edu.cn.
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-023-42811-4.
References
- 1.Whitson BA, Groth SS, Maddaus MA. Recommendations for optimal use of imaging studies to clinically stage mediastinal lymph nodes in non-small-cell lung cancer patients. Lung Cancer (Amst., Neth.) 2008;61:177–185. doi: 10.1016/j.lungcan.2007.12.019. [DOI] [PubMed] [Google Scholar]
- 2.Darling GE, et al. Positron emission tomography-computed tomography compared with invasive mediastinal staging in non-small cell lung cancer: results of mediastinal staging in the early lung positron emission tomography trial. J. Thorac. Oncol. 2011;6:1367–1372. doi: 10.1097/JTO.0b013e318220c912. [DOI] [PubMed] [Google Scholar]
- 3.Gómez-Caro A, et al. False-negative rate after positron emission tomography/computer tomography scan for mediastinal staging in cI stage non-small-cell lung cancer. Eur. J. Cardiothorac. Surg. 2012;42:93–100. doi: 10.1093/ejcts/ezr272. [DOI] [PubMed] [Google Scholar]
- 4.Beyaz F, Verhoeven RLJ, Schuurbiers OCJ, Verhagen A, van der Heijden E. Occult lymph node metastases in clinical N0/N1 NSCLC; A single center in-depth analysis. Lung cancer (Amst., Neth.) 2020;150:186–194. doi: 10.1016/j.lungcan.2020.10.022. [DOI] [PubMed] [Google Scholar]
- 5.Kramer H, Groen HJ. Current concepts in the mediastinal lymph node staging of nonsmall cell lung cancer. Ann. Surg. 2003;238:180–188. doi: 10.1097/01.SLA.0000081086.37779.1a. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Detterbeck FC, Jantz MA, Wallace M, Vansteenkiste J, Silvestri GA. Invasive mediastinal staging of lung cancer: ACCP evidence-based clinical practice guidelines (2nd edition) Chest. 2007;132:202s–220s. doi: 10.1378/chest.07-1362. [DOI] [PubMed] [Google Scholar]
- 7.Czarnecka-Kujawa K, Yasufuku K. The role of endobronchial ultrasound versus mediastinoscopy for non-small cell lung cancer. J. Thorac. Dis. 2017;9:S83–s97. doi: 10.21037/jtd.2017.03.102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Varlotto JM, et al. Identification of stage I non-small cell lung cancer patients at high risk for local recurrence following sublobar resection. Chest. 2013;143:1365–1377. doi: 10.1378/chest.12-0710. [DOI] [PubMed] [Google Scholar]
- 9.Zhao ZR, et al. Comparison of Segmentectomy and Lobectomy in Stage IA Adenocarcinomas. J. Thorac. Oncol. 2017;12:890–896. doi: 10.1016/j.jtho.2017.01.012. [DOI] [PubMed] [Google Scholar]
- 10.National Comprehensive Cancer Network. (NCCN) Clinical Practice Guidelines in Oncology. Non-Small Cell Lung Cancer, Version 1. 2022. Available at: https://www.nccn.org/professionals/physician_gls/default.aspx.
- 11.Strauss GM, et al. Adjuvant paclitaxel plus carboplatin compared with observation in stage IB non-small-cell lung cancer: CALGB 9633 with the Cancer and Leukemia Group B, Radiation Therapy Oncology Group, and North Central Cancer Treatment Group Study Groups. J. Clin. Oncol. 2008;26:5043–5051. doi: 10.1200/jco.2008.16.4855. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Butts CA, et al. Randomized phase III trial of vinorelbine plus cisplatin compared with observation in completely resected stage IB and II non-small-cell lung cancer: updated survival analysis of JBR-10. J. Clin. Oncol. 2010;28:29–34. doi: 10.1200/jco.2009.24.0333. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Goldstraw P, et al. The IASLC Lung Cancer Staging Project: Proposals for Revision of the TNM Stage Groupings in the Forthcoming (Eighth) Edition of the TNM Classification for Lung Cancer. J. Thorac. Oncol. 2016;11:39–51. doi: 10.1016/j.jtho.2015.09.009. [DOI] [PubMed] [Google Scholar]
- 14.Preoperative chemotherapy for non-small-cell lung cancer: a systematic review and meta-analysis of individual participant data. Lancet (London, England)383, 1561–1571, 10.1016/s0140-6736(13)62159-5 (2014). [DOI] [PMC free article] [PubMed]
- 15.Arriagada R, et al. Cisplatin-based adjuvant chemotherapy in patients with completely resected non-small-cell lung cancer. N. Engl. J. Med. 2004;350:351–360. doi: 10.1056/NEJMoa031644. [DOI] [PubMed] [Google Scholar]
- 16.Pignon JP, et al. Lung adjuvant cisplatin evaluation: a pooled analysis by the LACE Collaborative Group. J. Clin. Oncol. 2008;26:3552–3559. doi: 10.1200/jco.2007.13.9030. [DOI] [PubMed] [Google Scholar]
- 17.Scagliotti GV, et al. Randomized phase III study of surgery alone or surgery plus preoperative cisplatin and gemcitabine in stages IB to IIIA non-small-cell lung cancer. J. Clin. Oncol. 2012;30:172–178. doi: 10.1200/jco.2010.33.7089. [DOI] [PubMed] [Google Scholar]
- 18.Song WA, et al. Survival benefit of neoadjuvant chemotherapy in non-small cell lung cancer: an updated meta-analysis of 13 randomized control trials. J. Thorac. Oncol. 2010;5:510–516. doi: 10.1097/JTO.0b013e3181cd3345. [DOI] [PubMed] [Google Scholar]
- 19.Uy KL, et al. Improved results of induction chemoradiation before surgical intervention for selected patients with stage IIIA-N2 non-small cell lung cancer. J. Thorac. Cardiovasc. Surg. 2007;134:188–193. doi: 10.1016/j.jtcvs.2007.01.078. [DOI] [PubMed] [Google Scholar]
- 20.Hosny A, Parmar C, Quackenbush J, Schwartz LH, Aerts H. Artificial intelligence in radiology. Nat. Rev. Cancer. 2018;18:500–510. doi: 10.1038/s41568-018-0016-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Aerts HJ. The potential of radiomic-based phenotyping in precision medicine: a review. JAMA Oncol. 2016;2:1636–1642,. doi: 10.1001/jamaoncol.2016.2631. [DOI] [PubMed] [Google Scholar]
- 22.Aerts HJ, et al. Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nat. Commun. 2014;5:4006. doi: 10.1038/ncomms5006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Chen HHW, Chiu N-T, Su W-C, Guo H-R, Lee B-F. Prognostic value of whole-body total lesion glycolysis at pretreatment FDG PET/CT in non–small cell lung cancer. Radiology. 2012;264:559–566. doi: 10.1148/radiol.12111148. [DOI] [PubMed] [Google Scholar]
- 24.Berghmans T, et al. Primary Tumor Standardized Uptake Value (SUVmax) Measured on Fluorodeoxyglucose Positron Emission Tomography (FDG-PET) is of Prognostic Value for Survival in Non-small Cell Lung Cancer (NSCLC): A Systematic Review and Meta-Analysis (MA) by the European Lung Cancer Working Party for the IASLC Lung Cancer Staging Project. J. Thorac. Oncol. 2008;3:6–12. doi: 10.1097/JTO.0b013e31815e6d6b. [DOI] [PubMed] [Google Scholar]
- 25.Nair VS, Barnett PG, Ananth L, Gould MK. PET scan 18F-fluorodeoxyglucose uptake and prognosis in patients with resected clinical stage IA non-small cell lung cancer. Chest. 2010;137:1150–1156. doi: 10.1378/chest.09-2356. [DOI] [PubMed] [Google Scholar]
- 26.Veziroglu EM, et al. Role of Artificial Intelligence in PET/CT Imaging for Management of Lymphoma. Semin Nucl. Med. 2023;53:426–448. doi: 10.1053/j.semnuclmed.2022.11.003. [DOI] [PubMed] [Google Scholar]
- 27.Domingues I, et al. Using deep learning techniques in medical imaging: a systematic review of applications on CT and PET. Artif. Intell. Rev. 2020;53:4093–4160. doi: 10.1007/s10462-019-09788-3. [DOI] [Google Scholar]
- 28.Wallis D, et al. An [18F]FDG-PET/CT deep learning method for fully automated detection of pathological mediastinal lymph nodes in lung cancer patients. Eur. J. Nucl. Med. Mol. imaging. 2022;49:881–888. doi: 10.1007/s00259-021-05513-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Li L, Zhao X, Lu W, Tan S. Deep learning for variational multimodality tumor segmentation in PET/CT. Neurocomputing. 2020;392:277–295. doi: 10.1016/j.neucom.2018.10.099. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Fu X, Bi L, Kumar A, Fulham M, Kim J. Multimodal spatial attention module for targeting multimodal PET-CT lung tumor segmentation. IEEE J. Biomed. Health Inf. 2021;25:3507–3516. doi: 10.1109/jbhi.2021.3059453. [DOI] [PubMed] [Google Scholar]
- 31.Yuan C, et al. Multimodal deep learning model on interim [(18)F]FDG PET/CT for predicting primary treatment failure in diffuse large B-cell lymphoma. Eur. Radiol. 2023;33:77–88. doi: 10.1007/s00330-022-09031-8. [DOI] [PubMed] [Google Scholar]
- 32.Pieterman RM, et al. Preoperative staging of non-small-cell lung cancer with positron-emission tomography. N. Engl. J. Med. 2000;343:254–261. doi: 10.1056/nejm200007273430404. [DOI] [PubMed] [Google Scholar]
- 33.Kim DH, et al. Metabolic parameters using 18F-FDG PET/CT correlate with occult lymph node metastasis in squamous cell lung carcinoma. Eur. J. Nucl. Med. Mol. imaging. 2014;41:2051–2057. doi: 10.1007/s00259-014-2831-6. [DOI] [PubMed] [Google Scholar]
- 34.Ouyang ML, et al. Prediction of occult lymph node metastasis using tumor-to-blood standardized uptake ratio and metabolic parameters in clinical N0 lung adenocarcinoma. Clin. Nucl. Med. 2018;43:715–720. doi: 10.1097/rlu.0000000000002229. [DOI] [PubMed] [Google Scholar]
- 35.Shin, S. H. et al. Which definition of a central tumour is more predictive of occult mediastinal metastasis in nonsmall cell lung cancer patients with radiological N0 disease? Eur. Respir. J.53, 10.1183/13993003.01508-2018 (2019). [DOI] [PMC free article] [PubMed]
- 36.Zhang Y, et al. A prediction model for N2 disease in T1 non-small cell lung cancer. J. Thorac. cardiovascular Surg. 2012;144:1360–1364. doi: 10.1016/j.jtcvs.2012.06.050. [DOI] [PubMed] [Google Scholar]
- 37.Chen K, Yang F, Jiang G, Li J, Wang J. Development and validation of a clinical prediction model for N2 lymph node metastasis in non-small cell lung cancer. Ann. Thorac. Surg. 2013;96:1761–1768. doi: 10.1016/j.athoracsur.2013.06.038. [DOI] [PubMed] [Google Scholar]
- 38.Farjah F, Lou F, Sima C, Rusch VW, Rizk NP. A prediction model for pathologic N2 disease in lung cancer patients with a negative mediastinum by positron emission tomography. J. Thorac. Oncol. 2013;8:1170–1180. doi: 10.1097/JTO.0b013e3182992421. [DOI] [PubMed] [Google Scholar]
- 39.Yang M, et al. CT-based radiomics signature for the stratification of N2 disease risk in clinical stage I lung adenocarcinoma. Transl. Lung Cancer Res. 2019;8:876–885. doi: 10.21037/tlcr.2019.11.18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Gu Y, et al. A Texture Analysis-Based Prediction Model for Lymph Node Metastasis in Stage IA Lung Adenocarcinoma. Ann. Thorac. Surg. 2018;106:214–220. doi: 10.1016/j.athoracsur.2018.02.026. [DOI] [PubMed] [Google Scholar]
- 41.Zhong Y, et al. Deep Learning for Prediction of N2 Metastasis and Survival for Clinical Stage I Non-Small Cell Lung Cancer. Radiology. 2022;302:200–211. doi: 10.1148/radiol.2021210902. [DOI] [PubMed] [Google Scholar]
- 42.Jin C, et al. Predicting treatment response from longitudinal images using multi-task deep learning. Nat. Commun. 2021;12:1851. doi: 10.1038/s41467-021-22188-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Li K, Zhang R, Cai W. Deep learning convolutional neural network (DLCNN): unleashing the potential of (18)F-FDG PET/CT in lymphoma. Am. J. Nucl. Med Mol. Imaging. 2021;11:327–331. [PMC free article] [PubMed] [Google Scholar]
- 44.Kumar, A., Fulham, M., Feng, D. & Kim, J. Co-Learning Feature Fusion Maps from PET-CT Images of Lung Cancer. IEEE Trans Med Imaging, 10.1109/tmi.2019.2923601 (2019). [DOI] [PubMed]
- 45.Donahue J, et al. Long-term recurrent convolutional networks for visual recognition and description. IEEE Trans. Pattern Anal. Mach. Intell. 2017;39:677–691. doi: 10.1109/tpami.2016.2599174. [DOI] [PubMed] [Google Scholar]
- 46.He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. Proceedings of the IEEE conference on computer vision and pattern recognition, 770-778 (2016).
- 47.Liu Z, Bondell HD. Binormal precision–recall curves for optimal classification of imbalanced data. Stat. Biosci. 2019;11:141–161. doi: 10.1007/s12561-019-09231-9. [DOI] [Google Scholar]
- 48.Saito T, Rehmsmeier M. The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets. PloS one. 2015;10:e0118432. doi: 10.1371/journal.pone.0118432. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Ozenne B, Subtil F, Maucort-Boulch D. The precision–recall curve overcame the optimism of the receiver operating characteristic curve in rare diseases. J. Clin. Epidemiol. 2015;68:855–859. doi: 10.1016/j.jclinepi.2015.02.010. [DOI] [PubMed] [Google Scholar]
- 50.Fawcett T. An introduction to ROC analysis. Pattern Recognit. Lett. 2006;27:861–874. doi: 10.1016/j.patrec.2005.10.010. [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The PET/CT imaging data in the current study are not publicly available for patient privacy purposes. However, if researchers wish to access our data solely for scientific research purposes and are willing to sign a data transfer agreement, the corresponding author can share the relevant data. Source data are provided with this paper.
Are provided at GitHub (https://github.com/zhongthoracic/DLNMS).