Peritumoral and intratumoral radiomic features predict survival outcomes among patients diagnosed in lung cancer screening

Jaileene Pérez-Morales; Ilke Tunali; Olya Stringfield; Steven A Eschrich; Yoganand Balagurunathan; Robert J Gillies; Matthew B Schabath

doi:10.1038/s41598-020-67378-8

. 2020 Jun 29;10:10528. doi: 10.1038/s41598-020-67378-8

Peritumoral and intratumoral radiomic features predict survival outcomes among patients diagnosed in lung cancer screening

Jaileene Pérez-Morales ¹, Ilke Tunali ^2,³, Olya Stringfield ², Steven A Eschrich ⁴, Yoganand Balagurunathan ⁴, Robert J Gillies ², Matthew B Schabath ^1,^5,^✉

PMCID: PMC7324394 PMID: 32601340

Abstract

The National Lung Screening Trial (NLST) demonstrated that screening with low-dose computed tomography (LDCT) is associated with a 20% reduction in lung cancer mortality. One potential limitation of LDCT screening is overdiagnosis of slow growing and indolent cancers. In this study, peritumoral and intratumoral radiomics was used to identify a vulnerable subset of lung patients associated with poor survival outcomes. Incident lung cancer patients from the NLST were split into training and test cohorts and an external cohort of non-screen detected adenocarcinomas was used for further validation. After removing redundant and non-reproducible radiomics features, backward elimination analyses identified a single model which was subjected to Classification and Regression Tree to stratify patients into three risk-groups based on two radiomics features (NGTDM Busyness and Statistical Root Mean Square [RMS]). The final model was validated in the test cohort and the cohort of non-screen detected adenocarcinomas. Using a radio-genomics dataset, Statistical RMS was significantly associated with FOXF2 gene by both correlation and two-group analyses. Our rigorous approach generated a novel radiomics model that identified a vulnerable high-risk group of early stage patients associated with poor outcomes. These patients may require aggressive follow-up and/or adjuvant therapy to mitigate their poor outcomes.

Subject terms: Lung cancer, Biomarkers

Introduction

The National Lung Screening Trial (NLST) demonstrated that annual screening with low-dose helical computed tomography (LDCT) compared to chest radiography is associated with a 20% relative reduction in lung cancer mortality among high-risk individuals¹. However, LDCT screening can lead to overdiagnosis and overtreatment of slow growing, indolent cancers that may pose no threat if left untreated^2,3. Prior post-hoc analyses of the NLST have estimated that 18–22.5% of screen-detected cancers would not become symptomatic in a patient’s lifetime and would remain as indolent lung cancer⁴. At present there is limited data regarding the potential harmful impact of overdiagnosis on lung cancer outcomes; however, studies have suggested overdiagnosis is associated with increased operative mortality, severe disability among survivors, and reduction in longer term disease-free survival attributed to loss of pulmonary reserve⁵. Though clinical guidelines provide recommendations for the management of screen-detected nodules, there are limited tools to discriminate between indolent and aggressive lung cancers diagnosed in the lung cancer screening setting^6–9. As such, biomarkers that can classify behavior of screen-detected lung cancers is an unmet clinical need since prior studies have suggested that 10 to 27% of lung cancers are over-diagnosed in lung cancer screening^10–13.

Quantitative image features, also known as radiomics¹⁴, are non-invasive biomarkers that are generated from medical imaging and reflect the underlying tumor pathophysiology and heterogeneity. Radiomics have many advantages over circulating and tissue-based biomarkers as they are rapidly calculated from standard-of-care imaging and they reflect the entire tumor burden and not just a sample of the tumor in the case of tissue-based biomarkers. Our group^8,15–17 and others^18–20 have utilized radiomics in the lung cancer screening setting to improve risk prediction and diagnostic discrimination. To date, there have been limited efforts to use radiomics to predict tumor behavior and patient outcomes in the lung cancer screening setting.

Using publicly available data and LDCT images from the NLST, we generated radiomic features from screen-detected, incidentally-diagnosed lung cancers. Radiomic features describing size, shape, volume, and textural characteristics were calculated from the intratumoral region (area within the tumor) and from the peritumoral region (area surrounding the tumor parenchyma). The goal of this study was to utilize these peritumoral and intratumoral radiomics to identify a reproducible parsimonious model that predicts survival outcomes among lung cancer patients diagnosed in the lung cancer screening setting.

Results

Patient characteristics

There were no statistically significant differences between training and test cohorts for age, sex, smoking status, number of pack-years smoked, family history (FH) of lung cancer, histology, treatment, stage, and baseline screening result (Table 1). Self-reported chronic obstructive pulmonary disease (COPD) was significantly higher among patients in the test cohort versus the training cohort (16% vs. 7%, P = 0.02). Using Student t test, we found no statistically significant difference in mean age between training and test cohort when stratifying then among male (P = 0.99) and among females (P = 0.73).

Table 1.

Patient characteristics in the training and test cohorts.

Characteristics	Total (N = 234)	Training Cohort (N = 161)	Test Cohort (N = 73)	P value^1,2,3	Validation Cohort (N = 62)
Age, mean (sd)	63.8 (5.1)	63.9(5.1)	63.5 (5.3)	0.63	67.2 (9.47)
By sex
Female	36.0 (5.2)	63.2 (5.2)	62.8 (5.2)		67.73 (8.92)
Male	64.3 (5.0)	64.3 (4.9)	64.3 (5.3)		66.58 (10.10)
Sex, N(%)				0.06
Female	101 (43%)	63 (39%)	38 (52%)		30 (48.4%)
Male	133 (57%)	98 (61%)	35 (48%)		31 (50%)
Missing					1 (1.6%)
Smoking status, N(%)				0.82
Former	100 (43%)	68 (42%)	32 (44%)		59 (95.2%)
Current	134 (57%)	93 (58%)	41 (56%)		0(0%)
Never					2 (3.2%)
Missing					1 (1.6%)
No. pack-years, mean (SD)	64.7 (23.9)	64.5 (23.6)	65.1 (24.9)	0.86	n/a
FH of lung cancer				0.74
No	170 (73%)	118 (73%)	52 (71%)		n/a
Yes	64 (27%)	43 (27%)	21 (29%)		n/a
Self-reported history of COPD				0.02
No	211 (90%)	150 (93%)	61 (84%)		n/a
Yes	23 (10%)	11 (7%)	12 (16%)		n/a
Histology				0.23
Adenocarcinoma-BAC⁴	130 (56%)	96 (60%)	34 (47%)		55 (88.7%)
Squamous	44 (19%)	28 (17%)	16 (22%)		0 (0)
Small Cell	10 (4%)	5 (3%)	5 (7%)		0 (0)
Other NOS	50 (21%)	32 (20%)	18 (25%)		0 (0)
Missing					7 (11.3%)
Treatment				0.22
Surgical resection	182 (78%)	129 (80%)	53 (73%)		n/a
Chemotherapy/Other	21 (9%)	11 (7%)	10 (14%)		n/a
Radiation therapy	31 (13%)	21 (13%)	10 (14%)		n/a
Staging				0.70
I and II	179 (76%)	122 (76%)	57 (78%)		45 (72.6%)
III and IV	55 (24%)	39 (24%)	16 (22%)		17 (27.4%)
Baseline Screening				0.19
Positive (T0+)	158 (68%)	113 (70%)	45 (62%)		n/a
Negative (T0−)	76 (32%)	48 (30%)	28 (38%)		n/a
5-year overall survival rate (%)	64%	62.60%	56.20%	0.44	64%

Open in a new tab

Abbreviations: SD = standard deviation; FH = family history; Pack-years = packs smoked/day x years smoked; COPD = chronic obstructive pulmonary disease; NOS = not otherwise specified;

¹ P value obtained from Chi-squared for categorical variables.

² P value obtained from T-test for continuous variables.

³ P value obtained from Log-rank for survival variables.

⁴ BAC and adenocarcinoma were combined into one group.

Radiomic analyses

A total of 109 peritumoral features were extracted, of which 56 were found to be both stable and reproducible, and a total of 155 intratumoral features were extracted, of which 35 were stable and reproducible. Therefore, a total of 91 stable and reproducible radiomics features (peritumoral and intratumoral) were subjected to univariable analysis. In univariable analyses, 40 of the 91 radiomic features (26 peritumoral and 14 intratumoral) were significantly associated with OS in the training cohort (Supplemental Table 1) and 30 of the 40 features were eliminated because they were correlated. The 10 remaining features were reduced to four highly informative features using backward elimination (Supplemental Table 2). Among the four features, three were peritumoral (average co-occurrence joint entropy, NGTDM busyness, and average co-occurrence angular second moment) and one was intratumoral (statistical root mean square).

Classification and regression tree (CART) analysis

The four remaining radiomic features were subjected to Classification And Regression Tree (CART) analysis in the training cohort and based on 2 radiomic features (NGTDM Busyness and Statistical Root Mean Square). CART analysis classified patients into four risk groups: low-risk, intermediate-risk-1, intermediate-risk-2, and high-risk (Supplemental Fig. 1). The four risk groups were reduced to three risk groups by combining the two intermediate-risk groups (Fig. 1B) and 3 risk groups from the CART model was replicated in the test cohort.

Identification of risk groups based on peritumoral and intratumoral features (A) Statistical analysis pipeline for radiomics feature selection. (B) The tree structure of the classification and regression tree analysis (CART) which identified four risk groups based on two radiomics features, in which we combined the two intermediate risk groups.

In training cohort, the high-risk group (Fig. 2A) was associated with extremely poor Overall Survival (OS) (Hazard Ratio (HR) = 14.67; 10% 2.5-year OS and 0% 5-year OS, log-rank P < 0.0001) versus the intermediate (HR = 3.25; 63% 2.5-year OS and 41% 5-year OS) and low-risk group (HR = 1.00; 89% 2.5-year OS and 78% 5-year OS). In the test cohort, the high-risk group was associated with extremely poor OS (HR = 3.35; 50% 2.5-year OS and 0% 5-year OS, log-rank P = 0.043) versus the low-risk group (HR = 1.00, 68% 2.5 year OS and 51% 5-year OS) (Fig. 2A). Similar findings were observed for Progression Free Survival (PFS) (Fig. 2B).

Risk-groups associated with overall survival for training and test cohorts and among early stage patients. Across the training and test cohort as well as in early-stage the high-risk group had a significantly worse outcome in OS (A) and PFS (B).

Patient characteristics of the three risk groups in the total data set (N = 234)

There were no statistically significant differences between the three risk groups by age, smoking status, number of pack-years smoked, self-reported COPD, and family history of lung cancer, histological subtypes, and treatment (Table 2). However, there were statistically significant differences across the risk groups for sex (P = 0.04) and stage of disease (P = 0.001). Specifically, 92% of the patients in the high-risk group in were male vs. 54% in the low-risk group (P = 0.04) (Table 2). In term of lung cancer stage, 33% of the patients in the high-risk group had lung cancer early-stage vs. 80% in the low-risk group (P = 0.001).

Table 2.

Patient characteristics of the three risk groups in the total data set (N = 234).

Characteristics	Low risk group (N = 153)	Intermediate risk group (N = 69)	High risk (N = 12)	P value^1,2,3
Age, mean (sd)	63.1 (4.9)	65.2 (5.4)	63.7 (3.4)	0.07
Female	63.1 (4.9)	64.7 (5.6)	64
Male	63.8 (4.9)	65.6 (5.4)	63.6 (3.6)
Sex, N (%)				0.04
Female	71 (46%)	29 (42%)	1 (8%)
Male	82 (54%)	40 (58%)	11 (92%)
Smoking, N (%)				0.43
No	66 (43%)	31 (45%)	3 (25%)
Yes	87 (57%)	38 (55%)	9 (75%)
Pack-years, mean (sd)	63.2 (23.8)	66.5 (23.9)	74.0 (26.7)	0.95
FH of lung cancer				0.49
No	115 (75%)	47 (68%)	8 (67%)
Yes	38 (25%)	22 (32%)	4 (33%)
Self-reported history of COPD				0.19
No	140 (92%)	59 (86%)	12 (100%)
Yes	13 (8%)	10 (14%)	0 (0%)
Histology				0.27
Adenocarcinoma-BAC⁴	91 (59%)	36 (52%)	3 (25%)
Squamous	26 (17%)	13 (19%)	5 (42%)
Small cell	5 (3%)	4 (6%)	1 (8%)
Other NOS	31 (20%)	16 (23%)	3 (25%)
Treatment				0.13
Surgical	125 (82%)	50 (72%)	7 (58%)
Chemotherapy/Other	13 (8%)	7 (10%)	1 (8%)
Radiation therapy	15 (10%)	12 (17%)	4 (33%)
Staging				0.001
Early stage (I and II)	123 (80%)	52 (75%)	4 (33%)
Late stage (III and IV)	30 (20%)	17 (25%)	8 (67%)
Baseline screening				0.06
Positive (T0+)	111 (73%)	41 (59%)	6 (50%)
Negative (T0−)	42 (27%)	28 (41%)	6 (50%)
2.5-year overall survival rate, %				< 0.001
Training	89%	63%	10%
Test Cohort	68%	78%	50%
5-year overall survival rate, %				< 0.001
Training	77%	41%	n/a
Test Cohort	51%	73%	n/a

Open in a new tab

Abbreviations: sd = standard deviation; FH = family history; Pack-years = packs smoked/day x years smoked; COPD = chronic obstructive pulmonary disease; NOS = not otherwise specified;

¹ P-value obtained from Chi-squared for categorical variables.

² P-value obtained from Anova for continuous variables.

³ P-value obtained from Log-rank for survival variables.

⁴ BAC and adenocarcinoma were combined into one group.

Survival analyses among early stage patients

Among all early-stage patients, the high-risk group was associated with a significantly decreased OS (HR = 9.91; 25% 2.5-year and 0% 5-year OS, log-rank P < 0.0001) versus the low-risk group (HR = 1.00; 93% 2.5-year and 78% 5-year OS) (Fig. 2A). Similar results were found for PFS (Fig. 2B).

Validation dataset

Using non-screen detected we attempted to replicate the risk groups obtained from the CART model. Among early-stage adenocarcinoma lung cancers (Fig. 3A), the high-risk group was associated with worse OS (HR = 2.63; 56% 2.5-year and 42% 5-year OS, log-rank P = 0.112) compared to the low-risk group (HR = 1.00; 75% 2.5-year and 75% 5-year OS). Among late-stage patients (Fig. 3B), the risk groups were not associated with survival (log-rank P = 0.432).

Overall survival for the risk patient risk groups among non-screen detected adenocarcinoma lung cancers (A) and for early-stage (B) for late-stage patients.

Multivariable analyses

Multivariable Cox regression models were used to adjust for potential confounding factors including sex, treatment, and stage. In the training cohort, the high-risk group was associated with an elevated hazard ratio (OS: HR = 9.71; 95% Confidence Interval: [3.85, 24.48] and PFS: HR = 5.68; 95% Confidence Interval: [2.32, 13.93] ) when compared to intermediate and low-risk groups (Tables 3 and 4). In the test cohort, the high-risk group yielded an increased hazard ratio in PFS when compared to intermediate and low-risk groups (PFS: HR = 2.02; 95% Confidence Interval: [0.34, 11.99]). Among all patients, the high-risk group was associated with a significantly elevated hazard ratio (OS: HR = 5.16; 95% Confidence Interval: [2.34, 11.37] and PFS: HR = 4.02; 95% Confidence Interval: [1.89, 8.56]) when compared to the intermediate and low-risk group. For OS and PFS analysis, smoking status, sex, nor baseline screening were associated with our model.

Table 3.

Multivariable Cox proportional hazards models for overall survival in the training and test cohorts.

Characteristic	All Patients² HR (95% CI)	Training Cohort HR (95% CI)	Test Cohort HR (95% CI)
Risk group
Low	1.00 (Reference)	1.00 (Reference)	1.00 (Reference)
Intermediate	1.55 (0.95, 2.53)	2.41 (1.28, 4.56)	0.46 (0.15, 1.43)
High	5.16 (2.34, 11.37)	9.71 (3.85, 24.48)	0.93 (0.14, 6.33)
Sex
Female	1.00 (Reference)	1.00 (Reference)	1.00 (Reference)
Male	1.46 (0.88, 2.41)	2.41 (1.18, 4.94)	1.13 (0.46, 2.74)
Smoking status
Former	1.00 (Reference)	1.00 (Reference)	1.00 (Reference)
Current	1.13 (0.72, 1.80)	0.96 (0.53, 1.75)	1.71 (0.68, 4.34)
Baseline screening (T0)
Positive (T0+)	1.00 (Reference)	1.00 (Reference)	1.00 (Reference)
Negative (T0−)	1.19 (0.72, 1.97)	0.84 (0.44, 1.61)	2.55 (0.94, 6.96)
Stage
I and II	1.00 (Reference)	1.00 (Reference)	1.00 (Reference)
III and IV	2.84 (1.58, 5.13)	4.42 (1.99, 9.82)	1.68 (0.55, 5.11)
Histology
Adenocarcinoma-BAC¹	1.00 (Reference)	1.00 (Reference)	1.00 (Reference)
Squamous	1.31 (0.70, 2.46)	1.33 (0.85, 3.03)	1.19 (0.37, 3.84)
Small Cell	1.80 (0.79, 4.11)	2.21 (0.69, 7.08)	2.32 (0.62, 8.64)
Other NOS	1.18 (0.67, 2.06)	1.49 (0.71, 3.10)	2.16 (0.70, 6.73)
Treatment
Surgical	1.00 (Reference)	1.00 (Reference)	1.00 (Reference)
Chemotherapy/Other	4.36 (2.17, 8.75)	3.37 (1.24, 9.15)	5.31 (1.57, 17.91)
Radiation	3.29 (1.72, 6.29)	1.55 (0.64, 3.76)	13.52 (3.72, 49.12)
Harrell's C index	0.79	0.83	0.81

Open in a new tab

Abbreviations: HR: Hazard Ratios; NOS: Not otherwise specified;

Data in parentheses are 95% CIs.

¹BAC and adenocarcinoma were combined into one group.

² “All patients” combines the training and test sets into a single cohort.

Table 4.

Multivariable Cox proportional hazards models for progression free survival in the training and test cohorts.

Characteristic	All Patients HR (95% CI)	Training Cohort HR (95% Cl)	Test Cohort HR (95% CI)
Risk Group
Low	1.00 (Reference)	1.00 (Reference)	1.00 (Reference)
Intermediate	1.42 (0.92, 2.22)	2.04 (1.16, 3.59)	0.34 (0.12, 0.98)
High	4.02 (1.89, 8.56)	5.68 (2.32, 13.93)	2.02 (0.34, 11.99)
Sex
Female	1.00 (Reference)	1.00 (Reference)	1.00 (Reference)
Male	0.92 (0.60, 1.42)	1.29 (0.72, 2.31)	0.60 (0.27, 1.32)
Smoking Status
Former	1.00 (Reference)	1.00 (Reference)	1.00 (Reference)
Current	1.21 (0.79, 1.85)	0.98 (0.57, 1.69)	1.77 (0.74, 4.23)
Baseline Screening
Positive (T0+)	1.00 (Reference)	1.00 (Reference)	1.00 (Reference)
Negative (T0−)	1.47 (0.94, 2.31)	1.02 (0.57, 1.83)	2.60 (1.15, 5.90)
Stage
I and II	1.00 (Reference)	1.00 (Reference)	1.00 (Reference)
III and IV	3.54 (2.07, 6.04)	4.93 (2.45, 9.91)	1.42 (0.55, 3.69)
Histology
Adenocarcinoma-BAC¹	1.00 (Reference)	1.00 (Reference)	1.00 (Reference)
Squamous	1.03 (0.60, 1.87)	1.06 (0.50, 2.26)	1.12 (0.36, 3.49)
Small Cell	1.49 (0.68, 3.25)	2.31 (0.75, 7.13)	1.49 (0.41, 5.37)
Other NOS	1.05 (0.63, 1.74)	0.95 (0.48, 1.89)	2.31 (0.86, 6.21)
Treatment
Surgical	1.00 (Reference)	1.00 (Reference)	1.00 (Reference)
Chemotherapy/Other	3.48 (1.86, 6.51)	3.13 (1.25, 7.87)	5.09 (1.75, 14.80)
Radiation	2.37 (1.29, 4.35)	1.60 (0.69, 3.69)	8.56 (2.51, 29.13)
Harrel’s C statistics	0.79	0.81	0.80

Open in a new tab

Performance metrics: Harrell’s c index and areas under the curve (AUROC)

The discrimination performance of the multivariable model was estimated using the Harrell’s C index. The multivariable model showed a better discrimination capability with a higher C indices in the training and test cohort when analyzing OS (C-index: 0.83 and 0.81, training and test cohort respectively) compared to PFS (C-index: 0.81 and 0.80, training and test cohort respectively). Using the multivariable models, time-dependent areas under the curve (AUROC) for OS were generated and produced an AUROC of 0.878 at 2-years and 0.800 for 4-years. Among early-stage patients, the AUROC was 0.702 for 2-years and 0.669 and 4-years (Fig. 4). Similar results were found for PFS (Supplemental Fig. 3).

Time-dependent AUC plot of the multivariable model for overall survival for (A) all patients and (B) early-stage patients.

Radiogenomics analysis

From the radiogenomic analyses, FOXF2 and LOC285043 were found to be significantly associated with RMS feature in both the pair wise analysis and correlation analysis (Supplementary Table 3). Stage was not significantly associated with the two most informative radiomics features (P = 0.6828 for RMS and P = 0.7905 for NGTDM Busyness). From the correlation analyses, four genes were positively significantly correlated with the RMS radiomic feature FOXF2, TBX4, LOC285043 and TM4SF18. Among these four genes, FOXF2 had the highest correlation (r = 0.45) with RMS while the other genes were correlated at r = 0.44 (Fig. 5A). In the pair-wises analyses where RMS was dichotomized at the median value, six genes were significantly different the RMS high and RMS low. The mean value of FOXF2 was significantly higher for RMS high vs. RMS low (mean = 7.1050 [SD = 1.0664] vs. 7.9798 [0.7054], p < 0.001) (Fig. 5B). When we generated the risk groups in the radiogenomics dataset, FOXF2 expression was significantly lower for the intermediate-risk group vs. low-risk group (mean = 7.2408 [SD = 1.0755] vs. 8.0287 [0.7524], p < 0.001). Among the three risk groups, the intermediate-risk had the lower FOXF2 expression when compared to the high-risk and low-risk group. Although FOXF2 expression was lower for the high-risk group vs. low-risk group (mean = 7.6819 [SD = 0.5879] vs. 8.0287[0.7521]), the difference was not statistically significant (p = 0.539).

The association between radiomics gene expression. (A) Correlation between intratumoral RMS radiomic feature and *FOXF2*. (B) *FOXF2* expression by dichotomizing RMS at the median and FOXF2 expression by the three patient risk groups. (C) *RABGAP1L* expression by dichotomizing NGTDM busyness at the median and NGTDM busyness expression by the three patient risk groups.

No genes were significantly correlated with NGTDM busyness. In the pairwise analyses where NGTDM busyness was dichotomized at the median, three genes were significantly different between NGTDM busyness high vs. NGTDM busyness low: RABGAP1L, LOC101928674, and LDLRAD4-AS1. RABGAP1L expression was significantly higher NGTDM busyness low vs NGTDM busyness high (mean = 6.7612 [SD = 0.8313] vs. 6.1066 [0.8200], P < 0.001). For the risk groups, RABGAP1L expression was higher for the intermediate-risk group vs. low-risk group (mean = 6.5938 [SD = 0.8866] vs. 6.2120 [0.8981], P = 0.100). RABGAP1 expression was higher for the high-risk group vs. low-risk group (mean = 6.2534 [SD = 0.6919] vs. 6.2120 [0.8981]), the difference was not statistically significant (P = 0.990). Stage was not significantly associated with the two most informative radiomics features (P = 0.6828 for RMS and P = 0.7905 for NGTDM Busyness) nor with the two most informative genes (P = 0.7767 for FOXF2 and P = 0.7928 for RABGAP1L).

Discussion

Predictive biomarkers that identify aggressive cancers from those that are either indolent, or at lower-risk of poor survival outcomes, are a critical unmet need in the lung cancer screening setting. In this study, we utilized peritumoral and intratumoral radiomic features to generate a model that is able to detect vulnerable group of screen-detected early stage lung cancer patients that have high-risk of experiencing poor survival outcomes. Specifically, we identified a model that contained two radiomics features, one peritumoral and one intratumoral, which stratified patients into three risk-groups: low-risk, intermediate-risk, and high-risk. The model identified a vulnerable group early-stage patients with worse OS (HR = 9.91; 25% 2.5-year and 0% 5-year OS) versus the low-risk group (HR = 1.00; 93% 2.5-year and 78% 5-year OS). The final model was validated in the test cohort and further replicated in a cohort of non-screen detected adenocarcinoma patients. Because disease stage was significantly different across the risk groups, we stratified the model by stage and found compelling results among early stage patients, which typically have very good survival outcomes. Among early stage patients, the high-risk group was associated with a worse OS (HR = 2.63; 56% 2.5-year and 42% 5-year OS) compared to the low-risk group (HR = 1.00; 75% 2.5-year and 75% 5-year OS).

Radiomics is a non-invasive approach that utilizes standard-of-care imaging to generate quantitative image features that can be used for risk prediction, diagnostic discrimination, prognostication, and to predict treatment response^{14,15,21–23}. Prior studies have shown that peritumoral features, extracted from the area surrounding the tumor parenchyma, and intratumoral features, extracted from the area within the tumor, have prognostic and predictive utility in cancers such as lung, breast, brain, gastric, and head and neck^1,24–29. For example, Dong et al. (29) developed an individualized nomogram using radiomic features from primary tumor and from the peritoneum to identify occult peritoneal metastasis among patients with advanced gastric cancer²⁹. Using peritumoral features from contrast-enhanced magnetic resonance imaging, Braman et al. found that CoL1AGe entropy was associated with pathological complete response among breast cancer patients who are Her2 negative²⁶. In, Xu et al. identified a radiomic score using features from the peritumoral region of hepatocellular carcinoma tumors that predicts microvascular involvement³⁰. Cumulative, the evidence of our study and others have demonstrate the utility of using peritumoral features alone or in combination with intratumoral features.

In this study, we identified a highly informative peritumoral feature (NGTDM busyness) and a highly informative intratumoral feature (statistical RMS). NGTDM, a texture feature, captures intensity values of a neighborhood of pixels to characterize the difference between a center voxel within the neighborhood^31,32. NGTDM parameters are coarseness, contrast, and busyness. Coarseness describes the granularity of an image, contrast relates to the dynamic range of intensity, and busyness relate to the rate of intensity change within an image³². NGTDM busyness has a high predictive power in differentiating between glioblastoma and primary central nervous system lymphoma³³. Studies found that NGTDM busyness extracted from positron emission tomography (PET) is useful for discriminating benign from malignant solid pulmonary nodules³⁴. Intensity-based features are derived from image histograms which represent the intensity distribution of the image. Parmar et al. 2015 showed a correlation between intensity feature and patient survival in lung and head and neck cancer. Statistical RMS, an intensity feature, is a first-order statistic that calculates the root mean square of the voxel’s intensity value^31,35. A previous study showed that statistical RMS was able to predict pathological response after chemoradiation in non-small cell lung cancer (NSCLC), by identifying gross residual response³⁵. Statistical RMS combined with other intensity statistical features was able to distinguish bladder tumor tissue from other normal tissues in fluorodeoxyglucose-positron emission tomography (FDG-PET) scan³⁶.

The radiogenomics analyses revealed that the most informative intratumoral radiomic feature, RMS, was significantly associated with expression of FOXF2 and LOC285043 which is an uncharacterized gene³⁷. We observed a trend for lower expression of FOXF2 in intermediate and high-risk groups versus the low-risk group. However, this trend in high and low-risk group was not statistically significant. FOXF2 is expressed in the lung and functions as an activator or inhibitor of gene transcription³⁸ and up-regulation of FOXF2 expression induces EMT, migration, invasion and metastasis in breast cancer³⁹. To support our findings that low expression of FOXF2 is a negative prognostic factor, a prior study demonstrated that patients with stage I NSCLC who had low FOXF2 expression had significantly shorter overall survival compared to patients with high FOXF2 expression⁴⁰. RABGAP1L was found to be significantly associated with NGTDM Busyness in the pairwise analyses and we revealed that the intermediate and high-risk groups had higher expression of RABGAP1L versus the low-risk group. At present, there is no known role for RABGAP1L in lung cancer. However, RABGAP1L has been shown to deregulate the tyrosine-kinase signaling pathway in acute myeloid leukemia⁴¹ and regulates the activity of GTPases which is essential to transport cell adhesion proteins and migrating cells⁴².

We acknowledge some limitations of this study. First, the sample size is somewhat modest because we utilized lung cancer cases with specific inclusion/exclusion criteria from the NLST. Then when we split the available number of cases into training and test cohorts based on a 70:30 ratio; the resulting sample sizes likely attributed to the poorly calibrated model based on its ability to predict 5-years survival outcomes in the training and test cohorts. However, we applied a rigorous feature reduction approach to eliminate correlated and non-reproducible features and utilized a backward reduction approach to identify a parsimonious model containing the most important features to reduce false positive findings. Although the overall population of lung cancer patients were treated heterogeneously; however, among the early stage patients, 92.74% of the patients had surgery as their only treatment. We also recognize that there were limited numbers of patients in the high-risk groups for these cohorts. Finally, our validation cohort was limited to patients with lung adenocarcinoma. Additional research is needed to validate the biological underpinnings of these features.

The results from our analyses produced a parsimonious radiomic model that identified a vulnerable subset of screen-detected lung cancers that are associated with poor outcome. These findings could support more aggressive treatment and follow-up for such high-risk patients. Nonetheless, additional research will be needed to inform the potential translational implications of these findings, to fully elucidate the biology these high-risk screen-detected tumors, to assess whether these findings are consistent across screening trials and cohorts, and how best to personalize cancer management in these vulnerable patients.

Methods

NLST study population

Deidentified data and LDCT images were accessed from the National Cancer Institute (NCI) Cancer Data Access System (CDAS)⁴³. The NLST study design and main findings have been described previously^1,28. NLST eligibility criteria included current and former smokers aged 55‐74 years with a minimum 30 pack‐years smoking history and former smokers had to have quit within the past 15 years.

Based on the schema described in Schabath et al.⁴⁴, this analysis considered 321 NLST participants who had a negative or positive baseline screening (T0) result and were diagnosed with a screen-detected, incidental lung cancer on follow-up screening intervals 12 (T1) or 24 (T2) months after T0. Positive screens were defined as abnormalities on baseline screens or at follow-up screens that were new, stable or evolved. Negative screens were defined as a computerized tomography (CT) scan with no abnormalities, minor abnormalities, or significant abnormalities not suspicious for lung cancer. Among the 321 lung cancer patients, 196 had a baseline positive screen that was not diagnosed as lung cancer but evolved and diagnosed as lung cancer at T1 or T2 screening intervals. The remaining 125 lung cancer patients had a baseline negative screen result but developed a nodule that was diagnosed as lung cancer at either T1 or T2. Lung cancer patients who had multiple nodules at time of their diagnosis were excluded (N = 58) since we are unable to verify which nodule(s) were cancer. Complications with segmentation, such as a nodule attached to lung wall, led to 29 patients being excluded. The final dataset of 234 screen-detected lung cancers were randomly split into a training cohort (N = 161) and a test cohort (N = 73).

Non-screen detected lung cancer validation dataset

The radiomics data were further validated for OS in a prior published dataset was comprised of 62 adenocarcinoma patients who underwent surgical resection as first course therapy at the Moffitt Cancer Center and had pre-surgery CTs within 2 months prior surgery⁴⁵.

Radiogenomics dataset

A previously described dataset⁴⁶ of surgically resected adenocarcinoma lung cancers (N = 103) who had pre-surgery CTs and gene expression data was used to identify potential biological underpinnings of the final two informative radiomic features (RMS and NGTDM Busyness). The gene expression data were IRON-normalized and batch-corrected for RNA quality Pathway and Gene Ontology Enrichment using Clarivate Analytics MetaCore⁴⁶.

Radiomics

Nodule identification and tumor segmentation is described elsewhere¹⁵. The tumor mask images (i.e., tumor delineations) were imported into in-house radiomic feature extraction toolboxes created in MATLAB® 2015b (The Mathworks Inc., Natick, Massachusetts) and C+ + (https://isocpp.org). Using cubic interpolation, the images were resampled to a single voxel spacing of 1 mm × 1 mm × 1 mm to standardize spacing across all images. Hounsfield units (HU) in all images were resampled into fixed bin sizes of 25 HUs discretized from –1,000 to 1,000 HU.

Using standardized radiomic algorithms from the Image Biomarker Standardization Initiative (IBSI) v5⁴⁷, a total of 264 radiomic features were extracted from semi-automatic segmented intratumoral region (n = 155) and from the peritumoral region (n = 109) 3 mm outside of tumor boundary. The peritumoral regions were generated as an extension of the tumor segmentations using morphological image processing operations as previously mentioned⁴⁸. Peritumoral regions were bounded by a lung parenchyma mask to exclude the region of interest (ROI) outside of the lung parenchyma. Shape and size based peritumoral features were excluded as they explicitly describe (i.e., correlate) the intratumoral ROI. Only the most stable and reproducible intratumoral and peritumoral radiomic features that were previously found on another study of our group ⁴⁸ were utilized for analyses to create repeatable models. Further details of the feature selection process are presented in the statistical analysis section below (Fig. 1A).

Statistical analysis

Statistical analyses were performed using Stata/MP 14.2 (StataCorp LP, College Station TX), R Project for Statistical Computing (version 3.5.2), and R Studio (version 1.1.463). Fisher’s exact test was used to test the difference between the training and test cohorts for categorical variables and the Student’s t-test was used to test the difference between the training and test cohorts for continuous variables.

Overall survival (OS) and progression-free survival (PFS) were the main end-points for the analysis and were assessed from date of lung cancer diagnosis to the date of an event or last follow up. For OS, an event was defined as death and for PFS an event was established as death or progression of cancer. All survival data were right censored at 5-years. To generate a parsimonious radiomics model, we first performed univariable analyses using Cox Proportional Hazard regression and retained features with a P value < 0.10. Among the remaining features after univariable analyses, we removed features that were correlated based on Pearson’s correlation coefficient > 0.80. If two or more features were correlated based on an absolute Pearson’s correlation coefficient > 0.80, the feature with the smaller p-value from the univariable analyses was retained. The remaining radiomic features were subjected to backward elimination approach using a pre-specified more stringent P value < 0.01 for inclusion. Among the remaining covariates, Classification And Regression Tree analysis (CART) was used to stratify patients into risk groups. CART is a nonparametric data-mining tool that can identify hierarchical interactions and segment covariates into novel and meaningful terminal subgroups (i.e., nodes). The hazard ratios from the risk groups were generated using Cox Proportional Hazard regression. The risk groups were also analyzed by Kaplan Meier curves and log-rank tests. The risk groups based on the most informative radiomic features in the training cohort were validated in the test cohort and further replicated in the adenocarcinoma cohort. The Harrell’s concordance index (C-index) was used to evaluate the multivariable model. Time-dependent area under the receding operating curve (AUROC) analyses was used to assess accuracy of the Cox regression models at different time-points using R packages survival⁴⁹, survminer⁵⁰, and survivalROC⁵¹.

Using the radiogenomics dataset, analyses were conducted to determine if the two final radiomic features (RMS and NGTDM Busyness) were associated with the gene probesets using two different approaches: correlation and two-group analysis. For the correlation analysis, gene probesets were filtered and determined as statistically significant using the following criteria: Pearson’s correlation with a threshold |R|> 0.4, an expression filter with max expression of gene > 5, and an inter-quartile filter (IQR > log2 (1.2 FC)). For the two-group analyses, gene probesets were filtered and determined as significant using the following criteria based on a Student’s t test p < 0.001 and mean log fold-change between high and low prognostic radiomic feature oflfc > log2 (1.4 FC). The significant probesets from the two-group analyses were intersected yielding a final list of probesets significantly associated with the most informative radiomic feature. ANOVA and Tukey pairwise mean comparison was performed to analyzed gene expression across the risk groups.

Supplementary information

Supplementary file1 (DOCX 129 kb)^{(129.5KB, docx)}

Acknowledgements

Funding support from U54-CA163068, P20 CA202920, U01-CA143062, and U01-CA200464. None of the authors are affiliated with the National Cancer Institute. The authors thank the NCI for access to NCI’s data collected by the National Lung Screening Trial. The statements contained herein are solely those of the authors and do not represent or imply concurrence or endorsement by the NCI.

Author contributions

All authors (J.P.M., I.T., O.S., S.E., Y.B., R.J.G., and M.B.S.) drafted the manuscript, discussed the results and contributed to the final manuscript. J.P.M., I.T., and M.B.S performed the statistical analysis and participated in its design. O.S. performed the radiological evaluation. S.E. performed the radiogenomics analysis. Y.B., R.J.G., and M.B.S. helped and verified the analytical methods.

Availability of data and materials

The datasets used and analyzed during the current study are available from the corresponding author on reasonable request.

Competing interests

R.J.G. Consultant in HealthMyne Imaging Systems. J.P.M., I.T., O.S., S.E., Y.B., and M.B.S. declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

is available for this paper at 10.1038/s41598-020-67378-8.

References

1.Aberle DR, et al. Reduced lung-cancer mortality with low-dose computed tomographic screening. New Engl. J. Med. 2011;365:395–409. doi: 10.1056/NEJMoa1102873. [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Clay R, et al. Computer Aided Nodule Analysis and Risk Yield (CANARY) characterization of adenocarcinoma: radiologic biopsy, risk stratification and future directions. Transl. Lung Cancer Res. 2018;7:313–326. doi: 10.21037/tlcr.2018.05.11. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Welch HG, Black WC. Overdiagnosis in cancer. J. Natl. Cancer Inst. 2010;102:605–613. doi: 10.1093/jnci/djq099. [DOI] [PubMed] [Google Scholar]
4.Patz EF, Jr, et al. Overdiagnosis in low-dose computed tomography screening for lung cancer. JAMA Intern. Med. 2014;174:269–274. doi: 10.1001/jamainternmed.2013.12738. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Reich JM, Kim JS. Inequivalence of non-aggressiveness in clinically diagnosed lung cancers and overdiagnosis in lung cancer screening trials. J. Thorac. Dis. 2018;10:1230–1232. doi: 10.21037/jtd.2018.01.164. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Detterbeck FC, Gibson CJ. Turning gray: the natural history of lung cancer over time. J. Thorac. Oncol. 2008;3:781–792. doi: 10.1097/JTO.0b013e31817c9230. [DOI] [PubMed] [Google Scholar]
7.Wood, D. E. et al. Lung cancer screening, version 1.2015: featured updates to the NCCN guidelines. J. Natl. Comp. Cancer Netw. 13, 23–34; quiz 34 (2015). [DOI] [PubMed]
8.Li Q, et al. Comparison between radiological semantic features and lung-RADS in predicting malignancy of screen-detected lung nodules in the National Lung Screening Trial. Clin. Lung Cancer. 2018;19:148–156.e143. doi: 10.1016/j.cllc.2017.10.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Kazerooni EA, et al. ACR-STR practice parameter for the performance and reporting of lung cancer screening thoracic computed tomography (CT): 2014 (Resolution 4) J. Thorac. Imaging. 2014;29:310–316. doi: 10.1097/rti.0000000000000097. [DOI] [PubMed] [Google Scholar]
10.Patz EF, Jr, Pinsky P, Kramer BS. Estimating overdiagnosis in lung cancer screening—Reply letters. JAMA Intern. Med. 2014;174:1198–1199. doi: 10.1001/jamainternmed.2014.1525. [DOI] [PubMed] [Google Scholar]
11.Lindell RM, et al. Five-year lung cancer screening experience: CT appearance, growth rate, location, and histologic features of 61 lung cancers. Radiology. 2007;242:555–562. doi: 10.1148/radiol.2422052090. [DOI] [PubMed] [Google Scholar]
12.Sone S, et al. Long-term follow-up study of a population-based 1996–1998 mass screening programme for lung cancer using mobile low-dose spiral computed tomography. Lung Cancer (Amsterdam, Netherlands) 2007;58:329–341. doi: 10.1016/j.lungcan.2007.06.022. [DOI] [PubMed] [Google Scholar]
13.Ten Haaf K, de Koning HJ. Overdiagnosis in lung cancer screening: why modelling is essential. J. Epidemiol. Community Health. 2015;69:1035–1039. doi: 10.1136/jech-2014-204079. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Lambin P, et al. Radiomics: extracting more information from medical images using advanced feature analysis. Eur. J. Cancer (Oxford, England: 1990) 2012;48:441–446. doi: 10.1016/j.ejca.2011.11.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Hawkins S, et al. Predicting malignant nodules from screening CT scans. J. Thorac. Oncol. 2016;11:2120–2128. doi: 10.1016/j.jtho.2016.07.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Liu Y, et al. Radiologic features of small pulmonary nodules and lung cancer risk in the National Lung Screening Trial: a nested case-control study. Radiology. 2018;286:298–306. doi: 10.1148/radiol.2017161458. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Cherezov D, et al. Delta radiomic features improve prediction for lung cancer incidence: a nested case-control analysis of the National Lung Screening Trial. Cancer Med. 2018;7:6340–6356. doi: 10.1002/cam4.1852. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Mao L, et al. Quantitative radiomic model for predicting malignancy of small solid pulmonary nodules detected by low-dose CT screening. Quant. Imaging Med. Surg. 2019;9:263–272. doi: 10.21037/qims.2019.02.02. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Ten Haaf K, et al. Risk prediction models for selection of lung cancer screening candidates: a retrospective validation study. PLoS Med. 2017;14:e1002277. doi: 10.1371/journal.pmed.1002277. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Fave X, et al. Delta-radiomics features for the prediction of patient outcomes in non-small cell lung cancer. Sci. Rep. 2017;7:588. doi: 10.1038/s41598-017-00665-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Kumar V, et al. Radiomics: the process and the challenges. Magn. Reson. Imaging. 2012;30:1234–1248. doi: 10.1016/j.mri.2012.06.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Parmar C, et al. Radiomic feature clusters and prognostic signatures specific for lung and head & neck cancer. Sci. Rep. 2015;5:11044. doi: 10.1038/srep11044. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Wu W, et al. Exploratory study to identify radiomics classifiers for lung cancer histology. Front Oncol. 2016;6:71. doi: 10.3389/fonc.2016.00071. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Tunali I, et al. Radial gradient and radial deviation radiomic features from pre-surgical CT scans are associated with survival among lung adenocarcinoma patients. Oncotarget. 2017;8:96013–96026. doi: 10.18632/oncotarget.21629. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Sun R, et al. A radiomics approach to assess tumour-infiltrating CD8 cells and response to anti-PD-1 or anti-PD-L1 immunotherapy: an imaging biomarker, retrospective multicohort study. Lancet Oncol. 2018;19:1180–1191. doi: 10.1016/s1470-2045(18)30413-3. [DOI] [PubMed] [Google Scholar]
26.Braman NM, et al. Intratumoral and peritumoral radiomics for the pretreatment prediction of pathological complete response to neoadjuvant chemotherapy based on breast DCE-MRI. Breast Cancer Res. 2017;19:57. doi: 10.1186/s13058-017-0846-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Prasanna P, Patel J, Partovi S, Madabhushi A, Tiwari P. Radiomic features from the peritumoral brain parenchyma on treatment-naive multi-parametric MR imaging predict long versus short-term survival in glioblastoma multiforme: preliminary findings. Eur. Radiol. 2017;27:4188–4197. doi: 10.1007/s00330-016-4637-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Aberle DR, et al. The National Lung Screening Trial: overview and study design. Radiology. 2011;258:243–253. doi: 10.1148/radiol.10091808. [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Dong D, et al. Development and validation of an individualized nomogram to identify occult peritoneal metastasis in patients with advanced gastric cancer. Ann. Oncol. 2019;30:431–438. doi: 10.1093/annonc/mdz001. [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Xu X, et al. Radiomic analysis of contrast-enhanced CT predicts microvascular invasion and outcome in hepatocellular carcinoma. J. Hepatol. 2019;70:1133–1144. doi: 10.1016/j.jhep.2019.02.023. [DOI] [PubMed] [Google Scholar]
31.Cook GJ, et al. Are pretreatment 18F-FDG PET tumor textural features in non-small cell lung cancer associated with response and survival after chemoradiotherapy? J. Nucl. Med. 2013;54:19–26. doi: 10.2967/jnumed.112.107375. [DOI] [PubMed] [Google Scholar]
32.Amadsun, M. K. R. Textural features corresponding to textural properties. IEEE J. Mag.https://ieeexplore.ieee.org/document/44046 (1989).
33.Xiao DD, Yan PF, Wang YX, Osman MS, Zhao HY. Glioblastoma and primary central nervous system lymphoma: preoperative differentiation by using MRI-based 3D texture analysis. Clin. Neurol. Neurosurg. 2018;173:84–90. doi: 10.1016/j.clineuro.2018.08.004. [DOI] [PubMed] [Google Scholar]
34.Chen S, et al. Diagnostic classification of solitary pulmonary nodules using dual time (18)F-FDG PET/CT image texture features in granuloma-endemic regions. Sci. Rep. 2017;7:9370. doi: 10.1038/s41598-017-08764-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Coroller TP, et al. Radiomic phenotype features predict pathological response in non-small cell lung cancer. Radiotherapy Oncol. 2016;119:480–486. doi: 10.1016/j.radonc.2016.04.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Hsu CY, et al. Radiomics features differentiate between normal and tumoral high-Fdg uptake. Sci. Rep. 2018;8:3913. doi: 10.1038/s41598-018-22319-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Yu H, et al. Identification and validation of long noncoding RNA biomarkers in human non-small-cell lung carcinomas. J. Thorac. Oncol. 2015;10:645–654. doi: 10.1097/JTO.0000000000000470. [DOI] [PubMed] [Google Scholar]
38.Katoh M, Katoh M. Human FOX gene family (review) Int. J. Oncol. 2004;25:1495–1500. [PubMed] [Google Scholar]
39.Meyer-Schaller N, Heck C, Tiede S, Yilmaz M, Christofori G. Foxf2 plays a dual role during transforming growth factor beta-induced epithelial to mesenchymal transition by promoting apoptosis yet enabling cell junction dissolution and migration. Breast Cancer Res. 2018;20:118–118. doi: 10.1186/s13058-018-1043-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Kong P-Z, Li G-M, Tian Y, Song B, Shi R. Decreased expression of FOXF2 as new predictor of poor prognosis in stage I non-small cell lung cancer. Oncotarget. 2016;7:55601–55610. doi: 10.18632/oncotarget.10876. [DOI] [PMC free article] [PubMed] [Google Scholar]
41.Roberti MC, et al. RABGAP1L gene rearrangement resulting from a der(Y)t(Y;1)(q12;q25) in acute myeloid leukemia arising in a child with Klinefelter syndrome. Virchows Arch. 2009;454:311–316. doi: 10.1007/s00428-009-0732-z. [DOI] [PubMed] [Google Scholar]
42.Qu F, et al. Ankyrin-B is a PI3P effector that promotes polarized α5β1-integrin recycling via recruiting RabGAP1L to early endosomes. Elife. 2016;5:e20417. doi: 10.7554/eLife.20417. [DOI] [PMC free article] [PubMed] [Google Scholar]
43.Gierada DS, et al. Projected outcomes using different nodule sizes to define a positive CT lung cancer screening examination. J. Natl. Cancer Inst. 2014 doi: 10.1093/jnci/dju284. [DOI] [PMC free article] [PubMed] [Google Scholar]
44.Schabath MB, et al. Differences in patient outcomes of prevalence, interval, and screen-detected lung cancers in the CT arm of the National Lung Screening Trial. PLoS ONE. 2016;11:e0159880. doi: 10.1371/journal.pone.0159880. [DOI] [PMC free article] [PubMed] [Google Scholar]
45.Grove O, et al. Quantitative computed tomographic descriptors associate tumor shape complexity and intratumor heterogeneity with prognosis in lung adenocarcinoma. PLoS ONE. 2015;10:e0118261. doi: 10.1371/journal.pone.0118261. [DOI] [PMC free article] [PubMed] [Google Scholar]
46.Schabath MB, et al. Differential association of STK11 and TP53 with KRAS mutation-associated gene expression, proliferation and immune surveillance in lung adenocarcinoma. Oncogene. 2016;35:3209–3216. doi: 10.1038/onc.2015.375. [DOI] [PMC free article] [PubMed] [Google Scholar]
47.Zwanenburg, A., Leger, S., Vallières, M. & Löck, S. Image biomarker standardisation initiative. https://arxiv.org/abs/1612.07003 (2018).
48.Tunali I, et al. Stability and reproducibility of computed tomography radiomic features extracted from peritumoral regions of lung cancer lesions. Med. Phys. 2019;46:5075–5085. doi: 10.1002/mp.13808. [DOI] [PMC free article] [PubMed] [Google Scholar]
49.Therneau, T. Survival analysis. CRAN2.44–1.1 (2019).
50.Kassambara, A., Kosinski, M., Biecek, P. & Fabian, S. survminer. CRAN0.4.3 (2018).
51.Heagerty, P. & Saha-Chaudhuri, P. survivalROC. CRAN1.0.3 (2013).

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary file1 (DOCX 129 kb)^{(129.5KB, docx)}

Data Availability Statement

The datasets used and analyzed during the current study are available from the corresponding author on reasonable request.

[CR1] 1.Aberle DR, et al. Reduced lung-cancer mortality with low-dose computed tomographic screening. New Engl. J. Med. 2011;365:395–409. doi: 10.1056/NEJMoa1102873. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR2] 2.Clay R, et al. Computer Aided Nodule Analysis and Risk Yield (CANARY) characterization of adenocarcinoma: radiologic biopsy, risk stratification and future directions. Transl. Lung Cancer Res. 2018;7:313–326. doi: 10.21037/tlcr.2018.05.11. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR3] 3.Welch HG, Black WC. Overdiagnosis in cancer. J. Natl. Cancer Inst. 2010;102:605–613. doi: 10.1093/jnci/djq099. [DOI] [PubMed] [Google Scholar]

[CR4] 4.Patz EF, Jr, et al. Overdiagnosis in low-dose computed tomography screening for lung cancer. JAMA Intern. Med. 2014;174:269–274. doi: 10.1001/jamainternmed.2013.12738. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR5] 5.Reich JM, Kim JS. Inequivalence of non-aggressiveness in clinically diagnosed lung cancers and overdiagnosis in lung cancer screening trials. J. Thorac. Dis. 2018;10:1230–1232. doi: 10.21037/jtd.2018.01.164. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR6] 6.Detterbeck FC, Gibson CJ. Turning gray: the natural history of lung cancer over time. J. Thorac. Oncol. 2008;3:781–792. doi: 10.1097/JTO.0b013e31817c9230. [DOI] [PubMed] [Google Scholar]

[CR7] 7.Wood, D. E. et al. Lung cancer screening, version 1.2015: featured updates to the NCCN guidelines. J. Natl. Comp. Cancer Netw. 13, 23–34; quiz 34 (2015). [DOI] [PubMed]

[CR8] 8.Li Q, et al. Comparison between radiological semantic features and lung-RADS in predicting malignancy of screen-detected lung nodules in the National Lung Screening Trial. Clin. Lung Cancer. 2018;19:148–156.e143. doi: 10.1016/j.cllc.2017.10.002. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR9] 9.Kazerooni EA, et al. ACR-STR practice parameter for the performance and reporting of lung cancer screening thoracic computed tomography (CT): 2014 (Resolution 4) J. Thorac. Imaging. 2014;29:310–316. doi: 10.1097/rti.0000000000000097. [DOI] [PubMed] [Google Scholar]

[CR10] 10.Patz EF, Jr, Pinsky P, Kramer BS. Estimating overdiagnosis in lung cancer screening—Reply letters. JAMA Intern. Med. 2014;174:1198–1199. doi: 10.1001/jamainternmed.2014.1525. [DOI] [PubMed] [Google Scholar]

[CR11] 11.Lindell RM, et al. Five-year lung cancer screening experience: CT appearance, growth rate, location, and histologic features of 61 lung cancers. Radiology. 2007;242:555–562. doi: 10.1148/radiol.2422052090. [DOI] [PubMed] [Google Scholar]

[CR12] 12.Sone S, et al. Long-term follow-up study of a population-based 1996–1998 mass screening programme for lung cancer using mobile low-dose spiral computed tomography. Lung Cancer (Amsterdam, Netherlands) 2007;58:329–341. doi: 10.1016/j.lungcan.2007.06.022. [DOI] [PubMed] [Google Scholar]

[CR13] 13.Ten Haaf K, de Koning HJ. Overdiagnosis in lung cancer screening: why modelling is essential. J. Epidemiol. Community Health. 2015;69:1035–1039. doi: 10.1136/jech-2014-204079. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR14] 14.Lambin P, et al. Radiomics: extracting more information from medical images using advanced feature analysis. Eur. J. Cancer (Oxford, England: 1990) 2012;48:441–446. doi: 10.1016/j.ejca.2011.11.036. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR15] 15.Hawkins S, et al. Predicting malignant nodules from screening CT scans. J. Thorac. Oncol. 2016;11:2120–2128. doi: 10.1016/j.jtho.2016.07.002. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR16] 16.Liu Y, et al. Radiologic features of small pulmonary nodules and lung cancer risk in the National Lung Screening Trial: a nested case-control study. Radiology. 2018;286:298–306. doi: 10.1148/radiol.2017161458. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR17] 17.Cherezov D, et al. Delta radiomic features improve prediction for lung cancer incidence: a nested case-control analysis of the National Lung Screening Trial. Cancer Med. 2018;7:6340–6356. doi: 10.1002/cam4.1852. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR18] 18.Mao L, et al. Quantitative radiomic model for predicting malignancy of small solid pulmonary nodules detected by low-dose CT screening. Quant. Imaging Med. Surg. 2019;9:263–272. doi: 10.21037/qims.2019.02.02. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR19] 19.Ten Haaf K, et al. Risk prediction models for selection of lung cancer screening candidates: a retrospective validation study. PLoS Med. 2017;14:e1002277. doi: 10.1371/journal.pmed.1002277. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR20] 20.Fave X, et al. Delta-radiomics features for the prediction of patient outcomes in non-small cell lung cancer. Sci. Rep. 2017;7:588. doi: 10.1038/s41598-017-00665-z. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR21] 21.Kumar V, et al. Radiomics: the process and the challenges. Magn. Reson. Imaging. 2012;30:1234–1248. doi: 10.1016/j.mri.2012.06.010. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR22] 22.Parmar C, et al. Radiomic feature clusters and prognostic signatures specific for lung and head & neck cancer. Sci. Rep. 2015;5:11044. doi: 10.1038/srep11044. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR23] 23.Wu W, et al. Exploratory study to identify radiomics classifiers for lung cancer histology. Front Oncol. 2016;6:71. doi: 10.3389/fonc.2016.00071. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR24] 24.Tunali I, et al. Radial gradient and radial deviation radiomic features from pre-surgical CT scans are associated with survival among lung adenocarcinoma patients. Oncotarget. 2017;8:96013–96026. doi: 10.18632/oncotarget.21629. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR25] 25.Sun R, et al. A radiomics approach to assess tumour-infiltrating CD8 cells and response to anti-PD-1 or anti-PD-L1 immunotherapy: an imaging biomarker, retrospective multicohort study. Lancet Oncol. 2018;19:1180–1191. doi: 10.1016/s1470-2045(18)30413-3. [DOI] [PubMed] [Google Scholar]

[CR26] 26.Braman NM, et al. Intratumoral and peritumoral radiomics for the pretreatment prediction of pathological complete response to neoadjuvant chemotherapy based on breast DCE-MRI. Breast Cancer Res. 2017;19:57. doi: 10.1186/s13058-017-0846-1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR27] 27.Prasanna P, Patel J, Partovi S, Madabhushi A, Tiwari P. Radiomic features from the peritumoral brain parenchyma on treatment-naive multi-parametric MR imaging predict long versus short-term survival in glioblastoma multiforme: preliminary findings. Eur. Radiol. 2017;27:4188–4197. doi: 10.1007/s00330-016-4637-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR28] 28.Aberle DR, et al. The National Lung Screening Trial: overview and study design. Radiology. 2011;258:243–253. doi: 10.1148/radiol.10091808. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR29] 29.Dong D, et al. Development and validation of an individualized nomogram to identify occult peritoneal metastasis in patients with advanced gastric cancer. Ann. Oncol. 2019;30:431–438. doi: 10.1093/annonc/mdz001. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR30] 30.Xu X, et al. Radiomic analysis of contrast-enhanced CT predicts microvascular invasion and outcome in hepatocellular carcinoma. J. Hepatol. 2019;70:1133–1144. doi: 10.1016/j.jhep.2019.02.023. [DOI] [PubMed] [Google Scholar]

[CR31] 31.Cook GJ, et al. Are pretreatment 18F-FDG PET tumor textural features in non-small cell lung cancer associated with response and survival after chemoradiotherapy? J. Nucl. Med. 2013;54:19–26. doi: 10.2967/jnumed.112.107375. [DOI] [PubMed] [Google Scholar]

[CR32] 32.Amadsun, M. K. R. Textural features corresponding to textural properties. IEEE J. Mag.https://ieeexplore.ieee.org/document/44046 (1989).

[CR33] 33.Xiao DD, Yan PF, Wang YX, Osman MS, Zhao HY. Glioblastoma and primary central nervous system lymphoma: preoperative differentiation by using MRI-based 3D texture analysis. Clin. Neurol. Neurosurg. 2018;173:84–90. doi: 10.1016/j.clineuro.2018.08.004. [DOI] [PubMed] [Google Scholar]

[CR34] 34.Chen S, et al. Diagnostic classification of solitary pulmonary nodules using dual time (18)F-FDG PET/CT image texture features in granuloma-endemic regions. Sci. Rep. 2017;7:9370. doi: 10.1038/s41598-017-08764-7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR35] 35.Coroller TP, et al. Radiomic phenotype features predict pathological response in non-small cell lung cancer. Radiotherapy Oncol. 2016;119:480–486. doi: 10.1016/j.radonc.2016.04.004. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR36] 36.Hsu CY, et al. Radiomics features differentiate between normal and tumoral high-Fdg uptake. Sci. Rep. 2018;8:3913. doi: 10.1038/s41598-018-22319-4. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR37] 37.Yu H, et al. Identification and validation of long noncoding RNA biomarkers in human non-small-cell lung carcinomas. J. Thorac. Oncol. 2015;10:645–654. doi: 10.1097/JTO.0000000000000470. [DOI] [PubMed] [Google Scholar]

[CR38] 38.Katoh M, Katoh M. Human FOX gene family (review) Int. J. Oncol. 2004;25:1495–1500. [PubMed] [Google Scholar]

[CR39] 39.Meyer-Schaller N, Heck C, Tiede S, Yilmaz M, Christofori G. Foxf2 plays a dual role during transforming growth factor beta-induced epithelial to mesenchymal transition by promoting apoptosis yet enabling cell junction dissolution and migration. Breast Cancer Res. 2018;20:118–118. doi: 10.1186/s13058-018-1043-6. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR40] 40.Kong P-Z, Li G-M, Tian Y, Song B, Shi R. Decreased expression of FOXF2 as new predictor of poor prognosis in stage I non-small cell lung cancer. Oncotarget. 2016;7:55601–55610. doi: 10.18632/oncotarget.10876. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR41] 41.Roberti MC, et al. RABGAP1L gene rearrangement resulting from a der(Y)t(Y;1)(q12;q25) in acute myeloid leukemia arising in a child with Klinefelter syndrome. Virchows Arch. 2009;454:311–316. doi: 10.1007/s00428-009-0732-z. [DOI] [PubMed] [Google Scholar]

[CR42] 42.Qu F, et al. Ankyrin-B is a PI3P effector that promotes polarized α5β1-integrin recycling via recruiting RabGAP1L to early endosomes. Elife. 2016;5:e20417. doi: 10.7554/eLife.20417. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR43] 43.Gierada DS, et al. Projected outcomes using different nodule sizes to define a positive CT lung cancer screening examination. J. Natl. Cancer Inst. 2014 doi: 10.1093/jnci/dju284. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR44] 44.Schabath MB, et al. Differences in patient outcomes of prevalence, interval, and screen-detected lung cancers in the CT arm of the National Lung Screening Trial. PLoS ONE. 2016;11:e0159880. doi: 10.1371/journal.pone.0159880. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR45] 45.Grove O, et al. Quantitative computed tomographic descriptors associate tumor shape complexity and intratumor heterogeneity with prognosis in lung adenocarcinoma. PLoS ONE. 2015;10:e0118261. doi: 10.1371/journal.pone.0118261. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR46] 46.Schabath MB, et al. Differential association of STK11 and TP53 with KRAS mutation-associated gene expression, proliferation and immune surveillance in lung adenocarcinoma. Oncogene. 2016;35:3209–3216. doi: 10.1038/onc.2015.375. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR47] 47.Zwanenburg, A., Leger, S., Vallières, M. & Löck, S. Image biomarker standardisation initiative. https://arxiv.org/abs/1612.07003 (2018).

[CR48] 48.Tunali I, et al. Stability and reproducibility of computed tomography radiomic features extracted from peritumoral regions of lung cancer lesions. Med. Phys. 2019;46:5075–5085. doi: 10.1002/mp.13808. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR49] 49.Therneau, T. Survival analysis. CRAN2.44–1.1 (2019).

[CR50] 50.Kassambara, A., Kosinski, M., Biecek, P. & Fabian, S. survminer. CRAN0.4.3 (2018).

[CR51] 51.Heagerty, P. & Saha-Chaudhuri, P. survivalROC. CRAN1.0.3 (2013).

PERMALINK

Peritumoral and intratumoral radiomic features predict survival outcomes among patients diagnosed in lung cancer screening

Jaileene Pérez-Morales

Ilke Tunali

Olya Stringfield

Steven A Eschrich

Yoganand Balagurunathan

Robert J Gillies

Matthew B Schabath

Abstract

Introduction

Results

Patient characteristics

Table 1.

Radiomic analyses

Classification and regression tree (CART) analysis

Figure 1.

Figure 2.

Patient characteristics of the three risk groups in the total data set (N = 234)

Table 2.

Survival analyses among early stage patients

Validation dataset

Figure 3.

Multivariable analyses

Table 3.

Table 4.

Performance metrics: Harrell’s c index and areas under the curve (AUROC)

Figure 4.

Radiogenomics analysis

Figure 5.

Discussion

Methods

NLST study population

Non-screen detected lung cancer validation dataset

Radiogenomics dataset

Radiomics

Statistical analysis

Supplementary information

Acknowledgements

Author contributions

Availability of data and materials

Competing interests

Footnotes

Supplementary information

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases