Abstract
Simple Summary
In this study, we strive to identify clinically relevant image feature (IF) changes during chemoradiation in patients with non-small-cell lung cancer (NSCLC) to be able to predict tumor responses in an early stage of treatment. All patients underwent static (3D) and respiratory-gated 4D PET/CT scans before treatment and a 3D scan during or after treatment. Our proposed method rejects IF changes due to intrinsic variability such as noise, resolution and movement through breathing. The IF variability observed across 4D PET is employed as a patient individualized normalization factor to emphasize statistically relevant IF changes during treatment.
Abstract
The aim of this study is to identify clinically relevant image feature (IF) changes during chemoradiation and evaluate their efficacy in predicting treatment response. Patients with non-small-cell lung cancer (NSCLC) were enrolled in two prospective trials (STRIPE, PET-Plan). We evaluated 48 patients who underwent static (3D) and retrospectively-respiratory-gated 4D PET/CT scans before treatment and a 3D scan during or after treatment. Our proposed method rejects IF changes due to intrinsic variability. The IF variability observed across 4D PET is employed as a patient individualized normalization factor to emphasize statistically relevant IF changes during treatment. Predictions of overall survival (OS), local recurrence (LR) and distant metastasis (DM) were evaluated. From 135 IFs, only 17 satisfied the required criteria of being normally distributed across 4D PET and robust between 3D and 4D images. Changes during treatment in the area-under-the-curve of the cumulative standard-uptake-value histogram (δAUCCSH) within primary tumor discriminated (AUC = 0.87, Specificity = 0.78) patients with and without LR. The resulted prognostic model was validated with a different segmentation method (AUC = 0.83) and in a different patient cohort (AUC = 0.63). The quantification of tumor FDG heterogeneity by δAUCCSH during chemoradiation correlated with the incidence of local recurrence and might be recommended for monitoring treatment response in patients with NSCLC.
Keywords: lung cancer, PET radiomics, FDG monitoring and retrospectively gated 4D PET/CT
1. Introduction
In non-small cell lung cancer, [18F]fluoro-2-deoxy-D-glucose (FDG) PET is a valuable tool for tumor detection and staging [1,2,3,4]. Furthermore, the use of FDG-PET/CT is the standard of care for the definition of the target volume in radiotherapy treatment planning as well as for treatment monitoring [5,6,7,8]. In some studies, relative changes of standardized uptake value (SUV) in lung cancer patients correlated with the treatment outcome [9,10]. Additionally, the feasibility of response assessment during the first weeks of radiotherapy treatment would allow for an adaptation of the treatment strategy, which might lead to better local control rates [11].
Radiomics is the extraction and analysis of a large number of quantitative image features (IF) including first-order (histogram and shape parameters) and second- or higher-order statistics (texture features), which provide spatial and voxel intensity information. Radiomics might help to develop descriptive or predictive models for improving diagnosis or treatment selection [12]. In order to maximize the generalization ability of the resulted model, radiomics requires a high level of IF robustness [13,14,15]. The low spatial resolution and poor statistics of positron emission tomography (PET) images have a negative impact on the IF variability compared to other imaging modalities such as computed tomography (CT). In addition, PET IF variability has been proved to be very sensitive to image reconstruction settings, tumor segmentation methods, SUV resampling methods and texture feature matrix definitions [16,17]. Furthermore, the evaluation of lung cancer lesions using PET/CT imaging presents additional challenges due to respiratory movement. Tumor motion due to breathing cycles during PET scan acquisition results in inaccurate quantification of radioactivity concentration and erroneous estimation of the shape and volume of the lesion. All these degrading factors have an impact on the PET IF variability, from now on referred to as “intrinsic variability”. The effect of each of these degrading factors on IF quality depends on the considered IF. The intrinsic variability does not correlate with clinical changes, that is, physiological processes and if not taken into account leads to a misinterpretation of the results while analyzing clinical data. Thus, there is a need to identify IF variations large enough (clinical) in the quantification of lung lesions changes based on FDG-PET radiomics at different treatment time points to be considered relevant for treatment monitoring [18].
Retrospectively respiratory gated 4D PET/CT has been proposed to minimize respiratory motion degradation in PET/CT systems [19]. Our group previously investigated the IF variability for 3D and 4D PET imaging protocols with experimental heterogeneous phantoms [20] and lung cancer patients [21]. Results showed that although IF variability across the breathing phases depends on the considered IF, all IF followed a normal distribution for most lesions. In addition, and in concordance to previous publications [22], a large number of IF was robust to the motion compensation implied in 4D PET/CT images. The main purpose of this work was to identify relevant changes during chemoradiation of NSCLC lesions based on FDG-PET radiomics and to evaluate the efficacy of radiomics in predicting treatment outcome. Additionally, we aimed to evaluate a novel method, which rejects IF variations originated by the noise, resolution and effects of breathing motion inherent in PET images. The IF variability (standard deviation: σIF) observed across the pre-treatment 4D PET breathing phases was employed as an individualized patient normalization factor to emphasize statistically relevant IF changes during treatment. The prediction of the response was used as a clinical outcome to assess the feasibility of the method. The resultant prognostic model was additionally validated with a different segmentation method and an additional patient cohort.
2. Materials and Methods
2.1. Patient Cohorts
The entire study cohort consisted of patients with pulmonary lesions derived from two prospective trials (PET Plan and STRIPE). All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional or national research committee and with the 1964 Helsinki Declaration and its later amendments or comparable ethical standards. The randomized controlled PET-Plan trial (ARO 2009-09, ClinicalTrials.gov Identifier: NCT00697333, Deutsche Krebshilfe, German Cancer Aid Organisation, Nr 108237) was conducted in 24 centers in Germany, Austria and Switzerland in accordance with the Helsinki Declaration. Study design, procedures and main outcome results have been published elsewhere [23]. Briefly, patients with histologically proven inoperable locally advanced NSCLC suitable for chemoradiotherapy were randomly assigned (1:1) to target volume delineation performed on 18F-FDG-PET and CT (including CT-positive but 18F-FDG-negative nodes) plus elective nodal irradiation and tumor-associated atelectasis, if applicable (conventional target group), or to target volumes defined by PET alone (18F-FDG PET-based target group). Patients in the phase II STRIPE trial (Deutsche Krebshilfe, German Cancer Aid Organisation, Nr 108472) with small primary or metastatic lung tumors who refused surgery or whose tumors were inoperable due to comorbidities were treated with SBRT. Study design, procedures and main outcome results have been published elsewhere [24]. The 48 patients in our analyses (37 from PET Plan and 11 from STRIPE) were prospectively recruited in our department.
The main selection criterion for these patients was to have both static 3D PET/CT and retrospectively respiratory-gated 4D PET/CT scans performed before the start of the curatively intended chemoradiotherapy. The study population was divided into three different datasets (Cohorts 1 to 3) based on the number and characteristics of the PET/CT acquisitions. Clinical characteristics of the cohorts used for the development of the prognostic model are summarized in Table 1. Twenty-eight patients (11 from Cohort 1 and 17 from Cohort 2) underwent PET/CT acquisitions during treatment and 20 patients (4 from Cohort 1, 5 from Cohort 2 and 11 from Cohort 3) after treatment (Figure 1). Treatment response was evaluated in terms of overall survival (OS), local recurrence (LR) and distant metastasis (DM) for cohorts 1 and 2.
Table 1.
Clinical Characteristics | Cohort 1 (n = 15) |
Cohort 2 (n = 22) |
Cohort 3 (n = 11) |
---|---|---|---|
Clinical Trial Register | NCT00697333 | NCT00697333 | DRKS00003658 |
PET-Plan | PET-Plan | STRIPE | |
Age (years, mean ± SD, range) | (67 ± 9, 51–78) | (65 ± 10, 47–83) | (74 ± 7, 60–83) |
Sex | - | - | - |
Female | 9(60%) | 6(27%) | 3(27%) |
Male | 6(40%) | 16(73%) | 8(73%) |
Tumor Localization | - | - | - |
Peripheral | 3(20%) | 7(32%) | 7(73%) |
Central | 12(80%) | 15(68%) | 3(27%) |
Stage | - | - | - |
Ib | (0%) | (0%) | 6(55%) |
IIb | (0%) | 2(9%) | 1(9%) |
IIIa | 8(53%) | 5(23%) | (0%) |
IIIb | 5(33%) | 9(41%) | (0%) |
IIIc | 1(7%) | 5(23%) | (0%) |
IV | (0%) | 1(5%) | 2(18%) |
Metastases | (0%) | (0%) | 2(18%) |
Chemotherapy | - | - | - |
Concurrent | 15(100%) | 22(100%) | (0%) |
No | (0%) | (0%) | 11(100%) |
Radiotherapy | (65 ± 11, 30–74) | (66 ± 5, 60–74) | (36 ± 1, 35–38) * |
(Gy, mean ± SD, range) | - | - | - |
Overall Survival | (47 ± 36, 1–105) | (47 ± 21,4–77) | (60 ± 31, 11–105) |
(months, mean ± SD, range) | - | - | - |
Local Recurrence (yes) | 6(40%) | 9(41%) | 2(18%) |
Distant Metastasis (yes) | 6(40%) | 15(68%) | 6(55%) |
SD: standard deviation. * Prescribed dose in 60% Isodose.
2.2. PET/CT Acquisition
Scans were performed on two different PET/CT systems from Philips (Eindhoven, The Netherlands): GEMINI TF TOF 64 (TF64) and GEMINI TF 16 Big Bore (BB). The scanners fulfilled the requirements indicated in the European Association of Nuclear Medicine (EANM) imaging guidelines (http://earl.eanm.org/ (accessed on 5 February 2021)) and obtained EANM Research Ltd. (EARL) accreditation. The transverse spatial resolution at 1 cm from the central axis of the scanner is 4.8 mm for both scanners. PET data were corrected for randomness, scattering and attenuation based on the corresponding CT dataset. The reconstruction method was a LOR-based ordered-subset iterative time-of-flight algorithm using spherical coordinates (BLOB-OS-TF) with three iterations, 33 subsets and a relaxation parameter of 0.35 for smoothing. Images were normalized to decay-corrected injected activity per kg body weight standardized-uptake-value (SUV) [g/mL]. A 3D PET scan was planned 60 min after the injection of 350 MBq [18F]FDG. A 4D PET was acquired after a 3D PET approximately 90 min post injection. The scanning parameters involved in each cohort are summarized in Table 2.
Table 2.
Scannig Parameters | Cohort 1 (n = 15) |
Cohort 2 (n = 22) |
Cohort 3 (n = 11) |
---|---|---|---|
4D PET/CT before RT | yes | yes | yes |
PET/CT System | - | - | - |
TF-64 | 15 | 4 | 11 |
BB | 0 | 18 | 0 |
Voxel Dimension (mm3) | 4 × 4 × 4 | 4 × 4 × 4 | 4 × 4 × 4 |
3DPET/CT before RT | yes | yes | yes |
PET/CT System | - | - | - |
TF-64 | 15 | 4 | 11 |
BB | 0 | 18 | 0 |
Voxel Dimension (mm3) | 4 × 4 × 4 | 2 × 2 × 2 | 4 × 4 × 4 |
3DPET/CT during RT | yes | yes | yes |
PET/CT System | - | - | - |
TF-64 | 11 | 4 | 11 |
BB | 0 | 13 | 0 |
Voxel Dimension (mm3) | 4 × 4 × 4 | 2 × 2 × 2 | 4 × 4 × 4 |
3DPET/CT after RT | yes | yes | yes |
PET/CT System | - | - | - |
TF-64 | 4 | 0 | 11 |
BB | 0 | 5 | 0 |
Voxel Dimension (mm3) | 4 × 4 × 4 | 4 × 4 × 4 | 4 × 4 × 4 |
Time interval between 3D scans (days, mean ± SD, range) |
(68 ± 92, 14–343) | (196 ± 340, 13–1123) | (140 ± 67, 42–271) |
2.3. Tumor Segmentation
Two different methods were applied to delineate the primary tumor lesion: (i) a manual contour by consensus of two radiation oncologists and (ii) the semi-automatic segmentation method Contrast-oriented-algorithm (COA) approved by a radiation oncologist. COA was previously validated with heterogeneous experimental phantoms and lung cancer patients [20].
2.4. Image Features Extraction
A total of 135 IF were computed with an open-source code [25] based on MATLAB® (The MathWorks Inc., Natick, MA, USA) for all PET images and all segmentations presented in Figure 1. The radiomics IF used in this study are listed in the Supplementary Materials Table S1. As recommended by previous investigations [21,26], SUV values of the voxels within the contour were discretized with a fixed bin width (W = 0.01) for texture feature computation. Texture features were derived from five matrices: the 3D version of the gray-level co-occurrence matrix (GLCM), the gray-level run length matrix (GLRLM), the gray-level size zone matrix (GLSZM) and the neighborhood gray tone difference matrix (NGTDM). In addition, on the voxel intensities within the contour we applied: (i) a Wavelet band-pass filtering (WF) with a weight ratio of 1:2 between band-pass sub-bands and other sub-bands and (ii) an equal-probability quantization algorithm (Q) by using the function histeq of MATLAB®.
2.5. Statistical Analysis
Statistical analysis was performed using in-house software based on Wolfram Mathematica v 11.2. Normality was evaluated by the Shapiro–Wilk test. The Wilcoxon signed rank (WSR) test was used when comparing two data samples and positive findings were confirmed by the 95% confidence interval (CI) of the Bland–Altman percentage plot analysis [27]. For the analysis of overall survival, Kaplan–Meier curves were estimated and the comparison between groups was evaluated with the log-rank test. Multivariate Cox regression was used for estimation of hazard ratios (HR) with 95% CI. In the analysis of the binary outputs (distant metastasis and local recurrence), the Mann–Whitney U test was used for non-pairwise comparison between groups. For positive findings, an open-source multivariate binary logistic regression analysis [25] was additionally performed that involved imbalance-adjusted bootstrap resampling in prediction performance estimation and the computation of model coefficients. To correct for multiple test comparisons, the p values were adjusted for multiple testing by controlling the false discovery rate using Benjamin and Hochberg’s method [28]. A p < 0.05 was considered to be statistically significant.
2.6. Proposed Method
Our aim was to identify statistically relevant IF variations during chemoradiotherapy under the assumption that they would have a higher probability of being clinically relevant when ruling out those emerging from the intrinsic variability of the PET/CT images. The proposed method is based on two main findings from previous investigations: (i) our group observed that most IF were normally distributed across the breathing phases for 28 heterogeneous phantoms following 16 respiratory patters and for 31 lung cancer patients; (ii) there were IF robust to the motion compensation implied by 4D PET/CT.
The method consists of three main steps:
Identification of the IF following a normal distribution across the pre-treatment 4D breathing phases. The objective was to ensure that the 4D protocol and the robustness of the IF were good enough to reproduce FDG-distribution quantization across the respiratory phases. For each patient, a primary lesion was segmented on each breathing phase and values for the 135 IF were computed, Figure 2. We considered that IF satisfied the selection criteria when their values across the 4D frames followed a normal distribution (Shapiro–Wilk test) in more than 70% of the patients.
Identification of IF robustness throughout 3D and 4D PET images. The objective was to ensure that the IFs were robust enough to be reproducible with and without a motion-compensation reconstruction protocol.
Quantification of relative IF variations during treatment ∆rIF weighted according to variability across 4D frames:
(1) |
where this normalization factor permitted the emphasis of statistically relevant IF changes during treatment (δIF).
Consequently, the normalized relative deviation (δ) of the IF, which satisfied both selection criteria, was evaluated for the predictive accuracy of the treatment response.
3. Results
3.1. Method Development
For the proposed method, the IF that simultaneously satisfied both preconditions—to be normal-distributed along 4D breathing phases and to be robust throughout 3D and 4D PET images—were identified. The results using the Shapiro–Wilk test in the study of normality and the Wilcoxon–Rank test in the study of comparability are summarized in Table 3. The names of the IF are listed in the Supplementary Material in Table S2. Overall, 17 IF were simultaneously normal-distributed along 4D breathing phases and robust throughout 3D and 4D PET images (definitions in Table S1): AUCCSH, VarianceCM, WF-VarianceCM, WF-SRE,WF-LRE, WF-RLN, WF-RP, Q-SZE, Q-LZE, Q-ZSN, Q-ZP, Q-SRE, Q-LRE, Q-RLN, Q-RP, Q-ContrastNG, Q-Busyness.
Table 3.
Segmentation | Cohort 1 Manual |
Cohort 1 COA |
Cohort 3 Manual |
All |
---|---|---|---|---|
Normal Distributed across 4D | 65 | 61 | 50 | 31 |
Comparable (4D vs. 3D) | 83 | 69 | 131 | 62 |
3.2. Prognostic Model
Cohort 1 and cohort 2 were used for the development of the prognostic model. Both prospective cohorts presented similar average values for overall survival (OS): 47 ± 36 months for cohort 1 and 47 ± 21 months for cohort 2. Local recurrence (LR) was observed in 6 out of 15 patients (40%) from cohort 1 and in 9 out of 22 (41%) in cohort 2. Distant metastases (DM) were observed in 6 out of 15 patients (40%) in cohort 1 and in 15 out of 22 (68%) in cohort 2.
We evaluated the prediction of response for the normalized relative deviation of the 17 IF derived from the evaluation in the above-mentioned method. The analysis was initially carried out for cohort 1 using a manual segmentation and sequentially validated in cohort 1 with COA segmentation and in cohort 2 using manual segmentation. Statistically significant correlations between δIF and treatment outcome were considered false positives, that is, casually linked with the outcome if they were not significant for all (training and validation) cohorts. The only IF that showed statistically significant correlation with the treatment outcome for all cohorts was the area-under-the curve of the cumulative histogram (AUCCSH). The normalized relative deviation of AUCCSH (δAUCCSH) could differentiate patients with LR from patients without LR in patients with locally advanced NSCLC treated with chemoradiation.
In Figure 3a results for the manual segmentation was applied for the 11 patients of cohort 1 with 3D PET during radiotherapy (RT) are shown. The time interval between the 3D PET images (before and during RT) was 19 ± 10 days. The resulted prognostic model had an AUC of 0.87 and a specificity of 0.78. δAUCCSH resulted in a median = 5, mean ± standard deviation = 7.2 ± 4.5 and range of (1.3, 12.3) for patients with LR, and in median = 0.3, mean ± standard deviation = 0.8 ± 5.9 and range of (−16, 2.6) for patients without LR. Therefore, patients with increasing homogeneity in the primary tumor during chemoradiation had a higher probability of local recurrence. Results were confirmed when all 15 patients of cohort 1 were considered, including those with FDG-PET after the end of RT. For these 15 patients the average time interval between treatment and response FDG PET was 68 ± 92 days. The accuracy in RL discrimination showed an AUC = 0.80 and specificity = 0.74. Similar accuracy results (AUC = 0.83 and specificity = 0.75) were obtained when COA was employed instead of the manual segmentation (Figure 3b). Lower model accuracy (AUC = 0.63 and specificity = 0.61) was obtained for the 22 additional patients from cohort 2 with manual segmentation. An example of 3D PET images with and without LR is presented in Figure 4.
4. Discussion
Our novel method based on FDG-PET radiomics allows identifying clinically relevant radiomic changes during chemoradiation in locally advanced inoperable NSCLC. We could confirm that 17 IF satisfied simultaneously the two criteria required for the feasibility of the method: they were normal-distributed across 4D breathing frames, and they were consistent throughout 3D and 4D images. Consequently, for these 17 IFs we could justify the use of the variability across the pre-treatment 4D PET breathing phases as a patient individualized normalization factor to emphasize statistically relevant IF changes during treatment. Furthermore, we tested the implementation of the method in the prediction of treatment outcome in 37 lung cancer patients and preliminary validation results showed that the normalized relative deviation of the AUC of the cumulative histogram, δAUCCSH, differentiates patients with local recurrence from patients without local recurrence.
In the development of the method, different segmentation approaches and patient cohorts were used. When comparing the same cohort but with different segmentation approaches, manual segmentation resulted in a larger number of IF satisfying the two criteria (65 normal-distributed and 83 comparable) than for COA (61 normal-distributed and 69 comparable). The decreasing number of robust IF for COA could be justified by the fact that, in comparison with manual segmentation, automatic segmentations like COA are more sensitive to image noise, heterogeneity and signal blurring due to the lesion motion [20]. When comparing two different patient cohorts, more IF were robust between 3D and 4D in cohort 3 (131 IF) than for the same segmentation in cohort 1 (83 IF). It could not be explained by the range of lesion sizes or locations involved in the cohorts because decreased robustness would be expected for smaller lesions and for more significant lesion movement. However, in cohort 3 the lesions were smaller (median volume = 12 ± 9 mL) and 7/11 had a peripheral location compared to cohort 1 where lesions were larger (median volume = 86 ± 66 mL) and 3/15 were peripheral. A possible explanation for the unexpected decreased number of robust IF observed for cohort 1 could be a more irregular respiratory patterns for these central lesions as the respiratory pattern is a very significant degrading factor for IF robustness between 4D and 3D [29]. Overall, IF variability was sensitive to the segmentation and to the localization and volume of the treated lesions. We could therefore emphasize the importance of involving different cohorts in model development in order to increase the ability to generalize the resulting method.
In addition, it should be remarked that the method proposed would permit the identification of clinically relevant changes during treatment for a given image feature. In previous publications, two approaches were employed to define PET IF stability: In first method, IF were considered not stable if their relative deviations were larger than the relative deviations observed for volume or for SUV (maximum and mean) [30]. The second method was to consider IF as stable if they had a relative deviation lower than 15% [31] Although these methods were considered reasonable, it has been demonstrated that for phantoms [29] and for patients [21] the intrinsic variability of an IF and the impact of the degrading factor on an IF [16,32] varies between the respective features. Consequently, it should be recommended that a stability criterion individually adapted to the respective IF be used and the use of general rules be avoided. In addition, the criteria to identify clinically relevant changes during treatment using the proposed method would be also patient-dependent. It is well known that PET noise and resolution are degrading factors of IF variability and their impact on IF variability can be expected to be more significant for smaller lesions and lower uptake concentrations. Therefore, a stability criterion individually adapted to the patient, that is, adapted to the SUV and volume of the lesion, would be preferable to a common stability criterion applied to all patients independent of lesions characteristics.
Once we identified the IF eligible for their use in the proposed method we tested the implementation of the method in the outcome prediction for lung cancer patients, who were separated into training cohorts and validation cohorts. The training cohort was selected in terms of homogeneity: all response 3D PETs were acquired in the second week after the start of the treatment, and all PETs had the same resolution. Interestingly, accuracy obtained for the model of LR prediction in the training cohort (AUC = 0.87, Specificity = 0.78) was also high (AUC = 0.80, specificity = 0.74) when 4 additional patients, for whom response 3D PET was acquired after treatment, were included. For the validation cohorts, the model accuracy was independent on whether 3D PET was done during or after treatment. These results would not be enough to confirm the robustness of the method with respect to the time of the 3D PET acquisition for response monitoring. However, based on these results, the robustness of the method with respect to the time could not be rejected, which would imply an important advantage for the application of the method in clinical routine. In addition, LR prediction was equally accurate for lesions segmented manually or by COA. From these results, the replacement of manual segmentation by COA segmentation could be recommended. It would not only reduce inter- and intra-observer variability, but it would also reduce the time invested, which is requisite for the application of the method in clinical routine. Although previous phantom experiments had already reported the robustness of AUCCSH with respect to different PET/CT systems, CT metal artefacts and reconstruction voxel size [32], the accuracy of the LR prediction model was also evaluated for a second validation cohort of 22 new patients, mixing 2 and 4 mm PET images. For cohort 2, accuracy results were poorer but still comparable to findings reported in previous publications [33].
Many publications focused on radiomics in lung cancer [34,35,36], but only part of these publications focused on FDG-PET images. From them, a small number evaluated IF variations during or after treatment (delta radiomics) [33]. To our knowledge, we present the first prediction model based on IF variations adjusted to account for the intrinsic variability of IF. Our results showed that δAUCCSH could differentiate patients with local recurrence from patients without local recurrence. AUCCSH is a first-order statistics, that is, it does not take into account spatial information, which reflects tumor heterogeneity. The cumulative SUV-volume histogram (CSH) is an analogue to the dose-volume histogram employed in RT-planning. In CSH, the percent of tumor volume with an SUV (instead of dose) above a certain threshold is plotted against that threshold, which varied from 0 to 100% of the maximum SUV (SUVmax). The area under the curve of these plots, AUCCSH, is a quantitative index of heterogeneity, where AUCCSH increases with homogeneity. For patients with local recurrence δAUCCSH resulted in (median = 5, mean ± standard deviation = 7.2 ± 4.5, ranging from 1.3 to 12.3) and for patients without local recurrence in (median = 0.3, mean ± standard deviation = 0.8 ± 5.9, ranging from −16 to 2.6) (See example in Figure 2). We could, therefore, conclude that patients whose primary tumor homogeneity increased during therapy had a higher probability of local recurrence. Previous publications have also reported significant correlations between AUCCSH and RT outcome for lung cancer patients: prediction of recurrence based on pre-treatment AUCCSH quantification [37] and prediction of OS based on changes in AUCCSH during treatment [38]. The same correlations could be observed in our data sample, but they were not statistically significant. Although it is known that intratumoral heterogeneity in FDG distribution correlates with many factors at the cellular level (such as intracellular hypoxia, necrotic infiltration, vascularization and tumor cell proliferation [39]), the underlying physiological processes, which might explain why an increased FDG homogeneity in terms of AUC would be derived in local recurrence are still not clear. For a better understanding of our results on a cellular or immunological level (e.g., by an increase in the T-cell infiltration), it would be of interest to evaluate our findings with a mid-treatment histological examination. From our knowledge, no publication concerning other cancer sites has demonstrated a statistically significant prediction of treatment response by AUCCSH. It could probably be due to the fact that AUCCSH is not as frequently evaluated as other histogram parameters [16], such as SUVmean, Skewness, Kurtosis or coefficient of variance. Nevertheless, patients with an increase in primary tumor homogeneity during therapy (e.g., diagnosed using a mid-treatment 1⁸F-FDG PET) might benefit from concepts such as dose escalation or an additional boost of the primary tumor. In order to avoid treatment prolongation and subsequently avoiding the effect of an accelerated repopulation of tumor clonogens [40,41], either hyperfractionation [42] or hypofractionation [11] can be used. Such concepts have been tested previously, using metabolic tumor volumes according to 1⁸F-FDG-avidity on mid-treatment PET scans with conflicting results [42,43,44]. From the results reported in this study, it might be of interest to evaluate these treatments on the basis of an increase of 1⁸F-FDG homogeneity in terms of AUC.
The main limitation of our study is the small sample size. A wide patient population is required to increase the statistical significance of our results. Current work is therefore focused on the recruitment of additional patients from other German institutions involved in STRIPE and PET-Plan trials. This wider, more heterogeneous, patient population will allow us to evaluate the performance of our model in an external validation cohort. Consequently, the LR prediction model observed for the patient population involved in the current analysis should be understood as a proof of concept for the feasibility of the use of the proposed method to identify clinical relevant changes during the treatment of lung cancer patients based FDG-PET radiomics. The confirmation of this LR prediction in a larger patient cohort is therefore required. However, from our preliminary results, and in agreement with previous publications, the quantification of tumor FDG heterogeneity by δAUCCSH could be implemented when monitoring treatment response of NSCLC patients treated with chemoradiation and assist with mid-treatment adaption and dose intensification concepts such as isotopic dose escalation.
5. Conclusions
The presented novel method based on FDG-PET radiomics identifies clinically relevant changes during chemoradiation in patients with NSCLC. For the selected 17 IF, which were normally distributed across 4D breathing frames and robust throughout 3D and 4D images, we could justify the use of the variability across the pre-treatment 4D PET breathing phases as a patient-individualized normalization factor to emphasize statistically relevant IF changes during treatment. In addition, we reported the first prediction model based on IF variations which were adjusted to account for the intrinsic variability of IF. Preliminary results showed that patients for which primary tumor homogeneity, quantified by AUCCSH, increased during therapy had a higher probability for local recurrence. These patients could profit the most from mid-treatment adaptions of the target volume as well as from isotoxically escalated concepts such as in the PET-Plan trial.
Supplementary Materials
The following are available online at https://www.mdpi.com/2072-6694/13/4/814/s1. Table S1: Image features (radiomics) and Table S2: Image features robustness required for the proposed method. Black box means positive result for the analysis described on the first row and represents the property of interest. In last column, IF robustness was defined by simultaneously satisfying both criteria (normal-distributed (fifth column) and comparable (ninth column)).
Author Contributions
Conceptualization, M.C.; methodology, M.C., M.M.; software, M.C., T.F.; validation, G.R., E.G., M.C.; formal analysis, M.C.; investigation, G.R., S.A., T.F., T.S.-J., T.S.-J., C.Z., E.G., N.H.N.; resources, U.N., A.L.G.; data curation, G.R., T.F., T.S.-J., M.C., E.G.; writing—original draft preparation, M.C.; writing—review and editing, all authors; visualization, M.C., E.G.; supervision, U.N., L.M.-B., M.M., E.G., D.B., M.C.; project administration E.G.,U.N.; funding acquisition, U.N., A.L.G. All authors have read and agreed to the published version of the manuscript.
Funding
Montserrat Carles was partially funded in 2019 and 2020 by Era PerMed and funded from July 2020 on by the Conselleria de Sanitat Universal i Salut Pública from the Comunitat Valenciana. Sonja Adebahr is funded by a German consortium of translational cancer research (DKTK). The PET Plan trial was supported by the German Cancer Aid grant NR 108237. The STRIPE trial was supported from the German Cancer Aid grant NR Nr 108472. The funding sources had no involvement in the writing of the manuscript or in the decision to submit the article for publication.
Institutional Review Board Statement
The study was conducted according to the guidelines of the Declaration of Helsinki, and approved by the Ethics Committee of Albert-Ludwigs-Universität Freiburg (Protocol Code: 276/08, Date: 7 January 2009) for PET-Plan and (Protocol Code: 366/08, Date: 18 April 2012) for STRIPE.
Informed Consent Statement
Written informed consent was obtained from all the patients.
Data Availability Statement
Data sharing not applicable.
Conflicts of Interest
The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.
Footnotes
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Townsend D.W. Positron emission tomography/computed tomography. Semin. Nucl. Med. 2008;38:152–166. doi: 10.1053/j.semnuclmed.2008.01.003. [DOI] [PubMed] [Google Scholar]
- 2.MacManus M., Nestle U., Rosenzweig K.E., Carrio I., Messa C., Belohlavek O., Danna M., Inoue T., Deniaud-Alexandre E., Schipani S., et al. Use of pet and pet/ct for radiation therapy planning: Iaea expert report 2006–2007. Radiother. Oncol. 2009;91:85–94. doi: 10.1016/j.radonc.2008.11.008. [DOI] [PubMed] [Google Scholar]
- 3.UyBico S.J., Wu C.C., Suh R.D., Le N.H., Brown K., Krishnam M.S. Lung cancer staging essentials: The new tnm staging system and potential imaging pitfalls. Radiographics. 2010;30:1163–1181. doi: 10.1148/rg.305095166. [DOI] [PubMed] [Google Scholar]
- 4.Postmus P.E., Kerr K.M., Oudkerk M., Senan S., Waller D.A., Vansteenkiste J., Escriu C., Peters S., Committee E.G. Early and locally advanced non-small-cell lung cancer (nsclc): Esmo clinical practice guidelines for diagnosis, treatment and follow-up. Ann. Oncol. 2017;28:iv1–iv21. doi: 10.1093/annonc/mdx222. [DOI] [PubMed] [Google Scholar]
- 5.Stroobants S.G., D’Hoore I., Dooms C., De Leyn P.R., Dupont P.J., De Wever W., De Groot T., Verschakelen J.A., Mortelmans L.A., Vansteenkiste J.F. Additional value of whole-body fluorodeoxyglucose positron emission tomography in the detection of distant metastases of non-small-cell lung cancer. Clin. Lung Cancer. 2003;4:242–247. doi: 10.3816/CLC.2003.n.005. [DOI] [PubMed] [Google Scholar]
- 6.van Baardwijk A., Dooms C., van Suylen R.J., Verbeken E., Hochstenbag M., Dehing-Oberije C., Rupa D., Pastorekova S., Stroobants S., Buell U., et al. The maximum uptake of (18)f-deoxyglucose on positron emission tomography scan correlates with survival, hypoxia inducible factor-1alpha and glut-1 in non-small cell lung cancer. Eur. J. Cancer. 2007;43:1392–1398. doi: 10.1016/j.ejca.2007.03.027. [DOI] [PubMed] [Google Scholar]
- 7.Vaidya M., Creach K.M., Frye J., Dehdashti F., Bradley J.D., El Naqa I. Combined pet/ct image characteristics for radiotherapy tumor response in lung cancer. Radiother. Oncol. 2012;102:239–245. doi: 10.1016/j.radonc.2011.10.014. [DOI] [PubMed] [Google Scholar]
- 8.Nestle U., De Ruysscher D., Ricardi U., Geets X., Belderbos J., Pöttgen C., Dziadiuszko R., Peeters S., Lievens Y., Hurkmans C., et al. Estro acrop guidelines for target volume definition in the treatment of locally advanced non-small cell lung cancer. Radiother. Oncol. 2018;127:1–5. doi: 10.1016/j.radonc.2018.02.023. [DOI] [PubMed] [Google Scholar]
- 9.Nestle U., Kremp S., Grosu A.L. Practical integration of [18F]-FDG-PET and PET/CT in the planning of radiotherapy for non-small cell lung cancer (nsclc): The technical basis, icru-target volumes, problems, perspectives. Radiother. Oncol. 2006;81:209–225. doi: 10.1016/j.radonc.2006.09.011. [DOI] [PubMed] [Google Scholar]
- 10.van Elmpt W., Ollers M., Dingemans A.-M.C., Lambin P., De Ruysscher D. Response assessment using 18f-fdg pet early in the course of radiotherapy correlates with survival in advanced-stage non-small cell lung cancer. J. Nucl. Med. 2012;53:1514–1520. doi: 10.2967/jnumed.111.102566. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Kong F.M., Ten Haken R.K., Schipper M., Frey K.A., Hayman J., Gross M., Ramnath N., Hassan K.A., Matuszak M., Ritter T., et al. Effect of midtreatment pet/ct-adapted radiation therapy with concurrent chemotherapy in patients with locally advanced non-small-cell lung cancer: A phase 2 clinical trial. JAMA Oncol. 2017;3:1358–1365. doi: 10.1001/jamaoncol.2017.0982. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Scrivener M., de Jong E.E.C., van Timmeren J.E., Pieters T., Ghaye B., Geets X. Radiomics applied to lung cancer: A review. Transl. Cancer Res. 2016;5:398–409. doi: 10.21037/tcr.2016.06.18. [DOI] [Google Scholar]
- 13.Kumar V., Gu Y., Basu S., Berglund A., Eschrich S.A., Schabath M.B., Forster K., Aerts H.J., Dekker A., Fenstermacher D., et al. Radiomics: The process and the challenges. Magn. Reson. Imaging. 2012;30:1234–1248. doi: 10.1016/j.mri.2012.06.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Avanzo M., Stancanello J., El Naqa I. Beyond imaging: The promise of radiomics. Phys. Med. 2017;38:122–139. doi: 10.1016/j.ejmp.2017.05.071. [DOI] [PubMed] [Google Scholar]
- 15.European Institute for Biomedical Imaging R. Strategic research agenda for biomedical imaging. Insights Imaging. 2019;10:7. doi: 10.1186/s13244-019-0684-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Reuzé S., Schernberg A., Orlhac F., Sun R., Chargari C., Dercle L., Deutsch E., Buvat I., Robert C. Radiomics in nuclear medicine applied to radiation therapy: Methods, pitfalls, and challenges. Int. J. Radiat. Oncol. Biol. Phys. 2018;102:1117–1142. doi: 10.1016/j.ijrobp.2018.05.022. [DOI] [PubMed] [Google Scholar]
- 17.Zwanenburg A. Radiomics in nuclear medicine: Robustness, reproducibility, standardization, and how to avoid data analysis traps and replication crisis. Eur. J. Nucl. Med. Mol. Imaging. 2019;46:2638–2655. doi: 10.1007/s00259-019-04391-8. [DOI] [PubMed] [Google Scholar]
- 18.Leijenaar R.T., Carvalho S., Velazquez E.R., van Elmpt W.J., Parmar C., Hoekstra O.S., Hoekstra C.J., Boellaard R., Dekker A.L., Gillies R.J., et al. Stability of fdg-pet radiomics features: An integrated analysis of test-retest and inter-observer variability. Acta Oncol. 2013;52:1391–1397. doi: 10.3109/0284186X.2013.812798. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Bowen S.R., Nyflot M.J., Gensheimer M., Hendrickson K.R.G., Kinahan P.E., Sandison G.A., Patel S.A. Challenges and opportunities in patient-specific, motion-managed and pet/ct-guided radiation therapy of lung cancer: Review and perspective. Clin. Transl. Med. 2012;1:18. doi: 10.1186/2001-1326-1-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Carles M., Fechter T., Nemer U., Nanko N., Mix M., Nestle U., Schaefer A. Feasibility of a semi-automated contrast-oriented algorithm for tumor segmentation in retrospectively gated pet images: Phantom and clinical validation. Phys. Med. Biol. 2015;60:9227–9251. doi: 10.1088/0031-9155/60/24/9227. [DOI] [PubMed] [Google Scholar]
- 21.Carles M., Bach T., Torres-Espallardo I., Baltas D., Nestle U., Martí-Bonmatí L. Significance of the impact of motion compensation on the variability of pet image features. Phys. Med. Biol. 2018;63:065013. doi: 10.1088/1361-6560/aab180. [DOI] [PubMed] [Google Scholar]
- 22.Grootjans W., Tixier F., van der Vos C.S., Vriens D., Le Rest C.C., Bussink J., Oyen W.J., de Geus-Oei L.F., Visvikis D., Visser E.P. The impact of optimal respiratory gating and image noise on evaluation of intratumor heterogeneity on 18F-FDG pet imaging of lung cancer. J. Nucl. Med. 2016;57:1692–1698. doi: 10.2967/jnumed.116.173112. [DOI] [PubMed] [Google Scholar]
- 23.Nestle U., Schimek-Jasch T., Kremp S., Schaefer-Schuler A., Mix M., Kusters A., Tosch M., Hehr T., Eschmann S.M., Bultel Y.P., et al. Imaging-based target volume reduction in chemoradiotherapy for locally advanced non-small-cell lung cancer (pet-plan): A multicentre, open-label, randomised, controlled trial. Lancet Oncol. 2020;21:581–592. doi: 10.1016/S1470-2045(20)30013-9. [DOI] [PubMed] [Google Scholar]
- 24.Nestle U., Adebahr S., Kaier K., Gkika E., Schimek-Jasch T., Hechtner M., Momm F., Gaertner J., Becker G., Grosu A.L. Quality of life after pulmonary stereotactic fractionated radiotherapy (sbrt): Results of the phase ii stripe trial. Radiother. Oncol. 2020;148:82–88. doi: 10.1016/j.radonc.2020.03.018. [DOI] [PubMed] [Google Scholar]
- 25.Vallières M., Freeman C.R., Skamene S.R., El Naqa I. A radiomics model from joint FDG-PET and MRI texture features for the prediction of lung metastases in soft-tissue sarcomas of the extremities. Phys. Med. Biol. 2015;60:5471–5496. doi: 10.1088/0031-9155/60/14/5471. [DOI] [PubMed] [Google Scholar]
- 26.Leijenaar R.T., Nalbantov G., Carvalho S., van Elmpt W.J., Troost E.G., Boellaard R., Aerts H.J., Gillies R.J., Lambin P. The effect of suv discretization in quantitative fdg-pet radiomics: The need for standardized methodology in tumor texture analysis. Sci. Rep. 2015;5:11075. doi: 10.1038/srep11075. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Giavarina D. Understanding bland altman analysis. Biochem. Med. 2015;25:141–151. doi: 10.11613/BM.2015.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Benjamini Y., Drai D., Elmer G., Kafkafi N., Golani I. Controlling the false discovery rate in behavior genetics research. Behav. Brain Res. 2001;125:279–284. doi: 10.1016/S0166-4328(01)00297-2. [DOI] [PubMed] [Google Scholar]
- 29.Carles M., Torres-Espallardo I., Alberich-Bayarri A., Olivas C., Bello P., Nestle U., Martí-Bonmatí L. Evaluation of PET texture features with heterogeneous phantoms: Complementarity and effect of motion and segmentation method. Phys. Med. Biol. 2016;62:652–9251. doi: 10.1088/1361-6560/62/2/652. [DOI] [PubMed] [Google Scholar]
- 30.Orlhac F., Soussan M., Maisonobe J.A., Garcia C.A., Vanderlinden B., Buvat I. Tumor texture analysis in 18F-FDG PET: Relationships between texture parameters, histogram indices, standardized uptake values, metabolic volumes, and total lesion glycolysis. J. Nucl. Med. 2014;55:414–422. doi: 10.2967/jnumed.113.129858. [DOI] [PubMed] [Google Scholar]
- 31.Nyflot M.J., Yang F., Byrd D., Bowen S.R., Sandison G.A., Kinahan P.E. Quantitative radiomics: Impact of stochastic effects on textural feature analysis implies the need for standards. J. Med. Imaging. 2015;2:041002. doi: 10.1117/1.JMI.2.4.041002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Carles M., Baltas D., Mix M. PET image features for their use in radiomics: Selection criteria based on intrinsic complementarity with respect to volume and robustness with respect to system, metal artefacts and voxel size in image reconstruction. Nuklearmedizin. 2019;58:P76. [Google Scholar]
- 33.Shi L., He Y., Yuan Z., Benedict S., Valicenti R., Qiu J., Rong Y. Radiomics for response and outcome assessment for non-small cell lung cancer. Technol. Cancer Res. Treat. 2018;17:1533033818782788. doi: 10.1177/1533033818782788. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Sollini M., Cozzi L., Antunovic L., Chiti A., Kirienko M. Pet radiomics in nsclc: State of the art and a proposal for harmonization of methodology. Sci. Rep. 2017;7:358. doi: 10.1038/s41598-017-00426-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Thawani R., McLane M., Beig N., Ghose S., Prasanna P., Velcheti V., Madabhushi A. Radiomics and radiogenomics in lung cancer: A review for the clinician. Lung Cancer. 2018;115:34–41. doi: 10.1016/j.lungcan.2017.10.015. [DOI] [PubMed] [Google Scholar]
- 36.Gkika E., Benndorf M., Oerther B., Mohammad F., Beitinger S., Adebahr S., Carles M., Schimek-Jasch T., Zamboglou C., Frye B.C., et al. Immunohistochemistry and radiomic features for survival prediction in small cell lung cancer. Front. Oncol. 2020;10:1161. doi: 10.3389/fonc.2020.01161. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Kang S.R., Song H.C., Byun B.H., Oh J.R., Kim H.S., Hong S.P., Kwon S.Y., Chong A., Kim J., Cho S.G., et al. Intratumoral metabolic heterogeneity for prediction of disease progression after concurrent chemoradiotherapy in patients with inoperable stage iii non-small-cell lung cancer. Nucl. Med. Mol. Imaging. 2014;48:16–25. doi: 10.1007/s13139-013-0231-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Dong X., Sun X., Sun L., Maxim P.G., Xing L., Huang Y., Li W., Wan H., Zhao X., Xing L., et al. Early change in metabolic tumor heterogeneity during chemoradiotherapy and its prognostic value for patients with locally advanced non-small cell lung cancer. PLoS ONE. 2016;11:e0157836. doi: 10.1371/journal.pone.0157836. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Thorwarth D., Geets X., Paiusco M. Physical radiotherapy treatment planning based on functional pet/ct data. Radiother. Oncol. 2010;96:317–324. doi: 10.1016/j.radonc.2010.07.012. [DOI] [PubMed] [Google Scholar]
- 40.Bese N.S., Hendry J., Jeremic B. Effects of prolongation of overall treatment time due to unplanned interruptions during radiotherapy of different tumor sites and practical methods for compensation. Int. J. Radiat. Oncol. Biol. Phys. 2007;68:654–661. doi: 10.1016/j.ijrobp.2007.03.010. [DOI] [PubMed] [Google Scholar]
- 41.McMillan M.T., Ojerholm E., Verma V., Higgins K.A., Singhal S., Predina J.D., Berman A.T., Grover S., Robinson C.G., Simone C.B., 2nd Radiation treatment time and overall survival in locally advanced non-small cell lung cancer. Int. J. Radiat. Oncol. Biol. Phys. 2017;98:1142–1152. doi: 10.1016/j.ijrobp.2017.04.004. [DOI] [PubMed] [Google Scholar]
- 42.Eberhardt W.E., Pottgen C., Gauler T.C., Friedel G., Veit S., Heinrich V., Welter S., Budach W., Spengler W., Kimmich M., et al. Phase iii study of surgery versus definitive concurrent chemoradiotherapy boost in patients with resectable stage iiia(n2) and selected iiib non-small-cell lung cancer after induction chemotherapy and concurrent chemoradiotherapy (espatue) J. Clin. Oncol. 2015;33:4194–4201. doi: 10.1200/JCO.2015.62.6812. [DOI] [PubMed] [Google Scholar]
- 43.Kong F.-M., Li L., Wang W., Campbell J., Waller J.L., Piert M., Gross M., Cheng M., Owen D., Stenmark M., et al. Greater reduction in mid-treatment fdg-pet volume may be associated with worse survival in non-small cell lung cancer. Radiother. Oncol. 2019;132:241–249. doi: 10.1016/j.radonc.2018.10.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Even A.J.G., van der Stoep J., Zegers C.M.L., Reymen B., Troost E.G.C., Lambin P., van Elmpt W. Pet-based dose painting in non-small cell lung cancer: Comparing uniform dose escalation with boosting hypoxic and metabolically active sub-volumes. Radiother. Oncol. 2015;116:281–286. doi: 10.1016/j.radonc.2015.07.013. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Data sharing not applicable.