Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2020 Aug 15.
Published in final edited form as: Clin Cancer Res. 2019 May 29;25(16):5027–5037. doi: 10.1158/1078-0432.CCR-18-3908

Voxel Forecast for Precision Oncology: predicting spatially variant and multiscale cancer therapy response on longitudinal quantitative molecular imaging

Stephen R Bowen 1,2,*, Daniel S Hippe 1,*, W Art Chaovalitwongse 3, Chunyan Duan 3,4, Phawis Thammasorn 3, Xiao Liu 3, Robert S Miyaoka 1, Hubert J Vesselle 1, Paul E Kinahan 1, Ramesh Rengan 2, Jing Zeng 2
PMCID: PMC6697586  NIHMSID: NIHMS1530245  PMID: 31142507

Abstract

Purpose

Prediction of spatially variant response to cancer therapies can inform risk-adaptive management within precision oncology. We developed the “Voxel Forecast” multiscale regression framework for predicting spatially variant tumor response to chemoradiotherapy on FDG-PET/CT imaging.

Experimental Design

Twenty-five patients with locally advanced non-small cell lung cancer, enrolled on the FLARE-RT phase II trial (NCT02773238), underwent FDG-PET/CT imaging prior to (PETpre) and during week 3 (PETmid) of concurrent chemoradiotherapy. Voxel Forecast was designed to predict tumor voxel SUV on PETmid from baseline patient-level and voxel-level covariates using a custom generalized least squares (GLS) algorithm. Matérn covariance matrices were fit to patient-specific empirical variograms of distance-dependent inter-voxel correlation. Regression coefficients from variogram-based weights and corresponding standard errors were estimated using the jack-knife technique. The framework was validated using statistical simulations of known spatially variant tumor response. Mean absolute prediction error (MAE) of Voxel Forecast models were calculated under leave-one-patient-out cross-validation.

Results

Patient-level forecasts resulted in tumor voxel SUV MAE on PETmid of 1.5 g/mL while combined patient- and voxel-level forecasts achieved lower MAE of 1.0 g/mL (p<0.0001). PETpre voxel SUV was the most important predictor of PETmid voxel SUV. Patients with greater percentage of under-responding tumor voxels were classified as PETmid non-responders (p=0.030) with worse overall survival prognosis (p<0.001).

Conclusions

Voxel Forecast multiscale regression provides a statistical framework to predict voxelwise response patterns during therapy. Voxel Forecast can be extended to predict spatially variant response on multimodal quantitative imaging and may eventually guide optimized spatial-temporal dose distributions for precision cancer therapy.

Introduction

Precision oncology is predicated on delivering the right set of treatments at the right time to each patient (1, 2). Tissue and peripheral biomarkers establish baseline tumor pan-”omic” profiles for selection of therapeutic regimens, which continue to expand with the emergence of next generation molecularly targeted agents (3), immune checkpoint inhibitors (410), and cancer vaccines (1114). Quantitative imaging is gaining prominence as a non-invasive biomarker of early response to cancer therapies, providing rich high dimensional radiomic data (15) for tumor phenotyping (16, 17) in precision oncology (18). Among a variety of imaging applications to cancer therapy assessment, the quantitative evaluation of early tumor response on fluorodexoyglucose (FDG) positron emission tomography / computed tomography (PET/CT) stands out as a success story for predicting outcomes in lung cancer patients and enabling more precise patient risk stratification (1925). Unlike conventional tissue or peripheral biomarkers, FDG PET/CT as an imaging biomarker contains spatial information over the entire disease burden that can be correlated with lung cancer recurrence patterns (2629).

Mounting clinical evidence spurred the launch of several FDG PET/CT imaging biomarker-guided clinical trials in lung cancer (3033), which seek to deliver risk-adapted radiation therapies in combination with systemic therapies to improve efficacy. However, current trial designs limit their assessments to global patient or tumor region-of-interest (ROI) response that ignore intra-tumoral spatial heterogeneity in response. Analogous to tumor genomic heterogeneity driving clonal evolution and treatment resistance (34), tumor spatial radiomic heterogeneity captures both inter-patient and intra-tumoral variation in response (35). There are potential advantages to examining tumor response on multiple spatial scales of imaging, spanning individual imaging volume elements (voxels) to voxel clusters, lesions within the same patient, and lesions between patients. Multiscale response prediction can incorporate information at all spatial scales to better capture tumor radiomic heterogeneity, providing clinical decision support based on differentiation of response between patients, between tumors, and within tumors.

Multiscale prediction of tumor voxel response poses technical challenges. Conventional statistical methods such as ordinary least squares (OLS) regression to predict tumor voxel response have been investigated. Prior exploratory studies applied linear and linear-mixture regression models to characterize preliminary spatial relationships between longitudinal PET images, including identification of spatially coherent radioresistent tumor voxel subpopulations (36). Some investigations employed logistic regression to predict FDG PET tumor metabolic control probabilities (37); others compared linear regression, logistic regression, and classification and regression trees (CART) to predict voxel locations of tumor recurrence (38). These initial studies did not fully account for the spatially variant correlation between all voxels within the same tumor relative to independent data between patients. Underestimation of statistical variability by OLS regression can inflate false discoveries and relative importance of predictor variables. Furthermore, OLS regression does not leverage the spatial structure of voxel data to efficiently power prediction modeling.

Predicting spatially variant response to cancer therapy on longitudinal imaging with a novel statistical framework can inform individualized risk-adaptive treatment strategies. The purpose of this investigation was to develop a multiscale regression framework for predicting spatially variant tumor response at the voxel scale while accounting for inter-voxel correlation, described henceforth as “Voxel Forecast”. We first conducted simulations with known ground truth voxel response and spatial correlation structure to evaluate the statistical accuracy and precision of Voxel Forecast. We then applied Voxel Forecast to a cohort of locally advanced non-small cell lung cancer patients enrolled on the FLARE-RT protocol (NCT02773238) to predict Week 3 mid-treatment FDG-PET/CT tumor voxel response. We focused on mid-treatment PET/CT response prediction rather than post-treatment PET/CT response as this application is conducive to timely alterations in patient management strategies. Lastly, we explored the clinical implications of Voxel Forecast as a decision support system by correlating prediction residuals with clinical factors, treatment factors, and patient outcomes.

Methods

Patient Population and Clinical Trial Protocol

The phase II FLARE-RT clinical trial (NCT02773238) eligibility criteria included patients with locally advanced, inoperable stage IIB-IIIB non-small cell lung cancer (AJCC v7) and performance status ECOG 0–1, as well pulmonary function test, liver function test, and renal function test adequacy. All patients received 6 weeks of concurrent chemotherapy and radiation therapy (RT) with functional lung avoidance planning strategies, while a select subgroup of patients (n = 9) received adaptive and concomitant dose escalation based on mid-treatment PET imaging response assessment. Pre-RT FDG PET/CT imaging occurred at a median time of 10 days (range 1–33 days) prior to treatment start. Twenty-eight consecutive patients were enrolled on this study following obtained written consent, approval by the University of Washington / Fred Hutchinson Cancer Consortium institutional review board (CC9599), and compliance with the Declaration of Helsinki ethical guidelines. After excluding patients with metastatic disease on repeat baseline FDG PET/CT (n = 2) or those who withdrew from the protocol (n = 1), a total of 25 patients underwent mid-RT FDG PET/CT imaging at median prescribed dose of 24 Gy in 12 fractions and range of 20–26 Gy in 10–13 fractions during the third week of radiation therapy. These patients formed the clinical dataset for Voxel Forecast analysis.

Image Acquisition and Processing

All baseline and mid-treatment FDG PET/CT imaging was acquired with patients immobilized in radiotherapy position and adhered to standard imaging protocols on the same or cross-calibrated PET/CT scanners. Between imaging time points, the majority of patients (n = 21) were scanned on the Discovery STE™ (GE Healthcare, Waukesha, WI), while the remainder (n = 4) were scanned on the Discovery MI™ (GE Healthcare, Waukesha, WI). Following a 12-hour fasting protocol, patients were injected with median 10.0 mCi (IQR 9.6–10.6 mCi) [18F]FDG following pre- and post-injection assays. Baseline PET scans were started at a median time post-injection of 60 min (IQR 60–63 min), and week 3 mid-treatment PET scan start delays were matched to baseline scan start delays for each patient with a median time difference of 0 min (IQR 0–2 min). Patients scanned on the GE DSTE PET/CT underwent acquisitions of 5 min per bed position (15 cm axial field-of-view [aFOV]), spanning from base-of-skull to mid-thigh (6–7 aFOV, 30–35 min); patients scanned on the GE DMI PET/CT underwent acquisitions of 2.5 min per bed position (25 cm aFOV), spanning from base-of-skull to mid-thigh (4–5 aFOV, 10–13 min). All images were reconstructed using an ordered subset expectation maximization (OSEM) iterative algorithm without time-of-flight or resolution recovery corrections to improve harmonization of patient data between DSTE and DMI PET/CT scanners. Standard quantitative PET data corrections, including CT-based attenuation correction, were applied.

The primary metabolic tumor volumes (MTV) were semi-automatically defined by the maximum gradient descent algorithm PET Edge™ (MIM Software, Cleveland OH). Inter-observer variability was minimized by independent generation of MTV by two observers followed by group review and consensus from a board-certified radiation oncologist. Localized rigid alignment of primary MTV between the planning respiratory-correlated (4D) CT, pre-RT PET/CT, mid-RT PET/CT, and RT dose DICOM objects was performed for all patients using a built-in mutual information algorithm (MIM Software, Cleveland, OH). Co-registered PET voxel grids were resampled to match the RT dose 2 mm isotropic voxel grids, which guaranteed isomorphic voxel mapping between baseline and mid-treatment time points. We did not utilize deformable image registration of bulky locally advanced NSCLC tumors in order to minimize changes in the native FDG PET uptake spatial distribution, as well as mitigate uncertainties secondary to the image deformation algorithm. Example patient images with variable mid-treatment FDG PET response patterns are displayed in Figure 1.

Figure 1.

Figure 1.

Baseline (Pre-RT, upper row) and Week 3 response (Mid-RT, lower row) FDG PET/CT imaging during chemoradiotherapy in 2 example FLARE-RT protocol patients. FDG-PET imaging standardized uptake value (SUV) hot-metal colorscale is overlaid on treatment planning CT imaging with rainbow colorscale planned isodose lines between the Pre-RT and Mid-RT time points. Pre-RT metabolic tumor volume (MTV) is displayed as a khaki contour. Patient (A) presents with large spatially heterogeneous response, while patient (B) presents with modest spatially homogeneous response that preserves the baseline ring uptake pattern.

Voxel Forecast of Simulated Data

A schematic of the Voxel Forecast workflow for predicting spatially variant tumor response using customized multiscale regression is shown in Figure 2. Full details of the Voxel Forecast modeling framework, the Matérn variogram estimation to account for inter-voxel spatial dependence, and the simulation scenarios with ground truth for benchmarking prediction algorithm performance can be found in the supplement. At each iteration with a random sample of 10–75 simulated patient tumors characterized by a known spatially variant voxel response model, regression coefficients β of multiscale patient-level and voxel-level predictors were estimated with the following: (1) standard OLS algorithm for independent data (naïve method), (2) the pure generalized least squares approach of the spatial modeling framework in which all parameters are weighted using the population-averaged variogram (GLS), and (3) the custom OLS-GLS hybrid approach of the spatial modeling framework in Figure 2, where only voxel-level parameters are weighted using the population-averaged variogram. The primary metrics to measure the performance of these three algorithms (naïve OLS, GLS, OLS-GLS hybrid) were the bias, relative to the true values of β specified when simulating the data; the standard error of β, estimated empirically as the standard deviation of the distribution across the 250 simulations for each sample size; and the 95% confidence interval coverage, estimated empirically as the percentage of times the confidence intervals for β included the true values specified when simulating the data.

Figure 2.

Figure 2.

Schematic of Voxel Forecast variogram-weighted multiscale regression framework to predict tumor voxel response. Multiscale patient- and voxel-level predictors generate ordinary least squares (OLS) residuals to initialize iterative OLS-GLS (generalized least squares) hybrid regression with Matérn variogram to account for spatial auto-correlation of voxels under leave-1-out cross validation. Regression coefficient standard errors (SE) are estimated using the jack-knife technique.

Voxel Forecast of Patient Data

Univariate voxel forecast models of Week 3 mid-RT PET SUV within the MTV were generated using each patient-level and voxel-level predictors. Patient-level predictors included standard clinical characteristics and treatment factors, as well as quantitative PET standardized uptake value (SUV) features that have demonstrated prognostic value for treatment response and clinical outcome stratification in locally advanced non-small cell lung cancer patient series (22, 39). Voxel-level predictors included pre-RT voxel SUV, voxel RT dose, and voxel distance from tumor centroid. High-dimensional engineered radiomic features from PET or CT were not included in this preliminary application in order to mitigate the risk of false discoveries while promoting transparency in the setting of modest patient samples. Two primary multivariate voxel forecast models were then constructed, one using only patient-level predictors (Model P) and another using both patient-level and voxel-level predictors (Model P+V). Right-skewed variables were log-transformed prior to inclusion in the models. Confidence intervals and Wald test p-values for each regression coefficient were calculated using the jack-knife as described in the supplement (40). To limit the number of patient-level predictors included in each model, only the patient-level predictors associated with a Wald test p-value <0.1 in a univariate model were included in the multivariate models.

Pairwise interaction terms were tested individually using Wald tests of the corresponding regression coefficients with Model P+V as the base model. The interactions considered were the 3 patient-voxel combinations of stage IIIB and the voxel-level predictors, as well as the 3 pairwise voxel-voxel combinations of the voxel-level predictors. Prediction error of the models was measured per patient as the mean absolute error (MAE) in mid-RT SUV across all MTV voxels. Prediction error was calculated using leave-one-patient-out cross-validation. The Wilcoxon signed-rank test was used as an approximate test for any difference in prediction error between multivariate Model P and Model P+V.

A residue analysis of Voxel Forecast (multivariate Model P+V) was conducted to test for associations between under-responding voxels, relative to prediction, and baseline clinical / treatment / imaging characteristics. The percentage of under-responding voxels in each patient tumor was calculated to stratify baseline characteristics into tertiles (i.e. patients with <25% under-responding voxels, 25%−75%, >75%). Differences between the tertiles of under-response in the same baseline factors considered during the univariate analysis and mid-RT PET clinical response status, defined prospectively under the FLARE-RT protocol by quantitative and multidisciplinary diagnostic assessment of metabolic changes, were tested using the Kruskal-Wallis test or Fisher’s exact test, as appropriate. Overall survival (OS) was summarized using the Kaplan-Meier estimator in patient subgroups that were significantly associated with under-response relative to Voxel Forecast predictions. Events were defined as death from any cause, and patients were censored at the time of last follow-up. Differences in OS between subgroups were evaluated by the log-rank test. All modeling and statistical calculations were performed using the R statistical computing language version 3.1.1 and 3.4.1 (R Foundation for Statistical Computing, Vienna, Austria) and the gstat package version 1.1 (41, 42).

Results

Across all simulated scenarios (see supplement), estimates of the regression coefficients using the naïve-OLS, GLS, and OLS-GLS hybrid methods had little to no bias. Even with 10 simulated patients, the estimates of bias were <10% of the standard error across all coefficients. The precision and 95% CI coverage are summarized in Figure 3. For patient-level variables (the intercept and G, see supplement for descriptions of the variables), the naïve and hybrid methods had similar standard errors while GLS consistently had higher standard errors. However, both GLS and the hybrid methods produced 95% CIs with approximately nominal coverage across the sample sizes considered while the naïve approach resulted in CI coverage < 20%. For voxel-level variables (X1 and X4), the naïve method had standard errors 5–60 times higher than both the GLS and hybrid methods, which themselves were very similar. For X1, which had extensive spatial correlation, the naïve method had CI coverage <70% while CI coverage was close to the nominal level for X4. CI coverage from the GLS and hybrid methods was close to the nominal level, though slightly below it, across all variables (87–97%). CI coverage from the GLS and hybrid methods was 88% at N=35 for X1, which was within the simulation margin of error of ~5% of the CI coverage estimated at nearby sample sizes (~92%).

Figure 3.

Figure 3.

Standard errors (top row) and coverage of 95% confidence intervals (CIs) (bottom row) for regression coefficients across different sample sizes and methods (naïve OLS, GLS, OLS-GLS hybrid) using simulated data. The horizontal dashed line in the low panels indicates the nominal 95% coverage. The margin-of-error for differences between different points on the 95% CI coverage curves for the naïve and GLS/hybrid methods are approximately ±5% (for both) for the intercept, ±5% and ±4% for G, ±8% and ±5% for X1, and ±4% and ±5% for X4, respectively.

The custom OLS-GLS hybrid algorithm was selected to apply to the FLARE-RT clinical trial cohort (n = 25) due to its unbiased and high precision performance for predicting simulated spatial response patterns. The cross-validated estimates of patient variograms are displayed for an example patient (Figure 4A) and the patient cohort (Figure 4B), demonstrating robustness of the Matérn function for characterizing inter-voxel spatial dependence. Matérn function parameters included a population-averaged range of 1.97 cm and smoothness of 3.5, along with patient-specific sill estimates to preserve the native dynamic range of SUV (see supplement).

Figure 4.

Figure 4.

Estimated variogram based on Matérn function for an example FLARE-RT patient (A, black curve) and patient population (B, black curve). The sill represents the dynamic range of spatial variance, normalized to 1 for the population estimate. The range represents the inter-voxel distance over which data remain correlated. Gray curves show leave-one-patient-out cross-validation (LOOCV) estimates. Each patient has an individualized variogram with common range and smoothness but patient-specific sill.

Clinical and tumor characteristics are summarized in Table 1. Of the 25 patients, 15 (60%) were female and age ranged from 50 to 75 years (median: 62 years). Eight patients (32%) were stage IIIB. Most tumors (76%) were adenocarcinoma and the rest were squamous cell carcinoma. Pre-RT SUVmax ranged from 3 to 32 g/cm3 (median: 11 g/cm3) and SUVmean ranged from 2 to 16 g/cm3 (median: 6 g/cm3). Over an average of 10891 voxels per tumor, MTV voxel SUV decreased from 7 to 4 g/cm3 in response to 25 Gy on average, with large baseline inter-patient (standard deviation [SD]: 4 g/cm3) and intra-tumor SUV variability (SD: 3 g/cm3).

Table 1.

Patient and tumor characteristics.

Patient/tumor Variable (n = 25) No. (%) or median (range)
 Female sex 15 (60.0)
 Age, years 62 (50 – 75)
 Adenocarcinoma (vs. squamous cell carcinoma) 19 (76.0)
 Stage IIIB (vs. other) 8 (32.0)
 Proton Beam Therapy (vs. X-ray Radiation Therapy) 12 (48.0)
 Chemotherapy modality
  Carboplatin + paclitaxel 16 (64.0)
  Cisplatin + etoposide 9 (36.0)
 Primary metabolic tumor volume, cm3 27 (2 – 396)
 SUVmax, g/cm3 11 (3 – 32)
 SUVmean, g/cm3 6 (2 – 16)
 TLG, g 247 (5 – 4408)
 Ring/concave uptake pattern (vs. foci/convex) 12 (48.0)
 Time from pre-RT PET to mid-RT PET, days 26 (16 – 51)
 Time from pre-RT PET to RT start, days 10 (1 – 33)

Univariate results for predicting the mid-RT SUV per voxel are shown in Table 2. At the patient-level, stage IIIB patients tended to have a 33% higher mid-RT SUV on average compared to stage IIB-IIIA patients (p=0.073). Each 50% increment in SUVmax and SUVmean was associated with an 18% (p=0.004) and 20% higher (p=0.003) mid-RT SUV, respectively. At the voxel-level, only pre-RT SUV was significantly associated with mid-RT SUV, with a 35% increase in mid-RT SUV per 50% increase in pre-RT SUV (p<0.001). Voxel-level RT dose was not significantly associated with mid-RT SUV (p=0.56), due in part to the spatial homogeneity of a uniform tumor dose prescription between pre-RT PET and mid-RT PET time points.

Table 2.

Univariate analysis of patient-level and voxel-level forecasting of mid-RT PET SUV.

Univariate Model
Patient/tumor Variable β* (95% CI) P-value
 Female sex 1.2 (−26.5, 39.5) 0.94
 Age, per 5-year increase 4.3 (−9.6, 20.5) 0.55
 Adenocarcinoma (vs. squamous cell carcinoma) −6.0 (−40.8, 49.2) 0.78
 Stage IIIB (vs. other) 33.4 (−2.9, 83.2) 0.073
 Proton Beam Therapy (vs. X-ray Radiation Therapy) −10.8 (−35.2, 22.8) 0.47
 Chemotherapy modality
  Carboplatin + paclitaxel (ref)
  Cisplatin + etoposide 1.2 (−27.2, 40.9) 0.94
 Primary metabolic tumor volume (MTV), per 50% increase −0.4 (−4.8, 4.1) 0.85
 Primary SUVmax, per 50% increase 18.1 (6.0, 31.7) 0.004
 Primary SUVmean, per 50% increase 20.0 (6.9, 34.8) 0.003
 Primary Total Lesion Glycolysis (TLG), per 50% increase 1.3 (−2.5, 5.3) 0.48
 Ring/concave uptake pattern (vs. foci/convex) −1.9 (−28.5, 34.8) 0.90
 Time from pre-RT PET to mid-RT PET, per 1-week increase −4.2 (−14.6, 7.5) 0.45
 Time from pre-RT PET to RT start, per 1-week increase −4.3 (−14.8, 7.5) 0.44
 No. of RT fractions between pre-RT and mid-RT PET 1.6 (−14.5, 20.7) 0.85
 Mean dose between pre-RT and mid-RT PET, per 1-Gy increase 0.7 (−6.4, 8.3) 0.84
Voxel Variable

 Voxel pre-RT SUV, per 50% increase 34.6 (27.1, 42.6) <0.001
 Voxel RT dose, per 1-Gy increase −2.7 (−11.3, 6.9) 0.56
 Voxel distance from tumor centroid, per 1-cm increase −7.4 (−23.5, 12.0) 0.41
*

Percent change in average SUV at the mid-RT PET scan

The predictor was log-transformed prior to inclusion in the model to reduce right-skewness.

Multivariate voxel-scale prediction models for mid-RT SUV were constructed using patient-level predictors only (Model P) and using patient- and voxel-level predictors together (Model P+V, Table 3). Pre-RT SUVmean was included in the models without SUVmax due to high cross-correlation (r = 0.94). In Model P, only pre-RT SUVmean was significantly and independently associated with mid-RT voxel SUV (21.5% increase per 50% increase, p=0.001). Similarly, only pre-RT voxel SUV was significantly and independently associated with mid-RT voxel SUV (35% per 50% increase, p<0.001) in Model P+V. After including both patient- and voxel-level pre-RT SUV predictors for Model P+V, pre-RT SUVmean was no longer significantly associated with mid-RT voxel SUV (p=0.069). After patient-scale and voxel-scale multivariate adjustment for pre-RT PET predictors, a relatively stronger association between higher mean tumor radiation dose and lower mid-RT voxel SUV was observed (univariate p=0.84, Model P p=0.13, Model P+V p=0.17, 4–5% voxel SUV decrease per 1-Gy increase). Interactions between Stage IIIB, a patient-level factor, and the three voxel-level predictors were tested, though none were statistically significant when added to Model P+V (p>0.54 for all). Similarly, the three pairwise interactions between the voxel-level predictors were tested and none were statistically significant (p>0.13 for all).

Table 3.

Multivariate Voxel Forecast models with patient-level and voxel-level predictors for mid-RT PET SUV.

Patient-Level Only Model P
Patient- and Voxel-Level Model P+V
Patient/tumor Variable β (95% CI) P-value β (95% CI) P-value
 Stage IIIB (vs. other) 22.9 (−9.0, 65.8) 0.17 23.6 (−7.6, 65.6) 0.15
 Primary SUVmean, per 50% increase 21.5 (9.6, 34.6) 0.001 −9.8 (−19.4, 0.9) 0.069
 Mean dose between pre-RT and mid-RT PET, per 1-Gy −4.6 (−10.3, 1.5) 0.13 −4.0 (−9.5, 1.8) 0.17
Voxel Variable

 Voxel pre-RT SUV, per 50% increase 34.8 (29.2, 40.7) <0.001
 Voxel RT dose, per 1-Gy increase −0.3 (−0.8, 0.2) 0.28
 Voxel distance from tumor centroid, per 1-cm increase −1.1 (−3.3, 1.1) 0.30

The predictor was log-transformed prior to inclusion in the model to reduce right-skewness.

The median mid-RT PET SUV MAE estimated using leave-one-out cross-validation was 1.5 g/cm3 for Model P and 1.0 g/cm3 for Model P+V (p<0.001 for the difference) (Figure 5). Predicted voxelwise mid-RT SUV from Model P+V as a function of pre-RT SUV is shown for two patients in Figure 6. Voxel Forecast output in each patient was blinded to model training, highlighting an example case with typical MAE (1.1 g/cm3, Figure 6A) and another with high MAE (3.3 g/cm3, Figure 6B). The largest prediction errors in these patients tended to occur in voxels with high pre-RT SUV, where voxel forecasted response was either underestimated (Figure 6A) or overestimated (Figure 6B). Tumor voxel response forecasts were mapped back to the native PET imaging spatial coordinates and representative image slices (Figure 7A, 7D) are displayed for comparison to the observed tumor voxel response image slices (Figure 7B, 7E) in the same two example patients (Figure 7AC, 7DF). While the patient with typical MAE displays smaller regions that over-responded relative to prediction (Figure 7C, ΔSUV > 0), the patient with high MAE displays large contiguous regions that under-responded relative to prediction (Figure 7F, ΔSUV < 0).

Figure 5.

Figure 5.

Voxel Forecast model prediction errors. Leave-one-patient-out cross-validation (LOOCV) mean absolute error from OLS-GLS hybrid model of patient-level predictors only compared to OLS-GLS hybrid model of patient- and voxel-level predictors. Median (line), quartiles (box), and 1.5 IQR (whisker) are shown. Non-parametric pairwise comparisons were performed by Wilcoxon signed-rank testing.

Figure 6.

Figure 6.

Comparison of Voxel Forecast predicted (black markers) vs. observed (gray markers) Week 3 mid-treatment FDG PET voxel SUV in two example FLARE-RT protocol patients blinded to model training (A, B). Patient A presents with homogeneous response over moderate SUV range, resulting in mean absolute prediction error of 1.1 SUV [g/cm3], with most voxels over-responding to conventionally fractionated RT dose relative to prediction. Patient B presents with low response over large SUV range, resulting in biased prediction at high SUV and high mean absolute prediction error of 3.3 SUV [g/cm3]. SUV voxels in Patient B that under-responded to conventionally fractionated RT doses compared to prediction may define high-risk regions.

Figure 7.

Figure 7.

Comparison of Voxel Forecast predicted (A, D) vs. observed (B, E) Week 3 mid-treatment FDG PET tumor images in the same blinded FLARE-RT protocol patients as Figure 6. The first patient (top row) shows regions of over-response (C, ΔSUV = SUVpredicted – SUVobserved > 0) with low systematic bias in spatially variant prediction. The second patient (bottom row) shows systematic under-response (F, ΔSUV = SUVpredicted – SUVobserved < 0) throughout the tumor to conventionally fractionated RT doses, which may define high-risk regions.

A residue analysis of Voxel Forecast-defined under-responding tumor voxels, relative to prediction, was performed. Table 4 summarizes the percentage of under-responding voxels stratified into tertiles, which produced balanced subgroups for comparisons of baseline characteristics. Patients receiving proton pencil beam scanning therapy tended to have primary tumors with lower percentage of under-responding voxels compared to patients receiving conventional intensity modulated x-ray radiation therapy (p=0.077). Specifically, 6/12 (50%) patients receiving proton therapy had fewer than 25% under-responding primary tumor voxels, compared to 1/13 (8%) patients receiving x-ray therapy. More importantly, patients whose tumors contained higher percentages of under-responding voxels relative to prediction were more likely to be classified as mid-treatment PET non-responders according to the FLARE-RT protocol (p=0.030), and in turn these non-responders had a significantly worse overall survival prognosis than responders (Figure 8, p<0.001).

Table 4.

Patient/tumor characteristics stratified by tertiles of under-responding voxel percentages relative to prediction.

% of Under-Responding Voxels
Variable <25% (N=7) 25–75% (N=9) >75% (N=9) P-value
 Female sex 2 (28.6) 7 (77.8) 6 (66.7) 0.20
 Age, years 60 (50 – 70) 66 (59 – 75) 58 (51 – 72) 0.22
 Adenocarcinoma (vs. squamous cell carcinoma) 5 (71.4) 7 (77.8) 7 (77.8) >0.99
 Stage IIIB (vs. other) 3 (42.9) 1 (11.1) 4 (44.4) 0.31
 Proton Beam Therapy (vs. X-ray Radiation Therapy) 6 (85.7) 3 (33.3) 3 (33.3) 0.077
 Chemotherapy modality >0.99
  Carboplatin + paclitaxel 4 (57.1) 6 (66.7) 6 (66.7)
  Cisplatin + etoposide 3 (42.9) 3 (33.3) 3 (33.3)
 Primary metabolic tumor volume, cm3 27 (3 – 305) 32 (16 – 396) 25 (2 – 351) 0.56
 SUVmax, g/cm3 11 (6 – 32) 10 (4 – 23) 11 (3 – 23) 0.95
 SUVmean, g/cm3 5 (2 – 16) 5 (2 – 12) 6 (2 – 11) 0.97
 TLG, g 204 (12 – 1566) 257 (38 – 4408) 247 (5 – 2847) 0.63
 Ring/concave uptake pattern (vs. foci/convex) 3 (42.9) 5 (55.6) 4 (44.4) >0.99
 Time from pre-RT PET to mid-RT PET, days 30 (23 – 44) 26 (21 – 51) 21 (16 – 44) 0.23
 Time from pre-RT PET to RT start, days 14 (6 – 28) 10 (5 – 33) 5 (1 – 28) 0.21
 Number of fractions 12 (11 – 13) 12 (11 – 13) 12 (10 – 13) 0.81
 Mean dose, Gy 24 (23 – 28) 25 (23 – 28) 25 (19 – 27) 0.69
 Mid-RT PET Non-responder 0 (0.0) 4 (44.4) 6 (66.7) 0.030

Values are no. (%) or median (range) unless otherwise specified.

Figure 8.

Figure 8.

Kaplan-Meier estimated overall survival stratified by mid-treatment PET response status according to the FLARE-RT protocol (log rank p<0.001).

Discussion

We have proposed a voxel forecast modeling framework that uses variogram weighting to account for the spatial dependence between voxels within the same image, conducted a simulation study to measure the performance of the framework under known conditions, and applied the framework to predict Week 3 mid-RT tumor voxel SUV in a cohort of patients with locally advanced NSCLC enrolled on the FLARE-RT phase II clinical trial. The overall mean absolute prediction SUV error of Voxel Forecast improved from 1.5 g/cm3 when only patient-level predictors were used to 1.0 g/cm3 when voxel-level predictors were included in the model. Pre-RT voxel SUV was the strongest individual predictor of Week 3 mid-RT voxel SUV in response to uniformly prescribed tumor radiation dose.

Spatially heterogeneous prediction errors in mid-RT PET SUV across voxels and contiguous regions were influenced by patient-level and voxel-level factors. Some patients had tumors with low systematic bias in voxel forecasted FDG PET response, while other patients had tumors with large bias in voxel forecasted FDG PET response. These Voxel Forecast residuals have the potential to provide unique and powerful decision support for spatially variant response assessment in the context of adaptive cancer therapy, by defining tumor voxels that either under-respond or over-respond relative to prediction. Residue analysis revealed that primary tumor voxels treated with proton pencil beam scanning therapy may be less likely to under-respond compared to voxels treated with conventional x-ray intensity modulated radiation therapy. Reports of improved clinical outcomes in locally advanced NSCLC patients receiving proton beam therapy (43) motivate further investigation of spatially variant response patterns in this subpopulation. Patients with a higher percentage of under-responding primary tumor voxels were more likely to be classified as protocol-defined mid-RT PET non-responders, which in turn was a significant detriment to overall survival. Under-responding tumor voxels on mid-treatment PET may define high-risk regions for altering patient management in clinical oncology, from selecting radiation therapy modalities to adaptively intensifying radiation dose (33). Further planned accrual of patients onto the open FLARE-RT phase II trial and outcome data maturation will support a future treatment failure pattern analysis in relation to Voxel Forecast response to refine high-risk region definition.

The Voxel Forecast multiscale regression framework can potentially support extensions to several clinical applications. Beyond early response imaging, post-treatment imaging patterns of response can be predicted. Specific to the FLARE-RT trial, post-treatment PET voxel response can be predicted to include pre-treatment PET, mid-treatment PET, and RT dose voxel covariates from both spatially uniform and spatially adapted non-uniform prescriptions. Spatially variant post-treatment response prediction can consist of continuous voxel-level SUV changes as well as probabilities of voxel-level cancer recurrence from dichotimized PET SUV data. Voxel Forecast of temporal response dynamics across imaging time points, ranging from daily / weekly quantitative CT / MR imaging during therapy to quarterly / semi-annual surveillance CT after therapy, can be explored. In the broader context of precision oncology, Voxel Forecast can be adapted to predict multiscale response from an array of molecularly targeted and immune checkpoint blockade cancer therapies. These agents can be modeled as multiscale variables, both as global spatially uniform interventions and pharmacodynamically informed spatial biodistributions. In all cases, the spatial correlation structure of the therapeutic variable and its interaction with other patient-level and voxel-level input variables would help drive imaging-based response predictions. This approach has the potential to better differentiate treatment effects, so-called pseudo-progression, from true disease progression by leveraging spatial information from quantitative longitudinal imaging. Lastly, Voxel Forecast can operate on functional imaging of normal tissue, including but not limited to pulmonary perfusion/ventilation, hepatocyte activity, and bone marrow proliferation, to predict spatially variant organ function response to cancer therapies.

Other investigations have proposed technical methods for predicting spatial, temporal, or combined spatio-temporal data. Multiple linear regression was used to predict voxel-level PET response to radiation therapy and included covariates of voxel-neighborhood spatial correlation (38). Time series predictions of voxel neighborhood-averaged signals on functional MRI were built on Bayesian frameworks that leveraged prior knowledge to predict future time-intensity curve states and kinetic parameters at the regional scale (4446) or voxel scale (47). Outside of oncology and medical imaging research, spatio-temporal Kriging models of geological and meteorological data have combined physical equations with empirical spatial statistics for forecasting (4850). These methods rely on strong mathematical modeling assumptions regarding the spatio-temporal structure, which remains a challenge in predictive oncology when forecasting individual patient tumor response to cancer therapies (51). In contrast, Voxel Forecast makes assumptions about the spatial structure to gain statistical efficiency but does not rely on those assumptions to produce correct inferences. Random Forest ensemble learning models also do not require many mathematical assumptions of the underlying structure of the data; in fact, studies of densely sampled spatio-temporal data were used to train such models that closely matched the performance of Kriging spatio-temporal prediction models without strict requirements for data pre-processing and model parameterization (52). In contrast to Voxel Forecast, Random Forest models can automatically discover non-linear relationships and interactions between variables. However, such models require substantially more data to train and are less transparent for model interpretation. Predictor variable importance can be ranked, but the quantitative effect size of each variable on the predicted outcome, as well as any interactions between variables, is difficult to estimate. In clinical oncology where therapies are rapidly evolving, independent patient sample sizes remain modest and therefore limit high dimensional parametric spatio-temporal modeling.

The Voxel Forecast algorithm has several notable strengths. The algorithm is agnostic to quantitative imaging modality, therapeutic intervention, spatial scale, and correlation structure when making response predictions. While our application focused on predicting mid-treatment FDG PET imaging-based tumor response to concurrent chemotherapy and radiation therapy at the voxel scale, Voxel Forecast could have been applied to quantitative CT and MRI to predict multiscale response to immune checkpoint inhibitors in combination with radiation therapy. Furthermore, Voxel Forecast does not impose restrictions on the spatial scale of predictor variables, which can flexibly range from individual voxels to individual lesions and global patient parameters. This implies that Voxel Forecast could be trained to differentiate response to cancer therapies between patients, between lesions, between voxel clusters within lesions, and between lesion voxels as part of a comprehensive characterization of response heterogeneity. For example, region-level and voxel-level information on mediastinal and hilar lymph nodes would enable Voxel Forecast to predict spatially variant response patterns beyond the primary tumor. Statistical inference for the regression coefficients, including confidence intervals and p-values from hypothesis tests, is based on a non-parametric jack-knife approach that was validated for small patient sample sizes. This allows inferences to be robust to misspecification of the distribution of the outcome variable and the assumed voxel correlation structure, which could be more complex than a stationary Matérn variogram with parameter values shared across patients.

The current Voxel Forecast algorithm also has several limitations, some of which may be addressed by future algorithmic development. The core of the algorithm is a linear and additive structure which does not automatically capture many complex phenomena. However, this allows the model to achieve transparency and interpretability, and can serve as a useful approximation. This limitation can be addressed to some degree with inclusion of non-linear terms and interactions, particularly those derived from engineered or learned imaging features. An additional feature selection or clustering routine for benchmarking a subsequent voxel radiomics response prediction model would be necessary. Extensions to allow non-parametric fitting of non-linear terms such with splines or kernel smoothers may also be possible (53, 54). At present, only a single time point can be predicted, which avoided the requirement for complex modeling of time and space at the same time. Approaches for this have been proposed, though with strong underlying mathematical assumptions (47, 55). We are exploring ways to extend the algorithm to handle multiple time points while maintaining the current semi-parametric approach that minimizes structural assumptions. Voxel Forecast was originally developed for continuous response prediction. The estimating equations described in the supplement can in principle be extended to binary response classification using generalized linear models and generalized estimating equation frameworks (56), which have been previously modified for prediction of spatially correlated discrete and categorical data in 2D applications with a relatively small number of samples per patient (57, 58). However, Voxel Forecast performance may not be the same as for continuous outcomes, so further study and simulations are needed. Despite these present limitations, the statistical transparency and multiscale prediction flexibility of Voxel Forecast may complement techniques such as spatio-temporal deep learning (59), voxel-level optical flow combined with encoder-decoder convolutional neural networks (CNN) (60), 3D CNN for voxel-to-voxel prediction (61), and generative adversarial networks (GAN) for voxel image synthesis (62). Future investigations on spatially variant tumor voxel response prediction should seek to leverage the combined strengths of multiple statistical and artificial intelligence approaches.

In conclusion, the Voxel Forecast statistical framework consists of multiscale variogram-weighted regression to predict tumor voxel response patterns during and after cancer therapy. We have characterized the accuracy and precision of Voxel Forecast with simulations of response prediction, demonstrated that the framework remains robust in the setting of small sample sizes, and successfully generated a preliminary PET tumor voxel response prediction model in patients with locally advanced non-small cell lung cancer enrolled on the FLARE-RT clinical trial. Further training and validation of Voxel Forecast in larger patient cohorts for predicting quantitative imaging biomarker spatio-temporal response to cancer therapies may provide powerful clinical decision support and inform personalized dosing strategies.

Supplementary Material

1

Translational Relevance.

Advanced prediction algorithms based on imaging biomarkers, as complements to tissue and peripheral biomarkers, enhance clinical decision support for patient risk stratification and response assessment. We demonstrate that multiscale patient factors and imaging volume element (voxel) variables on fluorodeoxyglucose PET combine to predict spatially variant response during the third week of concurrent chemoradiotherapy in patients with locally advanced non-small cell lung cancer. Our novel Voxel Forecast tool can highlight tumor voxels that are responding well and those that are not, empowering cancer medicine providers to initiate personalized treatment adaptations. Forecasting spatially variant cancer response dynamics in individual patients provides powerful imaging biomarker-based guidance to the growing therapeutic arsenal in precision oncology.

Acknowledgments

This work was funded by NIH / NCI R01CA204301. We thank Hannah Thomas, Balu Sasidharan and Anna Reyes for their coordination of clinical and imaging data collection. We acknowledge the efforts of nuclear medicine and radiation oncology technologists during PET/CT acquisitions, radiation therapy planning, and image-guided radiation therapy delivery.

Financial Support: NIH/NCI R01CA204301

Conflict of Interest: DSH declares grants from Philips Healthcare, GE Healthcare, Toshiba America Medical Systems, and Siemens Medical Solutions USA, outside the submitted work. RSM and HJV are consultants for MIM Software and declare grants from Philips Healthcare, outside the submitted work. PEK is co-founder of PET/X, LLC, and declares grants from GE Healthcare, outside the submitted work.

References

  • 1.Mirnezami R, Nicholson J, Darzi A. Preparing for precision medicine. N Engl J Med 2012;366:489–491. [DOI] [PubMed] [Google Scholar]
  • 2.Chae YK, Pan AP, Davis AA, et al. Path toward Precision Oncology: Review of Targeted Therapy Studies and Tools to Aid in Defining “Actionability” of a Molecular Lesion and Patient Management Support. Mol Cancer Ther 2017;16:2645–2655. [DOI] [PubMed] [Google Scholar]
  • 3.Morash M, Mitchell H, Beltran H, et al. The Role of Next-Generation Sequencing in Precision Medicine: A Review of Outcomes in Oncology. J Pers Med 2018;8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Godwin JL, Zibelman M, Plimack ER, et al. Immune checkpoint blockade as a novel immunotherapeutic strategy for renal cell carcinoma: a review of clinical trials. Discov Med 2014;18:341–350. [PubMed] [Google Scholar]
  • 5.Su MY, Fisher DE. Immunotherapy in the Precision Medicine Era: Melanoma and Beyond. PLoS Med 2016;13:e1002196. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Heppt MV, Steeb T, Schlager JG, et al. Immune checkpoint blockade for unresectable or metastatic uveal melanoma: A systematic review. Cancer Treat Rev 2017;60:44–52. [DOI] [PubMed] [Google Scholar]
  • 7.Pianko MJ, Liu Y, Bagchi S, et al. Immune checkpoint blockade for hematologic malignancies: a review. Stem Cell Investig 2017;4:32. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Borcherding N, Kolb R, Gullicksrud J, et al. Keeping Tumors in Check: A Mechanistic Review of Clinical Response and Resistance to Immune Checkpoint Blockade in Cancer. J Mol Biol 2018;430:2014–2029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Harding JJ. Immune checkpoint blockade in advanced hepatocellular carcinoma: an update and critical review of ongoing clinical trials. Future Oncol 2018;14:2293–2302. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Emens LA, Ascierto PA, Darcy PK, et al. Cancer immunotherapy: Opportunities and challenges in the rapidly evolving clinical landscape. Eur J Cancer 2017;81:116–129. [DOI] [PubMed] [Google Scholar]
  • 11.Xia W, Wang J, Xu Y, et al. L-BLP25 as a peptide vaccine therapy in non-small cell lung cancer: a review. J Thorac Dis 2014;6:1513–1520. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Freeman-Keller M, Goldman J, Gray J. Vaccine immunotherapy in lung cancer: Clinical experience and future directions. Pharmacol Ther 2015;153:1–9. [DOI] [PubMed] [Google Scholar]
  • 13.Signorelli C, Odone A, Ciorba V, et al. Human papillomavirus 9-valent vaccine for cancer prevention: a systematic review of the available evidence. Epidemiol Infect 2017;145:1962–1982. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Chamani R, Ranji P, Hadji M, et al. Application of E75 peptide vaccine in breast cancer patients: A systematic review and meta-analysis. Eur J Pharmacol 2018;831:87–93. [DOI] [PubMed] [Google Scholar]
  • 15.Gillies RJ, Kinahan PE, Hricak H. Radiomics: Images Are More than Pictures, They Are Data. Radiology 2016;278:563–577. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Aerts HJ, Velazquez ER, Leijenaar RT, et al. Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nat Commun 2014;5:4006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.van Griethuysen JJM, Fedorov A, Parmar C, et al. Computational Radiomics System to Decode the Radiographic Phenotype. Cancer Res 2017;77:e104–e107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Mankoff DA, Farwell MD, Clark AS, et al. Making Molecular Imaging a Clinical Tool for Precision Oncology: A Review. JAMA Oncol 2017;3:695–701. [DOI] [PubMed] [Google Scholar]
  • 19.Huang W, Zhou T, Ma L, et al. Standard uptake value and metabolic tumor volume of (1)(8)F-FDG PET/CT predict short-term outcome early in the course of chemoradiotherapy in advanced non-small cell lung cancer. Eur J Nucl Med Mol Imaging 2011;38:1628–1635. [DOI] [PubMed] [Google Scholar]
  • 20.van Elmpt W, Ollers M, Dingemans AM, et al. Response assessment using 18F-FDG PET early in the course of radiotherapy correlates with survival in advanced-stage non-small cell lung cancer. J Nucl Med 2012;53:1514–1520. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Yossi S, Krhili S, Muratet JP, et al. Early assessment of metabolic response by 18F-FDG PET during concomitant radiochemotherapy of non-small cell lung carcinoma is associated with survival: a retrospective single-center study. Clin Nucl Med 2015;40:e215–221. [DOI] [PubMed] [Google Scholar]
  • 22.Gensheimer MF, Hong JC, Chang-Halpenny C, et al. Mid-radiotherapy PET/CT for prognostication and detection of early progression in patients with stage III non-small cell lung cancer. Radiother Oncol 2017;125:338–343. [DOI] [PubMed] [Google Scholar]
  • 23.Chin AL, Kumar KA, Guo HH, et al. Prognostic Value of Pretreatment FDG-PET Parameters in High-dose Image-guided Radiotherapy for Oligometastatic Non-Small-cell Lung Cancer. Clin Lung Cancer 2018;19:e581–e588. [DOI] [PubMed] [Google Scholar]
  • 24.Kong F, Wang J, Wong K, et al. Inter-method Comparison and Optimization of [18F] FDG PET Metabolic Response Assessment in Non-Small Cell Lung Cancer. Pract Radiat Oncol 2013;3:S23. [DOI] [PubMed] [Google Scholar]
  • 25.Wang J, Wong KK, Piert M, et al. Metabolic response assessment with (18)F-FDG PET/CT: inter-method comparison and prognostic significance for patients with non-small cell lung cancer. J Radiat Oncol 2015;4:249–256. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Aerts HJ, van Baardwijk AA, Petit SF, et al. Identification of residual metabolic-active areas within individual NSCLC tumours using a pre-radiotherapy (18)Fluorodeoxyglucose-PET-CT scan. Radiother Oncol 2009;91:386–392. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Aerts HJ, Bussink J, Oyen WJ, et al. Identification of residual metabolic-active areas within NSCLC tumours using a pre-radiotherapy FDG-PET-CT scan: a prospective validation. Lung Cancer 2012;75:73–76. [DOI] [PubMed] [Google Scholar]
  • 28.Ohri N, Bodner WR, Halmos B, et al. (18)F-Fluorodeoxyglucose/Positron Emission Tomography Predicts Patterns of Failure After Definitive Chemoradiation Therapy for Locally Advanced Non-Small Cell Lung Cancer. Int J Radiat Oncol Biol Phys 2017;97:372–380. [DOI] [PubMed] [Google Scholar]
  • 29.Zhu SH, Zhang Y, Yu YH, et al. FDG PET-CT in non-small cell lung cancer: relationship between primary tumor FDG uptake and extensional or metastatic potential. Asian Pac J Cancer Prev 2013;14:2925–2929. [DOI] [PubMed] [Google Scholar]
  • 30.Ohri N, Bodner WR, Kabarriti R, et al. Positron Emission Tomography-Adjusted Intensity Modulated Radiation Therapy for Locally Advanced Non-Small Cell Lung Cancer. Int J Radiat Oncol Biol Phys 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.van Diessen J, De Ruysscher D, Sonke JJ, et al. The acute and late toxicity results of a randomized phase II dose-escalation trial in non-small cell lung cancer (PET-boost trial). Radiother Oncol 2018. [DOI] [PubMed] [Google Scholar]
  • 32.van Elmpt W, De Ruysscher D, van der Salm A, et al. The PET-boost randomised phase II dose-escalation trial in non-small cell lung cancer. Radiother Oncol 2012;104:67–71. [DOI] [PubMed] [Google Scholar]
  • 33.Lee E, Zeng J, Miyaoka RS, et al. Functional lung avoidance and response-adaptive escalation (FLARE) RT: Multimodality plan dosimetry of a precision radiation oncology strategy. Med Phys 2017;44:3418–3429. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Gupta RG, Somer RA. Intratumor Heterogeneity: Novel Approaches for Resolving Genomic Architecture and Clonal Evolution. Mol Cancer Res 2017;15:1127–1137. [DOI] [PubMed] [Google Scholar]
  • 35.Bowen SR, Yuh WTC, Hippe DS, et al. Tumor radiomic heterogeneity: Multiparametric functional imaging to characterize variability and predict response following cervical cancer radiation therapy. J Magn Reson Imaging 2018;47:1388–1396. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Bowen SR, Chappell RJ, Bentzen SM, et al. Spatially resolved regression analysis of pre-treatment FDG, FLT and Cu-ATSM PET from post-treatment FDG PET: an exploratory study. Radiother Oncol 2012;105:41–48. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Petit SF, Aerts HJ, van Loon JG, et al. Metabolic control probability in tumour subvolumes or how to guide tumour dose redistribution in non-small cell lung cancer (NSCLC): an exploratory clinical study. Radiother Oncol 2009;91:393–398. [DOI] [PubMed] [Google Scholar]
  • 38.Bradshaw T, Fu R, Bowen S, et al. Predicting location of recurrence using FDG, FLT, and Cu-ATSM PET in canine sinonasal tumors treated with radiotherapy. Phys Med Biol 2015;60:5211–5224. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Bissonnette JP, Yap ML, Clarke K, et al. Serial 4DCT/4DPET imaging to predict and monitor response for locally-advanced non-small cell lung cancer chemo-radiotherapy. Radiother Oncol 2018;126:347–354. [DOI] [PubMed] [Google Scholar]
  • 40.Lipsitz SR, Laird NM, Harrington DP. Using the Jackknife to Estimate the Variance of Regression-Estimators from Repeated Measures Studies. Communications in Statistics-Theory and Methods 1990;19:821–845. [Google Scholar]
  • 41.Graler B, Pebesma E, Heuvelink G. Spatio-Temporal Interpolation using gstat. R Journal 2016;8:204–218. [Google Scholar]
  • 42.Pebesma EJ. Multivariable geostatistics in S: the gstat package. Computers & Geosciences 2004;30:683–691. [Google Scholar]
  • 43.Vyfhuis MAL, Onyeuku N, Diwanji T, et al. Advances in proton therapy in lung cancer. Ther Adv Respir Dis 2018;12:1753466618783878. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Schmid VJ, Whitcher B, Padhani AR, et al. A Bayesian hierarchical model for the analysis of a longitudinal dynamic contrast-enhanced MRI oncology study. Magn Reson Med 2009;61:163–174. [DOI] [PubMed] [Google Scholar]
  • 45.Sanyal N, Ferreira MA. Bayesian hierarchical multi-subject multiscale analysis of functional MRI data. Neuroimage 2012;63:1519–1531. [DOI] [PubMed] [Google Scholar]
  • 46.Tietze A, Nielsen A, Klaerke Mikkelsen I, et al. Bayesian modeling of Dynamic Contrast Enhanced MRI data in cerebral glioma patients improves the diagnostic quality of hemodynamic parameter maps. PLoS One 2018;13:e0202906. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Katanoda K, Matsuda Y, Sugishita M. A spatio-temporal regression model for the analysis of functional MRI data. Neuroimage 2002;17:1415–1428. [DOI] [PubMed] [Google Scholar]
  • 48.Adam-Poupart A, Brand A, Fournier M, et al. Spatiotemporal modeling of ozone levels in Quebec (Canada): a comparison of kriging, land-use regression (LUR), and combined Bayesian maximum entropy-LUR approaches. Environ Health Perspect 2014;122:970–976. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Liang D, Kumar N. Time-space Kriging to address the spatiotemporal misalignment in the large datasets. Atmos Environ (1994) 2013;72:60–69. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Zhan Y, Luo Y, Deng X, et al. Satellite-Based Estimates of Daily NO2 Exposure in China Using Hybrid Random Forest and Spatiotemporal Kriging Model. Environ Sci Technol 2018;52:4180–4189. [DOI] [PubMed] [Google Scholar]
  • 51.Yankeelov TE, Quaranta V, Evans KJ, et al. Toward a science of tumor forecasting for clinical oncology. Cancer Res 2015;75:918–923. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Hengl T, Nussbaum M, Wright MN, et al. Random forest as a generic framework for predictive modeling of spatial and spatio-temporal variables. PeerJ 2018;6:e5518. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Racine J, Li Q. Nonparametric estimation of regression functions with both categorical and continuous data. Journal of Econometrics 2004;119:99–130. [Google Scholar]
  • 54.Wood SN, Augustin NH. GAMs with integrated model selection using penalized regression splines and applications to environmental modelling. Ecological Modelling 2002;157:157–177. [Google Scholar]
  • 55.George B, Aban I. Selecting a separable parametric spatiotemporal covariance structure for longitudinal imaging data. Statistics in Medicine 2015;34:145–161. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Liang KY, Zeger SL. Longitudinal Data-Analysis Using Generalized Linear-Models. Biometrika 1986;73:13–22. [Google Scholar]
  • 57.Albert PS, McShane LM. A generalized estimating equations approach for spatially correlated binary data: Applications to the analysis of neuroimaging data. Biometrics 1995;51:627–638. [PubMed] [Google Scholar]
  • 58.Gotway CA, Stroup WW. A Generalized Linear Model Approach to Spatial Data Analysis and Prediction. Journal of Agricultural, Biological, and Environmental Statistics 1997;2:157–178. [Google Scholar]
  • 59.Ke JT, Zheng HY, Yang H, et al. Short-term forecasting of passenger demand under on-demand ride services: A spatio-temporal deep learning approach. Transportation Research Part C-Emerging Technologies 2017;85:591–608. [Google Scholar]
  • 60.Liu Z, Yeh R, Tang X, et al. Video Frame Synthesis using Deep Voxel Flow. International Conference on Computer Vision Venice, Italy: IEEE Computer Society; 2017. [Google Scholar]
  • 61.Moon G, Chang JY, Lee KM. V2V-PoseNet: Voxel-to-Voxel Prediction Network for Accurate 3D Hand and Human Pose Estimation from a Single Depth Map Computer Vision and Pattern Recognition. Salt Lake City, UT: IEEE Computer Society; 2018. [Google Scholar]
  • 62.Nie D, Trullo R, Lian J, et al. Medical Image Synthesis with Context-Aware Generative Adversarial Networks. Med Image Comput Comput Assist Interv 2017;10435:417–425. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1

RESOURCES