Abstract
Tissue perfusion plays a critical role in oncology. Growth and migration of cancerous cells requires proliferation of networks of new blood vessels through the process of tumor angiogenesis. Many imaging technologies developed recently attempt to measure characteristics pertaining to the passage of fluid through blood vessels, thereby providing a noninvasive means for cancer detection, as well as treatment prognostication, prediction, and monitoring. However, because these techniques require a sequence of successive imaging scans under administration of intravenous imaging tracers, the quality of the resulting perfusion data depends on the acquisition protocol. In this paper, we explain how to infer stability for stochastic curve estimation. The topic is motivated by two recent attempts to determine stable acquisition durations for acquiring perfusion characteristics using dynamic computed tomography, wherein inference used inappropriate statistical methods. Notably, when appropriate statistical techniques are used, the resulting conclusions deviate substantially from those previously reported in the literature.
Keywords: dynamic computed tomography, equivalence testing, semiparametric regression, perfusion imaging, penalized splines
Introduction
Many imaging technologies developed recently attempt to measure characteristics pertaining to the passage of fluid through blood vessels, thereby providing a noninvasive means to quantify vascular features.1 Perfusion is of particular interest in oncologic imaging, where tissue, and in particular tumor perfusion, plays a critical role. The growth and migration of cancerous cells requires proliferation of networks of new blood vessels through the process of tumor angiogenesis, triggering modifications to the vasculature of the surrounding host tissue. In principle, measurements obtained from perfusion imaging provide physiological correlates for neovascularization induced by tumor angiogenesis.2
Thus, many investigators in cancer biology and oncology are attempting to use these features to better understand the pathophysiological processes at play in the tumor microenvironment. Ultimately, these efforts aim to identify biomarkers based on perfusion phenotypes that could be uti-lized for cancer detection, disease prognostication, as well as prediction and monitoring of therapeutic response to intervention.3–6
Perfusion computed tomography (CTp) is one such functional imaging technology that enables noninvasive observation and quantification of perfusion characteristics. Physiological models have been developed to quantify a variety of perfusion characteristics (such as tumor blood volume, capillary permeability) that derive from measuring temporal changes in contrast enhancement obtained from CT images acquired over a period of time during intravenous administration of a contrast medium.7 Consequently, CTp provides a quantitative basis for evaluating vasculature heterogeneity. The functional imaging technology has been utilized in a number of organs and tumors, including the prostate, colorectal, head and neck, lung, liver, and normal tissue.
Because such techniques require a sequence of successive scans under intravenous administration of a contrast medium, the quality of the resulting perfusion data depends on the manner in which the data is acquired. When specifying an acquisition protocol, investigators must determine several factors that could affect the quality of the resultant perfusion measurements. For example, one important factor involves the delineation of the preenhancement setpoint, or time/image at which the arterial up-slope is considered to first occur.8,9 In order to avoid excessive radiation exposure, patients are not scanned continuously, but rather at regular intervals over the course of the acquisition. Thus, investigators must determine the interscan subsampling interval to use in the acquisition. In a recent study,10 the length of the subsampling intervals was shown to significantly impact the resulting perfusion characteristics. In addition, investigators must determine an acquisition duration, or the extent of time for which the patient must undergo repeated scanning, that yields stable quantification of the perfusion characteristics. To limit radiation exposure, the acquisition duration should be minimized. Moreover, because the tissue type may be unknown before the diagnosis, any proposed duration of acquisition must ensure stable quantification of CTp characteristics for both malignant and healthy tissues before CTp can be used for detection and prognostication.
In the cases of two recent attempts to determine stable acquisition duration for acquiring perfusion characteristics in body tumors using dynamic CT,11,12 recommendations were put forth that were inferred using statistical methods that are inappropriate for addressing this objective. In this context, the investigators implemented t-tests between CTp values obtained at discrete acquisition durations using a traditional hypothesis-testing framework. Information pertaining to neighboring scans was ignored in the inference, and stability was concluded in the absence of significant differences for tests between successive scans. In the case of one study,12 conclusions were also based on measures of linear dependence between pairs of intrapatient observations at successive scans.
It is well known that the traditional formulation of the hypothesis-testing problem considers equality of effects under the null hypothesis, with the alternative hypothesis character-izing inequality. The corresponding P-value provides a measure of evidence against the null hypothesis, not for it. Because the roles assumed by the null and alternative statements are logically asymmetric, equivalence should not be inferred from the absence of a significant difference, since, intuitively, any underpowered study would inevitably reach this conclusion. Moreover, it has been well described that measures of linear dependence are misleading and inappropriate for evaluating equivalence or “agreement”.13 A proper analysis requires an equivalence-testing framework that measures the evidence against nonequivalence in relation to a prespecified equivalence region.
In addition, the pairwise approaches to inference utilized in both studies ignore temporal trends in the data, masking stabilization as a function of acquisition time. Notably, when appropriate statistical techniques are applied, conclusions deviate substantially from those provided by the aforementioned authors.14 Heretofore, an appropriate method for inferring stable acquisition durations for acquiring imaging biomarkers from dynamic imaging modalities has yet to be explained. Nor has the concept of “equivalence testing”15 been appreciated by the oncologic imaging community. In this paper, we explain how statistical modeling can be used to infer stable domains for stochastic curve estimation.
The ideas in this paper are presented in the following sequence. First we present the general method. Thereafter, we demonstrate the method by evaluating acquisition durations for a perfusion biomarker acquired in metastatic sites to the liver as well as healthy liver tissue using semiparametric model inference. We provide concluding remarks in the last section.
Inferring Stable Acquisition Durations from Stochastic Curves
This section presents a formal definition for stability as well as a general approach to inference based on equivalence testing.
Stability criterion
Let t > 0 denote the acquisition duration, and let f (t) characterize the nonstochastic mapping of a perfusion-based biomarker as a function of t. Let denote the derivative or velocity of the function at acquisition time t. f’(t) characterizes the infinitesimal rate of change in f (t) with respect to the change in acquisition time t. Mapping f (t) is considered to be δ-stable, δ > 0, at acquisition time point t0 if
(1) |
The function has attained stability at time t if its velocity is bounded within a neighborhood of zero for all subsequent time points. Thus, stability condition (1) is satisfied for all δ > 0 if f’(t) reaches a steady state (or is time invariant) beyond t0: f’(t*) = 0, for all t* > t0. Therefore, we can evaluate acquisition durations for time invariance by fitting smooth curves to the observed data and conducting inference on the corresponding derivatives to assess their relative proximity to zero as a function of time.
Stability inference
Let yt denote a stochastic response variable associated with a perfusion biomarker acquired for one patient region. A general nonparametric additive model applies local regression to a low-dimensional projection of the data. For example, we may assume that a one-to-one transformation of y, g(y), varies symmetrically about mean f(t) with random error ε and constant error variance ,
(2) |
Mapping f (t) represents an arbitrary function of time, which can be estimated using smoothing splines or lowess.16
The traditional approach to statistical analysis through hypothesis testing is valid when the aim of an experiment is to evaluate the evidence for differences among experimental conditions. However, the condition of “stability” is actually a statement of equivalence. A proper analysis requires an equivalence-testing framework that measures the evidence against nonequivalence in relation to a prespecified “equivalence” region.
Let τ > 0 denote the maximum observation period, 0 < t < τ, and let Lα(t) and Uα(t) denote the lower and upper bounds of the 100(1 − α)% simultaneous confidence band (CB1−α) for f’(t) over the interval (t, τ). Statistically, one should infer that f (t) is stable at acquisition duration t0 at significance level α if the corresponding CB1−α encompassing all subsequent acquisition durations are contained within a sufficiently small neighborhood of zero (−λ, λ), that is
The approach is analogous to testing null hypotheses of nonequivalence with equivalence region (−λ, λ). The boundary parameter λ represents the minimal magnitude of deviation that is meaningful in the context of the analysis. This may be specified as a scaled multiple of the estimated residual error standard deviation.
Case Study in CT Perfusion
In this section, we demonstrate the method for stability inference presented in the previous section using semiparameteric regression with implementation to the perfusion characteristic most commonly utilized in oncology, namely blood flow (BF). Specifically, spline regression is used to avoid prespecification of a parametric form for the underlying functional relationships, which are often unknown. As demonstrated in Ref. 14, deconvolution modeling of dynamic CT requires acquisition durations of sufficient length in order to achieve accurate quantification of a patient’s perfusion characteristics. Before attaining steady states, these models yield biomarkers that are characterized by periods of noisy fluctuation. The dynamic periods are explained in part by the initial absorption of contrast. Ensuring stable quantification for the various perfusion scanning applications in oncology requires the implementation of acquisition protocols that use acquisition durations that yield relative time-invariant mappings. We will use the statistical model to flexibly estimate the mean velocity in the presence of stochastic curves. The stability criterion will be used to infer a minimum stabilization time for blood flow when acquired in metastatic sites in liver as well as healthy liver.
CT perfusion data
The study collected data on 16 patients with neuroendocrine liver metastases in whom CTp had been undertaken on a target lesion in the liver. CT perfusion images were obtained from a dual-phase protocol spanning a duration of 590 seconds. BF was acquired using a deconvolution analysis with the distributed parameter physiological model.7,17,18 BF is the rate measured as milliliters per minute per 100 grams of liver tissue (mL/min per 100 g). The dataset analyzed here consisted of 59 eight-slice cine images temporally sampled at 0.5 seconds from the phase 1 acquisition, together with 8 anatomically matched images from the phase 2 acquisition. A final BF value was obtained for each region of interest (ROI) by averaging across each of the eight CT slice images. There were 25 separate ROIs where BF was obtained in liver metastases and 27 separate ROIs where BF was obtained in normal liver tissue. The observed BF values were transformed to the log scale for the purpose of adjusting for conditionally asymmetric residual error at a given acquisition time and to mitigate heteroskedasticity as a function of acquisition time. Figure 1 provides the scatterplots of the observed log BF as a function of acquisition time for both types of tissue. Solid lines connect observations acquired from the same ROI, while dots characterize the observed scan times. The figure suggests that BF tends to be both elevated and more heterogenous in tumor sites when compared to normal liver.
Semiparametric model
We model the CTp curves using penalized splines due to their smoothness properties and the fact that a unified framework for computing simultaneous confidence bands has recently been established.19 Our case study analysis uses a truncated polynomial following the mixed model framework established by Ruppert et al.20, thereby enabling direct estimation and inference on the derivatives of the CTp curves as a function of acquisition duration.
Penalized spline regression
A spline basis is in essence a linear combination of piecewise polynomials. Let s denote a K × 1 vector of knot locations. At time t, a truncated polynomial spline basis of degree D is defined as
(3) |
where for the indicator function 1(). Note that, when restricted to each region of the time domain, the function is a Dth degree polynomial. f(t) is smooth for all t for D > 0. Unlike polynomials that possess all derivatives over the entire domain, splines possess all derivatives only at points that are not knots. The extent to which the spline is differentiable at a knot depends on the degree of the spline. For example, statistical inference with D = 1 approximates the derivative f’(t), with a step function over the time axis partition failing to yield continuity. In general, a Dth degree spline has no more than D − 1 continuous derivatives.
Penalized estimation uses constrained optimization to attempt to strike a balance between smoothness and a close fit to the data.20 In the mixed model representation, the spline coefficients u1,…,uK, are modeled as independent and identically distributed (i.i.d.) random effects with variance , and thus can be estimated using restricted maximum likelihood estimation (REML). Importantly, asymptotic simultaneous confidence bands can be obtained using Hotelling’s volume-of-tube formula.19
Let yi denote an mi × 1 vector of BF measurements acquired at times ti for the ith ROI. In addition, let y denote the column vector obtained from stacking each yi, i = 1,…, n. In our study, mi varies between 15 and 17. Adopting the mixed model notation provided by Ruppert et al.20, the semiparametric model is defined using design matrices X and Z corresponding to fixed and random components of the additive model, respectively,
4 |
where , , cov(ε, u) = 0, X = (1, t, t2,…,tD), , and u is the K × 1 vector of spline coefficients defined in (3). Let θ = (β, u) and define matrices
After estimating the variance components with REML, we can obtain the estimated best linear unbiased predictor (BLUP) of f (t), which follows as where and . As ϕ increases, becomes large relative to , and therefore the fit attributes more total variation to an increasingly complex relationship. Conversely, as ϕ decreases, the fit becomes more smooth, as more of the total variation is attributed to random fluctuation.
In addition, the residual sum-of-squared (RSS) errors can be used for cross validation, , or to conduct model criticism for comparing knot locations or among spline bases of varying degrees using Akaike’s information criterion: , where .
Derivative inference
Point and interval estimators can be derived for the rate of change as a function of acquisition time. Let g denote a G × 1 vector of grid points or acquisition times of interest on interval [0, τ], and let Cg = (Xg, Zg) denote the corresponding design matrix such that Xg and Zg are the column design matrices that result from application of the derivative operation to f(g):
The estimated BLUP for the derivative , at each point in g, follows as
5 |
Using the mixed model formulation, Ruppert et al.20 demonstrated that the corresponding large sample covariance of follows as
6 |
Thus, approximate 100(1 − α)% interval estimators can be computed by selecting an appropriate asymptotically justified critical value ,
7 |
A 100(1 − α)% pointwise confidence interval results from fixing at the (1 − α/2)th quantile of a Student’s t-distribution with n − DFfit degrees of freedom. Krivobokova et al.19 use Hotelling’s volume-of-tube formula to derive an approximate simultaneous CB1−α, whereby asymptotically the critical value is selected to satisfy
where ε = y − Cθ and m(g) accounts for bias due to shrinkage, m(g) = H(ϕ)TCθ − Cθ.
Results
Acquisition durations for BF from the liver perfusion study were inferred for CTp curves obtained in metastatic sites as well as regions of healthy liver (from the left or right lobes). For each tissue type, penalized spline regression analysis was implemented using truncated polynomial spline bases of D = 1, 2, 3. Our analysis used the package AdaptFitOS in statistical software R to implement REML and to select for the resulting simultaneous confidence bands.
Knots were placed at evenly spaced quantiles of the observed acquisition time points. While, in principle penalized spline estimators are robust to knot selection, because the sample sizes are rather small, the total number of knots were selected using the “corrected” version of AIC provided by Hurvich and Simonoff,21
AICc was also used to compare goodness of fit among splines of varying degree. Table 1 provides the AIC “optimal” numbers of knots for each spline degree and analysis in tumor and normal sites. In addition, the resulting AICc and RSS are provided for each model. Figure 2 plots the point and interval estimates for the BF maps obtained in tumor (top) and normal liver (bottom), as functions of acquisition time. The third-degree truncated polynomial basis clearly resulted in a smooth fit when compared to the piecewise linear model (first degree). An intermediate degree of smoothness is evident for the fit corresponding to the second-degree spline. As evident in Table 1, the first-degree spline yielded the best tradeoff between goodness of fit and model complexity as defined by AICc in tumor. For sites in normal liver, the extent of enhanced smoothness provided by the third-degree polynomial yielded the best tradeoff.
Table 1.
TISSUE TYPE | NUMBER KNOTS | AICC | RESIDUAL SS | ||||||
---|---|---|---|---|---|---|---|---|---|
SPLINE DEGREE | SPLINE DEGREE | SPLINE DEGREE | |||||||
1 | 2 | 3 | 1 | 2 | 3 | 1 | 2 | 3 | |
Tumor | 13 | 10 | 8 | 4.896 | 4.899 | 4.898 | 129.53 | 130.00 | 129.80 |
Normal | 11 | 9 | 8 | 4.387 | 4.387 | 4.385 | 77.88 | 78.05 | 77.86 |
Figure 3 provides the resultant estimated BLUP for the derivatives and the corresponding CB0.95, with an equivalence region (red) defined by ±0.5. The residual error standard deviation was estimated to be approximately 0.55 in tumor and 0.42 in normal tissue. Therefore, λ = 0.5 was chosen so that stability was measured in relation to the evidence that the mean curve varied less than approximately 1 standard deviation of random error. Using the approach described in the previous section, we concluded that CTp provides sufficiently stable characterization of BF when acquired for at least 220 seconds. Stabilization was evident sooner in normal liver, where a duration of 131 seconds yielded stable acquisition. This is not surprising, since, as noted in Figure 1, tissue perfusion tends to be more heterogeneous in regions undergoing tumor angiogenesis.
Discussion
In this paper, we described a statistically justified model-based approach for inferring stability for estimation of stochastic curves that eventually attain steady states. The effort was motivated in the oncologic imaging setting in the context of evaluating acquisition protocols for functional modalities that depend on a sequence of scans to acquire biomarkers that char-acterize biological processes associated with tumor angiogenesis. The approach was used to select acquisition durations that yield stable characterizations of a perfusion biomarker when acquired in metastatic sites in liver as well as normal liver tissue. It is important that the oncologic community recognize and use appropriate methods of inference when evaluating acquisition protocols for functional imaging modalities so that these promising technologies realize their full potential as tools for constructing biomarkers for guiding cancer detection, prognostication, and treatment selection.
Footnotes
ACADEMIC EDITOR: JT Efird, Editor in Chief
FUNDING: Brian Hobbs was supported in part by the University of Texas MD Anderson’s Cancer Center Support Grant NIH P30 CA016672. Chaan Ng was fully funded by the MD Anderson internal funds. The authors confirm that the funder had no influence over the study design, content of the article, or selection of this journal.
COMPETING INTERESTS: CSN reports grants received from GE Healthcare, outside the work presented here. BPH discloses no potential conflicts of interest.
Paper subject to independent expert blind peer review by minimum of two reviewers. All editorial decisions made by independent academic editor. Upon submission manuscript was subject to anti-plagiarism scanning. Prior to publication all authors have given signed confirmation of agreement to article publication and compliance with all applicable ethical and legal requirements, including the accuracy of author and contributor information, disclosure of competing interests and funding sources, compliance with ethical requirements relating to human and animal study participants, and compliance with any copyright requirements of third parties. This journal is a member of the Committee on Publication Ethics (COPE).
Author Contributions
Conceived and designed the experiments: BH, CN. Ana-lyzed the data: BH. Wrote the first draft of the manuscript: BH. Contributed to the writing of the manuscript: CN. Agree with manuscript results and conclusions: BH, CN. Jointly developed the structure and arguments for the paper: BH, CN. Made critical revisions and approved final version: BH, CN. Both authors reviewed and approved of the final manuscript.
REFERENCES
- 1.Dixon A, Gilbert F. Standardising measurement of tumour vascularity by imaging: recommendations for ultrasound, computed tomography, magnetic resonance imaging and positron emission tomography. Eur Radiol. 2012;22:1427–9. doi: 10.1007/s00330-012-2444-z. [DOI] [PubMed] [Google Scholar]
- 2.Miles K. Functional computed tomography in oncology. Eur J Cancer. 2002;38:2079–84. doi: 10.1016/s0959-8049(02)00386-6. [DOI] [PubMed] [Google Scholar]
- 3.Kambadakone A, Sahani D. Body perfusion CT: technique, clinical applications, and advances. Radiol Clin North Am. 2009;47:161–78. doi: 10.1016/j.rcl.2008.11.003. [DOI] [PubMed] [Google Scholar]
- 4.Goh V, Ng Q, Miles K. Computed tomography perfusion imaging for therapeutic assessment: has it come of age as a biomarker in oncology? Invest Radiol. 2012;47:2–4. doi: 10.1097/RLI.0b013e318229ff3e. [DOI] [PubMed] [Google Scholar]
- 5.Betz M, Kopp H, Spira D, Claussen C, Horger M. The benefit of using CT-perfusion imaging for reliable response monitoring in patients with gastrointestinal stromal tumor (GIST) undergoing treatment with novel targeted agents. Acta Radiol. 2013;54:711–21. doi: 10.1177/0284185113484642. [DOI] [PubMed] [Google Scholar]
- 6.García-Figueiras R, Goh VJ, Padhani AR, et al. CT perfusion in oncologic imaging: a useful tool? AJR Am J Roentgenol. 2013;200:8–19. doi: 10.2214/AJR.11.8476. [DOI] [PubMed] [Google Scholar]
- 7.Lee T, Functional CT. physiological models. Trends Biotechnol. 2002;20(8 suppl):S3–10. [Google Scholar]
- 8.Sanelli P, Lev M, Eastwood J, Gonzalez R, Lee T. The effect of varying user-selected input parameters on quantitative values in CT perfusion maps. Acad Radiol. 2004;11:1085–92. doi: 10.1016/j.acra.2004.07.002. [DOI] [PubMed] [Google Scholar]
- 9.Ng C, Chandler A, Yao J, et al. Effect of pre-enhancement set-point on CT perfusion values in normal liver and metastases to the liver from neuroendocrine tumors. J Comput Assist Tomogr. 2014;38:526–34. doi: 10.1097/RCT.0000000000000053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Ng CS, Hobbs BP, Wei W, et al. Effect of sampling frequency on perfusion values in CT perfusion of liver tumors and normal liver. J Comput Assist Tomogr. 2015 doi: 10.1097/RCT.0000000000000212. [In press] [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Goh V, Halligan S, Hugill J, Gartner L, Bartram C. Quantitative colorectal cancer perfusion measurement using dynamic contrast-enhanced multidetector-row computed tomography: effect of acquisition time and implications for protocols. J Comput Assist Tomogr. 2005;29:59–63. doi: 10.1097/01.rct.0000152847.00257.d7. [DOI] [PubMed] [Google Scholar]
- 12.Kambadakone A, Sharma A, Catalano O, Hahn P, Sahani D. Protocol modifications for CT perfusion (CTp) examinations of abdomen-pelvic tumors: impact on radiation dose and data processing time. Eur Radiol. 2011;21:1293–300. doi: 10.1007/s00330-010-2048-4. [DOI] [PubMed] [Google Scholar]
- 13.Bland J, Altman D. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet. 1986;327:307–10. [PubMed] [Google Scholar]
- 14.Ng C, Hobbs B, Chandler A, et al. Metastases to the liver from neuroendocrine tumors: effect of duration of scan acquisition on CT perfusion values. Radiology. 2013;269:758–67. doi: 10.1148/radiol.13122708. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Walker E, Nowacki A. Understanding equivalence and noninferiority testing. J Gen Intern Med. 2010;26:192–6. doi: 10.1007/s11606-010-1513-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Cleveland W. LOWESS: a program for smoothing scatterplots by robust locally weighted regression. Am Stat. 1981;35:54. [Google Scholar]
- 17.Miles K, Charnsangavej C, Lee F, Fishman E, Horton K, Lee T. Application of CT in the investigation of angiogenesis in oncology. Acad Radiol. 2000;7:840–50. doi: 10.1016/s1076-6332(00)80632-7. [DOI] [PubMed] [Google Scholar]
- 18.Stewart EE, Chen X, Hadway J, Lee T. Hepatic perfusion in a tumor model using DCE-CT: an accuracy and precision study. Phys Med Biol. 2008;53:4249–67. doi: 10.1088/0031-9155/53/16/003. [DOI] [PubMed] [Google Scholar]
- 19.Krivobokova T, Kneib T, Claeskens G. Simultaneous confidence bands for penalized spline estimators. J Am Stat Assoc. 2010;105:852–63. [Google Scholar]
- 20.Ruppert D, Wand MP, Carroll RJ. Semiparametric Regression. New York: Cambridge University Press; 2003. [Google Scholar]
- 21.Hurvich C, Simonoff J. Smoothing parameter selection in nonparametric regression using an improved Akaike information criterion. J R Stat Soc Series B Stat Methodol. 1998;60:271–93. [Google Scholar]