Abstract
Purpose
The purpose of this study was to compare the repeatabilities of the linear and nonlinear Tofts and reference region models (RRM) for Dynamic Contrast-Enhanced MRI (DCE-MRI).
Materials and Methods
Simulated and experimental DCE-MRI data from 12 rats with a flank tumor of C6 glioma acquired over three consecutive days were analyzed using four quantitative and semi-quantitative DCE-MRI metrics. The quantitative methods used were: 1) Linear Tofts model (LTM), 2) Nonlinear Tofts model (NTM), 3) Linear RRM (LRRM), and 4) Non-linear RRM (NRRM). The following semi-quantitative metrics were used: 1) Maximum enhancement ratio (MER), 2) time to peak (TTP), 3) initial area under the curve (iauc64), and 4) slope. LTM and NTM were used to estimate K, while LRRM and NRRM were used to estimate K relative to muscle (R). Repeatability was assessed by calculating the within-subject coefficient of variation (wSCV) and the percent intra-subject variation (iSV) determined with the Gage R&R analysis.
Results
The iSV for using LRRM was two-fold lower compared to NRRM at all simulated and experimental conditions. A similar trend was observed for the Tofts model, where LTM was at least 50% more repeatable than the NTM under all experimental and simulated conditions. The semi-quantitative metrics iauc64 and MER were as equally repeatable as K and R estimated by LTM and LRRM respectively. The iSV for iauc64 and MER were significantly lower than the iSV for slope and TTP.
Conclusion
In simulations and experimental results, linearization improves the repeatability of quantitative DCE-MRI by at least 30%, making it as repeatable as semi-quantitative metrics.
Keywords: Dynamic contrast-enhanced MRI, Repeatability, Reference Region Model, Tofts Model, Pharmacokinetics, Linear models
1. INTRODUCTION
Dynamic contrast-enhanced MRI (DCE-MRI) involves the serial acquisition of T1-weighted images before, during, and after the injection of a contrast agent (CA) that shortens the T1 relaxation time of water, resulting in an increase of the MRI signal in tissues or voxels where the agent accumulates [1]. After application of a proper pharmacokinetic (PK) model, parameters related to tissue perfusion [2], blood flow [3], capillary leakage [4], and transit time of the CA can be derived from the dynamic MRI signal in a voxel or a tissue of interest [5].
The two PK parameters most commonly estimated from DCE-MRI data are the rate of CA transfer from blood to tissue (K) and the rate of CA transfer from tissue to blood (kep) [1]. Several studies have shown evidence that K can be used to differentiate tumors from normal tissue [6, 7], and to monitor anti-cancer treatment in fibrosarcoma [8], breast [9, 10], and brain neoplasms [11, 12]. Unfortunately, these results are inconsistent with other studies, which showed that K offers little to no utility in monitoring anti-cancer treatment in breast and brain cancers [13, 14]. Because of these limitations, quantitative DCE-MRI descriptors are not part of the standard of care for clinical DCE-MRI.
These contradictory results may be due to the inherent variability of DCE-MRI that is a consequence of a low signal-to-noise ratio (SNR) [15], slow temporal resolution [16, 17], variability in the arterial input function (AIF) needed for PK modeling [18, 19], and/or the model assumed during data analysis [20, 21]. Some of these limitations have been addressed by the introduction of the non-linear reference region model (NRRM) [22, 23] that does not require AIF determination, and the linear reference region model (LRRM) that also does not require the AIF and gives more accurate parameter estimates than the NRRM under low SNR and slow temporal resolution [24, 25]. The standard Tofts model for DCE-MRI has also been linearized, and it was recently demonstrated that such linearization improves its performance under low SNR and low temporal resolution [26]. We recently demonstrated that the relative K (R) estimated by LRRM is a better predictor of response to neoadjuvant chemotherapy in breast cancer than the R estimated using NRRM [24]. An analoguous behavior was observed for K and kep estimated using the linear (LTM) and non-linear (NTM) Tofts models [27]. Based on these results, we hypothesized that linearization should improve the repeatability of quantitative DCE-MRI. We performed a retrospective study to compare the repeatability of R and kep estimated by NRRM and LRRM. We also compared the repeatability of quantitative NRRM and LRRM descriptors with semi-quantitative descriptors and quantitative NTM and LTM descriptors of DCE-MRI.
2. METHODS
All of the experimental data was downloaded from DataVerse [21], while the code used to simulate and analyze all data are available at https://github.com/JCardenasRdz/Gage-repeatability-DCE-MRI.
2.1 Theory of the quantitative analysis of DCE-MRI and its models
The generalized model for DCE-MRI establishes that the differential equation that describes the PK behavior of a CA within a voxel is:
(1) |
CTOI is the concentration of the CA in the tissue of interest (TOI) as a function of time. Cp(t) is the concentration of the CA in plasma as a function of time (also known as the AIF), K is the rate transfer constant from the plasma into the TOI and kep is the transfer constant from the TOI to plasma. This equation assumes that the fractional plasma volume is equal to zero (vp=0). The quantitative analysis of DCE-MRI data requires three steps to estimate K and kep for any TOI: 1) solve Eq. 1, 2) transform the observed changes in the MRI signal to changes in concentration of the CA, and 3) fit the concentration curves of step 2 to the solution obtained in step 1. The first and most common solution to Eq. [1] was developed by Tofts, et al. [2]:
(2) |
Equation (2) is non-linear in the parameters, and requires a non-linear fitting routine to estimate K and kep. We have named this method the Non-linear Tofts Model (NTM). Non-linear fitting methods are very sensitive to low SNR, while linear fitting methods are more robust and significantly faster. Murase addressed these issues by developing a linear approximation of the NTM, and we have named this method the Linear Tofts Model (LTM; Eq. [3]) [26].
(3) |
Experimentally measuring Cp(t) is challenging, which is a major limitation for Eq. 2 and Eq. 3. The reference region model (RRM) was introduced to remove the need to measure Cp(t), and instead uses a reference region (RR) as surrogate for Cp(t) (Eq. [4]) [22].
(4a) |
(4b) |
CTOI(t) and CRR(t) are the concentrations of the CA at time t in the TOI and RR respectively. K and K are the transfer constants between plasma and the extravascular extracellular space (EES) of the TOI and the RR respectively. R = K/K, kep,RR and kep,TOI are the transfer rates (min) from the TOI and RR back to the plasma. Estimating R, kep,RR and kep,TOI using Eq. 4 requires a nonlinear fitting method. Thus, we have named Eq. 4 the Non-linear RRM (NRRM). We obtained a linear solution to the RRM, and demonstrated that the Linear RRM (LRRM) is more robust than NRRM to low SNR and low temporal resolution (Eq. 5) [24].
(5) |
The same definitions used for Eq. 4 apply to Eq. 5. ve,RR is the fractional volume of the extravascular extracellular space. The goal of our study was to determine how the model used in DCE-MRI analysis affects the repeatability of DCE-MRI.
2.2 Simulations
Thirty simulated tumor enhancement curves were created using K and ve values that were randomly selected from a normal distribution (Figure 1). The mean K was set to 0.25 min with a standard deviation of 0.1, and the mean ve was set to 0.4 with a standard deviation of 0.1. A single muscle reference region enhancement curve was created for all subsequent analyses using a K of 0.1 min and a ve of 0.1. These values represented reasonable values for tumor and muscle tissues from previous reports [30]. All curves were simulated using Eq. 2, and Cp(t) was simulated using (Eq. 6):
(6) |
where A = 30 mM/min, C = 4.0 min, D = 0.65 mM, E = 5.0 min, and F = 0.04 min. This set of parameters simulated an AIF with an injection speed of 0.005 mL/sec, which is similar to a previously reported AIF [31].
To simulate potential changes in enhancement curves under the experimental conditions used in this study, white Gaussian noise was added to each of the 30 simulated enhancement curves 3 separate times at the same SNR. White Gaussian noise was also added to a simulated muscle reference region enhancement curve. SNR was defined as the ratio of the signal power over the noise power in decibels.
The R value for each of the 30 curves was estimated using LRRM without a non-negative constraint; LRRM with a non-negative constraint; and NRRM with a non-negative constraint. For NRRM, initial guesses for R were obtained from a gamma distribution with coefficient a = 1.40 and b = 0.56, which corresponds to reasonable values from previous reports. We also estimated values with NRRM using a non-negative constraint and initial guesses for R, kep,TOI, and kep,RR taken from the estimates from LRRM with a non-negative constraint, which we labeled as NRRM*. Additionally, we determined the rate of transfer from tissue to blood in the tissue of interest (kep,TOI), and the rate of transfer from tissue to blood in the reference region (kep,RR).
Gage Repeatability and Reproducibility (R&R) analysis (described below) was performed to measure the repeatability of the R values determined using LRRM with and without a non-negative constraint, NRRM and NRRM* by calculating the percent intra-group variances due to the fitting method. These three values were stored and the process starting from the addition of white Gaussian noise was repeated 1000 times. The median Gage R&R percent variance values from the 1000 values generated for each of the three fitting methods were taken as the true Gage R&R percent variance values for that SNR. The process was repeated for SNR values ranging from 5 to 40. Quantitative LTM and NTM analyses and semi-quantitative analyses were also performed in the same manner.
2.3 In vivo study
2.3.1 Animal model and DCE-MRI acquisition
Twelve rats were injected subcutaneously with rat glioma cells and subsequently prepared for imaging [21]. DCE-MRI acquisition was performed on 3 consecutive days. The basics of the DCE-MRI acquisition protocol were as follows: 8 axial images were acquired using a 10 msec repetition time (TR), 1.7 msec echo time (TE), 15 excitation pulse, 16 mm slab thickness (8 slices each 2 mm thick), 469 x 625 μm in-plane resolution, 128 x 80 matrix size, 60 x 50 mm field of view, 50 repetitions, temporal resolution = 6.4 sec, and 1 average. A more detailed description of the MRI protocol is provided in reference 21. Two of the twelve rats had technical scanning failures on 1 of the 3 days of imaging. Data from these 2 rats were excluded from analysis.
2.3.2 Image analysis
A quantitative T1 map was not obtained in this study. Therefore, we used the signal enhancement ratio (SER, Eq. 7) to replace contrast agent concentration for quantitative DCE-MRI analysis.
(7) |
SER(t) is the SER at time t, St is the MR signal at time t, and So is the signal before injection of the CA (t=0).
The following semi-quantitative metrics were used in this work: a) maximum enhancement ratio (MER), b) time to peak (TTP), c) initial area under the curve (iauc64), and d) slope (Fig. 2). The MER was defined as the maximum of each SER(t) curve. The TTP was determined by subtracting the time at the final baseline time point (10 image) from the time of the MER. iacu64 was determined from the area under the curve from the first post-baseline time point (11 image) to the time point acquired 64 seconds post-baseline (20 image). The slope was determined by dividing MER by TTP.
As mentioned earlier, the NTM and NRRM require a non-linear fitting algorithm and an initial guess to estimate their respective PK parameters. An initial guess of K = 0.5 and kep,TOI = 5.0 were used for the NTM, while the following initial guesses were used for the NRRM: R= 2.0, kep,TOI = 5.0, and kep,RR = 5.0. The MATLAB function lsqcurvefit was used for all non-linear fittings, while constraining all possible solutions between 0 and 10. The function tolerance was set to 1 x 10 and the maximum number of iterations was set to 100,000. Estimates for the linear methods were obtained with and without a constraint of greater than or equal to zero. This was accomplished using non-negative least squares as implemented in the MATLAB function lsqnonneg. Finally, we studied the effect of using the parameters estimated with the LTM as the initial guesses for the NTM (NTM*), and the parameters estimated by the LRRM as the initial guesses for the NRRM (NRRM*).
Our quantitative analysis of experimental DCE-MRI data using the NTM (Eq. 2) and LTM (Eq. 3) assumed a population-based averaged AIF, which was derived from the AIFs for individual rats in reference 21 (Eq. 8). The fitting for the NRRM and LRRM used muscle as a surrogate for the AIF.
(8) |
To analyze the average signal of the tumor, a region of interest (ROI) for each tumor and muscle were drawn by a single observer (KMJ) on slices 4, 5, and 6 of the 8 slices imaged for each rat. The three slices chosen showed the largest tumor volume. The same ROIs for any given rat were used for LTM, NTM, LRRM and NRRM analyses. For each rat, the average intensity of the whole tumor ROI and muscle reference region ROI from all time points were used to generate the enhancement curve that was fit with LRRM and NRRM analyses. The average signal intensity of the population AIF was used for the LTM and NTM analyses.
For pixelwise analyses, each pixel within the ROI chosen for the tumor region was used to compute R, kep,TOI, and kep,RR. A single enhancement curve for the muscle reference region, generated by taking the average signal value of the whole muscle reference region ROI from all time points, was used for all tumor pixelwise analyses. Based on the SNR of these data sets, pixels within the tumor ROI that showed less than 10% enhancement were excluded from analysis. This exclusion criterion was also used in a previous study that used this data [21]. Additionally, pixels that showed poor fits based on the R value were excluded from analysis. A range of R values from 0 – 0.9 were used as the cutoff point to ensure that comparisons between LRRM and NRRM were not affected by the selected R cutoff point. After removing the pixels with poor fits, the median value of the remaining pixels was determined for R, kep,TOI, and kep,RR.
2.4 Statistical Analysis
A summary of the parameter values generated from quantitative and semi-quantitative DCE-MRI analyses were provided in the form of mean, range, and within-subject coefficient of variation (wSCV). The values were taken on a global scale meaning that values from all rats over each of the three time points were included in the calculation. The wSCV was calculated as follows: 1) the base-10 logarithm was applied to estimated quantitative and semi-quantitative descriptors, 2) the within-subject variance (variance for each row) was calculated, 3) the mean within-subject standard deviation (wSD) was calculated by taking the square root of the within-subject variance, 4) wSCV= antilog(wSD)-1 [32, 33]. The medians of the estimates of the parameters R and kep,TOI, as determined by the methods LRRM and NRRM, were compared using a t-test, with the significance set to the 95% confidence level.
The Gage R&R methodology was initially developed to determine the sources of variation in a manufacturing system [34]. Gage R&R analysis uses ANOVA to determine the percent of the observed variation in a system that is due to the parts (process), measurement protocol (repeatability), and the operator (reproducibility). This analysis method is often performed to test variations between multiple operators who are measuring a characteristic of multiple parts with one measurement protocol. In this study, we compared different measurement protocols rather than comparing different operators. One researcher (one operator) analyzed DCE-MRI data from multiple rats (multiple parts) to determine the repeatability of fitting algorithms used for the analysis of DCE-MRI data (multiple measuring protocols). Each fitting algorithm was assigned a unique measurement protocol identification and each rat was assigned a unique part identification for Gage R&R analysis. The measured percent variance (iSV) represented the variance in the fitting method compared to the total variation. This analysis was performed for both the simulations and the experimental data. The Gage R&R methodology was implemented using the gagerr function in Matlab R2015a [28].
3. RESULTS
3.1 Simulations
A total of 3.15 million simulated enhancement curves were generated. These curves were subsequently analyzed with quantitative and semi-quantitative methods, and then with Gage R&R analysis. The use of 30 enhancement curves was sufficient for convergence as evidenced by the tight confidence intervals in the Gage R&R plots (Fig. 3). The median percent variance due to repeatability was obtained from the 1000 Gage R&R analyses performed at each SNR. The median percent variance value and its corresponding 95% confidence interval were evaluated over the range of SNRs tested from each DCE-MRI fitting method. Comparisons between LRRM with and without non-negative constraints showed no difference in percent variance over the range of SNRs tested (data not shown). Thus, LRRM with a non-negative constraint was compared with NRRM, which also had a non-negative constraint, to avoid erroneous negative R, kep,TOI, and kep,RR values from being generated.
LRRM showed a significantly lower Gage R&R percent variance compared to NRRM at all SNRs tested (Fig. 3a). A significant difference was defined as non-overlapping 95% confidence intervals. Interestingly, when repeating the analysis with NRRM* (NRRM initialized using LRRM-derived coefficients as the initial guess), the Gage R&R percent variance values were similar between LRRM and NRRM*. This result emphasizes that the repeatability of R estimated via the NRRM is highly dependent on a proper initial guess. LTM showed a significantly lower Gage R&R percent variance compared to NTM and LRRM at low SNRs (Fig. 3b). The significant difference between LTM and NTM at low SNRs further emphasizes that linearizing a model improves repeatability. Additionally, when repeating analysis with NTM* (NTM initialized using LTM-derived coefficients as initial guesses), the Gage R&R percent variance values were similar between LTM and NTM, which was observed with LRRM and NRRM as well. The analysis of semi-quantitative descriptors showed the best repeatability measurements with iauc64 and MER (Fig. 3c). The variability of these two descriptors was significantly lower than the variability of the slope and TTP at all SNRs, and similar to the variability of R values estimated via the LRRM and NRRM*. The Gage R&R value of the slope was significantly lower than TTP at all SNRs and similar to the variability of NRRM with a random initial guess at mid-range SNRs.
3.2 In vivo Results
Fitting in vivo results with the LRRM correctly described the PK of the agent in the tumor, as demonstrated by an excellent goodness-of-fit between the fit and the experimental results (R ≥ 0.94; Fig. 4). The semi-quantitative measurements, LTM, and LRRM measurements showed lower wSCVs than the NRRM measurements (Table 1). Notably, the wSCVs of R measurements were lower with LRRM vs. NRRM in pixelwise analysis. The wSCVs of R measurements were also lower with NRRM* vs. NRRM in pixelwise analysis. The R values from all rats at all time points were significantly higher with LRRM compared to NRRM in pixelwise analysis (p<0.01) and ROI analyses (p<0.01). Also, kep,TOI values were significantly lower with LRRM compared to NRRM in both pixelwise analysis (p<0.01) and ROI analysis (p < 0.01). Similarly, the R values were significantly higher with NRRM* compared to NRRM in pixelwise analysis (p = 0.04) and the kep,TOI values were significantly lower with NRRM* compared to NRRM in pixelwise analysis (p < 0.01).
Table 1.
Model | Parameter | ROI | Pixel | ||||
---|---|---|---|---|---|---|---|
| |||||||
Mean | IQR | wSCV | Mean | IQR | wSCV | ||
Quantitative Parameters (Reference Region) | |||||||
| |||||||
LRRM | RKtrans | 2.57 | 1.86–3.23 | 0.41 | 2.40 | 1.67–3.16 | 0.40 |
kep,TOI | 0.11 | 0.01–0.73 | 0.74 | 0.03 | 0.0–0.04 | 0.68 | |
NRRM | RKtrans | 1.52 | 1.02–2.61 | 0.97 | 1.99 | 1.13–2.97 | 0.92 |
kep,TOI | 1.03 | 0.62–1.47 | 0.91 | 1.33 | 0.81–1.55 | 0.73 | |
NRRM* | RKtrans | 1.24 | 0.73–1.68 | 0.74 | 1.99 | 1.13–2.97 | 0.63 |
kep,TOI | 0.45 | 0.12–0.71 | 0.61 | 0.30 | 0.07–0.43 | 0.53 | |
| |||||||
Quantitative Parameters (Tofts Model) | |||||||
| |||||||
LTM | Ktrans | 0.25 | 0.03–0.34 | 0.43 | 0.22 | 0.09–0.24 | 0.66 |
kep,TOI | 0.85 | 0.64–1.10 | 0.37 | 0.90 | 0.73–1.05 | 0.21 | |
NTM | Ktrans | 0.16 | 0.09–0.24 | 0.57 | 0.18 | 0.17–0.21 | 0.20 |
kep,TOI | 7.64 | 6.89–8.04 | 0.19 | 8.20 | 8.67–9.48 | 0.45 | |
NTM* | Ktrans | 0.08 | 0.05–0.11 | 0.43 | 0.05 | 0.04–0.05 | 0.27 |
kep,TOI | 1.87 | 1.04–2.32 | 0.26 | 2.21 | 1.59–2.83 | 0.29 | |
| |||||||
Semi-Quantitative Parameters | |||||||
| |||||||
MER | 5.93 | 4.56 – 8.22 | 0.44 | 7.46 | 6.22 – 8.81 | 0.25 | |
TTP | 3.79 | 3.20 – 4.05 | 0.15 | 3.09 | 2.99 – 3.20 | 0.07 | |
iauc64 | 0.21 | 0.15 – 0.29 | 0.48 | 0.22 | 0.18 – 0.26 | 0.29 | |
slope | 1.56 | 1.31 – 2.60 | 0.41 | 2.46 | 2.13 – 2.82 | 0.27 |
LRRM = linear reference region model
NRRM = nonlinear reference region model with set initial guess
NRRM* = nonlinear reference region model with initial guess from LRRM estimates
LTM = linear tofts model
NTM = nonlinear tofts model with set initial guess
NTM* = nonlinear tofts model with initial guess from LTM estimates
MER = mean enhancement ratio, TTP = time to peak, iauc64 = initial area under the curve
IQR: inter-quartile range
wSCV: within subject coefficient of variation. Bounds: 95% lower and upper confidence intervals
Interestingly, the R and the kep,TOI values from analyzing ROIs were similar when NRRM* and NRRM were used. This was also seen in the K and the kep,TOI values when analyzing ROIs with NTM* and NTM. Therefore data with high SNR produced the same estimates for the quantitative metrics regardless of the initial guess. Conversely, analysis of pixelwise data showed different R and kep,TOI values when NRRM* and NRRM were used, while the use of NRRM* and LRRM estimated similar values. This similarity is based on median values produced by LRRM and NRRM*. This result suggests that data with low SNR requires a good initial guess to obtain a good analysis with the NRRM method.
For Gage R&R analysis of the experimental DCE-MRI study, a single percent variance value was generated for the dataset for each of the quantitative and semi-quantitative methods and these values were compared to each other (Table 2). A student’s t-test could not be performed to test for statistical significance because only a single value could be measured with Gage R&R analysis for the quantitative and semi-quantitative methods.
Table 2.
Pixelwise Analysis | ROI Analysis | |
---|---|---|
Reference Region Model | ||
LRRM | 45.1 | 65.7 |
NRRM | 78.3 | 93.5 |
NRRM* | 53.3 | 86.7 |
| ||
Tofts Model | ||
| ||
LTM | 59.5 | 62.4 |
NTM | 86.9 | 84.3 |
NTM* | 62.7 | 71.8 |
| ||
Semi-quantitative | ||
| ||
MER | 46.1 | 65.5 |
Slope | 50.9 | 48.2 |
TTP | 88.7 | 100 |
Iauc64 | 46.5 | 66.4 |
For analyses of in vivo results, the median Gage R&R percent variance value was lower with LRRM compared to NRRM and lower with LTM compared to NTM in both pixelwise and ROI analysis. This difference held true for all thresholds set for pixel inclusion based on the R of the fit (Figs. 5a and 5b). An R value of 0.9 was not chosen as a threshold because of the high number of pixels that were discarded when doing so. The median Gage R&R percent variance value was lower with NRRM* compared to NRRM and NTM* compared to NTM for pixelwise analysis but similar for ROI analysis, which matched with wSCV results. The Gage R&R percent variance value for R with LRRM was similar to percent variance values for LTM, MER, slope, and iauc64 in both pixelwise and ROI analysis (Table 2). The Gage R&R percent variance values for R with NRRM, NTM, and TTP were similar, and were also higher than all other Gage R&R percent variance values in both pixelwise and ROI analysis. We attribute the lower repeatability of TTP to the stronger dependence of TTP on image noise relative to the dependence of MER and iauc64 on image noise (Fig. 2b).
R pixelwise maps from a representative rat over the three days of DCE-MRI showed the distribution of R values with LRRM, NRRM, and NRRM* (Fig. 6). The median R values from the maps for days 1, 2, and 3 were 3.44, 3.08, and 3.16 min respectively for LRRM; 3.52, 1.26, and 3.17 min respectively for NRRM; and 3.51, 2.89, and 2.97 min respectively for NRRM*. These results indicated a larger variability with NRRM as compared to LRRM and NRRM* in measuring R over multiple days. Additionally, the pixelwise distributions for NRRM had larger standard deviations and were more highly skewed than the pixelwise distributions with LRRM. For the rat shown in Fig. 6, the standard deviations of the pixels for days 1, 2, and 3 were 2.0, 2.16, and 2.63 respectively for LRRM; 2.50, 0.84, and 2.97 respectively for NRRM; and 2.22, 1.87, and 2.31 respectively for NRRM*.
4. DISCUSSION
The results of this study support our hypothesis that linearization can be used to improve the repeatability of the reference region model for DCE-MRI, making quantitative DCE-MRI as repeatable as LTM and semi-quantitative DCE-MRI. Our Gage R&R analyses of simulations showed that a lower percentage of the variability in the measurement system is due to the algorithm when the LRRM is used instead of the NRRM, regardless of the SNR of the DCE-MRI data. This result was consistent with previous reports that concluded that linearization improves the accuracy of the Tofts and reference region models for DCE-MRI [24, 26]. In addition, our experimental results showed lower wSCV and iSV values for R with LRRM compared to NRRM, for pixelwise analyses. This improved repeatability was also evident in the pixelwise parametric maps of R. The improvement in the repeatability of the pixelwise analyses with LRRM indicated that LRRM is especially useful under conditions of lower SNR. We believe our work is a useful contribution to the understanding of the repeatability and reproducibility of quantitative DCE-MRI with variations of the Tofts model and the reference region model [32, 35–38].
The accuracy and precision of the algorithms most commonly used for non-linear curve fitting of DCE data are highly dependent on the initial guess of the true values of K and kep (R for the RRM) [24, 26, 36]. This long-standing issue in quantitative DCE-MRI is often ignored, causing the results of quantitative DCE-MRI to be highly variable if the wrong initial guess is used. As evidence, we demonstrated that the repeatability of NRRM and NTM approached the repeatability of LRRM and LTM if the results of LRRM and LTM were used as the initial guess for NRRM (NRRM*) and NTM (NTM*). Also, the median value and interquartile range of R determined using NRRM* was similar to values determined using LRRM for pixelwise analyses. This result indicated that an “ideal” initial guess can overcome the variability of NRRM analysis induced by image noise. However, it is fundamentally impossible to know the “ideal” initial guess for each pixel in practice, and therefore the quality of NRRM analysis is fundamentally limited under practical conditions. To address this fundamental problem of the “ideal” initial guess, each voxel could be fit with a non-linear algorithm multiple times using different initial guesses [36]. However, hundreds-to-thousands of initial guesses may be needed to properly evaluate this issue of the “ideal” initial guess. Therefore this approach is computationally impractical for pixelwise analyses, and the improvement in the analysis does not scale linearly with the number of initial guesses. Our LRRM completely avoids this fundamental problem because our methods use a linear regression that does not require an initial guess [24, 26].
A recently published study concluded that the linear compartmental tissue uptake model (similar to the Tofts model) is less robust to low SNR than its nonlinear equivalent, which would seem to contradict our study [39]. However, this previous study used the initial values used to simulate data as the initial guesses for the non-linear fitting routines. An “ideal” initial guess is only available during simulation studies such as this recently published study, and an “ideal” initial guess is fundamentally impossible to obtain during practical experiments. Thus, the LRRM is a superior approach for analyzing DCE-MRI data as compared to NRRM.
The repeatability of R estimated with the LRRM was comparable to the repeatability of LTM, MER and iauc64 measurements, as shown by Gage R&R analyses of simulated and experimental data. A previous study showed that MER and iauc64 measurements have good reproducibility [35], which suggests that LRRM also has good reproducibility (where repeatability tests measurements under the same conditions, and reproducibility tests measurements under different conditions). TTP measurements, R estimated with the NRRM, and K estimated with the NTM showed the lowest repeatability, which was attributed to the stronger sensitivity to noise for these measurements.
The wSCV for R determined by the LRRM in our study is in good agreement with the wSCV (~0.40) for R reported in previous repeatability studies of the reference region model [35, 38]. The data analyzed in our study were acquired at lower temporal resolution than the previous repeatability studies. The wSCV for R determined by the NRRM showed that the NRRM was only half as repeatable relative to results from these previous reports. Therefore, linearization improves the repeatability of the RRM under conditions of slower temporal resolution, which are often encountered in clinical studies [38, 40]. This result regarding temporal resolution complements previous studies that have shown how LRRM can estimate accurate R values at temporal sampling rates as slow as 60 seconds [24], while NRRM requires temporal sampling rates less than 32.0 seconds to estimate accurate R values [39], and the Tofts model requires a temporal sampling rate of 5.0 seconds or faster [15, 16]. Furthermore, these previous studies showed that NRRM underestimates R and overestimates kep,TOI especially with low SNR, which matched the results of our study. In these previous studies, the calculation speed of LRRM was shown to be 1350–8200 times faster than NRRM (depending on the SNR) [24].
Our results also demonstrate the benefits of using Gage R&R analysis as a means to compare repeatability between different MRI analysis methods. Gage R&R analysis calculates the percentage of variation due to the measurement source compared to the total variation, and thus is not subject to the scale of the DCE-MRI parameter being measured. For comparison, wSCV is subject to the scale of the DCE-MRI parameter, and thus wSCV is inherently smaller for DCE-MRI parameters that have a small absolute value like TTP, compared to larger DCE-MRI parameters like MER. In our experimental results, wSCV values for TTP compared to MER were lower, while Gage R&R analyses clearly showed better repeatability measurements for MER compared to TTP, for both pixelwise and ROI analysis, during simulations and experimental analyses.
Despite the promising results presented in this work, three limitations still remain. First, the Gage R&R method assumes that the biologic variability of the tumors were zero over the three time points. However, the tumors inevitably changed in size and possibly in perfusion characteristics over the three time points. Therefore, the fitting method variability and the day-today variability could be confounded in the estimation of iSV. Unfortunately, performing repeat scans 24 hours apart was the shortest timeframe allowed by the IACUC. Second, we used the change in MR signal to represent the change in concentration in the tumor and reference region during our DCE-MRI analyses. However, the MR signal has a non-linear relationship with CA concentration, which could lead to erroneous estimations of the absolute values of PK constants. Our study focused on the repeatability of measuring the estimated PK constants, which pertains to precision rather than accuracy, thus this potential pitfall should not significantly affect the conclusions of our study. Also, the relationship between MR signal and concentration becomes more linear at lower, clinically-relevant magnetic fields. To test this potential pitfall, a future DCE-MRI clinical study can be performed at ≤ 3T magnetic field strength with the acquisition of T1 parametric maps before injection of the Gd-based CA.
5. CONCLUSION
In conclusion, this report introduces the Gage R&R analysis as a convenient method to determine the repeatability of DCE-MRI, while also demonstrating that linearization increase the repeatability of the Tofts and reference region models for DCE-MRI by approximately 40%.
Acknowledgments
This study was supported by the National Cancer Institute of the National Institutes of Health under award numbers R01CA167183, R01CA197029, and P30CA023074. K.M.J. is supported by a fellowship from NIH grants T32HL007955 and T32HL066988.
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
References
- 1.Tofts PS, Brix G, Buckley DL, Evelhoch JL, Henderson E, Knopp MV, Larsson HB, Lee TY, Mayr NA, Parker GJ, Port RE, Taylor J, Weisskoff RM. Estimating kinetic parameters from dynamic contrast-enhanced T1-weighted MRI of a diffusable tracer: standardized quantities and symbols. J Magn Reson Imaging. 1999;10(3):223–32. doi: 10.1002/(sici)1522-2586(199909)10:3<223::aid-jmri2>3.0.co;2-s. [DOI] [PubMed] [Google Scholar]
- 2.Tofts PS, Kermode AG. Measurement of the blood-brain barrier permeability and leakage space using dynamic MR imaging. 1. Fundamental concepts. Magn Reson Med. 1991;17(2):357–67. doi: 10.1002/mrm.1910170208. [DOI] [PubMed] [Google Scholar]
- 3.De Langen AJ, Van Den Boogaart VE, Marcus JT, Lubberink M. Use of H215O-PET and DCE-MRI to measure tumor blood flow. Oncologist. 2008;13(6):631–44. doi: 10.1634/theoncologist.2007-0235. [DOI] [PubMed] [Google Scholar]
- 4.Larsson HB, Stubgaard M, Frederiksen JL, Jensen M, Henriksen O, Paulson OB. Quantitation of blood brain barrier defect by magetic resonance imaging and gadolinium DTPA in patients with multiple sclerosis and brain tumors. Magn Reson Med. 1990;16(1):117–31. doi: 10.1002/mrm.1910160111. [DOI] [PubMed] [Google Scholar]
- 5.McDonald DM, Choyke PL. Imaging of angiogenesis: from microscope to clinic. Nature Med. 2003;9(6):713–25. doi: 10.1038/nm0603-713. [DOI] [PubMed] [Google Scholar]
- 6.Su MY, Wang Z, Carpenter PM, Lao X, Mühler A, Nalcioglu O. Characterization of N-ethyl-N-nitrosourea-induced malignant and benign breast tumors in rats by using three MR contrast agents. J Magn Reson Imaging. 1999;9(2):177–86. doi: 10.1002/(sici)1522-2586(199902)9:2<177::aid-jmri5>3.0.co;2-8. [DOI] [PubMed] [Google Scholar]
- 7.Langer DL, van der Kwast TH, Evans AJ, Trachtenberg J, Wilson BC, Haider MA. Prostate cancer detection with multi-parametric MRI: Logistic regression analysis of quantitative T2, diffusion-weighted imaging, and dynamic contrast-enhanced MRI. J Magn Imaging. 2009;30(2):327–34. doi: 10.1002/jmri.21824. [DOI] [PubMed] [Google Scholar]
- 8.Jiang F, Albert DH, Luo Y, Tapang P, Zhang K, Davidsen SK, Fox GB, Lesniewski R, McKeegan EM. ABT-869, a multitargeted receptor tyrosine kinase inhibitor, reduces tumor microvascularity and improves vascular wall integrity in preclinical tumor models. J Pharmacol Exp Ther. 2011;338:134–42. doi: 10.1124/jpet.110.178061. [DOI] [PubMed] [Google Scholar]
- 9.Thukral A, Thomasson DM, Chow CK, Eulate R, Wedam SB, Gupta SN, Wise BJ, Steinberg SM, Liewehr DJ, Choyke PL, Swain SM. Inflammatory breast cancer: dynamic contrast-enhanced MR in patients receiving bevacizumab—initial experience. Radiology. 2007;244:727–35. doi: 10.1148/radiol.2443060926. [DOI] [PubMed] [Google Scholar]
- 10.Buadu LD, Murakami J, Murayama S, Hasjiguchi N, Sakai S, Masuda K, Toyoshima S, Kuroki S, Ohno S. Breast lesions: correlation of contrast medium enhancement patterns on MR images with histopathologic findings and tumor angiogenesis. Radiology. 1996;200:639–49. doi: 10.1148/radiology.200.3.8756909. [DOI] [PubMed] [Google Scholar]
- 11.Kamoun WS, Ley CD, Farrar CT, Duyverman AM, Lahdenranta J, Lacorre DA, Batchelor TT, di Tomaso E, Duda DG, Munn LL, Fukumura D, Sorensen AB, Jaon RK. Edema control by cediranib, a vascular endothelial growth factor receptor-targeted kinase inhibitor, prolongs survival despite persistent brain tumor growth in mice. J Clin Oncol. 2009;27:2542–52. doi: 10.1200/JCO.2008.19.9356. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Batchelor TT, Sorensen AG, di Tomaso E, Zhang WT, Duda DG, Cohen KS, Kozak KR, Cahill DP, Chen PJ, Zhu M, Ancukiewicz M, Mrugala MM, Plotkin S, Drappatz J, Louis DN, Ivy P, Scadden DT, Benner T, Loeffler JS, Wen PY, Jain RK. AZD2171, a pan-VEGF receptor tyrosine kinase inhibitor, normalizes tumor vasculature and alleviates edema in glioblastoma patients. Cancer Cell. 2007;11:83–95. doi: 10.1016/j.ccr.2006.11.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Etxano J, Insausti LP, Elizalde A, Lopez Vega JM, Plazaola A, Martinez P. Analysis of the changes induced by bevacizumab using a high temporal resolution DCE-MRI as prognostic factors for response to further neoadjuvant chemotherapy. Acta Radiol. 2015;56:1300–7. doi: 10.1177/0284185114556098. [DOI] [PubMed] [Google Scholar]
- 14.Boxerman JL, Ellingson BM. Response assessment and magnetic resonance imaging issues for clinical trials involving high-grade gliomas. Top Magn Reson Imaging. 2015;24:127–36. doi: 10.1097/RMR.0000000000000054. [DOI] [PubMed] [Google Scholar]
- 15.Evelhoch JL. Key factors in the acquisition of contrast kinetic data for oncology. J Magn Reson Imaging. 1999;10(3):254–9. doi: 10.1002/(sici)1522-2586(199909)10:3<254::aid-jmri5>3.0.co;2-9. [DOI] [PubMed] [Google Scholar]
- 16.Henderson E, Rutt BK, Lee TY. Temporal sampling requirements for the tracer kinetics modeling of breast disease. Magn Reson Imaging. 1998;16(9):1057–73. doi: 10.1016/s0730-725x(98)00130-1. [DOI] [PubMed] [Google Scholar]
- 17.Heisen M, Fan X, Buurman J, van Riel NA, Karczmar GS, ter Haar Romeny BM. The influence of temporal resolution in determining pharmacokinetic parameters from DCE-MRI data. Magn Reson Med. 2010;63(3):811–6. doi: 10.1002/mrm.22171. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Li X, Cai Y, Moloney B, Chen Y, huang W, Woods M, Coakley FV, Rooney WD, Garzotto MG, Springer CS., Jr Relative sensitivities of DCE-MRI pharmacokinetic parameters to arterial input function (AIF) scaling. J Magn Reson. 2016;269:104–12. doi: 10.1016/j.jmr.2016.05.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Huang W, Chen Y, Fedorov A, Li X, Jajamovich GH, Malyarenko DI, Aryal MP, LaViolette PS, Oborski MJ, O’Sullivan F, Abramson RG, Jafari-Khouzani K, Afzal A, Tudorica A, Moloney B, Gupta SN, Besa C, kalpathy-Cramer J, Mountz JM, Laymon CM, Muzi M, Schmainda K, Cao Y, Chenevert TL, Taouli B, Yankeelov TE, Fennessy F, Li X. The impact of arterial input function determination variations on prostate dynamic contrast-enhanced magnetic resonance imaging pharmacokinetic modeling: A multicenter data analysis challenge. Tomography. 2016;2(1):56–66. doi: 10.18383/j.tom.2015.00184. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Huang W, Li X, Chen Y, Li X, Chang MC, Oborski MJ, Malyarenko DI, Muzi M, Jajamovich GH, Federov A, Tudorica A, Gupta SN, Laymon CM, Marro KI, Dyvorne HA, Miller JV, Barbodiak DP, Chenevert TL, Yankeelov TE, Mountz JM, Kinahan PE, Kikinis R, Taouli B, Fennessy F, kalpathy-Cramer J. Variations of dynamic contrast-enhanced magnetic resonance imaging in evaluation of breast cancer therapy response: a multicenter data analysis challenge. Transl Oncol. 2014;7(1):153–66. doi: 10.1593/tlo.13838. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Ng CS, Wei W, Bankson JA, Ravoori MK, Hoan L, Brammer DW, Klumpp S, Waterton JC, Jackson EF. Dependence of DCE-MRI biomarker values on analysis algorithm. PloS One. 2015;10(7):e0130168. doi: 10.1371/journal.pone.0130168. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Yankeelov TE, Luci JJ, Lepage M, Li R, Debusk L, Lin PC, Price RR, Gore JC. Quantitative pharmacokinetic analysis of DCE-MRI data without an arterial input function: a reference region model. Magn Reson Imaging. 2005;23(4):519–29. doi: 10.1016/j.mri.2005.02.013. [DOI] [PubMed] [Google Scholar]
- 23.Kovar DA, Lewis M, Karczmar GS. A new method for imaging perfusion and contrast extraction fraction: input functions derived from reference tissues. J Magn Reson Imaging. 1998;8:1126–34. doi: 10.1002/jmri.1880080519. [DOI] [PubMed] [Google Scholar]
- 24.Cárdenas-Rodríguez J, Howison CM, Pagel MD. A linear algorithm of the reference region model for DCE-MRI is robust and relaxes requirements for temporal resolution. Magn Reson Imaging. 2013;31(4):497–507. doi: 10.1016/j.mri.2012.10.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Lee J, Cárdenas-Rodríguez J, Pagel MD, Platt S, Kent M, Zhao Q. Comparison of analytical and numerical analysis of the reference region model for DCE-MRI. Magn Reson Imaging. 2014;32(7):845–53. doi: 10.1016/j.mri.2014.04.007. [DOI] [PubMed] [Google Scholar]
- 26.Murase K. Efficient method for calculating kinetic parameters using T1-weighted dynamic contrast-enhanced magnetic resonance imaging. Magn Reson Med. 2004;51:858–62. doi: 10.1002/mrm.20022. [DOI] [PubMed] [Google Scholar]
- 27.DeGrandchamp JB, Whisenant JG, Arlinghaus LR, Abramson VG, Yankeelov TE, Cárdenas-Rodríguez J. Predicting response before initiation of neoadjuvant chemotherapy in breast cancer using new methods for the analysis of dynamic contrast enhanced MRI (DCE MRI) data. SPIE Medical Imaging. 2016;9788:978801–10. [Google Scholar]
- 28.http://www.mathworks.com/help/stats/gagerr.html
- 29.Yang C, Karczmar GS, Medved M, Stadler WM. Multiple reference tissue method for contrast agent arterial input function estimation. Magn Reson Med. 2007;58(6):1266–75. doi: 10.1002/mrm.21311. [DOI] [PubMed] [Google Scholar]
- 30.Donahue KM, Weisskoff RM, Parmelee DJ, Callahan RJ, Wilkinson RA, Mandeville JB, Rosen BR. Dynamic Gd-DTPA enhanced MRI measurement of tissue cell volume fraction. Magn Reson Med. 1995;34:423–32. doi: 10.1002/mrm.1910340320. [DOI] [PubMed] [Google Scholar]
- 31.Simpson NE, He Z, Evelhoch JL. Deuterium NMR tissue perfusion measurements using the tracer uptake approach: I. Optimization of methods Magn Reson Med. 1999;42:42–52. doi: 10.1002/(sici)1522-2594(199907)42:1<42::aid-mrm8>3.0.co;2-z. [DOI] [PubMed] [Google Scholar]
- 32.Padhani AR, Hayes C, Landau S, Leach MO. Reproducibility of quantitative dynamic MRI of normal human tissues. NMR Biomed. 2002;15:143–53. doi: 10.1002/nbm.732. [DOI] [PubMed] [Google Scholar]
- 33.Bland MJ, Altman DG. Statistics notes: measurement error proportional to the mean. Br Med J. 1996;313:106. doi: 10.1136/bmj.313.7049.106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Burdick RK, Borror CM, Montgomery DC. SIAM Series on Statistics and Applied Probability. Philadelphia: 2005. Design and Analysis of Gauge R&R Studies: Making Decisions with Confidence Intervals in Random and Mixed ANOVA Models. [Google Scholar]
- 35.Galbraith SM, Lodge MA, Taylor NJ, Rustin GJ, Bentzen S, Stirling JJ, Padhani AR. Reproducibility of dynamic contrast-enhanced MRI in human muscle and tumours: comparison of quantitative and semi-quantitative analysis. NMR Biomed. 2002;15(2):132–42. doi: 10.1002/nbm.731. [DOI] [PubMed] [Google Scholar]
- 36.Ahearn TS, Staff RT, Redpath TW, Semple SI. The use of the Levenberg–Marquardt curve-fitting algorithm in pharmacokinetic modelling of DCE-MRI data. Phys Med Biol. 2005;50(9):N85. doi: 10.1088/0031-9155/50/9/N02. [DOI] [PubMed] [Google Scholar]
- 37.Aryal MP, Nagaraja TN, Brown SL, Lu M, Bagher-Ebadian H, Ding G, Panda S, Keenan K, Cabral G, Mikkelsen T, Ewing JR. Intratumor distribution and test-retest comparisons of physiological parameters quantified by dynamic contrast enhanced MRI in rat U251 glioma. NMR Biomed. 2014;27(10):1230–8. doi: 10.1002/nbm.3178. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Yang C, Karczmar GS, Medved M, Oto A, Zamora M, Stadler WM. Reproducibility assessment of a multiple reference tissue method for quantitative dynamic contrast enhanced–MRI analysis. Magn Reson Med. 2009;61(4):851–9. doi: 10.1002/mrm.21912. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Kallehauge JF, Sourbron S, Irving B, Tanderup K, Schnabel JA, Chappell MA. Comparison of linear and nonlinear implementation of the compartmental tissue uptake model for dynamic contrast-enhanced MRI. Magn Reson Med. 2017;77(6):2414–23. doi: 10.1002/mrm.26324. [DOI] [PMC free article] [PubMed] [Google Scholar]