Abstract
Purpose
Characterize system-specific bias across common magnetic resonance imaging (MRI) platforms for quantitative diffusion measurements in multicenter trials.
Methods
Diffusion weighted imaging (DWI) was performed on an ice-water phantom along the superior-inferior (SI) and right-left (RL) orientations spanning ±150 mm. The same scanning protocol was implemented on 14 MRI systems at seven imaging centers. The bias was estimated as a deviation of measured from known apparent diffusion coefficient (ADC) along individual DWI directions. The relative contributions of gradient nonlinearity, shim errors, imaging gradients and eddy currents were assessed independently. The observed bias errors were compared to numerical models.
Results
The measured systematic ADC errors scaled quadratically with offset from isocenter, and ranged between −55% (SI) and 25% (RL). Nonlinearity bias was dependent on system design and diffusion gradient direction. Consistent with numerical models, minor ADC errors (±5%) due to shim, imaging and eddy currents were mitigated by double echo DWI and image co-registration of individual gradient directions.
Conclusion
The analysis confirms gradient nonlinearity as a major source of spatial DW bias and variability in off-center ADC measurements across MRI platforms, with minor contributions from shim, imaging gradients and eddy currents. The developed protocol enables empiric description of systematic bias in multicenter quantitative DWI studies.
Keywords: quantitative diffusion MRI, multicenter trials, gradient nonlinearity, ADC mapping
Introduction
Diffusion weighted imaging (DWI) is an inherently quantitative magnetic resonance imaging (MRI) technique widely utilized in clinical applications and for research evaluating efficacy of tumor treatment (1–4). The most common summary measure of quantitative DWI is the apparent diffusion coefficient (ADC) that reports on the mean diffusivity of water which is modulated by the cellularity of the imaged tissue (5,6). The ADC is derived from two or more DWI measurements assuming mono-exponential signal decay with increasing diffusion weighting, b-value. The clinical utility of ADC as a radiological marker for tissue response to treatment (2,4,7) and/or lesion characterization (8–11) is being assessed through multicenter oncology trials (12–14). For such studies it is imperative that the sources of technical variability are characterized (and controlled) (15–17) so that measured changes due to biology are interpretable relative to systematic error (18–20). Eliminating bias in ADC measurements can reduce requisite subject numbers (and associated trial costs), and help avoid inconclusive findings from underpowered studies (18–20).
Previous multi-scanner quality control studies have demonstrated excellent reproducibility of ADC measurements acquired at magnet isocenter (21,22) using a temperature-controlled (ice-water) phantom (variability < 3%). Away from the isocenter, large ADC errors (> 20%) due to gradient nonlinearity (GNL) bias were detected on some clinical systems (21–23). Although both systematic bias and random errors are known to contribute to measurement variability (15,24,25), the detected platform-specific spatial bias in diffusion weighting (i.e., deviation from the specified b-value) (26,27) can exceed random measurement noise many-fold (21–23), thereby confounding ADC values obtained by different scanner platforms in multicenter studies (14,20,28). Characterization and removal of the detected spatial DW bias is a prerequisite for establishing confidence levels of the derived ADC metric in clinical oncology applications (12,14).
Systematic deviations from nominal diffusion weighting (26,27) are not routinely corrected on clinical MRI systems, even though platform-dependent gradient characteristics are utilized by vendor engineers to spatially (un)warp images so that distance and geometry are faithfully represented (29,30). Evaluation of emerging DW bias correction methods developed by the research community (23,26,27,31) requires a uniform protocol for independent characterization of systematic errors across the relevant clinical MRI platforms. Additionally, the relative contributions to the actual (incorrect) diffusion-weighting by multiple sources (including GNL (26), imaging gradients (32), imperfect shim, concomitant fields (33), and eddy currents (34)) need to be assessed to develop and implement efficient platform-specific correction strategies. For quantitative assessment of these contributions, a comprehensive model for b-value bias combining multiple sources of gradient imperfection is required.
Conventional approaches for quantitative 3D characterization of individual gradient systems often rely on a set of platform-specific (and application-specific) phantoms and tedious procedures (29,35) that are not suited for uniform multicenter cross-platform implementation. In contrast, the diffusion phantom approach is well adapted for the multi-platform setting typical of clinical oncology trials (21,22). The equivalence of GNL characteristics obtained via ADC bias measurement on an ice-water phantom (31) to those measured using a geometric distortion phantom (29) or predicted by spherical harmonics model based on gradient design (26,27) has been demonstrated for select systems (21,23,31). This approach enables direct quantitative investigation of systematic GNL bias impact on inter-platform ADC variability for multicenter trials. Furthermore, the diffusion phantom method allows for comprehensive empiric evaluation of both GNL and non-GNL sources of b-value bias for direct comparison with numerical models.
The characterization of platform-dependent bias in diffusion weighting leading to systematic ADC errors that deteriorate inter-system reproducibility was addressed in this study by the Data Acquisition Working Group of the NCI-sponsored Quantitative Imaging Network (QIN) (13,36), established to improve the role of quantitative imaging for oncology trails. This Working Group developed a uniform diffusion phantom test protocol for data acquisition and analysis to objectively characterize the systematic DW bias across diverse and commonly used clinical MRI platforms. The proposed empirical procedure was implemented on representative scanners from three major vendors at seven QIN member institutions. Empirical measurements were corroborated by numerical models of b-value bias. Results of the analysis and their implications for multisite clinical trials utilizing quantitative diffusion imaging are described in this manuscript.
Methods
A single phantom with support stage (Supporting Material: Figures S1–S5) and scan instructions was shipped to seven QIN centers, where it was prepared and scanned onsite. The resulting DWI data acquired on 14 MRI scanners were uploaded for analysis by a single center. All data were in Digital Image Communication in Medicine (DICOM) format (37) and data analysis was automated using routines developed in Matlab 7 (Mathworks, Natick MA).
(1) Diffusion Phantom and DWI acquisition
The ice-water phantom consisted of a 260 mm long, thin-walled, capped polyethylene tube (diameter: 29 mm) filled with distilled water (0.17L) and immersed in another container filled with an ice-water bath (3.8L Nalgene jar, Sup. Mat.: Fig. S1). The ~20-fold larger volume of ice-water bath facilitated rapid thermal equilibration of the inner tube, and sustained temperature over the full scanning session on a given system (21). Maintaining the phantom at a controlled temperature of (0.0±0.5)°C provided the universal ADC standard with a known (published) diffusion coefficient of (1.100±0.011)×10−3 mm2/s (38). A plastic stage was built to control phantom anterior-posterior (AP) elevation in the magnet bore (Sup. Mat.: Figures S3, S4). The phantom was scanned using head first, supine positioning. Initial AP elevation of the measurement tube was assessed on the first locator series, acquired with no superior-inferior (SI) and right-left (RL) offsets from isocenter (Sup. Mat.: Schema I). The vertical height of the phantom stage was then adjusted to set the AP location of the center of the measurement tube to within ±10 mm of AP = 0 for all subsequent series. Extended off-center measurements were performed by repositioning the phantom on the stage to +70 mm, 0mm and −70mm offsets for both RL and SI phantom orientations (Sup. Figs. S3,S5). Extended AP offset measurements were not performed due to redundancy with RL orientations expected from cylindrical symmetry of the studied horizontal-bore gradient systems (26,29,31). Axial (AX) multislice DWI scans with nominal b = 0, 750 and 1500 s/mm2 were acquired for all phantom positions. Additional sagittal (SAG) acquisitions were performed on several systems when the phantom was oriented in the RL direction (Table 1). To characterize individual gradient channels (here defined as, GX – RL gradient, GY – AP gradient, and GZ – SI gradient), the phantom was scanned using diffusion gradients applied along the primary magnet axes (X, Y, Z). The recommended key scanning parameters are listed in Supporting Table S1 (Sup. Mat.).
Table 1.
Log# | Vendor | Model | Field | L/d (m/m) | Seq.Var. | RLb | SIb | AP(at RL= 0/120mm)c | TE (ms) RL/SI |
---|---|---|---|---|---|---|---|---|---|
1 | Philips | Ingenia | 3T | 1.64/0.7 | SSE | AX,SAG | AX | 5/5 | 115 |
2 | Siemens | Verio | 3T | 1.73/0.7 | DSE | AX,SAG | AX | −2/−20 | 134 |
3 | GEa | HDxt-W | 1.5T | 1.72/0.6 | SSE | AX,SAG | AX | 35/37 | 110 |
4 | GEa | HDxt-Z | 1.5T | 1.72/0.6 | SSE | AX,SAG | AX | 7/3 | 75 |
5 | Siemens | Trio | 3T | 2.13/0.6 | DSE | AX,SAG | AX | −5/−25 | 156 |
6 | Siemens | Trio | 3T | 2.13/0.6 | DSE | AX | AX(p5,6)d | −5/−30 | 103 |
7 | Siemens | Espree | 1.5T | 1.25/0.7 | DSE | AX,SAG | AX | −5/−20 | 135 |
8 | Philips | Achieva | 3T | 1.6/0.6 | SSE | AX | AX(p6)d | 15/12 | 130 |
9 | GE | Explorer | 3T | 1.7/0.6 | SSE | AX | AX | 13/5 | 99 |
10 | Philips | Ingenia | 3T | 1.64/0.7 | SSE | AX | AX | 10/10 | 116/125 |
11 | GE | Excite | 1.5T | 1.72/0.6 | S/DSE | AX,SAG | AX | 10/5 | 96/110 |
12 | Siemens | Skyra | 3T | 1.73/0.7 | SSE | AX | AX | −15/−30 | 158 |
13 | Siemens | Espree | 1.5T | 1.25/0.7 | DSE | AX | AX | −35/−57 | 169 |
14 | Siemens | Skyra | 3T | 1.73/0.7 | DSE | AX | AX | −15/−33 | 175/161 |
“SSE”: single spin-echo; “DSE”: double spin-echo; “AX”: axial; “SAG”: sagittal; “L/d”: gantry bore length (m) / bore diameter (m)
same system with two gradient settings (labeled “W”hole and “Z”oom).
image acquisition plane (AX/SAG) for specific phantom alignment direction in the magnet (RL/SI);
apparent AP elevation of the phantom at RL offset 0 mm and 120 mm;
missing phantom SI-scans (“positions”, see Sup. Mat., Table S1);
DWI data were acquired on 14 MRI scanners (ten different models) from three major vendors (Siemens (Erlangen, Germany), Philips (Best, Netherlands), GE (Waukesha, WI)) at two fields (five 1.5T and nine 3.0T systems). The system information and scanning parameters (derived from DWI DICOM header) are summarized in Table 1. Half of the systems had a bore diameter of 0.7 m and the other half – 0.6 m. Bore length varied over a broader range from 1.25m to 2.13m, with the majority of systems (10) falling between 1.55m and 1.75m. A GE Signa HDxt scanner that provided two gradient settings (labeled “W”hole and “Z”oom) is listed as two separate systems (#3 and #4 in Table 1). Four system models (out of ten different platforms) were duplicated by different sites. Seven systems provided AX and SAG scans for the RL phantom orientation. Four systems exceeded recommended TE (≤130ms, Sup. Table S1), while all systems used the recommended TR ≥10s. The applied spin echo sequence variant (single spin-echo (SSE) versus double spin-echo (DSE), Table 1) was confirmed by individual sites with their vendor application scientists. Seven systems used SSE, six systems confirmed DSE, and one system (#11) ran both sequence variants. Additionally, SSE versus DSE was run on a scanner of the same model (#12 and #15) by two different sites, respectively. Two systems did not perform select SI offset measurements (“positions”, Sup. Mat., Fig. S5), and one system (not listed in Table 1) did not provide images for each individual diffusion direction, and thus were excluded from corresponding direction or offset specific analysis.
(2) ADC analysis
Centralized data analysis was performed by a single site to derive the ADC from native DWI images using a mono-exponential fit between pairs of acquired b = 0 and b > 0 s/mm2 values. The ADC was calculated for the three separate DWI directions to characterize each individual gradient channel. The applied diffusion gradient direction in magnet coordinates was derived from the DICOM header (to assign to specific gradient channel). Only one vendor utilized “public” DICOM tags (37) for DW direction [0018, 9089] and b-value [0018, 9087] conditions. Diffusion direction and b-value information for the other two vendors had to be extracted from “private” DICOM tags, with one vendor using special tags for X, Y, Z (versus oblique) DWI gradients, as described in the corresponding DICOM conformance statement. The information on DWI image post-processing (e.g., spatial filtering) was not available from the DICOM header, but was confirmed by individual sites to exclude optional filtering (as required by the study protocol, Sup. Mat.).
To correct for channel-specific b > 0 s/mm2 image distortion induced by eddy currents (34), two-dimensional full-affine co-registration to the b = 0 s/mm2 image was performed for DWI data that exhibited substantially different (in excess of 5 mm) phantom tube displacements and/or misshaping for different gradient channels (DWI directions). For systems with significant initial distortions, the co-registration efficiency was assessed by visual inspection of consistent phantom tube position and shape for all DWI directions of each given image slice prior to ADC map generation. Co-registration of major distortions improved resulting ADC map uniformity (reduced histogram width) for systems with high eddy current distortions on select gradient channels.
ADC was calculated on a pixel-by-pixel basis, and the mean value was measured from 10mm-diameter circular ROIs (~90 pixels) manually placed in the middle of the ice-water tube on the b=0 image, avoiding areas with susceptibility artifacts (e.g., near air bubble or near edges) and non-uniform signal (e.g., severe geometric distortions). Typically, 20–30 offsets (ROIs) were defined for each phantom position to cover a spatial extent of ±150 mm with a total of approximately 50 non-overlapping ADC measurements along the SI and RL orientations for individual gradient channels. Measurement uncertainty was estimated as 2×SD (95% confidence level (18)) of the ADC pixel noise within an ROI. Uncertainty analysis across systems provided a baseline sensitivity threshold for detection of systematic bias.
(3) Systematic bias evaluation
For all measured offsets, the fractional bias error for ADC was estimated as a normalized deviation from the published (“pub”) value of 1.1 × 10−3 mm2/s (38), using the equation (ADC-ADCpub)/ADCpub. For the mono-exponential phantom medium (ice-water), the measured fractional ADC bias was equivalent to bias in b-value (i.e., DW bias). Bias statistics across systems were characterized by mean, standard deviation (SD), median and ranges. ADC bias dependence on offset, x, was fit to a 4th order polynomial in Matlab: c0+c1x+c2x2+c3x3+c3x4. For a majority of the studied platforms, a 2nd order fit was indistinguishable from a 4th order for ADC bias along SI. However, at remote RL offsets, ADC measurements for five systems exhibited higher order features, therefore, a 4th order fit was used for all systems for consistent analysis. The goodness of fit was characterized by the norm of residuals (NR). For uniform comparison of relative bias curvature, asymmetry and offset contributions, the fit coefficients were scaled (to common ADC units of 10−3 mm2/s) by: 10−3 (mm) for c1 (original units of mm/s), and 10−6 (mm2) for c2 (original units of 1/s). A representative offset of ±120 mm from isocenter was chosen to estimate bias magnitude across systems to ensure (by inspection) that bias for all systems exceeded their corresponding ROI noise at least two-fold both for RL and SI orientations.
The relative gradient nonlinearity (GNL) bias contribution was assessed from c2, and its magnitude (for cross-platform analysis), using the average fractional bias measured at ±120 mm for the available data from 14 systems with RL and 13 systems with SI measurements, Table 1. GNL AP characteristics were implicitly inferred from the cylindrical symmetry of gradient channels (26,29,31) for the studied horizontal-bore systems: GX(AP) = GY(RL), GY(AP) = GX(RL), GZ(AP) = GZ(RL). The similarity of GNL designs for the four duplicate system models (Table 1) was quantified by mean distance between measured average fractional bias of corresponding channels and orientations: . The relative bias asymmetry was estimated from c1, and its magnitude, by the mean difference between measured fractional biases at ±120 mm provided for 14 systems (RL) and 12 systems (SI), Table 1. The c0-coefficient was used to evaluate mean ADC shift (from the known ice-water value) close to the isocenter. The Pearson correlation coefficient, R, across systems was used to find the dependent variables among measured/fit quantities and acquisition parameters. The assignment of dominant physical bias sources was performed on the basis of the highest correlation with the measured bias characteristics (nonlinearity, asymmetry and shift). Third and fourth order fit coefficients, c3 and c4, were not correlated to physically meaningful parameters, and corresponding analysis was omitted. For a qualitative description of observed bias trends, numerical models for DW-bias (i.e., non-uniformity of the b-value) were built for SSE and DSE sequence variants using established theory (Appendix) to include GNL (30) as well as the second-order gradient imperfections (32–34). Note that these models for b-value bias are distinct from geometric distortions (not analyzed here) of DW images by the same sources.
The magnitudes of the second-order gradient imperfections were assessed from independent empiric measurements. Imperfect shim (causing background gradients, GBG) was evaluated from the apparent (measured) versus actual (isocenter) region-of-interest (ROI) elevation (AP) as a function of (and normalized by) the offset (Table 1). ROI location accuracy was 2–3 mm (approximately 1–2 pixels). Cross-terms between the diffusion gradients, GDW, and the imaging gradients, GIM, were measured from the difference in ADC bias near isocenter for the DWI gradient along the slice direction with respect to other DWI directions. Eddy current contribution was analyzed by comparison of ROI noise (histogram width) from ADC maps generated for individual DW gradient directions with and without the b > 0 s/mm2 images co-registered to the b = 0 s/mm2 image. A proof-of-concept, empiric GNL bias correction was performed as described in (31). Channel-specific SI and RL nonlinearity “scalars” for diagonal elements of system nonlinearity tensor, Lii, were measured directly from ADC/ADCpub~5% (31). Trace-ADC correctors for the isotropic DWI phantom were approximated by the channel-average GNL as a function of the offset, . Ten individual GNL correctors were built for the non-duplicate system models (Table 1). Same corrector was used for the duplicate models. The numeric correctors were applied to remove the measured offset-dependent b-value bias as, ADCc=ADC(x)/Cav(x).
Results
Offset-dependent ADC measurements for the ice-water phantom along RL and SI orientations, performed for the individual gradient channels (GX, GY, GZ) and the trace DWI of all systems, provided independent estimates of the ROI noise, the mean ADC value at isocenter (constant fit coefficient, c0), the dependence of ADC asymmetry on offset direction (linear fit coefficient, c1), and the ADC bias curvature (quadratic fit coefficient, c2). Typical ADC measurements are illustrated for a representative system in Figure 1a. Nearly quadratic offset dependence is observed for all gradient channels, with positive deviations from known ADC value away from isocenter along RL, and negative deviations along SI, qualitatively consistent with GNL. Direction average (trace ADC) fit dependence on the offset is illustrated by the solid black curves. Since typical ADC measurements are performed by combining three orthogonal DWI directions, the deviation of trace ADC from the known value for ice-water directly quantifies the bias error of a given system. Deviation from symmetry at positive versus negative offsets from the isocenter is detected for the specific gradient directions (e.g., GX (blue) along SI, Fig.1a), reflecting the contribution of non-GNL bias sources. For offsets less than ±40mm from the isocenter, the mean measured ADC values for all gradient directions of this system are within ROI measurement uncertainty (±2%, error-bars) from the known ADC of ice-water (solid horizontal line).
To establish the level of significance for different contributors to ADC bias with respect to ROI noise, the measurement error statistics was analyzed for all studied systems. The analysis of the ROI noise (2×SD measurement uncertainties) averaged over all offsets is shown in Figs.1b,c for individual gradient directions along with the norm of residuals (NR) for the trace (channel-average) ADC fit. The trace-ADC uncertainties (not shown) are reduced by averaging of three channel-specific measurements , and thus would always be below the noise level of the individual channels (Fig.1b,c). As is evident from Fig.1b,c, NR provides a more conservative error envelope for the direction-dependent ROI noise. The observed measurement uncertainty was relatively independent of the sequence variant and b-value. The RL measurement errors were negatively correlated to TE and field strength (R = −0.54, −0.64; P = 0.036, 0.01, respectively), and positively correlated (R = 0.55, P = 0.034) to NR for the fit. Three outliers (ROI noise > 0.1 × 10−3 mm2/s) were observed for GZ -channels of the systems with higher eddy current contribution or with large areas where ice had melted (thermal equilibrium was not assured).
The significant channel-specific DWI distortions due to eddy currents (Methods) were observed to correlate both with broader ROI histograms and the jitter in the mean ADC measurements for adjacent ROIs at some offsets (e.g., Fig.1a, GX (SI >100mm)), leading to an increase in apparent NR. Co-registration to the b = 0 s/mm2 image for systems with high eddy current contribution (Methods) effectively reduced ROI noise (pixel histogram width) by half and alleviated mean jitter between the adjacent ROIs. The observed average ROI noise (Fig.1b,c) was higher at 1.5T than at 3.0T and along RL versus SI (0.06 versus 0.04) × 10−3 mm2/s, and was tighter for the DSE compared to the SSE sequence variant (SD: 0.02 versus 0.03) × 10−3 mm2/s. Suppression of eddy-current artifacts in DWI images by DSE acquisition was further confirmed empirically for the system that ran both SSE and DSE variants (Table 1). Average noise for individual gradient channels fell within trace-fit NR (Figs.1b,c, last column), indicating that the main source of fit error was measurement uncertainty. Mean ROI noise across systems was 0.05 (±0.03) × 10−3 mm2/s (dotted line in Figs.1b,c).
Figure 2 illustrates two physical sources of the mean ADC bias (deviation from ice-water value) at the isocenter (zero offset): (i) phantom AP-elevation (Figs.2a,b) and (ii) imaging gradients on the slice-select channel (Figs.2c,d). For all systems, the detected AP ROI elevation near isocenter ranged between (−35 and +35) ±3 mm (median of 5 mm) (Table 1). Physical AP elevation of the phantom correlated with the concerted shift of ADC curves for multiple gradient channels (e.g., GX and GY, Fig.2a,b). The relative shift of an ADC curve for the slice-select channel with respect to the other gradient directions (Fig.2c,d), observed for SAG versus AX acquisition (Table 1), supplied direct empirical measurement of imaging gradient contribution to ADC bias (e.g., +2% shift between GX and GZ curves in Fig.2c,d). For all systems that performed SAG versus AX scans for RL phantom orientation (Table 1), this measurement confirmed that imaging gradients contributed a small (< 3%) error for the diffusion gradient along slice direction. Note that the combined (minor) bias contribution from phantom positioning error and imaging gradients fell within mean ROI noise (5%, Fig.1b,c) across the studied scanner systems at isocenter.
Figures 3a,b summarize the offset-dependent bias (as fractional deviation from published value) for the trace-ADC fit of all studied systems (Table 1: 14 for RL orientation, and 13 for SI) when using the b = 1500 s/mm2. (No significant differences were observed for fits that used the b = 750 s/mm2; the data is not shown.) Note that the change of presentation scale (for this and following figures) to fractional bias units makes interpretation of the observed systematic ADC errors equivalent to the DW bias in b-value (Methods). The detected bias for all scanners exceeded ROI measurement error (Fig.1b,c) for offsets from isocenter greater than 40 mm. For individual systems, the largest (negative) bias (up to −55%) was observed along SI (Fig.3b), with RL showing comparatively less bias (up to +25%, Fig.3a). At the 120 mm offset, mean SI bias across systems was −20% (±20%, 2×SD), while RL was +7% (±15%, 2×SD), indicating relatively high variability (2×SD exceeding the mean bias) among systems for the measurements away from isocenter.
The relative contributions of bias curvature, asymmetry and constant shift from ice-water value are illustrated in Figure 3c,d for c0, and scaled c1 and c2 fit coefficients with respect to NR (that provides ROI noise envelope, Fig. 1c,d). The constant shift coefficient (c0=0.02±0.01) was less than a half of NR, independent of field and sequence variant, indicating good agreement among ADC measurements across systems close to isocenter. This coefficient was also significantly correlated (R = 0.83, P = 10−4) to the measured all-channel ADC shift (due to phantom AP elevation, Table 1, c, Fig.2a,b), confirming that the major source of ADC error at isocenter was due to differences in the AP location of the phantom. The next largest bias effect, which exceeded NR (and ROI noise, Fig.1b,c), was explained by the linear fit coefficient, 1, (Fig.3c,d) that was significantly anti-correlated (R = −0.8, P = 3·10−4) to RL bias asymmetry at ±120 mm (Methods). The average (absolute) asymmetry among the scanners was about 3-fold higher and changed sign for SI orientation compared to RL. This fit coefficient was nominally independent of field, but spread over a wider range among the SSE (compared to DSE) scanners (Fig. 3d). Finally, the major source of offset-dependent bias among the systems was explained by the c2 coefficient (note ×20 scale in Fig. 3c,d). This coefficient was highly correlated (R > 0.9, P < 10−5) to average fractional bias along both SI (negative bias, rhombi) and RL (positive bias, circles), and exhibited higher values for 1.5T systems (Fig.3c).
To directly quantify the relative magnitude of the two dominant bias contributors for individual gradient directions (nonlinearity and asymmetry), the corresponding correlates of c2 and c1 fit coefficients (average fractional bias and bias asymmetry at ±120 mm) were plotted in Figure 4 and analyzed as a function of field and sequence variant. For off-center measurements, the GNL bias accounted for the bulk of the observed ADC error (compared to constant shift and asymmetry) independent of sequence variant, but dependent on gradient channel and field (Fig.4a,b). The highest nonlinearity (>|0.2|) was observed for GX (SI and RL) and GY (SI) of the 1.5T scanners. For an individual gradient channel, the bias along SI offset (rhombi) was twice the RL bias (circles). The channel-average nonlinearity along RL orientation was significantly anti-correlated (R = −0.73, P = 0.002) to the system bore length (Table 1). Nearly zero RL nonlinearity bias was observed for all channels of the two long-bore 3T systems of the same model (Fig.4). A high correlation (|R| > 0.7, P < 0.005) was also observed between the system aspect ratio (bore-length to bore-diameter, Table 1) and the GZ-channel nonlinearity along RL (R < 0) and SI (R > 0). Duplicate system models (Table 1) exhibited nominally indistinguishable GNL characteristics for all channels, with the (channel and orientation) average percent-difference in GNL bias falling within the corresponding measurement SD: (0.6±1.8)% (#1,10); (1.6±1.7)% (#5,6); (1.4±3.3)% (#7,13); and (1.3±3.3)% (#12,14). The most linear channel among the scanners was GZ, and the channel-specific GNL bias was more consistent among 3T scanners (Fig. 4a). Overall these results provide the strong evidence that GNL contribution to diffusion weighting bias is predictable from gradient system design.
The average bias asymmetry error among the channels (Fig.4c,d) was within ±0.03 (2×SD) and nominally independent of the field or offset direction. The asymmetry errors among systems were reduced for DSE, most effectively (~1/2) for GZ and GY (Fig.4d). Independent measurement of ROI drift (AP) at 120 mm (RL) with respect to isocenter directly reported on shim accuracy and was significant (−15 to −20 mm) for six systems by the same vendor, while generally less than 5 mm for all other scanners (Table 1). Remarkably, the systems with poor shim were almost exclusively using DSE (0.85 correlation) that apparently lessened their observed bias asymmetry. Different asymmetry was observed for duplicate gradient models, reflecting independence of shim errors from general gradient channel design (GNL errors).
Numerical simulations of different contributions to bias in diffusion weighting (b-value: Appendix, Figure 5a,b) predicted that minor bias errors (due to shim and imaging gradients) would be mitigated by DSE, while major GNL bias persisted. Figs.5c,d show the ROI-offset (stars), quadratic GY bias (blue) and SI asymmetry (green) measured for two systems (from different QIN centers) with the same gradient model using SSE and DSE (Table 1). The extent of ROI AP-drift (stars) confirmed that both systems had similar significant shim imperfections. These resulted in a noticeable DW bias asymmetry for SSE, which was corrected by DSE. Another system showing significant bias contribution from the imaging gradients for SSE (not shown) was also corrected by DSE, consistent with the prediction from the numerical model (Fig. 5a,b). No significant contribution to b-value bias was predicted (or detected) from eddy-currents and concomitant gradients. The GNL model of a representative system (supplied by a vendor) was consistent with the measured channel-specific DW bias and showed that the expected b-value offsets due to observed AP positioning errors among systems (±35mm, Table 1) would be within 3%. Feasibility of empiric GNL (b-value) correction for the phantom ADC data, presented in Fig.3(a,b), is demonstrated in Sup. Mat. Fig. S6. This empiric correction removed 80–90% of the total original ADC error and effectively reduced both cross-platform variability and individual system bias to ±3% for RL and SI offsets within ~120 mm. The residual error, amplified for a few systems at the remote offsets, |x|>120mm, confirmed contribution of non-GNL bias sources.
Discussion
Consistent with the previous findings (21,22), high ADC agreement with published value (within 3%) was achieved close to isocenter (offset < 40mm) by all systems, with bias not exceeding the measurement error (mean ROI noise of 5%). The dominant sources of amplified random ROI noise were eddy-currents, lower field and longer TE. In the small offset range, the minor (systematic) ADC bias contributions (<3%) were mainly due to phantom positioning errors (AP elevation) and/or cross-terms with imaging gradients on slice-select channel (32). At larger offsets (> 40 mm) bias scaled nearly quadratically with the offset and exceeded the ROI noise. This offset-dependent bias led to both spatially dependent error of the measured ADC for a single system and increased variability among the systems due to different gradient characteristics. The major contribution to the DW bias was from the systematic nonlinearity in the gradients. ADC measurements at the same location (e.g., SI = 120mm) by different systems showed variations of 15–20% (2×SD), while ADC of the same (phantom) anatomy at ±120 mm deviated from the value at isocenter (and published value) by an average bias error of −20% (SI) to +7% (RL). The empiric correction of the major GNL bias in the phantom ADC has reduced both cross-platform variability and bias error range down to ±3% (below ROI noise) for the RL and SI offsets of up to ±120 mm.
Minor second-order bias contributions due to shim, imaging and eddy currents were virtually independent of gradient design. As expected from the physical origin of the effect, the shim-related bias was most pronounced along the main magnetic field direction (SI), while the imaging gradients contributed bias terms for the slice-select channel exclusively. Consistent with the prediction of a numerical model (Appendix), application of the DSE acquisition effectively suppressed bias due to shim and imaging gradients. For three systems, the errors due to eddy-currents on GZ-channels along RL or SI exceeded (otherwise small) GNL error. With the exception of the outliers, the absolute measurement uncertainty was small enough (2×SD < 7%) to allow confident characterization of ADC bias away from isocenter. Consistent with the analytical model (Appendix), the numerically predicted eddy-current contribution to b-value bias was negligible (even for the systems with significant eddy-current artifacts). DSE acquisition or co-registration of diffusion images from individual DWI directions to the b = 0 s/mm2 image for SSE acquisition alleviated the eddy current effects and reduced ROI noise (histogram width).
The proposed measurement protocol allowed quantitative description of the major nonlinearity bias for individual gradient channels across diverse scanner platforms in a multicenter setting. GNL was virtually independent of sequence variant and had the dominant contribution to systematic ADC bias, not correctable by acquisition or co-registration. Among the studied systems, the widest inconsistencies (20% spread from 20% mean bias) for off-center trace-ADC measurements were observed due to GNL for 1.5T magnets. This finding may be related to the older gradient design of the sampled 1.5T systems (rather than lower field). Direction-specific nonlinearity was dominated by GX-channel contribution for all systems and was higher at 1.5T. The most linear channel across all systems was the GZ gradient. For the typical ADC measurements of isotropic media, systematic error would be an average of the direction-specific bias.
GNL characteristics were highly correlated to bore geometry and nearly identical (within measurement error) for systems of the same model supporting the premise that related DW imperfections could be corrected using model-specific gradient design parameters (23,26,27). When these parameters are unknown, empiric off-line correction of GNL bias, based on (presently) measured system-specific nonlinearity characteristics, is also possible (31). However this would require diffusion direction and nominal b-value information, currently not uniformly stored in “public” DICOM tags (37). Use of non-public tags for DWI acquisition parameters should be discouraged in view of multicenter studies that need to characterize and correct for GNL bias. The described measurements were performed along linear offsets, RL (SI, AP = 0) and SI (RL, AP = 0), and the results should not be extrapolated to arbitrary locations within the magnet bore. There may be spatial regions for the systems where the ADC deviations are larger than detected. The orthogonal (X, Y, Z) DWI gradients were used to characterize individual channel performance along primary gradient directions, which does not provide information on cross-channel gradients (Li≠j, Appendix). These spurious gradients can be substantial, depending on the gradient design (26,29,30).
As a result of the findings presented here, a set of guidelines has been proposed and tested for multisite evaluation of scanners and minimizing ADC errors to correct for gradient design imperfections on individual platforms by vendors. For multisite clinical trials, the proposed practical QC protocol should be implemented to characterize the scanners and develop on-site correction strategies, independently (31) or with the vendor assistance (23,27). For large FOV scans (typically used for body applications) or off-center anatomies (e.g., breast, kidney) in multicenter trials, platform-specific GNL correction practices are strongly recommended. The measured channel-specific bias characteristics can be utilized to benchmark correction efficiency across the studied systems using tools developed by the research community (26,27,31). This study has demonstrated the feasibility of empiric DW-bias correction, based on an approximation for diagonal elements of the nonlinearity tensor (31), which is readily available for research purposes. Ultimately, for clinical trials, the GNL bias in diffusion weighting is best corrected “on-line” by manufacturers, based on known (comprehensive) gradient design characteristics (23).
In conclusion, the implemented protocol for evaluating DW bias allows independent characterization of gradient channels and determination of the dominant sources of ADC error both for individual gradient systems and across a diverse set of imaging platforms for multicenter trials. The results confirm gradient nonlinearity as a significant source of bias in ADC maps. Additional (minor) sources of bias are due to shim, eddy currents and imaging cross-terms. Importantly, these sources of DW bias are not currently corrected on clinical scanners regardless of vendor platform. The presence of systematic bias will not affect variability for the same anatomy scanned by the same platform at the same location relative to isocenter, other than that the actual measured ADC value will be biased (but consistent). However, without bias correction, ADC variability in multicenter studies is expected to be significant due to both different system designs and scan locations. Minor bias in b-value is mitigated by twice-refocused spin-echo acquisition, while dominant nonlinearity bias (same origin as geometric distortions) can be corrected using platform-specific gradient design information.
Supplementary Material
Acknowledgments
This research was supported by National Institutes of Health Grants: U01-CA166104, U01-CA151235, U01-CA154602, U01-CA154601, U01-CA140204, U01-CA142565, U01-CA172320. Authors are grateful to vendor applications scientists for confirming system specific scan variant and providing GNL design information for select systems.
Appendix
The diffusion weighting b-value was obtained as an Euclidian norm of b-matrix, bk = ‖bk, calculated by numerical time-integration of kth-direction gradient waveform, Gk(τ) = (GkX,Gky,GkZ)T, whose coordinates indicate active gradient channel elements at time τ, according to (32,39): . Here γ is the gyromagnetic ratio and TE is the echo time for the spin echo DWI sequence. In addition to idealized combination (32), GID = GDW + GIM, of imaging, GIM, and diffusion gradients, GDW = ‖GDW‖ u, where uk = (ukX,uky,ukZ)T, ‖u‖ = 1, uTu = 1 defines DW direction, the gradient waveforms were expanded to include spatially-dependent, r = (x,y,z)T, contributions from gradient nonlinearity tensor (26), , Gio=Gi(r0); background gradients due to imperfect shim, GBG = ∇Bo; concomitant gradients (33), ; Gj/2 ≡ Gj/2; and eddy current distortions (34), . These resulted in the combined gradient waveform: Gk(τ,r) = L(r)GID + GBG(r) + GEC(r) + GCF(r). In this model the nonlinearity tensor, defined by gradient channel design, operates only on the GID term, since other terms do not directly generate currents in the gradient channels.
An example of the combined gradient waveforms generated for DSE and SSE sequence variants on the GY channel at ±150 mm (SI) is shown in Figure A1. The waveform polarity is reversed after TE/2 (32) to implicitly account for the effect of the spin echo RF pulses (39). The relative contribution of the individual gradient imperfections into b-value bias was evaluated by percent deviation of simulated value from ideal (input b = 1000 s/mm2) in the presence of a specific (additive) bias source. Based, on the average values observed for the studied systems, the simulation parameters were set to: TE=100ms, B0=3T, τ1=0.5ms, τ2=2ms, a1 = a2 = 1, GBG=±0.3mT/m, GIM=18.8mT/m. With these characteristics, the contributions of GCF and GEC were negligible, accounting for less than 0.02% and 0.5% of the b-value bias, respectively, while GIM and GBG produced minor detectible errors of 3–5%, and GNL stood out as the major contributing bias factor of up to 20% at remote offsets from isocenter.
References
- 1.DeVries AF, Kremser C, Hein PA, et al. Tumor microcirculation and diffusion predict therapy outcome for primary rectal carcinoma. Int J Rad Oncology, biology, physics. 2003;56(4):958–965. doi: 10.1016/s0360-3016(03)00208-6. [DOI] [PubMed] [Google Scholar]
- 2.Foltz WD, Wu A, Chung P, et al. Changes in apparent diffusion coefficient and T2 relaxation during radiotherapy for prostate cancer. J Magn Reson Imaging. 2013;37(4):909–916. doi: 10.1002/jmri.23885. [DOI] [PubMed] [Google Scholar]
- 3.Mardor Y, Pfeffer R, Spiegelmann R, et al. Early detection of response to radiation therapy in patients with brain malignancies using conventional and high b-value diffusion-weighted magnetic resonance imaging. J Clin Oncology. 2003;21(6):1094–1100. doi: 10.1200/JCO.2003.05.069. [DOI] [PubMed] [Google Scholar]
- 4.Nilsen L, Fangberget A, Geier O, Olsen DR, Seierstad T. Diffusion-weighted magnetic resonance imaging for pretreatment prediction and monitoring of treatment response of patients with locally advanced breast cancer undergoing neoadjuvant chemotherapy. Acta Oncologica. 2010;49(3):354–360. doi: 10.3109/02841861003610184. [DOI] [PubMed] [Google Scholar]
- 5.Manenti G, Di Roma M, Mancino S, et al. Malignant renal neoplasms: correlation between ADC values and cellularity in diffusion weighted magnetic resonance imaging at 3 T. La Radiologia Medica. 2008;113(2):199–213. doi: 10.1007/s11547-008-0246-9. [DOI] [PubMed] [Google Scholar]
- 6.Squillaci E, Manenti G, Cova M, et al. Correlation of diffusion-weighted MR imaging with cellularity of renal tumours. Anticancer Res. 2004;24(6):4175–4179. [PubMed] [Google Scholar]
- 7.Hamstra DA, Rehemtulla A, Ross BD. Diffusion magnetic resonance imaging: a biomarker for treatment response in oncology. J Clin Oncology. 2007;25(26):4104–4109. doi: 10.1200/JCO.2007.11.9610. [DOI] [PubMed] [Google Scholar]
- 8.Galea N, Cantisani V, Taouli B. Liver lesion detection and characterization: role of diffusion-weighted imaging. J Magn Reson Imaging. 2013;37(6):1260–1276. doi: 10.1002/jmri.23947. [DOI] [PubMed] [Google Scholar]
- 9.Guo Y, Cai YQ, Cai ZL, et al. Differentiation of clinically benign and malignant breast lesions using diffusion-weighted imaging. J Magn Reson Imaging. 2002;16(2):172–178. doi: 10.1002/jmri.10140. [DOI] [PubMed] [Google Scholar]
- 10.Kallehauge JF, Tanderup K, Haack S, et al. Apparent Diffusion Coefficient (ADC) as a quantitative parameter in diffusion weighted MR imaging in gynecologic cancer: Dependence on b-values used. Acta Oncologica. 2010;49(7):1017–1022. doi: 10.3109/0284186X.2010.500305. [DOI] [PubMed] [Google Scholar]
- 11.Watanabe Y, Terai A, Araki T, et al. Detection and localization of prostate cancer with the targeted biopsy strategy based on ADC map: a prospective large-scale cohort study. J Magn Reson Imaging. 2012;35(6):1414–1421. doi: 10.1002/jmri.23587. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Koh DM, Collins DJ. Diffusion-weighted MRI in the body: applications and challenges in oncology. Amer J Roentgenology. 2007;188(6):1622–1635. doi: 10.2214/AJR.06.1403. [DOI] [PubMed] [Google Scholar]
- 13.Kurland BF, Gerstner ER, Mountz JM, et al. Promise and pitfalls of quantitative imaging in oncology clinical trials. Magn Reson Imaging. 2012;30(9):1301–1312. doi: 10.1016/j.mri.2012.06.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Padhani AR, Liu G, Koh DM, et al. Diffusion-weighted magnetic resonance imaging as a cancer biomarker: consensus and recommendations. Neoplasia. 2009;11(2):102–125. doi: 10.1593/neo.81328. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Delakis I, Moore EM, Leach MO, De Wilde JP. Developing a quality control protocol for diffusion imaging on a clinical MRI system. Physics Med Biology. 2004;49(8):1409–1422. doi: 10.1088/0031-9155/49/8/003. [DOI] [PubMed] [Google Scholar]
- 16.Ogura A, Hayakawa K, Miyati T, Maeda F. Imaging parameter effects in apparent diffusion coefficient determination of magnetic resonance imaging. Eur J Radiology. 2011;77(1):185–188. doi: 10.1016/j.ejrad.2009.06.031. [DOI] [PubMed] [Google Scholar]
- 17.Saritas EU, Lee JH, Nishimura DG. SNR dependence of optimal parameters for apparent diffusion coefficient measurements. IEEE Trans Med Imaging. 2011;30(2):424–437. doi: 10.1109/TMI.2010.2084583. [DOI] [PubMed] [Google Scholar]
- 18.Barnhart HX, Barboriak DP. Applications of the repeatability of quantitative imaging biomarkers: a review of statistical analysis of repeat data sets. Trans Onc. 2009;2(4):231–235. doi: 10.1593/tlo.09268. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Raunig DL, McShane LM, Pennello G, et al. Quantitative imaging biomarkers: A review of statistical methods for technical performance assessment. Stat Meth Med Res. doi: 10.1177/0962280214537344. Epub 2014 June 11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Sasaki M, Yamada K, Watanabe Y, et al. Variability in absolute apparent diffusion coefficient values across different platforms may be substantial: a multivendor, multi-institutional comparison study. Radiology. 2008;249(2):624–630. doi: 10.1148/radiol.2492071681. [DOI] [PubMed] [Google Scholar]
- 21.Malyarenko D, Galban CJ, Londy FJ, et al. Multi-system repeatability and reproducibility of apparent diffusion coefficient measurement using an ice-water phantom. J Magn Reson Imaging. 2013;37(5):1238–1246. doi: 10.1002/jmri.23825. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Mulkern RV, Ricci K, Vajapeyam S, et al. Pediatric Brain Tumor Consortium Multi-site Assessment of Apparent Diffusion Coefficient z-Axis Variation Assessed with an Ice Water. Phantom Acad Radiology. doi: 10.1016/j.acra.2014.10.006. Epub 2014 Nov 27. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Tan ET, Marinelli L, Slavens ZW, King KF, Hardy CJ. Improved correction for gradient nonlinearity effects in diffusion-weighted imaging. J Magn Reson Imaging. 2013;38(2):448–453. doi: 10.1002/jmri.23942. [DOI] [PubMed] [Google Scholar]
- 24.Aja-Fernandez S, Niethammer M, Kubicki M, Shenton ME, Westin CF. Restoration of DWI data using a Rician LMMSE estimator. IEEE Trans Med Imaging. 2008;27(10):1389–1403. doi: 10.1109/TMI.2008.920609. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Sijbers J, den Dekker AJ. Maximum likelihood estimation of signal amplitude and noise variance from MR data. Magn Reson Med. 2004;51(3):586–594. doi: 10.1002/mrm.10728. [DOI] [PubMed] [Google Scholar]
- 26.Bammer R, Markl M, Barnett A, et al. Analysis and generalized correction of the effect of spatial gradient field distortions in diffusion-weighted imaging. Magn Reson Med. 2003;50(3):560–569. doi: 10.1002/mrm.10545. [DOI] [PubMed] [Google Scholar]
- 27.Malyarenko DI, Ross BD, Chenevert TL. Analysis and correction of gradient nonlinearity bias in apparent diffusion coefficient measurements. Magn Reson Med. 2014;71(3):1312–1323. doi: 10.1002/mrm.24773. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Zhu T, Hu R, Qiu X, et al. Quantification of accuracy and precision of multi-center DTI measurements: a diffusion phantom and human brain study. NeuroImage. 2011;56(3):1398–1411. doi: 10.1016/j.neuroimage.2011.02.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Janke A, Zhao H, Cowin GJ, Galloway GJ, Doddrell DM. Use of spherical harmonic deconvolution methods to compensate for nonlinear gradient effects on MRI images. Magn Reson Med. 2004;52(1):115–122. doi: 10.1002/mrm.20122. [DOI] [PubMed] [Google Scholar]
- 30.Jovicich J, Czanner S, Greve D, et al. Reliability in multi-site structural MRI studies: effects of gradient non-linearity correction on phantom and human data. NeuroImage. 2006;30(2):436–443. doi: 10.1016/j.neuroimage.2005.09.046. [DOI] [PubMed] [Google Scholar]
- 31.Malyarenko DI, Chenevert TL. Practical estimate of gradient nonlinearity for implementation of ADC bias correction. J Magn Reson imaging. 2014;40(6):1487–1495. doi: 10.1002/jmri.24486. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Gullmar D, Haueisen J, Reichenbach JR. Analysis of b-value calculations in diffusion weighted and diffusion tensor imaging. Concepts Magn Reson. 2005;25A:53–66. [Google Scholar]
- 33.Meier C, Zwanger M, Feiweier T, Porter D. Concomitant field terms for asymmetric gradient coils: consequences for diffusion, flow, and echo-planar imaging. Magn Reson Med. 2008;60(1):128–134. doi: 10.1002/mrm.21615. [DOI] [PubMed] [Google Scholar]
- 34.Reese TG, Heid O, Weisskoff RM, Wedeen VJ. Reduction of eddy-current-induced distortion in diffusion MRI using a twice-refocused spin echo. Magn Reson Med. 2003;49(1):177–182. doi: 10.1002/mrm.10308. [DOI] [PubMed] [Google Scholar]
- 35.Vannesjo SJ, Haeberlin M, Kasper L, et al. Gradient system characterization by impulse response measurements with a dynamic field camera. Magn Reson Med. 2013;69(2):583–593. doi: 10.1002/mrm.24263. [DOI] [PubMed] [Google Scholar]
- 36.National Cancer Institute. Cancer Imaging Program. QIN Network Organization; http://imaging.cancer.gov/programsandresources/specializedinitiatives/qin/qinnetwork. Updated April 25, 2012. Accessed March 30, 2015. [Google Scholar]
- 37.Clunie DA. DICOM structured reporting and cancer clinical trials results. Cancer informatics. 2007;4:33–56. doi: 10.4137/cin.s37032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Holz M, Heil SR, Sacco A. Temperature-dependent self-diffusion coefficients of water and six selected molecular liquids for calibration in accurate H-1 NMR PFG measurements. Phys Chem Chem Phys. 2000;2(20):4740–4742. [Google Scholar]
- 39.Stejskal EO, Tanner JE. Spin Diffusion Measurements: Spin Echoes in the Presence of a Time-Dependent Field Gradient. J Chem Phys. 1965;42(1):288–292. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.