Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2017 Oct 1.
Published in final edited form as: J Magn Reson Imaging. 2016 Mar 23;44(4):846–855. doi: 10.1002/jmri.25237

Variability and bias assessment in breast ADC measurement across multiple systems

KE Keenan 1, AP Peskin 2, LJ Wilmes 3, SO Aliu 3, EF Jones 3, W Li 3, J Kornak 4, DC Newitt 3, NM Hylton 3
PMCID: PMC5098898  NIHMSID: NIHMS819314  PMID: 27008431

Abstract

Purpose

To assess the ability of a recent, anatomically-designed breast phantom incorporating T1 and diffusion elements to serve as a quality control device for quantitative comparison of apparent diffusion coefficient (ADC) measurements calculated from diffusion-weighted MRI (DWI) within and across MRI systems.

Materials and Methods

A bilateral breast phantom incorporating multiple T1 and diffusion tissue mimics and a geometric distortion array was imaged with DWI on 1.5 T and 3.0 T scanners from two different manufacturers, using three different breast coils (three configurations total). Multiple measurements were acquired to assess the bias and variability of different diffusion weighted single-shot echo-planar imaging (SS-EPI) sequences on the scanner-coil systems.

Results

The repeatability of ADC measurements was mixed: the standard deviation relative to baseline across scanner-coil-sequences ranged from low variability (0.47, 95% Confidence Interval (CI): 0.22–1.00) to high variability (1.69, 95% CI: 0.17–17.26), depending on material, with the lowest and highest variability from the same scanner-coil-sequence. Assessment of image distortion showed that Right/Left measurements of the geometric distortion array were 1 to 16 % larger on the left coil side compared to the right coil side independent of scanner-coil systems, diffusion weighting, and phase-encoding direction.

Conclusions

This breast phantom can be used to measure scanner-coil-sequence bias and variability for DWI. When establishing a multi-system study, this breast phantom may be used to minimize protocol differences (e.g. due to available sequences or shimming technique), to correct for bias that cannot be minimized, and to weigh results from each system depending on respective variability.

Keywords: breast MRI, phantoms, apparent diffusion coefficient, EPI, quality control, tissue mimic

INTRODUCTION

Quantitative magnetic resonance imaging (MRI) is increasingly used for breast cancer diagnosis, staging and treatment monitoring. Several clinical trials that use MRI techniques to assess neoadjuvant treatment protocols are underway (1, 2).

Diffusion MRI is used increasingly to provide characterization of breast cancer tumors (3, 4), either in addition to or as a replacement for dynamic contrast enhanced MRI with T1 measurements and T1-weighted images (5, 6). Quality control is much needed to ensure accurate quantitation of diffusion-weighted MRI (DWI) measurements for clinical assessments (4), the importance of which is further substantiated by the bias in diffusion measurements recently observed in a multicenter trial (7). Similarly, Min et al. summarized several studies that determine a threshold ADC value between benign and malignant breast tumors (8): the reported thresholds ranged from 0.85 × 10−3 m2/s to 1.48 × 10−3 m2/s, which is too broad to be used clinically. To determine a threshold ADC value, an improved study would distinguish the variance in measurements due to the scanner, coil and sequence from differences between patients’ benign and malignant tumor tissue.

A breast phantom is a quality control device that can be used to quantify measurement variability due to system hardware, software or imaging sequence. A breast phantom was introduced that is compatible with multiple coil styles, contains diffusion mimics for fat and healthy fibroglandular tissue, and benign and malignant tumor tissue, and has an anatomically appropriate design (9), distinguishing it from previous breast phantoms that were compatible with only one coil (10), lacked diffusion mimics (11), or lacked fat tissue mimics (12). The goal of this study is to assess the ability of this breast phantom to serve as a quality control device for comparison of ADC data across scanner-coil-sequence combinations and repeated scanning (e.g. as would be required in a multi-system study or clinical trial).

METHODS

Phantom

The breast phantom evaluated in this study was created as two distinct interchangeable units: one for diffusion and distortion evaluation and one for T1 relaxation evaluation (9) (Figure 1). The interchangeable nature allows each phantom unit to be imaged on each side of the coil, thereby allowing examination of differences between the left and right sides of the coil. As described in the phantom design paper (9), the polyvinylpyrrolidone (PVP) solutions (13, 14) in the breast phantom span the range of apparent diffusion coefficient (ADC) values for malignant mass to benign lesion reported in the literature (3, 15, 16).

Figure 1.

Figure 1

Breast phantom CAD model showing A) the diffusion phantom unit with vertically oriented samples tubes and geometric distortion plate and B) the T1 phantom unit with sample spheres arranged on four isolated levels, and C) prototype prepared for imaging in a breast MR coil.

The diffusion phantom unit includes a central, axial plate for measuring distortion. The phantom was oriented with the plate in the axial imaging plane. The plate contains a grid of 10 mm diameter holes on 20 mm center-to-center spacing in both A/P and R/L directions, with four A/P rows of four or six holes.

Magnetic Resonance Imaging Techniques

Diffusion weighted imaging sets and T1-weighted (T1-w) images were collected on three clinical scanner-coil configurations (Table 1). Apparent Diffusion Coefficient (ADC) measurements were performed using single-shot echo-planar imaging (SS-EPI) as described in Table 2. T1-w images at 1.5 T used a 3-Dimensional (3D) fast gradient echo sequence with fat suppression, repetition time/echo time (TR/TE) 7.0/4.2 ms, flip angle 10 degrees, field of view (FOV) 400 mm2, resolution 0.78 mm2, and slice thickness 2 mm. T1-w images at 3.0 T used a 3D spoiled gradient echo with fat suppression, TR/TE 7.1/2.2 ms, flip angle 20 degrees, FOV 340 mm2, resolution 0.76 mm2, and slice thickness 2 mm. All sequences were acquired in the axial plane.

Table 1.

Scanner-Coil Configurations

Configuration MRI Scanner Manufacturer Breast Coil Field Strength
1.5Ta Signa, GE Healthcare, Milwaukee, WI Open, 8-channel coil, Hologic (formerly Sentinelle Medical), Toronto, Ontario 1.5 T
1.5Tb Signa, GE Healthcare, Milwaukee, WI Open, 8-channel coil, GE Healthcare, Milwaukee, WI 1.5 T
3Tc Verio, Siemens AG Medical Solutions, Erlangen, Germany Open, 16-channel coil, Hologic (formerly Sentinelle Medical), Toronto, Ontario 3.0 T

Table 2.

ADC measurement parameters

Field 1.5 T 3.0 T
Sequence 2 b-value EPI 4 b-value EPI 4 b-value EPI
Configuration(s) 1.5Ta & 1.5Tb 1.5Ta 3Tc
TR (ms) 7500 7500 14100
TE (ms) 66.4 70.4 88
Flip Angle (degrees) 90 90 90
Bandwidth (Hz) 1953.1 1953.1 1935.0
b-value (s/mm2) 0, 600 0, 100, 600, 800 0, 100, 600, 800
Freq. encode direction A/P A/P A/P
Averages 2, 8 2, 2, 4, 8 8
Slice Thickness (mm) 2.0 2.0 2.0
FOV (mm × mm) 400 × 400 400 × 400 399 × 399
Matrix size (pixels) 256 × 192 256 × 160 152 × 152
Imaging Plane Axial Axial Axial

The diffusion and T1 relaxation phantom units were swapped between the left and right imaging positions, and the acquisition was repeated to gather data from each side of the breast coil. In some cases, measurements were repeated on the same day and on subsequent days over several months to determine variation in system performance. In addition, components with the same material were in multiple locations in the diffusion phantom unit, allowing evaluation of measurement variation with position from the magnet isocenter.

Bore temperatures were measured continuously at the time of scanning using an MRI-compatible fiber-optic temperature probe (OTP-M, OpSens, Québec, Canada). Temperature information was used in the interpretation of results; temperature corrections were not applied.

ADC Calculation

ADC measurements required calculation of parametric maps from DWI data and selection of the regions of interest (ROIs). We used software developed in the IDL programming environment (Exelis Visual Information Solutions, Boulder, CO) at the University of California San Francisco (16). ADC maps were generated by fitting the equation:

lnSb=lnS0bADC (1)

using a linear least-squares approach, where Sb and S0 are the diffusion-weighted and non-diffusion-weighted signals respectively, and b is the diffusion sensitizing factor.

ROIs were created manually. ROIs for the diffusion phantom unit were delineated by rectangles measuring approximately 70 mm by 3 mm on three contiguous axial slices for the four large tubes and 55 mm by 2 mm on two contiguous axial slices for the 12 small tubes. These 2D ROIs were subsequently stacked into 3D multi-slice ROIs. In some data sets, the geometry of the tubes was distorted due to EPI artifacts. For those data sets, ROIs were rotated in the axial plane plane up to 5 degrees to ensure that the ROIs were circumscribed within the tubes.

Image Segmentation and Distortion Measurement

Segmentation was performed on the diffusion and T1-w sets of breast phantom images in order to assess differences in distortion as a function of b-value, direction of the frequency-encoding gradient (Right/Left, R/L or Anterior/Posterior, A/P) and coil position of the two units (diffusion phantom unit on Coil Left or Coil Right). The image segmentation and distortion measurement processes were fully automated and checked against manual measurements.

For each set of images, we selected the center, axial slice of the actual imaging volume, which always contained the grid of holes in the axial distortion plate in the diffusion phantom unit. To locate the rows of holes within the distortion plate, we first located the R/L boundaries of the distortion plate within this unit, and placed the rows of grid points relative to those plate boundaries. The outer boundary of the distortion plate was located by taking a threshold, defining the background to be the lowest 5 % of intensity values. By locating the top and bottom A/P position of the boundary, we estimated the R/L vertical locations of the three structural plates in the diffusion phantom that are used for tube alignment from the computer aided design (CAD) model (Figure 2 A). In an A/P area of +/− 10 pixels from each vertical estimate, we located pixels whose intensities are well below the neighboring pixel above or below by more than 5 % of the neighboring pixel intensity range. These were clustered and filled to establish the structural plate locations. From the structural plate locations, each of the four rows of grid holes is easily located.

Figure 2.

Figure 2

A) View of the components in the CAD model on the diffusion phantom unit with the three structural plates used for tube alignment noted with (*). B) Segmentation of holes in the grid using the center slice of the image set. This shows the high gradient/intensity pixels before each cluster is filled.

Between the top and bottom large structural plates and inside the diffusion phantom unit outer boundary, we collected and averaged pixel intensity gradients, using a Sobel gradient operator. Gradient magnitudes corresponding with individual pixels in the image are calculated separately in the horizontal (Gx) and vertical (Gy) directions, applying a 3 × 3 kernel:

Gx=[101202101],Gy=[121000121]

The gradient magnitude is then the square root of the sum of the squares of Gx and Gy. For pixels whose Sobel gradient magnitudes were at least 0.1 standard deviations above the slice average, we collected those with at least an intensity difference of more than 5 % of the image’s pixel intensity range between the pixel and an outside neighboring pixel. These clusters were located and filled, defining the holes. Figure 2 B shows an example of these pixel clusters before they are filled. We measured the center of each hole and all center-to-center distances between holes in the R/L and A/P directions. In each set of images, measurements from the diffusion images in both coil positions were compared with those from T1-w images, in which no distortion occurred, based on comparison with the design specifications.

Statistics

All statistical analysis was performed in R with the nlme package (cran.r-project.org). We used the generalized least squares modeling (GLS) using the nlme package function gls. GLS models are identical to linear models/multiple regression, except that they allow for non-constant (heterogeneous) variance structures across subjects. Specifically, we allow for different variances across scanner-coil-sequence combinations. This extension to linear modeling is critical to our analysis, since estimation of differences in variability between scanner-coil-sequence combinations is central to our repeatability (test-retest) analyses. In this study design, we efficiently obtain estimates under varying sets of parameters simultaneously, using a modeling approach applied to all of the different combinations, rather than generating an individual data set for each and every parameter to be estimated. We therefore considered each scanner-coil-sequence combination as a separate condition in the model.

The outcomes in all models were the median MR measurement over voxels in a 3D ROI (i.e., for a particular tissue mimic within a tube), and the predictors are the scanner-coil-sequence combination, distance of the ROI from the magnet isocenter (modeled as linear change from isocenter), and left vs. right coil position. All results are given in terms of estimates, 95 % confidence intervals (CIs), and p‐values without adjustment for multiple testing (1719). We consider p < 0.05 to constitute statistical significance, and we indicate where caution is warranted despite findings with p < 0.05.

We examined the relative standard deviation (SD) in measurements for the four different scanner-coil-sequence combinations; a lower relative standard deviation (lower variability) corresponds to a higher repeatability. Rather than providing absolute estimates and CIs of standard deviations to estimate repeatability, we provide the standard deviations in comparison to a particular baseline scanner-coil-sequence combination. The baseline scanner-coil-sequence combination for ADC was chosen to be that with measured ADC values closest to the reference measurements acquired using a spin-echo sequence with diffusion-weighted gradients (9), Configuration “1.5Ta – 2 b-value” (Tables 1 and 2). A relative standard deviation less than one is more repeatable than the baseline, and above one is less repeatable than the baseline.

To assess reproducibility, we compared the difference in estimated means to the baseline configuration mean ADC. GLS model term mean estimates (e.g. for scanner-coil-sequence effects or distance from the magnet isocenter) and CIs relate to lack of reproducibility. For example, for scanner-coil-sequence effects or distance from the magnet isocenter, the mean ADC estimate increases or decreases due to scanner-coil-sequence, or linear change in mean ADC due to distance from magnet isocenter. If the estimated mean ADC is significantly different from the baseline mean ADC, there is bias, and the measurement is not reproducible. In addition, we determined the estimated mean ADC as a percent of the baseline mean ADC to assess the clinical impact of the change, e.g. a different scanner-coil-sequence or a change in phantom position.

RESULTS

Temperature

The bore temperatures ranged from 17.53 to 24.10 °C across all configurations, and the standard deviation of temperature for each single MRI session ranged from 0.16 to 0.78 °C.

ADC Measurement Variability

This breast phantom study found no clear trends for ADC repeatability (Table 3). For both 1.5 T scanner-coil-sequence combinations, the relative SD was both lower and higher than the baseline SD, depending on the mimic material and scanner-coil-sequence configuration. The relative SD to baseline across scanner-coil-sequences ranged from low variability (25% PVP: 0.47, 95% CI: 0.22–1.00) to high variability (0% PVP: 1.69, 95% CI: 0.17–17.26), both on 1.5Tb – 2 b-value EPI. The relative SD is higher, demonstrating lower repeatability when compared to baseline for all materials on 3.0Tc – 4 b-value EPI.

Table 3.

Relative SD (repeatability) on ADC measurements across clinical MRI systems and sequences

Material Baseline Residual SD
(95% CI low, high)
(10−6 mm2/s)
1.5Ta, 2 b-value EPI
Relative SD on ADC
(95 % CI low, high)
1.5Tb, 2 b-value EPI
Relative SD on ADC
(95% CI low, high)
1.5Ta, 4 b-value EPI
Relative SD on ADC
(95% CI low, high)
3Tc, 4 b-value EPI
Water (PVP 0 %) 156.0 (94.1, 258.5) 1.69 (0.17, 17.26) 0.78 (0.33, 1.81) 1.11 (0.69, 1.79)
10 % PVP w/w in water 93.8 (74.2, 118.6) 0.63 (0.45, 0.89) 0.83 (0.66, 1.06) 1.45 (1.16, 1.81)
14 % PVP 73.8 (42.3, 128.9) 1.41 (0.54, 3.69) 1.06 (0.59, 1.88) 1.08 (0.58, 2.02)
18 % PVP 47.3 * 0.97 * 1.30 * 1.73 *
25 % PVP 72.2 (51.9, 100.4) 0.47 (0.22, 1.00) 0.90 (0.63, 1.28) 1.29 (0.94, 1.77)
40 % PVP 74.4 (56.5, 97.8) 0.71 (0.50, 1.00) 1.16 (0.87, 1.54) 1.04 (0.76, 1.42)
*

Model did not converge on confidence interval results.

Scanner-coil-sequence ADC Measurement Bias

In most cases, ADC measures were reproducible; there was little measurement bias. ADC reproducibility for the scanner-coil-sequence model was assessed relative to the baseline, 1.5Ta – 2 b-value EPI, as shown in Table 4. The mean measured ADC for the baseline case, all materials, are given in the second column of Table 4. Only three of the eighteen scanner-coil-sequence and material combinations were statistically different than baseline and therefore not reproducible: DI water, 1.5Ta – 4 b-value EPI (198.8 × 10−6 mm2/s (95% CI: 36.3, 361.3), p=0.0203); 10 % w/w PVP, 1.5Tb – 2 b-value EPI (75.3 × 10−6 mm2/s (95% CI: 34.0, 116.5), p=0.0005); and 10% w/w PVP, 1.5Ta – 4 b-value EPI (56.1 × 10−6 mm2/s (95% CI: 8.5, 103.8), p=0.0216).

Table 4.

Effect of scanner-coil-sequence configuration on ADC for scanner-coil-sequence model

Material Baseline mean ADC
(standard error)
(10−6 mm2/s)
1.5Ta, 2 b-value EPI
Estimated difference in mean ADC for 1.5Tb, 2 b-value EPI
(10−6 mm2/s) and 95% CI (low, high)
Estimated difference in mean ADC for 1.5Ta, 4 b-value EPI
(10−6 mm2/s) and 95% CI (low, high)
Estimated difference in mean ADC for 3Tc, 4 b-value EPI
(10−6 mm2/s) and 95% CI (low, high)
Water (PVP 0 %) 1464.8 (479.7) 132.3 (−566.1, 830.8)
p=0.6890
198.8 (36.3, 361.3)
p=0.0203
157.8 (−102.2, 417.8)
p=0.2124
10 % PVP w/w in water 1365.1 (42.0) 75.3 (34.0, 116.5)
p=0.0005
56.1 (8.5, 103.8)
p=0.0216
84.7 (−18.6, 188.1)
p=0.1064
14 % PVP 998.5 (138.5) 57.4 (−174.0, 288.9)
p=0.6011
4.6 (−112.1, 121.2)
p=0.9339
67.8 (−40.6, 176.1)
p=0.1996
18 % PVP 1066.6 (48.6) 65.9 *
p=0.0852
−20.1 *
p=0.6837
97.2 *
p=0.2043
25 % PVP 749.1 (53.2) 30.9 (−7.5, 69.3)
p=0.1111
6.8 (−51.9, 65.5)
p=0.8156
38.3 (−55.1, 131.7)
p=0.4093
40 % PVP 451.0 (40.6) 38.8 (−4.0, 81.5)
p=0.0745
47.6 (−24.6, 119.7)
p=0.1914
31.1 (−26.4, 89.0)
p=0.2807
*

Model did not converge on confidence interval results.

Dependence on distance from the magnet isocenter

We included distance from the magnet isocenter as a predictor in the GLS model (Table 5). We found a spatial dependence on position from magnet isocenter, as expected from the nonlinear model previously reported (7, 20), and all ADC measurements reported in this study were adjusted by the model to remove the dependence on position.

Table 5.

Effect of position from magnet isocenter on ADC for scanner-coil and scanner-coil-sequence models

Material Absolute estimated difference in mean ADC measured value for 1 cm change in position from magnet isocenter
(10−6 mm2/s) and 95% CI (low, high)
p-value
Water (PVP 0 %) 74.2 (3.2, 145.3) 0.0418
10 % PVP w/w in water 37.4 (30.0, 44.8) <0.0001
14 % PVP 63.1 (27.9, 98.4) 0.0019
18 % PVP * 47.1 (20.4, 73.7) 0.0023
25 % PVP 24.1 (13.8, 34.4) <0.0001
40 % PVP 16.3 (9.3, 23.4) <0.0001
*

Estimate, confidence interval and p-value are based on a model with the 1.5 T – Coil B data removed to enable confidence interval estimation for the estimated difference in mean ADC for 1 cm lateral change in position from coil center.

Phantom position within the coil

We included position of the phantom in the left or right side of the bilateral breast coil as a predictor in the GLS model (Table 6). In the majority of cases, the resulting change in the estimates of the mean ADC was not statistically significant. There was one statistically significant result for phantom position: 40 % w/w PVP for the ADC scanner-coil-sequence model (p=0.0352) with an estimated difference in mean ADC measured value of 8.96 % (95% CI: 0.64 – 17.27 %).

Table 6.

Effect of phantom position in right coil side on ADC for scanner-coil and scanner-coil-sequence models

Material Estimated difference in mean ADC for phantom in right coil side (10−6 mm2/s) and 95% CI (low, high) p-value Est. difference in mean ADC as a percent of baseline mean ADC value
Water (PVP 0 %) 88.3 (−502.2, 678.7) 0.7519 6.03
10 % PVP w/w in water 1.0 (−35.1, 37.0) 0.9575 0.07
14 % PVP −107.5 (−217.3, 2.4) 0.0544 −10.77
18 % PVP * −97.0 (−254.2, 60.2) 0.2036 −9.09
25 % PVP 27.9 (−0.6, 56.3) 0.0549 3.72
40 % PVP 40.4 (2.9, 77.9) 0.0352 8.96
*

Estimate, confidence interval and p-value are based on a model with the 1.5 T – Coil B data removed to enable confidence interval estimation for the distance from the magnet isocenter.

SS-EPI distortion assessment

While the majority of the differences in estimate of the means for ADC measurement with the phantom positioned in the left or right side of the coil were not significant, we did find a difference in the image distortion between the Coil Left and Coil Right sides of the SS-EPI axial diffusion images across all three scanner-coil configurations, independent of phase-encoding gradient direction (Figure 3).

Figure 3.

Figure 3

Average right-left grid hole separation distance for the SS-EPI images for each configuration (1.5Ta; 1.5Tb; 3Tc) for each distortion plate row, and the overall average distance for a configuration. In all cases the left coil side distances were greater than the right coil side distances. The black line represents the actual grid hole separation distance of 20 mm from the CAD model. For the T1-w images, no measurable difference was found from the CAD model.

Distortion was assessed on both coil sides for T1-w and SS-EPI diffusion sequences, comparing grid hole separation measurements in the R/L and A/P directions on the geometric distortion plate to known measurements from the phantom design CAD model. No measureable distortion effects were evident in either direction in the T1-w images, nor in the A/P direction in the SS-EPI diffusion images. However, we did find a difference in geometric distortion in the R/L direction between the Coil Left and Coil Right sides in the SS-EPI axial diffusion images across all three scanner-coil configurations (Figure 3). The image distortion was clearly visible as seen in Figure 4, illustrating the difference in relative apparent size of the two phantom units, and in Figure 5 showing the geometric distortion plate imaged with different sequences. Measured grid hole spacing was consistently larger on the Coil Left than the Coil Right across configurations, with mean differences across the four rows of 1.8 mm (9 %), 1.5 mm (7.5 %), and 0.7 mm (3.5 %) for Configurations 1.5Ta, 1.5Tb, and 3Tc respectively. The direction of the phase-encoding gradient in the image acquisition did cause the expected SS-EPI image distortions, but those distortions did not change the R/L grid hole spacing distortion. Distortion measurements were the same for all b-value images (b=0, 100, 600, and 800 s/mm2), ruling out effects of diffusion gradients on the observed grid hole spacing distortion. To verify that this was not an effect of the phantom itself, we retrospectively examined data from human subjects, which agreed with the phantom findings (details are provided in the Supplementary Information).

Figure 4.

Figure 4

Breast phantom slices at the same depth locations, imaged in opposite coil positions from SS-EPI with b=100 s/mm2 diffusion weighting. The diffusion side is visibly thinner in the R/L direction (A), while the T1 side is visibly thinner in the R/L direction (B).

Figure 5.

Figure 5

Central slice of the: A) T1-w image, B) SS-EPI, b=600 s/mm2 Patient Right image, and C) SS=EPI, b=600 s/mm2 Patient Left image of the diffusion side of the phantom.

DISCUSSION

Using the breast phantom, we completed an initial assessment of ADC measurement bias and variability on clinical MRI scanners, breast coils and typical clinical imaging sequences. We found that the breast phantom can be used to identify sources of bias and variability to guide quantitative measurement methods in multisystem studies. Comparing left and right sides of the coil, we identified distortions in the SS-EPI diffusion images, likely attributable to B0 shim inhomogeneity, which means repeatability of the ADC measurements depends on shimming and position in the coil.

In general, ADC measurements were not biased across scanner-coil-sequence combinations. Temperature varied in the experiments conducted on the same configuration across time and in experiments conducted across different configurations, by as much as 5 °C. Temperature fluctuations and differences across configurations may have increased bias, especially for the ADC measurements.

ADC repeatability in this study was mixed. In agreement with these results, a study with an ice-water phantom using similar protocols across multiple scanner-coil combinations found ADC variability increased at locations farther away from magnet isocenter (7), a challenge in breast imaging. Giannotti et al. found high repeatability in ADC measurement of an ice-water phantom assessed weekly over twelve weeks (12). In that study, using one scanner-coil-sequence configuration, the largest error was inter-observer error in patient imaging. Recognizing that inter-observer errors can be difficult to mitigate, studies that include multiple scanner-coil-sequence combinations could use this breast phantom during study set-up to minimize errors due to protocol selection or shim technique and increase repeatability of the ADC measurement.

Our results demonstrate that on the clinical systems we used, ADC values are dependent on position within the magnet. Our generalized linear model indicates a statistically significant effect of ROI position within the magnet on ADC estimates for all of the breast tissue-mimicking materials. ADC measurement is dependent on the diffusion-sensitizing gradients, which are known to have a nonlinear variation from the magnet isocenter (7, 20). Users should be aware that this dependence exists, and methods to mitigate gradient nonlinearity effects should be applied (21). This breast phantom enables diffusion sequences to be tested for variation in artifacts with changes in sample position within the magnet, phase encoding direction, spectral bandwidth, or gradient non-linearity corrections.

Position of the phantom in the left or right coil side was not a statistically significant predictor in the gls model, but we did find differences in image distortion between the left and right sides of the coil. Specifically, it was found that SS-EPI diffusion weighted imaging method resulted in image distortion differences between the left and right sides of the coil. The distortion was measured across different vendors, breast coils, and field strengths; at the different imaging sessions; and stayed in the left/right orientation regardless of phase-encoding direction. There were no cases for which we did not find distortion in the SS-EPI diffusion images, indicating that the effect is likely caused by the SS-EPI imaging itself and not a factor related to specific equipment.

This distortion effect could be due to eddy currents and diffusion-weighted pulses or B0 shim. If the source of the distortion were from eddy currents and diffusion-weighted pulses, the effect would scale with diffusion-weighting gradient strength (b-value) and would be along the direction of the phase-encoding gradient (22), which was not the case. A second possible cause is from the B0 shim (23), which is known to be important in breast imaging (24, 25). In this study, we used the clinical breast imaging protocol shim procedure. In SS-EPI, background gradients can combine with the series of phase-encoded blipped gradient pulses. On one coil side, the B0 shim adds to the strength of the blipped positive phase-encoded pulses and subtracts from it on the other coil side. In our study, distance was increased on the left coil side and decreased on the right coil side across two MRI system vendors. In a previous phantom study using a breast coil, geometric distortions were reported in the diffusion weighted images, although no quantitative measurements were reported (26). In agreement with our findings, the geometric distortions did not change with diffusion weighting (different b-values). These findings indicate that this breast phantom can be used to evaluate distortion correction methods (23, 27) and test improved shimming approaches, such as dynamic multi-slice shimming (25).

One limitation of this phantom is the lack of temperature control, which increases the uncertainty in ADC measurements. NIST intends to characterize and publish the ADC values of the PVP solutions at a range of bore temperatures. Coupled with an accurate temperature measurement, we can reduce measurement uncertainty and increase reproducibility of ADC measurements. Our statistical methods include the median ADC and standard deviation from each 3D ROI, but the voxel size discrepancy between 1.5 T and 3.0 T is a limitation that potentially introduced bias in the data. From this initial study, we cannot make statements about manufacturer or product performance; a more substantial study is needed to make such statements.

In conclusion, this study demonstrates that this breast phantom is a tool to identify and correct scanner-coil-sequence bias and variability. This breast phantom may be used when establishing a multisystem study to create a protocol which minimizes differences (e.g. due to available sequences or shimming technique), to correct bias that cannot be minimized, and to weight results from each system depending on respective variability.

References

  • 1.Barker AD, Sigman CC, Kelloff GJ, Hylton NM, Berry DA, Esserman LJ. I-SPY 2: an adaptive breast cancer trial design in the setting of neoadjuvant chemotherapy. Clinical pharmacology and therapeutics. 2009;86(1):97–100. doi: 10.1038/clpt.2009.68. [DOI] [PubMed] [Google Scholar]
  • 2.Turnbull L, Brown S, Harvey I, Olivier C, Drew P, Napp V, Hanby A, Brown J. Comparative effectiveness of MRI in breast cancer (COMICE) trial: a randomised controlled trial. Lancet. 2010;375(9714):563–71. doi: 10.1016/S0140-6736(09)62070-5. [DOI] [PubMed] [Google Scholar]
  • 3.Partridge SC, Mullins CD, Kurland BF, Allain MD, DeMartini WB, Eby PR, Lehman CD. Apparent diffusion coefficient values for discriminating benign and malignant breast MRI lesions: effects of lesion type and size. AJR American journal of roentgenology. 2010;194(6):1664–73. doi: 10.2214/AJR.09.3534. [DOI] [PubMed] [Google Scholar]
  • 4.Galban CJ, Ma B, Malyarenko D, Pickles MD, Heist K, Henry NL, Schott AF, Neal CH, Hylton NM, Rehemtulla A, Johnson TD, Meyer CR, Chenevert TL, Turnbull LW, Ross BD. Multi-site clinical evaluation of DW-MRI as a treatment response metric for breast cancer patients undergoing neoadjuvant chemotherapy. PLoS One. 2015;10(3):e0122151. doi: 10.1371/journal.pone.0122151. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Partridge SC, Rahbar H, Murthy R, Chai X, Kurland BF, DeMartini WB, Lehman CD. Improved diagnostic accuracy of breast MRI through combined apparent diffusion coefficients and dynamic contrast-enhanced kinetics. Magnetic resonance in medicine: official journal of the Society of Magnetic Resonance in Medicine/Society of Magnetic Resonance in Medicine. 2011;65(6):1759–67. doi: 10.1002/mrm.22762. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Turnbull LW. Dynamic contrast-enhanced MRI in the diagnosis and management of breast cancer. NMR in biomedicine. 2009;22(1):28–39. doi: 10.1002/nbm.1273. [DOI] [PubMed] [Google Scholar]
  • 7.Malyarenko D, Galban CJ, Londy FJ, Meyer CR, Johnson TD, Rehemtulla A, Ross BD, Chenevert TL. Multi-system repeatability and reproducibility of apparent diffusion coefficient measurement using an ice-water phantom. Journal of magnetic resonance imaging: JMRI. 2013;37(5):1238–46. doi: 10.1002/jmri.23825. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Min Q, Shao K, Zhai L, Liu W, Zhu C, Yuan L, Yang J. Differential diagnosis of benign and malignant breast masses using diffusion-weighted magnetic resonance imaging. World J Surg Oncol. 2015;13:32. doi: 10.1186/s12957-014-0431-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Keenan KE, Wilmes LJ, Aliu SO, Newitt DC, Jones EF, Boss MA, Stupic KF, Russek SE, Hylton NM. Design of a Breast Phantom for Quantitative MRI. Journal of Magnetic Resonance Imaging. doi: 10.1002/jmri.25214. Accepted. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Tuong B, Gardiner I. Development of a novel breast MRI phantom for quality control. AJR American journal of roentgenology. 2013;201(3):W511–5. doi: 10.2214/AJR.12.9571. [DOI] [PubMed] [Google Scholar]
  • 11.Freed M, de Zwart JA, Loud JT, El Khouli RH, Myers KJ, Greene MH, Duyn JH, Badano A. An anthropomorphic phantom for quantitative evaluation of breast MRI. Medical physics. 2011;38(2):743–53. doi: 10.1118/1.3533899. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Giannotti E, Waugh S, Priba L, Davis Z, Crowe E, Vinnicombe S. Assessment and quantification of sources of variability in breast apparent diffusion coefficient (ADC) measurements at diffusion weighted imaging. European journal of radiology. 2015;84(9):1729–36. doi: 10.1016/j.ejrad.2015.05.032. [DOI] [PubMed] [Google Scholar]
  • 13.Horkay F, Pierpaoli C, Basser PJ. inventorsPhantom for diffusion MRI Imaging USA. 2012 [Google Scholar]
  • 14.Pierpaoli C, Sarlls J, Nevo U, Basser PJ, Horkay F. Proceedings of the 17th meeting of the International Society for Magnetic Resonance in Medicine. Honolulu, HI: 2009. Polyvinylpyrrolidone (PVP) Water Solutions as Isotropic Phantoms for Diffusion MRI Studies. [Google Scholar]
  • 15.Partridge SC, Demartini WB, Kurland BF, Eby PR, White SW, Lehman CD. Differential diagnosis of mammographically and clinically occult breast lesions on diffusion-weighted MRI. Journal of magnetic resonance imaging: JMRI. 2010;31(3):562–70. doi: 10.1002/jmri.22078. [DOI] [PubMed] [Google Scholar]
  • 16.Partridge SC, Heumann EJ, Hylton NM. Semi-automated analysis for MRI of breast tumors. Studies in health technology and informatics. 1999;62:259–60. [PubMed] [Google Scholar]
  • 17.Perneger TV. What’s wrong with Bonferroni adjustments. BMJ. 1998;316(7139):1236–8. doi: 10.1136/bmj.316.7139.1236. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Rothman KJ. No adjustments are needed for multiple comparisons. Epidemiology. 1990;1(1):43–6. [PubMed] [Google Scholar]
  • 19.Savitz DA, Olshan AF. Multiple comparisons and related issues in the interpretation of epidemiologic data. Am J Epidemiol. 1995;142(9):904–8. doi: 10.1093/oxfordjournals.aje.a117737. [DOI] [PubMed] [Google Scholar]
  • 20.Bammer R, Markl M, Barnett A, Acar B, Alley MT, Pelc NJ, Glover GH, Moseley ME. Analysis and generalized correction of the effect of spatial gradient field distortions in diffusion-weighted imaging. Magnetic resonance in medicine: official journal of the Society of Magnetic Resonance in Medicine/Society of Magnetic Resonance in Medicine. 2003;50(3):560–9. doi: 10.1002/mrm.10545. [DOI] [PubMed] [Google Scholar]
  • 21.Tan ET, Marinelli L, Slavens ZW, King KF, Hardy CJ. Improved correction for gradient nonlinearity effects in diffusion-weighted imaging. Journal of magnetic resonance imaging: JMRI. 2013;38(2):448–53. doi: 10.1002/jmri.23942. [DOI] [PubMed] [Google Scholar]
  • 22.Jones DK, Cercignani M. Twenty-five pitfalls in the analysis of diffusion MRI data. NMR in biomedicine. 2010;23(7):803–20. doi: 10.1002/nbm.1543. [DOI] [PubMed] [Google Scholar]
  • 23.Jezzard P, Balaban RS. Correction for geometric distortion in echo planar images from B0 field variations. Magnetic resonance in medicine: official journal of the Society of Magnetic Resonance in Medicine/Society of Magnetic Resonance in Medicine. 1995;34(1):65–73. doi: 10.1002/mrm.1910340111. [DOI] [PubMed] [Google Scholar]
  • 24.Maril N, Collins CM, Greenman RL, Lenkinski RE. Strategies for shimming the breast. Magnetic resonance in medicine: official journal of the Society of Magnetic Resonance in Medicine/Society of Magnetic Resonance in Medicine. 2005;54(5):1139–45. doi: 10.1002/mrm.20679. [DOI] [PubMed] [Google Scholar]
  • 25.Lee SK, Tan ET, Govenkar A, Hancu I. Dynamic slice-dependent shim and center frequency update in 3 T breast diffusion weighted imaging. Magnetic resonance in medicine: official journal of the Society of Magnetic Resonance in Medicine/Society of Magnetic Resonance in Medicine. 2014;71(5):1813–8. doi: 10.1002/mrm.24824. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Sinha S, Lucas-Quesada FA, Sinha U, DeBruhl N, Bassett LW. In vivo diffusion-weighted MRI of the breast: potential for lesion characterization. Journal of magnetic resonance imaging: JMRI. 2002;15(6):693–704. doi: 10.1002/jmri.10116. [DOI] [PubMed] [Google Scholar]
  • 27.Newitt DC, Tan ET, Wilmes LJ, Chenevert TL, Kornak J, Marinelli L, Hylton N. Gradient nonlinearity correction to improve apparent diffusion coefficient accuracy and standardization in the american college of radiology imaging network 6698 breast cancer trial. Journal of magnetic resonance imaging: JMRI. 2015;42(4):908–19. doi: 10.1002/jmri.24883. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES