Multisite concordance of apparent diffusion coefficient measurements across the NCI Quantitative Imaging Network

David C Newitt; Dariya Malyarenko; Thomas L Chenevert; C Chad Quarles; Laura Bell; Andriy Fedorov; Fiona Fennessy; Michael A Jacobs; Meiyappan Solaiyappan; Stefanie Hectors; Bachir Taouli; Mark Muzi; Paul E Kinahan; Kathleen M Schmainda; Melissa A Prah; Erin N Taber; Christopher Kroenke; Wei Huang; Lori R Arlinghaus; Thomas E Yankeelov; Yue Cao; Madhava Aryal; Yi-Fen Yen; Jayashree Kalpathy-Cramer; Amita Shukla-Dave; Maggie Fung; Jiachao Liang; Michael Boss; Nola Hylton

doi:10.1117/1.JMI.5.1.011003

. 2017 Oct 10;5(1):011003. doi: 10.1117/1.JMI.5.1.011003

Multisite concordance of apparent diffusion coefficient measurements across the NCI Quantitative Imaging Network

David C Newitt ^a,^*, Dariya Malyarenko ^b, Thomas L Chenevert ^b, C Chad Quarles ^c, Laura Bell ^c, Andriy Fedorov ^d, Fiona Fennessy ^d, Michael A Jacobs ^e, Meiyappan Solaiyappan ^e, Stefanie Hectors ^f, Bachir Taouli ^f, Mark Muzi ^g, Paul E Kinahan ^g, Kathleen M Schmainda ^h, Melissa A Prah ^h, Erin N Taber ⁱ, Christopher Kroenke ⁱ, Wei Huang ⁱ, Lori R Arlinghaus ^j, Thomas E Yankeelov ^k, Yue Cao ^l, Madhava Aryal ^l, Yi-Fen Yen ^m, Jayashree Kalpathy-Cramer ^m, Amita Shukla-Dave ⁿ, Maggie Fung ^o, Jiachao Liang ^p, Michael Boss ^q,^r, Nola Hylton ^a

^aUniversity of California San Francisco, Department of Radiology and Biomedical Imaging, San Francisco, California, United States

^bUniversity of Michigan, Department of Radiology, Ann Arbor, Michigan, United States

^cBarrow Neurological Institute, Division of Imaging Research, Phoenix, Arizona, United States

^dHarvard Medical School, Brigham and Women’s Hospital, Department of Radiology, Boston, Massachusetts, United States

^eThe Johns Hopkins School of Medicine, Russell H. Morgan Department of Radiology and Radiological Science and Sidney Kimmel Comprehensive Cancer Center, Baltimore, Maryland, United States

^fTranslational and Molecular Imaging Institute, Icahn School of Medicine at Mount Sinai, New York, United States

^gUniversity of Washington, Department of Radiology, Neurology, and Radiation Oncology, Seattle, Washington, United States

^hMedical College of Wisconsin, Department of Radiology, Milwaukee, Wisconsin, United States

ⁱOregon Health and Science University, Advanced Imaging Research Center, Portland, Oregon, United States

^jVanderbilt University Medical Center, Vanderbilt University Institute of Imaging Science, Nashville, Tennessee, United States

^kThe University of Texas at Austin, Institute for Computational and Engineering Sciences, Department of Biomedical Engineering and Diagnostic Medicine, Austin, Texas, United States

^lUniversity of Michigan, Radiation Oncology, Radiology, and Biomedical Engineering, Ann Arbor, Michigan, United States

^mHarvard Medical School, Massachusetts General Hospital, Athinoula A. Martinos Center for Biomedical Imaging, Department of Radiology, Charlestown, Massachusetts, United States

ⁿMemorial Sloan-Kettering Cancer Center, Department of Medical Physics and Radiology, New York, New York, United States

^oMemorial Sloan-Kettering Cancer Center, GE Healthcare, New York, New York, United States

^pHologic Inc., Sunnyvale, California, United States

^qNational Institute of Standards and Technology, Applied Physics Division, Boulder, Colorado, United States

^rUniversity of Colorado Boulder, Department of Physics, Boulder, Colorado, United States

Address all correspondence to: David C. Newitt, E-mail: david.newitt@ucsf.edu

PMCID: PMC5633866 PMID: 29021993

Abstract.

Diffusion weighted MRI has become ubiquitous in many areas of medicine, including cancer diagnosis and treatment response monitoring. Reproducibility of diffusion metrics is essential for their acceptance as quantitative biomarkers in these areas. We examined the variability in the apparent diffusion coefficient (ADC) obtained from both postprocessing software implementations utilized by the NCI Quantitative Imaging Network and online scan time-generated ADC maps. Phantom and in vivo breast studies were evaluated for two ( ${ADC}_{2}$ ) and four ( ${ADC}_{4}$ ) $b$ -value diffusion metrics. Concordance of the majority of implementations was excellent for both phantom ADC measures and in vivo ${ADC}_{2}$ , with relative biases $< 0.1 %$ ( ${ADC}_{2}$ ) and $< 0.5 %$ (phantom ${ADC}_{4}$ ) but with higher deviations in ADC at the lowest phantom ADC values. In vivo ${ADC}_{4}$ concordance was good, with typical biases of $\pm 2 %$ to 3% but higher for online maps. Multiple b-value ADC implementations were separated into two groups determined by the fitting algorithm. Intergroup mean ADC differences ranged from negligible for phantom data to 2.8% for ${ADC}_{4}$ in vivo data. Some higher deviations were found for individual implementations and online parametric maps. Despite generally good concordance, implementation biases in ADC measures are sometimes significant and may be large enough to be of concern in multisite studies.

Keywords: apparent diffusion coefficient, reproducibility, breast MRI

1. Introduction

The controlled sensitivity of nuclear magnetic resonance, and thus of MRI, to water diffusion provides medical researchers and clinicians a unique tool for measuring microscopic properties of tissue. In the realm of cancer in particular, quantitative diffusion-weighted MRI (DWI) is playing an ever-increasing role in both diagnosis and treatment response monitoring. In addition to providing information about tissue cellularity and microstructure, DWI has the advantages of not requiring the administration of an exogenous contrast agent and of requiring reasonably short acquisition times using standard echo-planar imaging techniques.

The simplest and most commonly used model for describing the MRI sensitive diffusion process is a monoexponential MRI signal decay as a function of the diffusion weighting (“ $b$ -value”) typically achieved with a pair of field gradient pulses as described by Stejskal and Tanner¹ in 1965. This model assumes Gaussian diffusion behavior in isotropic tissue regions, characterized by an apparent diffusion coefficient (ADC) exponential decay constant. Despite the simplicity of this physical model, its practical implementation requires several choices that could affect the ADC measurements. These include masking of voxels for low signal-to-noise ratio (SNR) or poorness of fit; correction for nonideal imaging factors, such as low SNR effects, scanner nonlinearities, or diffusion weighting inaccuracies; and, for multi- $b$ -value analysis, the choice of fitting algorithm may also be a source of variability.

For validation, reproduction of results, meta-analyses in multicenter studies, and consistency across multiple exams in longitudinal studies, it is essential that different analysis implementations (AIs) produce concordant results. Numerous studies have been published addressing repeatability and reproducibility of ADC measurements, mostly addressing the important aspects of acquisition repeatability²^,³ and intra- and interreader reproducibility.⁴^,⁵ For this work, the Image Analysis and Performance Metrics Working Group of the NCI Quantitative Imaging Network (QIN)⁶ undertook the ADC Mapping Collaborative Project (ADC-CP) to determine the effects of software platform and algorithm choices on ADC measurement through the analysis of common datasets by multiple institutions. The overall goal of the project is to quantify the cross-platform concordance of DWI parametric mapping software implementations. In this study, we present the results for ADC analyses performed on phantom and in vivo breast DWI, along with evaluation of the feasibility of centralized analysis of multicenter generated DWI parametric maps.

2. Materials and Methods

Overview: The ADC-CP was initiated and coordinated by the Breast Imaging Research Program (BIRP) at the University of California San Francisco (UCSF). Participants performed a prescribed set of DWI analyses on a common set of in vivo and phantom MRI datasets, generating derived parametric maps. These were submitted to the BIRP for centralized region-of-interest (ROI) and statistical analysis. Where available, parametric maps generated at scan time by on-scanner, manufacturer-provided software (“online” maps) were included in the central analysis.

2.1. Common DWI Datasets

Three groups of DWI datasets were analyzed in the ADC-CP: two $b$ -value in vivo breast scans (Br2b), four $b$ -value in vivo breast scans (Br4b), and four $b$ -value phantom scans (Ph4b). Analysis metrics and MRI diffusion protocol details for all data are summarized in Table 1. All in vivo datasets were from the IRB approved American College of Radiology Imaging Network (ACRIN) 6698 trial⁷ and were used with the permission of ACRIN. In vivo image files were deidentified as per the requirements of the Health Insurance Portability and Accountability Act [Digital Imaging and Communication in Medicine (DICOM) standard, supplement 142], while preserving private metadata attributes necessary for DWI processing. DICOM images were curated and shared via the Cancer Imaging Archive.⁸ Each protocol group included scans from three MRI scanner manufacturers: Siemens Medical (SM), Philips Medical (PM), and General Electric Healthcare (GEHC). In vivo scans were multislice axial acquisitions with full biaxial breast coverage using standard two-dimensional (2-D) single-shot echo-planar imaging sequences. Group Br2b consisted of three studies: ID101 (GEHC, Signa HDxt, 3.0 T), ID102 (PM, Intera, 3.0 T), and ID103 (SM, Avanto, 1.5 T). Group Br4b consisted of four studies: ID201 (GEHC, Signa HDxt, 3.0 T), ID203 (GEHC, Signa HDxt, 1.5 T), ID205 (PM, Achieva, 1.5 T), and ID207 (SM, Avanto, 1.5 T). For all in vivo scans, a single $b = 0$ image was acquired and non-0 $b$ -value images were acquired with three orthogonal diffusion encoding directions. For all cases except ID203, standard on-scanner processing was used, resulting in trace images for each non-0 $b$ -value and online generated ADC maps, and only the trace images were available for analysis. For ID203, the full set of directional DWI images was preserved, and no trace images or online ADC map were calculated.

Table 1.

Dataset information for the ADC Mapping CP.

Group label	Description	$N$ studies	$b$ -values ( $s / {mm}^{2}$ )	Scanner manufacturers^a	Output parameters^b	Analysis ROIs
Br2b	Two $b$ -value, three direction bilateral axial breast	3	0, 800	GEHC, SM, PM	${ADC}_{2}$	Multislice tumor
Br4b	Four $b$ -value, three direction bilateral axial breast	4^c	0, 100, 600, 800	GEHC, SM, PM	${ADC}_{4}$	Multislice tumor
					${ADC}_{slow}$
					PerfFrac
Ph4b	Four $b$ -value, three direction diffusion phantom	3	0, 500, 900, 2000	GEHC, SM, PM	${ADC}_{4}$	1-cm-diameter circles, 13 vials
Ph4b	Four $b$ -value, three direction diffusion phantom	3	0, 500, 900, 2000	GEHC, SM, PM	${ADC}_{hi-low}$	1-cm-diameter circles, 13 vials

Open in a new tab

Manufacturers: General Electric Healthcare (GEHC), Siemens Medical (SM), Philips Medical (PM).

Output parameters: ${ADC}_{⟨ n ⟩}$ : monoexponential ADC using all $⟨ n ⟩$ $b$ -values; ${ADC}_{hi-low}$ : monoexponential ADC using only highest and lowest $b$ -values; ${ADC}_{slow}$ : monoexponential ADC using three highest $b$ -values; and PerfFrac: fraction of $b = 0$ signal attributed to fast-decaying perfusion component.

An additional GEHC study with all directional DWI images but no trace images was included in Br4b.

The Ph4b datasets were of a diffusion phantom designed and constructed by the National Institute of Standards and Technology (NIST) and High Precision Devices (HPD Inc., Boulder, Colorado).⁹^,¹⁰ This phantom consisted of an array of 13 20-mL vials in a spherical vessel filled with an ice–water mixture to maintain a controlled temperature of 0°C. Three vials were filled with water and ten vials were filled with solutions of the polymer polyvinylpyrrolidone (PVP) in deionized water,¹¹ with two vials each at PVP mass fractions of 10%, 20%, 30%, 40%, and 50%. ADC values ranged from $\sim 1.1$ to $0.12 \times 10^{- 3} {mm}^{2} / s$ . Scans were multislice coronal acquisitions at 3.0 T, using standard 2-D single-shot echo-planar imaging sequences. Diffusion encoding was applied on three orthogonal axes, with reconstruction of standard trace images at each $b$ -value. Only the trace images were provided for analysis. Three datasets were provided: ID401 (GEHC, Discovery MR750, Memorial Sloan-Kettering Cancer Center, New York, New York), ID402 (SM, Trio, University of Colorado, Boulder, Colorado), and ID403 (PM, Ingenia, University of Michigan, Ann Arbor, Michigan). All phantom images used in this study were obtained by the DWI task force of the Quantitative Imagining Biomarker Alliance (QIBA) of the Radiological Society of North America (RSNA).

2.2. ADC-CP Participants

Participants in the ADC Mapping CP included 11 QIN sites and one non-QIN commercial group. A total of 15 DWI AIs were used (Table 2). Eight platforms were on-site developed private analysis packages programmed in MATLAB^® (The MathWorks Inc., Natick, Massachusetts; six implementations “AI-MAT1” to “AI-MAT6”), IDL^® (Exelis Visual Information Solutions, Boulder, Colorado; “AI-IDL”), or C++ (“AI-C++”). Five implementations utilized free, publicly available analysis packages: 3D Slicer DWModeling module of the SlicerProstate extension¹⁹ (Brigham and Womens Hospital; two implementations “AI-3DSl1” and “AI-3DSl2”), AFNI (University of California, San Diego, Analysis of Functional Neuro Images; “AI-AFNI”), OsiriX ADCMap plugin (Stanford; “AI-OsX1”), and QIBAPhan (RSNA/University of Michigan; “AI-QIBA”). Two implementations were commercially available analysis packages: Aegis™ (Hologic Inc., Sunnyvale, California; “AI-Aegis”) and OsiriX plugin IB Diffusion™ (Imaging Biometrics, Elm Grove, Wisconsin; “AI-OsX2”). Source websites for the publicly available software packages are included in the references in Table 2.

Table 2.

DWI quantitative AIs included in the ADC Mapping CP.

AI ID	Data groups	Base language or platform	AI publicly available	Parametric map format	Multi- $b$ fit (function)^b
AI-IDL	All	IDL^®	No	DICOM	NLS-GX (curvefit)
AI-MAT1	Br2b, Br4b	MATLAB^®	No	MATLAB^®	NLS-TRF (lsqcurvefit)
AI-MAT2	All	MATLAB^®	No	MATLAB^®	Log-linear
AI-3DSl1	All^a	3D Slicer DWI Module¹²^,¹³	Yes	DICOM	NLS-LM
AI-MAT3	Br2b, Br4b	MATLAB^®	No	MATLAB^®	Log-linear (lscov)
AI-QIBA	Ph4b	QibaPhan1.3¹⁴	Yes	DICOM	Log-linear (lscov)
AI-OsX1	All^a	OsiriX-ADCMap¹⁵	Yes	DICOM	Log-linear
AI-MAT4	Br2b, Br4b	MATLAB^®	No	ANALYZE	Log-linear
AI-CPP	All	C++	No	DICOM	Log-linear
AI-AFNI	Br2b, Ph4b	AFNI¹⁶	Yes	NIfTI	NA
AI-MAT5	Br4b, Ph4b	MATLAB^®	No	NIfTI	Log-linear
AI-OsX2	All	OsiriX-IB Diffusion¹⁷	Yes	DICOM	Log-linear
AI-MAT6	All	MATLAB^®	No	MATLAB^®	Log-linear (polyfit)
AI-3DSl2	All^a	3D Slicer DWI Module¹²^,¹³	Yes	NRRD	NLS-LM
AI-Aegis	All^a	Aegis (C++)¹⁸	Yes	DICOM	Log-linear

Open in a new tab

No perfusion-fraction ( $P_{f}$ ) analysis performed on Br4b.

Multi- $b$ fitting methods: $NLS - GX = nonlinear least squares$ using gradient expansion, $NLS - TRF = NLS$ using trust-region-reflective, $NLS - LM = NLS$ using Levenberg–Marquardt, and $log-linear = linear fit$ or regression of $\log (S)$ . Base software package function name is given where known.

2.3. ADC-CP Parametric Maps

For the purpose of the ADC-CP, the basic monoexponential decay model for the MRI signal intensity from an isotropic tissue region was assumed

S (b) = S_{0} \times e^{- b \times ADC},

(1)

where $S (b)$ is the signal intensity at a diffusion weighting $b$ , $S_{0}$ is the true signal for no diffusion weighting, and ADC is the apparent diffusion coefficient. For practical considerations, methods for the derivation of the estimated ADC from a DWI acquisition can be separated into two cases: two $b$ -value analyses wherein the ADC is solved explicitly via the following equation:

ADC = {\log [S (b 1)] - \log [S (b 2)]} / (b 2 - b 1),

(2)

and multi- $b$ -value analyses where fitting of the data to Eq. (1) must be done to determine the ADC. The choice of algorithm for fitting multi- $b$ -value data, as well as the choice of any masking parameters, was left to the participating sites.

Site analysis consisted of generating a set of parametric maps from pixel-by-pixel analysis of each DWI dataset. Analyses performed for each data group are listed in Table 1. For all cases, a monoexponential ADC map utilizing all images was computed: ${ADC}_{2}$ for Br2b, and ${ADC}_{4}$ for Br4b and Ph4b groups. In addition, for the Br4b group, a perfusion minimized analysis was performed.²⁰ For this analysis, the three nonzero $b$ -values were used to estimate the “slow” or tissue diffusion signal using Eq. (1) for $b \geq 100 s / {mm}^{2}$ , giving $S_{0 slow}$ and ${ADC}_{slow}$ as the fitted parameters characterizing the slow signal decay. The fraction of the signal attributable to a fast-decaying perfusion component was then calculated as

P_{f} = [S (0) - S_{0 slow}] / S (0),

(3)

and parametric maps were generated for ${ADC}_{slow}$ and $P_{f}$ . For the Ph4b group, a two $b$ -value decay coefficient, ${ADC}_{hi-low}$ , was also calculated using only the $b = 0$ and $2000 s / {mm}^{2}$ images.

In addition to the parametric maps provided by the analysis sites, scanner manufacturers’ software (“online”) ADC maps were evaluated when they were provided with the original DWI data. This included the ${ADC}_{2}$ for the Br2b group, ${ADC}_{4}$ for the Br4b datasets with trace images (three of four studies), and ${ADC}_{4}$ for the Ph4b group.

2.4. Centralized ROI Analysis

All parametric maps were submitted to UCSF through a secure box system. No restrictions were placed on the choice of file format, and formats included DICOM ( $N = 7$ ), Neuroimaging Informatics Technology Initiative (NIfTI; $N = 2$ ),²¹ Nearly Raw Raster Data (NRRD; $N = 1$ ),²² Analyze (Mayo Clinic; $N = 1$ ),²³ and MATLAB^® ( $N = 4$ ). Prior to concordance analysis, all maps were converted to a UCSF in-house modified multiframe DICOM format allowing integer or floating point data, along with storage of an analysis mask. Slice order was detected automatically for file formats that do not include orientation information and was reversed if necessary to match the slice order of the source images. ADC scaling was detected automatically by comparison with a reference UCSF ADC map, and scaling factors were set in the metadata (DICOM rescale slope attribute) to produce ADC maps in common units of $10^{- 6} {mm}^{2} / s$ . No manipulation of the actual map pixel data was done except for floating point formats (MATLAB^® implementations) in which pixels with a “not-a-number” value were reassigned to 0.0 and masked out for analysis.

ROI analysis was performed using standardized ROIs across all parametric maps (Fig. 1). For the in vivo breast cancer scans, a multislice, whole-tumor region defined for use in the primary study was used. For the phantom scans, ROIs were defined on the middle slice of each scan using 1-cm-diameter circular regions on each of the 13 sample vials. ROIs were applied to the parametric maps yielding mean values of the diffusion metrics for each analysis platform. All centralized analysis was done using software developed by the UCSF lab in IDL^®.

2.5. Statistical Analysis

For each metric, pairwise within-subject coefficient of variation (wCV) was calculated between all implementation pairs to establish groups of implementations with similar results (intragroup $wCV < 0.1 %$ between all AI pairs). As no ground truth values could be established for the in vivo assessed DWI metrics, individual implementation concordance could only be evaluated from the percent difference of each ROI measurement from a consensus reference value for that measurement. This method was also used for the phantom scans even though reference ADC values were available, both for consistency of presentation and to avoid complications from scanner- and position-dependent ADC effects. A full analysis of the phantom ADC data relative to the ground truth reference values is presented by Malyarenko et al.²⁴ Reference value calculation for each of the metrics is described in Sec. 3. The two-tailed student’s $T$ -test was used to test for significant differences among different implementations.

3. Results

3.1. Practicalities

From the 12 participating institutions, monoexponential ADC maps for the Br2b and Br4b groups and perfusion minimized ${ADC}_{slow}$ values for Br4b were provided for 13 analysis platforms. Nine platforms from eight institutions also provided perfusion-fraction maps for the Br4b group. The Ph4b data group was analyzed on 11 platforms, 10 generating both ${ADC}_{4}$ and ${ADC}_{hi-low}$ parametric maps while one provided only ${ADC}_{4}$ . All sites were able to process DICOM image sets from all three vendors, but interpretation of the no trace, full directional data (Br4b, ID203) was challenging for several sites due to unfamiliarity with this format. After specification of the image storage order for this case, all sites were able to program their implementations to process this data, though in some cases we noted discrepancies in the results as shown in Sec. 3.2.

3.2. Breast Scans

For the Br2b ${ADC}_{2}$ metric, a majority of the AI (11 of 13) gave essentially identical results (maximum $wCV < 0.003 %$ ). For each dataset, the median ADC value from all offline results was used for the reference value for concordance. Figure 2 shows the percent difference from these reference values for each AI’s mean ROI ${ADC}_{2}$ measure for each of the three Br2b scans. AI-MAT3 had a consistent 0.12% positive bias relative to the median, while AI-Aegis varied from $- 0.04 %$ to $- 0.06 %$ . The GEHC and PM online maps were within 0.05% of the respective median values, but the SM map had a $- 1.4 %$ bias.

Fig. 2 — Concordance of two $b$ -value *in vivo* ADC measurements across 13 offline AIs and online scanner-generated maps. Plotted is the percent difference for each ROI mean value from the median value for that measurement for all offline AI. Eleven offline AI had essentially identical results ( $wCV < 0.003 %$ ) and thus show no offsets on the plot. The SM online ADC had a $- 1.4 %$ bias relative to the consensus median value.

More variations were observed among platforms in the Br4b analyses. Figure 3(a) shows graphically the pattern of agreement among platforms given by the pairwise wCV measures. A majority of implementations (9 of 13) fell into two groups when using a threshold of $wCV < 0.1 %$ among all group members. Group A consisted of three AI (AI-IDL, AI-3DSl1, and AI-3DSl2) with $wCV < 0.01 %$ , while group B consisted of six AI (AI-MAT2, AI-MAT3, AI-MAT5, AI-OsX1, AI-C++, and AI-Aegis) with $wCV < 0.1 %$ . For each dataset, a reference value was calculated as the average of the mean value for group A and the mean value for group B. Figure 3(b) shows the percent difference from these reference values for ${ADC}_{4}$ from each implementation for the Br4b datasets. ${ADC}_{4}$ values differed significantly between groups A and B [ $2.8 % \pm 0.2 %$ ( $mean \pm SD$ ), $p < 0.003$ ], and up to 5% between nongrouped sites. Two of the four nongrouped implementations, AI-MAT4 and AI-MAT6, had only small variations ( $wCV < 0.13 %$ ) from the group B values, while AI-MAT1 and AI-OsX2 showed more variability both between scans from different vendors and from the reference values. Two implementations, AI-MAT1 and AI-MAT4, had slightly anomalous results for ID203 (GEHC), believed to be due to different handling of the full directional diffusion data. Scanner-generated ${ADC}_{4}$ maps were available for the three datasets with trace images. GEHC and SM maps gave mean ROI ADC values of $+ 3.6 %$ and $- 3.3 %$ , respectively, from the reference values, while the PM online map had a 28% offset. Further investigation revealed that this large deviation was due to loss of the DICOM rescale slope data employed by PM for parametric map intensity scaling. This loss appeared to have occurred during data transfer between the scanner and the imaging site’s PACS system.

Fig. 3 — *In vivo* four $b$ -value ${ADC}_{4}$ ROI analysis results. (a) Pairwise wCV matrix for all implementations, shown graphically from $wCV < 0.1 %$ (white circles) to $wCV > 2 %$ (fully black circles), with groups A and B indicated. (b) Percent difference from the consensus ${ADC}_{4}$ values for each of four datasets, for each implementation and online map. Mean difference in ADC between groups A and B was 2.8%. The 28% deviation on the PM online ADC was due to a DICOM header corruption problem.

Results for the perfusion minimized analysis tissue ADC ( ${ADC}_{slow}$ ) were similar to the ${ADC}_{4}$ results [Figs. 4(a) and 4(b)]. For $wCV < 0.1 %$ grouping, AI-IDL switched from group A to B, and AI-MAT6 was also now included in group B. Overall differences were generally smaller than for ${ADC}_{4}$ but still statistically significant: $1.2 % \pm 0.2 %$ ( $mean \pm SD$ , $p < 0.003$ ) difference between groups A and B and maximum individual differences of any implementation $< \pm 1.3 %$ relative to the reference value. Perfusion fraction ( $P_{f}$ ) was a nonstandard metric and was implemented on nine platforms. Two groups were again evident, though with different membership [Fig. 4(c)]: group A ( $wCV = 0.04 %$ ) composed of AI-IDL and AI-MAT2 and group B ( $wCV < 0.01 %$ ) with MATLAB^® implementations AI-MAT3, AI-MAT5, and AI-MAT6, with a small difference among the groups [ $0.29 % \pm 0.10 %$ ( $mean \pm SD$ ), $p < 0.03$ ]. Figure 4(d) shows the concordance for the $P_{f}$ metric results. $P_{f}$ results from AI-MAT1 showed large deviations ( $- 16 %$ to $- 23 %$ ) from the consensus reference, indicating possible errors in the software implementation that was developed on-site for this CP. AI-C++ had a positive bias of 1.5% to 2.5%, which was found to be due to implementation of a biexponential decay model for this calculation. All other measures fell within $\pm 0.25 %$ of the reference values, except for the AI-MAT4 result for the GEHC directional diffusion dataset with a $- 0.9 %$ deviation. No online parametric maps were available for the perfusion minimized analysis.

Fig. 4 — wCV and ROI mean concordance results for the Br4b data group perfusion minimized analysis. For ${ADC}_{slow}$ , (a) shows the pairwise wCV matrix with groups with $wCV < 0.1 %$ indicated and (b) the corresponding data for differences in mean ROI ${ADC}_{slow}$ . Group results showed smaller variations than for ${ADC}_{4}$ . For $P_{f}$ , (c, d) groups were less well defined, except for the three MATLAB^® AI indicated, which were nearly identical ( $wCV < 0.01 %$ ). The small positive biases for AI-C++ were identified as due to use of a biexponential model.

3.3. Phantom Scans

Analyses of the three Ph4b phantom datasets, ID401 (GEHC), ID402 (SM), and ID403 (PM), were submitted from 11 AI for the four $b$ -value ${ADC}_{4}$ metric and 10 AI for the two $b$ -value ${ADC}_{hi-low}$ . For AI-C++, only the ID402 results were included, as a problem in the DICOM encoded ADC maps for ID401 and ID403 resulted in incorrect ROI ADC values in the centralized analysis. In a separate analysis completed after the encoding bug was fixed, these results were in concordance with the other implementations.²⁴ Online maps for ${ADC}_{4}$ were available for all three phantom datasets, but only ID403 (PM) included an online map for ${ADC}_{hi-low}$ . Results for the two $b$ -value ${ADC}_{hi-low}$ were practically identical across all implementations. The maximum pairwise wCV among postprocessing implementations using all 39 ROI measurements from the three datasets was 0.04%. Looking at the percent difference of each ROI measure from the nine site median values, AI-QIBA showed a similar clinically insignificant bias (0.05%) to that seen in the Br2b datasets for AI-MAT2. The results from the online PM ${ADC}_{hi-low}$ map were very close to the offline reference median values, yet showed a consistent but clinically insignificant bias of $0.31 \pm {0.02 \times 10}^{- 6} {mm}^{2} / s$ across all ADC values. This gave a maximum percent difference of $- 0.25 %$ for the lowest ADC sample.

For the ${ADC}_{4}$ measures, paired wCV measurements over all phantom measurements gave similar groups to the Br4b results. Differences within and between the two postprocessing implementation groups were smaller than for the breast scans. The maximum wCV was 0.04% for group A (AI-IDL, AI-Sl1, AI-Sl2, and AI-MAT6) and 0.01% for group B (AI-MAT2, Al-QIBA, Al-OsX1, and Al-MAT5), and the between-group root mean square percent difference in ADC values for all 13 ROIs was 0.29%, 0.30%, and 0.62% for GEHC, SM, and PM scans, respectively. There was no significant bias among the ROI mean ADC values from the two groups ( $p = 0.15$ , 0.07, and 0.19 for GEHC, SM, and PM scans, respectively). Figure 5 shows the differences from reference ${ADC}_{4}$ (average of the mean group A and mean group B results) for the three Ph4b datasets. While differences are in general very small ( $< 0.5 %$ ), individual excursions were as high as 5.5%, with the highest differences on the lowest 2 ADC values ( $ADC < 0.25 \times 10^{- 3} {mm}^{2} / s$ ). Only the SM online map showed a statistically significant deviation from the reference values, with a small negative bias of $- 0.31 % \pm 0.25$ ( $mean \pm SD$ , $p = 0.001$ ). Only the PM dataset analysis showed a trend with ADC value in the difference among the analysis groups, with group A tending to underestimate ADC relative to group B for higher ADC values and over estimate at lower. A linear regression of the percent difference between the groups versus the mean ROI ADC gave a slope of 1% per $1.0 \times 10^{- 3} {mm}^{2} / s$ with $R^{2} = 0.35$ .

Fig. 5 — Percent difference from reference ${ADC}_{4}$ for Ph4b measurements for (a) GEHC, (b) SM, and (c) PM scans. The reference value for ${ADC}_{4}$ for each individual ROI is the average of the groups A and B mean ${ADC}_{4}$ values for that ROI. ROIs are ordered from highest ADC (0% PVP) to lowest ADC (50% PVP), left to right, for each AI or group. Concordance is excellent, except for a few measurements on the lowest ADC vials ( $ADC < 0.25 \times 10^{- 3} {mm}^{2} / s$ ).

4. Summary and Discussion

Overall, the QIN ADC Mapping Collaborative Project demonstrated good agreement between the majority of postprocessed (“offline”) and scanner-generated (“online”) ADC implementations, while revealing several sources of discrepancies among different platforms. With the exception of isolated outliers, mostly attributable to metadata errors rather than algorithmic differences, the largest discrepancies observed were between online and offline parametric maps. The most consistent bias was for Siemens scanner acquisitions, where the online maps gave ADC values lower than consensus reference values derived from the offline maps. These ranged from $- 0.3 %$ (phantom 4b) to $- 1.4 %$ (in vivo 2b) to $- 3.5 %$ (in vivo 4b). Based on communication with Siemens, the most likely explanation is the use by the online ADC algorithm of detailed image sequence information to calculate a more accurate $b$ -value than the nominal value stored in the DICOM metadata, which is used for all offline calculations. A higher true $b$ -value, obtained by accounting for diffusion and imaging gradient cross terms, will result in a lower calculated ADC value, as we observed. The General Electric online maps for the in vivo four $b$ -value ADC also showed a marked discrepancy from the consensus reference ( $+ 3.5 %$ ), though it agreed identically with one of the offline implementations (AI-OsX2, OsiriX IB Diffusion plugin).

The biases we report for the in vivo breast scans are of comparable magnitudes to measures of repeatability and reproducibility reported in breast ADC studies. Aliu et al.² reported a wCV of 11% in a repeatability study on normal volunteers, while Spick et al.⁵ and Clauser et al.²⁵ found wCV values between 5.0% and 8.5% for breast tumor ADC measurements. In the ACRIN 6698 trial, whole-tumor ADC test–retest repeatability was 4.8%.³ Our results indicate that choices in ADC analysis algorithm or between online and offline analysis platforms will have nonnegligible effects on breast ADC measures and should be considered in addition to biases arising from image acquisition when interpreting findings in breast DWI studies.

A consistent finding was a grouping of a majority of the implementations for multi- $b$ ADC estimation into two groups with very similar results within-group but significant differences between the two groups. Based on the descriptions of the methods provided by each site, this appeared to be primarily driven by the choice between “log-linear” fitting, wherein a linear least-squares fit is done on the log of the image intensities, and a nonlinear least-squares fit of the untransformed data to the exponential diffusion equation. For the in vivo scans, the difference in implementations resulted in significant differences ( $p < 0.003$ ) of 2.8% for ${ADC}_{4}$ and 1.2% for the ${ADC}_{slow}$ in the perfusion minimized analysis. Our results are comparable to those reported by Zeilinger et al.²⁶ using different methods. While the grouping based on pairwise wCV was also apparent in the four $b$ -value phantom ADC₄, no significant difference was found for the resulting ADC measures ( $p = 0.22$ ). We speculate that this may be due to the higher noise level and heterogeneity within each ROI in the in vivo scans giving a greater sensitivity to the fitting algorithm selection, but further work is needed to identify the cause. Finally, given the lack of ground truth values for the in vivo scans, it is important not to equate discrepancies with errors in the presented work, except in those cases where specific error sources could be identified. In particular, while the choice of reference values for most of our ROI result plots as the average of the two prominent AI groups allows easy visualization of the differences between the AIs, it also can lend an appearance of preference to those AIs over the “nongrouped” results.

The QIN ADC Mapping CP also highlighted some practical challenges of multicenter ADC analyses and centralized analysis of postprocessed parametric maps. For example, several sites had to implement code for the ADC-CP to analyze the less common full directional dataset, which may have resulted in somewhat higher variability in the results for those scans. While saving of directional data for DWI is not currently a common practice in clinical trials, it may become more so in the future given ongoing work on improving reproducibility of multiplatform DWI by gradient nonlinearity correction²⁷^–²⁹ and distortion correction.³⁰ Another lesson learned was the criticality of preservation of DICOM metadata for quantitative DWI. In particular, the case of lost scaling information in a Philips scanner-generated ADC map illustrates that significant errors can result from metadata corruption. While the nature of this project resulted in easy recognition of this problem, in a clinical trial setting, it might have gone unnoticed. Finally, the centralized analysis of parametric maps for this CP was greatly complicated by the multitude of file formats currently employed for the storage of these objects. Adoption of a common format, such as the parametric map DICOM object,³¹ would aid meta-analysis of ADC data obtained from multicenter studies. Use of DICOM, specifically for ADC map storage, was addressed in a companion cooperative project.²⁴

A limitation of this study was the restriction to the monoexponential decay model, with the simple extension to a perfusion minimized ${ADC}_{slow} / P_{f}$ calculation. For in vivo situations where the simple Gaussian diffusion model breaks down, several more complex models are currently employed such as biexponential models,³² including intravoxel incoherent motion,³³^,³⁴ stretched exponentials,³⁵ and kurtosis.³⁶^,³⁷ As model complexity increases, dependency on AI choices will also increase. An additional limitation of this study stems from the choice of a single organ, the breast, for the in vivo datasets. As breast DWI is challenging, due largely to limitations in SNR, fat suppression quality, motion, and other artifacts, we consider these datasets a challenging test of the fitting algorithms’ robustness. However, the results presented are only indirectly relevant to other applications, such as neural and abdominal imaging.

In conclusion, we found that while agreement among the majority of ADC mapping implementations was good, the biases in in vivo ADC measures both between different offline implementations and between vendor-generated and offline maps are significant. Furthermore, these differences may, in some cases, be large enough to adversely affect the analysis of multisite diffusion data. For any given longitudinal (e.g., treatment response) or cross-sectional study, we would recommend that all analyses be performed on a common platform and that the output parametric map metadata reflect both the DWI data origin and the details of the applied calculation algorithm.

Acknowledgments

This research was supported by the National Institutes of Health under Grants Nos. U01CA151235, R01CA190299, U01CA166104, R01CA158079, 5P30CA006973 (IRAT), U01CA151261, U24CA180918, U01CA140204, U01CA172320, U01CA148131, U01CA176110, U01CA154602, U01CA142565, U01CA183848, U01CA154601, U01CA211205 and R50CA211270. ACRIN received funding from the National Cancer Institute under Grant Nos. U01CA079778 and U01CA080098.

Biographies

David C. Newitt is a research specialist and an assistant director of the Breast Imaging Research Program at the University of California (UC), San Francisco. He received his PhD in solid state physics under Dr. Erwin Hahn at UC Berkeley in 1993, and has worked on MRI at UCSF since then. His current focus is on use of MR-DWI and DCE for treatment response monitoring of invasive breast cancer.

Paul E. Kinahan is a professor and vice chair for research in the Department of Radiology, University of Washington, with joint appointments in radiation oncology, physics and bioengineering. He is director of UWMC PET/CT imaging physics and head of the Imaging Research Laboratory. He received his PhD in biomedical engineering in 1994 at the University of Pennsylvania.

Wei Huang is an associate professor/scientist in the Advanced Imaging Research Center at Oregon Health and Science University. He is a magnetic resonance imaging (MRI) physicist by training and has more than twenty five years’ experience in MRI and MR Spectroscopy (MRS) research. His current research focuses on imaging of underlying tumor biological functions using quantitative MRI methods for cancer detection and therapeutic monitoring.

Lori R. Arlinghaus received her BS degree in biomedical engineering from Washington University, St. Louis in 2002, her MS degree and PhD in biomedical engineering from Vanderbilt University in 2005 and 2009, respectively. She is an imaging research scientist at the Vanderbilt University, Institute of Imaging Science. Her research interests focus on clinical application of quantitative magnetic resonance imaging techniques.

Thomas E. Yankeelov is the Moncrief professor of computational oncology and professor of biomedical engineering and diagnostics at the University of Texas in Austin. He serves as a director of the Center for Computational Oncology and a director of Cancer Imaging Research. The goal of his research is to improve patient care by employing advanced in-vivo imaging methods for the early identification, assessment, and prediction of tumors' response to therapy.

Nola Hylton is a professor of radiology and biomedical imaging at UCSF and directs the Breast Imaging Research Program. She has been integrally involved in the development of MRI for breast cancer detection and diagnosis for over 20 years and works with academic and industry partners on the clinical optimization of breast MRI technologies. Her current research program focuses on the application of quantitative MRI methods to characterize breast cancer response to treatment.

Biographies for the other authors are not available.

Disclosures

Jayashree Kalpathy-Cramer is a consultant for Infotech Soft. Jiachao Liang is an employee of Hologic Inc. Maggie Fung is an employee of GEHC. Kathleen Schmainda has ownership interest in Imaging Biometrics LLC. Other coauthors have nothing to disclose. Contribution of NIST is not subject to copyright in the United States. Certain commercial equipment, instruments, and software are identified in this paper to foster understanding. Such identification does not imply recommendation or endorsement by the NIST, nor does it imply that the materials or equipment identified are necessarily the best available for the purpose.

References

1.Stejskal E. O., Tanner J. E., “Spin diffusion measurements: spin echoes in the presence of a time-dependent field gradient,” J. Chem. Phys. 42, 288–292 (1965). 10.1063/1.1695690 [DOI] [Google Scholar]
2.Aliu S. O., et al. , “Repeatability of quantitative MRI measurements in normal breast tissue,” Transl. Oncol. 7(1), 130–137 (2014). 10.1593/tlo.13841 [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Newitt D. C., et al. , “Reproducibility of ADC measures by breast DWI: results of the ACRIN 6698 trial,” in Int. Society of Magnetic Resonance in Medicine, Honolulu, Hawaii, p. 949 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Giannotti E., et al. , “Assessment and quantification of sources of variability in breast apparent diffusion coefficient (ADC) measurements at diffusion weighted imaging,” Eur. J. Radiol. 84(9), 1729–1736 (2015). 10.1016/j.ejrad.2015.05.032 [DOI] [PubMed] [Google Scholar]
5.Spick C., et al. , “Diffusion-weighted MRI of breast lesions: a prospective clinical investigation of the quantitative imaging biomarker characteristics of reproducibility, repeatability, and diagnostic accuracy,” NMR Biomed. 29(10), 1445–1453 (2016). 10.1002/nbm.v29.10 [DOI] [PubMed] [Google Scholar]
6.Clarke L. P., et al. , “The Quantitative Imaging Network: NCI’s historical perspective and planned goals,” Transl. Oncol. 7(1), 1–4 (2014). 10.1593/tlo.13832 [DOI] [PMC free article] [PubMed] [Google Scholar]
7.ACRIN-6698, “Diffusion weighted MR imaging biomarkers for assessment of breast cancer response to neoadjuvant treatment: a sub-study of the I-SPY 2 trial,” 2012, http://www.acrin.org/Portals/0/Protocols/6698/Protocol-ACRIN6698_v2.29.12_active_ForOnline.pdf (25 September 2017).
8.Clark K., et al. , “The Cancer Imaging Archive (TCIA): maintaining and operating a public information repository,” J. Digital Imaging 26(6), 1045–1057 (2013). 10.1007/s10278-013-9622-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Boss M. A., et al. , “Multicenter study of reproducibility of wide range of ADC at 0°C,” in RSNA Annual Meeting, Chicago, Illinois: (2015). [Google Scholar]
10.Palacios E. M., et al. , “Toward precision and reproducibility of diffusion tensor imaging: a multicenter diffusion phantom and traveling volunteer study,” AJNR Am. J. Neuroradiol. 38(3), 537–545 (2017). 10.3174/ajnr.A5025 [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Pierpaoli C., et al. , “Polyvinylpyrrolidone (PVP) water solutions as isotropic phantoms for diffusion MRI studies,” in ISMRM 17th Annual Meeting (2009). [Google Scholar]
12.3D Slicer Community, “3D slicer home page,” 2017, https://www.slicer.org/ (25 September 2017).
13.Fedorov A., “DW modeling documentation,” 2017, http://wiki.slicer.org/slicerWiki/index.php/Documentation/Nightly/Modules/DWModeling (25 September 2017).
14.Chenevert T., “QIBAphan Analysis Software,” 2016, https://goo.gl/xjHc6G (25 September 2017).
15.Sung K., Charles-Edwards G., “ADC map calculation,” 2014, http://web.stanford.edu/~bah/software/ADCmap (25 September 2017).
16.Center for Scientific Computation in Imaging, UCSD, “AFNI diffusion plugin,” 2012, http://csci.ucsd.edu/afni-diff-plugin (25 September 2017).
17.Imaging Biometrics, “IB diffusion,” 2017, http://www.imagingbiometrics.com/what-we-offer/product-services/ibdiffusion (25 September 2017).
18.Hologic, Inc., “Multiview software,” 2017, http://www.hologic.com/products/imaging/mammography/multiview-software (25 September 2017).
19.Fedorov A., et al. , “3D slicer as an image computing platform for the Quantitative Imaging Network,” Magn. Reson. Imaging 30(9), 1323–1341 (2012). 10.1016/j.mri.2012.05.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Lewin M., et al. , “The diffusion-weighted imaging perfusion fraction f is a potential marker of sorafenib treatment in advanced hepatocellular carcinoma: a pilot study,” Eur. Radiol. 21(2), 281–290 (2011). 10.1007/s00330-010-1914-4 [DOI] [PubMed] [Google Scholar]
21.NIH, “Neuroimaging informatics technology initiative,” 2005, https://nifti.nimh.nih.gov (25 September 2017).
22.Teem, “Nearly raw raster data (NRRD),” 2017, http://teem.sourceforge.net/nrrd (25 September 2017).
23.Mayo Clinic, “Analyze 7.5 file format,” 2017, https://rportal.mayo.edu/bir/ANALYZE75.pdf (25 September 2017).
24.Malyarenko D., et al. , “Toward uniform implementation of parametric map DICOM in multi-site quantitative diffusion imaging studies,” J. Med. Imaging (6), 011006 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Clauser P., et al. , “Is there a systematic bias of apparent diffusion coefficient (ADC) measurements of the breast if measured on different workstations? An inter- and intra-reader agreement study,” Eur. Radiol. 26(7), 2291–2296 (2016). 10.1007/s00330-015-4051-2 [DOI] [PubMed] [Google Scholar]
26.Zeilinger M. G., et al. , “Impact of post-processing methods on apparent diffusion coefficient values,” Eur. Radiol. 27(3), 946–955 (2017). 10.1007/s00330-016-4403-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Malyarenko D. I., et al. , “Demonstration of nonlinearity bias in the measurement of the apparent diffusion coefficient in multicenter trials,” Magn. Reson. Med. 75(3), 1312–1323 (2016). 10.1002/mrm.v75.3 [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Malyarenko D. I., et al. , “QIN DAWG validation of gradient nonlinearity bias correction workflow for quantitative diffusion-weighted imaging in multicenter trials,” Tomogr. J. Imaging Res. 2(4), 396–405 (2016). 10.18383/j.tom.2016.00214 [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Newitt D. C., et al. , “Gradient nonlinearity correction to improve apparent diffusion coefficient accuracy and standardization in the American College of Radiology Imaging Network 6698 breast cancer trial,” J. Magn. Reson. Imaging 42(4), 908–919 (2015). 10.1002/jmri.24883 [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Graham M. S., Drobnjak I., Zhang H., “Realistic simulation of artefacts in diffusion MRI for validating post-processing correction techniques,” NeuroImage 125, 1079–1094 (2016). 10.1016/j.neuroimage.2015.11.006 [DOI] [PubMed] [Google Scholar]
31.National Electrical Manufacturers Association, “DICOM supplement 172: parametric map storage,” Digital Imaging and Communications in Medicine (DICOM) Standard, 2014, ftp://medical.nema.org/medical/dicom/final/sup172_ft2.pdf (25 September 2017).
32.Bourne R. M., et al. , “Biexponential diffusion decay in formalin-fixed prostate tissue: preliminary findings,” Magn. Reson. Med. 68(3), 954–959 (2012). 10.1002/mrm.v68.3 [DOI] [PubMed] [Google Scholar]
33.Le Bihan D., et al. , “Separation of diffusion and perfusion in intravoxel incoherent motion MR imaging,” Radiology 168(2), 497–505 (1988). 10.1148/radiology.168.2.3393671 [DOI] [PubMed] [Google Scholar]
34.Le Bihan D., et al. , “MR imaging of intravoxel incoherent motions: application to diffusion and perfusion in neurologic disorders,” Radiology 161(2), 401–407 (1986). 10.1148/radiology.161.2.3763909 [DOI] [PubMed] [Google Scholar]
35.Bennett K. M., et al. , “Characterization of continuously distributed cortical water diffusion rates with a stretched-exponential model,” Magn. Reson. Med. 50(4), 727–734 (2003). 10.1002/(ISSN)1522-2594 [DOI] [PubMed] [Google Scholar]
36.Jensen J. H., Helpern J. A., “MRI quantification of non-Gaussian water diffusion by kurtosis analysis,” NMR Biomed. 23(7), 698–710 (2010). 10.1002/nbm.1518 [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Jensen J. H., et al. , “Diffusional kurtosis imaging: the quantification of non-Gaussian water diffusion by means of magnetic resonance imaging,” Magn. Reson. Med. 53(6), 1432–1440 (2005). 10.1002/(ISSN)1522-2594 [DOI] [PubMed] [Google Scholar]

[r1] 1.Stejskal E. O., Tanner J. E., “Spin diffusion measurements: spin echoes in the presence of a time-dependent field gradient,” J. Chem. Phys. 42, 288–292 (1965). 10.1063/1.1695690 [DOI] [Google Scholar]

[r2] 2.Aliu S. O., et al. , “Repeatability of quantitative MRI measurements in normal breast tissue,” Transl. Oncol. 7(1), 130–137 (2014). 10.1593/tlo.13841 [DOI] [PMC free article] [PubMed] [Google Scholar]

[r3] 3.Newitt D. C., et al. , “Reproducibility of ADC measures by breast DWI: results of the ACRIN 6698 trial,” in Int. Society of Magnetic Resonance in Medicine, Honolulu, Hawaii, p. 949 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

[r4] 4.Giannotti E., et al. , “Assessment and quantification of sources of variability in breast apparent diffusion coefficient (ADC) measurements at diffusion weighted imaging,” Eur. J. Radiol. 84(9), 1729–1736 (2015). 10.1016/j.ejrad.2015.05.032 [DOI] [PubMed] [Google Scholar]

[r5] 5.Spick C., et al. , “Diffusion-weighted MRI of breast lesions: a prospective clinical investigation of the quantitative imaging biomarker characteristics of reproducibility, repeatability, and diagnostic accuracy,” NMR Biomed. 29(10), 1445–1453 (2016). 10.1002/nbm.v29.10 [DOI] [PubMed] [Google Scholar]

[r6] 6.Clarke L. P., et al. , “The Quantitative Imaging Network: NCI’s historical perspective and planned goals,” Transl. Oncol. 7(1), 1–4 (2014). 10.1593/tlo.13832 [DOI] [PMC free article] [PubMed] [Google Scholar]

[r7] 7.ACRIN-6698, “Diffusion weighted MR imaging biomarkers for assessment of breast cancer response to neoadjuvant treatment: a sub-study of the I-SPY 2 trial,” 2012, http://www.acrin.org/Portals/0/Protocols/6698/Protocol-ACRIN6698_v2.29.12_active_ForOnline.pdf (25 September 2017).

[r8] 8.Clark K., et al. , “The Cancer Imaging Archive (TCIA): maintaining and operating a public information repository,” J. Digital Imaging 26(6), 1045–1057 (2013). 10.1007/s10278-013-9622-7 [DOI] [PMC free article] [PubMed] [Google Scholar]

[r9] 9.Boss M. A., et al. , “Multicenter study of reproducibility of wide range of ADC at 0°C,” in RSNA Annual Meeting, Chicago, Illinois: (2015). [Google Scholar]

[r10] 10.Palacios E. M., et al. , “Toward precision and reproducibility of diffusion tensor imaging: a multicenter diffusion phantom and traveling volunteer study,” AJNR Am. J. Neuroradiol. 38(3), 537–545 (2017). 10.3174/ajnr.A5025 [DOI] [PMC free article] [PubMed] [Google Scholar]

[r11] 11.Pierpaoli C., et al. , “Polyvinylpyrrolidone (PVP) water solutions as isotropic phantoms for diffusion MRI studies,” in ISMRM 17th Annual Meeting (2009). [Google Scholar]

[r12] 12.3D Slicer Community, “3D slicer home page,” 2017, https://www.slicer.org/ (25 September 2017).

[r13] 13.Fedorov A., “DW modeling documentation,” 2017, http://wiki.slicer.org/slicerWiki/index.php/Documentation/Nightly/Modules/DWModeling (25 September 2017).

[r14] 14.Chenevert T., “QIBAphan Analysis Software,” 2016, https://goo.gl/xjHc6G (25 September 2017).

[r15] 15.Sung K., Charles-Edwards G., “ADC map calculation,” 2014, http://web.stanford.edu/~bah/software/ADCmap (25 September 2017).

[r16] 16.Center for Scientific Computation in Imaging, UCSD, “AFNI diffusion plugin,” 2012, http://csci.ucsd.edu/afni-diff-plugin (25 September 2017).

[r17] 17.Imaging Biometrics, “IB diffusion,” 2017, http://www.imagingbiometrics.com/what-we-offer/product-services/ibdiffusion (25 September 2017).

[r18] 18.Hologic, Inc., “Multiview software,” 2017, http://www.hologic.com/products/imaging/mammography/multiview-software (25 September 2017).

[r19] 19.Fedorov A., et al. , “3D slicer as an image computing platform for the Quantitative Imaging Network,” Magn. Reson. Imaging 30(9), 1323–1341 (2012). 10.1016/j.mri.2012.05.001 [DOI] [PMC free article] [PubMed] [Google Scholar]

[r20] 20.Lewin M., et al. , “The diffusion-weighted imaging perfusion fraction f is a potential marker of sorafenib treatment in advanced hepatocellular carcinoma: a pilot study,” Eur. Radiol. 21(2), 281–290 (2011). 10.1007/s00330-010-1914-4 [DOI] [PubMed] [Google Scholar]

[r21] 21.NIH, “Neuroimaging informatics technology initiative,” 2005, https://nifti.nimh.nih.gov (25 September 2017).

[r22] 22.Teem, “Nearly raw raster data (NRRD),” 2017, http://teem.sourceforge.net/nrrd (25 September 2017).

[r23] 23.Mayo Clinic, “Analyze 7.5 file format,” 2017, https://rportal.mayo.edu/bir/ANALYZE75.pdf (25 September 2017).

[r24] 24.Malyarenko D., et al. , “Toward uniform implementation of parametric map DICOM in multi-site quantitative diffusion imaging studies,” J. Med. Imaging (6), 011006 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

[r25] 25.Clauser P., et al. , “Is there a systematic bias of apparent diffusion coefficient (ADC) measurements of the breast if measured on different workstations? An inter- and intra-reader agreement study,” Eur. Radiol. 26(7), 2291–2296 (2016). 10.1007/s00330-015-4051-2 [DOI] [PubMed] [Google Scholar]

[r26] 26.Zeilinger M. G., et al. , “Impact of post-processing methods on apparent diffusion coefficient values,” Eur. Radiol. 27(3), 946–955 (2017). 10.1007/s00330-016-4403-6 [DOI] [PMC free article] [PubMed] [Google Scholar]

[r27] 27.Malyarenko D. I., et al. , “Demonstration of nonlinearity bias in the measurement of the apparent diffusion coefficient in multicenter trials,” Magn. Reson. Med. 75(3), 1312–1323 (2016). 10.1002/mrm.v75.3 [DOI] [PMC free article] [PubMed] [Google Scholar]

[r28] 28.Malyarenko D. I., et al. , “QIN DAWG validation of gradient nonlinearity bias correction workflow for quantitative diffusion-weighted imaging in multicenter trials,” Tomogr. J. Imaging Res. 2(4), 396–405 (2016). 10.18383/j.tom.2016.00214 [DOI] [PMC free article] [PubMed] [Google Scholar]

[r29] 29.Newitt D. C., et al. , “Gradient nonlinearity correction to improve apparent diffusion coefficient accuracy and standardization in the American College of Radiology Imaging Network 6698 breast cancer trial,” J. Magn. Reson. Imaging 42(4), 908–919 (2015). 10.1002/jmri.24883 [DOI] [PMC free article] [PubMed] [Google Scholar]

[r30] 30.Graham M. S., Drobnjak I., Zhang H., “Realistic simulation of artefacts in diffusion MRI for validating post-processing correction techniques,” NeuroImage 125, 1079–1094 (2016). 10.1016/j.neuroimage.2015.11.006 [DOI] [PubMed] [Google Scholar]

[r31] 31.National Electrical Manufacturers Association, “DICOM supplement 172: parametric map storage,” Digital Imaging and Communications in Medicine (DICOM) Standard, 2014, ftp://medical.nema.org/medical/dicom/final/sup172_ft2.pdf (25 September 2017).

[r32] 32.Bourne R. M., et al. , “Biexponential diffusion decay in formalin-fixed prostate tissue: preliminary findings,” Magn. Reson. Med. 68(3), 954–959 (2012). 10.1002/mrm.v68.3 [DOI] [PubMed] [Google Scholar]

[r33] 33.Le Bihan D., et al. , “Separation of diffusion and perfusion in intravoxel incoherent motion MR imaging,” Radiology 168(2), 497–505 (1988). 10.1148/radiology.168.2.3393671 [DOI] [PubMed] [Google Scholar]

[r34] 34.Le Bihan D., et al. , “MR imaging of intravoxel incoherent motions: application to diffusion and perfusion in neurologic disorders,” Radiology 161(2), 401–407 (1986). 10.1148/radiology.161.2.3763909 [DOI] [PubMed] [Google Scholar]

[r35] 35.Bennett K. M., et al. , “Characterization of continuously distributed cortical water diffusion rates with a stretched-exponential model,” Magn. Reson. Med. 50(4), 727–734 (2003). 10.1002/(ISSN)1522-2594 [DOI] [PubMed] [Google Scholar]

[r36] 36.Jensen J. H., Helpern J. A., “MRI quantification of non-Gaussian water diffusion by kurtosis analysis,” NMR Biomed. 23(7), 698–710 (2010). 10.1002/nbm.1518 [DOI] [PMC free article] [PubMed] [Google Scholar]

[r37] 37.Jensen J. H., et al. , “Diffusional kurtosis imaging: the quantification of non-Gaussian water diffusion by means of magnetic resonance imaging,” Magn. Reson. Med. 53(6), 1432–1440 (2005). 10.1002/(ISSN)1522-2594 [DOI] [PubMed] [Google Scholar]

PERMALINK

Multisite concordance of apparent diffusion coefficient measurements across the NCI Quantitative Imaging Network

David C Newitt

Dariya Malyarenko

Thomas L Chenevert

C Chad Quarles

Laura Bell

Andriy Fedorov

Fiona Fennessy

Michael A Jacobs

Meiyappan Solaiyappan

Stefanie Hectors

Bachir Taouli

Mark Muzi

Paul E Kinahan

Kathleen M Schmainda

Melissa A Prah

Erin N Taber

Christopher Kroenke

Wei Huang

Lori R Arlinghaus

Thomas E Yankeelov

Yue Cao

Madhava Aryal

Yi-Fen Yen

Jayashree Kalpathy-Cramer

Amita Shukla-Dave

Maggie Fung

Jiachao Liang

Michael Boss

Nola Hylton

Abstract.

1. Introduction

2. Materials and Methods

2.1. Common DWI Datasets

Table 1.

2.2. ADC-CP Participants

Table 2.

2.3. ADC-CP Parametric Maps

2.4. Centralized ROI Analysis

Fig. 1.

2.5. Statistical Analysis

3. Results

3.1. Practicalities

3.2. Breast Scans

Fig. 2.

Fig. 3.

Fig. 4.

3.3. Phantom Scans

Fig. 5.

4. Summary and Discussion

Acknowledgments

Biographies

Disclosures

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases