Graphical abstract

Keywords: Magnetic resonance imaging, Quality control, Diffusion weighted imaging, Quantitative
Highlights
-
•
An established diffusion phantom was scanned across 23 magnetic resonance imaging scanners, two manufacturers, and two field strengths, and consistently showed increased error and lower signal–noise-ratio at lower apparent diffusion coefficient (ADC) values.
-
•
ADC measurements scanned in sagittal orientation consistently showed higher coefficient-of-variation across the reference ADC values.
-
•
No significant differences were seen in Bland-Altman analysis across field strengths.
Abstract
Background and purpose
Diffusion weighted imaging (DWI) and their calculated apparent diffusion coefficient (ADC) values in magnetic resonance imaging (MRI) has grown with applications across multiple anatomical sites and settings across diagnostic and oncologic therapy settings, necessitating characterizing its quantitative performance across multiple scanners. Therefore, the purpose of this study was to determine the quantitative accuracy of DWI and their calculated ADC values across a fleet of MRI systems using a NIST-traceable diffusion phantom.
Materials and methods
Three NIST/QIBA DWI phantoms were imaged with protocols provided with the phantom on a multi-vendor fleet of 23 clinical MRI scanners. User-directed regions-of-interest on each vial provided ADC measurements among a wide range of NIST-traceable ADC values. Across all the MRI systems and data, the coefficient-of-variation was calculated, and Bland-Altman analysis was performed. The measurements between phantoms were analyzed to assess their agreement to determine if multiple phantoms can be used for quality assurance across a large institution.
Results
Lower ADC values and the sagittal orientation exhibited the largest error range and coefficient-of-variation (CoV) and was directly related to SNR. From a one-way analysis of the variance (ANOVA), the mean and standard deviation of the percent errors from each phantom were not significantly different from one another. No strong differences in Bland-Altman plots were seen between 1.5T and 3T scanners, however confidence intervals were different among vendors. The ADC values among the three phantoms were not significantly different from one another.
Conclusions
A comprehensive analysis of institution-wide MRI scanners was conducted, motivating further work to utilize these results to implement a comprehensive quality assurance program and advanced scheduling system to better match similarly performing MRI scanners.
1. Introduction
Diffusion weighted imaging (DWI) is increasingly being used in oncologic applications for assessment of disease response and prediction of therapeutic effects [[1], [2], [3], [4], [5], [6], [7], [8], [9]]. In head and neck cancers, in particular, there is a growing body of literature, which supports, at least in preliminary institutional datasets, the utility of DWI for prognostication of radiotherapeutic or chemoradiotherapeutic response [[10], [11], [12], [13]]. DWI presents a unique advantage in radiation oncology as its ability to obtain functional information can be used for tumor delineation and to assess post-treatment changes [[14], [15], [16]]. Specifically, the apparent diffusion coefficient (ADC), a diffusion parameter derived from DWI, can be used as a quantitative imaging biomarker for disease response and toxicity effects [17].
One hurdle to the prominent implementation of ADC to the clinical workflow is the concern of repeatability on the same scanner and reproducibility across different scanners which may affect response assessment. To characterize this, one prior study has investigated ADC measurements across six MRI scanners across four institutions [18], however only one manufacturer was investigated. Another study investigated the use of an ice-water-based DWI phantom across a fleet of 35 MRI systems of three manufacturers and 1.5T and 3T field strengths [19]. A smaller study used a similar phantom across three Siemens scanners [20]. A recent development for more accurate ADC measurement validation is the National Institutes of Standards and Technology (NIST)-traceable Quantitative Imaging Biomarker Alliance (QIBA) diffusion phantom [21] from CaliberMRI (Boulder, Colorado, USA) which has been investigated in one study across six 3T scanners [22] and in another across one 1.5T scanner and three 3T scanners [23]. The largest of these studies investigated the CaliberMRI phantom across 12 scanners split evenly between 1.5T and 3T from the three major MRI manufacturers (i.e., Siemens, GE, and Philips) [24]. Although these studies assist in characterizing the behavior of ADC across multiple devices, manufacturers, field strengths, and acquisition parameters, it is necessary to continue these efforts especially in larger centers to ensure transferability and scalability of large-scale clinical trials and datasets.
Therefore, in our analysis, we aim to assess the harmonization and standardization of ADC as a quantitative imaging biomarker across a fleet of 23 MRI scanners from both Siemens and GE at both 1.5T and 3T field strengths. We also aim to utilize this detailed and comprehensive information to categorize our scanners by vendor, platform, or field strength by those which are closest in terms of ADC accuracy, so that, if necessary, patients can be selectively triaged to scanners which most closely approximate the machine of their initial imaging. For example, if the scanner of baseline MRI measurements exhibits underestimated ADC, then the scanner at follow-up should exhibit a similar bias, assuming the original scanner is unavailable, to minimize reproducibility errors. Taking this a step further, automated algorithms could be developed to determine these decisions quantitatively at an operations research level. Therefore, our study aims to provide this initial benchmarking across scanners using the QIBA consensus protocol as a precursor to longitudinal assessment amongst scanners for quality assurance as well as later lending to the investigation of variance introduced by specific clinical protocols and advanced acquisition techniques (e.g., multi-shot, simultaneous multi-slice encoding, non-Cartesian, etc.).
2. Materials and methods
In this study, we aimed to characterize our entire fleet of NIST/QIBA phantoms (n = 3) and MRI scanners (n = 23) which constituted nine unique platforms (Supplementary Table S1) with a total of four 1.5T platforms and four 3T platforms with four Siemens (Siemens Healthcare; Erlangen, Germany) and five GE (GE Medical Systems; Milwaukee, WI, USA) manufacturers. Note the 3T GE Signa HDxt systems had Twin Resonance Module (TRM) gradients which allowed the acquisitions to be performed in two separate performance modes (Whole and Zoom).
The NIST-traceable QIBA diffusion phantom [21] is 194 mm in diameter and holds 30 mL vials of polymer polyvinylpyrrolidone (PVP) in aqueous solution. Increasing concentrations of the polymer cause the ADC values to be reduced. The 13 vials within the phantom consist of three vials of pure water and two vials each of a PVP solution at 10, 20, 30, 40, 50 % PVP by mass fraction as shown in Supplementary Fig. S1. To eliminate the temperature sensitivity of diffusion, the phantom was filled with ice water prior to scanning and equilibrated at 0 °C according to the provided phantom instructions [21]. The temperature of the phantom was measured and verified to be within 0 ± 0.2 °C. The value of the diffusion at 0 °C as a function of PVP is given in the table below (Supplementary Table S2).
The diffusion weighted acquisition parameters followed the recommended QIBA protocol but had some platform dependent parameters [25]. The phantom was imaged using each system’s head phased-array radiofrequency receive coil which ranged from 8 to 64 channels. Imaging was performed using diffusion-weighted, single-shot echo planar imaging (EPI) acquisitions (echo time, TE = 71.8–113 ms; repetition time, TR = 10 or 15 s; field-of-view = 216 – 220 mm; matrix = 128 x 128 or 192 x 192; slice thickness = 4 mm; number of averages = 1) with four b-values (0, 500, 900, 2000 s/mm2) in simultaneous three orthogonal directions. If desired, the user may include more averages for each of the b-value images, typically including more averages with higher b-values due to the reduced SNR. Images were acquired single-slice and the NIST/QIBA phantoms were re-oriented for each slice orientation (axial, coronal, sagittal) to explore potential eddy current effects [26]. All of the scanners used in this study (across both GE and Siemens systems) included built-in 3D distortion correction methods which was utilized. The NIST/QIBA phantoms were first characterized to establish phantom equivalence via analysis of their variance. Subsequently, each scanner was evaluated. Two scanners were equipped with dual gradient modes and were evaluated in each mode. In total, after some scanners were scanned multiple times across the three phantoms to validate their equivalence, a total of 31 sets of data were collected.
The resulting images were analyzed using the QIBAPhan software [27] that generated user-directed regions-of-interest (ROI) and corresponding ADC maps from the four b-values and a mono-exponential curve fitting along with automated signal-to-noise ratio (SNR) measurements (Supplementary Fig. S1). Equal area ROIs of 1–2 cm diameter, following QIBA guidelines [28], were used. The quantitative measurements were compared to the NIST-measured ADC values provided by the phantom manufacturer. The coefficient of variation (CoV) was calculated as the standard deviation divided by the mean for each PVP concentration and slice orientation using the combined voxel values within each vial’s ROI. The number of voxels within each vial’s ROI varied from between 60 and 120 depending on the orientation and the exact acquisition parameters. A combined 3D CoV was also calculated using a square root sum-of-squares approach across each orientation. The dependency of the ADC accuracy to the SNR of the image was also examined. Bland-Altman plots and their respective limits of agreement [29,30] were used to investigate the ADC accuracy and bias between devices, manufacturers, and magnetic field strengths.
3. Results
In Fig. 1, a comparison of the three diffusion phantoms used in this study is shown. Each phantom was scanned on the same MRI system and the percent error of the ADC measurement compared to the known value was calculated. The percent error was consistently less than 10 % except for the 50 % PVP concentration which corresponded to the lowest ADC value (118.3–123.0 x10-6 mm2/s). Standard deviation also increased predictably as the ADC values decreased. In the one-way analysis of the variance (ANOVA) the ADC values between the diffusion phantoms were not statistically significant (10 % PVP: F = 0.34, p = 0.72, 20 % PVP: F = 0.88, p = 0.44, 30 % PVP: F = 0.01, p = 0.99, 40 % PVP: F = 0.31, p = 0.74, 50 % PVP: F = 0.22, p = 0.80) except for the water references (0 % PVP: F = 10.3, p < 0.05), where the errors were the smallest. The large increase in error and standard deviation appears to be directly related to the SNR (Fig. 2).
Fig. 1.
The percent error for each concentration of PVP in one MRI system that scanned all three diffusion phantoms. The median value is shown as a red horizontal line for each phantom and PVP concentration. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 2.
Scatter plot of the software-measured signal-to-noise ratio versus the percent error in the measurement. Accompanying per-axis histograms are shown on the top and right side to better visualize the average SNR (top) and percent error (right). The PVP concentrations are shown in varying shades of red to better analyze their trends. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
The CoV was calculated using all the ADC voxel values inside the ROIs for each dataset with respect to the axial, coronal, and sagittal slice orientation (Fig. 3). No measured noise is assumed to have a CoV of 0 % while a CoV of 100 % indicated that the standard deviation of the ADC values within the vial equal the mean ADC of that vial. For most scanners, the sagittal slice orientation had the highest CoV among the different orientations.
Fig. 3.
Plot of the average CoV across all PVP concentrations for each MRI system among the different slice orientations (black letters) and the combined 3D CoV (red circle). The numbers in parentheses for the x-axis labels differentiate individual scanners of the same vendor and system. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Additionally, the CoV was calculated using all the ADC voxel values inside the ROIs for each dataset with respect to the PVP concentration of each vial (Fig. 4). Across all the MRI systems and datasets, the vials with the higher PVP concentrations had the higher CoV. These vials also corresponded to the lowest SNR from Fig. 2.
Fig. 4.
Plot of the average coefficient of variation across all slice orientations for each MRI system among the different vial PVP concentrations. The numbers in parentheses for the x-axis labels differentiate individual scanners of the same vendor and system.
In Fig. 5, Fig. 6, the results from the Bland-Altman analysis are displayed as stratified by different magnetic field strengths and manufacturers, respectively. A similar visualization across platforms is shown in Supplementary Fig. S2. The 95 % limits of agreement were calculated and overlaid on each plot, as well as the means to represent any overall bias.
Fig. 5.
Bland-Altman plot analysis between 1.5T and 3T MRI systems with 95 % limits of agreement represented by dashed red lines and the mean bias represented with a solid red line. The difference was determined by the measured value minus the reference value. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 6.
Bland-Altman plot analysis between GE and Siemens MRI systems with 95% limits of agreement represented by dashed red lines and the mean bias represented with a solid red line. The difference was determined by the measured value minus the reference value. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
4. Discussion
To our knowledge, this is the largest study using the NIST/QIBA DWI phantom to characterize diffusion parameters across scanners of various manufacturers, models, and field strengths. We found that most ADC values were slightly overestimated by ∼ 5 % by following the QIBA acquisition protocol and that higher concentrations of PVP, and thus lower ADC, resulted in reduced SNR and higher CoV. Secondly, the lower ADC values also resulted in higher measured error rates compared to NIST-traceable reference values. Thirdly, the sagittal orientation produced a higher CoV across each orientation. Finally, 1.5T systems performed similarly compared to 3T systems while the GE systems tested here produced wider 95 % confidence intervals compared to Siemens upon Bland-Altman analysis.
In more detail, from Fig. 1, the large percent error seen primarily at the 50 % PVP vials or low ADC values was shown to be a consequence of the SNR dependence. The large CoV in Fig. 4 for the vials with the higher PVP concentration also showed them to be the least reliable. As indicated by Fig. 3, the sagittal orientation had higher ADC accuracy across all scanners. This could potentially be due to gradient nonlinearities within each system. The 95 % limits of agreement were similar between 1.5T and 3T acquisitions with measurements falling outside the range at the lower ADC values. Comparing between GE and Siemens MRI platforms, the Siemens’ limits were narrower with less measurements outside the 95 % confidence interval limits. This implies that GE was more likely to deviate from references values more unpredictably compared to the expected scanner population performance, especially in the lower ADC vials. However, only 12 sets of data were acquired on Siemens MRI systems compared to 19 sets on GE systems. Fig. 5, Fig. 6 showed different intervals for their respective limits of agreement. All systems seem to be close to having zero bias on average with a slightly increased error for the highly restricted, low SNR, ADC values and negative bias for high diffusion values. This overestimation in ADC could be due to the inclusion of the b-value of 0 s/mm2 which may artificially increase the ADC since it includes potential perfusion effects [31]. Therefore, future studies should investigate slightly higher lowest b-values of around 50–100 s/mm2. With the Bland-Altman analysis, it was observed that primarily lower ADC value measurements were outside the 95 % limit. With the known ADC value being small, these differences produced large percent errors, compared to the same differences at the larger ADC values producing small percent errors. The nature of this study called for a consistent method to calculate ADC values across MRI scanners, as done so through QIBAPhan. While this method investigated a robust characterization of ADC values across vendors, models, and field strengths, the ADC values found in this study may vary more in direct radiation oncology applications, as optimized clinical workflows use built-in on-board commercial software for such calculations. Finally, from the ANOVA results, the mean and standard deviation of the percent errors from each phantom, which were used interchangeably on different scanners, were not significantly different from one another. This implies that multiple phantoms can be used to build an ADC quality assurance program in a large institution without fear of significant differences between different versions of the same phantom.
One potential limitation of test fluid and DWI phantom is that their diffusion properties depend on their temperature; previous studies display an increase in ADC value with increasing temperature[32]. To get around this limitation, the NIST/QIBA phantom puts the PVP test fluid in equilibrium in an ice water bath to maintain a constant temperature of 0 °C, which is the approach used in this study. Care has been taken in the preparation and temperature measurement of the phantom prior to scanning to ensure consistent diffusion property. This is complemented by the fact that the temperature dependence is an exponential function and relatively flat at 0 °C in general for the range of PVP percentage and flatter with higher percentage of PVP. Therefore, the diffusion properties of the NIST/QIBA phantom should be relatively stable. Another potential limitation is that we did not consider an SNR-weighted fitting for the ADC calculation which may lead to bias at higher b-values where the SNR is lower [33]. Future work will investigate SNR dependency and compare measurements without the higher 2000 s/mm2 b-value. Bridging from this, the recommended QIBA profile [25] includes a lowest b-value of 0 s/mm2 which can lead to unintended perfusion effects [31]. A final limitation is the use of EPI as the DWI acquisition strategy which is known to suffer from geometric distortion, thus limiting its application in radiation oncology settings where accuracy is critical, particularly in the head and neck region with large air cavities enhancing this distortion [14]. Future studies should investigate distortion-limiting acquisitions such as RESOLVE (Siemens), BLADE (Siemens), PROPELLOR (GE), and SPLICE (Philips).
This study is an important step towards quantitative DWI implementation in the radiation oncology space. There is an unmet need for standardization of ADC values to accurately delineate tumors from normal tissue across the various MRI systems and clinical applications. For instance, by combining the effects of perfusion MRI with DWI, high b-value diffusion MRI can be utilized to diagnose glioblastoma [34]. Additionally, ADC values can be standardized to specific malignancies, including those in the head and neck [35,36], breast [37], and thoracic regions [38]. The results from this study, particularly the biases seen across varying MRI manufacturers, motivate its replication across each clinic to minimize reproducibility biases in the multi-timepoint settings common in radiation oncology. It may also be used in a retrospective setting to confirm whether multiple ADC measurements can be included in a longitudinal analysis and the associated uncertainties involved. The focus here would be to match systems with similar levels of CoV across slice orientations and ADC values to prioritize consistent measurements compared to an optimal measurement inconsistently to best follow-up the subjects. For example, the “1.5T Siemens Aera (6)” and “3T GE HDxt (2) – whole” scanners demonstrated a similar CoV across each slice orientation which is unexpected due to both cross-field strength and cross-vendor differences and, therefore, could be matched for baseline and follow-up imaging. Prior to implementation, the bias across ADC values should also be compared between the two to ensure repeatable and reproducible measurements. From the results presented here, an argument could be made that the follow-up in the same tissues can be made on either 1.5T or 3T scanners given that the measured spread in measured values is similar between the two field strengths at the tissue’s ADC. However, this should be used with caution as each software version and equipment upgrade (i.e., coils) may result in unexpected changes necessitating re-evaluation. For this reason, updating the results of this study following significant changes onto a dashboard-type display, or similar, may be helpful to monitor clinic operations from a physics point-of-view. It may be debated whether including monthly, weekly, or even daily ADC measurements along with the daily ACR phantom quality assurance measurements would be useful towards achieving these goals. With more data across different MRI platforms, future work would involve creating conversion factors to help facilitate quantitative measurements for multi-platform and multi-site imaging studies. Such conversion factors could be calculated from the percent errors seen across ADC values to adjust each scanner’s output to a synthetic 0 % error, though future studies must address the comparison between bias seen in phantoms compared to human subjects. If successful, the resulting “bias vs. ADC value” curve could be used to correct for human subject measurements for longitudinal assessment and clinical trials. It may be useful for MRI manufacturers to standardize how ADC maps are saved. For example, saving both a noise threshold for image interpretation and non-threshold ADC map for quality assurance would be useful in retrospective studies to prevent re-calculation of ADC maps. Further, the methods for image intensity normalization across the b-values should be clearly specified.
In this study, acquisition and analysis of DWI data from numerous MRI systems from a single-institution characterized ADC accuracy and reliability. With the increasing usage of DWI in the clinic, a quantitative quality assurance process should be implemented to provide improved quality control over the MRI systems which would consistently acquire the data needed to reproduce this study at a daily frequency allowing for a more comprehensive understanding of clinic shortcomings and future opportunities. Through ADC standardization across various vendors, scanners, field strengths, and acquisition parameters, radiation oncology is closer to recognizing ADC as a quantitative imaging biomarker. In characterizing ADC values across 23 different scanners as done in this study, DWI can be utilized in a larger number of clinics internationally with a more reliable performance.
5. Funding statement
LM is supported by a National Institutes of Health (NIH) Diversity Supplement (R01CA257814-02S2). NW is supported by a training fellowship from UTHealth Houston Center for Clinical and Translational Sciences T32 Program (Grant No. T32 TR004905) and a NIH National Institute of Dental and Craniofacial Research (NIDCR) Academic Industrial Partnership Grant (R01DE028290). CDF received/receives funding and salary support related to this project during the period of study execution from: the National Institutes of Health (NIH) National Institute of Biomedical Imaging and Bioengineering (NIBIB) Research Education Programs for Residents and Clinical Fellows Grant (R25EB025787-01); the National Institute for Dental and Craniofacial Research Establishing Outcome Measures Award (1R01DE025248/R56DE025248) and Academic Industrial Partnership Grant (R01DE028290); NCI Early Phase Clinical Trials in Imaging and Image-Guided Interventions Program (1R01CA218148); an NIH/NCI Cancer Center Support Grant (CCSG) Pilot Research Program Award from the UT MD Anderson CCSG Radiation Oncology and Cancer Imaging Program (P30CA016672); an NIH/NCI Head and Neck Specialized Programs of Research Excellence (SPORE) Developmental Research Program Award (P50 CA097007). CDF received/receives funding and salary support unrelated to this project during the period of study execution from: NIH Big Data to Knowledge (BD2K) Program of the National Cancer Institute (NCI) Early Stage Development of Technologies in Biomedical Computing, Informatics, and Big Data Science Award (1R01CA2148250; National Science Foundation (NSF), Division of Mathematical Sciences, Joint NIH/NSF Initiative on Quantitative Approaches to Biomedical Big Data (QuBBD) Grant (NSF 1557679); NSF Division of Civil, Mechanical, and Manufacturing Innovation (CMMI) grant (NSF 1933369); and the Sabin Family Foundation. Direct infrastructure support is provided to CDF by the multidisciplinary Stiefel Oropharyngeal Research Fund of the University of Texas MD Anderson Cancer Center Charles and Daneen Stiefel Center for Head and Neck Cancer and the Cancer Center Support Grant (P30CA016672) and the MD Anderson Program in Image-guided Cancer Therapy.
Declaration of competing interest
The authors declare the following financial interests/personal relationships which may be considered as potential competing interests: LM has received unrelated travel funding from Elekta AB. CDF has received direct industry grant support, honoraria, and travel funding from Elekta AB unrelated to this project. Part of the research was conducted at the Center for Academic Biomedical Imaging at the University of Texas MD Anderson Cancer Center with equipment support from GE Healthcare.
Acknowledgments
Part of the research was conducted at the Center for Academic Biomedical Imaging at the University of Texas MD Anderson Cancer Center with equipment support from GE Healthcare. This research is supported in part by the National Institutes of Health through MD Anderson's Cancer Center Support Grant CA016672. Appreciation is expressed to Dr. Thomas Chenevert for providing the analysis package used in this study.
This manuscript is the result of funding in whole or in part by the National Institutes of Health (NIH). It is subject to the NIH Public Access Policy. Through acceptance of this federal funding, NIH has been given a right to make this manuscript publicly available in PubMed Central upon the Official Date of Publication, as defined by NIH.
Data statement
Raw data are made available as a Microsoft Excel file via FigShare (doi:10.6084/m9.figshare.12933938); DICOM files are to be made publicly available after manuscript acceptance. The code to reproduce the analysis and figures is available at: https://github.com/Lucas-Mc/QIBA_ADC.
Footnotes
Supplementary data to this article can be found online at https://doi.org/10.1016/j.phro.2025.100856.
Contributor Information
Lucas McCullum, Email: lbmccullum@mdanderson.org.
Clifton D. Fuller, Email: cdfuller@mdanderson.org.
R. Jason Stafford, Email: jstafford@mdanderson.org.
Appendix A. Supplementary data
The following are the Supplementary data to this article:
References
- 1.Deng J., Wang Y. Quantitative magnetic resonance imaging biomarkers in oncological clinical trials: current techniques and standardization challenges. Chronic Dis Transl Med. 2017;3:8–20. doi: 10.1016/j.cdtm.2017.02.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Jaffray D.A., Chung C., Coolens C., Foltz W., Keller H., Menard C., et al. Quantitative imaging in radiation oncology: an emerging science and clinical service. Semin Radiat Oncol. 2015;25:292–304. doi: 10.1016/j.semradonc.2015.05.002. [DOI] [PubMed] [Google Scholar]
- 3.Maier S.E., Sun Y., Mulkern R.V. Diffusion imaging of brain tumors. NMR Biomed. 2010;23:849–864. doi: 10.1002/nbm.1544. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Padhani A.R. Diffusion magnetic resonance imaging in cancer patient management. Semin Radiat Oncol. 2011;21:119–140. doi: 10.1016/j.semradonc.2010.10.004. [DOI] [PubMed] [Google Scholar]
- 5.Yankeelov T.E., Mankoff D.A., Schwartz L.H., Lieberman F.S., Buatti J.M., Mountz J.M., et al. Quantitative imaging in cancer clinical trials. Clin Cancer Res Off J Am Assoc Cancer Res. 2016;22:284–290. doi: 10.1158/1078-0432.CCR-14-3336. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Koh D.-M., Collins D.J., Orton M.R. Intravoxel incoherent motion in body diffusion-weighted MRI: reality and challenges. AJR Am J Roentgenol. 2011;196:1351–1361. doi: 10.2214/AJR.10.5515. [DOI] [PubMed] [Google Scholar]
- 7.Hötker A.M., Mazaheri Y., Aras Ö., Zheng J., Moskowitz C.S., Gondo T., et al. Assessment of prostate cancer aggressiveness by use of the combination of quantitative DWI and dynamic Contrast-Enhanced MRI. AJR Am J Roentgenol. 2016;206:756–763. doi: 10.2214/AJR.15.14912. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Rubesova E, Grell AS, DeMaertelaer V, Metens T, Chao S-L, Lemort M. Quantitative diffusion imaging in breast cancer: A clinical prospective study - Rubesova - 2006 - Journal of Magnetic Resonance Imaging - Wiley Online Library n.d. https://onlinelibrary.wiley.com/doi/10.1002/jmri.20643 (accessed June 12, 2025). [DOI] [PubMed]
- 9.Thoeny H.C., Ross B.D. Predicting and monitoring cancer treatment response with DW-MRI. J Magn Reson Imaging JMRI. 2010;32:2–16. doi: 10.1002/jmri.22167. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Freling N., Crippa F., Maroldi R. Staging and follow-up of high-grade malignant salivary gland tumours: the role of traditional versus functional imaging approaches – a review. Oral Oncol. 2016;60:157–166. doi: 10.1016/j.oraloncology.2016.04.016. [DOI] [PubMed] [Google Scholar]
- 11.Galbán C.J., Mukherji S.K., Chenevert T.L., Meyer C.R., Hamstra D.A., Bland P.H., et al. A feasibility study of parametric response map analysis of diffusion-weighted magnetic resonance imaging scans of head and neck cancer patients for providing early detection of therapeutic efficacy. Transl Oncol. 2009;2:184–190. doi: 10.1593/tlo.09175. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.King AD, Chow K-K, Yu K-H, Kwok Fai Mo F, Yeung DKW, Yuan J, et al. Head and Neck Squamous Cell Carcinoma: Diagnostic Performance of Diffusion-weighted MR Imaging for the Prediction of Treatment Response | Radiology n.d. https://pubs.rsna.org/doi/10.1148/radiol.12120167?url_ver=Z39.88-2003&rfr_id=ori:rid:crossref.org&rfr_dat=cr_pub%20%200pubmed (accessed June 12, 2025).
- 13.Driessen JP, van Kempen PMW, van der Heijden GJ, Philippens MEP, Pameijer FA, Stegeman I, et al. Diffusion‐weighted imaging in head and neck squamous cell carcinomas: A systematic review - Driessen - 2015 - Head & Neck - Wiley Online Library n.d. https://onlinelibrary.wiley.com/doi/full/10.1002/hed.23575 (accessed June 12, 2025). [DOI] [PubMed]
- 14.McDonald B.A., Salzillo T., Mulder S., Ahmed S., Dresner A., Preston K., et al. Prospective evaluation of in vivo and phantom repeatability and reproducibility of diffusion-weighted MRI sequences on 1.5 T MRI-linear accelerator (MR-Linac) and MR simulator devices for head and neck cancers. Radiother Oncol J Eur Soc Ther Radiol Oncol. 2023;185 doi: 10.1016/j.radonc.2023.109717. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Joint Head and Neck Radiation Therapy-MRI Development Cooperative, the MR-Linac Consortium Head and Neck Tumor Site Group Longitudinal diffusion and volumetric kinetics of head and neck cancer magnetic resonance on a 1.5 MR-linear accelerator hybrid system: A prospective R-IDEAL stage 2a imaging biomarker characterization/pre-qualification study. Radiat Oncol. 2023;42 doi: 10.1016/j.ctro.2023.100666. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.El-Habashy D.M., Wahid K.A., He R., McDonald B., Mulder S.J., Ding Y., et al. Dataset of weekly intra-treatment diffusion weighted imaging in head and neck cancer patients treated with MR-Linac. Sci Data. 2024;11:487. doi: 10.1038/s41597-024-03217-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Messina C., Bignone R., Bruno A., Bruno A., Bruno F., Calandri M., et al. Diffusion-weighted imaging in oncology: an update. Cancers. 2020;12:1493. doi: 10.3390/cancers12061493. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Moore C., Bull C., Darekar A., Wilson D., Goodall A., Manoharan P., et al. Multi-institution longitudinal apparent diffusion coefficient measurements in a diffusion weighted imaging phantom at room temperature. Phys Imaging Radiat Oncol. 2025:35. doi: 10.1016/j.phro.2025.100814. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Malyarenko D., Galbán C.J., Londy F.J., Meyer C.R., Johnson T.D., Rehemtulla A., et al. Multi-system repeatability and reproducibility of apparent diffusion coefficient measurement using an ice-water phantom. J Magn Reson Imaging JMRI. 2013;37:1238–1246. doi: 10.1002/jmri.23825. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Sorace A.G., Wu C., Barnes S.L., Jarrett A.M., Avery S., Patt D., et al. Repeatability, reproducibility, and accuracy of quantitative mri of the breast in the community radiology setting. J Magn Reson Imaging JMRI. 2018 doi: 10.1002/jmri.26011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Quantitative MRI (qMRI) Diffusion Phantom (QIBA DWI) 2017.
- 22.Paquier Z., Chao S.-L., Bregni G., Sanchez A.V., Guiot T., Dhont J., et al. Pre-trial quality assurance of diffusion-weighted MRI for radiomic analysis and the role of harmonisation. Phys Med. 2022;103:138–146. doi: 10.1016/j.ejmp.2022.10.009. [DOI] [PubMed] [Google Scholar]
- 23.Choi S.J., Kim K.W., Ko Y., Cho Y.C., Jang J.S., Ahn H., et al. Whole process of standardization of diffusion-weighted imaging: phantom validation and clinical application according to the QIBA profile. Diagnostics. 2024;14:583. doi: 10.3390/diagnostics14060583. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Pasini S., Ringgaard S., Vendelboe T., Garcia-Ruiz L., Strittmatter A., Villa G., et al. Multi-center and multi-vendor evaluation study across 1.5 T and 3 T scanners (part 1): apparent diffusion coefficient standardization in a diffusion MRI phantom. MAGMA. 2025 doi: 10.1007/s10334-025-01256-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Shukla-Dave A, Obuchowski N, Chenevert TL, Jambawalikar S, Schwartz LH, Malyarenko D, et al. Quantitative imaging biomarkers alliance (QIBA) recommendations for improved precision of DWI and DCE‐MRI derived biomarkers in multicenter oncology trials - Shukla‐Dave - 2019 - Journal of Magnetic Resonance Imaging - Wiley Online Library n.d. https://onlinelibrary.wiley.com/doi/full/10.1002/jmri.26518 (accessed June 12, 2025). [DOI] [PMC free article] [PubMed]
- 26.Malyarenko D., Newitt D., Wilmes L., Tudorica A., Helmer K.G., Arlinghaus L.R., et al. Demonstration of nonlinearity bias in the measurement of the apparent diffusion coefficient in multicenter trials. Magn Reson Med. 2016;75:1312–1323. doi: 10.1002/mrm.25754. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Newitt D.C., Malyarenko D., Chenevert T.L., Quarles C.C., Bell L.C., Fedorov A., et al. Multisite concordance of apparent diffusion coefficient measurements across the NCI Quantitative Imaging Network. J Med Imaging. 2017;5 doi: 10.1117/1.JMI.5.1.011003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Ma B., et al. The QIBA profile for diffusion-weighted MRI: apparent diffusion coefficient as a quantitative imaging biomarker. Radiology. 2024:313. doi: 10.1148/radiol.233055. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Bland J.M., Altman D.G. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet Lond Engl. 1986;1:307–310. [PubMed] [Google Scholar]
- 30.Bland M. Oxford University Press (UK); 2015. An introduction to medical statistics. [Google Scholar]
- 31.Le Bihan D., Breton E., Lallemand D., Aubin M.L., Vignaud J., Laval-Jeantet M. Separation of diffusion and perfusion in intravoxel incoherent motion MR imaging. Radiology. 1988;168:497–505. doi: 10.1148/radiology.168.2.3393671. [DOI] [PubMed] [Google Scholar]
- 32.Amouzandeh G., Chenevert T.L., Swanson S.D., Ross B.D., Malyarenko D.I. Technical note: Temperature and concentration dependence of water diffusion in polyvinylpyrrolidone solutions. Med Phys. 2022;49:3325–3332. doi: 10.1002/mp.15556. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Blackledge M.D., Tunariu N., Zugni F., Holbrey R., Orton M.R., Ribeiro A., et al. Noise-corrected, exponentially weighted, diffusion-weighted MRI (niceDWI) improves image signal uniformity in whole-body imaging of metastatic prostate cancer. Front Oncol. 2020;10:704. doi: 10.3389/fonc.2020.00704. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Wahl D.R., Kim M.M., Aryal M.P., Hartman H., Lawrence T.S., Schipper M.J., et al. Combining perfusion and high B-value diffusion MRI to inform prognosis and predict failure patterns in glioblastoma. Int J Radiat Oncol. 2018;102:757–764. doi: 10.1016/j.ijrobp.2018.04.045. [DOI] [PubMed] [Google Scholar]
- 35.Wang J., Takashima S., Takayama F., Kawakami S., Saito A., Matsushita T., et al. Head and neck lesions: characterization with diffusion-weighted echo-planar MR imaging. Radiology. 2001;220:621–630. doi: 10.1148/radiol.2202010063. [DOI] [PubMed] [Google Scholar]
- 36.Kato H., Kanematsu M., Kato Z., Teramoto T., Mizuta K., Aoki M., et al. Necrotic cervical nodes: usefulness of diffusion-weighted MR imaging in the differentiation of suppurative lymphadenitis from malignancy. Eur J Radiol. 2013;82:e28–e35. doi: 10.1016/j.ejrad.2012.08.014. [DOI] [PubMed] [Google Scholar]
- 37.Sinha S, Lucas-Quesada FA, Sinha U, DeBruhl N, Bassett LW. In vivo diffusion‐weighted MRI of the breast: Potential for lesion characterization - Sinha - 2002 - Journal of Magnetic Resonance Imaging - Wiley Online Library n.d. https://onlinelibrary.wiley.com/doi/10.1002/jmri.10116. [DOI] [PubMed]
- 38.Shen G., Jia Z., Deng H. Apparent diffusion coefficient values of diffusion-weighted imaging for distinguishing focal pulmonary lesions and characterizing the subtype of lung cancer: a meta-analysis. Eur Radiol. 2016;26:556–566. doi: 10.1007/s00330-015-3840-y. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.






