Abstract
Background and purpose:
The apparent diffusion coefficient (ADC), a potential imaging biomarker for radiotherapy response, needs to be reproducible before translation into clinical use. The aim of this study was to evaluate the multi-centre delineation- and calculation-related ADC variation and give recommendations to minimize it.
Materials and methods:
Nine centres received identical diffusion-weighted and anatomical magnetic resonance images of different cancerous tumours (adrenal gland, pelvic oligo metastasis, pancreas, and prostate). All centres delineated the gross tumour volume (GTV), clinical target volume (CTV), and viable tumour volume (VTV), and calculated ADCs using both their local calculation methods and each of the following calculation conditions: b-values 0–500 vs. 150–500 s/mm2, region-of-interest (ROI)-based vs. voxel-based calculation, and mean vs. median. ADC variation was assessed using the mean coefficient of variation across delineations (CVD) and calculation methods (CVC). Absolute ADC differences between calculation conditions were evaluated using Friedman’s test. Recommendations for ADC calculation were formulated based on observations and discussions within the Elekta MRI-linac consortium image analysis working group.
Results:
The median (range) CVD and CVC were 0.06 (0.02–0.32) and 0.17 (0.08–0.26), respectively. The ADC estimates differed 18% between b-value sets and 4% between ROI/voxel-based calculation (p-valu es < 0.01). No significant difference was observed between mean and median (p = 0.64). Aligning calculation conditions between centres reduced CVC to 0.04 (0.01–0.16). CVD was comparable between ROI types.
Conclusion:
Overall, calculation methods had a larger impact on ADC reproducibility compared to delineation. Based on the results, significant sources of variation were identified, which should be considered when initiating new studies, in particular multi-centre investigations.
Keywords: Apparent diffusion coefficient, MRI-Linac, Adaptive radiotheray, Diffusion-weighted magnetic resonance imaging, ADC reproducibility, MRI biomarkers
Quantitative imaging biomarkers (QIBs), derived from in-vivo imaging, are useful in oncology, as they non-invasively provide quantitative information on tissue characteristics [1–3]. Development of QIBs has the potential to improve precision and reduce subjectivity of image analysis, and hereby enable a more robust association between image-derived parameters and biological and clinical parameters [4,5]. QIBs may provide spatially and temporally resolved information linked to tumour biology, which in radiotherapy may be used for improved target delineation, dose-painting and prediction and monitoring of response. Hence, QIBs may improve personalization of the treatment [6].
The advanced magnetic resonance imaging (MRI) technique, diffusion-weighted MRI (DWI), is a potential QIB for the above-mentioned radiotherapy purposes [6–9]. In standard DWI, strong magnetic gradients are applied to sensitize the MRI signal to the random motion of water molecules within the scanned object. The amount of diffusion weighting is defined by the b-value, and if at least two appropriately selected b-values are acquired, the quantitative parameter, the apparent diffusion coefficient (ADC), can be derived. ADC correlates with tissue cellularity, and have been shown to identify radio-resistant regions [10,11]. DWI and derived ADC maps are used in the clinic to guide target delineation for some tumours, and may be a future tool for dose painting [12,13]. Further, baseline ADC and ADC changes during treatment have shown potential to predict response, although lack of consistency is preventing translation to the clinic [8,14–17]. Specifically, varying acquisition protocols and analysis methods reduce ADC reproducibility, potentially hindering validation of ADC as a QIB. To overcome this problem, a standardization of measurements is needed, and large multi-centre validation trials are warranted [2,18].
Hybrid MRI linear accelerators (MRI-linac) allows daily measurement of ADC, with no or limited prolongation of the radiotherapy fractions [19,20]. As such, an MRI-linac provides an ideal platform for clinical validation of potential QIBs such as ADC. Accuracy of ADC on MRI-linac has been demonstrated using phantoms, and feasibility has been demonstrated in patients [18,21,22]. Furthermore, recommendations for MRI protocols to acquire DWI on an Elekta MRI-linac have been published [23]. The current study focused on the analysis of the acquired DWI scans to obtain an ADC value.
Different approaches to DWI analysis may introduce a variation across centres/studies. Within the Elekta MRI-linac consortium image analysis working group [24], two expected sources of variation were identified: The delineation of a region of interest (ROI), and the calculation method. Delineation uncertainty is a well-known source of uncertainty in radiotherapy and propagates as ADC variation as well [25,26]. The impact of calculation methods on ADC reproducibility has been investigated to a lesser extent [27]. The current study investigated the impact of variations in both delineations and calculation methods on the ADC reproducibility utilizing the same data, which enabled assessment of their relative contributions. The aim was to give vendor-neutral recommendations to improve ADC reproducibility, based on an evaluation of the observed ADC variation between MRI-linac centres and discussions within the working group.
Methods
Study design
Nine MRI-linac centres participated in the study using anonymized patient MRI data from four different clinical cases, acquired at one of the participating centres. At each centre, two steps were performed (Fig. 1). In step 1, an oncologist performed delineations. In step 2, each centre calculated ADC for delineations made at all centres using their local calculation method. This resulted in a 9×9 table of ADC values for each clinical case and delineation type.
Clinical cases
The study included four patients with different cancerous tumours.
Adrenal gland (76 year old male with oligo progression after systemic treatment for non-small cell lung cancer)
Pancreas (68 year old male with recurrent pancreas cancer, consolidative radiotherapy after systemic treatment)
Oligo metastasis in the pelvis (54 year old woman with recurrent ovarian cancer, consolidative radiotherapy after systemic treatment)
Prostate and adjacent seminal vesicles (74 years old man with low volume metastatic prostate cancer)
All patients received treatment on the same 1.5 T MRI-linac (Unity by Elekta, Stockholm, Sweden) at one of the participating centres. The patients were included in the MOMENTUM study (clinicaltrials.gov NCT04075305) [28]. Informed consent was obtained from all patients, and DICOM-data was anonymized and stored adhering to ethics standards.
MRI data
MRI data were acquired at fraction one, prior to beam delivery and included T2-weighted images (T2W) and DWI with the b-values 30, 80, 150, 300 and 500 s/mm2 (adrenal gland and pancreas), and 0, 30, 80, 150 and 500 s/mm2 (oligo metastasis and prostate) adhering to the normal MRI-linac workflow [29]. Sequence details are listed in Table S1 in supplementary materials. DWI were acquired twice in succession while the patient remained in position, to obtain test–retest data for repeatability estimation.
Delineation
T2W images and DWI images with b-values 150 and 500 s/mm2 were available for delineation. Provided with brief clinical case descriptions, the oncologists delineated the gross tumour volume (GTV), clinical target volume (CTV) (prostate only) and the viable tumour volume (VTV) (except for prostate) in a mutually blinded manner using the ProKnow platform (Version 1.32.0, Elekta Solutions AB, Stockholm, Sweden). The VTV was defined as the GTV excluding cystic and necrotic parts. A description of the technical data preparation is given in supplementary materials.
ADC calculation
Each centre provided a brief description of their local calculation method, including software implementation, choice of b-values, and whether a ROI- or voxel-based calculation was used. The ROI-based method refers to ADC calculation using the mean or median ROI signals of DWIs, whereas the voxel-based method refers to calculating ADC within each voxel, after which the mean or median value is calculated within the ROI. If a centre’s standard approach was to use the scanner software for ADC calculation, that centre was provided with ADC maps calculated with the scanner software using all b-values, the lowest and the highest value, and b ≥ 150 s/mm2, respectively. They were asked to choose the set best representing their normal choice.
Each centre provided ADC values for both their own and other centre’s delineations. The calculation was based on 1: the centre’s own calculation method, and 2: all combinations of the following calculation conditions: all b-values vs. b ≥ 150 s/mm2, ROI-based vs. voxel-based and mean vs. median (referred to as pre-specified calculation conditions).
Data analysis and statistics
Delineations were compared pairwise to calculate the Dice similarity coefficient (Dice) and mean surface distance (MSD). ADC variation across delineations and calculation methods was assessed using the mean coefficient of variation (CV), calculated in the following way (Cf. Fig. 1): The CV describing variation across calculation methods was calculated for each of the nine delineations, and the average of these nine values was used as a measure of variation across calculation methods (CVC). Likewise, the CV describing variation across delineations was calculated for each of the nine calculation methods, and the average was used as a measure of variation across delineations (CVD).
Retest ADC values were calculated using rigid contour propagation of GTVs between test- and retest-scans. Median ADC values within the GTVs were extracted from ADC maps calculated with the scanner software using b ≥ 150 s/mm2. The within-subject coefficient of variation (wCV) was calculated as a measure of test–retest ADC variation (ADC repeatability), as recommended by the Quantitative Imaging Biomarkers Alliance (QIBA) [30].
The ADC difference between the sets of b-values, ROI/voxel-based analysis and mean/median values, respectively, were evaluated using Friedman tests with a 5 % significance level and with Bonferroni correction for multiple testing. Only GTVs were used for this purpose.
Results
A total of 69 out of 72 expected delineated volumes (9 centres × 8 volumes) were available for the analysis. Within these volumes, a total of 4483 ADC values were obtained out of 5589 (69 delineation × 9 centres × 9 combinations of calculation conditions). The reasons for the reduced number were the following: One centre omitted calculation within two prostate volumes and two centres omitted the ROI-based calculations due to technical difficulties or limitations of their local software. One centre omitted ADC calculation using the pre-specified calculation conditions due to limited time and resources. One centre used software that reported only one decimal, which in some cases led to CV’s of zero. CV’s of zero were excluded before calculating the mean CV.
Representative delineations are presented in Fig. 2. The delineation variation was large for pancreas VTV and prostate GTV (Dice: 0.20–0.22 and MSD: 9.09–9.23 mm) compared to the remaining cases (Dice: 0.48–0.88 and MSD: 1.52–4.09 mm) (Fig. 3.A–B). A closer inspection of the prostate delineations revealed that some GTV delineations did not overlap (Fig. 2). The prostate CTV delineation variation was smaller (Dice: 0.80, MSD: 2.68 mm), despite not all centres included the seminal vesicles in the delineation. The CVD was comparable between GTV and VTV, although the delineation variation was slightly smaller for GTV compared to VTV (Fig. 3.A–B). There was a clear correlation between delineation variation and ADC variation (Fig. 3.A–B).
All centres used a voxel-based approach as their local calculation method. One centre used ADC maps generated by the scanner software, while remaining centres used in-house software for ADC calculation with a mono-exponential Stejskal-Tanner model [31]. The main differences between the local calculation methods were the choice of b-values, fitting method, and applied filtering. A full comparison of the centres’ local calculation methods is presented in Table S2 in supplementary materials.
With the centres’ own calculation methods, the median (range) CVD and CVC were 0.06 (0.02 – 0.32) and 0.17 (0.08 – 0.26), respectively (Fig. 4.A). The delineation-related variation was larger for pancreas VTV and prostate GTV (CVD: 0.15–0.32) compared to the remaining cases (CVD: 0.02–0.06). In comparison, the ADC repeatability (wCV) based on test–retest scans was estimated to 4.0% (adrenal gland), 6.6% (pancreas), 1.3% (oligo metastasis), and 15.2% (prostate). A detailed overview of the ADC variation for each delineation and calculation method is shown in Fig. S1–3 in supplementary materials.
When centers aligned their calculation methods according to any of the pre-specified calculation conditions, the calculation-related ADC variation was clearly smaller than when centres used their own choice of calculation conditions (Fig. 4.B–I compared to Fig. 4.A), with a reduction of median (range) CVC to 0.04 (0.01–0.16) (or 0.04 (0.01–0.08) with the low-agreement prostate GTV excluded).
In terms of absolute ADC, there was a trend towards larger values for calculation methods that included b-values below 150 s/mm2, (calculation methods no. 1, 4 and 9 in Figure S1 and S3 and Table S2 in supplementary materials). Averaged across all combinations of the pre-specified calculation conditions, ADC estimates were 18% larger for the full b-set compared to b ≥ 150 s/mm2 (p < 0.01) and 4% larger for ROI-based analysis compared to voxel-based (p < 0.01) (Table 1). There was no significant difference between mean and median values (p = 0.64).
Table 1.
All b-values | b ≥ 150 s/mm2 | Mean difference (%) | |||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
ROI-based | Voxel-based | ROI-based | Voxel-based | ||||||||
Mean | Median | Mean | Median | Mean | Median | Mean | Median | b-sets | ROI/vox | Mean/median | |
Adrenal gland | |||||||||||
GTV | 1.27 | 1.37 | 1.21 | 1.24 | 0.90 | 0.97 | 0.88 | 0.90 | 32.86 | 5.84 | −4.91 |
VTV | 1.26 | 1.34 | 1.23 | 1.24 | 0.88 | 0.96 | 0.87 | 0.90 | 33.69 | 4.40 | −4.47 |
Pancreas | |||||||||||
GTV | 0.95 | 0.95 | 0.88 | 0.96 | 0.88 | 0.85 | 0.81 | 0.82 | 11.05 | 4.30 | −1.59 |
VTV | 1.08 | 1.08 | 1.03 | 1.07 | 1.03 | 1.06 | 0.98 | 1.00 | 4.59 | 3.90 | −2.18 |
Oligo metastasis | |||||||||||
GTV | 1.51 | 1.41 | 1.44 | 1.34 | 1.33 | 1.33 | 1.29 | 1.19 | 10.35 | 5.99 | 5.48 |
VTV | 1.49 | 1.39 | 1.43 | 1.32 | 1.33 | 1.31 | 1.29 | 1.18 | 9.58 | 5.45 | 6.33 |
Prostate | |||||||||||
GTV | 1.41 | 1.37 | 1.35 | 1.34 | 1.07 | 0.99 | 1.03 | 1.02 | 28.41 | 1.85 | 3.09 |
CTV | 1.56 | 1.52 | 1.49 | 1.46 | 1.31 | 1.29 | 1.27 | 1.29 | 15.62 | 3.00 | 1.13 |
Mean value | 1.32 | 1.30 | 1.26 | 1.25 | 1.09 | 1.09 | 1.05 | 1.04 | 18.27 | 4.34 | 0.36 |
Discussion
This study evaluated the ADC variation related to differences in delineation and calculation methods between centres. The calculation-related variation was generally larger than delineation-related variation (Fig. 4.A), and was primarily driven by different choices of b-values. When calculation conditions (all b-values vs. b ≥ 150 s/mm2, ROI-based vs. voxel-based, and mean vs. median) were aligned between centres, the calculation-related variation was reduced to about the same level as the delineation-related variation. Furthermore, the delineation- and calculation-related ADC variation was comparable to the ADC repeatability, indicating that acquisition and post-processing of the images contribute equally to the ADC variation. The GTV and VTV performed comparably with respect to ADC reproducibility.
Overall, the observed delineation-related ADC variation largely agreed with other studies, showing CV of 0.1 and inter-observer coefficient of repeatability of 1.9–14% in pancreas [32,33], and 9.5–13.7% in prostate [34], although not directly comparable due to differences in methods. The large delineation variation of the pancreas VTV was likely due to the higher sensitivity to delineation of small volumes (Fig. 3.C). For the prostate GTV, the large delineation variation could arise from the GTV not being a standard delineation type. In fact, large variation in definitions of intraprostatic lesions has been reported in earlier studies [35,36]. Potentially, the use of a higher b-value would have improved the conspicuity of the intra-prostatic lesions. To comply with the MRI-linac recommendations, a maximum b-value of 500 s/mm2 was used [18,23]. The delineation variation in prostate may also have been overestimated as not all centres included the vesicles in the CTV (as case descriptions indicated).
Other studies have shown that the type of ROI influences both absolute ADC values, relative ADC changes during treatment, and the reproducibility of delineations [18,25,26,37]. Therefore, this study included two types of ROIs. The GTV, because it has the advantage of being available before the start of treatment in both the standard and MRI-linac radiotherapy workflow. The so-called VTV was included because it excludes non-viable parts of the tumour and may be relevant for probing the cellular response directly and assessing treatment response, as suggested by Padhani et al. [18]. Further, one study showed that ADC based on VTV was superior to GTV in stratifying between responding and non-responding patients [38]. An advantage of the VTV is that, by definition, it contains only high signal-to-noise-ratio (SNR) voxels. For tumours with no significant necrotic/cystic components, e.g. prostate, the VTV corresponds to the GTV.
Since the choice of ROI type did not influence the ADC reproducibility in the current study, selection of ROI type depends on its application in radiotherapy. While the VTV may define radio-resistant regions and be relevant for dose painting, it is not obvious which ROI is best suited for response prediction. The literature investigating the potential benefits of using GTV vs. VTV is limited [16,38]. In general, the results of the current study advocate improving delineation consistency (Fig. 3.A–B), which underlines the importance of having as precise consensus guidelines as possible. In the future, delineation variation may be reduced by automatic delineation tools including AI models, as indicated in several studies [17,39–41].
The DWI-signal is sensitive to perfusion at low b-values (below 100 s/mm2), and therefore, including low b-values in the analysis is expected to overestimate ADCs [18,42] as observed in this study also (all b-values compared to b ≥ 150 s/mm2) (Table 1). Therefore, a previous publication by the Elekta MRI-linac working group, recommended that the lowest b-value should be 100–150 s/mm2 [23]. A maximum b-value of 500 s/mm2 was also recommended to ensure sufficient SNR and a diffusion time comparable to that of a diagnostic scanner. Moreover, if notably higher b-values are included in the calculation (b > 1000 s/mm2), non-Gaussian diffusion effects may result in an underestimation of ADC, as the mono-exponential model assumes a Gaussian diffusion behaviour [43].
The ROI- and voxel-based approach have been used in previous studies and are therefore relevant from a reproducibility point-of-view [25,33,37,44,45]. It should be noted that strictly speaking, the average ADC across voxels within a ROI cannot be derived using the ROI-based approach, which is based on the mean DWI signal within the ROI. I.e. the ROI-based method is mathematically inconsistent with the exponential model of ADC calculation (when more than one voxel is present within a ROI). However, using the ROI-based method may lead to better estimates of ADC as it is expected to be more robust to motion induced misalignment of individual DWI acquired at different b-values, which if not properly corrected can lead to invalid ADC values. Further, it may improve SNR which may give a better goodness of fit of data, as was confirmed using the current data (not shown) [6]. In the current study, the ROI-based approach led to larger ADC values compared to the voxel-based approach (Table 1), while the two approaches performed comparable with respect to ADC reproducibility (Fig. 4.B–I).
The residual calculation-related ADC variation present after aligning the pre-specified calculation conditions between centres (Fig. 4.B–I) may be accounted for as use of different software implementations, including different fitting and filtering methods (Table S2 in supplementary materials). Specifically, five centres used linear least squares fitting of ln(S) as a function of b-values to estimate the ADC (Table S2 in supplementary materials). Since the SNR decreases with increasing b-value, the uncertainty of ln (S) also increases with b-values, if not accounted for by averaging signals from multiple excitations. Thus, a better approach will be to use weighted linear least squares fitting (see supplementary materials) [46]. For the voxel-based approach, five centres used filtering by excluding voxels containing non-physical values, i.e. values outside a certain range (Table S2 in supplementary materials). Alternative to this, voxels may be removed based on low SNR or poor quality of the fit, which is a more objective criterion. Contributions from fitting and filtering were not determined individually, nevertheless, in combination, they resulted in calculation-related ADC variations comparable to the delineation-related variations (points close to the dotted line in Fig. 4.B–I). This stresses the importance of excluding sources of variation whenever possible, especially if the aim is to establish common ADC cut-off values, e.g. for response prediction. Making a platform-independent software available for public download might be a way to proceed such that in-house developed software can be validated against a common software.
The SNR has also been shown to play a role in estimation of the ADC [23,47]. Although not specifically investigated in this study, it is worth mentioning a few implications. Low SNR levels lead to an underestimation of the ADC, due to the so-called noise floor present in magnitude reconstructed DWI-images [6,48]. Therefore, to allow a comparison between studies, the SNR should always be reported based on defined standards, e.g. published by the National Electrical Manufacturers Association (NEMA) [49] or QIBA [50]. For practicality, it may be sufficient to measure SNR once, if patient and coil positioning is consistent between scans [6]. Applying noise correction has been shown to reduce the ADC bias [47].
Other specific points of attention when calculating ADC include pre-processing of the image data. For example, to minimize the effect of motion, registration between b-values is recommended [51], and is available on most MRI scanner software, including the Unity MRI-linac. As a minimum, b-value images should visually be inspected for motion and artefacts. Further, as the intensity-histogram of DWI images may be “stretched” to fully utilize the storage bit depth, the stored pixel values should as a general rule be “unscaled” prior to quantitative analysis as described by Chenevert et al. [52].
A main limitation of this study is that only one patient was included per tumour type. This was deemed a necessary compromise to increase the realizability of the investigation. However, by including four tumour types instead of e.g. four tumours of the same type, we were able detect differences in the analysis-related ADC variation between tumour types. Minor limitations include that no re-positioning of the patient was performed between the test and retest scans, which may underestimate the true repeatability. ADC reproducibility may also be affected by the sequence used to acquire the images (turbo-spin-echo (TSE) vs. echo planar imaging (EPI) [53]) and the diffusion time [54], but investigations of this was outside the scope of the current study where EPI based readout was used. Moreover, as EPI is notorious for low geometric accuracy [55], a high ADC reproducibility can still lead to a misinterpretation of the extent of the GTV. The effect of geometric distortions on ADC reproducibility and GTV misalignment should be investigated in a future study.
Conclusion
This investigation provides recommendations for improving reproducibility of ADC calculations, based on observations and discussions within the Elekta MRI-linac consortium image analysis working group. These recommendations are focused towards future investigations of ADC as a potential imaging biomarker in radiotherapy. Investigations of other potential quantitative imaging biomarkers using a similar setup, and the geometric accuracy of these, are warranted.
In summary, the calculation-related ADC variation was larger than the delineation-related ADC variation. Specifically, the calculation-related ADC variation can be attributed to the choice of b-values, ROI-based/voxel-based calculation, and software implementation including fitting and filtering method. Therefore, it is recommended to align these factors in multi-centre studies, and to report details of the ADC calculation method within a study to allow comparison between studies. In general, delineation variation correlates with ADC variation, and should therefore be reduced as much as possible. Selection of GTV vs. a dedicated volume for ADC derivation seems less critical for ADC reproducibility, and should depend primarily on feasibility and the radiotherapy purpose.
Supplementary Material
Acknowledgements
We thank Marco Luzzara, Senior Director of Medical Affairs and Clinical Research, Elekta, for his support in facilitating data sharing and analysis in ProKnow, Elekta. We thank Liam Lawrence, Physical Sciences Platform, Sunnybrook Research Institute, Toronto, Canada, and Edward Taylor, Department of Medical Physics, Princess Margaret Cancer Centre, University Health Network, Toronto, Canada, for their contributions to the data analysis.
Role of funding source
AB, FM acknowledges the support of the Danish Cancer Society (Grant no. R231-356 A13852), Danish Comprehensive Cancer Center RT (Danish Cancer Society grant) (Grant no. R191-A11526), and by MANTRA (New MAgNetic resonance Technology for Response Adapted radiotherapy), a Frontline research center based at Odense University Hospital, Denmark.
AT acknowledges the support of Cancer Research UK grant numbers C7224/A28724 and C33589/A28284. Further, AT acknowledges NHS funding to the NIHR Biomedical Research Centre at The Royal Marsden and The Institute of Cancer Research. The views expressed in this publication are those of the author(s) and not necessarily those of the NHS, the National Institute for Health Research or the Department of Health and Social Care.
MLR acknowledges the support of the German Research Council (DFG, Grant no. ZI 736/2–1; PAK997/1), the University Hospital Tübingen and the Medical Faculty Tübingen.
None of the above funding sources had any role in the study design, collection, analysis and interpretation of data, writing of the report or in the decision to submit the article for publication.
Footnotes
CRediT authorship contribution statement
Anne L.H. Bisgaard: Conceptualization, Methodology, Writing – original draft, Visualization, Project administration, Formal analysis, Investigation. Rick Keesman: Conceptualization, Methodology, Investigation, Resources, Writing – review & editing. Astrid L.H.M. W. van Lier: Conceptualization, Methodology, Investigation, Writing – review & editing. Catherine Coolens: Conceptualization, Methodology, Investigation, Resources, Writing – review & editing. Petra J. van Houdt: Investigation, Resources, Writing – review & editing. Alison Tree: Investigation, Writing – review & editing. Andreas Wetscherek: Investigation, Resources, Writing – review & editing. Paul B. Romesser: Investigation, Writing – review & editing. Neelam Tyagi: Investigation, Resources, Writing – review & editing. Monica Lo Russo: Investigation, Writing – review & editing. Jonas Habrich: Investigation, Resources, Writing – review & editing. Danny Vesprini: Investigation, Writing – review & editing. Angus Z. Lau: Investigation, Resources, Writing – review & editing. Stella Mook: Investigation, Writing – review & editing. Peter Chung: Investigation, Writing – review & editing. Linda G. W. Kerkmeijer: Investigation, Writing – review & editing. Zeno A.R. Gouw: Investigation, Resources, Writing – review & editing. Ebbe L. Lorenzen: Software, Writing – review & editing. Uulke A. van der Heide: Conceptualization, Supervision, Methodology, Writing – review & editing. Tine Schytte: Investigation, Resources, Writing – review & editing. Carsten Brink: Methodology, Writing – review & editing, Funding acquisition. Faisal Mahmood: Conceptualization, Supervision, Methodology, Investigation, Resources, Writing – review & editing, Funding acquisition.
Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Appendix A. Supplementary material
Supplementary data to this article can be found online at https://doi.org/10.1016/j.radonc.2023.109803.
References
- [1].Atkinson AJ, Colburn WA, DeGruttola VG, DeMets DL, Downing GJ, Hoth DF, et al. Biomarkers and surrogate endpoints: preferred definitions and conceptual framework. Clin Pharmacol Ther 2001;69:89–95. 10.1067/mcp.2001.113989. [DOI] [PubMed] [Google Scholar]
- [2].O’Connor JPB, Aboagye EO, Adams JE, Aerts HJWL, Barrington SF, Beer AJ, et al. Imaging biomarker roadmap for cancer studies. Nat Rev Clin Oncol 2017;14:169–86. 10.1038/nrclinonc.2016.162. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [3].Kessler L, Barnhart H, Buckler A. The emerging science of quantitative imaging biomarkers terminology and definitions for scientific studies and regulatory submissions. Stat Methods Med Res 2015;24:9–26. 10.1177/0962280214537333. [DOI] [PubMed] [Google Scholar]
- [4].Sullivan DC. Imaging as a quantitative science. Radiology 2008:328–32. 10.1148/radiol.2482080242. [DOI] [PubMed] [Google Scholar]
- [5].Abramson R, Burton K, Yu J, Scalzetti E, Yankeelov T, Rosenkrantz A, et al. Methods and Challenges in Quantitative Imaging Biomarker Development Richard. Acad Radiol 2015;22:25–32. 10.1016/j.acra.2014.09.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [6].Gurney-Champion OJ, Mahmood F, van Schie M, Julian R, George B, Philippens MEP, et al. Quantitative imaging for radiotherapy purposes. Radiother Oncol 2020;146:66–75. 10.1016/j.radonc.2020.01.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [7].Koh DM, Collins DJ. Diffusion-weighted MRI in the body: applications and challenges in oncology. Am J Roentgenol 2007;188:1622–35. 10.2214/AJR.06.1403. [DOI] [PubMed] [Google Scholar]
- [8].van Houdt PJ, Yang Y, van der Heide UA. Quantitative magnetic resonance imaging for biological image-guided adaptive radiotherapy. Front Oncol 2021;10:1–9. 10.3389/fonc.2020.615643. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [9].Lips IM, van der Heide UA, Haustermans K, van Lin ENJT, Pos F, Franken SPG, et al. Single blind randomized Phase III trial to investigate the benefit of a focal lesion ablative microboost in prostate cancer (FLAME-trial): Study protocol for a randomized controlled trial. Trials 2011;12. 10.1186/1745-6215-12-255. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [10].Surov A, Meyer HJ, Wienke A. Correlation between apparent diffusion coefficient (ADC) and cellularity is different in several tumors: a meta-analysis. Oncotarget 2017;8:59492–9. 10.18632/oncotarget.17752. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [11].Henning EC, Azuma C, Sotak CH, Helmer KG, et al. Multispectral tissue characterization in a RIF-1 tumor model: monitoring the ADC and T2 responses to single-dose radiotherapy. Part II Magn Reson Med 2007;57:513–9. 10.1002/mrm.21178. [DOI] [PubMed] [Google Scholar]
- [12].Moffat BA, Chenevert TL, Lawrence TS, Meyer CR, Johnson TD, Dong Q, et al. Functional diffusion map: A noninvasive MRI biomarker for early stratification of clinical brain tumor response. PNAS 2005;102:5524–9. 10.1073/pnas.0501532102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [13].Kerkmeijer LGW, Groen VH, Pos FJ, Haustermans K, Monninkhof EM, Smeenk RJ, et al. Re: focal boost to the intraprostatic tumor in external beam radiotherapy for patients with localized prostate cancer: results from the FLAME randomized phase III Trial. J Clin Oncol 2021;39:787–96. 10.1200/JCO.20.02873. [DOI] [PubMed] [Google Scholar]
- [14].Tsien C, Cao Y, Chenevert T. Clinical applications for diffusion magnetic resonance imaging in radiotherapy. Semin Radiat Oncol 2014;24:218–26. 10.1016/j.semradonc.2014.02.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [15].Schurink NW, Lambregts DMJ, Beets-Tan RGH. Diffusion-weighted imaging in rectal cancer: Current applications and future perspectives. Br J Radiol 2019;92:20180655. 10.1259/bjr.20180655. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [16].Mahmood F, Johannesen HH, Geertsen P, Hansen RH. Repeated diffusion MRI reveals earliest time point for stratification of radiotherapy response in brain metastases. Phys Med Biol 2017;62:2990–3002. 10.1088/1361-6560/aa5249. [DOI] [PubMed] [Google Scholar]
- [17].Bisgaard ALH, Brink C, Fransen ML, Schytte T, Behrens CP, Vogelius I, et al. Robust extraction of biological information from diffusion-weighted magnetic resonance imaging during radiotherapy using semi-automatic delineation. Phys Imaging Radiat Oncol 2022;21:146–52. 10.1016/j.phro.2022.02.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [18].Padhani AR, Liu G, Mu-Koh D, Chenevert TL, Thoeny HC, Takahara T, et al. Diffusion-weighted magnetic resonance imaging as a cancer biomarker: Consensus and recommendations. Neoplasia 2009;11:102–25. 10.1593/neo.81328. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [19].Raaymakers BW, Lagendijk JJW, Overweg J, Kok JGM, Raaijmakers AJE, Kerkhof EM, et al. Integrating a 1.5 T MRI scanner with a 6 MV accelerator: proof of concept. Phys Med Biol 2009;54:N229–37. 10.1088/0031-9155/54/12/n01. [DOI] [PubMed] [Google Scholar]
- [20].Raaymakers BW, Jürgenliemk-Schulz IM, Bol GH, Glitzner M, Kotte ANTJ, Van Asselen B, et al. First patients treated with a 1.5 T MRI-Linac: Clinical proof of concept of a high-precision, high-field MRI guided radiotherapy treatment. Phys Med Biol 2017;62:L41–50. 10.1088/1361-6560/aa9517. [DOI] [PubMed] [Google Scholar]
- [21].Kooreman ES, van Houdt PJ, Nowee ME, van Pelt VWJ, Tijssen RHN, Paulson ES, et al. Feasibility and accuracy of quantitative imaging on a 1.5 T MR-linear accelerator. Radiother Oncol 2019;133:156–62. 10.1016/j.radonc.2019.01.011. [DOI] [PubMed] [Google Scholar]
- [22].Lewis B, Guta A, Mackey S, Gach HM, Mutic S, Green O, et al. Evaluation of diffusion-weighted MRI and geometric distortion on a 0.35T MR-LINAC at multiple gantry angles. J Appl Clin Med Phys 2021;22:118–25. 10.1002/acm2.13135. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [23].Kooreman ES, van Houdt PJ, Keesman R, Pos FJ, van Pelt VWJ, Nowee ME, et al. ADC measurements on the Unity MR-linac – A recommendation on behalf of the Elekta Unity MR-linac consortium. Radiother Oncol 2020;153:106–13. 10.1016/j.radonc.2020.09.046. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [24].Kerkmeijer LGW, Fuller CD, Verkooijen HM, Verheij M, Choudhury A, Harrington KJ, et al. The MRI-linear accelerator consortium: Evidence-based clinical introduction of an innovation in radiation oncology connecting researchers, methodology, data collection, quality assurance, and technical development. Front Oncol 2016;6:1–6. 10.3389/fonc.2016.00215. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [25].Lambregts DMJ, Beets GL, Maas M, Curvo-Semedo L, Kessels AGH, Thywissen T, et al. Tumour ADC measurements in rectal cancer: Effect of ROI methods on ADC values and interobserver variability. Eur Radiol 2011;21:2567–74. 10.1007/s00330-011-2220-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [26].Song M, Yue Y, Jin Y, Guo J, Zuo L, Peng H, et al. Intravoxel incoherent motion and ADC measurements for differentiating benign from malignant thyroid nodules: Utilizing the most repeatable region of interest delineation at 3.0 T. Cancer Imaging 2020;20:1–9. 10.1186/s40644-020-0289-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [27].Ghosh A, Singh T, Singla V, Bagga R, Khandelwal N. Comparison of absolute Apparent Diffusion Coefficient (ADC) values in ADC maps generated across different postprocessing software: Reproducibility in endometrial carcinoma. Am J Roentgenol 2017;209:1312–20. 10.2214/AJR.17.18002. [DOI] [PubMed] [Google Scholar]
- [28].de Mol van Otterloo SR, Christodouleas JP, Blezer ELA, Akhiat H, Brown K, Choudhury A, et al. The MOMENTUM study: an international registry for the evidence-based introduction of MR-guided adaptive therapy. Front Oncol 2020;10. 10.3389/fonc.2020.01328. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [29].Bertelsen AS, Schytte T, Møller PK, Mahmood F, Riis HL, Gottlieb KL, et al. First clinical experiences with a high field 1.5 T MR linac. Acta Oncol (Madr) 2019;58:1352–7. 10.1080/0284186X.2019.1627417. [DOI] [PubMed] [Google Scholar]
- [30].Shukla-Dave A, Obuchowski NA, Chenevert TL, Jambawalikar S, Schwartz LH, Malyarenko D, et al. Quantitative imaging biomarkers alliance (QIBA) recommendations for improved precision of DWI and DCE-MRI derived biomarkers in multicenter oncology trials. J Magn Reson Imaging 2019;49: e101–21. 10.1002/jmri.26518. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [31].Stejskal EO, Tanner JE. Spin diffusion measurements: Spin echoes in the presence of a time-dependent field gradient. J Chem Phys 1965;42:288–92. 10.1063/1.1695690. [DOI] [Google Scholar]
- [32].Ma C, Guo X, Liu L, Zhan Q, Li J, Zhu C, et al. Effect of region of interest size on ADC measurements in pancreatic adenocarcinoma. Cancer Imaging 2017;17:1–7. 10.1186/s40644-017-0116-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [33].Barral M, Soyer P, Ben Hassen W, Gayat E, Aout M, Chiaradia M, et al. Diffusion-weighted MR imaging of the normal pancreas: Reproducibility and variations of apparent diffusion coefficient measurement at 1.5-and 3.0-Tesla. Diagn Interv. Imaging 2013;94:418–27. 10.1016/j.diii.2012.12.007. [DOI] [PubMed] [Google Scholar]
- [34].Ueno Y, Tamada T, Sofue K, Urase Y, Hinata N, Fujisawa M, et al. Do the variations in ROI placement technique have influence for prostate ADC measurements? Acta Radiol Open. 2022;11:. 10.1177/20584601221086500205846012210865. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [35].Dinh CV, Steenbergen P, Ghobadi G, Heijmink SWTJP, Pos FJ, Haustermans K, et al. Magnetic resonance imaging for prostate cancer radiotherapy. Phys Medica 2016;32:446–51. 10.1016/j.ejmp.2016.01.484. [DOI] [PubMed] [Google Scholar]
- [36].Steenbergen P, Haustermans K, Lerut E, Oyen R, De Wever L, Van Den Bergh L, et al. Prostate tumor delineation using multiparametric magnetic resonance imaging: Inter-observer variability and pathology validation. Radiother Oncol 2015;115:186–90. 10.1016/j.radonc.2015.04.012. [DOI] [PubMed] [Google Scholar]
- [37].Mahmood F, Johannesen HH, Geertsen P, Opheim GF, Hansen RH. The effect of region of interest strategies on apparent diffusion coefficient assessment in patients treated with palliative radiation therapy to brain metastases. Acta Oncol (Madr) 2015;54:1529–34. 10.3109/0284186X.2015.1061211. [DOI] [PubMed] [Google Scholar]
- [38].Mahmood F, Hjorth Johannesen H, Geertsen P, Hansen RH. Diffusion MRI outlined viable tumour volume beats GTV in intra-treatment stratification of outcome. Radiother Oncol 2020;144:121–6. 10.1016/j.radonc.2019.11.012. [DOI] [PubMed] [Google Scholar]
- [39].Van Heeswijk MM, Lambregts DMJ, Van Griethuysen JJM, Oei S, Rao SX, De Graaff CAM, et al. Automated and semiautomated segmentation of rectal tumor volumes on diffusion-weighted MRI: can it replace manual volumetry? Int J Radiat Oncol Biol Phys 2016;94:824–31. 10.1016/j.ijrobp.2015.12.017. [DOI] [PubMed] [Google Scholar]
- [40].Hesamian MH, Jia W, He X, Kennedy P. Deep learning techniques for medical image segmentation: achievements and challenges. J Digit Imaging 2019;32:582–96. 10.1007/s10278-019-00227-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [41].Lenchik L, Heacock L, Weaver AA, Boutin RD, Cook TS, Itri J, et al. Automated segmentation of tissues using CT and MRI: a systematic review. Acad Radiol 2019;26:1695–706. 10.1016/j.acra.2019.07.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [42].Le Bihan D, Breton E, Lallemand D, Aubin ML, Vignaud J, Laval-Jeantet M. Separation of diffusion and perfusion in intravoxel incoherent motion MR imaging. Radiology 1988;168:497–505. 10.1148/radiology.168.2.3393671. [DOI] [PubMed] [Google Scholar]
- [43].Rosenkrantz AB, Padhani AR, Chenevert TL, Koh DM, De Keyzer F, Taouli B, et al. Body diffusion kurtosis imaging: basic principles, applications, and considerations for clinical practice. J Magn Reson Imaging 2015;42:1190–202. 10.1002/jmri.24985. [DOI] [PubMed] [Google Scholar]
- [44].Vidić I, Egnell L, Jerome NP, White NS, Karunamuni R, Rakow-Penner R, et al. Modeling the diffusion-weighted imaging signal for breast lesions in the b = 200 to 3000 s/mm2 range: quality of fit and classification accuracy for different representations. Magn Reson Med 2020;84:1011–23. 10.1002/mrm.28161. [DOI] [PubMed] [Google Scholar]
- [45].Iima M, Partridge SC, Le Bihan D. Six DWI questions you always wanted to know but were afraid to ask: clinical relevance for breast diffusion MRI. Eur Radiol 2020;30:2561–70. 10.1007/s00330-019-06648-0. [DOI] [PubMed] [Google Scholar]
- [46].De Deene Y, Van De Walle R, Achten E, De Wagter C. Mathematical analysis and experimental investigation of noise in quantitative magnetic resonance imaging applied in polymer gel dosimetry. Signal Process 1998;70:85–101. 10.1016/s0165-1684(98)00115-7. [DOI] [Google Scholar]
- [47].Dietrich O, Heiland S, Sartor K. Noise correction for the exact determination of apparent diffusion coefficients at low SNR. Magn Reson Med 2001;45:448–53. . [DOI] [PubMed] [Google Scholar]
- [48].Prah DE, Paulson ES, Nencka AS, Schmainda KM. A simple method for rectified noise floor suppression: phase- corrected real data reconstruction with application to diffusion-weighted imaging. Magn Reson Med 2010;64:418–29. 10.1002/mrm.22407. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [49].National Electrical Manufacturers and others. Determination of Signal-to-Noise Ratio (SNR) in Diagnostic Magnetic Resonance Imaging. NEMA Standards Publication MS 1–2008:1–21. https://www.nema.org/standards/view/Determination-of-Signal-to-Noise-Ratio-in-Diagnostic-Magnetic-Resonance-Imaging [accessed 17.01.2023]. [Google Scholar]
- [50].DWI MR Biomarker Committee B. QIBA Profile: Diffusion-Weighted Magnetic Resonance Imaging (DWI). Quantitative Imaging Biomarkers Alliance; 2019. https://qibawiki.rsna.org/images/7/7d/QIBADWIProfilev1.45_20170427_v5_accepted_linenumbers.pdf [accessed 17.01.2023].
- [51].Pathak R, Tian J, Thacker NA, Morris DM, Ragheb H, Saunders C, et al. Considering tumour volume for motion corrected DWI of colorectal liver metastases increases sensitivity of ADC to detect treatment-induced changes. Sci Rep 2019;9:1–10. 10.1038/s41598-019-40565-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [52].Chenevert TL, Malyarenko DI, Newitt D, Li X, Jayatilake M, Tudorica A, et al. Errors in quantitative image analysis due to platform-dependent image scaling. Transl Oncol 2014;7:65–71. 10.1593/tlo.13811. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [53].Tyagi N, Cloutier M, Zakian K, Deasy JO, Hunt M, Rimner A. Diffusion-weighted MRI of the lung at 3T evaluated using echo-planar-based and single-shot turbo spin-echo-based acquisition techniques for radiotherapy applications. J Appl Clin Med Phys 2019;20:284–92. 10.1002/acm2.12493. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [54].Reynaud O Time-dependent diffusion MRI in cancer: Tissue modeling and applications. Front Phys 2017;5:1–16. 10.3389/fphy.2017.00058. [DOI] [Google Scholar]
- [55].Le Bihan D, Poupon C, Amadon A, Lethimonnier F. Artifacts and pitfalls in diffusion MRI. J Magn Reson Imaging 2006;24:478–88. 10.1002/jmri.20683. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.