Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2020 Jan 30;15(1):e0228424. doi: 10.1371/journal.pone.0228424

Whole body MRI in multiple myeloma: Optimising image acquisition and read times

Saurabh Singh 1, Elly Pilavachi 2, Alexandra Dudek 2, Timothy J P Bray 1, Arash Latifoltojar 1, Kannan Rajesparan 2, Shonit Punwani 1, Margaret A Hall-Craggs 1,*
Editor: Pascal A T Baltzer3
PMCID: PMC6992198  PMID: 31999774

Abstract

Objective

To identify the whole-body MRI (WB-MRI) image type(s) with the highest value for assessment of multiple myeloma, in order to optimise acquisition protocols and read times.

Methods

Thirty patients with clinically-suspected MM underwent WB-MRI at 3 Tesla. Unenhanced Dixon images [fat-only (FO) and water-only (WO)], post contrast Dixon [fat-only plus contrast (FOC) and water-only plus contrast (WOC)] and diffusion weighted images (DWI) of the pelvis from all 30 patients were randomised and read by three experienced readers. For each image type, each reader identified and labelled all visible myeloma lesions. Each identified lesion was compared with a composite reference standard achieved by review of a complete imaging dataset by a further experienced consultant radiologist to determine truly positive lesions. Lesion count, true positives, sensitivity, and positive predictive value were determined. Time to read each scan set was recorded. Confidence for a diagnosis of myeloma was scored using a Likert scale. Conspicuity of focal lesions was assessed in terms of percent contrast and contrast to noise ratio (CNR).

Results

Lesion count, true positives, sensitivity and confidence scores were significantly higher when compared to other image types for DWI (P<0.0001 to 0.003), followed by WOC (significant for sensitivity (P<0.0001 to 0.004), true positives (P = 0.003 to 0.049) and positive predictive value (P< 0.0001 to 0.006)). There was no statistically significant difference in these metrics between FO and FOC. Percent contrast was highest for WOC (P = 0.001 to 0.005) and contrast to noise ratio (CNR) was highest for DWI (P = 0.03 to 0.05). Reading times were fastest for DWI across all observers (P< 0.0001 to 0.014).

Discussion

Observers detected more myeloma lesions on DWI images and WOC images when compared to other image types. We suggest that these image types should be read preferentially by radiologists to improve diagnostic accuracy and reporting efficiency.

Introduction

Whole body MRI (WB-MRI) is a valuable tool for assessing disease in patients with multiple myeloma (MM) [15], and has been recommended by both the International Myeloma Working Group (IMWG) and National Institute for Health and Care Excellence (NICE) as first line imaging for the initial assessment of disease in patients with suspected MM [6,7]. Information from MRI is also incorporated into staging systems such as the Durie-Salmon PLUS staging system [8], and the number of lesions detected correlate closely with mortality [9].

To facilitate wider use of WB-MRI, efforts need to be made to improve the ‘value’ of WB-MRI by reducing acquisition time, reading time and overall cost [10]. A strategy to improve value of WB-MRI would be to choose sequences which provide sufficient diagnostic information in a reasonable time and remove sequences which do not add any additional diagnostic information. Recent studies have reported the utility of faster sequences in whole body protocols for MM such as Dixon MRI and diffusion weighted imaging [5,1113]. These sequences have shorter acquisition times than conventional spin echo sequences and can provide additional functional information such as disease activity or bone marrow composition. In particular, Dixon imaging can generate four image types: in-phase (IP), out of phase (OP), water-only (WO) and fat-only (FO). These image types provide anatomical information comparable to conventional spin echo sequences but also allow the assessment of water and fat content of bone marrow and myeloma lesions. Dixon imaging is also recommended as a core component of imaging protocols and considered important in reducing false positive lesions on DWI [13]. Gadolinium-enhanced sequences have also been shown to be useful in the assessment of MM [14], and can be included into acquisition protocols [13]. Therefore, there are a number of image types in a typical WB-MRI protocol that provide diagnostic information, but it is unclear whether all sequences need to be read in order to detect bone myeloma lesions. There is a need to elucidate which sequences provide the most diagnostic information and, on that basis, eliminate less useful sequences from the acquisition and read protocols.

There are only a few studies which have examined the diagnostic performance of sequences and compared them systematically. A study by Weininger et al 2009 [14], compared conventional pre- and post-contrast spin echo sequences and a T2-weighted inversion recovery sequence (T2w-TIRM) in detecting myeloma lesions. They found that contrast enhanced T1W and the T2w-TIRM had the highest level of sensitivity [14]. A previous study by our group investigated the diagnostic performance of different non-contrast Dixon image types in detecting myeloma lesions. This study showed that FO and WO only pre-contrast images detected the most myeloma lesions compared to in phase and out of phase images [15]. No studies have directly compared the performance of unenhanced Dixon images, contrast enhanced Dixon images and DWI.

In this study, we aimed to evaluate radiologists’ myeloma lesion detection rates on unenhanced fat only (FO), contrast-enhanced fat only (FOC), unenhanced water only (WO), contrast-enhanced water only (WOC) and diffusion weighted images (DWI). We hypothesised that readers would detect more true positive lesions on contrast enhanced images compared to unenhanced images. We also hypothesised that diffusion weighted imaging would be most sensitive for detecting focal lesions.

Materials and methods

Subjects

Institutional review board approved this prospective study (Research Ethics Committee reference, NRES London-Bromley, 12/LO/0428) and all patients gave written informed consent.

This study prospectively enrolled thirty patients (13 males and 17 females, median age 55, age range 36–82) who were being investigated for suspected symptomatic multiple myeloma between June 2012 and September 2014. This cohort of patients has also been studied in previous publications on treatment response and detection of myeloma lesions on pre-contrast imaging [15,16]. Patients were excluded from the study if they had a history of previous malignancy or previous chemotherapy/radiotherapy, estimated GFR < 50 mL/min/1.73 m2, were unable to given informed consent or had a contraindication to MRI scanning. Further investigation showed that 26 out of 30 had MM, two a had solitary plasmacytoma and one had monoclonal gammopathy of uncertain significance. The clinical and biochemical parameters for each patient are recorded in Table 1 including baseline interphase fluorescence in situ hybridisation (FISH) and genetic risk was determined according to International Myeloma Working Group recommendations [17].

Table 1. Patient data.

Patient characteristics Number or median (range)
Age (years) 56 (36–80)
Chain isotype
IgG 17
IgA 5
Light chain 4
MGUS 1
Solitary plasmacytoma 2
Smouldering MM 1
ISS Stage
I 13
II 13
III 4
Biochemical parameters
Bone marrow percentage plasma cells 65 (0–90)
Beta-2 microglobulin (mg/l) 3.3 (1.3–11.3)
Albumin (g/l) 40 (30–53)
Creatinine 56 (77.5–105)
Genetic risk group
Low/Standard Risk 17
High Risk 9

Patient demographics, disease parameters and treatment. ISS, international staging system (15)

Scan acquisition

WB-MRI was carried out on a 3.0T wide-bore system (Ingenia; Phillips Healthcare, Best, Netherlands) using two anterior surface coils, a head coil and an integrated posterior coil. The imaging protocol was comprised of coronal pre- and post-contrast modified Dixon (Dixon) acquisitions from which fat, water, in-phase and out-of-phase images were reconstructed on the scanner using a two-point method [18] (TR 3.0ms, TE 1.02–18, flip angle 15°, slice thickness 5mm, pixel bandwidth 1992Hz, acquisition matrix 196 x 238, SENSE factor 2, number of slice 120) in addition to diffusion and post-contrast imaging covering vertex to toe using ten contiguous anatomical stations (Table 2). The Dixon images were in the coronal plane and DWI in the transverse plane. The highest b value (1000 s/mm2) images were selected based on previous studies which show that the higher b value, the higher the contrast between normal and infiltrated bone marrow [14].

Table 2. Sequence parameters (15).

Sequence Parameters
Parameters Dixon
(pre and post contrast)
DWI (b0, 100, 300, 1000 s/mm2)
Imaging Plane Coronal Transverse
Sequence type Gradient echo Single-shot spin echo with echo planar readout
Echo time (ms) 1.02/1.8 71
Repetition time (ms) 3 6371
Field of View (mm x mm) 502 x 300 500 x 306
Voxel size (mm x mm) 2.1 x 2.1 4 x 4.2
Number of Slices 120 40
Slice Thickness (mm) 5 5
Acquisition Matrix 144 x 238 124 x 72
ETL 2 39
Acceleration factor (SENSE) 2 2.5
Pixel Bandwidth (Hz) 1992 3369
Scan time (s) 17 152

Image assessment

The five individual image types including diffusion weighted images (b = 1000), unenhanced FO and WO Dixon images, and enhanced FO and WO Dixon images for all thirty patients (150 image series) were randomised and read by three readers with experience in whole body MR imaging (between three and fifteen years). In other words, the five sets of images of each individual patient were not read sequentially but in a random order. On each image series, each reader was asked to label the number of myeloma lesions present in the bony pelvis (pubis, ischium, ilium and sacrum) up to a maximum of 20. In cases where the reader assessed the disease to be diffuse or over 20 focal lesions, the patient was assigned a lesion count of 20. In addition, the readers were asked to provide a confidence score based on their degree of certainty that there were myeloma lesions in the pelvis on a 4-point Likert scale (1-no lesions, 2-indeterminate lesions, 3-likely myeloma lesions, 4-very likely myeloma lesions). The labelled images from the readers was then compared to a composite reference standard consisting of diffusion-weighted, pre- and post-contrast Dixon imaging, which had been evaluated simultaneously by a consultant radiologist with over 20 years of experience in myeloma and MR imaging. Lesions were labelled on the reference imaging as positive for myeloma if they demonstrated a combination of features that together, in the view of the experienced radiologist, made the lesion highly likely to represent myeloma. These features included abnormal marrow signal compared to background marrow (i.e. hypointense on IP and FO images, and hyperintense on WO images) and contrast enhancement and increased signal on high b value diffusion images. If this signal abnormality was localised on a background of normal bone marrow signal, the disease pattern was ascribed to be focal and where this was diffuse with only localised areas of normal bone marrow signal, the disease pattern was assessed to be diffuse. We did not assign a maximum lesion count for the reference standard and all individual lesions were labelled, enabling direct comparison for any lesions labelled by the readers. We also recorded whether patients had focal or diffuse disease (the diffuse classification included patients with focal-on-diffuse infiltration), based on the reference labelled imaging.

Each labelled lesion was compared with the reference standard to determine the number of true positive lesions (TP), false positive lesions (FP) (i.e. those that were incorrectly identified as lesions); and false negative lesions (FN) (these were the ‘reference-standard lesions’ which were not identified). This data was used to determine the mean per-set lesion count, sensitivity (TP/TP+FN), positive predictive value (TP/TP + FP) and mean confidence score.

Although other sequences were acquired during the standard whole body protocol, these were not included for assessment in the current study as these are acquired mainly for assessment of extramedullary disease (axial T2) and for measurement of response to therapy (T2, ADC and fat fraction maps before and after treatment).

Design and statistics

The study design is illustrated in Fig 1. The statistical analysis plan was based on that used in a previous study [15]. A multilevel mixed-effects linear regression model was used to compare each lesion detection metric (lesion count, sensitivity, positive predictive value and mean confidence score), across the five image types using Stata [Stata IC Version 14.1, College Station, USA]. This analysis is designed to compare metrics between groups whilst accounting for the multiple levels within the data. Image type (i.e. FO, FOC, WO, WOC, DWI) was used as the predictor variable, and the value of the specific lesion detection metric being analysed (i.e. lesion count, sensitivity, positive predictive value or mean confidence score) was used as the outcome variable. The two levels data were clustered were at the level of ‘subject’ (patient) and ‘observer’ (reader). The same analysis was used for the two subgroups of patients who had diffuse and focal disease (as determined by the reference standard assessment).

Fig 1. Study design (15).

Fig 1

Percent contrast and contrast-to-noise ratio

In patients with at least three focal lesions greater than 5 mm in diameter, visible on all image types, percent contrast and contrast-to-noise ratio (CNR) were calculated using a previously described method [11]. Specifically, circular regions of interest (ROIs) were drawn on focal lesions greater than 3mm in diameter and further three ROIs were placed in areas of bone marrow without focal lesions in the sacrum and iliac bones.

Percent contrast was calculated as:

PercentContrast=(SaSb)(Sa+Sb) [1]

where Sa is the mean signal intensity of myeloma lesions and Sb is the background marrow signal intensity.

Similarly, CNR was calculated as:

CNR=|SaSb|(Sasd+Sbsd)/2 [2]

where Sasd and Sbsd are the mean within-ROI standard deviation values for myeloma lesions and background marrow respectively. A one-way analysis of variance (ANOVA) with a post-hoc Tukey Kramer multiple comparison test was used to compare percent contrast and CNR between image series.

Patient by patient analysis

A patient by patient analysis was undertaken after the initial analysis to determine which image types add diagnostic value. The aim of this analysis is to establish whether the observer identified any additional lesions on other image types when compared to the image type which scores the highest on the metrics examined in this study such as sensitivity, positive predictive value, true positive lesion rate and confidence.

Reading time

The time taken to read each image series was recorded by one of the authors. All readers were aware they were being timed and there were no specific instructions to influence reading times. All 150 image series (five image sequences for each of the 30 patients) were randomised and therefore the sequence of patients and image types were random.

Results

Three readers read a total of 150 image series and identified 1243, 1440 and 1207 lesions respectively, compared to 1952 reference lesions. An example of a patient with a focal lesion on the five image types is shown in Fig 2. Fig 3 charts mean lesion count, sensitivity, positive predictive value and confidence for each image type and all observers. A summary of this data and true positive count for each image type for all patients (focal and diffuse disease) is given in Table 3. Sub-group analysis for focal disease only (24 patients) and diffuse disease only (6 patients) follow in Tables 4 and 5. A summary of the mean time taken for readers to read each image type for each patient is recorded in Table 6.

Fig 2. Example of a focal MM lesion.

Fig 2

There is a focal lesion in the left hemi sacrum (arrow) on the five image types. The Dixon image types (FO, WO, FOC, WOC) are in the coronal plane and the DWI image is in the transverse plane).

Fig 3. Lesion count, sensitivity, positive predictive value (PPV) and confidence for each image type.

Fig 3

Individual observers are shown in colour (see legend), and the mean value across all three observers is shown in black. Error bars indicate the 95% confidence interval.

Table 3. Summary data for all patients.

All patients (n = 30)
Lesion Count
Image type Mean Differences in means (95% CI) p-value
FO 7.8 Baseline - -
WO 7.5 -0.37 -1.9 to 1.3 0.713
FOC 7.5 -0.37 -1.9 to 1.3 0.713
WOC 8.8 1.29 -0.55 to 2.66 0.196
DWI 11.7 4.83 2.34 to 5.55 <0.001
Sensitivity
Image type Mean Differences in means (95% CI) p-value
FO 0.46 Baseline - -
WO 0.47 0.11 -0.09 to 0.102 0.909
FOC 0.46 0.04 -0.094 to 0.098 0.968
WOC 0.61 2.96 0.049 to 0.242 0.003
DWI 0.72 5.32 0.165 to 0.358 <0.001
PPV
Image type Mean Differences in means (95% CI) p-value
FO 0.75 Baseline - -
WO 0.83 1.68 -0.011 to 0.138 0.094
FOC 0.77 0.41 -0.058 to 0.088 0.683
WOC 0.94 4.55 0.095 to 0.24 <0.001
DWI 0.86 3.13 0.043 to 0.187 0.002
Confidence
Image type Mean Differences in means (95% CI) p-value
FO 2.9 Baseline - -
WO 2.6 -2.12 -0.41 to -0.02 0.034
FOC 2.7 -1.23 -0.32 to 0.07 0.22
WOC 2.9 0.67 -0.13 to 0.26 0.503
DWI 3.2 3.57 0.16 to 0.55 <0.001
True Positives
Image type Mean Differences in means (95% CI) p-value
FO 6.2 Baseline - -
WO 7.0 1.02 -0.697 to 2.208 0.308
FOC 6.6 0.45 -1.112 to 1.786 0.653
WOC 8.4 2.98 0.759 to 3.663 0.003
DWI 10.6 5.92 2.937 to 5.841 <0.001

Lesion count, true positives, sensitivity, positive predictive value and confidence were compared between the five image types for all patients, using the fat only images as the baseline. Regression analyses used image types as the predictor variable, and lesion count/TP/sensitivity/confidence were used as the outcome variable. Mean values were calculated by the regression analysis and were equal to means calculated manually from all patients and all three radiologists.

Table 4. Summary data for the diffuse subgroup.

Diffuse disease (N = 6)
Lesion Count
Image type Mean Differences in means (95% CI) p-value
FO 7.7 Baseline - -
WO 6.8 -0.28 -7.04 to 5.26 0.777
FOC 9.6 0.62 -4.21 to 8.1 0.536
WOC 9.3 0.51 -4.54 to 7.77 0.608
DWI 12.9 1.68 -0.88 to 11.43 0.093
Sensitivity
Image type Mean Differences in means (95% CI) p-value
FO 0.337 Baseline - -
WO 0.314 -0.14 -0.34 to 0.30 0.855
FOC 0.43 0.55 -0.23 to 0.41 0.582
WOC 0.48 0.92 -0.17 to 0.47 0.357
DWI 0.65 2.93 0.11 to 0.56 0.07
PPV
Image type Mean Differences in means (95% CI) p-value
FO 0.74 Baseline - -
WO 0.73 -0.08 -0.25 to 0.27 0.934
FOC 0.64 -0.79 -0.36 to 0.15 0.43
WOC 0.93 1.49 -0.06 to 0.45 0.135
DWI 0.86 0.8 -0.15 to 0.36 0.423
Confidence
Image type Mean Differences in means (95% CI) p-value
FO 2.7 Baseline - -
WO 2.6 -0.44 -.91 to 0.57 0.658
FOC 2.67 -0.29 -0.85 to 0.63 0.768
WOC 2.78 0 -0.74 to 0.74 1
DWI 3.17 1.03 -0.35 to 1.12 0.302

Lesion count, sensitivity, positive predictive value and confidence were compared between the five image types for diffuse disease only, using the fat only images as the baseline. Regression analyses used image types as the predictor variable, and lesion count/sensitivity/confidence were used as the outcome variable. Mean values were calculated by the regression analysis and were equal to means calculated manually from all patients and all three radiologists.

Table 5. Summary data for the focal subgroup.

Focal disease (N = 24)
Lesion Count
Image type Mean Differences in means (95% CI) p-value
FO 7.8 Baseline - -
WO 7.6 0.22 -2.13 to 2.67 0.826
FOC 6.9 -0.63 -3.17 to 1.63 0.531
WOC 8.7 0.94 -1.25 to 3.55 0.349
DWI 11.4 3.21 1.53 to 6.33 0.001
Sensitivity
Image type Mean Differences in means (95% CI) p-value
FO 0.49 Baseline - -
WO 0.5 0.37 -0.10 to 0.15 0.715
FOC 0.47 -0.31 -0.15 to 0.11 0.754
WOC 0.62 2.11 0.01 to 0.26 0.04
DWI 0.78 4.02 0.13 to 0.39 <0.001
PPV
Image type Mean Differences in means (95% CI) p-value
FO 0.74 Baseline - -
WO 0.86 1.92 -0.001 to 0.202 0.055
FOC 0.81 1.14 -0.04 to 0.159 0.254
WOC 0.93 3.55 0.081 to 0.28 <0.001
DWI 0.856 2.26 0.02 to 0.21 0.024
Confidence
Image type Mean Differences in means (95% CI) p-value
FO 2.9 Baseline - -
WO 2.7 -1.14 -0.05 to 0.14 0.256
FOC 2.8 -0.8 -0.47 to 0.2 0.422
WOC 3 0.49 -0.25 to 0.41 0.626
DWI 3.2 2.16 0.03 to 0.69 0.031

Lesion count, sensitivity, positive predictive value and confidence were compared between the five image types for focal disease only, using the fat only images as the baseline. Regression analyses used image types as the predictor variable, and lesion count/sensitivity/confidence were used as the outcome variable. Mean values were calculated by the regression analysis and were equal to means calculated manually from all patients and all three radiologists.

Table 6. Reader timings.

Reader Timings
Image type Mean time (s) Differences in Means (95% CI) P = value
FO 99.38 Baseline - -
WO 77.77 7.15 -35.76 to -7.61 0.003
FOC 97.17 7.07 -16.51 to 11.2 0.705
WOC 82.65 7.14 -31.88 to -3.89 0.012
DWI 60.36 7.13 -53.28 to 25.32 0.000

Reader timings were compared between the five image types for all patients, using the fat only images as the baseline. Mean values were calculated by the regression analysis, and were equal to means calculated manually from all patients and all three radiologists.

Sensitivity and positive predictive value

The mean sensitivity for each image type was 0.46 for FO, 0.47 for WO, 0.46 for FOC, 0.61 for WOC and 0.72 for DWI. Sensitivity was significantly higher on the DWI images (p values from <0.001 to 0.018). WOC was the next most sensitive at detecting lesions (p values <0.001 to 0.004). There was no significant difference between WO and FO images (p = 0.909), or between FOC and FO images (p = 0.968).

The mean positive predictive values were 0.75 for FO, 0.83 for WO, 0.77 for FOC, 0.94 for WOC, and 0.86 for DWI. Positive predictive values were significantly higher on WOC images compared to all image types (p values <0.001to 0.006) except DWI (p = 0.151). There was no significant difference in PPV for WO compared to FO (p = 0.094) and FOC compared to FO (p = 0.683).

Confidence score

The mean confidence scores were 2.9 for FO, 2.6 for WO, 2.7 for FOC, 2.9 for WOC and 3.2 for DWI (4-point Likert confidence scale: 1-no lesions, 2-indeterminate, 3-likely myeloma, 4-definitely myeloma).

Confidence scores were lowest for WO images, and FOC images, and not significant for WOC. They were, however, significantly higher for DWI images in comparison to other image types (p values from 0.000 to 0.004). Readers were more confident at identifying myeloma lesions on WOC compared to WO (p = 0.005). There was no statistically significant difference between FO and FOC images (p = 0.220).

Sub-group analysis

Of 30 patients, 25 patients had focal lesions and 5 patients had diffuse disease. A summary of the mean lesion count, sensitivity, positive predictive value and confidence score for each image type for the diffuse disease group is given in Table 3 and the focal group in Table 4.

Diffuse disease group

Lesion count and sensitivity: There was no significant difference in readers detecting diffuse disease between all image types.

There was higher sensitivity on DWI compared to FO images, but not significant (p = 0.07). There was no significant difference between WO, FOC, and WOC compared to FO (p = 0.855, p = 0.388 and p = 0.21 respectively).

Positive Predictive Value and Confidence: For positive predictive value there was no significance for WO, FOC, WOC or DWI compared to FO images (p = 0.934, p = 0.43, p = 0.135 and p = 0.423 respectively).

For confidence there was also no significance WO, FOC, WOC and DWI compared to FO images (p = 0.658, p = 0.768, p = 1, and p = 0.302 respectively).

Focal disease group

Lesion count and sensitivity: Lesions counts were significantly higher on DWI images compared to FO images in patients with focal disease (3.21, p = 0.001). There was no significant difference between the following image type comparisons FO vs WO, FOC and WOC (p = 0.826, p = 0.531, p = 0.349 respectively).

Sensitivity was significantly higher on DWI compared to FO images (4.02, p<0.0001). WOC images also demonstrated higher sensitivity compared to FO images (2.11, p = 0.04). WO and FOC images were not significantly different when compared to FO images (p = 0.715 and p = 0.754 respectively).

Positive Predictive Value and Confidence: Positive predictive value was significantly higher on DWI images compared to FO images (2.26, p = 0.024) and on WOC images compared to FO images (3.55, p<0.0001). No significant difference could be demonstrated on WO and FOC compared to FO (p = 0.055, p = 0.254, respectively).

For reader confidence DWI was significantly higher compared to FO images (2.16, p = 0.031), whilst there was so no significant difference of WO, FOC and WOC images compared to FO images (p = 0.256, p = 0.422 and p = 0.626 respectively).

Patient by patient analysis of FO, WO, WOC, DWI: As the above results show, DWI was significantly better than other image types for sensitivity, detecting true positive lesions and reader confidence. Therefore, to ascertain whether other image types would add any further diagnostic information when detecting myeloma lesions in addition to DWI, a patient by patient analysis was undertaken for each observer. The image types examined were WOC, FO and WO as these had the highest values for sensitivity, true positive lesion rate and positive predictive value. An image type was assumed to add value if the reader observed more true positive lesions or fewer false positive lesions compared to DWI. In the 90 sets of comparisons (30 patients x 3 observers), WOC added value in17, FO in 14 and WO in 6 out of 90 comparisons.

Reader timings

The average read time for each image type was: FO 98s, WO 77s, FOC 99s, WOC 84s and DWI 67s across all readers. When combined, total mean reading time for the 5 image types of the pelvis was 417s for a patient (averaged across all three readers). Compared to the baseline (FO), reading time was significantly faster for WO (7.15, p = 0.003), WOC (7.14, p = 0.012) and fastest for DWI (7.13, p<0.0001).

Percent contrast and contrast-to-noise ratio

ROI analysis revealed that CNR was highest for DWI images at 65.7 (values for each image type were, FO 58.0, WO 34.7, FOC 32.7, WOC 29.9). It was significantly higher on DWI compared to WOC (p = 0.03), FOC (p = 0.049), WO (p = 0.05).

Percent contrast for ROIs was highest for WOC at 0.61 followed by DWI (0.51), FO (0.42), FOC (0.39) and lowest for WO (0.15). These data are charted in Fig 4.

Fig 4. Comparison of percent contrast and CNR between groups.

Fig 4

The figures show the results of a post- hoc multiple comparison test from a one-way ANOVA. Estimates of Percent Contrast and CNR are shown as circles; the comparison intervals for each group are shown as the whiskers.

Discussion

In this study of whole body imaging for assessment of skeletal disease, lesion counts, true positive counts, sensitivity, positive predictive value, reader confidence and reading time were compared across pre-contrast fat only (FO) and water only (WO) Dixon images, post contrast fat only (FOC) and water only (WOC) images, and diffusion weighted imaging (DWI). We have shown that DWI and WOC Dixon images are superior to other Dixon image types in detecting myeloma lesions. For all lesions combined (diffuse and focal disease) DWI proved to be the superior compared to all other image types for the following variables: lesion count, sensitivity, PPV, confidence and true positive. WOC was the next best image type with statistically significant differences in sensitivity, PPV and true positive counts compared to other image types apart from DWI. This was also reflected in the subgroup analysis which showed that for patients with focal disease, DWI and WOC were superior to the other image types for sensitivity and PPV. DWI performed best in detecting diffuse disease in all categories apart from PPV, however the results did not reach statistical significance. Lesion detection rates are highest as well as fastest (reading time in seconds) on DWI, followed by WOC images.

The observed bone lesion detection rates correlated with the results of the conspicuity analysis, which demonstrated that the DWI sequence had the highest contrast to noise ratio, and WOC had the highest percent contrast on ROI analysis. This is likely due to the highly cellular nature of myeloma lesions which causes replacement of normal fatty marrow [11,19], leading to an increase in signal on diffusion weighted imaging and enhancement of lesions against normal hypointense bone marrow on both DWI and WOC. There was no statistical difference between FO and FOC in lesion conspicuity and lesion detection rates. This suggests that enhancement of MM lesions on FOC does not improve conspicuity whereas on WOC, the hyperintense enhancement on the background of hypointense normal bone marrow leads to increased conspicuity and better detection. Both DWI and WOC image types provide functional information about myeloma lesions such as cellular density and vascularity leading to improved lesion contrast when compared to unenhanced Dixon images. This increase in conspicuity may be particularly important in myeloma patients with a higher bone marrow cell percentage in which lesions can be difficult to detect [11,20].

To the best of our knowledge, this is the first study to compare lesion detection rates on pre-contrast and post-contrast Dixon image types and DWI. The findings of this study are consistent with previous studies and recently published guidelines for the acquisition, interpretation and reporting of Whole-Body MRI in Myeloma (MY-RADS) [21]. Diffusion weighted imaging has been established as a sensitive sequence in detecting focal myeloma lesions compared to FDG-PET in a few studies [22,23] and forms a core component of the recommended clinical protocol but suffers from the detection of false positive lesions. Dixon imaging is also recommended as a core component of imaging protocols for both anatomical and functional information such as fat fraction measurements [24]. The current guidelines recommend comparison with morphological imaging such as water only pre-contrast and fat fraction images to mitigate false positives detected on DWI. Our study has shown that post-contrast WOC is superior to WO and FO for sensitivity, positive predictive value and true positive detection rate. In addition, when a patient by patient analysis was undertaken, WOC added diagnostic value to DWI in 17 patients compared, WO (6) and FO (4). An image type was assumed to add value if the reader observed more true positive lesions or fewer false positive lesions compared to DWI, which would result in a change of management according to the International Myeloma Working Group (IMWG) guidelines. In patients with multiple myeloma lesions, detecting additional lesions may not change management but in patients with smouldering myeloma (SMM), detecting more than one unequivocal myeloma lesion puts patients in a high risk category leading to a change in management according to the IMWG guidelines [25]. Our results show that even the best performing image type (DWI) identifies fewer true positive lesions overall across all readers compared to the reference standard which included all image types. To optimise read times, a possible approach would be that radiologists assess post contrast WOC and DWI together using image linking and co-registration in order to detect truly positive lesions and reduce the false positives that DWI may detects on its own. This can be supplemented by review of the other imaging sequences, particularly where identifying a further lesion would impact on patient management. As commented above, identifying a second lesion in a patient with smouldering myeloma would have greater impact on disease management that identifying an additional lesion in a patient where multiple lesions were already seen. This reading strategy would reduce overall read times in some patients. The disadvantage of this approach is that it requires the administration of intravenous contrast which is associated with increased cost and the low risk of contrast reaction. However contrast administration also enables dynamic contrast enhanced imaging (WB-DCE) which has been shown to be of value in monitoring treatment [26,27] and as our study suggests would better reduce false positive lesions detected by DWI compared to unenhanced Dixon image types. We suggest that the choice of morphological imaging should be post contrast WOC in protocols which have post-contrast imaging and FO images in protocols which are unenhanced, in keeping with previous work and the value added shown by the patient by patient analysis [15].

In order to increase the value of MR, both acquisition time and read time need to be optimised. Our results show that for assessing the pelvis only, read times can be several minutes if all image types are assessed. If only the two most superior image types are assessed, the reading time saved would be 4 minutes 33 seconds (417s vs 143s) for a single image station. Whole body imaging in adults can require typically up to 7 stations, and the time saving for the whole scan read could potentially be in the order of tens of minutes if only 2 sequences are read rather than the entire datasets.

This study has some limitations. The reference standard included the image types that performed the best (DWI and WOC) and it may be argued that this can cause bias. However, the reference radiologist analysed all image types examined including in phase and out of phase Dixon images in addition to the five image types compared in this study, to determine a true positive lesion. Therefore, the potential bias is equal for all image types assessed and practically this represents the best possible reference standard available as the patients did not undergo another contemporaneous imaging modality (such as Pet-CT) and not all patients had equivalent follow up imaging. The scoring system used an upper limit of twenty lesions as the maximum lesions a reader detected. Therefore, in patients with high tumour load, we may not have captured differences in lesion detection. The scoring system also penalised observers for failing to detect diffuse disease which is reflected in the low sensitivity scores compared to the reference standard. Another limitation of this study is that we have not compared lesion detection rates on other sequences which are used in other MM protocols such as STIR images. However, we feel that the time saving produced by Dixon imaging is sufficient justification for removing STIR imaging from WB-MRI protocols. Only six patients had a diffuse pattern of involvement and although DWI had higher indices than other image types, this did not reach statistical significance. This study has focussed on detection of bone disease and sequences required to assess extramedullary disease has not been considered. Reading times were recorded per image type. However in clinical practice, two or more types of images may be reviewed in parallel. Consequently simply summating reading times gives an indication of the potential time saving, but is not a precise measurement of time saved to read a whole scan. In our study we have not assessed the use of fat fraction maps as in clinical practice these are used for assessment of disease response during therapy, rather than for initial lesion identification.

In conclusion, the findings of this study suggest that DWI followed by WOC offer the highest relative value in terms of scan acquisition and read times, for bone lesion assessment in patients undergoing WBMRI for suspected myeloma. Our study suggests that WOC may be better than WO in supplementing DWI in the assessment of myeloma in treatment-naïve patients.

Supporting information

S1 File. Raw data showing lesion counts, true positives, false positives, false negatives, sensitivity, PPV and confidence for each of the four Dixon image types, for each observer and each subject.

Please refer to the Materials and Methods section for more information on data structure.

(XLSX)

Acknowledgments

This work was undertaken at University College Hospitals NHS Trust and University College London. MHC, SP, TJPB and AL work within the National Institute for Health Research (NIHR) Biomedical Research Centre. The views expressed in this publication are those of the authors and not necessarily those of the NIHR, UK Department of Health. This work was supported by CRUK/EPSRC KCL/UCL Comprehensive Cancer Imaging Centre.

Data Availability

All relevant data are within the paper and its Supporting Information files.

Funding Statement

This work was undertaken at UCLH/UCL, which receives funding from the Department of Health’s the National Institute for Health Research (NIHR) Biomedical Research Centre (BRC) funding scheme. The views expressed in this publication are those of the authors and not necessarily those of the UK Department of Health. MHC, SP, TJPB and AL are supported by the NIHR University College London Hospitals BRC. This work was supported by CRUK/EPSRC KCL/UCL Comprehensive Cancer Imaging Centre. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.International Myeloma Working Group. Criteria for the classification of monoclonal gammopathies, multiple myeloma and related disorders: a report of the International Myeloma Working Group. Br J Haematol. 2003;121(5):749–57. [PubMed] [Google Scholar]
  • 2.Dimopoulos M, Terpos E, Comenzo RL, Tosi P, Beksac M, Sezer O, et al. International myeloma working group consensus statement and guidelines regarding the current role of imaging techniques in the diagnosis and monitoring of multiple Myeloma. Leuk Off J Leuk Soc Am Leuk Res Fund, UK. 2009;23(9):1545–56. [DOI] [PubMed] [Google Scholar]
  • 3.Dimopoulos M, Kyle R, Fermand JP, Rajkumar SV, San Miguel J, Chanan-Khan A, et al. Consensus recommendations for standard investigative workup: Report of the International Myeloma Workshop Consensus Panel 3. In: Blood. 2011. p. 4701–5. 10.1182/blood-2010-10-299529 [DOI] [PubMed] [Google Scholar]
  • 4.Bird JM, Owen RG, D’Sa S, Snowden JA, Pratt G, Ashcroft J, et al. Guidelines for the diagnosis and management of multiple myeloma 2011. Br J Haematol. 2011;154(1):32–75. 10.1111/j.1365-2141.2011.08573.x [DOI] [PubMed] [Google Scholar]
  • 5.Giles SL, Messiou C, Collins DJ, Morgan VA, Simpkin CJ, West S, et al. Whole-Body Diffusion-weighted MR Imaging for Assessment of Treatment Response in Myeloma. Radiology. 2014;271(3):131529. [DOI] [PubMed] [Google Scholar]
  • 6.Dimopoulos MA, Hillengass J, Usmani S, Zamagni E, Lentzsch S, Davies FE, et al. Role of Magnetic Resonance Imaging in the Management of Patients With Multiple Myeloma: A Consensus Statement. J Clin Oncol [Internet]. 2015. February 20 [cited 2018 Sep 27];33(6):657–64. Available from: http://www.ncbi.nlm.nih.gov/pubmed/25605835 10.1200/JCO.2014.57.9961 [DOI] [PubMed] [Google Scholar]
  • 7.Myeloma: diagnosis and management | Guidance and guidelines | NICE. [cited 2018 Sep 27]; Available from: https://www.nice.org.uk/guidance/ng35
  • 8.Durie BGM, Kyle R a, Belch A, Bensinger W, Blade J, Boccadoro M, et al. Myeloma management guidelines: a consensus report from the Scientific Advisors of the International Myeloma Foundation. Hematol J. 2003;4:379–98. 10.1038/sj.thj.6200312 [DOI] [PubMed] [Google Scholar]
  • 9.Walker R, Barlogie B, Haessler J, Tricot G, Anaissie E, Shaughnessy JD, et al. Magnetic resonance imaging in multiple myeloma: diagnostic and clinical implications. J Clin Oncol [Internet]. 2007. March 20 [cited 2016 Jun 30];25(9):1121–8. Available from: http://www.ncbi.nlm.nih.gov/pubmed/17296972 10.1200/JCO.2006.08.5803 [DOI] [PubMed] [Google Scholar]
  • 10.ISMRM. The MR Value Initiative. https://www.ismrm.org/the-mr-value-initiative-phase-1/
  • 11.Takasu M, Tamura T, Kaichi Y, Tanitame K, Akiyama Y, Date S, et al. Magnetic resonance evaluation of multiple myeloma at 3.0 Tesla: how do bone marrow plasma cell percentage and selection of protocols affect lesion conspicuity? PLoS One [Internet]. 2014. January [cited 2016 Apr 16];9(1):e85931 Available from: http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=3904853&tool=pmcentrez&rendertype=abstract 10.1371/journal.pone.0085931 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Dixon WT. Simple proton spectroscopic imaging. Radiology. 1984;153(1):189–94. 10.1148/radiology.153.1.6089263 [DOI] [PubMed] [Google Scholar]
  • 13.Dutoit JC, Verstraete KL. Whole-body MRI, dynamic contrast-enhanced MRI, and diffusion-weighted imaging for the staging of multiple myeloma. Skeletal Radiol [Internet]. 2017. June 13 [cited 2018 Sep 27];46(6):733–50. Available from: http://www.ncbi.nlm.nih.gov/pubmed/28289855 10.1007/s00256-017-2609-6 [DOI] [PubMed] [Google Scholar]
  • 14.Weininger M, Lauterbach B, Knop S, Pabst T, Kenn W, Hahn D, et al. Whole-body MRI of multiple myeloma: Comparison of different MRI sequences in assessment of different growth patterns. Eur J Radiol [Internet]. 2009. February [cited 2018 Sep 27];69(2):339–45. Available from: http://www.ncbi.nlm.nih.gov/pubmed/18096344 10.1016/j.ejrad.2007.10.025 [DOI] [PubMed] [Google Scholar]
  • 15.Bray TJP, Singh S, Latifoltojar A, Rajesparan K, Rahman F, Narayanan P, et al. Diagnostic utility of whole body Dixon MRI in multiple myeloma: A multi-reader study. Jiang Q, editor. PLoS One [Internet]. 2017. July 3 [cited 2018 Sep 27];12(7):e0180562 Available from: http://www.ncbi.nlm.nih.gov/pubmed/28672007 10.1371/journal.pone.0180562 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Latifoltojar A, Hall-Craggs MA, Taylor S, Yong K, Rabin N, Benger N, et al. Whole body mDixon in multiple myeloma: Quantitative derived parameters changes following chemotherapy. In: ISMRM. 2015. [Google Scholar]
  • 17.Sonneveld P, Avet-Loiseau H, Lonial S, Usmani S, Siegel D, Anderson KC, et al. Treatment of multiple myeloma with high-risk cytogenetics: a consensus of the International Myeloma Working Group. Blood [Internet]. 2016. June 16 [cited 2019 Mar 7];127(24):2955–62. Available from: http://www.bloodjournal.org/cgi/doi/10.1182/blood-2016-01-631200 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Eggers H, Brendel B, Duijndam A, Herigault G. Dual-Echo Dixon Imaging with Flexible Choice of Echo Times. [DOI] [PubMed]
  • 19.Takasu M, Tani C, Sakoda Y, Ishikawa M, Tanitame K, Date S, et al. Iterative decomposition of water and fat with echo asymmetry and least-squares estimation (IDEAL) imaging of multiple myeloma: initial clinical efficiency results. Eur Radiol [Internet]. 2012. May [cited 2016 Jun 30];22(5):1114–21. Available from: http://www.ncbi.nlm.nih.gov/pubmed/22138735 10.1007/s00330-011-2351-8 [DOI] [PubMed] [Google Scholar]
  • 20.Bäuerle T, Hillengass J, Fechtner K, Zechmann CM, Grenacher L, Moehler TM, et al. Multiple myeloma and monoclonal gammopathy of undetermined significance: importance of whole-body versus spinal MR imaging. Radiology [Internet]. 2009. August [cited 2016 Apr 16];252(2):477–85. Available from: http://www.ncbi.nlm.nih.gov/pubmed/19703885 10.1148/radiol.2522081756 [DOI] [PubMed] [Google Scholar]
  • 21.Messiou C, Hillengass J, Delorme S, Lecouvet FE, Moulopoulos LA, Collins DJ, et al. Guidelines for Acquisition, Interpretation, and Reporting of Whole-Body MRI in Myeloma: Myeloma Response Assessment and Diagnosis System (MY-RADS). Radiology [Internet]. 2019. February 26 [cited 2019 Mar 4];181949 Available from: http://pubs.rsna.org/doi/10.1148/radiol.2019181949 [DOI] [PubMed] [Google Scholar]
  • 22.Pawlyn C, Fowkes L, Otero S, Jones JR, Boyd KD, Davies FE, et al. Whole-body diffusion-weighted MRI: a new gold standard for assessing disease burden in patients with multiple myeloma? Leukemia [Internet]. 2016. June [cited 2017 Apr 3];30(6):1446–8. Available from: http://www.ncbi.nlm.nih.gov/pubmed/26648535 10.1038/leu.2015.338 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Sachpekidis C, Mosebach J, Freitag MT, Wilhelm T, Mai EK, Goldschmidt H, et al. Application of (18)F-FDG PET and diffusion weighted imaging (DWI) in multiple myeloma: comparison of functional imaging modalities. Am J Nucl Med Mol Imaging [Internet]. 2015. [cited 2019 Mar 4];5(5):479–92. Available from: http://www.ncbi.nlm.nih.gov/pubmed/26550539 [PMC free article] [PubMed] [Google Scholar]
  • 24.Messiou C, Hillengass J, Delorme S, Lecouvet FE, Moulopoulos LA, Collins DJ, et al. Guidelines for Acquisition, Interpretation, and Reporting of Whole-Body MRI in Myeloma: Myeloma Response Assessment and Diagnosis System (MY-RADS). Radiology. 2019. February;181949. [DOI] [PubMed] [Google Scholar]
  • 25.Rajkumar SV, Dimopoulos MA, Palumbo A, Blade J, Merlini G, Mateos M-V, et al. International Myeloma Working Group updated criteria for the diagnosis of multiple myeloma. Lancet Oncol [Internet]. 2014. November [cited 2015 Sep 23];15(12):e538–48. Available from: http://www.ncbi.nlm.nih.gov/pubmed/25439696 10.1016/S1470-2045(14)70442-5 [DOI] [PubMed] [Google Scholar]
  • 26.Bourillon C, Rahmouni A, Lin C, Belhadj K, Beaussart P, Vignaud A, et al. Intravoxel Incoherent Motion Diffusion-weighted Imaging of Multiple Myeloma Lesions: Correlation with Whole-Body Dynamic Contrast Agent–enhanced MR Imaging. Radiology [Internet]. 2015. December 24 [cited 2019 Mar 4];277(3):773–83. Available from: http://pubs.rsna.org/doi/10.1148/radiol.2015141728 [DOI] [PubMed] [Google Scholar]
  • 27.Dutoit JC, Claus E, Offner F, Noens L, Delanghe J, Verstraete KL. Combined evaluation of conventional MRI, dynamic contrast-enhanced MRI and diffusion weighted imaging for response evaluation of patients with multiple myeloma. Eur J Radiol [Internet]. 2016. February 1 [cited 2019 Mar 4];85(2):373–82. Available from: https://www.sciencedirect.com/science/article/pii/S0720048X15301832?via%3Dihub 10.1016/j.ejrad.2015.11.040 [DOI] [PubMed] [Google Scholar]

Decision Letter 0

Pascal A T Baltzer

13 Nov 2019

PONE-D-19-26051

Whole Body MRI in Multiple Myeloma: Optimising Image Acquisition and Read Times

PLOS ONE

Dear Professor Hall-Craggs,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

We would appreciate receiving your revised manuscript by Dec 28 2019 11:59PM. When you are ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter.

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). This letter should be uploaded as separate file and labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. This file should be uploaded as separate file and labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. This file should be uploaded as separate file and labeled 'Manuscript'.

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

We look forward to receiving your revised manuscript.

Kind regards,

Pascal A. T. Baltzer, M.D.

Academic Editor

PLOS ONE

Journal Requirements:

1. When submitting your revision, we need you to address these additional requirements.

Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

http://www.journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and http://www.journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. We noticed you have some minor occurrence(s) of overlapping text with the following previous publication(s), which needs to be addressed:

https://doi.org/10.1371journal.pone.0180562

In your revision ensure you cite all your sources (including your own works), and quote or rephrase any duplicated text outside the Methods section. Further consideration is dependent on these concerns being addressed.

3. We have noticed that you report some instances of p = 0. Since this is not strictly possible, please report the exact p value in your results.

4. We noted in your submission details that a portion of your manuscript may have been presented or published elsewhere: The cohort of patients has been studied in previously published work. Two publications by Latifoltojar et al in 2017 studied 21 out of 30 patients in the context of changes in quantitative imaging biomarkers in response to treatment. A previous study by Bray et al 2017, studied the detection of myeloma lesions on pre-contrast imaging in this cohort. This study differs from both as we studied all available imaging including DWI and post contrast Dixon imaging in the entire 30 patient cohort. We have also included new patient by patient analysis, not performed in the paper by Bray et al 2017.  We have also focussed on MR value. We have measured read times for the scans for different readers, allowing estimation of time saving if scans are selectively reviewed..

Please clarify whether this [conference proceeding or publication] was peer-reviewed and formally published. If this work was previously peer-reviewed and published, in the cover letter please provide the reason that this work does not constitute dual publication and should be included in the current manuscript.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: A complete full body protocol is used as a reference. In comparison, the individual sequences are analyzed sequentially. They are examined in detail with regard to their diagnostic accuracy. Finally, it is examined whether the combination of the best with the less important sequence offers an incremental benefit. This approach is generally convincing and worthy of being published. This is all the more true, since the data obviously come from a database that has already been published, which is, however, sufficiently noted.

A few points of criticism I would like to mention which should be addressed by the authors.

1. It should be made clear that this is an artificially shortened protocol. In fact, the full state-of-the-art protocol includes more sequences (https://doi.org/10.1148/radiol.2019181949, figure 1). In this respect, the reference standard is somewhat misleading and the actual accuracy of the sequences is presumably lower.

2. The authors mainly focus on parameters of lesion detection (sensitivity and PPV etc.). Just as important - especially in the context of efficiency - are the negative ratings. As far as I can see, these can also be analyzed by the data, which in my opinion should be revised.

3. The terminology efficiency and effectiveness is clearly defined, highly relevant, but unfortunately - in my opinion - not outlined clearly enough in the paper. In fact, the authors convincingly demonstrate that the shortening of the full protocol produces a significantly worse diagnostic result: E.g. lesion count / sensitivity of DWI = 72 %, which means a 28% worse detection rate vs. the gold standard. This, in turn, was not measured completely (see above). Of course, it is impossible to examine all this in one paper, but it should be named more clearly and discussed more objectively. This is all the more true in the context of the clinical relevance: The exact definition of the lesion load is vital for optimized outcome (etc.) as mentioned by the authors.

4. "Fat and water image reconstructions are mandatory and should be used to generate fat fraction maps": this clear statement in the above-mentioned cited paper is somewhat contradictory to the authors' conclusions. This, too, should be quite discussed.

5. The analysis of reading time is highly relevant and should be clearer presented. In every multiparametric analysis, one sequence is read as a "main sequence" and the other parameters are evaluated as add-ons. A simple addition of the reading times as given by the independent read is thus misleading in a certain way, but at least it overestimates the potential time saving effect.

6. CNR analysis: How were lesions not detectable in all sequences analyzed. A more detailed explanation is recommended.

7. Discussion: “increased (...) morbidity for patients” introduced by contrast agents. Unclear sentence. In which context side effects of CA could play a clinically relevant role in the management of MM patients?

8. Since MRI is a multiparametric method and will probably remain so, according to the data of the authors, a more detailed presentation of the data regarding the combination of several sequences is desirable ("determine which image types add diagnostic value").

9. Bland Altman analysis of the key results is recommended for better assessment of a proportional or systematic bias.

10. The main result is potential time savings for a different MRI protocols. I recommend to present these results more clearly in a graphical illustration.

Reviewer #2: Dear authors,

i have read your paper with great interest as there is only few data on evaluating specific sequences in WB/MRI in MM. I have found methodology, reporting and the conclusions drawn concise and comprehensible. I would like to ask you to comment on a. if there was any histopathologic correlation or consequent work-up by PET/CT in these 30 patients and

b. if the readers have had ADC maps when evaluation DWI.

Overall my recommendation would be: minor revisions.

With my best regards

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files to be viewed.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2020 Jan 30;15(1):e0228424. doi: 10.1371/journal.pone.0228424.r002

Author response to Decision Letter 0


13 Dec 2019

Journal Requirements:

1. When submitting your revision, we need you to address these additional requirements.

Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

http://www.journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and http://www.journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

The guidelines have been reviewed and followed.

2. We noticed you have some minor occurrence(s) of overlapping text with the following previous publication(s), which needs to be addressed:

https://doi.org/10.1371journal.pone.0180562

In your revision ensure you cite all your sources (including your own works), and quote or rephrase any duplicated text outside the Methods section. Further consideration is dependent on these concerns being addressed.

Outside the methods (and we have acknowledged and referenced the previous paper [15]), there is very little overlapping text as the results are of a different study.

All sources have been cited.

3. We have noticed that you report some instances of p = 0. Since this is not strictly possible, please report the exact p value in your results.

The p values recorded as p = 0.000 have been amended to p<0.001.

4. We noted in your submission details that a portion of your manuscript may have been presented or published elsewhere: The cohort of patients has been studied in previously published work. Two publications by Latifoltojar et al in 2017 studied 21 out of 30 patients in the context of changes in quantitative imaging biomarkers in response to treatment. A previous study by Bray et al 2017, studied the detection of myeloma lesions on pre-contrast imaging in this cohort. This study differs from both as we studied all available imaging including DWI and post contrast Dixon imaging in the entire 30 patient cohort. We have also included new patient by patient analysis, not performed in the paper by Bray et al 2017. We have also focussed on MR value. We have measured read times for the scans for different readers, allowing estimation of time saving if scans are selectively reviewed..

Please clarify whether this [conference proceeding or publication] was peer-reviewed and formally published. If this work was previously peer-reviewed and published, in the cover letter please provide the reason that this work does not constitute dual publication and should be included in the current manuscript.

I think that the covering letter quoted from above was confusing and misleading. The paragraph below has been expanded to make it clear that the data submitted in the current study are new. Apologies for the confusion.

“The CURRENT SUBMITTED study differs from both (the previous studies ) as we studied all available imaging including DWI and post contrast Dixon imaging in the entire 30 patient cohort. We have also included new patient by patient analysis, not performed in the paper by Bray et al 2017. We have also focussed on MR value. We have measured read times for the scans for different readers, allowing estimation of time saving if scans are selectively reviewed.”

We confirm that the new data included in the submitted paper have not been published elsewhere and this is not duplicate publication.

Response to Comments to the Author

Reviewer #1: A complete full body protocol is used as a reference. In comparison, the individual sequences are analyzed sequentially. They are examined in detail with regard to their diagnostic accuracy. Finally, it is examined whether the combination of the best with the less important sequence offers an incremental benefit. This approach is generally convincing and worthy of being published. This is all the more true, since the data obviously come from a database that has already been published, which is, however, sufficiently noted.

A few points of criticism I would like to mention which should be addressed by the authors.

1. It should be made clear that this is an artificially shortened protocol. In fact, the full state-of-the-art protocol includes more sequences (https://doi.org/10.1148/radiol.2019181949, figure 1). In this respect, the reference standard is somewhat misleading and the actual accuracy of the sequences is presumably lower.

In the paper referred to by the reviewer, a different protocol was used to the one described in our study. We do not routinely image the spine, but if there is any abnormality seen in the spine on the Dixon and diffusion images, we go on to do dedicated spine images as an additional procedure. This saves time and is supported by data from the paper from our group authored by Latifoltojar (reference 16).

We do routinely do T2 images, but the main reason for doing the T2 is to assess for extramedullary disease and for treatment response following chemotherapy. In the current paper we have not included the T2 images as it well established in the literature, and in general practice, that for patients before treatment the T2 images are non-contributory for the assessment of bone lesions over and above diffusion and Dixon images.

We have clarified in the text the use of T2 images.

“Although other sequences were acquired during the standard whole body protocol, these were not included for assessment as these are acquired mainly for assessment of extramedullary disease (axial T2) and for measurement of response to therapy (T2, ADC and fat fraction maps before and after treatment).”

2. The authors mainly focus on parameters of lesion detection (sensitivity and PPV etc.). Just as important - especially in the context of efficiency - are the negative ratings. As far as I can see, these can also be analyzed by the data, which in my opinion should be revised.

In our cohort all our patients had a high clinical suspicion of myeloma and our current study was to compare the performances of different sequences and not the performance of whole body MRI. To do this we have to take a lesion and compare it on different sequences – we cannot take ‘no lesion’ [= true negative] and compare this between sequences. Consequently in this study we have no ‘true negatives’ as we are labelling lesions and not patients. We have true positives, false positives and false negatives. True negatives do not exist within this study.

3. The terminology efficiency and effectiveness is clearly defined, highly relevant, but unfortunately - in my opinion - not outlined clearly enough in the paper. In fact, the authors convincingly demonstrate that the shortening of the full protocol produces a significantly worse diagnostic result: E.g. lesion count / sensitivity of DWI = 72 %, which means a 28% worse detection rate vs. the gold standard. This, in turn, was not measured completely (see above). Of course, it is impossible to examine all this in one paper, but it should be named more clearly and discussed more objectively. This is all the more true in the context of the clinical relevance: The exact definition of the lesion load is vital for optimized outcome (etc.) as mentioned by the authors.

We agree this is an important point. We have added a sentence to elaborate on why we think preferentially reading DWI and post contrast would lead to increased reporting efficiency in the Discussion

“This can be supplemented by review of the other imaging sequences, particularly where identifying a further lesion would impact on patient management. As commented above, identifying a second lesion in a patient with smouldering myeloma would have greater impact on disease management that identifying an additional lesion in a patient where multiple lesions were already seen. This reading strategy would reduce overall read times in some patients.”

4. "Fat and water image reconstructions are mandatory and should be used to generate fat fraction maps": this clear statement in the above-mentioned cited paper is somewhat contradictory to the authors' conclusions. This, too, should be quite discussed.

We do understand this point and agree that it is a relative limitation of our study. However fat fraction maps most useful in a clinical setting for measuring disease response to therapy, and our study was about identifying lesions in disease and not quantifying response to treatment. We have added a paragraph to clarify this.

“In our study we have not assessed the use of fat fraction maps as in clinical practise these are used for assessment of disease response during therapy, rather than for initial lesion identification.”

5. The analysis of reading time is highly relevant and should be clearer presented. In every multiparametric analysis, one sequence is read as a "main sequence" and the other parameters are evaluated as add-ons. A simple addition of the reading times as given by the independent read is thus misleading in a certain way, but at least it overestimates the potential time saving effect.

We agree this is a limitation of the way reading time was measured. However as each individual image type was being assessed, we did not use a ‘’locked sequential read”. We have added a sentence expanding on this within the limitations section.

“Reading times were recorded per image type. However in clinical practice, two or more types of images may be reviewed in parallel. Consequently simply summating reading times gives an indication of the potential time saving, but is not a precise measurement of time saved to read a whole scan.”

6. CNR analysis: How were lesions not detectable in all sequences analyzed. A more detailed explanation is recommended.

For measurement of CNR we selected lesions which were visible on all sequences. This has been clarified in the manuscript.

Percent Contrast and Contrast-to-Noise Ratio

In patients with at least three focal lesions greater than 5 mm in diameter, visible on all image types, percent contrast and contrast-to-noise ratio (CNR) were calculated using a previously described method (11).

7. Discussion: “increased (...) morbidity for patients” introduced by contrast agents. Unclear sentence. In which context side effects of CA could play a clinically relevant role in the management of MM patients?

We have altered this sentence as we agree that following the cases of NSF, and the introduction of new guidelines for administration of IV contrast, there should be no increase in morbidity other than contrast reactions.

8. Since MRI is a multiparametric method and will probably remain so, according to the data of the authors, a more detailed presentation of the data regarding the combination of several sequences is desirable ("determine which image types add diagnostic value").

We agree that this would be interesting, but is beyond the scope of this study which was directed towards lesion detection. A different study would be needed to address diagnostic value/outcome analysis.

9. Bland Altman analysis of the key results is recommended for better assessment of a proportional or systematic bias.

We do not feel that Bland Altman would be the appropriate test of our data. Bland Altman is fundamentally an analysis of agreement. We are using a reference ‘gold’ standard and so we can simply compare this with other sequences.

10. The main result is potential time savings for a different MRI protocols. I recommend to present these results more clearly in a graphical illustration.

We have not really compared different MRI protocols in this study, we used one protocol only, and so it would be difficult to compare this graphically – as this is not fundamentally what we have measured. We have looked at lesion detection and read time for different sequences, and these are presented numerically in the text.

Reviewer #2: Dear authors,

i have read your paper with great interest as there is only few data on evaluating specific sequences in WB/MRI in MM. I have found methodology, reporting and the conclusions drawn concise and comprehensible. I would like to ask you to comment on a. if there was any histopathologic correlation or consequent work-up by PET/CT in these 30 patients and

b. if the readers have had ADC maps when evaluation DWI.

Overall my recommendation would be: minor revisions.

Thank you for your positive review.

The patients had their bone marrow examined but focal lesions were not biopsied.

The readers did not have ADC maps to review although these were automatically produced by the vendor software. As with most read protocols, the long b value diffusion images are used for lesion detection (the focus of this study) whereas ADC maps and change in ADC are used for assessment of treatment response.

Attachment

Submitted filename: Response to Reviewers .docx

Decision Letter 1

Pascal A T Baltzer

15 Jan 2020

Whole Body MRI in Multiple Myeloma: Optimising Image Acquisition and Read Times

PONE-D-19-26051R1

Dear Dr. Hall-Craggs,

We are pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it complies with all outstanding technical requirements.

Within one week, you will receive an e-mail containing information on the amendments required prior to publication. When all required modifications have been addressed, you will receive a formal acceptance letter and your manuscript will proceed to our production department and be scheduled for publication.

Shortly after the formal acceptance letter is sent, an invoice for payment will follow. To ensure an efficient production and billing process, please log into Editorial Manager at https://www.editorialmanager.com/pone/, click the "Update My Information" link at the top of the page, and update your user information. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, you must inform our press team as soon as possible and no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

With kind regards,

Pascal A. T. Baltzer, M.D.

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #2: Dear authors,

i am happy with the manuscript as it is right now and i have no further questions.

Best regards.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #2: No

Acceptance letter

Pascal A T Baltzer

23 Jan 2020

PONE-D-19-26051R1

Whole Body MRI in Multiple Myeloma: Optimising Image Acquisition and Read Times

Dear Dr. Hall-Craggs:

I am pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

For any other questions or concerns, please email plosone@plos.org.

Thank you for submitting your work to PLOS ONE.

With kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Pascal A. T. Baltzer

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 File. Raw data showing lesion counts, true positives, false positives, false negatives, sensitivity, PPV and confidence for each of the four Dixon image types, for each observer and each subject.

    Please refer to the Materials and Methods section for more information on data structure.

    (XLSX)

    Attachment

    Submitted filename: Response to Reviewers .docx

    Data Availability Statement

    All relevant data are within the paper and its Supporting Information files.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES