Skip to main content
PLOS One logoLink to PLOS One
. 2024 Mar 13;19(3):e0297911. doi: 10.1371/journal.pone.0297911

Are clinically unimportant findings qualified as benign in lumbar spine imaging reports? A content analysis of plain X-ray, CT and MRI reports

Caitlin Farmer 1,*, Romi Haas 1, Jason Wallis 1, Denise O’Connor 1, Rachelle Buchbinder 1
Editor: Aloysius Gonzaga Mubuuke2
PMCID: PMC10936854  PMID: 38478495

Abstract

Background

Lumbar spine diagnostic imaging reports may cause patient and clinician concern when clinically unimportant findings are not explicitly described as benign. Our primary aim was to determine the frequency that common, benign findings are reported in lumbar spine plain X-ray, computed tomography (CT) and magnetic resonance imaging (MRI) reports as either normal for age or likely clinically unimportant.

Methods

We obtained 600 random de-identified adult lumbar spine imaging reports (200 X-ray, 200 CT and 200 MRI) from a large radiology provider. Only reports requested for low back pain were included. From the report text, one author extracted each finding (e.g., ‘broad-based posterior disc bulge’) and whether it was present or absent (e.g., no disc bulge) until data saturation was reached, pre-defined as a minimum of 50 reports and no new/similar findings in the last ten reports within each imaging modality. Two authors independently judged whether each finding was likely clinically unimportant or important. For each likely clinicially unimportant finding they also determined if it had been explicitly reported to be benign (expressed as normal, normal for age, benign, clinically unimportant or non-significant).

Results

Data saturation was reached after coding 262 reports (80 X-ray, 82 CT, 100 MRI). Across all reports we extracted 3,598 findings. Nearly all reports included at least one clinically unimportant finding (76/80 (95%) X-ray, 80/82 (98%) CT, 99/100 (99%) MRI). Over half of the findings (n = 2,062, 57%; 272 X-Ray, 667 CT, 1123 MRI) were judged likely clinically unimportant. Most likely clinically unimportant findings (90%, n = 1,854) were reported to be present on imaging (rather than absent) and of those only 18% (n = 331) (89 (35%) X-ray, 93 (16%) CT and 149 (15%) MRI) were explicitly reported as benign.

Conclusion

Lumbar spine imaging reports frequently include findings unlikely to be clinically important without explicitly qualifying that they are benign.

Introduction

Degenerative changes, including disc bulges and facet joint degeneration, are common findings described in lumbar spine imaging reports [13]. These changes are increasingly prevalent with age and are equally as common in people with and without low back pain [1]. Longitudinal population-based studies have confirmed that such degenerative changes do not have clinically important associations with either current or future back pain, even when multiple changes are present [4, 5]. Reporting on the presence of degenerative changes in imaging reports without clarifying that they are likely clinically unimportant has the potential to lead to overdiagnosis and overtreatment, and cause patient anxiety about the seriousness and persistence of symptoms [69].

There is a paucity of guidance for radiologists on how to communicate findings of limited clinical relevance in a manner that does not alarm the reader. A scoping review that included six radiology reporting guidelines found that three of the guidelines recommended reporting the presence of normal findings, although there was limited advice about how this should be actioned [10]. For example, the Royal Australian and New Zealand College of Radiologists (RANZCR) guideline suggests that normal findings be noted when it would make a difference to the referrer or when absence of such a statement would create ambiguity [11].

In addition, only three guidelines provide guidance on communicating confidence or certainty in reports. The UK College of Radiologists guideline recommends that the level of certainty or doubt surrounding an imaging diagnosis be clearly documented in the report [12], the Canadian College of Radiologists guideline suggests the focus should be on findings that offer potential for resolution of the clinical question [13], while the RANZCR guideline recommends avoiding use of vague modifiers, such as ‘possibly represents’ [11]. These recommendations are in keeping with other literature that recommends avoiding ambigious statements or hedging vocabulary, such as ‘there appears to be…’, to minimise confusion for the reader. Using hedging vocabulary such as ‘seen’ or ‘identified’ is discouraged as it suggests that something may have been missed; for example ‘no fracture seen’ is a less certain statement than ‘no fracture’ [14, 15].

The choice of phrasing in imaging reports can influence management decisions. For example a US study providing hypothetical scenarios to clinicians found they were more likely to request further imaging if an incident 5mm liver lesion was described as ‘most likely a cyst’ compared to being a ‘benign cyst (46% and 2% respectively) [16]. Providing reassuring statements, such as ‘findings are normal for age’, or avoiding alarming descriptors, such as ‘degeneration’, ‘tear’ or ‘rupture’, have been suggested as possible ways to reduce misinterpretation of clinically unimportant findings in lumbar spine imaging reports [6, 17].

One previous study has investigated the extent to which degenerative changes are reported in lumbar spine imaging reports and how they are described [2]. Based upon examination of 120 consecutive plain X-ray reports requested in primary care, they found that almost three quarters noted the presence of degenerative changes. Only 2% of reports explicitly stated these were normal for age, while 14% indicated the changes were either ‘mild’ or ‘slight’, which may be a less explicit way of indicating to the reader that a particular finding is of limited clinical relevance. Another study performed a content analysis of plain X-Ray and MRI imaging reports of patients with persistent low back pain and explored, through interviews, which terms negatively impacted the patients’ perceived prognosis [18]. The terms ‘wear and tear’ and ‘disc space loss’ were associated with a significantly worse perceived outcome based upon patients’ interpretation that these terms signified the spine was ‘deteriorating’, ‘crumbling’, ‘collapsing’ and/or the discs were ‘wearing out’.

The primary aims of this study were to determine (a) the frequency that likely clinically unimportant findings are reported in lumbar spine plain X-ray, CT and MRI reports, and (b) the frequency that they are explicitly reported to be benign (i.e., normal, normal for age, benign, clinically unimportant or non-significant). Second, we investigated the frequency of adjectives (e.g., mild, severe) used to describe these findings and how frequently terms of uncertainty (i.e., vague modifiers, hedging vocabulary) were used.

Methods

Study design

We performed a content analysis of a random sample of fully de-identified lumbar spine plain X-ray, CT and MRI imaging reports from iMed, a large radiology service provider in Victoria, Australia.

A random sample of 600 (200 X-ray, 200 CT and 200 MRI) reports written between 1 January 2019 and 30 June 2021 were collected in July 2021 and this study was conducted over the following year. To obtain the random sample for each modality, we used the ‘Rand’ function in Excel to identify random dates within this time period for each imaging modality. If more than one report was identified for a selected day, we again used the Rand’ function in Excel to select another report at random. For days without a report, we used the next randomised date.

A research assistant, not otherwise involved in the study, extracted the complete text of the identified X-ray, CT and MRI reports, including the patient sex, date of birth, date of imaging examination, requesting clinician specialty (e.g., GP, orthopaedic surgeon, rheumatologist), where available, and reporting radiologist using a standardised MS Excel data collection form. To ensure anonymity of the reporting radiologist a unique numerical code was assigned for each radiologist. The de-identified extracted reports were then provided to the research team. No data that could identify patients, referrers or radiologists were provided.

Eligibility criteria

We included lumbar spine imaging reports for people of any age that indicated that the imaging had been requested for low back or radicular lower limb pain. Reports that covered multiple body regions (e.g., thoracolumbar spine) were included only if the report of the lumbar spine could be clearly separated from reporting on other body regions. We excluded reports of imaging performed following major trauma and those requested to explicitly rule in/out serious causes (i.e. infection, malignancy, fracture), imaging post surgery or imaging performed for monitoring purposes. We also excluded any report that included a serious finding (e.g., vertebral fracture or metastatic disease) regardless of whether the clinical notes queried the presence of such a finding.

Data extraction

From the text of the reports relevant to the lumbar spine, one author (CF), a physiotherapist with expertise in low back pain, extracted each individual finding including any adjectives describing the finding (e.g., ‘mild posterior disc bulge’). This was continued until data saturation was reached, pre-defined as no new/similar findings in the last ten reports within each imaging modality.

The same author also extracted each term of uncertainty, including vague modifiers and hedging vocabulary based, a priori, on published lists of these terms [1416, 19], and consensus among the authors for additional terms. We grouped terms of uncertainty that had similar meaning together (e.g., ‘not shown’, ‘identified’ and ‘seen’). A second author (JW or RH, both also physiotherapists with expertise in low back pain) checked all extracted data and differences were resolved by consensus.

For each individual finding, two authors (CF, JW or RH) independently determined whether the finding was likely clinically unimportant or important, based, a priori, on published evidence about the relevance of imaging findings, the report context, and/or author team (also included rheumatologist with low back pain expertise (RB) and occupational therapist (DOC)) consensus for equivocal findings. S1 Table provides the list of likely clinically unimportant findings based upon the published evidence. We grouped findings based on anatomical structure (i.e., disc, facet joint, etc.,) and pathology present (e.g., bulge, arthropathy). Findings that described the same or similar abnormality (e.g., ‘disc height loss’ and ‘disc space narrowing’) were grouped together.

For findings that are usually considered clinically unimportant (e.g., ‘disc protrusion’), if there was evidence within the context of the report of its potential importance (e.g., ‘compressing a nerve root’), or if the clinical importance of a finding was ambiguous (e.g., ‘mild to moderate canal stenosis’), we erred on being conservative and categorised the finding as likely clinically important. When a clinically unimportant finding was reported to be absent (e.g., ‘no disc bulge’) we recorded that separately.

For each likely clinically unimportant finding the same two independent authors (CF and RH or JW) recorded whether there was an explicit qualification that the finding was benign. This could have been stated as ‘normal’, ‘normal for age’, ‘benign’, ‘clinically unimportant’, ‘non-significant’ and/or other related synonyms (e.g., ‘normal alignment’ or ‘alignment satisfactory’). Any disagreements were resolved by discussion with all authors.

Sample size and data saturation

The sample size was informed by previous content analyses of lumbar spine imaging reports [2, 18, 20], and the data saturation stopping rule described by Francis et al. [21]. A pilot study investigating the content of lumbar spine imaging performed in patients presenting with back pain to an emergency department of one metropolitan hospital in Victoria, Australia, indicated that a minimum of 50 reports, and likely less than 100 reports, of each modality would be needed [20].

We identified 200 reports for each modality to allow for the various terms known to describe similar radiological findings [22], as well as potential exclusions. We extracted individual findings until data saturation had been reached, defined as a minimum of 50 reports and the point at which no new findings were identified among the last ten reports, coded separately for each imaging modality. Cumulative frequency tables listed each new term until this stopping rule was met.

Data analysis

We used descriptive statistics to summarise the demographic characteristics of patients that were imaged, reporting radiologists and imaging referrers. We also measured and reported the report word count, median number of findings per report and median number categorised as likely clinically unimportant or important, the proportion of imaging reports with at least one likely clinically unimportant finding, the proportion that qualified clinically unimportant findings as benign, adjectives used to describe the findings, and terms of uncertainty. The most common likely clinically unimportant findings and most common adjectives and terms of uncertainty were determined for each imaging modality.

Ethics

This study was approved by the Monash University Human Research Ethics Committee (Approval ID 27959). Individual participant consent was not required due to the de-identified nature of the data.

Results

Fig 1 presents a flow chart of the report coding process and a summary of the main findings. Data saturation was reached after coding 262 reports (80 X-ray, 82 CT and 100 MRI). Ninety-five reports (17 X-ray, 66 CT and 12 MRI) were excluded for reasons listed in Fig 1, most commonly because the imaging was performed due to trauma. 3598 separate findings were extracted (454, 1139 and 2005 in the X-ray, CT and MRI reports respectively). Of these, 2062 (57%) (272 in X-ray (60%), 667 in CT (59%) and 1123 in MRI (56%)) were judged to be likely clinically unimportant. Most (n = 1854, 90%) were reported to be present rather than reported to be absent, and only a minority (n = 331, 18%) were explicitly reported to be benign.

Fig 1. Flow chart of report coding process.

Fig 1

Notes: *’Benign’ findings were explicitly qualified as ‘normal’, ‘normal for age’, ‘benign’, ‘clinically unimportant’, ‘non-significant’ and/or other related synonyms. ^One MRI finding (‘CBD’) was excluded as its meaning was unclear.

Patient demographics, requesting clinician and radiologist details and imaging report characteristics by imaging modality are shown in Table 1. There were more women across all three imaging modalities and the median (range) age varied from 50 (15 to 91) years for MRI to 65 (17 to 98) years for CT reports. Most imaging was requested by GPs (50/80 (63%) X-rays, 49/82 (60%) CTs and 57/100 (57%) MRIs). One hundred and five different radiologists reported on the imaging with over 50 different radiologists for each imaging type (52 X-rays, 56 CT scans and 53 MRI scans). Only 10 radiologists reported imaging for all three imaging modalities and 39 contributed only a single report across all modalities. The median number of reports written by each radiologist was one (range for X-ray: 1 to 5, CT: 1 to 3 and MRI 1 to 6)).

Table 1. Demographics (sex and age) of patients that had imaging requested, number of reporting radiologists and imaging requestors, and word count and number of imaging findings reported, by imaging modality.

X-ray (N = 80) CT (N = 82) MRI (N = 100)
Patient demographics
 Female, N (%) 49 (61) 43 (52) 53 (53)
 Age in years, median (range) 56 (6 to 90) 65 (17 to 98) 50 (15 to 91)
Number of reporting radiologists 52 56 53
Imaging requestor, N (%) *
 General practitioner 50 (63) 49 (60) 57 (57)
 Emergency physician 7 (9) 16 (20) 14 (14)
 Chiropractor 1 (1) 5 (6) 4 (4)
 Physiotherapist 1 (1) 1 (1) 5 (5)
 Rheumatologist 2 (3) 1 (1) 0 (0)
 Orthopaedic surgeon 2 (3) 0 (0) 0 (0)
 Endocrinologist 2 (3) 1 (1) 0 (0)
 Neurologist 0 (0) 0 (0) 3 (3)
 Pain physician 0 (0) 1 (1) 4 (4)
 Unclear 12 (15) 6 (7) 13 (13)
Imaging report characteristics
Word count, median (range) 48 (8 to 167) 137 (4 to 389) 162 (56 to 355)
Number of reports that included at least one likely clinically unimportant finding, N (%) 76 (95) 80 (98) 99 (99)
Median (range) number of findings per report 5 (2 to 14) 11 (2 to 22) 16 (7 to 47)
Median (range) of clinically unimportant findings reported per report 3 (1 to 8) 5 (1 to 32) 9 (1 to 35)
Number of reports that qualified all reported clinically unimportant findings as benign, N (%) 13 (16) 8 (10) 3 (3)
Mean percent of clinically unimportant findings described as benign per report 27 15 14
Number of reports that included at least one term of uncertainty, N (%) 53 (66) 75 (92) 82 (82)
Median (range) of terms of uncertainty per report 1 (1–4) 3 (1 to 12) 3 (1 to 14)

*One osteopath, one neurosurgeon and one oncologist requested one X-ray, and one urologist and one rehabilitation physician requested a single CT scan.

^ Terms of uncertainty could include vague modifiers and hedging vocabulary

X-ray reports had the fewest number of words and findings (median (range): 48 (8 to 167) and 5 (2 to 14), respectively) and MRI scans had the most (median (range): 162 (56 to 355) and 16 (7 to 47), respectively). Nearly all reports included at least one clinically unimportant finding (76/80 (95%) X-ray, 80/82 (98%) CT, 99/100 (99%) MRI), with a median (range) of 3 (1 to 8), 5 (1 to 32) and 9 (1 to 35) per report for X-ray, CT and MRI respectively. Among reports where clinically unimportant findings were present, few reports qualified all of them as benign (13 (16%) X-ray, 8 (10%) CT, 3 (3%) MRI). The majority of reports included at least one term of uncertainty (53/80 (66%) X-Ray, 75/82 (92%) CT and 82/100 (82%) MRI reports).

Likely clinically unimportant findings, their frequency, and proportion reported to be benign

S1 Table indicates which clinically unimportant findings appeared in at least one report by imaging modality. The most common likely clinically unimportant findings that were reported to be present are shown by modality and in order of frequency in Table 2, together with the proportion that were explicitly reported as benign. Allowing for different sensitivities between imaging modalities, changes to the discs (e.g., disc height loss, disc bulge), facet joint arthropathy and degenerative changes were reported most commonly. Across all modalities these findings were reported to be benign in less than half of the reports that noted their presence. For example, disc height loss was reported to be present in 54%, 54% and 31% of X-Ray, CT and MRI reports respectively, and this change was reported to be benign in only 42%, 26% and 33% of those reports respectively. Alignment of segments of the spine, when described (39% X-ray, 40% CT and 46% MRI reports), was most frequently reported to be benign (100% X-Ray, 82% CT and 87% MRI reports that reported the finding).

Table 2. The likely clinically unimportant findings that were reported to be present by imaging modality: The number (%) of reports that indicated the specific finding was present and frequency (%) of the finding across reports in order of frequency, and the number (%) that were reported to be benign.

Number of reports where finding is present Frequency of finding across reports Number of findings reported as benign
X-ray N = 80 reports N = 253 findings N = 89 findings
N (%)^ N (%)# N (%)α
Disc height loss 43 (54) 48 (19) 20 (42)
Facet joint arthropathy 32 (40) 33 (13) 4 (12)
Lumbar spine alignment 31 (39) 31 (12) 31 (100)
SIJ degeneration 24 (30) 26 (10) 17 (65)
Scoliosis (mild) 23 (29) 24 (10) 1 (4)
General degeneration 20 (25) 23 (9) 3 (13)
Osteophytes 16 (20) 16 (6) 1 (6)
Lordosis 11 (14) 11 (4) 6 (55)
Degenerative disc disease 10 (13) 11 (4) 0 (0)
Spondylolisthesis (grade 1) 9 (11) 9 (4) 0 (0)
Congenital deformities 6 (8) 8 (3) 0 (0)
Pedicles/pars defect 6 (8) 6 (2) 6 (100)
Kyphosis (mild) 2 (3) 2 (1) 0 (0)
Bone island 1 (1) 1 (<1) 0 (0)
Pelvic tilt 1 (1) 1 (<1) 0 (0)
Disc protrusion 1 (1) 1 (<1) 0 (0)
Bony wedging 1 (1) 1 (<1) 0 (0)
Granuloma 1 (1) 1 (<1) 0 (0)
CT (n = 82 reports) N = 82 reports N = 585 findings N = 93 findings
N (%)^ N (%)# N (%)α
Facet joint arthropathy 45 (55) 88 (15) 14 (16)
Disc bulge 40 (49) 73 (13) 3 (4)
Disc height loss 44 (54) 59 (10) 15 (25)
Lumbar spine alignment 40 (40) 44 (8) 36 (82)
Osteophytes 23 (28) 40 (7) 0 (0)
General degeneration 28 (34) 36 (6) 1 (3)
Degenerative disc disease 24 (29) 32 (6) 3 (9)
Disc protrusion 17 (21) 30 (5) 0 (0)
Canal stenosis (mild) 16 (20) 24 (4) 2 (8)
Foraminal stenosis (mild) 12 (15) 23 (4) 0 (0)
SIJ degeneration 17 (21) 17 (3) 10 (59)
Nerve root irritation 9 (11) 15 (3) 0 (0)
Ligamentum flavum hypertrophy 8 (10) 15 (3) 0 (0)
Vacuum phenomenon 12 (15) 13 (2) 0 (0)
Spondylolisthesis (grade 1) 8 (10) 13 (2) 0 (0)
Scoliosis (mild) 10 (12) 12 (2) 0 (0)
Congenital deformities 9 (11) 11 (2) 2 (18)
Lordosis 9 (11) 9 (2) 4 (44)
Pars defect 6 (7) 8 (1) 1 (13)
Spondylosis 3 (4) 6 (1) 0 (0)
Thecal indentation 5 (6) 5 (1) 0 (0)
Schmorl’s node 3 (2) 4 (1) 2 (50)
Disc herniation 2 (2) 2 (<1) 0 (0)
Lateral recess stenosis (mild) 2 (2) 2 (<1) 0 (0)
Haemangioma 1 (1) 1 (<1) 0 (0)
Kyphosis (mild) 1 (1) 1 (<1) 0 (0)
Bone island 1 (1) 1 (<1) 0 (0)
Bony wedging 1 (1) 1 (<1) 0 (0)
MRI (N = 100 reports) N = 100 reports N = 1016 findings N = 149 findings
N (%)^ N (%)# N (%)α
Disc bulge 70 (70) 174 (17) 5 (3)
Facet joint arthropathy 70 (70) 152 (15) 17 (11)
Disc desiccation 44 (44) 69 (7) 9 (13)
Disc protrusion 46 (46) 62 (6) 1 (2)
Foraminal stenosis (mild) 36 (36) 61 (6) 0 (0)
Disc height loss 31 (31) 58 (6) 19 (33)
Canal stenosis (mild) 33 (33) 54 (5) 9 (17)
Nerve root irritation 33 (33) 52 (5) 5 (10)
General degeneration 34 (35) 50 (5) 23 (46)
Lumbar spine alignment 46 (46) 46 (5) 40 (87)
Thecal indentation 24 (24) 39 (4) 0 (0)
Annular fissure 27 (27) 30 (3) 0 (0)
Disc degenerative disease 20 (20) 23 (2) 1 (4)
Ligamentum flavum hypertrophy 8 (8) 16 (2) 0 (0)
Scoliosis (mild) 15 (15) 15 (2) 1 (7)
Spondylolisthesis (grade 1) 12 (12) 14 (1) 0 (0)
Congenital deformities 11 (11) 13 (1) 2 (15)
Sacroiliac joint degeneration 11 (11) 13 (1) 8 (62)
Lateral recess stenosis (mild) 11 (11) 11 (1) 1 (9)
Haemangioma 10 (10) 10 (1) 1 (10)
Lordosis 10 (10) 10 (1) 5 (50)
Modic changes (Type 2 or 3) 10 (10) 10 (1) 0 (0)
Signal change 7 (7) 10 (1) 1 (10)
Pars defect 6 (6) 6 (1) 0 (0)
Osteophytes 5 (5) 6 (1) 0 (0)
Schmorl’s node 5 (5) 6 (1) 1 (17)
Disc herniation 4 (4) 4 (<1) 0 (0)
Tarlov cyst 1 (1) 1 (<1) 0 (0)
Bony wedging 1 (1) 1 (<1) 0 (0)

Note all ‘alignment’ terms reproduced as written in the report, e.g. alignment, scoliosis, lordosis

^N = number of reports (% of all reports for same modality)

#N = number of times finding present (% of all findings for same modality)

αN = number of times finding reported as benign (% of same finding)

Likely clinically unimportant findings that were reported to be absent (e.g., ‘no disc bulge’) were in keeping with those that were reported to be present (S2 Table). S3 Table shows the findings that were considered likely clinically important by modality and in order of frequency. The most common finding for x-ray was fracture, for CT it was canal stenosis and for MRI it was foraminal stenosis.

Descriptors of likely clinically unimportant findings and their frequency

Overall, there were 50 different descriptors or groupings of descriptors used to describe likely clinically unimportant findings. The most frequent descriptors, used at least 10 times, overall and by imaging modality are shown in Fig 2. The most common descriptors were used to indicate the severity of the findings (e.g., mild, moderate, severe), and many, shown on the left of the vertical line, indicated a finding was minimal, minor, small or mild. Other commonly used descriptors were ‘degenerative’ and ‘broadbased’.

Fig 2. Most frequent descriptors of likely clinically unimportant findings overall (used 10 or more times), and by imaging modality*.

Fig 2

*Descriptors used less than ten times overall included chronic (n = 9), focal (n = 9), subtle (n = 8), advanced (n = 7), diffuse (n = 7), significant (n = 7), generalised (n = 6), limited (n = 6), large (n = 4), considerable (n = 3), decreased (n = 3), eccentric (n = 3), fatty (n = 3), gentle (n = 3), asymmetric (n = 2), florid (n = 2), partly (n = 2), stable (n = 2), crescentic (n = 1), further (n = 1), irregular (n = 1), low level (n = 1), more substantial (n = 1), non-compressive (n = 1), occasional (n = 1), prolific (n = 1), pseudo (n = 1).

Terms of uncertainty and their frequency

Table 3 shows the 22 most frequent groupings of terms of uncertainty. Despite the inclusion of more MRI reports, five groups of uncertain terms contained more terms in the CT modality than either MRI or X-ray, including the most common, ‘not shown’ (221 occurrences overall; X-ray n = 35, CT n = 104, MRI n = 82). Other groups with more occurrences in CT than X-ray or MRI were ‘appear’ (53 occurrences overall; X-ray n = 12, CT n = 24 and MRI n = 17), ‘may be’ (15 occurrences; X-ray n = 2, CT n = 7, MRI n = 6), ‘Cannot be assessed’ (11 occurrences; CT n = 8 and MRI n = 3) and ‘no abnormal’ (7 occurrences; CT n = 4 and MRI n = 3).

Table 3. Terms of uncertainty and their frequency overall and by imaging modality*.

Terms of uncertainty XR CT MRI TOTAL
Not shown; Seen; identified; noted; detected; demonstrated; evident; shown; show; shows; of note 35 104 82 221
No significant; no suspicious; no obvious; no undue; no gross; no specific; without significant; no appreciable; no convincing; no definable; no definite; no other significant; no active 8 35 53 96
No evidence of; without evidence of; no features of; no signs of; without convincing evidence of; without clear evidence of; does not show any evidence of; no further evidence of; no specific evidence; with evidence of; evidence of 10 30 39 79
Appear; appears; apparent; appearing; in appearance; has the appearance of 12 24 17 53
Very mild; very slight; very minimal; very marginal; slightly; perhaps slightly; a degree of; only mildly; only very; only limited; only very minimal; looks to be slightly; there may be slight; slightly; relatively; slight; a little; limited; lower limits of 0 9 31 40
Significant; substantial; significantly; significance; no substantial 2 8 20 30
Possible; possibly; probably; potentially; potential; possibility of; perhaps 2 10 16 28
Suggests; suggestion of; appearances suggest; suggested; suggestion; suggestive of; suggesting; would suggest; no suggestion of 3 6 12 21
Some; some impression of; sometimes; somewhat 3 6 8 17
May; may be; may represent; may be a source; may be contributing to; may benefit from; may contribute to; may have; may just; may well respond; might be; can be 2 7 6 15
Likely; most likely; almost certainly; thought to reflect; this may be due to; 3 3 7 13
Essentially; reasonably; generally; usually; equivocal 0 4 8 12
Cannot be assessed; cannot be further assessed; challenging to evaluate; it is difficult to see; clinical significance unknown; as far as can be ascertained; not well demonstrated; within the limitations; I suspect; is thought to reflect; of concern 0 8 3 11
If the patient has clinical features of; if the patient is not benefitting; is there?; if there is; recommend clinical correlation; should that fit; favoured to related to; does this correlate clinically; if this is; in an attempt to; 1 2 7 10
Could be considered; could be entertained; could be performed; could pertain to; could represent; consideration should be given 0 2 5 7
No abnormal; no abnormality 0 4 3 7
Not convincing of; disproportionate to; do not necessarily; not thought to be; in the absence of; no specific cause; do not show any 1 3 3 7
Suspicious; appearances suspicious 1 1 4 6
At least; allowing for 0 3 2 5
Contribute; contributes; consistent with; most in keeping with; identifiable 0 2 3 5
Quite; quite substantial; particularly 0 1 3 4
Would likely account for; which would be; which would account for; would account for 0 2 2 4

*Terms were grouped based on similar meanings

Discussion

We found that lumbar spine X-Ray, CT and MRI reports describe a large number of findings overall. While most were judged to be of unlikely clinical relevance, less than a fifth were explicitly reported to be benign. Descriptors such as minimal, minor, small, mild and degenerative were common and likely intended to convey a lack of clinical relevanceVague modifiers and hedging vocabulary were also common across reports.

There was variation in the proportion of each finding reported as benign, for example, ‘lumbar spine alignment’ was commonly reported as normal while some terms such as spondylolisthesis were not reported as benign in any report where it was present. Terms such as spondylolisthesis are often considered ‘pathological’ [2] which could explain why there were less likely to be explicity identified as benign. Our findings are in keeping with the single previous study that has investigated the content of lumbar spine X-ray reports [2]. We found that these issues also apply to complex imaging (CT and MRI)reports.

Previous studies have indicated that many commonly reported, likely clinically unimportant findings, such as disc bulge and disc or facet joint degeneration, are likely to be misconstrued by patients [9, 18, 23]. An online survey designed to elicit consumer understanding of terms commonly used in lumbar spine imaging reports found these terms were deemed to be serious and likely to make the majority of respondents concerned about persistence of pain [9]. Participants in another online survey also reported lower expectations of recovery and higher perceived seriousness and need for surgery if they were deemed to have a‘disc bulge’ or ‘degeneration’ as the cause of their pain versus other diagnostic labels such as an ‘episode of back pain’ or ‘lumbar sprain’ [23].

General practitioners also want clear explanations for terms found in lumbar spine imaging reports, including their clinical relevance [24]. In another study that replaced terms considered to increase patient concern with alternative less concerning terms were interpreted as showing less severe disease by general orthopaedic surgeons, orthopaedic residents and physiotherapists but not spinal surgeons [6]. Similarly, for all groups except spinal surgeons there was also a trend away from recommendations for invasive treatments such as injections and surgery, and a lower perceived likelihood of the patient requiring surgical intervention.

Strengths and limitations

We performed a content analysis of a random sample of lumbar spine plain X-ray, CT and MRI imaging reports obtained from a large radiology service provider in Victoria, Australia, and extracted all findings until data saturation was reached for each imaging modality. It is therefore likely that our results are generalisable to other providers and settings in Australia.

Decisions regarding whether findings were likely clinically important or unimportant were made independently by two authors, based a priori on published evidence and report context. Equivocal findings and differences of opinion between the author team were resolved by discussion. These data are so that readers can make their own judgments about the validity of our decisions. Similarly two authors also independently determined whether or not each finding was reported to be benign and disagreements were resolved by discussion with all authors.

We were intentionally conservative in how we categorised the likely clinical importance of imaging findings. For example, we categorised moderate to severe canal stenosis as likely clinically important as this is widely accepted in clinical practice [25]. However a recent study found that, even in combination with other degenerative findings, canal stenosis may not have a clinically important association with low back pain [4]. It is also possible that some findings we coded as likely clinically unimportant were also miscoded.

The reports were extracted in full from an electronic database but we did not have access to the imaging requests other than what was included in the reports. While the clinical notes from requests are usually included in the reports in our setting, we cannot exclude the possibility that the referring clinician provided further information by other means. We were also unable to verify the report findings, or their adherence to standard lumbar spine reporting nomenclature [26], as we did not have access to the images themselves.

Implications for practice

Although imaging is only one part of a comprehensive clinical assessment, primary care clinicians can find it difficult to understand the terminology in reports and assess the clinical relevance of findings [24]. This may lead to misinterpretation of the findings by both referring clinicians and patients and result in unwarranted anxiety, more complex imaging, overdiagnosis and unnecessary treatment. As patients are increasingly able to directly access their imaging findings, it becomes even more imperative for radiologists to consider how to report imaging findings in a way that minimises misinterpretation and uncertainty about the relevance of the reported findings.

Implications for research

Further research is needed to determine the most effective and comprehensible methods for reporting lumbar spine imaging findings in people with low back pain. Co-design with relevant stakeholders including radiologists, clinicians and patients of a standard reporting template, that appropriately considers the importance of findings, how they are described and how certainty is qualified, is one possible approach. The template could also provide guidance about where it might be appropriate to use vague modifiers or hedging terms [16]. Evaluation of such a template could consider whether the findings are understood by clinicians and patients as intended and whether it improves the quality of care compared with usual reporting. Similar to pathology reports that provide normal ranges that can vary by age, imaging reports could also include a ‘reference range’ of findings that are normal for age.

However, studies that have included explicit information about the age-related prevalence of common findings in asymptomatic populations have reported conflicting results. While early studies identified promising reductions in referrals and repeat imaging [7] and reduced opioid prescription [8] a large multi-centre randomised trial involving 250,401 participants found that a small shift in prescribing only [2729]. Changing the language to be less threatening also shows promise [6, 17] as does inclusion of explicit evidence-based management advice [30].

Conclusion

Lumbar spine imaging reports frequently include findings that are unlikely to be clinically important without explicitly indicating they are benign. A wide variety of descriptors and uncertainty terms are used to put the findings in context that may be indirectly intended to convey the lack of clinical relevance of the findings. Clearer, more explicit language may reduce misconceptions about the relevance of lumbar spine imaging findings in people with low back pain and improve quality of care and health-related outcomes.

Supporting information

S1 Table. Likely clinically unimportant findings based upon the published evidence for the relevance of imaging findings*.

*Bracketed numbers indicate the studies that determined the findings are likely clinically unimportant.

(DOCX)

pone.0297911.s001.docx (149.2KB, docx)
S2 Table. Clinically unimportant findings that were reported to exclude the presence of the pathology by number of reports and overall frequency of the finding.

(DOCX)

pone.0297911.s002.docx (17.5KB, docx)
S3 Table. Frequency and type of likely clinically important findings by modality.

^N = number of reports (% of all reports for same modality). #N = number of times finding present (% of all findings for same modality). αN = number of times finding reported as present (% of same finding).

(DOCX)

pone.0297911.s003.docx (21.6KB, docx)

Acknowledgments

Thank you to Marco Polidori, our research assistant for the report extraction and to iMed radiology network, who kindly provided a copy of the anonymised reports.

Data Availability

“We have uploaded our extracted data to Open Science Framework (available at https://osf.io/ut9am/). Due to our legal agreement with i-Med, this does not include the original radiology reports. However, requests to access these can be obtained by emailing research@i-med.com.au and then forwarded to the Monash University Human Research Ethics Committee (MUHREC) (muhrec@monash.edu).”

Funding Statement

This work was supported by a National Health and Medical Research Council (NHMRC) Program Grant: Using healthcare wisely: reducing inappropriate use of tests and treatments (APP1113532) 2017-21. RB is funded by an Australian National Health and Medical Research Council (NHMRC) L3 Investigator Fellowship (APP1194483) www.nhmrc.org.au The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Brinjikji W, Luetmer PH, Comstock B, Bresnahan BW, Chen LE, Deyo RA, et al. Systematic literature review of imaging features of spinal degeneration in asymptomatic populations. AJNR Am J Neuroradiol. 2015;36(4):811–6. doi: 10.3174/ajnr.A4173 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Thompson P, Carr E. Content analysis of general practitioner-requested lumbar spine X-ray reports. Br J Radiol. 2007;80(959):866–71. doi: 10.1259/bjr/11719131 [DOI] [PubMed] [Google Scholar]
  • 3.de Schepper EI, Koes BW, Veldhuizen EF, Oei EH, Bierma-Zeinstra SM, Luijsterburg PA. Prevalence of spinal pathology in patients presenting for lumbar MRI as referred from general practice. Fam Pract. 2016;33(1):51–6. doi: 10.1093/fampra/cmv097 [DOI] [PubMed] [Google Scholar]
  • 4.Kasch R, Truthmann J, Hancock MJ, Maher CG, Otto M, Nell C, et al. Association of Lumbar MRI Findings with Current and Future Back Pain in a Population-based Cohort Study. Spine. 2022;47(3):201–11. doi: 10.1097/BRS.0000000000004198 [DOI] [PubMed] [Google Scholar]
  • 5.Chen L, Perera RS, Radojcic MR, Beckenkamp PR, Ferreira PH, Hart DJ, et al. Association of Lumbar Spine Radiographic Changes With Severity of Back Pain-Related Disability Among Middle-aged, Community-Dwelling Women. JAMA Netw Open. 2021;4(5):e2110715. doi: 10.1001/jamanetworkopen.2021.10715 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Rajasekaran S, Dilip Chand Raja S, Pushpa BT, Ananda KB, Ajoy Prasad S, Rishi MK. The catastrophization effects of an MRI report on the patient and surgeon and the benefits of ’clinical reporting’: results from an RCT and blinded trials. Eur Spine J. 2021. doi: 10.1007/s00586-021-06809-0 [DOI] [PubMed] [Google Scholar]
  • 7.Fried JG, Andrew AS, Ring NY, Pastel DA. Changes in Primary Care Health Care Utilization after Inclusion of Epidemiologic Data in Lumbar Spine MR Imaging Reports for Uncomplicated Low Back Pain. Radiology. 2018;287(2):563–9. doi: 10.1148/radiol.2017170722 [DOI] [PubMed] [Google Scholar]
  • 8.McCullough BJ, Johnson GR, Martin BI, Jarvik JG. Lumbar MR imaging and reporting epidemiology: do epidemiologic data in reports affect clinical management? Radiology. 2012;262(3):941–6. doi: 10.1148/radiol.11110618 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Farmer C, Connor DA, Lee H, McCaffery K, Maher C, Newell D, et al. Consumer understanding of terms used in imaging reports requested for low back pain: a cross-sectional survey. BMJ Open. 2021;11(9):e049938. doi: 10.1136/bmjopen-2021-049938 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Farmer CI, Bourne AM, O’Connor D, Jarvik JG, Buchbinder R. Enhancing clinician and patient understanding of radiology reports: a scoping review of international guidelines. Insights into Imaging. 2020;11(1):62. doi: 10.1186/s13244-020-00864-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.The Royal Australian and New Zealand College of Radiologists. Clinical Radiology written report guide. Sydney, Australia: The Royal Australian and New Zealand College of Radiologists; 2017.
  • 12.The Royal College of Radiologists. Standards for interpretation and reporting of imaging investigations 2nd ed. London, UK: The Royal College of Radiologists; 2018.
  • 13.Canadian Association of Radiologists. Standard for communication of diagnostic imaging finding. Ottawa, Canada: Canadian Association of Radiologists; 2010.
  • 14.Wallis A, McCoubrie P. The radiology report—are we getting the message across? Clin Radiol. 2011;66(11):1015–22. doi: 10.1016/j.crad.2011.05.013 [DOI] [PubMed] [Google Scholar]
  • 15.Rosenkrantz AB, Kiritsy M, Kim S. How "consistent" is "consistent"? A clinician-based assessment of the reliability of expressions used by radiologists to communicate diagnostic confidence. Clinical Radiology. 2014;69(7):745–9. doi: 10.1016/j.crad.2014.03.004 [DOI] [PubMed] [Google Scholar]
  • 16.Rosenkrantz AB. Differences in Perceptions Among Radiologists, Referring Physicians, and Patients Regarding Language for Incidental Findings Reporting. AJR Am J Roentgenol. 2017;208(1):140–3. doi: 10.2214/AJR.16.16633 [DOI] [PubMed] [Google Scholar]
  • 17.Karran EL, Medalian Y, Hillier SL, Moseley GL. The impact of choosing words carefully: an online investigation into imaging reporting strategies and best practice care for low back pain. PeerJ. 2017;5:e4151. doi: 10.7717/peerj.4151 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Sloan TJ, Walsh DA. Explanatory and diagnostic labels and perceived prognosis in chronic low back pain. Spine (Phila Pa 1976). 2010;35(21):E1120–5. doi: 10.1097/BRS.0b013e3181e089a9 [DOI] [PubMed] [Google Scholar]
  • 19.Audi S, Pencharz D, Wagner T. Behind the hedges: how to convey uncertainty in imaging reports. Clin Radiol. 2021;76(2):84–7. doi: 10.1016/j.crad.2020.08.003 [DOI] [PubMed] [Google Scholar]
  • 20.Haas R, Farmer C, Bourne A, O’Connor D, Gelber N, Jarvik J, et al. 79 Assessment of the content of lumbar spine plain radiograph reports requested by emergency doctors: a pilot study. Royal Society of Medicine; 2019.
  • 21.Francis JJ, Johnston M, Robertson C, Glidewell L, Entwistle V, Eccles MP, et al. What is an adequate sample size? Operationalising data saturation for theory-based interview studies. Psychology & Health. 2010;25(10):1229–45. doi: 10.1080/08870440903194015 [DOI] [PubMed] [Google Scholar]
  • 22.Khorasani R, Bates DW, Teeger S, Rothschild JM, Adams DF, Seltzer SE. Is Terminology Used Effectively to Convey Diagnostic Certainty in Radiology Reports? Academic Radiology. 2003;10(6):685–8. doi: 10.1016/s1076-6332(03)80089-2 [DOI] [PubMed] [Google Scholar]
  • 23.O’Keeffe M, Ferreira GE, Harris IA, Darlow B, Buchbinder R, Traeger AC, et al. Effect of diagnostic labelling on management intentions for non-specific low back pain: A randomized scenario-based experiment. Eur J Pain. 2022;26(7):1532–45. doi: 10.1002/ejp.1981 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Espeland A, Baerheim A. General practitioners’ views on radiology reports of plain radiography for back pain. Scandinavian Journal of Primary Health Care. 2007;25(1):15–9. doi: 10.1080/02813430600973459 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Jarvik JG, Comstock BA, James KT, Avins AL, Bresnahan BW, Deyo RA, et al. Lumbar Imaging With Reporting Of Epidemiology (LIRE)—Protocol for a pragmatic cluster randomized trial. Contemp Clin Trials. 2015;45(Pt B):157–63. doi: 10.1016/j.cct.2015.10.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Fardon DF, Williams AL, Dohring EJ, Murtagh FR, Gabriel Rothman SL, Sze GK. Lumbar disc nomenclature: version 2.0: recommendations of the combined task forces of the North American Spine Society, the American Society of Spine Radiology, and the American Society of Neuroradiology. Spine (Phila Pa 1976). 2014;39(24):E1448–65. doi: 10.1097/BRS.0b013e3182a8866d [DOI] [PubMed] [Google Scholar]
  • 27.Jarvik JG, Meier EN, James KT, Gold LS, Tan KW, Kessler LG, et al. The Effect of Including Benchmark Prevalence Data of Common Imaging Findings in Spine Image Reports on Health Care Utilization Among Adults Undergoing Spine Imaging: A Stepped-Wedge Randomized Clinical Trial. JAMA Network Open. 2020;3(9):e2015713–e. doi: 10.1001/jamanetworkopen.2020.15713 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Suri P, Meier EN, Gold LS, Marcum ZA, Johnston SK, James KT, et al. Providing Epidemiological Data in Lumbar Spine Imaging Reports Did Not Affect Subsequent Utilization of Spine Procedures: Secondary Outcomes from a Stepped-Wedge Randomized Controlled Trial. Pain Med. 2021;22(6):1272–80. doi: 10.1093/pm/pnab065 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Marcum ZA, Gold LS, James KT, Meier EN, Turner JA, Kallmes DF, et al. Effects of Including Epidemiologic Data in Lumbar Spine Imaging Reports on Prescribing Non-Opioid Medications for Pain. Journal of General Internal Medicine. 2021. doi: 10.1007/s11606-021-06627-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Farmer C, Bourne A, Haas R, Wallis J, O’Connor D, Buchbinder R. Can modifications to how medical imaging findings are reported improve quality of care? A systematic review. Clin Radiol. 2022;77(6):428–35. doi: 10.1016/j.crad.2022.02.011 [DOI] [PubMed] [Google Scholar]

Decision Letter 0

Aloysius Gonzaga Mubuuke

4 Sep 2023

PONE-D-23-14641Are clinically unimportant findings qualified as benign in lumbar spine imaging reports? A content analysis of plain X-ray, CT and MRI reportsPLOS ONE

Dear Dr. Farmer,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Oct 19 2023 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Aloysius Gonzaga Mubuuke

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at 

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please provide additional details regarding participant consent. In the ethics statement in the Methods and online submission information, please ensure that you have specified (1) whether consent was informed and (2) what type you obtained (for instance, written or verbal, and if verbal, how it was documented and witnessed). If your study included minors, state whether you obtained consent from parents or guardians. If the need for consent was waived by the ethics committee, please include this information.

If you are reporting a retrospective study of medical records or archived samples, please ensure that you have discussed whether all data were fully anonymized before you accessed them and/or whether the IRB or ethics committee waived the requirement for informed consent. If patients provided informed written consent to have data from their medical records used in research, please include this information.

3. We note that you have indicated that data from this study are available upon request. PLOS only allows data to be available upon request if there are legal or ethical restrictions on sharing data publicly. For more information on unacceptable data access restrictions, please see http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions. 

In your revised cover letter, please address the following prompts:

a) If there are ethical or legal restrictions on sharing a de-identified data set, please explain them in detail (e.g., data contain potentially sensitive information, data are owned by a third-party organization, etc.) and who has imposed them (e.g., an ethics committee). Please also provide contact information for a data access committee, ethics committee, or other institutional body to which data requests may be sent.

b) If there are no restrictions, please upload the minimal anonymized data set necessary to replicate your study findings as either Supporting Information files or to a stable, public repository and provide us with the relevant URLs, DOIs, or accession numbers. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories.

We will update your Data Availability statement on your behalf to reflect the information you provide.

4. Please include captions for your Supporting Information files at the end of your manuscript, and update any in-text citations to match accordingly. Please see our Supporting Information guidelines for more information: http://journals.plos.org/plosone/s/supporting-information. 

5. Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

Additional Editor Comments:

The paper addresses an important area trying to show how radiologists make their decisions using different imaging modalities in lumbar images. In addition to the comments from reviewers, the authors should strengthen the discussion by clearly explaining the implication of the findings to clinical practice in patients presenting with lumbar issues for imaging. In addition, proof-read the entire paper to refine the language.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Thank you for the opportunity to review this article. The research question is if radiologists label normal findings in imaging (X-Ray, CT, and MRI) as such in their report. For this, 200 of each modality from patients in which the imaging was indicated due to lower back pain, were acquired. The reports analyzed with the question, if the normality of likely clinically unimportant findings was reported. The main finding is that only a minority of “most likely clinically unimportant” findings were not labeled as such, which, as they argue, could lead clinicians and patients to overestimate the severity of findings and lead to potentially unwarranted escalation of therapies.

The introduction clearly introduces the topic and explains the clinical relevancy in an encompassing manner. The methods are clearly described. The reporting of the is generally very detailed and transparent of the findings, although some aspects require additional explanation.

In table 1 an extra row should be added, regarding how many percent of the clinically unimportant findings were on average reported as benign. Although this is similar to “Number of reports that qualified all reported clinically unimportant findings as benign, N(%)”, it does not allow conclusions about average percent of findings that were reported as benign.

In table 2 a brief explanation should be added, that cases of potentially clinically significant findings were excluded. Although it is described in the methods, the table adds ambiguity which gradings of the reported features were included. For example, just mentioning Foraminal stenosis, might lead readers to the conclusion that all grades of foraminal stenosis were evaluated as likely clinically unimportant. The mentioning of “Lumbar spine alignment” and “Lordosis” or” Scoliosis” leads to confusion, what exactly is meant with the term “Lumbar spine alignment”.

The discussion could be improved by discussing potential reasons for the percentage differences in labeling of features as benign. For example, normal lumbar spinal alignment and Pedicles/pars defect were report as benign 100% of the time, while no occurrence “Spondylolisthesis” was reported as benign.

From an orthopedics surgeons’ point of view, the assertion that changes like Modic Typ3, Nerve root irritation, Facet joint arthropathie, or Spondylolisthesis are benign or clinically unimportant is difficult to grasp. These might be not significantly associated with low back pain on a population’s levels, as shown in some studies, but for individual patients, they can be indicative of underlying pathology that may require intervention (conservative or otherwise).

In clinical practice, a patient’s history, physical examination findings, and imaging studies must be considered together to make informed decisions on management. Furthermore, some of these changes, might be associated with mechanical instability or degenerative processes that could progress over time. As such, dismissing them as benign or clinically unimportant without a thorough evaluation and considering the patient's unique circumstances could potentially lead to missed opportunities for early intervention and management, which in turn could impact the quality of life and functional outcomes for the patient. Given that radiologist often have limited clinical data about the patient, it seems reasonable, that radiologist would not reported these findings explicitly as benign (or normal, etc.). The discussion should aim to address these concerns or better specify the population of patients this study is aimed to cover.

An additional limitation of this study that should be reported is the missing of clinical data.

Overall the manuscript is well written and answers that set out research question. After the comments are adressed, I can recommend this manuscript for publication.

Reviewer #2: Comments

Abstract line 35; qualifying that they are benign

103-104 and this study was conducted over the 104 following year

114 To ensure anonymity of the reporting radiologist, - add coma

125; serious finding is vague; quantify or substantiate on the meaning of vague.

143-146; why did physiotherapists and rheumatologists independently determined whether finding was likely clinically unimportant or important? Why not the physicians such as orthopedic surgeons or referring clinicians? Was this appropriate?

199: The 95 were excluded; Reasons for exclusion?

212; use age range instead of median age

217; use mean/average instead of median.

264; The most common finding across imaging modalities was fracture, while for CT and MRI it was foraminal and canal stenosis and nerve root impingement; So was it fracture for MRI and CT too?

297; This study found lumbar spine X-Ray, CT and MRI reports include a large number of findings.; write; This study found that; also large number of findings is vague; do you mean “irrelevant” findings or?

383; Avoid the word- serious

408: Provided; not provide.

Overall: Try to summarise This study found lumbar spine X-Ray, CT and MRI reports include a large number of findings. the discussion; it is too long.

Check all grammatical errors and correct them before submitting.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: Yes: Rita Nassanga

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2024 Mar 13;19(3):e0297911. doi: 10.1371/journal.pone.0297911.r002

Author response to Decision Letter 0


12 Dec 2023

Thank you for your constructive and considered feedback, which we have incorporated into our manuscript. We have reviewed grammar and edited for clarity, and significantly reduced the discussion length. We have also refined formatting and labelling in accordance with PlosOne policies. Please find our full response below and in the attached 'Response to reviewers' document.

We thank the reviewers for their insightful comments on our manuscript, and answer their queries below.

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

________________________________________

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

________________________________________

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

________________________________________

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

________________________________________

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Thank you for the opportunity to review this article. The research question is if radiologists label normal findings in imaging (X-Ray, CT, and MRI) as such in their report. For this, 200 of each modality from patients in which the imaging was indicated due to lower back pain, were acquired. The reports analyzed with the question, if the normality of likely clinically unimportant findings was reported. The main finding is that only a minority of “most likely clinically unimportant” findings were not labeled as such, which, as they argue, could lead clinicians and patients to overestimate the severity of findings and lead to potentially unwarranted escalation of therapies.

The introduction clearly introduces the topic and explains the clinical relevancy in an encompassing manner. The methods are clearly described. The reporting of the is generally very detailed and transparent of the findings, although some aspects require additional explanation.

In table 1 an extra row should be added, regarding how many percent of the clinically unimportant findings were on average reported as benign. Although this is similar to “Number of reports that qualified all reported clinically unimportant findings as benign, N(%)”, it does not allow conclusions about average percent of findings that were reported as benign.

We have added a row in Table 1, “Mean percent of clinically unimportant findings described as benign per report” as requested. We also reordered one row on the table.

In table 2 a brief explanation should be added, that cases of potentially clinically significant findings were excluded. Although it is described in the methods, the table adds ambiguity which gradings of the reported features were included. For example, just mentioning Foraminal stenosis, might lead readers to the conclusion that all grades of foraminal stenosis were evaluated as likely clinically unimportant. The mentioning of “Lumbar spine alignment” and “Lordosis” or” Scoliosis” leads to confusion, what exactly is meant with the term “Lumbar spine alignment”.

Thank you for this feedback. We have added descriptors to the relevant terms (for example, ‘Scoliosis (mild)’ to make this clearer to readers, and added a note regarding alignment terms, which were reproduced as they were written in the report.

The discussion could be improved by discussing potential reasons for the percentage differences in labeling of features as benign. For example, normal lumbar spinal alignment and Pedicles/pars defect were report as benign 100% of the time, while no occurrence “Spondylolisthesis” was reported as benign.

We have revised the discussion and added a paragraph regarding the potential reasons for the range in labelling of findings as benign as suggested (lines 346-350 in track changes manuscript).

From an orthopedics surgeons’ point of view, the assertion that changes like Modic Typ3, Nerve root irritation, Facet joint arthropathie, or Spondylolisthesis are benign or clinically unimportant is difficult to grasp. These might be not significantly associated with low back pain on a population’s levels, as shown in some studies, but for individual patients, they can be indicative of underlying pathology that may require intervention (conservative or otherwise).

In clinical practice, a patient’s history, physical examination findings, and imaging studies must be considered together to make informed decisions on management. Furthermore, some of these changes, might be associated with mechanical instability or degenerative processes that could progress over time. As such, dismissing them as benign or clinically unimportant without a thorough evaluation and considering the patient's unique circumstances could potentially lead to missed opportunities for early intervention and management, which in turn could impact the quality of life and functional outcomes for the patient. Given that radiologist often have limited clinical data about the patient, it seems reasonable, that radiologist would not reported these findings explicitly as benign (or normal, etc.). The discussion should aim to address these concerns or better specify the population of patients this study is aimed to cover.

We agree that it is important to comprehensively assess an individual patient and we would expect that the GP/primary care practitioner would do so. We have made it clearer that in primary care most of these findings would be considered benign. Our judgements about relevance of individual findings were based upon the available published evidence that these findings are usually benign – as we indicated, both cross-sectional and longitudinal population-based studies have found that the changes we included as benign do not appear to be associated with either current or future back pain.

This is outlined in the methods (lines 143-158) and discussion (lines 375-381). In the following paragraph in the discussion (lines 384-389) we also indicated that we were intentionally conservative in our judgements. As well we provided these data in full (Supplementary Table 1) so that the reader can also make their own judgments about our assessments. We agree that our judgements were based upon population-based data, which is the highest form of evidence for looking at these types of associations and together with clinical judgement can be therefore extrapolated to individual patients in most instances. We agree that in a specialist setting, the specialist can review the films and make their own interpretation for individual patients.

An additional limitation of this study that should be reported is the missing of clinical data.

We have added a sentence clarifying this (lines 392-394).

Overall the manuscript is well written and answers that set out research question. After the comments are adressed, I can recommend this manuscript for publication.

Thank you for these helpful comments.

Reviewer #2: Comments

Abstract line 35; qualifying that they are benign

Corrected, thank you.

103-104 and this study was conducted over the following year

Added as requested.

114 To ensure anonymity of the reporting radiologist, - add coma

Added as requested.

125; serious finding is vague; quantify or substantiate on the meaning of vague.

We have expanded the examples of serious findings.

143-146; why did physiotherapists and rheumatologists independently determined whether finding was likely clinically unimportant or important? Why not the physicians such as orthopedic surgeons or referring clinicians? Was this appropriate?

Findings were determined as likely clinically unimportant or important based on the available evidence (Supplementary table 1). The research authors were physiotherapists and a rheumatologist. Their background was included as context about who was performing the study and the judgements. All are expert clinicians with expertise in back pain so we consider this appropriate.

199: The 95 were excluded; Reasons for exclusion?

We have clarified in the text that the reasons for exclusion are provided in Figure 1 (top box, right side of figure).

212; use age range instead of median age

We prefer to report both median and range for age.

217; use mean/average instead of median.

We prefer to report median rather than mean as the data were not normally distributed.

264; The most common finding across imaging modalities was fracture, while for CT and MRI it was foraminal and canal stenosis and nerve root impingement; So was it fracture for MRI and CT too?

We have clarified that the most common finding for x-ray was fracture, for CT it was canal stenosis, and for MRI it was foraminal stenosis.

297; This study found lumbar spine X-Ray, CT and MRI reports include a large number of findings.; write; This study found that;

We have amended this as suggested.

also large number of findings is vague; do you mean “irrelevant” findings or?

We have clarified this sentence to “This study found that lumbar spine X-Ray, CT and MRI reports describe a large number of imaging findings overall.”

383; Avoid the word- serious

We prefer to retain this word as it is based upon its use in clinical guidelines and our prior cited papers.

408: Provided; not provide.

Amended as suggested.

Overall: Try to summarise This study found lumbar spine X-Ray, CT and MRI reports include a large number of findings. the discussion; it is too long.

Check all grammatical errors and correct them before submitting.

We have shortened the discussion and checked and corrected grammatical errors.

Decision Letter 1

Aloysius Gonzaga Mubuuke

16 Jan 2024

Are clinically unimportant findings qualified as benign in lumbar spine imaging reports? A content analysis of plain X-ray, CT and MRI reports

PONE-D-23-14641R1

Dear Dr. Farmer,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Aloysius Gonzaga Mubuuke

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Acceptance letter

Aloysius Gonzaga Mubuuke

4 Mar 2024

PONE-D-23-14641R1

PLOS ONE

Dear Dr. Farmer,

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now being handed over to our production team.

At this stage, our production department will prepare your paper for publication. This includes ensuring the following:

* All references, tables, and figures are properly cited

* All relevant supporting information is included in the manuscript submission,

* There are no issues that prevent the paper from being properly typeset

If revisions are needed, the production department will contact you directly to resolve them. If no revisions are needed, you will receive an email when the publication date has been set. At this time, we do not offer pre-publication proofs to authors during production of the accepted work. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few weeks to review your paper and let you know the next and final steps.

Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

If we can help with anything else, please email us at customercare@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Aloysius Gonzaga Mubuuke

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Table. Likely clinically unimportant findings based upon the published evidence for the relevance of imaging findings*.

    *Bracketed numbers indicate the studies that determined the findings are likely clinically unimportant.

    (DOCX)

    pone.0297911.s001.docx (149.2KB, docx)
    S2 Table. Clinically unimportant findings that were reported to exclude the presence of the pathology by number of reports and overall frequency of the finding.

    (DOCX)

    pone.0297911.s002.docx (17.5KB, docx)
    S3 Table. Frequency and type of likely clinically important findings by modality.

    ^N = number of reports (% of all reports for same modality). #N = number of times finding present (% of all findings for same modality). αN = number of times finding reported as present (% of same finding).

    (DOCX)

    pone.0297911.s003.docx (21.6KB, docx)

    Data Availability Statement

    “We have uploaded our extracted data to Open Science Framework (available at https://osf.io/ut9am/). Due to our legal agreement with i-Med, this does not include the original radiology reports. However, requests to access these can be obtained by emailing research@i-med.com.au and then forwarded to the Monash University Human Research Ethics Committee (MUHREC) (muhrec@monash.edu).”


    Articles from PLOS ONE are provided here courtesy of PLOS

    RESOURCES