Abstract
Abstract
Introduction
Acute low back pain (LBP) is a prevalent condition with various non-surgical treatment options, yet no comprehensive network meta-analysis has systematically compared their relative efficacy for pain and disability. This study aims to fill that gap by synthesising available evidence on the efficacy of different types of non-surgical interventions for acute LBP, such as various medications, manual therapies and education-based therapies. Our coprimary objectives are to (1) compare each active treatment to an inert reference for measures of LBP and related disability and (2) rank the efficacy of treatments.
Methods and analysis
We will conduct a systematic search across multiple databases, including grey literature, to identify randomised controlled trials evaluating non-surgical treatments for acute LBP. Eligible studies must report on pain and/or disability outcomes in adults. The risk of bias will be assessed using the Risk of Bias tool, and the certainty of evidence will be graded using CINeMA (Confidence in Network Meta-Analysis). We will use a frequentist network meta-analysis to pool standardised mean differences in pain and disability, employing random-effects models to account for heterogeneity. A qualitative analysis will assess study characteristics and transitivity, while a quantitative analysis will evaluate efficacy and inconsistency. Results will be presented using network geometry, p-scores, forest plots, funnel plots, Egger’s test, Q-statistics and league tables to visualise both direct and indirect evidence and to identify potential biases.
Ethics and dissemination
This review protocol does not involve any primary research with human participants, animal subjects or medical record review. Consequently, this work did not require approval from an institutional review board or ethics committee. Results will be submitted to a peer-reviewed journal and presented at conference(s). De-identified data will be made available in a public repository.
Keywords: Pain management, Exercise, Meta-Analysis, Back pain, COMPLEMENTARY MEDICINE
STRENGTHS AND LIMITATIONS OF THIS STUDY.
The study will include a comprehensive search strategy and employ standardised tools to assess trial quality and evidence certainty.
By focusing exclusively on acute non-specific low back pain, the review avoids heterogeneity that could arise from combining populations with varied durations of low back pain.
The inclusion of a broad range of non-surgical treatments enables a novel comparison of their relative efficacy using the network meta-analysis approach; yet this strategy relies on the transitivity assumption being met.
While the grouping of interventions is informed by available evidence and a multidisciplinary author team, this strategy may obscure small differences within categories such as usual care, inert treatments, exercise or others, potentially contributing to heterogeneity and inconsistency.
Introduction
Low back pain (LBP) is a common disorder that can lead to disability, diminished quality of life, inability to work and incurs high direct and indirect costs.1 2 The global prevalence of LBP ranged from 1% to 20% in 2019, with higher rates typically observed in high-income countries.3 Acute LBP, defined as symptoms lasting less than 6 weeks,4 is most often non-specific, meaning it lacks identifiable structural pathology or major neurological deficits.5,7 Compared with chronic LBP, acute LBP has a favourable prognosis and significantly improves within 6 weeks, yet its recovery slows over time.8 Among patients with acute LBP, about 69% experience a recurrence of LBP within a year,9 approximately one in four develop chronic symptoms,10 while about one in five develop persistent severe symptoms.11
Clinical practice guidelines recommend a wide range of non-surgical treatments for acute LBP, including non-steroidal anti-inflammatory drugs (NSAIDs), exercise, spinal manipulative therapy, heat and education as first-line interventions.12 13 However, guideline recommendations vary and often lack certainty, especially regarding the comparative efficacy of these interventions,1214,16 which limits clinicians’ decision-making. These limitations may be because meta-analyses mainly have focused on chronic LBP or examined mixed populations of acute, subacute and/or chronic LBP, thereby limiting insights specific to acute presentations.17,20 Furthermore, prior network meta-analyses (NMAs) have often examined interventions within single categories such as analgesic medications,21 acupuncture19 and exercises22 rather than across diverse treatment modalities. This fragmented approach does not allow for comparisons across different types of non-surgical treatments, leaving gaps in knowledge for clinicians treating acute LBP and hindering guideline synthesis and implementation.
There is a general research gap in evidence specific to acute LBP compared with chronic LBP. Illustratively, a recent meta-analysis of non-surgical treatments for LBP included 301 randomised controlled trials (RCTs), of which only 52 (17%) focused on acute LBP, while the remainder included chronic LBP or mixed-duration populations.23 The review results were not stratified by LBP duration, limiting insights into acute LBP. Similarly, a global review of 22 clinical practice guidelines for LBP found that while 13 (59%) provided recommendations for acute LBP, only 1 (5%) was focused exclusively on this population, with the remainder focusing on chronic or mixed durations.16 These findings indicate that although a body of trial evidence on acute LBP exists, it has received comparatively less attention in evidence synthesis and guideline development. As a result, clinicians and policymakers are left with less specific guidance for managing acute LBP, despite its high prevalence and potential for progression to chronic symptoms.9 11
There is a need for evidence that informs clinical decision-making and guideline development for acute LBP. Clinicians are often faced with choosing among multiple non-surgical treatment options, yet lack clear guidance on how these options compare in terms of efficacy and may encounter conflicting recommendations across clinical practice guidelines.16 An NMA that evaluates a broad range of interventions can offer relevant comparisons that inform clinical decision-making.24 Given their ability to incorporate both direct and indirect evidence across multiple treatments, NMAs are also increasingly used to inform clinical practice guidelines and health policy decisions.24,26 Identifying efficacious interventions for acute LBP may therefore help clinicians and other decision-makers to minimise pain and disability but also help prevent LBP from becoming chronic, reduce unnecessary care escalation and optimise healthcare resource use.27,29
Current practice often diverges from guideline recommendations.30 31 Many clinicians employ non-guideline-recommended treatments, such as passive modalities and opioids, which may reflect uncertainty about the most effective therapies, distrust of guidelines and/or a gap between research evidence and clinical implementation.27 32 33 Research suggests that non-concordant care is associated with higher healthcare utilisation, increased costs, worse outcomes and greater risk of developing chronic LBP.27,29 Despite the relatively good prognosis of acute LBP, this remains a potentially costly condition especially for those with high amounts of initial healthcare utilisation.34 These issues underscore the need to clarify efficacious treatments for acute LBP for clinicians, stakeholders creating practice guidelines and patients who seek to streamline their recovery and avoid unnecessary or costly interventions.
Rationale and objectives
We aim to conduct a systematic review and NMA to address the existing gap in the literature regarding the efficacy of various non-surgical treatments for acute non-specific LBP. Our co-primary objectives are to (1) rank the efficacy of treatments for LBP and disability and (2) compare each active treatment to an inert reference for measures of LBP and related disability. To our knowledge, this will be the first NMA to synthesise evidence specific to acute LBP across a broad range of non-surgical treatments.
Methods
Study reporting will adhere to The PRISMA Extension Statement for Reporting of Systematic Reviews Incorporating Network Meta-analyses of Health Care Interventions (PRISMA-NMA).35
Eligibility criteria
Given the need to meet the transitivity assumption in NMA, we plan to adopt a selective approach in defining study eligibility criteria. By focusing on specific age ranges, symptom duration, care settings and other factors, we aim to ensure that the interventions examined are comparable and applicable to a relatively homogeneous population.
Population
We will include studies of adults (≥18 years old) with a new episode of acute, non-specific LBP. Acute symptoms should be either defined as a duration of less than 6 weeks,4 stated in the eligibility criteria, or evident via the range in duration of symptoms in the baseline characteristics. Studies focused on symptoms of more than 6 weeks, for instance those that describe subacute or chronic LBP, will be excluded. We will also exclude studies that describe ‘acute’ or ‘recent onset’ symptoms without specifying the duration, as these terms can be ambiguous, considering some authors use a 12-week cut-off for acute LBP.36 37 We will not require a clean period prior to the episode as there is no consensus regarding the duration of such a washout, and this would be impractical to implement.38 Non-specific LBP will be defined as pain that is not attributed to a specific pathology or nerve compression.5,7
We will exclude studies involving surgical candidates and patients treated in acute care settings such as emergency departments, trauma care and urgent care clinics. Standardisation of the population and care setting aims.39 Standardisation of the population and care setting aims to maintain the transitivity assumption, as many non-pharmacological interventions are not available in acute settings. We will exclude patients with specific lumbar spine disorders such as radiculopathy, neurogenic claudication, inflammatory arthropathy or serious pathology like fracture, infection or neoplasm.5,7 Additionally, we will exclude animal studies and studies focused exclusively on populations with worker’s compensation injury, sick leave status or pregnancy, given potential differences in prognosis and/or available treatments.
Interventions
We will include any non-surgical interventions, such as medications, education, exercise, passive modalities, manual therapies, inert treatments like placebo or sham, usual care and others. Surgical interventions such as discectomy or fusion will be excluded. We will exclude studies that solely compare interventions within the same therapy type, such as different dosages or routes of administration of the same medication, or delivery approaches of a manual therapy, as our aim is to compare the efficacy of distinct therapy types. We will also exclude studies that allow patients to self-select treatments as opposed to being randomised.
Comparisons
We will include any available comparisons between eligible non-surgical interventions, as previously described, provided they do not involve surgical treatments or intervention arms with three or more combined therapies. Comparisons may include usual care, placebo/sham interventions, no treatment or active controls, allowing for a variety of study designs to be synthesised. This broad strategy aims to minimise comparator bias by incorporating data from optimal comparator groups as well as suboptimal comparators (ie, ineffective, minimal treatment or unknown effectiveness) that are often used in RCTs of LBP treatments.40
Outcome
Included studies must report at least one type of continuous measure for pain outcomes, such as the Numerical Pain Rating Scale or Visual Analogue Scale, and/or disability assessments, including the Oswestry Disability Index (ODI), Modified Oswestry or Roland-Morris Disability Questionnaire (RMDQ). We will require follow-up measures taken between 1 week and 3 months, as this aligns with the Cochrane Back Group’s definition of short-term follow-up41 and reflects our study’s aims. This time frame captures symptom recurrence while avoiding regression to the mean, considering that acute LBP typically shows significant improvement within 3 months.42 We will exclude studies that only report categorical outcomes (eg, mild/moderate/severe pain, levels of improvement) or unrelated measures (eg, straight leg raise, surgery rates, range of motion). Included studies must also provide or plot mean or median values along with a measure of variance (eg, SD, 95% CIs or range) for the reported outcomes, enabling data imputation or scraping. For studies less than 10 years post-publication with missing data, we will reach out to the authors’ institutional email address a maximum of two times to attempt to obtain this de-identified raw outcomes data.
Study design
We will include only RCTs to ensure robust evidence, excluding observational studies.
Other
We will exclude studies that do not provide an English abstract or are published in non-English languages. This is to avoid issues with duplicate data extraction, quality scoring and overall interpretability by the entire research team.
Information sources
We will search PubMed, Scopus, Web of Science and CINAHL from inception to the date of query. The starting date of eligible studies will therefore be unrestricted, and the maximum end date will correspond with the final search date, projected to be 15 August 2025. We will conduct citation tracking of included articles and have coauthors to submit relevant articles from their personal collections which were missed by the aforementioned searches. We will search the reference lists of previous systematic reviews of RCTs on the topic acute LBP.17,1921 22 43 44
Search strategy
The search will focus on RCTs of non-surgical treatments examining acute LBP and was created with a research librarian. An example search strategy for MEDLINE, via PubMed, is shown in online supplemental file 1.
Selection process
Article selection and de-duplication will be conducted using Covidence (Covidence systematic review software, Veritas Health Innovation, Melbourne, Australia, 2024). Articles will first be screened by title and abstract, then by full text for inclusion. Two authors will independently conduct each phase of article screening. Disagreements will be resolved by discussion. The article selection process will be visualised using a flow diagram.
Data collection process
Data extraction will be conducted independently by two authors, who will discuss and resolve any potential discrepancies, involving a third author if necessary. The following data items will be extracted into a Microsoft Excel workbook: study characteristics, including the author surname and publication year; the duration of LBP for inclusion; the age of patients; the total number of patients enrolled; the types of interventions used in each group; and the outcome assessments used for measuring pain and disability. Finally, we will extract mean post-treatment pain and disability scores along with their SDs, with the option to impute missing data if needed, for the outcome window closest to 4 weeks.
Considering we are only including randomised groups, we will conduct NMA using post-treatment scores rather than change scores. While both approaches are valid, post-treatment scores may provide more conservative estimates.45 Additionally, using post-treatment scores may streamline the data collection process and require fewer calculations, as change scores are often missing.20
Data items
Our team will strategically lump and split treatments into nodes based on both the similarity of the interventions and their efficacy, as guided by prior systematic reviews. For instance, we will consider using drug classes as individual nodes (eg, opioids, NSAIDs, corticosteroids) rather than specific medications within each class. This approach is supported by findings from a recent NMA,21 which suggested a lack of meaningful differences in efficacy between individual analgesics for acute LBP. Additionally, we will group placebo and sham therapies into a single node, as a meta-analysis demonstrated no clinically relevant between-group differences in pain outcomes among various sham interventions for LBP.46 Similarly, previous systematic reviews justify lumping different exercise therapies, given the lack of significant outcome differences between specific exercise approaches.22 47 This strategy aims to maintain a connected network, avoiding a fragmented structure that could complicate analyses. A tentative description of each possible treatment node is listed in table 1.
Table 1. Taxonomy for lumping with proposed nodes. This list is iterative and may change depending on the therapies encountered among included studies.
Node | Definition and examples | Reference(s) supporting lumping |
---|---|---|
Acetaminophen | Non-opioid analgesic used for pain | NA |
Acupuncture | Insertion of one or more filiform needles into acupoints or tender points (ie, ‘Ashi’ points) based on Traditional Chinese Medicine principles | 67 |
Anticonvulsants | Adjuvant analgesics used for pain such as gabapentin and pregabalin | 21 |
Antidepressants | Medications that may alleviate pain through mechanisms involving serotonin and norepinephrine (eg, amitriptyline, duloxetine) | 21 |
Benzodiazepines | Sedatives that may aid in pain relief through muscle relaxation or anxiolytic effects (eg, diazepam, lorazepam) | 21 |
Corticosteroids | Steroid hormone-based anti-inflammatories such as prednisone and dexamethasone | 21 |
Cold therapy (cryotherapy) | Use of freezing or near-freezing ice or gel packs, or other methods to cool the body | NA |
Dry needling | Insertion of one or more filiform needles into specific trigger points or sensitised areas within symptomatic soft tissue | 67 |
Education | Delivery of general or patient-specific advice for activities of daily living, ergonomics, activity modification, reassurance, pain neuroscience and other purposes to manage pain | 17 68 |
Exercise | Any of a variety of therapeutic movements, stretches or strengthening techniques, including directional preference, stability, range of motion exercises, yoga or Tai chi | 22 47 |
Inert | Any of a variety of inactive treatments such as placebo medications, sham spinal manipulation or needling therapies, or others | 46 |
Muscle relaxants | Medications that reduce muscle tension or spasm (eg, cyclobenzaprine, tizanidine) | 21 |
NSAIDs | Non-steroidal anti-inflammatory drugs (eg, ibuprofen, naproxen, diclofenac, meloxicam) | 21 |
Opioids | Drugs that act on opioid receptors including hydrocodone and tramadol | 21 |
Soft tissue techniques | Passive therapies addressing muscle or other soft tissue including massage | 69 |
Spinal manipulation | Manual techniques with or without thrust directed to the spinal joints (eg, chiropractic or osteopathic manipulation, flexion-distraction, Chuna, Tuina, spinal mobilisation) | 43 |
Superficial heat therapy | Application of heat therapies achieved through hot packs, towels, saunas, heat wraps or other similar methods | 70 |
Usual care (ie, standard care) | A variety of commonly used, conservative interventions that may vary by country and year, potentially determined at the discretion of a clinician. Usual care may include analgesics, NSAIDs and/or muscle relaxants, along with recommendations for activity modification and patient education | 31 71 |
NA, not applicable; NSAIDs, non-steroidal anti-inflammatory drugs.
Imputing missing data
We will attempt to include studies meeting criteria when pertinent data are missing, by imputing and/or scraping data. We will aim to include these studies because excluding them may lead to biased effect estimates, considering studies with non-significant findings are more likely to have missing variance data.48 For scraping, we will use PlotDigitizer, which has been previously validated for this purpose.49 For imputing, we will use previously established formulas.50 Briefly, this may include estimation of SD values from the mean and SE, CIs, range, as well as imputing the mean from the median and quartiles. When no other method is possible, we will substitute missing SD, which is reasonable given the short-term follow-up window of our study.51 52 We will conduct a sensitivity analysis with studies with imputed data excluded, to examine the potential impact of this strategy.
Prioritisations
We will prioritise outcome measures based on guidance from previous publications and favour commonly used measures. When an article reports multiple disability scores, such as the ODI and the RMDQ, we will prioritise the score that is most often used across studies. Regardless, data from multiple disability scales can be analysed when reported across different studies. We will synthesise the results for both, given our aim of using standardised mean differences (SMDs) to account for variations between scales. For pain assessments, we will prioritise pain at rest, rather than pain during movement or at night.
If multiple available data points are given for multiple time windows, we will adhere to the Cochrane Back Review Group’s definition of short-term follow-up as outcomes that are measured closest to 4 weeks after randomisation, from a minimum of 1 week to a maximum of 12 weeks.4 41
When both intent-to-treat and per-protocol data are available, we will prioritise the intent-to-treat analysis. This approach minimises bias by including all participants as randomised, regardless of whether they completed the study or adhered to the intervention protocol.
To maintain network connectivity, we will reclassify comparisons involving interventions combined with usual care versus usual care alone by synthesising them as active treatment versus usual care alone. This approach is necessary as it would be infeasible to consider individual components of usual care as distinct nodes, given the variability in this treatment category.31 Additionally, usual care is often used in acute LBP trials as an ethical alternative to a placebo intervention.20 46 53 Accordingly, our strategy aims to preserve relevant active treatment outcome data while assuming that outcomes are independent of usual care.
Similarly, we will list an active treatment combined with placebo (eg, Treatment A+placebo) under the same node as the active treatment (ie, Treatment A). This prevents the creation of a unique treatment node that could disconnect the network. This simplification will help ensure that outcomes for Treatment A are pooled and enable greater insights about treatment efficacy.
Geometry of the network
We will use R and the netmeta package to visualise network connections.54 55 We will create a graph that displays the individual treatment nodes, direct comparisons and the number of studies for each comparison (figure 1). We will assess whether the network is fully connected and report the total number of studies, pairwise comparisons and unique treatments. Additionally, we will discuss potential biases related to the network geometry, such as imbalances in the number of studies across comparisons.
Figure 1. Mock network graph representing the relationships between various interventions. This hypothetical network contains 12 studies, 18 pairwise comparisons and 11 interventions. Each node indicates an intervention, and to avoid misinterpreting this as actual data, hypothetical active treatments were assigned letters. The numbers of comparisons are displayed overlying the lines, which vary in thickness based on the inverse SE, reflecting the precision of the evidence. Interventions were generated using a random generator in Excel, drawing from an iterative list. Random participant numbers were also simulated, ranging from 10 to 50 per arm. Redundant arms within studies were replaced with ‘Inert’ to avoid errors, leading to a greater number of comparisons with an inert reference, which we expect given previous studies on the topic.18,21.
Risk of bias within individual studies
We will assess the risk of bias in individual studies using the Cochrane Risk of Bias 1 (RoB 1) tool, which evaluates potential bias at the study level across several domains (ie, randomisation, deviations, missing data, measurement and selection).56 Two independent reviewers will respond to the RoB 2 signalling questions to identify potential bias in trial design, conduct and reporting, with any disagreements resolved through discussion. Judgements regarding RoB for each domain will be categorised as ‘Low’, ‘Some Concerns’ or ‘High’ based on the reviewers’ assessments. Scores will be stored in Covidence due to its blinding feature and integration with RoB 1 and exported for use in R. This process will be done via Robvis57 and ggplot,58 including an individual study plot indicating the judgements for each domain and a summary plot showing the overall judgements across studies for each domain (figure 2).
Figure 2. Mock individual study risk of bias (RoB) scores (A) and summary scores (B). The (A) traffic light-style plot illustrates the risk of bias ratings for each included hypothetical study, from RoB 1 judgements. Each row represents a study, while the columns indicate the domains of bias: sequence generation (D1), allocation/concealment (D2), blinding of participants and personnel (D3), blinding of outcome assessment (D4), incomplete outcome data (D5), selective reporting (D6) and other sources of bias (D7), followed by an overall score. The summary score plot (B) presents an aggregated view of the risk of bias across all studies. The colour-blind-friendly colours indicate the following: tan indicates ‘Low’, orange represents ‘Unclear’ and red represents ‘High’ risk of bias; both plots follow the same legend.
Summary measures
Forest plots
This primary outcome will use forest plots to compare interventions against an inert reference, using SMD as the effect estimate (figure 3). These plots will visualise effect estimates from both direct evidence for pairwise comparisons for both disability and pain. SMD values with a magnitude of 0.2, 0.5 and 0.8 will be considered small, moderate and large effects, respectively,59 with negative values indicating a reduction in pain or disability.
Figure 3. Mock forest plot indicating the standardised mean difference (SMD) for various interventions compared with the inert reference group. Negative SMD values indicate a favourable effect of treatment relative to the inert reference. Hypothetical data, randomly generated in Microsoft Excel, were used for demonstrative purposes. Active treatments are labelled with letters to avoid confusion with real data. The size of the squares represents the precision of the estimates, with larger squares indicating greater precision. Error bars denote 95% CIs.
P-score
This primary outcome will include a frequentist ranking of treatments (ie, p-score) based on their efficacy for disability and pain. This measure quantifies the probability that a treatment is more efficacious than another treatment in the network.60 We will further visualise this metric using cumulative ranking plots for both pain and disability outcomes (figure 4).
Figure 4. Mock cumulative ranking plot. This figure displays the treatment rankings based on the p-scores, which reflect the probability of a treatment being more efficacious than all others in the network. The colours in the plot range from green to red, with green (left of figure; top of key) indicating higher p-scores and red (right of figure; top of key) representing lower p-scores. This ranking is based on a hypothetical network of treatments, where treatments are assigned arbitrary labels (eg, ‘A’, ‘B’, ‘C’) to avoid confusion with real-world data.
League table
As a sensitivity analysis, we will create a league table for both disability and pain outcomes, displaying pairwise comparisons of treatment efficacy ranked from most to least efficacious. Each cell will indicate the SMD effect estimate along with its 95% CIs, with direct evidence shown in the upper triangle and indirect evidence in the lower triangle. Online supplemental file 2 shows a mock league table for pairwise comparisons. Treatments are indicated in letters due to use of hypothetical data. From the top left to bottom right, these are ranked from most to least efficacious. Each cell shows the standardised mean deviation and its 95% CIs, with direct evidence shown in the upper triangle and indirect evidence in the lower triangle.
Planned methods of analysis
We will provide a qualitative overview of the included studies, assessing follow-up durations, outcome measures used and the number of patients per intervention. This will include a histogram of follow-up durations and a general overview of age distribution across studies, including variability in reported age metrics (eg, mean (SD), range, median). We will also summarise the proportion of male and female patients. This information will help determine whether the transitivity assumption is met. All analyses will be performed using R (V.4.2.2, Vienna, Austria61) using the netmeta package,54 with other packages mentioned accordingly herein.
For our statistical NMA approach, we will synthesise the effects of included interventions on post-treatment pain and disability in patients with acute LBP. For studies with multiple treatment arms, we will use pairwise comparisons of each treatment against the other treatments and an inert reference, ensuring that all relevant comparisons are included. We will use a random effects model given the expected variability in effect estimates, follow-up duration and interventions across studies.20 For completeness and to adhere to PRISMA-NMA, we will calculate and present SMDs with corresponding 95% CIs for each study-specific pairwise comparisons. These estimates will be derived without the NMA methods.
Given our goal of comparing the efficacy of a range of interventions, we will require at least 10 trials, including at least 5 direct comparisons within the network. This threshold is achievable based on prior similar NMAs19,21 and is supported by statistical considerations.62 For example, with a trial count ratio of 1:2, approximately 4.5 indirect trials provide the same precision as 1 direct trial. Accordingly, 5 direct trials offer precision equivalent to approximately 22.5 indirect trials. With a conservative estimate of 40 patients per group,20 this combination of direct and indirect evidence should yield a sufficiently large effective sample size to produce precise and clinically meaningful effect estimates.
Assessment of inconsistency
We will evaluate inconsistency using Cochran’s Q-statistic derived from a random effects model for the entire network, which compares treatment effects from direct comparisons with those inferred through indirect comparisons.63 A significant Q-statistic value (ie, greater than the df with p<0.05) indicates heterogeneity among included studies. Additionally, we will calculate I² values, with values of 25%, 50% and 75% interpreted as small, moderate and high levels of heterogeneity, respectively.64 Finally, we will assess the consistency of the network using node-splitting forest plots (figure 5). These plots will compare SMD for pain and disability outcomes in pairwise comparisons where both direct and indirect evidence are available. We will visually evaluate consistency by noting any lack of overlap between the 95% CIs, which would suggest inconsistency between the direct and indirect evidence.
Figure 5. Mock node-splitting forest plot. This plot compares hypothetical standardised mean difference (SMD) values for outcomes (ie, pain or disability) across pairwise comparisons with both direct and indirect evidence. The plot uses a random effects model to account for between-study variability. Squares represent the point estimates for each comparison, with the size of the squares proportional to the precision of the estimates (larger squares indicate more precise estimates). The 95% CIs are represented by the horizontal lines. Comparisons with overlapping CIs between direct and indirect evidence suggest consistency, while non-overlapping CIs indicate inconsistency between the two types of evidence. Treatments are represented by letters to avoid confusion with real data.
Risk of bias across studies
We will examine for potential publication bias by visually inspecting comparison-adjusted funnel plots for direct comparisons of pain and disability outcomes54 (figure 6). Comparison-adjusted funnel plots are modified from traditional funnel plots to account for heterogeneity among studies, enabling the comparison of multiple interventions within a network.65 A symmetrical funnel plot will suggest low risk of bias, while asymmetry may suggest the presence of publication bias. We will provide a statistical assessment of publication bias using Egger’s regression test, interpreting a resulting p value of <0.05 as indicative of potential bias.
Figure 6. Mock comparison-adjusted funnel plot. This plot assesses for potential publication bias for outcomes (ie, pain or disability) using hypothetical data. Each point represents a direct treatment comparison, plotted by effect size (standardised mean difference) on the x-axis and SE on the y-axis. The plot adjusts for heterogeneity between comparisons. Asymmetry in the plot may indicate the presence of publication bias. Egger’s regression test (p=0.6540) suggests an absence of publication bias in this mock dataset. Treatments are ordered with ‘Inert’ as the reference comparator, facilitating meaningful comparison across treatment pairs.
Additional analyses
We will conduct sensitivity analyses by excluding studies that had imputed or scraped values. This approach is recommended to reduce the risk of bias, as imputed data can introduce uncertainty and may not accurately reflect the original data distribution.48 51 52 By excluding these studies, we aim to assess the robustness of our main outcomes. Specifically, we will repeat our analyses for the primary outcomes, including the p-scores and forest plots, for both pain and disability outcomes.
Certainty assessment
We will evaluate the certainty of evidence for our primary forest plot outcomes for pain and disability using the CINeMA (Confidence in Network Meta-Analysis) framework, which grades confidence levels as high, moderate, low or very low.66 We will use a minimum relevant SMD threshold of ±0.1, which indicates a small effect size. We will incorporate the RoB 1 scores from our earlier analyses into CINeMA, manually assess indirectness of the evidence, enter the scores from reporting bias, while the software will automatically compute imprecision, heterogeneity and incoherence based on our NMA findings. Using the online CINeMA tool (https://cinema.ispm.unibe.ch), we will rate each comparison as having ‘No Concerns’, ‘Some Concerns’ or ‘Major Concerns’, with ratings discussed among multiple authors to ensure consensus. For assessing within-study bias, we will focus on the average RoB 1 score for each pairwise comparison as a guideline. Consistent with CINeMA guidelines, we will not downgrade our overall confidence more than once for related domains such as imprecision and heterogeneity.
Patient and public involvement
Patients and/or the public were not involved in the design, or conduct, or reporting, or dissemination plans of this research.
Ethics and dissemination
This review protocol does not involve any primary research with human participants, animal subjects or medical record review. Consequently, this work did not require approval from an institutional review board or ethics committee.
Supplementary material
Footnotes
Funding: The authors have not declared a specific grant for this research from any funding agency in the public, commercial or not-for-profit sectors.
Prepub: Prepublication history and additional supplemental material for this paper are available online. To view these files, please visit the journal online (https://doi.org/10.1136/bmjopen-2025-100520).
Provenance and peer review: Not commissioned; externally peer reviewed.
Patient consent for publication: Not applicable.
Patient and public involvement: Patients and/or the public were not involved in the design, or conduct, or reporting, or dissemination plans of this research.
References
- 1.Fatoye F, Gebrye T, Ryan CG, et al. Global and regional estimates of clinical and economic burden of low back pain in high-income countries: a systematic review and meta-analysis. Front Public Health. 2023;11:1098100. doi: 10.3389/fpubh.2023.1098100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Wu A, March L, Zheng X, et al. Global low back pain prevalence and years lived with disability from 1990 to 2017: estimates from the Global Burden of Disease Study 2017. Ann Transl Med. 2020;8:299. doi: 10.21037/atm.2020.02.175. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Fatoye F, Gebrye T, Odeyemi I. Real-world incidence and prevalence of low back pain using routinely collected data. Rheumatol Int. 2019;39:619–26. doi: 10.1007/s00296-019-04273-0. [DOI] [PubMed] [Google Scholar]
- 4.Furlan AD, Malmivaara A, Chou R, et al. 2015 Updated Method Guideline for Systematic Reviews in the Cochrane Back and Neck Group. Spine (Phila Pa 1976) 2015;40:1660–73. doi: 10.1097/BRS.0000000000001061. [DOI] [PubMed] [Google Scholar]
- 5.Balagué F, Mannion AF, Pellisé F, et al. Non-specific low back pain. The Lancet. 2012;379:482–91. doi: 10.1016/S0140-6736(11)60610-7. [DOI] [PubMed] [Google Scholar]
- 6.Kent PM, Keating JL, Buchbinder R. Searching for a conceptual framework for nonspecific low back pain. Man Ther. 2009;14:387–96. doi: 10.1016/j.math.2008.07.003. [DOI] [PubMed] [Google Scholar]
- 7.Krismer M, van Tulder M. Low back pain (non-specific) Best Practice & Research Clinical Rheumatology. 2007;21:77–91. doi: 10.1016/j.berh.2006.08.004. [DOI] [PubMed] [Google Scholar]
- 8.Wallwork SB, Braithwaite FA, O’Keeffe M, et al. The clinical course of acute, subacute and persistent low back pain: a systematic review and meta-analysis. CMAJ . 2024;196:E29–46. doi: 10.1503/cmaj.230542. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.da Silva T, Mills K, Brown BT, et al. Recurrence of low back pain is common: a prospective inception cohort study. J Physiother. 2019;65:159–65. doi: 10.1016/j.jphys.2019.04.010. [DOI] [PubMed] [Google Scholar]
- 10.Chou R, Shekelle P. Will this patient develop persistent disabling low back pain? JAMA. 2010;303:1295–302. doi: 10.1001/jama.2010.344. [DOI] [PubMed] [Google Scholar]
- 11.Kongsted A, Kent P, Axen I, et al. What have we learned from ten years of trajectory research in low back pain? BMC Musculoskelet Disord. 2016;17:220. doi: 10.1186/s12891-016-1071-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.George SZ, Fritz JM, Silfies SP, et al. Interventions for the Management of Acute and Chronic Low Back Pain: Revision 2021. J Orthop Sports Phys Ther. 2021;51:CPG1–60. doi: 10.2519/jospt.2021.0304. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Price MR, Cupler ZA, Hawk C, et al. Systematic review of guideline-recommended medications prescribed for treatment of low back pain. Chiropr Man Therap . 2022;30 doi: 10.1186/s12998-022-00435-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Pangarkar SS, Kang DG, Sandbrink F, et al. VA/DoD Clinical Practice Guideline: Diagnosis and Treatment of Low Back Pain. J Gen Intern Med. 2019;34:2620–9. doi: 10.1007/s11606-019-05086-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Oliveira CB, Maher CG, Pinto RZ, et al. Clinical practice guidelines for the management of non-specific low back pain in primary care: an updated overview. Eur Spine J . 2018;27:2791–803. doi: 10.1007/s00586-018-5673-2. [DOI] [PubMed] [Google Scholar]
- 16.Zhou T, Salman D, McGregor AH. Recent clinical practice guidelines for the management of low back pain: a global comparison. BMC Musculoskelet Disord. 2024;25:344. doi: 10.1186/s12891-024-07468-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Piano L, Ritorto V, Vigna I, et al. Individual Patient Education for Managing Acute and/or Subacute Low Back Pain: Little Additional Benefit for Pain and Function Compared to Placebo. A Systematic Review With Meta-analysis of Randomized Controlled Trials. J Orthop Sports Phys Ther. 2022;52:432–45. doi: 10.2519/jospt.2022.10698. [DOI] [PubMed] [Google Scholar]
- 18.Rubinstein SM, Terwee CB, Assendelft WJJ, et al. Spinal manipulative therapy for acute low back pain: an update of the cochrane review. Spine (Phila Pa 1976) 2013;38:E158–77. doi: 10.1097/BRS.0b013e31827dd89d. [DOI] [PubMed] [Google Scholar]
- 19.Wu B, Yang L, Fu C, et al. Efficacy and safety of acupuncture in treating acute low back pain: a systematic review and bayesian network meta-analysis. Ann Palliat Med. 2021;10:6156–67. doi: 10.21037/apm-21-551. [DOI] [PubMed] [Google Scholar]
- 20.Gianola S, Bargeri S, Del Castillo G, et al. Effectiveness of treatments for acute and subacute mechanical non-specific low back pain: a systematic review with network meta-analysis. Br J Sports Med. 2022;56:41–50. doi: 10.1136/bjsports-2020-103596. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Wewege MA, Bagg MK, Jones MD, et al. Comparative effectiveness and safety of analgesic medicines for adults with acute non-specific low back pain: systematic review and network meta-analysis. BMJ. 2023;380:e072962. doi: 10.1136/bmj-2022-072962. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.IJzelenberg W, Oosterhuis T, Hayden JA, et al. Exercise Therapy for Treatment of Acute Non-specific Low Back Pain: A Cochrane Systematic Review and Meta-analysis of Randomized Controlled Trials. Arch Phys Med Rehabil. 2024;105:1571–84. doi: 10.1016/j.apmr.2024.02.732. [DOI] [PubMed] [Google Scholar]
- 23.Cashin AG, Furlong BM, Kamper SJ, et al. Analgesic effects of non-surgical and non-interventional treatments for low back pain: a systematic review and meta-analysis of placebo-controlled randomised trials. BMJ Evid Based Med. 2025:bmjebm–2024. doi: 10.1136/bmjebm-2024-112974. [DOI] [PubMed] [Google Scholar]
- 24.Florez ID, De La Cruz-Mena JE, Veroniki A-A. Network meta-analysis: a powerful tool for clinicians, decision-makers, and methodologists. J Clin Epidemiol. 2024;176:111537. doi: 10.1016/j.jclinepi.2024.111537. [DOI] [PubMed] [Google Scholar]
- 25.Antoniou SA, Mavridis D, Tsokani S, et al. EAES Guidelines Subcommittee: Network meta-analysis as a tool in clinical practice guidelines. Surg Endosc. 2023;37:1–4. doi: 10.1007/s00464-022-09772-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Kanters S, Ford N, Druyts E, et al. Use of network meta-analysis in clinical guidelines. Bull World Health Organ. 2016;94:782–4. doi: 10.2471/BLT.16.174326. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Gaspar FW, Thiese MS, Wizner K, et al. Guideline adherence and lost workdays for acute low back pain in the California workers’ compensation system. PLoS One. 2021;16:e0253268. doi: 10.1371/journal.pone.0253268. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Hanney WJ, Masaracchio M, Liu X, et al. The Influence of Physical Therapy Guideline Adherence on Healthcare Utilization and Costs among Patients with Low Back Pain: A Systematic Review of the Literature. PLoS One. 2016;11:e0156799. doi: 10.1371/journal.pone.0156799. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Stevans JM, Delitto A, Khoja SS, et al. Risk Factors Associated With Transition From Acute to Chronic Low Back Pain in US Patients Seeking Primary Care. JAMA Netw Open . 2021;4:e2037371. doi: 10.1001/jamanetworkopen.2020.37371. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Zadro J, O’Keeffe M, Maher C. Do physical therapists follow evidence-based guidelines when managing musculoskeletal conditions? Systematic review. BMJ Open. 2019;9:e032329. doi: 10.1136/bmjopen-2019-032329. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Kamper SJ, Logan G, Copsey B, et al. What is usual care for low back pain? A systematic review of health care provided to patients with low back pain in family practice and emergency departments. Pain. 2020;161:694–702. doi: 10.1097/j.pain.0000000000001751. [DOI] [PubMed] [Google Scholar]
- 32.Ahern M, Dean CM, Dear BF, et al. Management of acute low back pain: the practices and perspectives of primary care clinicians in Australia. Aust J Prim Health. 2020;26:256–64. doi: 10.1071/PY19152. [DOI] [PubMed] [Google Scholar]
- 33.Ganesh GS, Khan AR, Khan A. A survey of Indian physiotherapists’ clinical practice patterns and adherence to clinical guidelines in the management of patients with acute low back pain. Musculoskeletal Care. 2023;21:478–90. doi: 10.1002/msc.1720. [DOI] [PubMed] [Google Scholar]
- 34.Kim LH, Vail D, Azad TD, et al. Expenditures and Health Care Utilization Among Adults With Newly Diagnosed Low Back and Lower Extremity Pain. JAMA Netw Open . 2019;2:e193676. doi: 10.1001/jamanetworkopen.2019.3676. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Hutton B, Salanti G, Caldwell DM, et al. The PRISMA Extension Statement for Reporting of Systematic Reviews Incorporating Network Meta-analyses of Health Care Interventions: Checklist and Explanations. Ann Intern Med. 2015;162:777–84. doi: 10.7326/M14-2385. [DOI] [PubMed] [Google Scholar]
- 36.Casazza BA: Diagnosis and treatment of acute low back pain. Am Fam Physician. 2012;85:343–50. [PubMed] [Google Scholar]
- 37.McIntosh G, Hall H. Low back pain (acute) BMJ Clin Evid. 2011;2011:1102. [PMC free article] [PubMed] [Google Scholar]
- 38.Norton G, McDonough CM, Cabral HJ, et al. Classification of patients with incident non-specific low back pain: implications for research. Spine J. 2016;16:567–76. doi: 10.1016/j.spinee.2015.08.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Hirshon JM, Risko N, Calvello EJB, et al. Health systems and services: the role of acute care. Bull World Health Organ. 2013;91:386–8. doi: 10.2471/BLT.12.112664. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Bejarano G, Csiernik B, Zadro JR, et al. Compared to what? An analysis of comparators in trials informing the National Institute of Clinical Excellence (NICE) low back pain guideline. Braz J Phys Ther. 2023;27:100563. doi: 10.1016/j.bjpt.2023.100563. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Furlan AD, Pennick V, Bombardier C, et al. 2009 updated method guidelines for systematic reviews in the Cochrane Back Review Group. Spine (Phila Pa 1976) 2009;34:1929–41. doi: 10.1097/BRS.0b013e3181b1c99f. [DOI] [PubMed] [Google Scholar]
- 42.Breckenridge JD, Ginn KA, Wallwork SB, et al. Do People With Chronic Musculoskeletal Pain Have Impaired Motor Imagery? A Meta-analytical Systematic Review of the Left/Right Judgment Task. J Pain. 2019;20:119–32. doi: 10.1016/j.jpain.2018.07.004. [DOI] [PubMed] [Google Scholar]
- 43.Paige NM, Miake-Lye IM, Booth MS, et al. Association of Spinal Manipulative Therapy With Clinical Benefit and Harm for Acute Low Back Pain: Systematic Review and Meta-analysis. JAMA. 2017;317:1451–60. doi: 10.1001/jama.2017.3086. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Su X, Qian H, Chen B, et al. Acupuncture for acute low back pain: a systematic review and meta-analysis. Ann Palliat Med. 2021;10:3924–36. doi: 10.21037/apm-20-1998. [DOI] [PubMed] [Google Scholar]
- 45.Fu R, Holmer HK. Change score or follow-up score? Choice of mean difference estimates could impact meta-analysis conclusions. J Clin Epidemiol. 2016;76:108–17. doi: 10.1016/j.jclinepi.2016.01.034. [DOI] [PubMed] [Google Scholar]
- 46.Strijkers RHW, Schreijenberg M, Gerger H, et al. Effectiveness of placebo interventions for patients with nonspecific low back pain: a systematic review and meta-analysis. Pain. 2021;162:2792–804. doi: 10.1097/j.pain.0000000000002272. [DOI] [PubMed] [Google Scholar]
- 47.Karlsson M, Bergenheim A, Larsson MEH, et al. Effects of exercise therapy in patients with acute low back pain: a systematic review of systematic reviews. Syst Rev. 2020;9:182. doi: 10.1186/s13643-020-01412-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Fu R, Vandermeer BW, Shamliyan TA, et al. Methods Guide for Effectiveness and Comparative Effectiveness Reviews. Agency for Healthcare Research and Quality (US); 2013. Handling continuous outcomes in quantitative synthesis. [PubMed] [Google Scholar]
- 49.Aydin O, Yassikaya MY. Validity and Reliability Analysis of the PlotDigitizer Software Program for Data Extraction from Single-Case Graphs. Perspect Behav Sci . 2022;45:239–57. doi: 10.1007/s40614-021-00284-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Chi K-Y, Li M-Y, Chen C, et al. Ten circumstances and solutions for finding the sample mean and standard deviation for meta-analysis. Syst Rev. 2023;12:62. doi: 10.1186/s13643-023-02217-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Higgins JP, Li T, Deeks JJ. Choosing effect measures and computing estimates of effect. Cochrane Handb Syst Rev Interv. 2019:143–76. [Google Scholar]
- 52.Wiebe N, Vandermeer B, Platt RW, et al. A systematic review identifies a lack of standardization in methods for handling missing variance data. J Clin Epidemiol. 2006;59:342–53. doi: 10.1016/j.jclinepi.2005.08.017. [DOI] [PubMed] [Google Scholar]
- 53.Puhl AA, Reinhart CJ, Rok ER. Injeyan HS: An examination of the observed placebo effect associated with the treatment of low back pain – a systematic review. Pain Res Manag J Can Pain Soc. 2011;16 doi: 10.1155/2011/625315. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Balduzzi S, Rücker G, Nikolakopoulou A, et al. An R Package for Network Meta-Analysis Using Frequentist Methods. J Stat Softw. 2023;106 doi: 10.18637/jss.v106.i02. [DOI] [Google Scholar]
- 55.Rücker G, Schwarzer G. Automated drawing of network plots in network meta-analysis. Res Synth Methods. 2016;7:94–107. doi: 10.1002/jrsm.1143. [DOI] [PubMed] [Google Scholar]
- 56.Higgins JPT, Altman DG, Gøtzsche PC, et al. The Cochrane Collaboration’s tool for assessing risk of bias in randomised trials. BMJ. 2011;343:d5928. doi: 10.1136/bmj.d5928. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.McGuinness LA, Higgins JPT. Risk-of-bias VISualization (robvis): An R package and Shiny web app for visualizing risk-of-bias assessments. Res Synth Methods. 2021;12:55–61. doi: 10.1002/jrsm.1411. [DOI] [PubMed] [Google Scholar]
- 58.Wickham H. Ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag New York; 2016. [Google Scholar]
- 59.Page P. Beyond statistical significance: clinical interpretation of rehabilitation research literature. Int J Sports Phys Ther. 2014;9:726–36. [PMC free article] [PubMed] [Google Scholar]
- 60.Rücker G, Schwarzer G. Ranking treatments in frequentist network meta-analysis works without resampling methods. BMC Med Res Methodol. 2015;15:58. doi: 10.1186/s12874-015-0060-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.R Core Team R: a language and environment for statistical computing. r foundation for statistical computing. 2022
- 62.Thorlund K, Mills EJ. Sample size and power considerations in network meta-analysis. Syst Rev. 2012;1:41. doi: 10.1186/2046-4053-1-41. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Krahn U, Binder H, König J. A graphical tool for locating inconsistency in network meta-analyses. BMC Med Res Methodol. 2013;13:35. doi: 10.1186/1471-2288-13-35. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Higgins JPT, Thompson SG, Deeks JJ, et al. Measuring inconsistency in meta-analyses. BMJ. 2003;327:557–60. doi: 10.1136/bmj.327.7414.557. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Chaimani A, Salanti G. Using network meta-analysis to evaluate the existence of small-study effects in a network of interventions. Res Synth Methods. 2012;3:161–76. doi: 10.1002/jrsm.57. [DOI] [PubMed] [Google Scholar]
- 66.Papakonstantinou T, Nikolakopoulou A, Higgins JPT, et al. CINeMA: Software for semiautomated assessment of the confidence in the results of network meta-analysis. Campbell Syst Rev . 2020;16:e1080. doi: 10.1002/cl2.1080. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Griswold D, Wilhelm M, Donaldson M, et al. The effectiveness of superficial versus deep dry needling or acupuncture for reducing pain and disability in individuals with spine-related painful conditions: a systematic review with meta-analysis. J Man Manip Ther. 2019;27:128–40. doi: 10.1080/10669817.2019.1589030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Jones CM, Shaheed CA, Ferreira GE, et al. Advice and education provide small short-term improvements in pain and disability in people with non-specific spinal pain: a systematic review. J Physiother. 2021;67:263–70. doi: 10.1016/j.jphys.2021.08.014. [DOI] [PubMed] [Google Scholar]
- 69.Chen Z, Wu J, Wang X, et al. The effects of myofascial release technique for patients with low back pain: A systematic review and meta-analysis. Complement Ther Med. 2021;59:102737. doi: 10.1016/j.ctim.2021.102737. [DOI] [PubMed] [Google Scholar]
- 70.Qaseem A, Wilt TJ, McLean RM, et al. Noninvasive Treatments for Acute, Subacute, and Chronic Low Back Pain: A Clinical Practice Guideline From the American College of Physicians. Ann Intern Med. 2017;166:514–30. doi: 10.7326/M16-2367. [DOI] [PubMed] [Google Scholar]
- 71.Pascoe SC, Spoonemore SL, Jr, Young JL, et al. Proposing six criteria to improve reproducibility of “usual care” interventions in back pain trials: a systematic review. J Clin Epidemiol. 2022;149:227–35. doi: 10.1016/j.jclinepi.2022.05.002. [DOI] [PubMed] [Google Scholar]