Randomized controlled trials (RCTs) of back pain often prioritize efficacy or effectiveness outcomes. To determine safety, back pain RCTs, monitor adverse events. Participants’ guesses about their assigned intervention are rarely reported, and quantitative assessments of blinding success are sparse. Blinding data are challenging to locate, retrieve, or pool. Freed et al. 1 conducted a meta-analysis of 40 back pain RCTs, published between 2000 and 2019 which reported participants’ treatment guesses. A main finding was the association between correct guesses and the size of treatment effects. Even though blinding may not always be crucial to study objectives,2,3 patient-reported outcomes are susceptible to bias introduced by participants’ correct beliefs or perceptions about intervention assignment.4,5
Here, we explored associations between participants’ treatment guesses and adverse events in back pain RCTs updating the search of Freed et al. 1 We hypothesize that “active” intervention guesses are associated with higher adverse event rates, whereas “control” intervention guesses are associated with lower rates—a commonly held, although untested belief. We also hypothesize that satisfactory blinding is associated with fewer adverse events.
Freed et al. 1 analyzed 40 back pain RCTs published between 2000 and 2019. Two eligible studies not included in that review were identified after that publication. From 2020 to 2023, we searched “back pain” and “blind” in PubMed for screening. Similar searches were conducted in Ovid and Embase. Eligible articles were subjected to word searches on blinding and adverse events.
Participants’ intervention assignments (“Active” vs “Control”) were tabulated against treatment guess data (“Active” vs “Control” vs “Don’t know”). Then, a blinding index (BI) was computed.1,5–7 Credibility/expectancy scores were reported in some RCTs and converted to a BI via linear transformation (see Supplemental materials). In addition, we extracted treatment modality and timing of blinding assessment. 8 For safety outcomes, we extracted the most prominently reported adverse event for primary analysis and another one for secondary/sensitivity analyses, whenever available. Adverse events were analyzed in terms of count and rate (i.e., count/total sample size).
We calculated Pearson correlation coefficients and fitted locally estimated scatterplot smoothing (LOESS) with confidence limits between arm-specific BI and adverse events. Next, we repeated these analyses for the correlation between “sum BI” and “between-arm differences in adverse events.” BI can be interpreted as the proportion of correct guesses beyond chance within an arm with 0 indicating a “random guess”, equivalently, 50:50 correct: incorrect; 1 indicating complete correct guess; and −1 indicating complete incorrect guess.1,5,6 Sum BI represents the degree of discrepant guesses between arms, with 0 indicating no discrepancy, for example, all participants in both arms guessing they received the active treatment. Satisfactory blinding was operationalized as two different scenarios: (1) (|BIactive|≤ c and |BIcontrol|≤ c; “random guess”), or (2) (|BIsum|≤ c; “wishful thinking”).1,5,6 Two cutoffs c ={0.2,0.3} were chosen.6,9
PubMed identified 230 publications for screening, supplemented by other search approaches. Twelve RCTs meeting eligibility criteria were identified for the period between 2020 and 2023 (Supplemental materials). Among 54 back pain RCTs-with participants’ treatment guess data, 45 reported adverse events.
Correlation coefficients between BI and adverse events in active arms were −0.17 and 0.01 using count and rate, respectively. The corresponding correlations for control arms were 0.28 and 0.15. For sum BI and between-group adverse event difference, the corresponding values were 0.31 (95% confidence interval (CI): 0.02–0.55) and 0.31 (0.01–0.55); see Figure 1.
Figure 1.
Discrepant guesses between arms (sum BI) and adverse event rates.
Sum BI and between-arm adverse event rate differences (adverse event rates [active] - adverse event rates [control]) are displayed. Y-axis represents adverse event rate differences; the total number of adverse event cases were divided by total sample size within an arm, then the value of between-arm difference was calculated. Magnitude of discrepant guesses between arms (in X-axis) was measured by summed BIs (sum BI); BIsum = BIactive + BIcontrol, where sum BI closer to 0 indicates that participants in both arms believe they received the same treatment. For example, BIsum = 0 can be “wishful thinking” when both groups believe they received the active treatment, or “pessimistic thinking” when both groups believe they received control; both scenarios result in BIsum = 0, although “wishful thinking” is more common in practice.1,5,6 The regression line is fitted by LOESS with 95% confidence limits, based on the method of Cleveland and Grosse (1991). A total of 45 trials with BI and adverse event data are included. Pearson correlation coefficient is 0.31 (95% CI: 0.02–0.55). Correlation is 0.31 when count is used instead of rate, and 0.29 when weighted by total sample size for each trial. SAS version 9.4 was used for analysis and graphics, including the LOESS procedure (SAS Institute, Cary, NC).
Using 0.2 and 0.3 cutoffs, 22 and 31 out of 45 RCTs were classified as reaching “satisfactory blinding,” respectively. In the within-arm exploration, adverse events were not systematically different between the RCTs classified as “satisfactory blinding” and their not-satisfactory counterpart (i.e., the complement set). In contrast, the between-arm adverse event difference was low in the satisfactory blinding RCT group. With the 0.2 cutoff, the mean event counts were 0.5 for the satisfactory group versus 2.7 for the not-satisfactory counterpart, while the respective rates were 0.0 versus 0.08. Results tended to be robust to alternative adverse events and cutoffs; see Supplemental materials.
We explored associations between participants’ treatment guesses and adverse events in back pain RCTs. Our first hypothesis was not supported—active and control treatment guesses were not significantly correlated with increased and decreased adverse events, respectively. Correlations between arm-specific BIs and adverse events were weak, but somewhat in opposite directions, compared to correlations between arm-specific BIs and size of treatment effects reported previously. 1 When using sum BI as a measure of study-level blinding, lower between-arm difference in adverse events was observed among satisfactorily blinded RCTs.
Our interpretations warrant caution due to marked clinical heterogeneity among included RCTs. Participant characteristics, types and timing of treatment guess and safety outcomes also varied. Other RCTs—outside back pain—have found associations between patient blinding and adverse events. 10 Our findings provide a quantitative assessment of blinding and adverse events in back pain trials, suggesting that some adverse events could be due to participants’ treatment guesses about their assigned intervention. Hence, adverse events in sham- or placebo-control arms could be predominantly contextual or non-specific—namely, nocebo effect (i.e., placebo side effect).
Limitations of our study should be noted. First, stratified, subgroup, or adjusted analyses were not feasible; thus, sources of heterogeneity were not elucidated. Second, our data searches and review processes could not be as systematic as in traditional meta-analyses, owing to rarely collected and hard-to-reach blinding-related data, often buried within text or file drawer. Third, our operationalization of “satisfactory blinding” is numerical and ad hoc, so misclassification is plausible, when true status is hardly known. 6 Last, we could not address temporality between blinding and adverse events. 11
To our knowledge, this is the first exploration of participant’s treatment guesses and adverse events in back pain trials. This is noteworthy as reviews and meta-analyses on blinding generally rely on authors’ self-labeling of “single” or “double-blind”—especially, in title—rather than quantitative blinding assessments.2,10
Participant blinding and between-group comparisons can be important for safety outcomes, beyond efficacy or effectiveness outcomes. In the current patient-centered era with an emphasis on patient-reported outcomes, our findings provide empirical evidence that more discrepant treatment guesses could be associated with larger between-group differences in adverse events. Back pain trialists should foster blinding assessments and implementation procedures—including credible sham- or placebo-controls—and mitigate sources of bias in the evaluation of safety outcomes. 12
Supplemental Material
Supplemental material, sj-docx-1-ctj-10.1177_17407745241276124 for Participant’s treatment guesses and adverse events in back pain trials: Nocebo in action? by Javier Muñoz Laguna, Hyangsook Lee, Eduard Poltavskiy, Jeehyoung Kim and Heejung Bang in Clinical Trials
Acknowledgments
The authors thank Ms Caron Modeas’ English editing service. They also thank Dr Spence, Stewart, Szabo, and Jaffe who provided useful comments.
Footnotes
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding: The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: J.M.L.’s doctoral position is funded by the European Centre for Chiropractic Research Excellence (ECCRE); reports support from Pro Chiropractic Switzerland (Pro Chiropraktik Schweiz), the Spanish Chiropractic Association (Asociación Española de Quiropráctica), and the Graduate Campus of the University of Zurich (GRC Travel Grant 2023_Q1_TG_115) for conference activities; J.M.L was partly supported by the European Cooperation in Science and Technology (COST) outside this work. H.L. was partly supported by the National Research Foundation of Korea (NRF; grant nos. 2020R1A6A6017334 and RS-2023-00278131) funded by the Korea government (Ministry of Science and Information and Communication Technology). H.B. was partly supported by the National Institutes of Health (grant nos. UL1 TR001860, R01 AR076088, and R61 AT012187). Funding sources had no role in the study design or implementation.
Reproducible research statement: Study protocol: Not available. See the Supplemental material in Freed et al. (2021) or contact the first or last author of this article for further information. Statistical code, dataset, and full supplement: Available from the first or last author on reasonable request.
Disclaimer: The interpretation and reporting of these data are the responsibility of the authors and should in no way be seen as an official policy of or interpretation by funding agencies.
ORCID iDs: Javier Muñoz Laguna
https://orcid.org/0000-0003-0332-5540
Hyangsook Lee
https://orcid.org/0000-0002-1107-9389
Eduard Poltavskiy
https://orcid.org/0000-0002-4052-8593
Jeehyoung Kim
https://orcid.org/0000-0003-3380-1683
Heejung Bang
https://orcid.org/0000-0002-5755-9951
Supplemental material: Supplemental material for this article is available online.
References
- 1. Freed B, Williams B, Situ X, et al. Blinding, sham, and treatment effects in randomized controlled trials for back pain in 2000–2019: a review and meta-analytic approach. Clin Trials 2021; 18(3): 361–370. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Moustgaard H, Clayton G, Jones H, et al. Impact of blinding on estimated treatment effects in randomised clinical trials: meta-epidemiological study. BMJ 2020; 368: 16802. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Godlee F. Blinding may be unnecessary, but please divest. BMJ 2020; 368: m255. [Google Scholar]
- 4. Back pain consortium research program, https://heal.nih.gov/research/clinical-research/back-pain (accessed 31 May 2024).
- 5. Won J, Bang H, Lee H. Effect of partial vs full disclosure of potential assignment to placebo on participant blinding, perceptions of group assignment, and trial outcomes: a randomized clinical trial. JAMA Netw Open 2022; 5(3): e224050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Moroz A, Freed B, Tiedemann L, et al. Blinding measured: a systematic review of randomized controlled trials of acupuncture. Evid Based Complement Alternat Med 2013; 2013: 708251. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Howick J, Webster RK, Rees JL, et al. TIDieR-Placebo: a guide and checklist for reporting placebo and sham controls. PLoS Med 2020; 17(9): e1003294. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. NEJM evidence: threats to blinding, https://www.youtube.com/watch?v=IFLNDmQLgVA (accessed 31 May 2024). [DOI] [PubMed]
- 9. Reid E, Kamlin OF, Orsini F, et al. Success of blinding a procedural intervention in a randomised controlled trial in preterm infants receiving respiratory support. Clin Trials 2023; 20(5): 479–485. [DOI] [PubMed] [Google Scholar]
- 10. Gupta A, Thompson D, Whitehouse A, et al. Adverse events associated with unblinded, but not with blinded, statin therapy in the Anglo-Scandinavian Cardiac Outcomes Trial—Lipid-Lowering Arm (ASCOT-LLA): a randomised double-blind placebo-controlled trial and its non-randomised non-blind extension phase. Lancet 2017; 389: 2473–2481. [DOI] [PubMed] [Google Scholar]
- 11. Betensky RA. Don’t be blinded by the blinding. NEJM Evid 2022; 1(3): EVIDe2100063. [DOI] [PubMed] [Google Scholar]
- 12. Hohenschurz-Schmidt D, Draper-Rodi J, Vase L, et al. Blinding and sham control methods in trials of physical, psychological, and self-management interventions for pain (article I): a systematic review and description of methods. Pain 2023; 164(3): 469–484. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Supplemental material, sj-docx-1-ctj-10.1177_17407745241276124 for Participant’s treatment guesses and adverse events in back pain trials: Nocebo in action? by Javier Muñoz Laguna, Hyangsook Lee, Eduard Poltavskiy, Jeehyoung Kim and Heejung Bang in Clinical Trials

