In this issue of the journal, Rosén et al. report the results of a survey of surgeons regarding placebo effects.1 In particular, they show that surgeons consider “non-specific” effects (including aspects of the surgeon-patient interaction) and “placebo effects” to be important. Their definitions of these terms are clearly stated but not universally accepted. A discussion of these terms is necessary to allow surgeons to better understand the reasons why people improve after surgery.
The terms placebo effect, non-specific effects and contextual effects all refer to responses that are separate to the specific effects of surgery: the effect that results from the anatomical and physiological changes brought about by the surgical procedure. The specific surgical effect is best measured in a high-fidelity placebo trial in which all things are equal except the specific part of the procedure in question.2 If the surgical group has better outcomes than the placebo group, it can be implied that the difference between the groups was due entirely to the specific effect of the surgery.
However, in many placebo surgical trials in orthopaedics, the difference in outcome between placebo surgery and the surgical procedure is not significant, i.e., there is no specific effect of surgery.3–7 Importantly, however, is the observation that in all of these studies, both groups (surgery and placebo surgery) show significant improvements after surgery. What is of interest, and what needs defining, is what causes the observed improvement when that improvement is not due to the specific effect of surgery.
Using the term “placebo effect” to describe the improvement seen after a placebo procedure is common. It is the definition used by Rosén et al. 1 and the definition used in my own book.8 The problem with using that definition is that it suggests that the improvement seen after placebo surgery is due to the placebo. In reality, the improvement is likely to have occurred without the placebo surgery. In fact, placebos, by definition, have no direct effect themselves.
Some of the improvement seen after placebo surgery may be due to contextual effects; what Rosén et al refer to as non-specific effects and others have referred to as “ritual” effects.9 These include patient expectations, the confidence and personality of the surgeon, and even the cost of the procedure. While contextual effects may explain some of the improvement after surgery, these effects are often short-lived and are unlikely to explain larger, sustained improvements.
Three other, often overlooked factors are likely to explain most or all of the improvement after surgery seen in placebo surgical trials, and in much of the surgery we perform in clinical practice. These factors are natural history, regression to the mean and concomitant treatment.
Natural history (what would happen regardless of treatment) explains, for example, the eventual resolution of symptoms from the common cold. It also explains most or all of the improvement in pain that occurs after injuries. Natural history also confounds the treatment of fluctuating conditions such as multiple sclerosis. Without considering natural history in trials of surgery, we may falsely attribute improvement that occurs post-surgery as being due to the surgery (the fallacy of post hoc ergo propter hoc: it follows, therefore it is because of).
Regression to the mean occurs when we select patients who are currently at one end of a spectrum and follow them. Over time, they will fall closer to the mean. A good example is provided by Daniel Kahneman in his book, Thinking, Fast and Slow. He described an experienced flight instructor who recommended punishing bad performance (those at the lower end of the spectrum) and not praising good performance (those at the high end of the spectrum), saying: “On many occasions I have praised well performing cadets… the next time they usually do worse. On the other hand, I have often screamed into a cadet’s earphone for bad execution, and in general he does better on his next try”.10 Both groups were simply regressing to the mean.
Similarly, only selecting people with severe knee pain from a pool of people with osteoarthritis of the knee (a condition in which symptoms fluctuate widely) will make any treatment look good. The average pain in that group will fall closer to the mean over time and, similarly, others in the pool who were not selected will have severe pain later. Regression to the mean is likely to explain the similar improvement reported in knee arthritis symptoms for the multitude of therapies in which a before-and-after analysis is performed.
The phenomena of natural history and regression to the mean are strong reasons for a control group in any study and why caution is advised when interpreting non-comparative studies.
Concomitant treatment is also often overlooked. In a landmark trial comparing (the then new) bone morphogenetic protein (BMP) to (traditional) bone grafting in the treatment of ununited tibia fractures, BMP (with a union rate of 75%) was deemed equally effective as bone grafting (with a union rate of 84%). However, all patients also received intramedullary nailing, which produces similar union rates when used alone as a treatment for ununited tibia fractures.11 It is quite possible that BMP (and bone grafting) added nothing, and that the perceived effectiveness was entirely due to the concomitant treatment of intramedullary nailing.
Placebo surgical trials have a great advantage in determining the effectiveness of surgery because they provide the same contextual effects in both groups, patients are blinded and all other factors are equal between groups. It is my opinion, however, that the use of the term “placebo effect” to describe all the improvement seen after surgery has led to the widespread misinterpretation that the placebo treatment causes the improvement. Surgeons need to understand all the factors that contribute to improvements seen after surgery. These include, apart from any specific effects of surgery, contextual effects around the procedure, the effect of concomitant treatments, and what would have happened anyway (natural history and regression to the mean), and are summarized in the Figure.
Surgeons should also understand that components of the Figure are not to scale. In other words, the relative role of each component will vary widely depending on the procedure. For example, in the surgical management of proximal humerus fractures, it is likely that the specific surgical effect is small, there may be some contextual effects, and the effects of natural history and concomitant treatment (post-operative physical therapy) may be strong. Conversely, the specific surgical effect of surgery to stabilize an unstable knee may be very large, and the role of other effects minor. While the definitions of some or all these effects vary, it is important for surgeons to understand the factors contributing to the observed improvement after surgery.
Ian Harris
Ingham Institute for Applied Medical Research, South Western Sydney Clinical School, UNSW Sydney, Australia
Email:
iaharris1@gmail.com
References
- 1.Rosén A, Sachs L, EKdahl A, Westberg A, Gerdhem P, Kaptchuck T J, Jensen K.. Surgeons’ behaviors and beliefs regarding placebo effects in surgery. Acta Orthop 2021; 92(x): x–x. Epub ahead of print. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Beard D J, Campbell M K, Blazeby J M, et al. . Considerations and methods for placebo controls in surgical trials (ASPIRE guidelines). The Lancet 2020; 395(10226): 828–38. [DOI] [PubMed] [Google Scholar]
- 3.Sihvonen R, Paavola M, Malmivaara A, et al. . Arthroscopic partial meniscectomy versus sham surgery for a degenerative meniscal tear. N Engl J Med 2013; 369(26): 2515–24. [DOI] [PubMed] [Google Scholar]
- 4.Moseley J B, O’Malley K, Petersen N J, et al. . A controlled trial of arthroscopic surgery for osteoarthritis of the knee. N Engl J Med 2002; 347(2): 81–8. [DOI] [PubMed] [Google Scholar]
- 5.Beard DJ, Rees J L, Cook J A, et al. . Arthroscopic subacromial decompression for subacromial shoulder pain (CSAW): a multicentre, pragmatic, parallel group, placebo-controlled, three-group, randomised surgical trial. The Lancet 2018;391(10118): 329–38. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Paavola M, Malmivaara A, Taimela S, et al. . Subacromial decompression versus diagnostic arthroscopy for shoulder impingement: randomised, placebo surgery controlled clinical trial. BMJ 2018; 362: k2860. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Schrøder C P, Skare Ø, Reikerås O, Mowinckel P, Brox J I.. Sham surgery versus labral repair or biceps tenodesis for type II SLAP lesions of the shoulder: a three-armed randomised clinical trial. Br J Sports Med 2017; 51(24): 1759–66. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Harris I. Surgery, the ultimate placebo. Sydney: NewSouth Publishing; 2016. [Google Scholar]
- 9.Green S A. Surgeons and shamans: the placebo value of ritual. Clin Orthop Relat Res 2006; 450: 249–54. [DOI] [PubMed] [Google Scholar]
- 10.Kahneman D. Thinking, fast and slow. New York: Farrar, Straus and Giroux; 2011. [Google Scholar]
- 11.Brinker M R, O’Connor D P.. Exchange nailing of ununited fractures. J Bone Joint Surg Am 2007;89(1): 177–88. [DOI] [PubMed] [Google Scholar]