Skip to main content
Osteoarthritis and Cartilage Open logoLink to Osteoarthritis and Cartilage Open
. 2020 Nov 4;2(4):100113. doi: 10.1016/j.ocarto.2020.100113

Moderators of the effects of therapeutic exercise for people with knee and hip osteoarthritis: A systematic review of sub-group analyses from randomised controlled trials

JG Quicke a,, J Runhaar a,b, DA van der Windt a, EL Healey a, NE Foster a, MA Holden a
PMCID: PMC9718168  PMID: 36474874

Summary

Objective

1) To identify potential moderators of the effect of therapeutic exercise explored in randomised controlled trials (RCTs) of knee and hip osteoarthritis (OA); 2) summarise the extent, strength and quality of evidence reported for moderators.

Design

Systematic review (PROSPERO CRD42019148074). Inclusion criteria: a) RCTs with sub-group analyses investigating potential moderator variables; b) participants with knee and/or hip OA; c) therapeutic exercise interventions compared to either no exercise control or alternative exercise intervention(s), and; d) measuring pain or physical function outcomes. Included RCTs’ risk of bias and sub-group analysis quality were assessed. Data were extracted on sub-group analyses (methods and potential moderators), outcomes (pain and function) and sub-group findings (associated statistics of potential moderator∗intervention effects). Findings were analysed using narrative synthesis.

Results

14 RCTs were included; 13 knee OA RCTs (n = 2743 participants) explored 23 potential moderators and 1 hip OA RCT (n = 203) explored 6 potential moderators. Sub-group analysis quality was mixed. Knee varus malalignment was the only moderator of therapeutic exercise compared to non-exercise control in 1 RCT (WOMAC-pain adjusted difference 12.7 in the neutral alignment sub-group and 1.8 in the malaligned sub-group, interaction term: p = 0.02). Varus thrust, knee laxity/instability, obesity and cardiac problems all moderated the effect of therapeutic exercise on pain or function compared to different comparison exercise.

Conclusions

Therapeutic exercise may be effective for reducing pain in people with knee OA and neutral alignment but not for those with varus malalignment. The exercise moderator literature is limited. More robust evidence is required to inform sub-group exercise selection.

Keywords: Osteoarthritis, Knee pain, Hip pain, Exercise, Moderators, Systematic review

1. Introduction

Therapeutic exercise (TE) is physical activity that is planned, structured, repetitive and purposeful for the improvement or maintenance of a specific health condition [1]. Strengthening and aerobic TE are recommended for people with osteoarthritis (OA), irrespective of condition severity and comorbidity [2,3]. Meta-analyses of randomised controlled trials (RCTs) of TE interventions in people with knee and/or hip osteoarthritis (OA) provide strong evidence of, on average, small-to-medium effects on pain and function compared to non-exercise controls [[4], [5], [6], [7]]. However, effect sizes may hide wide individual variability in response to TE, and there may be sub-groups showing much larger or smaller benefits.

Baseline variables that are associated with the effect of an intervention are known as moderators [8]. In people with knee and/or hip OA, it is possible that participant factors such as sociodemographics (e.g. age), clinical assessment findings (e.g. the presence of comorbidity), or biomechanical factors (e.g. joint malalignment) may moderate the effects of TE interventions, which may inform identification of sub-groups who are likely to benefit (or not). Identifying moderators may inform targeted therapy, and has the potential to improve healthcare efficiency, for example by judiciously matching specific types of therapeutic exercise to sub-groups of patients most likely to respond best or by highlighting sub-groups who respond poorly to exercise where other approaches may be more effective and cost-effective. To date, according to a scoping Pubmed and PROSPERO search, no systematic review has summarized available evidence about potential moderators of TE in knee and/or hip OA that also considers the methodological quality of these sub-group analyses. This systematic review therefore aims to: 1) identify potential moderators of the effect of TE explored or tested in RCTs of people with knee and/or hip OA; and 2) summarise the evidence reported for individual moderators. Providing a summary of the extent, strength, and quality of evidence for moderators of the effect of TE interventions for OA may help enable optimal targeting of future TE interventions and inform future moderator research by highlighting gaps in the current evidence base and areas that require further confirmatory studies.

2. Method

A comprehensive, 2-stage, prospectively registered (PROSPERO CRD42019148074) systematic review following PRISMA reporting guidelines [9] was conducted to identify RCTs or quasi-RCTs that:

a) were carried out with any attempt at sub-group analyses investigating potential moderator variables (including secondary analysis studies from RCTs); b) included participants with knee and/or hip OA (including self-reported pain/OA in adults aged 45 years and over, clinical or radiographical diagnoses); c) compared TE interventions to either no exercise control (waiting list, no treatment or usual care) or alternative TE intervention(s), and; d) measured self-report pain or physical function outcomes (see Table 1).

Table 1.

Study eligibility criteria.

Inclusion criteria Exclusion criteria
Study Methods
  • Randomised controlled trials

  • Quasi-randomised controlled trial (where the method of allocation is known, but is not considered strictly random, e.g. alternation, medical record number).

  • Observational studies/non-randomised controlled trials

Publications
  • Full text, published studies

  • All countries/languages

Participants
  • Knee and/or hip pain in adults aged 45 years and over (mean age over 45 years)

  • Knee and/or hip OA diagnosed by x-ray

  • Knee and/or hip OA diagnosed according to clinical criteria or by a health care professional

  • Self-reported knee and/or hip OA

  • N.B: If population is mixed (e.g. OA and rheumatoid arthritis, include if over 50% of participants have OA

  • Knee and/or hip pain attributable to conditions other than OA

  • Rheumatoid arthritis/other defined inflammatory rheumatological problems

  • Pre-operative or postoperative patients (people on waiting lists for knee/hip surgery or immediately following surgery)

  • People with ‘patellofemoral pain syndrome’

  • Animal based studies

  • Studies of children

Intervention
  • Any therapeutic exercise intervention (land or water based), regardless of content, duration, frequency, or intensity

  • Non exercise interventions

  • Advice only to exercise or increase physical activity, including within wider OA self-management programmes

  • Exercise or physical activity that was not specifically applied to improve or maintain knee and/or hip OA

  • Exercise combined with other treatment modalities other than advice/education/self-management/motivational techniques)

Comparator
  • Other forms of therapeutic exercise (i.e. different type, duration, frequency or intensity of exercise if sufficiently different from the intervention arm), or no exercise control group (including usual care, waiting list, placebo, attention control, or no treatment)

  • If intervention groups receive identical therapeutic exercise interventions

  • If the comparator is a different active intervention other than usual care, waiting list, placebo, attention control, or no treatment (e.g. injection, opioids, weight loss etc.)

Outcomes
  • Contains at least one self-report measure of pain and/or function

Search stage 1 was an update of a previous systematic review [6] to identify RCTs of TE for participants with hip and/or knee OA. Electronic database searches were run from the date of the previous review (March 2012) until April 2020 in Medline, Embase CINAHL, AMED, HMIC (Health Management Information Consortium), Cochrane Database of Systematic Reviews Cochrane Controlled Clinical Trials (CENTRAL), DARE (Database of Reviews of Effectiveness), NHS EED (Economic Evaluations Database), and Web of Science. The search involved combining keywords and database MeSH headings for knee and/or hip pain and OA, exercise and physical activity, and the Cochrane highly sensitive RCT search [10]. The full search strategies for Medline and Embase are shown in Appendix 1.

Iterative screening of titles and abstracts and then full texts was carried out independently by two reviewers (JQ, MH, AB, JR, EH) with conflicts resolved via discussion and a third reviewer if required (JQ, MH, DvdW). Reference lists of included RCTs were also screened to identify other potentially relevant RCTs.

Search stage 2 involved further searches based on the RCTs identified during stage 1, including reference checking and searching the publications of the first and last authors (Medline). This allowed identification of RCTs with sub-group analysis and linked secondary sub-group analysis studies from eligible RCTs in separate publications. As in stage one, screening of titles, abstracts and full texts was completed independently by two reviewers with a third reviewer resolving disagreements if necessary.

2.1. Data extraction

Data from included studies were extracted on: RCT study identification details (lead author/year of publication/country of study), participant data (number/OA diagnosis type/OA joint), intervention and control/comparator arm data (TE and comparator type), sub-group analysis carried out (methods and potential moderator sub-group variables including how they were measured), outcome measure data (self-report pain and function and the primary outcome time point these were measured). Where more than one measure of self-reported pain and physical function were reported in sub-group analyses, the highest in the hierarchy of outcome measures, as recommended by the Cochrane Musculoskeletal Review Group was used [11]. Data was also extracted on sub-group findings and associated statistics (including tests of potential moderator∗intervention effects).

2.2. Quality appraisal

Included RCTs were assessed for risk of bias. The Cochrane Risk of Bias Tool was used to assess: selection bias; performance bias; detection bias; attrition bias; reporting bias, and; other bias [12]. “Other bias” was used to cover major concerns regarding precision, contamination, and issues of sampling frame generalizability. For example, pilot studies, studies where concerning numbers of participants undertook co-interventions that may bias findings and those with highly selected sampling frames which were not likely to be generalisable to the wider OA population were considered at high risk of bias. Blinding of participants to exercise intervention was not formally evaluated since all the included studies were not able to blind the participants to the fact they were carrying out exercise. A conservative RCT “summary” risk of bias judgement was allocated based on the lowest rating for any individual risk of bias domain. Studies were not excluded based on risk of bias. Sub-group analysis quality appraisal was carried out using the criteria recommended by Pincus and colleagues [13]. These included: whether the sub-group analysis was specified a-priori; whether the selection of potential sub-group analyses was based on clinical rationale or existing research findings; whether the sub-group variable(s) was adequately measured and measured pre-randomisation; if there was a test of interaction between the potential moderator and intervention and, if so, was this statistically significant (p < 0.05). Sub-group analyses were scored between 0 and 5 depending on the number of quality criteria they satisfied. Those scoring 5 were deemed of “higher quality” evidence with scores less than five considered “insufficient” quality evidence. Findings from the risk of bias and quality appraisals were used to inform the discussion and review conclusion strength. Data extraction and quality assessment were completed independently by two reviewers (JQ, MH, AB, JR, EH) with a third reviewer resolving disagreements if necessary (JQ, MH, DvdW).

2.3. Data synthesis

In addressing objective 2 (summarising the reported moderator evidence), only the sub-group analyses that reported explicit statistical interaction tests (potential moderator∗intervention effects) were included in the final summary of evidence findings synthesis [14]. Findings were analysed for each potential moderator using narrative synthesis summarising: a) the amount of evidence (number of RCTs and participants); b) the direction, strength, and statistical significance of sub-group (moderation) effects; c) consistency of results across trials and outcome domains, and; d) level of evidence for the moderator (based on moderation analysis quality). Narrative synthesis was carried out due to the substantial heterogeneity between studies in terms of study populations, interventions, sub-group analyses undertaken and outcome measures used. Moderator findings were collated into categories by control group (non-exercise control or other TE comparator), OA phenotypes (knee or hip OA) and clinical outcome domains (pain and physical function).

3. Results

In total 149 RCTs of TE for knee and/or hip OA were identified, of which 14 conducted sub-group analysis and were included in this review. Fig. 1 provides a flow diagram of included studies.

Fig. 1.

Fig. 1

Included studies flow diagram.

3.1. Study characteristics and moderators investigated

Table 2 describes the included studies. The majority of included RCTs focused on participants with knee OA (n = 13; 2743 participants); only one RCT included participants with hip OA (n = 203 participants). The RCTs were undertaken in 7 different countries (UK n = 3, USA n = 3, Netherlands n = 3, Australia n = 2, Belgium, Japan, Sweden). TE interventions were mainly lower limb strengthening and/or aerobic exercise delivered over 6–26 weeks. Eight RCTs had a non-exercise control (including the hip OA RCT) [[15], [16], [17], [18], [19], [20], [21], [22]] and six RCTs compared different TE interventions [[23], [24], [25], [26], [27], [28]]. Most RCTs explored both pain and function outcomes (n = 11). The sub-group analyses were generally explored using outcome data at the post-intervention time-point (most commonly at 12 weeks).

Table 2.

Description of included studies.

Trial identification
(RCT author, year of publication, country, primary or secondary analysis paper)
Participant details
(OA diagnosis, number and knee/ hip joint)
Potential moderator sub-group analysis Therapeutic exercise intervention Comparator Outcomes
(Domain(s), measures and primary follow-up time point)
Was the main effect for the exercise intervention versus comparator statistically significant?#
Knee OA RCTs
Beckwee 2017: Belgium (P) Symptomatic OA (ACR); knee; N = 35 Bone marrow lesions Strength training Walking training Pain (intermittent and constant osteoarthritis pain questionnaire) PI 18/52 No
Bennell 2014 Australia (S) Bennell 2015 Symptomatic, radiographic OA (KL2+); knee; N = 92 Varus thrust
Obesity
Varus alignment
Isometric quadriceps strength
Quadriceps strengthening Neuromuscular exercise Pain (VAS pain in last week)
Function (WOMAC function scale) PI 12/52
No
No
De Rooij 2017 Netherlands (P) Symptomatic OA (ACR) and comorbidity; knee; N = 126 Obesity Comorbidity adapted aerobic and strength exercises Current medical care and exercise waiting list Pain (NRS pain in the last week; WOMAC pain scale)
Function (WOMAC function scale) PI 20/52
Yes

Yes
Ettinger 1997
USA
(S)- Foy 2005 pooled data from FAST and ADAPT RCTs
(S)- Penninx 2002
(S)- Mangani 2006
Symptomatic, radiographic OA; knee; N = 584 combined Ethnicity



Depression
Presence of comorbidity
Strength training (I1) Or Aerobic exercise (I2) training Health education control Pain (pain intensity Likert scale 1-6)
Function (FAST Physical Functioning Questionnaire Index) PI 18/12
Yes

Yes
Hay 2006
United Kingdom
(S) Legha 2019 pooled data from TOPIC and BEEP RCTs
Symptomatic OA; knee;
N = 217
Overweight/obesity
Pain elsewhere
Anxiety/depression
Cardiac problems
Diabetes Mellitus
Respiratory conditions
Community physiotherapy exercise Non exercise control Pain (WOMAC pain)
Function (WOMAC function) PI 6/12
Yes
Yes
Hay 2018
United Kingdom
(S) Legha 2019 pooled data from TOPIC and BEEP RCTs
Symptomatic OA; knee;
N = 514
Overweight/obesity
Pain elsewhere
Anxiety/depression
Cardiac problems
Diabetes Mellitus
Respiratory conditions
Individually tailored exercise (I1)

Targeted exercise adherence (I2)
Usual physiotherapy led exercise Pain (WOMAC pain)
Function (WOMAC function) PI 6/12
No
No
Hurley 2007
United Kingdom (P)
Symptomatic OA; knee; N = 418 Anxiety & depression Exercise confidence and beliefs Individualised rehabilitation (I1)
Group rehabilitation (I2)
Usual primary care Pain (WOMAC pain scale)
Function (WOMAC function scale) FU 6/12
Yes
Yes
Knoop 2013
Netherlands
(S) Knoop 2014
Symptomatic knee OA and knee instability; knee; N = 159 Upper leg muscle strength
Knee proprioception
Knee laxity
Knee instability
Tailored knee stability exercise Muscle strengthening Pain (NRS pain in the last week; WOMAC pain scale)
Function (WOMAC function scale) PI 12/52
No

No
Kudo 2013
Japan (P)
Symptomatic, radiographic OA (KL1 or more); knee; N = 209 Age
BMI
Flexion contracture
Quadriceps strength
Radiographic knee OA severity
Meniscal MRI Mink grade
Subchondral bone lesion grading
Femorotibial angle
Group exercise Home exercise Pain and function (Normalized WOMAC out of 100) considered as a pain outcome.
PI 3/12
Yes
Lim 2008
Australia (P)
Symptomatic radiographic OA; knee; N = 107 Knee varus malalignment Quadriceps strengthening Non exercise control Pain (WOMAC pain scale)
Function (WOMAC function scale) PI 12/52
Yes
No
Messier et al 2004
USA
(S)- Foy 2005 pooled data from FAST and ADAPT RCTs
Symptomatic, radiographic OA; knee; N = 584 combined Ethnicity Exercise only Healthy lifestyle education Physical Functioning Questionnaire Index) PI 18/12 No
Mikesky et al 2006
USA (P)
Symptomatic and radiographic OA and adults aged ≥ 55; knee: N = 221 Radiographic knee OA severity Strength training Range of movement exercise Pain (WOMAC pain scale)
Function (WOMAC function scale) PI 12/52
No
No
Thorstensson 2005
Sweden (P)
Symptomatic, radiographic OA (KL III or more); knee; N = 61 Pain Exercise group Non-intervention attention control Pain (KOOS pain)
Function (KOOS function) PI 6/52
No
No
Hip OA RCT
Teirlinck et al 2016
Netherlands (P)
Symptomatic OA (ACR); hip; N = 203 Age
Sex
Pain level
Education
Radiographic hip OA severity
Knee OA
Usual general practitioner care and exercise Usual general practitioner care Pain (HOOS pain)
Function (HOOS function) PI 12/12
No
Yes

Key: BOLD = primary randomised control trial report publication; ACR = American College of Rheumatology definition; BMI= Body Mass Index; FU = follow up; HOOS=Hip Disability and Osteoarthritis Outcome Score; I = intervention; KL= Kellgren Lawrence; KOOS= Knee injury and Osteoarthritis Outcome Score; MRI = magnetic resonance imaging; N = number; NRS=Numerical Rating Scale; P = data extracted from a primary randomised controlled trial report publication; PI = post intervention; S = data extracted from a randomised controlled trial secondary data analysis study publication; WOMAC; # = If “Yes” statistically significant (p < 0.05) main effect favours exercise intervention 1 over control or exercise intervention 2 at the primary outcome time point unless otherwise stated.

In knee OA RCTs, 23 potential different moderators were investigated, including: being overweight or obese (n = 5 RCTs), anxiety/depression (n = 4 RCTs) quadriceps strength (n = 3 RCTs); bone marrow lesions, ethnicity, knee OA radiographic severity, number of comorbidities, types of comorbidities (cardiac problems, diabetes mellitus, pain elsewhere, respiratory conditions), varus knee malalignment (all in n = 2 RCTs); age, comorbidity presence, exercise confidence and beliefs, femorotibial angle, knee flexion contracture, knee laxity, knee instability, knee proprioception, pain intensity, meniscal abnormality, varus thrust (all in n = 1 RCT only). In the hip OA RCT six potential moderators were investigated: age, education, sex, pain intensity, hip OA radiographic severity, presence of knee OA [22].

3.2. Moderator findings

3.2.1. Knee OA-therapeutic exercise vs non-exercise control

Of the seven RCTs that provided sub-group analysis data on potential moderators of TE effects compared to a non-exercise control [[15], [16], [17], [18], [19], [20], [21]] five explored pain outcomes and five explored function outcomes (see Table 3). Only four RCTs [[16], [17], [18], [19]] including three secondary analyses studies [[29], [30], [31]] from the Fitness Arthritis and Seniors Trial (FAST) and the Treatment Options for Pain In the Knee (TOPIK) trial explicitly carried out interaction tests for moderation and are included in the synthesis below.

Table 3.

Potential moderators investigated and moderator interaction analysis findings.

Study author Potential moderator Potential moderator details Type of sub-group analysis Was a moderator interaction analysis undertaken? Moderator interaction analysis findings**
Pain Function
Knee OA RCTs
Therapeutic exercise vs non-exercise control
De Rooij et al 2017 Obesity BMI 30 and over Description of statistical methods not provided: subgroup analysis only including patients with obesity- interaction appears not to have been tested. Unclear Similar results were found for subgroups as compared to main effects (data not shown) Similar results were found for subgroups as compared to main effects (data not shown)
Ettinger et al 1997: (Penninx et al., 2002) (Foy et al., 2005)
N.B. Used data from Ettinger et al. 1997 andMessier et al 2004
ADAPT & FAST RCTs # (Mangani et al., 2006)
Depression
Ethnicity
Comorbidity
Center for Epidemiologic
Studies Depression short scale (dichotomised at <6 and ≥ 6)
African American or Caucasian
Presence of comorbidity (defined as 2 conditions in addition to knee OA)
Repeated measures ANCOVA to test the effects of exercise vs control on pain score (3,9 and 18 month FU) depression*exercise interaction was tested.
Series of ANCOVAs. Ethnicity investigated only as prognostic factor.
ANOVA to test the effect of exercise compared to health education on percent changes in self-reported disability and knee pain from baseline to 3-, 9-, and 18-month FU. Comorbidity*exercise interaction tested, with significance level adjusted using the Bonferroni method.
Yes
No
Yes
Effects of exercise similar for high and low depression groups with a non-significant interaction term p > 0.15
N/A (no moderator interaction analyses)
No significant interaction was found for any of the follow-up times (p-values for interaction terms >0.1).
N/A (function not investigated)
N/A (no moderator interaction analyses)
No significant interaction was found for any of the follow-up times (p-values for interaction
terms >0.1).
Hay 2006
Legha 2019 pooled data from TOPIC and BEEP RCTs #
BMI overweight/obese Categorised into <25, 25–29.9, and >29.9 Mixed model with an additional term for the interaction with treatment effect Yes No statistically significant interactions found (P > 0.05) No statistically significant interactions found (P > 0.05)
Pain elsewhere Pain other than in knee
Anxiety and depression Single item from EQ-5D
Diabetes Mellitus Self-report (Y/N)
Cardiac problems Self-report (Y/N)
Respiratory conditions Self-report (Y/N)
Number of comorbidities Categorised based on the above comorbidities, 0, 1–2, 3+
Hurley et al 2007 Anxiety & depression HADS self-report depression Multilevel modelling was used to test the effect of exercise vs control. Yes N/A (pain not investigated) No significant interaction was found (p-value not reported).
Exercise confidence and beliefs Exercise health beliefs and self-efficacy questionnaire Moderator*exercise interactions were tested using separate models for depression, anxiety, exercise beliefs, and exercise self-efficacy (6 months FU).
Lim et al 2008 Knee varus malalignment ≥5° varus Two-way ANOVA was performed to test the effect of strengthening exercise vs control on change in outcome scores. Malalignment *exercise interaction was tested. Yes Malalignment was statistically associated with the effect of quadriceps strengthening (interaction term statistically significant: p = 0.024). In the neutral alignment group, strengthening participants reported a significant pain reduction over time compared with control participants (adjusted difference 12.7; P <0.001).
There was no significant pain reduction in the more malaligned group (adjusted difference 1.8; P = 0.592).
No significant interaction was found (p-value 0.062).
Thorstensson et al 2005 Pain Dichotomised around the median at baseline (<57 or 58≥/100 KOOS pain) Description of statistical methods for post-hoc subgroup analyses were not provided. Unclear Changes seen in subgroups were not different from changes seen in the total groups. N/A (function not investigated)
Therapeutic exercise vs comparison exercise
Beckwee et al 2017 Bone marrow lesions No bone marrow lesions on MRI of the tibia or femur or 1 or more bone marrow lesions present Within subgroup changes were analysed with Wilcoxon signed rank tests. Effect sizes (r) were calculated by dividing the Z score of the Wilcoxon signed rank test by the root of the number of observations. Between-subgroup differences were analysed with Mann–Whitney U-tests or Fisher's exact test. Interactions appear not to have been tested. No N/A (no moderator interaction analyses) N/A (no moderator interaction analysis and function not investigated)
Bennell et al 2014 (Bennell et al., 2015) Varus thrust Assessed by two experienced researchers from gait videos Between-group ANCOVA to test effects between strengthening and neuromuscular exercise,Moderator*exercise interactions were tested. Yes Significant interaction effect between type of exercise and varus thrust (P = 0.001). Neuromuscular exercise was more effective for people without varus thrust (effect estimates for subgroups not reported) None of the candidate moderators influenced the difference between the two types of exercise for physical function (interactions P > 0.05)
Obesity BMI 30 and over Yes Significant interaction effect between type of exercise and obesity (P = 0.023). Quadriceps strengthening was more effective for obese people than for those non-obese (effect estimates for subgroups not reported).
Varus alignment Varus malalignment of <5° or ≥5° Yes No other subgroup effects (significant interactions) reported
Isometric quadriceps strength Participants were classified as weaker (<1.32 Nm/kg for females and <1.50 Nm/kg for males) or stronger (≥1.32 Nm/kg for females or ≥1.50 Nm/kg for males)
Hay 2018
Legha 2019 data from BEEP RCT
BMI overweight/obese Categorised into <25 25–29.9, and >29.9 Mixed model with an additional term for the interaction with treatment effect Yes No significant associations were found for the effect on pain of a specific type of exercise. The presence of cardiac problems was statistically significantly associated with the effect of enhanced exercise compared to usual exercise (p0.041), enhanced exercise may be less effective than usual physiotherapy-led exercise for improving function in people with cardiac problems.
Other comorbidities and number of comorbidities were not associated with the effect of a specific type of exercise.
Pain elsewhere Pain other than in knee
Anxiety and depression Single item from EQ-5D
Diabetes Mellitus Self-report (Y/N)
Cardiac problems Self-report (Y/N)
Respiratory conditions Self-report (Y/N)
Number of comorbidities Categorised based on the above comorbidities, 0, 1–2, 3+
Knoop et al 2013 (Knoop et al., 2014) Upper leg muscle strength Assessed for extension and flexion using an isokinetic dynamometer (D) Generalized estimating equation analyses were used to compare the effect of tailored stability versus strengthening exercises. Yes No significant interaction, subgroup effect estimates not reported. Significant interaction found for upper leg muscle strength with type of exercise (p = 0.01), suggesting tailored stability exercise may be more effective than strengthening exercise in persons with higher baseline muscle strength, while strengthening exercises may be more effective in persons with lower baseline muscle strength (effect estimates for subgroups not reported).
No interactions were found for other candidate moderators
Knee proprioception Knee proprioception (proprioceptive accuracy assessed using knee joint motion detection task) (D) Moderator*treatment interactions were tested. Yes No significant interaction, subgroup effect estimates not reported.
Knee laxity Continuous, amount of total passive movement in frontal plane after fixed valgus and varus load of 7.7 Nm (D) Yes Significant interaction found for laxity with type of exercise (p < 0.001 for pain), suggesting tailored stability exercises may be more effective than strengthening exercises in those persons with lower laxity compared to those with higher laxity
Episodes of knee instability Dichotomous, none/seldom and regularly/very often Yes Significant interactions found for knee instability (p = 0.05 for pain), suggesting tailored stability exercises may be more effective than strengthening exercise in those with more frequent knee instability (effect estimates for subgroups not reported)
Kudo et al 2013 Age (<65 or ≥65) 1. Groups of home exercise and group exercise were divided into ‘more effective group’ and ‘less effective group’ according to W (improvement in the normalized WOMAC) 2. Subjects of group exercise were divided into 2 groups according to baseline factors then W compared between groups using Mann-Whitney U test. Significance set at p < 0.05. No N/A (no moderator interaction analyses) N/A (no moderator interaction analyses)
BMI (<25 or ≥25)
Flexion contracture (<5° or ≥5°)
Quadriceps strength (split by median)
Radiographic severity (KL 1–2 or 3–4)
Meniscus abnormality Meniscal MRI Mink grade (0–2 or 3)
Subchondral bone lesions MRI (Normal, spot, moderate, large)
Femorotibial angle (<178° or ≥178°)
Mikesky et al 2006 Radiographic OA severity (KL 0–1 and 2–4) Stratified analyses performed for knees with and without radiographic evidence of OA at baseline (i.e., K/L grade 2–4 and K/L grade 0–1, respectively). Interaction was not tested. No N/A (no moderator interaction analyses) N/A (no moderator interaction analyses)
Hip OA RCT
Teirlinck et al 2016 Age (<65 or ≥65) Linear mixed models for repeated measurements were used for main effects. Methods for a-priori defined exploratory subgroup analyses are not provided in detail, but interactions appear not to be tested. Unclear N/A no interaction moderator analyses N/A no interaction moderator analyses
Education Lower than higher vocational education or higher vocational education or university
Sex Male or female
Presence of Knee OA Self-report
Pain level (NRS <3 or ≥3)
Radiographic hip OA severity KL (0–1 or ≥2)

Key: BOLD = authors in bold indicate a primary randomised control trial report publication, whilst non-bold authors names indicate a linked secondary analysis publication; ANCOVA = analysis of covariance; ANOVA = analysis of variance; BMI= Body Mass Index; FU = follow up; D = a continuous measure dichotomised around the median; HOOS=Hip Disability and Osteoarthritis Outcome Score; I = intervention; KL= Kellgren Lawrence; KOOS= Knee injury and Osteoarthritis Outcome Score; MRI = magnetic resonance imaging; N = number; NRS=Numerical Rating Scale; P = data extracted from a primary randomised controlled trial report publication; PI = post intervention; S = data extracted from a randomised controlled trial secondary data analysis publication; WOMAC=Western Ontario and; # = secondary analysis study combining data from more than one similar randomised controlled trials; * = interaction; **moderator results only reported where explicit moderator interaction analyses carried out (prognostic factors of treatment outcome irrespective of intervention arm or for single treatment arms not reported).

Knee varus malalignment was the only moderator of the effect of TE on pain compared to a non-exercise control (interaction term statistically significant: p = 0.02, “high quality”level moderation analysis, from a low risk of bias RCT). TE was more effective compared to a non-exercise control, for reducing pain in people with knee OA with neutral alignment but not in those with varus malalignment (≥5° varus) [19]. The variables that have undergone exploratory sub-group moderation analyses with statistical interaction testing that did not moderate pain treatment effect included: obesity, anxiety and depression, cardiac problems, diabetes mellitus, pain elsewhere, respiratory conditions, presence of comorbidity, number of comorbidities, exercise confidence and beliefs [17,18,[29], [30], [31]].

Of the variables that were explored as potential moderators of function outcome, with interaction testing, none were statistically significant (obesity, anxiety and depression, cardiac problems, diabetes mellitus, pain elsewhere, respiratory conditions, presence of comorbidity, number of comorbidities, exercise confidence and beliefs, knee varus malalignment) [[17], [18], [19],[29], [30], [31]].

3.2.2. Knee OA-therapeutic exercise vs other exercise comparator

Six RCTs provided sub-group analysis data on potential moderators of the effects of specific types or modes of delivery of TE compared to other exercise interventions [[23], [24], [25], [26], [27], [28]]. Of these, 6 explored pain outcomes and 4 explored function outcomes. Only 3 RCTs [[24], [25], [26]] including one secondary analysis study from the Benefits of Effective Exercise for knee Pain (BEEP) trial [31] explicitly carried out interaction tests for moderation and are reported below.

Varus thrust and obesity both moderated the effect on pain of neuromuscular exercise compared to strengthening exercise (P < 0.05, “high quality” level moderation analysis, from an unclear risk of bias RCT) [24]. Neuromuscular exercise was more effective for improving pain than strengthening exercise in people with varus thrust compared to those with no varus thrust. Quadriceps strengthening was more effective for improving pain than neuromuscular exercise in people with obesity compared to those who are not obese. Knee laxity and knee instability both moderated the effect on pain of tailored stability exercises compared to strengthening exercises (P < 0.001, “insufficient” and “high quality” level moderation analysis respectively, from an unclear risk of bias RCT) [26]. Tailored stability exercise was more effective than strengthening exercise in improving pain in people with lower laxity/more frequent instability compared to those with higher laxity/less instability respectively.

Upper leg muscle strength moderated the effect of tailored stability exercise compared to strengthening exercise on function (p = 0.01, “insufficient” level moderation analysis, from an unclear risk of bias RCT [26]. Tailored stability exercise was more effective than strengthening exercise in people with higher baseline muscle strength but strengthening exercise was more effective in those with lower baseline muscle strength. Enhanced exercise was less effective than usual exercise in improving function in people with cardiac problems than in those without (p = 0.04, “insufficient” level moderation analysis, from a low risk of bias RCT) [32]. Obesity, diabetes mellitus, pain elsewhere, respiratory conditions, varus thrust, knee proprioception, knee laxity, knee instability were not found to moderate function outcome.

3.2.3. Hip OA

The hip OA RCT did not explicitly carry out interaction tests within their sub-group analyses (see Table 3) [22].

3.3. RCT risk of bias assessment

Overall summary risk of bias from included studies was generally deemed to be low or unclear. Two RCTs (14% of all studies) were judged to be at high risk of bias in one or more risk of bias domains and therefore judged as high risk of bias overall [22,28]. Seven studies included a mixture of low and unclear risk of bias domain judgements (50%), and five studies (36%) were judged to be at low risk of bias for all outcome domains (Fig. 2). Table 4 highlights the individual RCT Cochrane risk of bias tool scores for each outcome domain.

Fig. 2.

Fig. 2

Summary of risk of bias within the included primary randomised controlled trials.

Table 4.

Risk of bias within included randomised controlled trials.

3.3.

3.4. Methodological assessment of moderator analyses

The overall level of moderator evidence for individual moderators was evaluated as “high quality” in 7 of 44 moderators investigated within studies (16%) and “insufficient” in 37 of 44 (84%) (see Table 5).

Table 5.

Methodological assessment of moderator analyses (as per Pincus et al., 2011).

Study author Criteria
1. Was the subgroup analysis specified a priori? 2. Was selection of factors for analysis justified and either or both i) theory based ii) evidence based? 3. Was the subgroup variable measured prior to randomisation? 4. Was the quality of measurement of baseline factors adequate? 5. Was there an explicit test of the interaction between moderator and treatment? 6. Total Score 7. Level of moderation evidence
Therapeutic exercise vs non-exercise
De Rooij 2017
Obesity N N Y Y N 2 Insufficient
Ettinger 1997
(Penninx 2002) Depression ? Y Y Y Y 4+ Insufficient
(Foy 2005)#
Ethnicity
? Y Y Y N 3+ Insufficient
(Mangani 2006)
Comorbidity
? Y Y Y Y 4+ Insufficient
Hay 2006
(Legha 2019) Overweight/obesity N Y Y Y Y 4 Insufficient
Pain elsewhere N Y Y Y Y 4 Insufficient
Anxiety/depression N Y Y Y Y 4 Insufficient
Cardiac problems N Y Y Y Y 4 Insufficient
Diabetes Mellitus N Y Y Y Y 4 Insufficient
Respiratory conditions N Y Y Y Y 4 Insufficient
Hurley 2007
Anxiety & depression Y Y Y Y Y 5 High quality
Exercise confidence and beliefs Y N Y Y Y 4 Insufficient
Lim 2008
Knee malalignment Y Y Y Y Y 5 High quality
Teirlinck 2016
Age Y N Y Y N 3 Insufficient
Education Y N y Y N 3 Insufficient
Sex Y N Y Y N 3 Insufficient
Pain level Y N y Y N 3 Insufficient
Radiographic OA severity Y N Y Y N 3 Insufficient
Knee OA Y N Y Y N 3 Insufficient
Thorstensson 2005
Pain N N Y Y N 2 Insufficient
Therapeutic exercise versus comparison exercise
Beckwee 2017
Bone marrow lesions Y Y Y Y N 4 Insufficient
Bennell et al 2014
(Bennell 2015) Varus thrust Y Y Y Y Y 5 High quality
Obesity Y Y Y Y Y 5 High quality
Varus alignment Y Y Y Y Y 5 High quality
Isometric quadriceps strength Y Y Y Y Y 5 High quality
Hay 2018
(Legha 2019) Overweight/obesity N Y Y Y Y 4 Insufficient
Pain elsewhere N Y Y Y Y 4 Insufficient
Anxiety/depression N Y Y Y Y 4 Insufficient
Cardiac problems N Y Y Y Y 4 Insufficient
Diabetes Mellitus N Y Y Y Y 4 Insufficient
Respiratory conditions N Y Y Y Y 4 Insufficient
Knoop 2013
(Knoop 2014) Upper leg muscle strength N Y Y Y Y 4 Insufficient
Knee proprioception N Y Y Y Y 4 Insufficient
Knee laxity N Y Y Y Y 4 Insufficient
Episodes of knee instability Y Y Y Y Y 5 High quality
Kudo 2013
Age Y N Y Y N 3 Insufficient
BMI Y Y Y Y N 4 Insufficient
Flexion contracture Y Y Y Y N 4 Insufficient
Quadriceps strength Y N Y Y N 3 Insufficient
Radiographic severity Y N Y Y N 3 Insufficient
Meniscus abnormality Y Y Y Y N 4 Insufficient
Subchondral bone lesion Y Y Y Y N 4 Insufficient
Femorotibial angle Y Y Y Y N 4 Insufficient
Mikesky 2006
Radiographic OA severity Y N Y Y N 3 Insufficient

Key: BOLD = authors in bold indicate a primary randomised control trial report publication, whilst non-bold authors names indicate a linked secondary analysis publication. # = secondary analysis study combining data from more than one similar randomised controlled trials. Y = yes, N = no, ? = unclear.

Of the potential moderators investigated, 55% were justified a-priori. Although all RCTs measured their potential moderator variables adequately and before randomisation, 19 of 44 sub-group analyses investigated (43%) were done so without interaction tests between the potential moderator and intervention arm. An additional methodological factor not included in the Pincus et al. criteria [13] is whether or not studies were powered to identify sub-group effects. Only a single included study [19] reported being powered to carry out sub-group analyses, with all other studies powered only to investigate between-intervention group main effects. Furthermore, this interaction power calculation involved “change in knee adduction movement” as the outcome rather than pain or function.

3.5. Summary of moderator evidence

Table 6 synthesises the existing evidence base for moderators of the effects of TE considering clinical implications, level of evidence, summary risk of bias and consistency of findings across different outcome domains (pain and function). For this synthesis table only studies that explicitly carried out moderator∗intervention interactions are included.

Table 6.

Summary of findings- Evidence from RCTs (with interaction analyses) for moderators of the effects of therapeutic exercise for knee OA.

Moderator (RCT author) No of studies/participants analysed & participant “cases” Concise clinical summary
Pain Level of evidence/risk of bias Function Level of evidence/risk of bias Consistency of findings between outcomes?
Therapeutic exercise vs non-exercise control
Biological Moderators
Comorbidity presence
(Ettinger et al., 1997)
1 RCT
N = 435 analysed
Comorbidity cases = 197
The effectiveness of therapeutic exercise in reducing pain was similar in patients with and without comorbidity. Insufficient/low The effectiveness of therapeutic exercise in improving function was similar in patients with and without comorbidity. Insufficient/low Y
Number of comorbidities
(Hay et al., 2006)
1 RCT
N = 217 analysed
The effectiveness of therapeutic exercise in reducing pain was similar in patients irrespective of their number of comorbidities. Insufficient/unclear The effectiveness of therapeutic exercise in improving function was similar in patients irrespective of their number of comorbidities. Insufficient/unclear Y
Comorbidity types:
  • -

    Cardiac problems

  • -

    Diabetes Mellitus

  • -

    Pain elsewhere

  • -

    Respiratory conditions

  • -

    Overweight/obesity

(Hay et al., 2006)
1 RCT
N = 217 analysed
Cardiac problem cases = 105
Diabetes Mellitus cases = 16
Respiratory cases = 33
Overweight = 49
Obese = 71
The effectiveness of therapeutic exercise in reducing pain was similar in patients with and without various comorbidity types. Insufficient/unclear The effectiveness of therapeutic exercise in improving function was similar in patients irrespective of various comorbidity types. Insufficient/unclear Y
Knee varus malalignment
(Lim et al., 2008)
1 RCT
N = 107 analysed
Varus malalignment cases = 52
Therapeutic exercise was effective in reducing pain in patients with knee OA and neutral alignment but not in those with varus malalignment (>5° varus). Knee varus malalignment likely moderates the effect of therapeutic exercise for reducing pain.∗∗ High quality/low The effectiveness of therapeutic exercise in improving function was similar in patients with and without knee malalignment. High quality/low N
Psycho-social Moderators
Anxiety and Depression
(Ettinger et al., 1997/Hay et al., 2006/Hurley et al., 2007)
3 RCTs (two explored pain and two explored function outcome)
N = 438/217/342 analysed
Heterogeneous case definitions, 98/80/numbers of cases with depression were unreported
The effectiveness of therapeutic exercise interventions in reducing pain was similar in patients regardless of anxiety and depression. Insufficient/low & unclear The effectiveness of therapeutic exercise in improving function was similar in patients regardless of anxiety and depression. Mixed quality/unclear Y
Exercise confidence and beliefs
(Hurley et al., 2007)
1 RCT
N = 342 analysed
Number of cases was unreported
N/A (no available evidence) N/A The effectiveness of therapeutic exercise in improving function was similar in patients regardless of exercise confidence and beliefs. Insufficient/unclear N/A
Therapeutic exercise vs comparison exercise
Biological Moderators
Number of comorbidities
(Hay et al., 2018)
1 RCT
N = 514 analysed
The effectiveness of enhanced vs usual exercise in reducing pain in patients was similar irrespective of their number of comorbidities. Insufficient/low The effectiveness of enhanced vs usual exercise in improving function was similar in patients irrespective of their number of comorbidities. Insufficient/low Y
Comorbidity types:
  • -

    Cardiac problems

  • -

    Diabetes Mellitus

  • -

    Pain elsewhere

  • -

    Respiratory conditions

(Hay et al., 2018)
1 RCT
N = 514 analysed
Cardiac problem cases = 256
Diabetes Mellitus cases = 66
Respiratory cases = 88
The effectiveness of enhanced vs usual exercise in reducing pain was similar in patients with and without various comorbidity types. Insufficient/low The effectiveness of enhanced vs usual exercise in improving function was similar in patients irrespective of various comorbidity types except for cardiac problems.
Enhanced exercise may be less effective in improving function in patients with cardiac problems.a
Insufficient/low Y for all comorbidity types except cardiac problems
Obesity
(Bennell et al., 2014, Hay et al., 2018)
2 RCT
N = 92/514 analysed
Obesity cases = 42/189
The role of obesity as a moderator of exercise vs other exercise in reducing pain was conflicting in the literature.a
Quadriceps strengthening was more effective in reducing pain than neuromuscular exercise for patients with obesity.a
The effectiveness of enhanced vs usual exercise in reducing pain was similar in patients with and without obesity.
Mixed quality/low & unclear The effectiveness of quadriceps strengthening vs neuromuscular exercise in improving function was similar in patients with and without obesity.
The effectiveness of enhanced vs usual exercise in improving function was similar in patients with and without obesity.
Mixed quality/low & unclear Conflicting between outcomes and between studies.
Varus malalignment
(Bennell et al., 2014)
1 RCT
N = 92 analysed
Varus malalignment cases = 50
The effectiveness of quadriceps strengthening vs neuromuscular exercise in reducing pain was similar in patients regardless of varus malalignment. High quality/unclear The effectiveness of quadriceps strengthening vs neuromuscular exercise in improving function was similar in patients regardless of varus malalignment. High quality/unclear Y
Knee laxity
(Knoop et al., 2013)
1 RCT
N = 154
Knee laxity cases = approx. 77 (as dichotomised around the median by gender)
Knee joint stabilising exercise was more effective in reducing pain than strengthening alone for patients without knee laxity. Knee laxity may moderate the effectiveness of joint stabilising exercise for reducing pain.a Insufficient/unclear The effectiveness of knee joint stabilising exercise vs strengthening alone in improving function was similar in patients regardless of knee laxity. Insufficient/unclear N
Knee instability
(Knoop et al., 2013)
1 RCT
N = 154
Knee instability cases = 48
Knee joint stabilising exercise was more effective in reducing pain than strengthening alone for patients with more frequent knee instability. Knee instability may moderate the effectiveness of joint stabilising exercise for reducing pain.a Insufficient/unclear The effectiveness of knee joint stabilising exercise vs strengthening alone in improving function was similar in patients regardless of knee instability. Insufficient/unclear N
Proprioception
(Knoop et al., 2013)
1 RCT
N = 154
Proprioception cases = approx. 77 (as defined around the median by gender).
The effectiveness of joint stabilisation vs strengthening alone in reducing pain was similar in patients regardless of proprioception. Insufficient/unclear The effectiveness of knee joint stabilisation vs strengthening alone in improving function was similar in patients regardless of proprioception. Insufficient/unclear Y
Quadriceps strength
(Bennell et al., 2014, Knoop et al., 2013)
2 RCTs
Total N = 92 + 154
Heterogeneous case definitions
“stronger cases” = 55 + approx. 77
The effectiveness of therapeutic exercise vs other exercise was similar in reducing pain in patients regardless of their baseline quadriceps strength.
The effectiveness of quadriceps strengthening vs neuromuscular exercise in reducing pain was similar regardless of patient baseline quadriceps strength.
The effectiveness of knee joint stabilising exercise vs strengthening alone in reducing pain was similar regardless of patient baseline quadriceps strength.
Mixed quality/unclear The role of muscle strength as a moderator of exercise vs other exercise in improving function was conflicting in the literature.a
The effectiveness of quadriceps strengthening vs neuromuscular exercise for improving function was similar in patients regardless of quadriceps strength.
Knee joint stabilising exercise was more effective in improving function than strengthening alone in patients with higher baseline quadriceps strength.a
Mixed quality/unclear Conflicting between outcomes and between studies.
Varus thrust
(Bennell et al., 2014)
1 RCT
N = 85
Varus thrust cases = 43
Neuromuscular exercise was more effective than strengthening exercise for reducing pain in people without varus thrust. Varus thrust likely moderates the effectiveness of neuromuscular exercise for reducing pain.∗∗ High quality/unclear The effectiveness of quadriceps vs neuromuscular exercise in improving function was similar in patients regardless of varus thrust. High quality/unclear N
Psycho-social Moderators
Anxiety/depression
(Hay et al., 2018)
1RCT
N = 514
Anxiety/depression cases = 140
The effectiveness of enhanced vs usual exercise in reducing pain was similar in patients irrespective of anxiety and depression. Insufficient/low The effectiveness of enhanced vs usual exercise in improving function was similar in patients irrespective of anxiety and depression. Insufficient/low Y
a

Statistically significant moderator, with insufficient level of evidence or conflicting findings across multiple studies; ∗∗statistically significant moderator, with adequate level of evidence (high quality level of evidence moderator analysis and low risk of bias RCT). Mixed quality = data from studies with high quality and insufficient quality moderator analyses.

4. Discussion

This systematic review, according to our background scoping search, is the first to investigate moderators of the effects of TE interventions in people with knee or hip OA. It summarises the extent, strength and quality of the existing moderator evidence base and highlights gaps in the literature that warrant further investigation.

Fourteen RCTs met the systematic review eligibility criteria with most comprising participants with knee OA and only one including participants with hip OA [22]. A wide range of different potential moderators have been explored, including socio-demographic factors, clinical assessment findings, structural and biomechanical factors and psychological factors. This variation suggests uncertainty in the field and a lack of consensus regarding the sub-group variables that are most likely to be moderators. Obesity, anxiety and depression and quadriceps strength have been most commonly investigated (in three or more RCTs), however, there is conflicting evidence as to whether obesity and quadriceps strength act as moderators and anxiety and depression appear not to moderate TE effects. The level of evidence for potential moderators investigated to date was generally insufficient and findings were sometimes inconsistent between different outcome domains. Since no high quality level moderator studies have been repeated and only a single RCT reported being powered for any sub-group analysis [19], caution should be employed when interpreting the summary findings which can be considered as hypothesis generating for future research rather than definitive.

Arguments have been made in the literature both for and against the use of sub-grouping within RCTs to investigate potential moderators [32,33]. Potential benefits include the ability to target exercise therapy and help answer the question “what works best for whom” however, in order to have adequately powered RCTs to test moderators, required sample sizes are very large and existing RCTs are likely inadequately powered for sub-group analyses, may not account for family-wise error in testing multiple hypotheses, and potential moderator sub-groups are often defined without theory or consensus. However, given that very large RCTs are unlikely to be funded that are powered specifically to detect sub-group effects and not all trials are sufficiently homogeneous (in terms of participants, interventions, moderator definitions and outcome measures) to justify combining in individual participant data meta-analyses that could increase statistical power to investigate moderators [33], there remains a pragmatic need to critically investigate exploratory sub-group findings.

4.1. Moderators in RCTs of exercise versus non exercise control

Knee varus malalignment was the only moderator found to be statistically significant in a test of interaction (moderator∗intervention effects) in the category of RCTs investigating TE versus non exercise control. Lim and colleagues [19], in the only study within the review that reported being powered for any sub-group analysis, found that people with knee OA and neutral knee alignment who carried out strengthening exercises reported reduced pain, however, this was not the case for those with varus malalignment. It is possible that strengthening is clinically effective through joints with neutral alignment with load spread across knee compartments but in the presence of aberrant biomechanics the individual may overload and compress individual joint compartment surfaces that are most affected by OA [19,34] without effectively reducing pain (for example, in the presence of varus malalignment, it is plausible that strengthening exercise may overload the medial tibiofemoral compartment).

Comorbidity presence, anxiety and depression, and exercise confidence and beliefs were not found to be statistically significant moderators in interaction tests. Whilst these participant factors may not be moderators, it is also possible that the studies exploring them were insufficiently powered to detect statistically significant moderator∗intervention effects (leading to false negative findings) given the substantial sample sizes required to detect interactions. Additional moderators may exist that have not been investigated yet. This is a particular risk for moderators of TE in people with hip OA given the current dearth of research exploring moderators in this population group.

Comparing our findings to the existing literature, two existing systematic reviews have investigated moderators of TE effects in other musculoskeletal pain presentations; low back pain [33,35] and shoulder pain [36]. Gurung et al. [33] and McRobert et al. [36] investigated sub-group moderator analyses and evaluated moderator analysis quality scoring in the same way as this review [13] whilst Hayden and colleagues [35] carried out an individual participant data (IPD) meta-analysis. Gurung and colleagues [33] identified four large RCTs (total n = 7208) and found strong evidence in one or more studies for moderators of age, employment status and type, back pain status, narcotic medication use, treatment expectations and education. Hayden and colleagues [35] combined IPD from 27 RCTS (n = 3514) and found not having heavy physical demands at work and medication use moderated exercise effects for people with low back pain. McRobert et al. [36], similar to this review, found that very few RCTs had investigated moderation of the effects of interventions for shoulder pain, with none of the trials of sufficient sample size to robustly analyse moderation. Comparing the findings from these reviews to our findings has limitations as different joint pain conditions may have unique moderators (for example, knee malalignment for knee OA and painful arc for shoulder pain). Nevertheless, from these existing reviews, there is no clear and consistent pattern of moderators that have been identified across musculoskeletal pain presentations.

4.2. Moderators of therapeutic exercise versus comparison exercise

To our knowledge, no previous systematic reviews have investigated potential moderators of TE interventions compared to other TE interventions in people with OA or musculoskeletal pain. Of the three knee OA RCTs that carried out moderator analyses with interaction testing varus thrust, obesity, knee laxity and knee instability all moderated the effectiveness of different exercise interventions for knee pain whilst upper leg strength and cardiac problems moderated intervention effect on function outcome in a single RCT [[24], [25], [26]]. The authors of these RCTs offered rationale for some of these potential moderators. Looking at these findings together, a cautious case can be made supporting the hypothesis that targeting and tailoring of exercise to specific biomechanical and clinical assessment characteristics may lead to improved clinical outcomes in people with knee OA. A challenge in externally validating these findings is that sufficiently similar TE intervention versus comparison exercise RCTs are unlikely to exist (preventing secondary data analysis confirming moderator effects) or be funded in the future.

4.3. Strengths and limitations of the systematic review

Systematic review strengths included the prospective registration and comprehensive search strategy. Synthesising findings on the amount, type and level of evidence for individual moderators (including study moderation analysis quality and risk of bias) was a strength in drawing robust conclusions. In this review, only studies that look at sub-group effects using interaction tests were considered true moderator analyses. This criteria is helpful to avoid any confusion with studies who have not carried out statistical testing for moderation or those using “sub-group” or “secondary analysis” language to investigate if baseline variables are associated with clinical outcome irrespective of the treatment received (i.e. prognostic factor research not moderation).

Key limitations of the review include methodological limitations in the moderator analyses investigated in included studies. For example, many studies failed to give a priori justification for their choice of potential moderators and investigated multiple variables (increasing the potential for chance findings). Some studies did not adequately report or test for interaction (moderator∗intervention) and only a single included study was reported to be powered to detect any sub-group moderator effects. Measurement of the same potential candidate moderators (for example quadriceps strength) in different RCTs included in this systematic review was varied. This heterogeneity will lead to future challenges in harmonising and aggregating data sets. Another consideration in investigating potential moderators is the relatively common use of arbitrarily dichotomised continuous scales (see Table 3 for individual examples such as age and pain). This may lead to unwanted loss of information [37] and bias any potential moderator findings towards the null.

A further limitation of this review is the possibility of publication bias, with sub-group findings of underpowered studies being more likely to be reported when statistically significant results are found. As a result of these limitations, the review conclusions about moderators may change in light of further high-quality studies. Finally, the eligibility criteria for RCTs may also make it challenging to investigate the full range of potential moderators of TE. For example, it is possible that cognitive decline or memory loss or residing in a residential home may moderate exercise effects but may also be exclusion criteria in RCTs.

4.4. Research and clinical implications

This review highlights the need for studies that investigate potential moderators of TE effects in people with hip OA. It also highlights candidate variables (such as varus knee malalignment) for further confirmatory moderator investigation in an individual participant data analysis meta-analysis, the gold standard approach to identifying moderators of the effect of exercise for knee and hip OA [38]. By combining multiple similar RCT datasets for IPD the power to detect moderators, if they exist, can be increased. To efficiently advance the moderator literature and facilitate the future harmonisation and aggregation of IPD there is a need to reach consensus on the most likely and clinically applicable candidate moderators (for example, through Delphi studies, nominal group techniques or qualitative research involving researchers, healthcare practitioners and people with OA) and to make recommendations for the optimum ways to consistently measure and report them in future studies.

The moderator analysis methods evaluation within this review highlights areas to improve in future exploratory sub-group analyses including a priori describing the explicit rationale by which moderators are thought to act and the use of interaction tests. Learning from the existing systematic reviews of moderators in other musculoskeletal pain presentations [33,35] other candidate moderators that appear worthy of future investigation in knee and hip OA include, medication use, heavy physical demands at work, employment status, education level and treatment expectations.

An important consideration when interpreting the findings from moderator analyses is how do we evaluate if a moderator is of clinical importance or not? This was not addressed within this review. Future work could consider how best to quantify and interpret the magnitude of moderation in OA studies and make recommendations for the field.

The findings of the sub-group analyses within this systematic review are, by their nature, exploratory. As a result, caution is required in drawing clinical conclusions regarding which patients with knee or hip OA may benefit most from exercise.

5. Conclusion

Of the 149 TE RCTs reviewed, only 13 reported investigating potential moderators of the effects of TE in knee OA and one in people with hip OA. A total of 23 potential moderators have been investigated in knee OA and 6 in hip OA. Only 3 potential moderators were explored in 3 or more RCTs suggesting a lack of consensus about potential candidate moderators. Based on the knee OA RCTs to date, varus knee malalignment (findings from a single RCT with non-exercise control), obesity, cardiac problems, varus thrust, knee laxity, knee instability and upper leg strength (findings from individual RCTs with comparison exercise interventions) have some supporting evidence as moderators of TE.

This review has uncovered common sub-group analysis limitations and a dearth of evidence, particularly in hip OA. Future research is needed that draws on individual participant data meta-analyses of multiple RCTs to increase statistical power for moderator analyses before drawing firm conclusions about how different clinical sub-groups respond to TE.

Contributions

JQ was the overall lead for the work for the systematic review and was involved at all stages of the study and lead the development of this paper. JQ takes responsibility for the integrity of the work. The lead author can be contacted by email: j.g.quicke@keele.ac.uk or at the Primary Care Center Versus Arthritis, Keele University, Keele, Staffordshire, United Kingdom, ST5 5BG.

JQ, JR, DvDV, EH, NF, MH were all involved with the conception of the design. JQ, JR, DvDV, EH, MH were involved in study searching, quality assessment and data extraction. All authors were involved in the drafting, critical revision and finalisation of the manuscript. MH was lead applicant in securing funding for this project.

Role of the funding source

This research was supported by a United Kingdom, National Institute for Health Research (NIHR) School for Primary Care Research grant (SPCR grant number 351). JQ is part funded by a NIHR Clinical Research Network West Midlands, Research Scholar Fellowship. EH is part funded by the NIHR Applied Research Collaboration (ARC) West Midlands. NF was funded by an NIHR Research Professorship (NIHR-RP-011-015) and is a NIHR Senior Investigator. The views expressed are those of the author(s) and not necessarily those of the NIHR or the Department of Health and Social Care.

Declaration of competing interest

There is no conflict of interest for any of the authors.

Acknowledgements

We would like to acknowledge Dr Amy Bourton who was involved in the early stages of the work, study searching, quality assessment and data extraction. Thank you also to Dr Nadia Corp who was involved in the creation of the search filters.

Footnotes

Supplementary data to this article can be found online at https://doi.org/10.1016/j.ocarto.2020.100113.

Appendix A. Supplementary data

The following is the Supplementary data to this article:

Multimedia component 1

mmc1.docx (18.7KB, docx)

References

  • 1.Nicolson P.J.A., Bennell K.L., Dobson F.L., Van Ginckel A., Holden M.A., Hinman R.S. Interventions to increase adherence to therapeutic exercise in older adults with low back pain and/or hip/knee osteoarthritis: a systematic review and meta-analysis. Br. J. Sports Med. 2017;51:791–799. doi: 10.1136/bjsports-2016-096458. [DOI] [PubMed] [Google Scholar]
  • 2.National Institute for Health and Care Excellence . 2014. Osteoarthritis: Care and Management.www.nice.org.uk/CG177 [Google Scholar]
  • 3.Bannuru R.R., Osani M.C., Vaysbrot E.E., Arden N.K., Bennell K., Bierma-Zeinstra S.M.A., et al. OARSI guidelines for the non-surgical management of knee, hip, and polyarticular osteoarthritis. Osteoarthr. Cartil. 2019;27(11):1578–1589. doi: 10.1016/j.joca.2019.06.011. [DOI] [PubMed] [Google Scholar]
  • 4.Fransen M., McConnell S., Hernandez-Molina G., Reichenbach S. Exercise for osteoarthritis of the hip. Cochrane Database Syst. Rev. 2014;22(4):CD007912. doi: 10.1002/14651858.CD007912.pub2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Fransen M., McConnell S., Harmer A.R., Van der Esch M., Simic M., Bennell K.L. Exercise for osteoarthritis of the knee: a Cochrane systematic review. Br. J. Sports Med. 2015;49(24):1554–1557. doi: 10.1136/bjsports-2015-095424. [DOI] [PubMed] [Google Scholar]
  • 6.Uthman O.A., van der Windt D.A., Jordan J.L., Dziedzic K.S., Healey E.L., Peat G.M., et al. Exercise for lower limb osteoarthritis: systematic review incorporating trial sequential analysis and network meta-analysis. Br. Med. J. 2013;347:f5555. doi: 10.1136/bmj.f5555. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Ferreira M., Reijneveld-van de Vendel E.A.E., Teirlinck C.H., Runhaar J., van Middlekoop M. Do we need another trial on exercise in patients with knee osteoarthritis?: No new trials on exercise in knee OA. Osteoarthr. Cartil. 2019;27(9):1266–1269. doi: 10.1016/j.joca.2019.04.020. [DOI] [PubMed] [Google Scholar]
  • 8.Kraemer H.C., Wilson G.T., Fairburn C.G., Agras W.S. vol. 59. Archives of General Psychiatry. American Medical Association; 2002. pp. 877–883. (Mediators and Moderators of Treatment Effects in Randomized Clinical Trials). [DOI] [PubMed] [Google Scholar]
  • 9.Liberati A., Altman D.G., Tetzlaff J., Mullrow C., Gøtzche P.C., Ioannidis J.P.A., et al. The PRISMA statement for reporting systematic reviews and meta-analyses of studies that evaluate healthcare interventions: explanation and elaboration. BMJ. 2009;339:b2700. doi: 10.1136/bmj.b2700. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Higgins J., Green S. Wiley-Blackwell; Wiltshire: 2009. Cochrane Handbook for Systematic Reviews of Interventions. [Google Scholar]
  • 11.Cochrane Musculoskeletal Group. Proposed Outcomes. http://musculoskeletal.cochrane.org/proposed-outcomes (accessed 5 May 2017).
  • 12.Higgins J.P.T., Altman D.G., Gotzsche P.C., Juni P., Moher D., Oxman A.D., et al. The Cochrane Collaboration's tool for assessing risk of bias in randomised trials. Br. Med. J. 2011;343:d5928. doi: 10.1136/bmj.d5928. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Pincus T., Miles C., Froud R., Underwood M., Carnes D., Taylor S.J. Methodological criteria for the assessment of moderators in systematic reviews of randomised controlled trials: a consensus study. BMC Med. Res. Methodol. 2011;11:14. doi: 10.1186/1471-2288-11-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Van Hoorn R., Tummers M., Booth A., Gerhardus A., Rehfuess E., Hind D., et al. The development of CHAMP: a checklist for the appraisal of moderators and predictors. BMC Med. Res. Methodol. 2017;17(1):173. doi: 10.1186/s12874-017-0451-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.de Rooij M., van der Leeden M., Cheung J., van der Esch M., Häkkinen A., Haverkamp D., et al. Efficacy of tailored exercise therapy on physical functioning in patients with knee osteoarthritis and comorbidity: a randomized controlled trial. Arthritis Care Res. 2017;69(6):807–816. doi: 10.1002/acr.23013. [DOI] [PubMed] [Google Scholar]
  • 16.Ettinger W.H., Burns R., Messier S.P., Applegate W., Rejeski W.J., Morgan T., et al. A randomized trial comparing aerobic exercise and resistance exercise with a health education program in older adults with knee osteoarthritis. J. Am. Med. Assoc. 1997;277(1):25–31. [PubMed] [Google Scholar]
  • 17.Hay E.M., Foster N.E., Thomas E., Peat G., Phelan M., Yates H.E., et al. Effectiveness of community physiotherapy and enhanced pharmacy review for knee pain in people aged over 55 presenting to primary care: pragmatic randomised trial. BMJ. 2006;333(7576):995. doi: 10.1136/bmj.38977.590752.0B. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Hurley M.V., Walsh N.E., Mitchell H.L., Pimm T.J., Patel A., Williamson E., et al. Clinical effectiveness of a rehabilitation program integrating exercise, self-management, and active coping strategies for chronic knee pain: a cluster randomized trial. Arthritis Rheum. 2007;57(7):1211–1219. doi: 10.1002/art.22995. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Lim B.W., Hinman R.S., Wrigley T.V., Sharma L., Bennell K.L. Does knee malalignment mediate the effects of quadriceps strengthening on knee adduction moment, pain, and function in medial knee osteoarthritis? A randomized controlled trial. Arthritis Care Res. 2008;59(7):943–951. doi: 10.1002/art.23823. [DOI] [PubMed] [Google Scholar]
  • 20.Messier S.P., Loeser R.F., Miller G.D., Morgan T.M., Rejeski W.J., Sevick M.A., et al. Exercise and dietary weight loss in overweight and obese older adults with knee osteoarthritis: the arthritis, diet, and activity promotion trial. Arthritis Rheum. 2004;50(5):1501–1510. doi: 10.1002/art.20256. [DOI] [PubMed] [Google Scholar]
  • 21.Thorstensson C.A., Roos E.M., Petersson I.F., Ekdahl C. Six-week high-intensity exercise program for middle-aged patients with knee osteoarthritis: a randomized controlled trial [ISRCTN20244858] BMC Muscoskel. Disord. 2005;6:27. doi: 10.1186/1471-2474-6-27. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Teirlinck C.H., Luijsterburg P.A.J., Dekker J., Bohnen A.M., Verhaar J.A.N., Koopmanschap M.A., et al. Effectiveness of exercise therapy added to general practitioner care in patients with hip osteoarthritis: a pragmatic randomized controlled trial. Osteoarthr. Cartil. 2016;24(1):82–90. doi: 10.1016/j.joca.2015.07.023. [DOI] [PubMed] [Google Scholar]
  • 23.Beckwée D., Vaes P., Raeymaeckers S., Shahabpour M., Scheerlinck T., Bautmans I. Exercise in knee osteoarthritis: do treatment outcomes relate to bone marrow lesions? A randomized trial. Disabil. Rehabil. 2017;39(18):1847–1855. doi: 10.1080/09638288.2016.1212110. [DOI] [PubMed] [Google Scholar]
  • 24.Bennell K.L., Dobson F., Roos E.M., Skou S.T., Hodges P., Wrigley T.V., et al. Influence of biomechanical characteristics on pain and function outcomes from exercise in medial knee osteoarthritis and varus malalignment: exploratory analyses from a randomized controlled trial. Arthritis Care Res. 2015;67(9):1281–1288. doi: 10.1002/acr.22558. [DOI] [PubMed] [Google Scholar]
  • 25.Hay E., Dziedzic K., Foster N., Peat G., van der Windt D., Bartlem B., et al. NIHR Journals Library; Southampton (UK): 2018 Jul. Optimal Primary Care Management of Clinical Osteoarthritis and Joint Pain in Older People: a Mixed-Methods Programme of Systematic Reviews, Observational and Qualitative Studies, and Randomised Controlled Trials. [PubMed] [Google Scholar]
  • 26.Knoop J., Van Der Leeden M., Roorda L.D., Thorstensson C.A., Van Der Esch M., Peter W.F., et al. Knee joint stabilization therapy in patients with osteoarthritis of the knee and knee instability: subgroup analyses in a randomized, controlled trial. J. Rehabil. Med. 2014;46(7):703–707. doi: 10.2340/16501977-1809. [DOI] [PubMed] [Google Scholar]
  • 27.Kudo M., Watanabe K., Otsubo H., Kamiya T., Kaneko F., Katayose M., et al. Analysis of effectiveness of therapeutic exercise for knee osteoarthritis and possible factors affecting outcome. J. Orthop. Sci. 2013;18(6):932–939. doi: 10.1007/s00776-013-0443-9. [DOI] [PubMed] [Google Scholar]
  • 28.Mikesky A.E., Mazzuca S.A., Brandt K.D., Perkins S.M., Damush T., Lane K.A. Effects of strength training on the incidence and progression of knee osteoarthritis. Arthritis Care Res. 2006;55(5):690–699. doi: 10.1002/art.22245. [DOI] [PubMed] [Google Scholar]
  • 29.Penninx B.W., Rejeski W.J., Pandya J., Miller M.E., Di Bari M., Applegate W.B., et al. Exercise and depressive symptoms: a comparison of aerobic and resistance exercise effects on emotional and physical function in older persons with high and low depressive symptomatology. J. Gerontol. 2002;57(2):124–132. doi: 10.1093/geronb/57.2.p124. [DOI] [PubMed] [Google Scholar]
  • 30.Mangani I., Cesari M., Kritchevsky S.B., Maraldi C., Carter C.S., Atkinson H.H., et al. Physical exercise and comorbidity. Results from the fitness and arthritis in Seniors trial (FAST) Aging Clin. Exp. Res. 2006;18(5):374–380. doi: 10.1007/BF03324833. [DOI] [PubMed] [Google Scholar]
  • 31.Legha A., Burke D.L., Foster N.E., van der Windt D.A., Quicke J.G., Healey E.L., et al. Do comorbidities predict pain and function in knee osteoarthritis following an exercise intervention, and do they moderate the effect of exercise? Analyses of data from three randomized controlled trials. Muscoskel. Care. 2020;18(1):3–11. doi: 10.1002/msc.1425. [DOI] [PubMed] [Google Scholar]
  • 32.Foster N.E., Hill J.C., Hay E.M. Subgrouping patients with low back pain in primary care: are we getting any better at it? Man. Ther. 2011;16(1):3–8. doi: 10.1016/j.math.2010.05.013. [DOI] [PubMed] [Google Scholar]
  • 33.Gurung T., Ellard D.R., Mistry D., Patel S., Underwood M. Identifying potential moderators for response to treatment in low back pain: a systematic review. Physiotherapy. 2015;101(3):243–251. doi: 10.1016/j.physio.2015.01.006. [DOI] [PubMed] [Google Scholar]
  • 34.Sharma L., Dunlop D.D., Cahue S., Song J., Hayes K.W. Quadriceps strength and osteoarthritis progression in malaligned and lax knees. Ann. Intern. Med. 2003;138(8):613–619. doi: 10.7326/0003-4819-138-8-200304150-00006. [DOI] [PubMed] [Google Scholar]
  • 35.Hayden J.A., Wilson M.N., Stewart S., Cartwright J.L., Smith A.O., Riley R.D., et al. Exercise treatment effect modifiers in persistent low back pain: an individual participant data meta-analysis of 3514 participants from 27 randomised controlled trials. Br. J. Sports Med. 2020;54(21):1277–1278. doi: 10.1136/bjsports-2019-101205. bjsports-2019-101205. [DOI] [PubMed] [Google Scholar]
  • 36.Mc Robert C. Primary care decision-making for shoulder pain: identifying treatment effect moderators using clinical expertise. Keele Res. Repo. - Keele Univ. p. 15-67. Available from: https://eprints.keele.ac.uk/5162/.
  • 37.Altman D.G., Royston P. Statistics Notes 52: the cost of dichotomising continuous variables. Br. Med. J. 2006;332:1080. doi: 10.1136/bmj.332.7549.1080. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Holden M.A., Burke D.L., Runhaar J., Van Der Windt D., Riley R.D., Dziedzic K., et al. Subgrouping and TargetEd Exercise pRogrammes for knee and hip OsteoArthritis (STEER OA): a systematic review update and individual participant data meta-analysis protocol. BMJ Open. 2017;7(12) doi: 10.1136/bmjopen-2017-018971. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Multimedia component 1

mmc1.docx (18.7KB, docx)

Articles from Osteoarthritis and Cartilage Open are provided here courtesy of Elsevier

RESOURCES