Abstract
Background
The Sarcopenia Quality of Life (SarQoL) questionnaire is a disease-specific sarcopenia quality of life tool. We aimed to independently assess SarQoL with a particular focus on its suitability as a clinical trial outcome measure.
Methods
We analysed data from the UK Sarcopenia Network and Registry. Measures of physical performance and lean mass were collected at baseline. SarQoL and the Strength, Assistance, Rise, Climb - Falls (SARC-F) questionnaire (to assess functional ability) were collected at both baseline and six-month follow-up. Global changes in fitness and quality of life at 6 months were elicited on seven-point Likert scales. Internal consistency was assessed using Cronbach’s alpha. Responsiveness (Cohen’s d and Guyatt coefficients) and minimum clinically important differences were calculated for participants reporting slight improvement or worsening in their global scores. Concurrent validity was assessed by correlating baseline SarQoL scores with measures of physical performance and functional ability.
Results
We analysed data from 147 participants, 125 of whom underwent follow up assessment; mean age 78 years; 72 (49%) were women. Internal consistency was good; Cronbach’s alpha was 0.944 at baseline and 0.732 at telephone follow-up. Correlation between baseline and follow-up SarQoL was weak (r = 0.27; p = 0.03). The minimum clinically important improvement ranged from 5 to 21 points giving trial sample size estimates of 25–100 participants. SarQoL scores were moderately correlated with handgrip (r = 0.37; p < 0.001), SARC-F (r = − 0.45; p < 0.001), short physical performance battery (r = 0.48; p < 0.001) and 4-m walk speed (r = 0.48; p < 0.001).
Conclusions
SarQoL has acceptable performance in older UK participants with probable sarcopenia and is sufficiently responsive for use in clinical trials for sarcopenia.
Keywords: Sarcopenia, Quality of life, Validity, Responsiveness, Minimum clinical important difference
Introduction
Sarcopenia, the age-related loss of muscle mass and function, is common and clinically important [1]. It is associated with an increased risk of falls, future disability and dependency, hospital admission and earlier death [2–4]. Finding new ways to prevent or treat sarcopenia is therefore an important area of research. Conducting clinical trials for sarcopenia is however not straightforward. Measuring muscle mass and muscle function are important efficacy outcomes in sarcopenia trials, but health related quality of life is also a key consideration [5]. Whilst there are many generic tools for measuring health related quality of life, until recently tools designed specifically for measuring quality of life in people with sarcopenia were lacking.
The SarQoL questionnaire was introduced in 2015 [6] to fill this gap. Originally derived in French and tested with older people from a Belgian outpatient clinic, it has been translated into multiple languages including English [7]. The questionnaire consists of 55 items organised into 22 questions across seven domains: Physical and mental health, Locomotion, Body composition, Functionality, Activities of daily living, Leisure activities and Fears.
Although some validation work has been performed [8–10], there is a need to test the internal consistency, responsiveness (how much the measure changes when a real change in health occurs) and concurrent validity (how the measure compares to measures that would be expected to be related to the measure under test) in a UK population of older people with sarcopenia. In particular, there is a need to derive the minimum clinically important difference for SarQoL using currently recommended anchor-based methods [11] to enable sample size calculations for clinical trials, but also to enable interpretation of effect sizes derived from intervention studies.
The aim of this analysis was therefore to test the internal consistency, responsiveness and concurrent validity of the SarQoL tool in a cohort of older people with sarcopenia in the UK.
Methods
Study population
We analysed data from the UK Sarcopenia Network and Registry (SarcNet) pilot study; the study design and baseline data have been reported previously [12]. SarcNet was designed as an observational study with a baseline visit and a six-month follow up. The target population was people aged 65 and over with self-reported impairment in physical function. To ensure that as many potentially eligible participants were included in SarcNet, we used a SARC-F score [13] of 3 or more out of 10 at telephone pre-screening (in line with our previous LACE randomised controlled trial and with recent data on the ability of SARC-F to detect patients with probable sarcopenia in similar populations) [13–15] rather than the more commonly used cutoff of 4. Participants were recruited from primary care organisations (General Practices) in the UK and assessed at six hospital-based recruitment sites. Exclusion criteria were life expectancy of less than 6 months in the judgement of the local investigator, participation in an interventional study within the last 30 days. Other exclusion criteria were: presence of a permanent pacemaker with an atrial sensing lead or presence of an implantable cardioverter-defibrillator, peripheral oedema present above knee level or fever at the baseline visit (all contraindications to bioimpedance testing). Previous analysis of baseline data [12] showed that 94% of those in SarcNet fulfilled the 2019 European Working Group on Sarcopenia in Older People (EWGSOP) criteria [16] for probable sarcopenia.
Measures collected
The SarQoL questionnaire (in its English translation) [7] was administered by the research nurse as part of the SarcNet study at baseline and at six-month follow-up to assess quality of life. Baseline visits were conducted face-to-face (in a research clinic or in the participants own home). Due to limitations on face-to-face research activity imposed in the UK as part of the pandemic response to COVID-19, all but eight follow-up visits were conducted by telephone by the research nurses. The SARC-F score was collected by telephone at pre-screening and at the six-month follow-up encounter (whether face to face or by telephone) by the research nurses.
At the baseline visit, maximum hand grip strength was measured using a Jamar hydraulic dynamometer (Lafayette Instrument Company, USA) [17]. Three measurements were taken on each hand and the maximum value was used for analysis. Appendicular lean muscle mass was measured using the Akern 101 bioimpedance system (Akern SRL, Pontassieve, Italy). Resistance and reactance were recorded and the Sergi equation [18] was used to derive appendicular skeletal muscle mass index (appendicular skeletal muscle mass divided by height squared). The Short Physical Performance Battery (SPPB) [19] was conducted, comprising side-by-side, semi-tandem and tandem balance tests, gait speed over 4-m walk distance, and five times sit to stand time from a chair without using arms to assist. At the six-month follow-up visit, all participants were asked two questions to assess global change in a) fitness, and b) quality of life. Change in fitness was assessed by the response on a 7-point Likert scale (much worse to much better) to the statement “Since the first visit, my overall fitness is…”. Change in quality of life was assessed by the response on a similar 7-point Likert scale to the statement “Since the first visit, my overall quality of life is…”. Due to only small numbers of participants recording the most extreme changes on the Likert scale, responses for ‘much better’ and ‘much worse’ were amalgamated with ‘better’ and ‘worse’ respectively. Few participants reported minimal global improvement, thus an additional category of ‘any improvement’ (including those reporting slightly better, better or much better) was also derived.
Statistical analysis
All statistical analyses were conducted using SPSS v26 (IBM, New York, USA). A two-sided p value of < 0.05 was taken as significant for all analyses. Descriptive statistics were generated for the full baseline group, those that dropped out before 6 months, and those that underwent follow up by telephone and face to face at 6 months; means and standard deviations were reported for normally-distributed continuous variables, medians and interquartile ranges were reported for continuous variables that were not normally distributed on visual inspection. Baseline characteristics of those undergoing telephone follow-up were compared with those undergoing face-to-face follow-up and with those who did not undergo assessment at 6 months; Student’s t-test and Mann-Whitney U tests were used for normally and non-normally distributed continuous variables, and Pearson’s chi-squared test was used to compare categorical variables. The internal consistency (a measure of whether individual questionnaire items are related to each other) of SarcQol was analysed using Cronbach’s alpha. The baseline, follow-up and telephone follow-up populations were analysed separately, and internal consistency within each of the SarQoL domains was also assessed. Correlations between each subdomain and both the total baseline SarQoL score and the score without the index subdomain were calculated using Pearson’s correlation coefficient to test whether a single subdomain could substitute for the total score.
Correlation between baseline and six-month follow-up values was assessed using Pearson’s correlation coefficient. This value is a key factor in calculating sample sizes for trial analyses that adjust for baseline values and is more informative for planning trial sample sizes than the intraclass correlation coefficient commonly calculated as part of psychometric assessment of measurement tools. A high correlation between baseline and follow-up values (i.e. a more stable measure) enables greater precision in detecting changes between groups, and the sample size can be reduced according to the formula (1 - r2) for a single time-point follow-up trial [20]. Responsiveness to change was calculated in two ways. Cohen’s d (equivalent to effect size) was calculated as the mean SarQoL difference between baseline and follow up) / pooled SD of baseline and follow up SarQoL [21]. Cohen’s d was calculated separately for groups reporting a slight improvement, a slight worsening, or any improvement. Guyatt’s responsiveness coefficient was calculated for the same groups using the mean change in SarQoL score for each group / SD of the change in SarQoL score in the group showing ‘no change’ on the Likert scale [22]. For comparison, responsiveness was also calculated for the SARC-F score in the same way.
Concurrent validity was assessed by calculating the correlation between baseline SarQol scores (total and individual domains), measures of physical performance (maximal grip strength, SPPB score and 4 m walk speed), and function in daily life measured by the SARC-F score. Pearson’s correlation coefficients were calculated as all data were normally distributed. Finally, a range of sample size calculations were performed to show the number of participants that would need to be recruited to detect different Minimum Clinically Important Difference (MCID) values for SarQoL under different assumptions.
Results
Data were available for 147 participants at baseline, and 125 participants underwent six-month follow up. Eight follow-up visits were conducted face-to-face; the others were conducted by telephone. The flow of participants through the study is shown in Fig. 1. The mean time between baseline and follow up was 6.5 (SD 1.1) months. Details of the baseline characteristics of the whole group, those undergoing six-month follow-up by telephone or face-to-face, and those who dropped out before six-month follow-up are given in Table 1.
Table 1.
All included participants (n = 147) |
Participants by 6 month follow up status | |||
---|---|---|---|---|
Telephone (n = 117) |
Face-to-face (n = 8) |
Dropped out (n = 22) |
||
Mean age (years) (SD) | 77.6 (7.3) | 77.0 (7.2) | 78.9 (6.0) | 79.8 (7.8) |
Female sex (%) | 72 (49) | 56 (48) | 3 (38) | 13 (59) |
Mean SPPB (SD) | 5.5 (2.3) | 5.8 (2.4) | 6.4 (1.8) | 5.6 (2.5) |
Max handgrip strength (kg) (SD) | ||||
Men |
27.2 (9.4) (n = 70) |
26.8 (9.2) (n = 61) | 32.8 (12.6) (n = 5) |
26.6 (9.2) (n = 9) |
Women |
15.7 (6.1) (n = 69) |
15.5 (6.4) (n = 56) | 16.7 (3.1) (n = 3) |
16.5 (5.2) (n = 13) |
Mean walk speed (m/s) (SD) |
0.61 (0.24) (n = 142) | 0.61 (0.24) (n = 113) | 0.64 (0.21) (n = 8) |
0.62 (0.23) (n = 21) |
Median chair stand time (s) [IQR] |
24.0 [32.0–17.4] (n = 106) | 23.0 [14.5] (n = 84) | 21.8 [26.0] (n = 6) |
28.0 [7.6] (n = 16) |
Mean SARC-F score (SD) | 5.28 (1.82) | 5.28 (1.85) | 4.38 (1.19) | 5.59 (1.82) |
Mean skeletal muscle mass index (kg/m2) (SD) | ||||
Men |
7.84 (1.21) (n = 70) |
7.82 (1.11) (n = 57) | 7.75 (0.92) (n = 5) |
8.03 (2.01) (n = 8) |
Women |
6.65 (1.13) (n = 69) |
6.76 (1.23) (n = 54) | 6.12 (0.30) (n = 3) |
6.30 (0.61) (n = 12) |
SarQoL score (SD) (best = 100) |
50.9 (11.9) | 51.1 (12.6) | 52.6 (7.6) | 49.5 (9.3) |
Physical and mental health | 55.5 (15.6) | 55.4 (15.9) | 57.9 (7.8) | 55.3 (16.3) |
Locomotion | 48.8 (18.1) | 49.1 (18.7) | 43.8 (12.8) | 48.8 (17.2) |
Body composition | 58.4 (17.2) | 58.0 (17.7) | 59.9 (12.4) | 59.8 (16.8) |
Functionality | 52.6 (12.8) | 52.7 (13.5) | 55.1 (7.7) | 51.4 (10.6) |
Activities of daily living | 46.1 (14.9) | 46.2 (15.4) | 52.3 (10.9) | 43.3 (13.1) |
Leisure activities | 36.1 (17.3) | 36.5 (18.2) | 31.2 (5.9) | 36.3 (15.1) |
Fears | 67.5 (19.1) | 68.9 (19.8) | 68.8 (22.2) | 59.7 (11.5)* |
*p < 0.05 compared to telephone follow-up group
Subdomain scores all normalised to 0–100 scale (best = 100)
Internal consistency and subdomain correlations
Table 2 shows the results of internal consistency testing using Cronbach’s alpha. The full SarQoL score showed acceptable levels of consistency at baseline (alpha = 0.944) but showed less consistency at the follow up visit (alpha = 0.732). No differences in consistency were seen when confining the follow up analysis only to those undergoing telephone assessment at the six-month visit. Although consistency within most subdomains was good, consistency within the body composition and leisure activities domains was poor. Table 3 shows correlations between each subdomain and the total baseline score; the functionality and activities of daily living domains had the highest correlation with the total score.
Table 2.
Baseline data (n = 144) (95% CI) |
Follow up data (n = 125) (95% CI) |
Telephone follow up data (n = 117) (95% CI) |
|
---|---|---|---|
SarQoL total | 0.94 (0.93, 0.96) | 0.71 [0.63,0.78) | 0.73 [0.66, 0.80) |
Physical and mental health | 0.76 (0.70, 0.82) | 0.83 [0.79, 0.87) | 0.83 [0.78, 0.87) |
Locomotion | 0.89 (0.86, 0.92) | 0.30 [0.10, 0.47) | 0.28 [0.64, 0.46) |
Body composition | 0.17 (−0.07, 0.37) | 0.09 [− 0.21, 0.32) | 0.09 [− 0.22, 0.33) |
Functionality | 0.87 (0.83, 0.90) | 0.86 [0.83, 0.90) | 0.86 [0.82, 0.89) |
Activities of daily living | 0.88 (0.85, 0.91) | 0.88 [0.85, 0.91) | 0.88 [085, 0.91) |
Leisure activities | 0.39 (0.15, 0.56) | 0.45 [0.22, 0.61) | 0.42 [0.17, 0.60) |
Fears | n/aa | n/aa | n/aa |
aunable to calculate as only one question in this domain
Table 3.
Subdomain | Correlation with total SarQoL score |
Correlation with mean of all subdomains excluding the subdomain under test | ||
---|---|---|---|---|
r | p | r | p | |
Physical and mental health | 0.80 | < 0.001 | 0.72 | < 0.001 |
Locomotion | 0.84 | < 0.001 | 0.66 | < 0.001 |
Body composition | 0.59 | < 0.001 | 0.52 | < 0.001 |
Functionality | 0.88 | < 0.001 | 0.74 | < 0.001 |
Activities of daily living | 0.87 | < 0.001 | 0.66 | < 0.001 |
Leisure activities | 0.33 | < 0.001 | 0.28 | < 0.001 |
Fears | 0.32 | < 0.001 | 0.24 | < 0.001 |
Change in SarQoL over time
Follow up SarQoL scores at 6 months were weakly correlated with scores at baseline (r = 0.27, p = 0.03). In contrast, follow-up SARC-F scores were much more closely correlated with baseline SARC-F scores (r = 0.63, p < 0.001). Figure 2 shows the changes in the SarQoL score between baseline and follow-up for each category of global change in fitness or quality of life; details of the changes for each subdomain and for the SARC-F score are shown in Table 4. More participants reported worsening of global fitness or quality of life than reported improvement; no change in global fitness or quality of life were the most commonly selected categories for global change. Point estimates for SarQoL and for SARC-F in those reporting ‘no change’ were close to zero, suggesting no systematic bias in how scores changed between face-to-face assessment at baseline and telephone assessment at follow-up. Of the subdomains, functionality mapped most closely to the total score, both in terms of a stable functionality score in those reporting no global change, and also in the degree of improvement or deterioration in scores.
Table 4.
A. By global change in overall fitness | ||||||
Worse (n = 30) |
Slightly worse (n = 25) |
About the same (n = 46) |
Slightly better (n = 5) |
Better (n = 11) |
Any improvement (n = 16) |
|
Overall SarQoL |
−8.5 (15.9) |
−0.5 (16.0) |
−0.3 (12.3) |
21.3 (19.0) |
14.4 (17.0) |
16.6 (17.3) |
Physical and mental health |
−6.2 (20.4) |
2.4 (19.0) |
5.8 (17.5) |
32.7 (13.1) | 17.4 (18.4) |
22.1 (18.0) |
Locomotion |
−7.8 (23.7) |
−0.5 (19.2) |
−4.3 (23.1) |
25.0 (31.7) | 16.4 (23.5) |
19.1 (25.5) |
Body composition |
−3.9 (22.5) |
4.1 (22.1) |
12.4 (25.6) |
22.5 (12.0) | 14.0 (20.1) |
16.7 (18.0) |
Functionality |
−7.4 (15.5) |
−0.7 (17.1) |
− 0.9 (11.4) |
15.5 (25.5) | 14.7 (14.9) |
14.9 (17.9) |
Activities of daily living | −11.9 (17.2) |
−0.9 (19.4) |
−4.5 (14.9) |
20.5 (17.7) | 12.0 (22.3) |
14.7 (20.8) |
Leisure activities | −11.6 (29.3) |
0.7 (17.0) |
6.9 (21.8) |
20.0 (13.9) | 12.1 (19.8) |
14.5 (18.1) |
Fears |
−6.7 (22.9) |
3.5 (29.9) |
4.9 (32.2) |
7.5 (30.1) |
6.8 (32.8) |
7.0 (30.9) |
SARC-F | 0.83 (1.49) | 0.28 (1.99) | −0.07 (1.67) | −1.00 (2.12) | −1.64 (1.69) |
−1.44 (1.79) |
B. By global change in overall quality of life | ||||||
Worse (n = 26) |
Slightly worse (n = 15) |
About the same (n = 63) |
Slightly better (n = 3) |
Better (n = 10) |
Any improvement (n = 13) |
|
Overall SarQoL |
−7.6 (17.5) |
−5.1 (11.8) |
2.7 (15.2) |
5.3 (5.0) |
10.5 (18.1) |
9.3 (15.9) |
Physical and mental health | −6.0 (22.1) |
−3.0 (19.2) |
7.6 (17.7) |
18.5 (22.2) | 16.3 (21.1) |
16.8 (20.4) |
Locomotion | −8.4 (26.5) |
−11.0 (15.0) |
−0.3 (24.0) |
5.6 (12.7) |
12.9 (25.6) |
11.2 (23.0) |
Body composition | −3.0 (25.8) |
−1.9 (18.5) |
11.5 (24.2) |
16.7 (7.2) |
15.8 (19.8) |
16.0 (17.4) |
Functionality | −6.4 (17.0) |
−5.0 (15.2) |
1.5 (15.0) |
−1.1 (14.2) | 11.1 (16.3) |
8.3 (16.2) |
Activities of daily living | −13.2 (18.5) |
−3.8 (18.8) |
0.4 (18.0) |
−1.0 (11.5) |
4.9 (20.4) |
3.6 (18.4) |
Leisure activities | −10.9 (30.8) |
0.0 (15.4) |
5.0 (21.2) |
27.7 (9.6) |
10.0 (22.4) |
14.1 (21.3) |
Fears | −9.6 (19.8) |
−1.7 (29.8) |
7.7 (31.1) |
0.0 (25.0) |
1.3 (36.1) |
1.0 (32.9) |
SARC-F | 0.60 (1.65) |
0.33 (2.22) |
−0.08 (1.60) |
2.33 (1.53) | −1.70 (1.83) |
−0.77 (2.45) |
Values are mean and standard deviation
Responsiveness
Table 5 shows measures of responsiveness calculated separately for ‘slight improvement’ and ‘slight worsening’ in global fitness and global quality of life. Results for all categories of improvement combined are also presented, as numbers in each individual improvement category were small. The SarQoL tended to be more responsive to change than the SARC-F, and both tools were more responsive to improvement than to worsening.
Table 5.
Cohens d for slightly worse | Cohens d for slightly better | Cohens d for any improvement | Guyatt for slightly worse | Guyatt for slightly better | Guyatt for any improvement | |
---|---|---|---|---|---|---|
A. By global change in overall fitness | ||||||
SarQoL | 0.03 | 2.29 | 1.11 | 0.04 | 1.73 | 1.35 |
SARC-F | 0.04 | 0.48 | 0.29 | 0.17 | 0.60 | 0.86 |
B. By global change in overall quality of life | ||||||
SarQoL | 0.36 | 0.98 | 0.75 | 0.34 | 0.35 | 0.61 |
SARC-F | 0.06 | 1.38 | 0.19 | 0.21 | 1.46a | 0.48 |
aparadoxical worsening
MCIDs and sample size estimates
Table 6 depicts the anchor-based minimum clinically important differences for the SarQoL and SARC-F, and reports trial sample sizes that would be needed to detect these differences with 80 and 90% power, with and without adjustment for the correlation between baseline and follow-up, given an alpha of 0.05.
Table 6.
MCID | SD | Sample size | |||||
---|---|---|---|---|---|---|---|
80% power, unadjusted | 80% power, adjusted | 90% power, unadjusted | 90% power, adjusted | ||||
SarQoL– change in fitness | Slight improvement | 21 | 19 | 26 | 24 | 36 | 34 |
Any improvement | 17 | 17 | 32 | 30 | 44 | 42 | |
Slight worsening | 1 | 16 | 8038 | 7476 | 10,760 | 10,008 | |
SarQoL – change in QoL | Slight improvement | 5 | 5 | 32 | 30 | 44 | 42 |
Any improvement | 9 | 16 | 100 | 94 | 134 | 126 | |
Slight worsening | 5 | 12 | 182 | 170 | 244 | 228 | |
SARC-F – change in fitness | Slight improvement | 1.0 | 2.1 | 140 | 86 | 186 | 114 |
Any improvement | 1.4 | 1.8 | 52 | 32 | 70 | 44 | |
Slight worsening | 0.3 | 2.0 | 1396 | 852 | 1868 | 1140 | |
SARC-F – change in QoL | Slight improvement | 2.3 | 1.5 | 14 | 10 | 18 | 12 |
Any improvement | 0.8 | 2.5 | 308 | 188 | 412 | 252 | |
Slight worsening | 0.3 | 2.2 | 1690 | 1032 | 2262 | 1380 |
MCID Minimum clinically important difference. Sample size is total sample size for a two arm trial. Calculations assume 1:1 randomisation, alpha 0.05. Adjusted: Sample size multiplied by (1 – r2) where r = 0.27 for correlation between baseline and follow-up SarQoL scores, and r = 0.63 for correlation between baseline and follow-up SARC-F scores. Where calculations deliver odd numbers, sample size is rounded up to nearest even number
Concurrent validity
Table 7 shows correlations between the total SarQoL score, measures of physical performance and the SARC-F score. The SarQoL score showed moderate correlations with each of the other measures as expected.
Table 7.
SARC-F | Max grip | SPPB | Walk speed | |||||
---|---|---|---|---|---|---|---|---|
r | p | r | p | r | p | r | p | |
SarQoL total | −0.45 | < 0.001 | 0.37 | < 0.001 | 0.48 | < 0.001 | 0.48 | < 0.001 |
Physical and mental health | −0.35 | < 0.001 | 0.29 | 0.001 | 0.33 | < 0.001 | 0.26 | 0.004 |
Locomotion | −0.35 | < 0.001 | 0.21 | 0.19 | 0.44 | < 0.001 | 0.51 | < 0.001 |
Body composition | −0.27 | 0.002 | 0.18 | 0.05 | 0.25 | 0.006 | 0.19 | 0.04 |
Functionality | −0.44 | < 0.001 | 0.34 | < 0.001 | 0.45 | < 0.001 | 0.42 | < 0.001 |
Activities of daily living | −0.43 | < 0.001 | 0.42 | < 0.001 | 0.46 | < 0.001 | 0.48 | < 0.001 |
Leisure activities | −0.09 | 0.34 | 0.22 | 0.02 | 0.29 | 0.001 | 0.26 | 0.004 |
Fears | −0.26 | < 0.001 | 0.20 | 0.02 | 0.18 | 0.04 | 0.11 | 0.22 |
Discussion
Our analysis is the first to validate the SarQoL score in a specific group of older patients in the UK with probable sarcopenia – the group who form the target population for the use of this tool in clinical trials and other studies of sarcopenia. We found that the SarQoL tool had good internal consistency, with better consistency at the baseline visit (where SarQoL was administered face-to-face) than at the follow up assessment (where it was administered by telephone in almost all participants). Responsiveness to change was variable, with small numbers limiting the robustness of the analyses, but the SarQol questionnaire appeared to be more sensitive to improvement than to deterioration, with sample sizes of 25–100 required to detect clinically significant improvements from interventions. Only a weak correlation was seen between baseline and follow up SarQoL scores, suggesting that adjustments for baseline values of the SarQoL in analyses of clinical trial data are unlikely to improve trial power by a large amount. The moderate correlations between SarQoL and measures of physical performance give good evidence of concurrent validity; higher correlations would not be expected given that the constructs of physical performance and daily activities are related to, but distinct from, the construct of sarcopenia-related quality of life.
Our findings complement and extend previous development work on the SarQoL tool. Most previous validation studies for the SarQoL questionnaire have been conducted by the research group that originally designed the questionnaire, and the current analysis is one of the few independent validation studies that have been performed to date. A previous analysis in a UK-based population of healthy older people [7] (not selected for activity limitation or sarcopenia) showed good internal consistency (Cronbach’s alpha 0.88); SarQoL scores were lower in the small number of participants with sarcopenia than those without, and SarQoL scores showed close correlation with related domains (e.g. physical function) of other health status tools including the Short Form 36 (SF-36) questionnaire. Reliability was high (intraclass correlation coefficient 0.95), in part due to the short gap of only 2 weeks between testing and retesting. No attempt was made to test responsiveness to change. Similar results were obtained in older Belgian outpatients using similar methods [8].
One previous study used longitudinal follow-up data to test responsiveness in a small group of patients (n = 43) with sarcopenia [9], but this analysis relied on correlating change in SarQol over time with change in related health status tools (SF-36 and EuroQoL 5-dimension tool), rather than by using currently recommended anchor-based methods. Although these data showed good correlation between change in SarQoL and other measures over the two-year follow up period, these data support construct validity of the SarQoL rather than responsiveness per se, and do not enable derivation of a minimum clinically important difference. Pooled data from nine studies [10] was used to derive the smallest detectable change (estimated at 7 points); the test-retest interval in this analysis was short (2 weeks in all studies), and a standard-error based method was used which is not optimal for deriving the minimum clinically important difference [11].
Our study has a number of strengths. Firstly, we studied a group of older people with levels of physical function representative of those seen in primary and secondary care services, almost all of whom had a diagnosis of probable sarcopenia [12]. This group is the group for which a sarcopenia quality of life tool would be deployed in both research and clinical practice. We used a 6 month follow up interval for both baseline to follow-up correlation and for responsiveness testing. This interval reflects the interval between visits that would be used in clinical trials or between assessments in clinical practice. Such an interval also increases the chances that improvement or deterioration will have occurred. We used an anchor-based approach to determine responsiveness to change and to estimate minimum clinically important differences, reflecting current recommendations in this field.
A number of limitations of our analysis also require highlighting. The number of participants who improved between baseline and follow up was small. This likely reflects the natural history of sarcopenia but was also likely to be due to the effects of movement restrictions imposed to combat the COVID-19 pandemic [23]. We were unable to administer the SarQoL face-to-face for most participants at follow up because of the pandemic, and the lack of face-to-face visits also precluded collection of some of the other planned outcome measures, particularly physical performance measures. It is possible that the different mode of questionnaire delivery at follow-up may have introduced more variability in the scores, reducing the observed consistency or responsiveness. Conversely, consistency and responsiveness were still acceptable despite heterogeneity in the mode of administration, suggesting that in research or clinical practice participants could be given a choice of completing the questionnaire face-to-face or by telephone. We relied on natural change in sarcopenia over the six-month follow-up rather than the response to an intervention, and future studies should assess the responsiveness to change for the SarQoL after resistance training (the intervention with the best evidence for improving sarcopenia) [24, 25]. We elected not to measure reliability in this analysis as we did not re-administer the SarQoL within a sufficiently short period after the baseline visit to be confident that participants had remained clinically stable [26].
The responsiveness to change for the SarQoL may enable smaller sample sizes to be used in trials than some generic quality of life tools (the EQ-5D for instance typically requires sample sizes of 200–300 to enable detection of the minimum clinically important difference of 0.074 points [27]). However, disease-specific tools such as SarQoL complement, but cannot replace, generic health status or quality of life for older people. Such generic measures are still essential to assess broader health status in older people who will typically suffer from multiple long-term conditions [28].
Although previous studies have noted that the SarQoL took participants only 10 min to complete, our anecdotal experience in this study suggests that participants who are more functionally impaired may take longer. As only one of a battery of tests that might be administered during a trial or other study visit, it is worth considering if it is possible to reduce the burden on study participants by reducing the size of the questionnaire. Of note, the subdomain of ‘function’ within SarQoL delivers a similar distribution of normalised scores to the full SarQoL questionnaire and correlates highly with the total score; use of this subset of questions could potentially enable less burdensome data capture, albeit with the loss of some aspects of quality of life captured elsewhere in the full SarQoL. These findings parallel those from studies of other health status measures such as the SF-36, where the physical function subdomain shows a close correlation with grip strength [29]. Our results suggest that the SarQoL can be administered face-to-face and by telephone as well as by self-completion as studied previously; this flexibility of delivery is important for effective trial delivery both during a pandemic when face-to-face visits may be impossible, but also in non-pandemic times, when remote trial delivery can improve participation and retention [30].
Conclusions
Taken together with previous validation studies of the SarQoL questionnaire, our results suggest that SarQoL has acceptable properties for use in clinical trials of sarcopenia interventions as part of a suite of outcomes. To date, few trials have included SarQoL as a trial outcome, and it is only by using SarQoL in clinical trials that we will be able to fully assess its performance as an outcome measure in this context.
Acknowledgements
With thanks to all the participants, participating primary care surgeries and the NIHR Primary Care Research Network. We would like to thank Maggie Rice, Haja Tholley, Lindsay Wilkes, Abigail Jones, Farhat Mahmood and Ikhlaq Jacob for their efforts in recruiting participants.
Abbreviations
- BIA
Bioimpedance analysis
- EWGSOP
European Working Group on Sarcopenia in Older People
- MCID
Minimum Clinically Important Difference
- SARC-F
Strength, Assistance, Rise, Climb – Falls questionnaire
- SarcNet
Sarcopenia Network and registry
- SarQoL
Sarcopenia quality of life questionnaire
- SD
Standard deviation
- SF-36
Short-Form 36 questionnaire
- SPPB
Short Physical Performance Battery
Authors’ contributions
MDW, AAS and RMD designed the SarcNet registry. APC, SVH, CMcD, DS, BS, ALT and AT contributed to recruitment, data collection and management of the SarcNet registry. PH and MDW conducted the analysis described in this paper and drafted the manuscript. RMD, APC, SVH, CMcD, DS, BS, ALT, AT and AAS critically revised the manuscript. All authors read and approved the final manuscript.
Funding
The SarcNet study was funded by the NIHR Newcastle Biomedical Research Centre. A.C. is part-funded by the National Institute for Health Research Applied Research Collaboration Yorkshire & Humber and Health Data Research UK, an initiative funded by UK Research and Innovation Councils, NIHR and the UK devolved administrations and leading medical research charities.
Availability of data and materials
The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request, subject to completion of a Data Access agreement with Newcastle University.
Declarations
Ethics approval and consent to participate
The SarcNet study was approved by the North East—Tyne and Wear South NHS Research Ethics Committee (reference 18/NE/0314). Written informed consent was obtained from all participants and the study was conducted according to the principles of the Declaration of Helsinki.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Footnotes
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Cruz-Jentoft AJ, Sayer AA. Sarcopenia. Lancet. 2019;393:2636–2646. doi: 10.1016/S0140-6736(19)31138-9. [DOI] [PubMed] [Google Scholar]
- 2.Lunt E, Ong T, Gordon AL, Greenhaff PL, Gladman JRF. The clinical usefulness of muscle mass and strength measures in older people: a systematic review. Age Ageing. 2021;50:88–95. doi: 10.1093/ageing/afaa123. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Liu P, Hao Q, Hai S, Wang H, Cao L, Dong B. Sarcopenia as a predictor of all-cause mortality among community-dwelling older people: a systematic review and meta-analysis. Maturitas. 2017;103:16–22. doi: 10.1016/j.maturitas.2017.04.007. [DOI] [PubMed] [Google Scholar]
- 4.Yeung SSY, Reijnierse EM, Pham VK, Trappenburg MC, Lim WK, Meskers CGM, et al. Sarcopenia and its association with falls and fractures in older adults: a systematic review and meta-analysis. J Cachexia Sarcopenia Muscle. 2019;10:485–500. doi: 10.1002/jcsm.12411. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Kilsby AJ, Sayer AA, Witham MD. Selecting potential pharmacological interventions in sarcopenia. Drugs Aging. 2017;34:233–240. doi: 10.1007/s40266-017-0444-z. [DOI] [PubMed] [Google Scholar]
- 6.Beaudart C, Biver E, Reginster JY, Rizzoli R, Rolland Y, Bautmans I, et al. Development of a self-administrated quality of life questionnaire for sarcopenia in elderly subjects: the SarQoL. Age Ageing. 2015;44:960–966. doi: 10.1093/ageing/afv133. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Beaudart C, Edwards M, Moss C, Reginster JY, Moon R, Parsons C, et al. English translation and validation of the SarQoL®, a quality of life questionnaire specific for sarcopenia. Age Ageing. 2017;46:271–276. doi: 10.1093/ageing/afw192. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Beaudart C, Biver E, Reginster JY, Rizzoli R, Rolland Y, Bautmans I, et al. Validation of the SarQoL, a specific health-related quality of life questionnaire for sarcopenia. J Cachexia Sarcopenia Muscle. 2017;8:238–244. doi: 10.1002/jcsm.12149. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Geerinck A, Bruyere O, Locquet M, Reginster J-Y, Beaudart C. Evaluation of the responsiveness of the SarQoL questionnaire, a patient-reported outcome measure specific to sarcopenia. Adv Ther. 2018;35:1842–1858. doi: 10.1007/s12325-018-0820-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Geerinck A, Alekna V, Beaudart C, Bautmans I, Cooper C, De Souza OF, et al. Standard error of measurement and smallest detectable change of the sarcopenia quality of life (SarQoL) questionnaire: an analysis of subjects from 9 validation studies. PLoS One. 2019;14:e0216065. doi: 10.1371/journal.pone.0216065. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Cook JA, Julious SA, Sones W, Hampson LV, Hewitt C, Berlin JA, et al. DELTA2 guidance on choosing the target difference and undertaking and reporting the sample size calculation for a randomised controlled trial. BMJ. 2018;363:k3750. doi: 10.1136/bmj.k3750. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Witham MD, Heslop P, Dodds RM, Clegg AP, Hope SV, McDonald C, et al. Developing a UK sarcopenia registry: recruitment and baseline characteristics of the SarcNet pilot. Age Ageing. 2021;50:1762–1769. doi: 10.1093/ageing/afab084. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Malmstrom TK, Morley JE. Sarcopenia: the target population. J Frailty Aging. 2013;2:55–56. doi: 10.14283/jfa.2013.8. [DOI] [PubMed] [Google Scholar]
- 14.Witham MD, Achison M, Aspray TJ, Avenell A, Band MM, Donnan PT, et al. Recruitment strategies for sarcopenia trials – lessons from the LACE randomised controlled trial. JCSM Rapid Commun. 2021;4:93–102. doi: 10.1002/rco2.38. [DOI] [Google Scholar]
- 15.Dodds RM, Murray JC, Robinson SM, Sayer AA. The identification of probable sarcopenia in early old age based on the SARC-F tool and clinical suspicion: findings from the 1946 British birth cohort. Eur Geriatr Med. 2020;11:433–441. doi: 10.1007/s41999-020-00310-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Cruz-Jentoft AJ, Bahat G, Bauer J, Boirie Y, Bruyère O, Cederholm T, et al. Sarcopenia: revised European consensus on definition and diagnosis. Age Ageing. 2019;48:16–31. doi: 10.1093/ageing/afy169. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Roberts HC, Denison HJ, Martin HJ, Patel HP, Syddall H, Cooper C, et al. A review of the measurement of grip strength in clinical and epidemiological studies: towards a standardised approach. Age Ageing. 2011;40:423–429. doi: 10.1093/ageing/afr051. [DOI] [PubMed] [Google Scholar]
- 18.Sergi G, De Rui M, Veronese N, Bolzetta F, Berton L, Carraro S, et al. Assessing appendicular skeletal muscle mass with bioelectrical impedance analysis in free-living Caucasian older adults. Clin Nutr. 2015;34:667–673. doi: 10.1016/j.clnu.2014.07.010. [DOI] [PubMed] [Google Scholar]
- 19.Guralnik JM, Simonsick EM, Ferrucci L, Glynn RJ, Berkman LF, Blazer DG, et al. A short physical performance battery assessing lower extremity function: association with self-reported disability and prediction of mortality and nursing home admission. J Gerontol. 1994;49:M85–M94. doi: 10.1093/geronj/49.2.M85. [DOI] [PubMed] [Google Scholar]
- 20.Walters SJ, Jacques RM, Dos Anjos Henriques-Cadby IB, Candlish J, Totton N, Xian MTS. Sample size estimation for randomised controlled trials with repeated assessment of patient-reported outcomes: what correlation between baseline and follow-up outcomes should we assume? Trials. 2019;20:566. doi: 10.1186/s13063-019-3671-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Cohen J. Statistical power analysis for the behavioral sciences. Hillsdale: Hove Lawrence Erlbaum; 1988. [Google Scholar]
- 22.Guyatt G, Walter S, Norman G. Measuring change over time: assessing the usefulness of evaluative instruments. J Chronic Dis. 1987;40:171–178. doi: 10.1016/0021-9681(87)90069-5. [DOI] [PubMed] [Google Scholar]
- 23.Richardson SJ, Carroll CB, Close J, Gordon AL, O'Brien J, Quinn TJ, et al. Research with older people in a world with COVID-19: identification of current and future priorities, challenges and opportunities. Age Ageing. 2020;49:901–906. doi: 10.1093/ageing/afaa149. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Dent E, Morley JE, Cruz-Jentoft AJ, Arai H, Kritchevsky SB, Guralnik J, et al. International clinical practice guidelines for sarcopenia (ICFSR): screening, diagnosis and management. J Nutr Health Aging. 2018;22:1148–1161. doi: 10.1007/s12603-018-1139-9. [DOI] [PubMed] [Google Scholar]
- 25.Tsekoura M, Billis E, Tsepis E, Dimitriadis Z, Matzaroglou C, Tyllianakis M, et al. The effects of group and home-based exercise programs in elderly with sarcopenia: a randomized controlled trial. J Clin Med. 2018;7:480. doi: 10.3390/jcm7120480. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Terwee CB, Bot SDM, de Boer MR, van der Windt DA, Knol DL, Dekker J, et al. Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol. 2007;60:34–42. doi: 10.1016/j.jclinepi.2006.03.012. [DOI] [PubMed] [Google Scholar]
- 27.Band MM, Sumukadas D, Struthers AD, Avenell A, Donnan PT, Kemp PR, et al. Leucine and ACE inhibitors as therapies for sarcopenia (LACE trial): study protocol for a randomised controlled trial. Trials. 2018;19:6. doi: 10.1186/s13063-017-2390-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Song X, Wu J, Yu C, Dong W, Lv J, Guo Y, et al. Association between multiple comorbidities and self-rated health status in middle-aged and elderly Chinese: the China Kadoorie biobank study. BMC Public Health. 2018;18:744. doi: 10.1186/s12889-018-5632-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Sayer AA, Syddall HE, Martin HJ, Dennison EM, Roberts HC, Cooper C. Is grip strength associated with health-related quality of life? Findings from the Hertfordshire cohort study. Age Ageing. 2006;35:409–415. doi: 10.1093/ageing/afl024. [DOI] [PubMed] [Google Scholar]
- 30.NIHR Remote Trial Delivery Working Group: Preliminary Guidance. https://sites.google.com/nihr.ac.uk/remotetrialdelivery/home. Accessed 7 Oct 2021.
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request, subject to completion of a Data Access agreement with Newcastle University.