Abstract
Introduction
The purpose of this study is to determine the validity, reliability, and responsiveness of the Malay Shoulder Pain and Disability Index (M-SPADI) in Malay speakers suffering from shoulder pain.
Materials and methods
The M-SPADI, the Numerical Rating Scale (NRS), and measurements of shoulder active range of motion (AROM) were completed by 140 patients with shoulder pain (68 with rotator cuff pathology and 72 with other shoulder pathology). Thirty-four patients were retested for test-retest reliability with M-SPADI after an average of 9.2 days. M-SPADI was performed on twenty-one individuals three months after completing treatment for rotator cuff disorders to assess response.
Results
The results of exploratory factor analysis revealed a bidimensional structure for M-SPADI. M-SPADI disability score was significantly greater in patients with rotator cuff pathologies (median = 31.87, IQR 82.50) than in patients with other shoulder pathologies (median = 20.00, IQR 23.84). In multi-group factor analysis, measurement invariance revealed no significant difference between the two groups (p>0.05). There was a significant positive correlation between M-SPADI and NRS (Pain = 0.86, Disability = 0.75, Total = 0.82, p=0.005), and a significant negative correlation between M-SPADI and shoulder AROM (Pain = -0.34 to -0.67, Disability =-0.44 to -0.73, Total =0.43 to -0.72, p=0.005). M-SPADI had a high degree of internal consistency (Cronbach's 0.92 for pain and 0.95 for disability). Test-retest reliability was moderate to excellent (ICC Pain = 0.84, ICC Disability = 0.78, ICC Total = 0.81, p=0.001), and the smallest detectable change ranges (Pain = 8.74, Disability = 3.21, Total = 3.83) were less than the minimal detectable change ranges (Pain = 21.57, Disability = 6.82, Total = 8.79). The area under the receiver operating characteristic curve (AUC) for M-SPADI was greater than 0.90 (Pain = 0.99, Disability = 0.94, Total = 0.96).
Conclusion
The M-SPADI has established construct validity, internal consistency, test-retest reliability, and responsiveness. The M-SPADI is a reliable and valid instrument for evaluating shoulder pain among Malay-speaking individuals. In addition, the M-SPADI disability subscale may be useful for monitoring functional score changes in patients with rotator cuff pathology.
Keywords: measurement properties, SPADI, reliability, responsiveness, validity
Introduction
Patient-reported outcomes measures (PROM) are structured, validated questionnaires that patients complete to evaluate their perceived level of impairment, disability, and health-related quality of life1. It is frequently used in research as a primary or secondary outcome measure2. Numerous PROMs exist to quantify pain and physical function or activity. Three types of PROM commonly used for the shoulder are upper extremity region-specific (e.g. Disability of Arm, Shoulder, and Hand [DASH]), shoulder-specific (e.g. American Shoulder and Elbow Surgeon Score [ASES] and Shoulder Pain and Disability Index [SPADI])and disease-specific (e.g. Western Ontario Rotator Cuff Index [WORC] and Oxford Shoulder Instability Score [OSIS])3. The evaluation of pain and physical function were identified as key outcome domains in most studies4. The Shoulder Pain and Disability Index (SPADI) is one of the most frequently used PROMs among patients with shoulder pain5.
The SPADI was developed in 1991 and since then has been validated in groups of patients with rotator cuff and subacromial pathology, calcific tendinitis, adhesive capsulitis, and non-specific shoulder pain6. It was also responsive to change in various clinical settings, such as treatment for adhesive capsulitis, subacromial impingement, and shoulder arthroplasty7,8. The SPADI is self-administered and takes between 2 and 5 minutes to complete9. It has been translated and validated into many languages, including Danish10, Dutch11, Spanish12, Telugu13, Chinese14, and Slovene15.
The Malay SPADI (M-SPADI) had previously been validated in the general population, that include patients with non-specified shoulder pain and patients without shoulder pain16. The previous study only considered a limited set of psychometric properties including content validity, convergent validity, known-group validity (shoulder pain and no shoulder pain groups), hypothesis testing, and test-retest reliability. The Consensus-based Standards for the Selection of Health Measurement Instruments (COSMIN) guidelines proposed ten criteria that can be used to determine whether a study meets the standards for good methodologic quality: content validity, internal structure (including structural validity and internal consistency), reliability, measurement error, hypotheses testing, cross-cultural validity, criterion validity, and responsiveness17.
Hence, this study evaluated the translated Malay SPADI (M-SPADI) in the context of the COSMIN guidelines to provide additional methodologic quality assessment, particularly for patients with shoulder pain.
Materials and Methods
The research was conducted using a prospective cohort validation design that was approved by the Universiti Malaya Medical Centre (UMMC), Medical Research Ethics Committee (MECID No. 201977-7623). The study was conducted in accordance with the Helsinki Declaration, and prior to participation, all patients provided written informed consent. Patients were recruited from an outpatient clinic in a university hospital between June 2020 and May 2021. Patients had to be over the age of 18, able to speak in Malay and presented with shoulder pain to be considered for the study. Exclusion criteria included injuries to the neck, elbow, wrist, or hand, unwillingness to provide informed consent, or a diagnosis of a mental disorder.
A physician from the research team completed the screening, which included a review of the medical record for radiological results and clinical assessments. Following screening, patients who agreed to participate must provide written informed consent. Patients sociodemographic information including age, gender, occupation, level of education, and duration of shoulder discomfort we gathered using a standardised clinical research form. Participants were then required to complete the M-SPADI in Appendix 1 and verbal Numerical Rating Scale (NRS). The patients' active range of motion (AROM) of the shoulder was also evaluated in all functional planes for the shoulder. This includes flexion, extension, abduction, adduction and internal and external rotation.
From the group of patients mentioned above, patients who had no invasive procedures or new treatments in the weeks following their enrolment in the study were asked to complete the M-SPADI again seven days later to assess test-retest reliability. Patients who received treatment for shoulder pain, on the other hand, were approached three months later to repeat the M-SPADI to assess responsiveness.
In this study two important tools used were the M-SPADI questionnaire and the Numerical Rating Scale (NRS). The M-SPADI is a self-administered questionnaire developed to assess shoulder pain and impairment in adults, and it was translated from the original SPADI by Roach et al6. It consists of thirteen items that evaluate two distinct domains. The first five questions assess patients' pain, while the following eight are on functional disability. The M-SPADI scoring includes the Total M-SPADI, the combined score for both domains, since few questions assess both domain16. The M-SPADI questionnaire uses an 11-point numerical scale on which the individual rates their level of pain or disability on a scale of 0 to 10 using the anchors “no pain to worst pain imaginable” or “no difficulty to so difficult it requires assistance”18. The sum of the subscale item scores was divided by the maximum possible subscale score and multiplied by 100 to obtain a score out of 100. A score was considered invalid if more than two items were missing6. The scores range from 0 to 100, with no cut-off point to indicate severity, as they were intended to measure current conditions and change over time5.
The NRS is a frequently used tool to assess pain intensity. Typically, 0 denotes "no pain," while 10 denotes "the worst pain imaginable"19. It allows for a more detailed comparison of pain levels than the VAS, which allows for an infinite number of possible responses. Numerous studies have established strong correlations between the NRS and other tools for assessing pain. Moreover, the NRS can be used easily with a good compliance rate20,21.
The validity study sample size was estimated based on the recommendations for the participant-to-item ratio of a minimum of 5:1 (five participants to one item)22. In addition, Mokkink et al suggested that the sample size should be more than 100 to be qualified for analysis23. Thus, in this study, 140 patients with shoulder pain were recruited.
For the M-SPADI test-retest reliability, the estimated sample size was calculated using the computer software downloaded from http://wnarifin.github.io24. Intercorrelation coefficient (ICC) with an expected ICC of 0.95, precision of 0.5, confidence level of 95%, number of repetitions per subject of 2, and a drop-out rate of 10%, the minimum sample size needed was 20.
All statistical analyses of the M-SPADI measurement properties were conducted using IBM SPSS Statistics for Windows, version 27 (IBM Corp., Armonk, NY, USA) and smart-PLS version 3.2.8. The significance level for all statistical tests was set at 0.05, and the normality of all data was determined using Q-Q plots and the Kolmogorov-Smirnov test. Non-parametric tests were performed when data was not normally distributed.
Construct validity was evaluated as convergent, discriminant, and known-group validity. In addition, we include measurement of invariance analysis. Convergent validity assesses the degree of agreement between multiple indicators of the same construct. To establish convergent validity, the indicator's factor loading must be greater than 0.7, composite reliability (CR) or Cronbach’s more than 0.6, and the average variance extracted (AVE) greater than 0.525. The degree to which two constructs are empirically distinct is termed discriminant validity. It also quantifies the degree to which the overlapping structures differ. To determine discriminant validity, the Fornell and Larcker criterion was used. According to the criteria, the square root of the AVE of each construct should be greater than the correlations with other latent constructs26. Additionally, the group validity was assessed by comparing the rank means of M-SPADI scores among patients with rotator cuff pathology with other causes of shoulder pain. The U test was used to determine the validity. We hypothesised that patients with rotator cuff pathology would have higher M-SPADI disability scores than patients with other shoulder pain and that the difference would be statistically significant (p<0.05).
Measurement invariance refers to the degree to which respondents from different groups with the same latent trait level respond similarly to a given subscale23. Measurement invariance was calculated using a multiple group factor analysis. Positive results were expected when there were no significant differences between the group factors27. We hypothesised that there would be no significant differences in the factors between patients with rotator cuff pathology and those with other shoulder pathologies.
For criterion validity, Spearman's correlation was used to determine the correlation between M-SPADI and the NRS, as well as the correlation between M-SPADI and shoulder AROM. The relationships were classified as weak (Spearman's rho 0.30–0.50), or strong (Spearman's rho >0.70)28. We hypothesised a high positive correlation between the M-SPADI and the NRS. Furthermore, M-SPADI would have a significant negative correlation with shoulder AROM.
The reliability tests consist of internal consistency, test-retest reliability, and measurement error. The M-SPADI internal consistency was assessed using Cronbach's alpha (α) and average inter-item correlation for each M-SPADI subscale. Cronbach's α≥0.70 for each subscale was considered good22. The average inter-item correlation is a criterion for determining whether individual questions on a test or questionnaire produce consistent, appropriate results; items measuring the same general construct or concept are compared for similar scores. The ideal average inter-item correlation range is 0.15 to 0.5029. Correlation <0.15 suggested items are not well correlated and do not accurately measure the same construct or idea. Correlation greater than 0.50 indicates items are so close to each other that they are redundant30.
For test-retest reliability, based on the average measurement model, the Cronbach’s α must be more than 0.70 for all subscales to be considered reliable. The interclass correlation coefficient (ICC), calculated with a 95% confidence interval, must be more than 0.70 to be considered satisfactory31. The interval between repeated assessments was determined to be between seven and fourteen days to prevent recall and ensure there was no clinical change32.
Absolute reliability was determined by assessing the measurement error. The Standard Error of Measurement (SEM) and the Smallest Detectable Change (SDC) were analysed. The standard error of the mean (SEM) denotes the standard deviation (SD) of repeated measurements in a single patient, and N = sample size (SEM = SD√N)33. SDC is the smallest change sufficient to establish that the observed change is real and not attributable to measurement error (SDC = SEM x √N x1.96)34. A measure of variability associated with the SEM is the minimal detectable change (MDC), which is the smallest detectable change that can be considered above the measurement error with a given level of 95% confidence. The SDC should be less than the MDC for a good measurement error test17.
For responsiveness, the receiver operating curve (ROC) method was used to examine the ability of the M-SPADI to detect treatment response over time. The patients were classified as “improved” if the NRS score was <3 and “unchanged” if the NRS >435. The assessment was conducted three months after individuals with rotator cuff tendinopathy had treatment with regenerative therapy and exercise. In ROC curve analysis, sensitivity and specificity are plotted at several cut-off points, and the area under the curve (AUC) can be estimated. AUCs greater than 0.70 are thought to be adequate for demonstrating that outcome measures can distinguish between improved and unimproved patients36.
Results
One hundred forty patients were enrolled in the validity study. Sixty-eight participants were diagnosed with rotator cuff pathologies and 72 with other shoulder conditions. The Q-Q plots and Kolmogorov-Smirnov tests revealed the M-SPADI scores, NRS scores, and shoulder AROM were not normally distributed. Analysis based on a non-parametric test found no significant differences in the sociodemographic characteristics between the two groups, as shown in Table I.
Table I:
Patients’ sociodemographic characteristic.
Parameters | Patients with rotator cuff pathology (n=68) | Patients with other shoulder pathology (n=72) | p-value |
---|---|---|---|
Age (years), Median (IQR) | 51.58 (56.75) | 54.12 (69.19) | 0.344* |
Gender, N (%) | |||
Male | 34 (50%) | 38 (53%) | 0.866** |
Female | 34 (50%) | 34 (47%) | |
Occupation, N (%) | |||
Employed | 43 (63%) | 45 (62%) | 0.964** |
Retired | 6 (9%) | 7 (10%) | |
Homemaker | 15 (22%) | 17 (24%) | |
Other | 4 (6%) | 3 (4%) | |
Education, N (%) | |||
Primary education | 2 (3%) | 1 (1%) | 0.786** |
Secondary education | 19 (28%) | 22 (31%) | |
Tertiary education | 68 (69%) | 72 (68%) | |
Duration of pain in weeks, Median (IQR) | 28 (1036.00) | 24 (519.70) | 0.351* |
Abbreviation - IQR: Inter-quartile range, N: number
Notes: * Mann-Whitney test, ** chi-square. p- value is considered statistically significance if <0.05
Convergent validity was demonstrated by an AVE greater than 0.5 for both items, composite reliability (CR) greater than 0.7, factor loading greater than 0.6, and Cronbach's alpha greater than 0.7. Table II illustrates a good fit to convergence for all of the constructs. Thus, all of the items are measuring what they were designed to measure.
Table II:
Construct validity determine by convergent and discriminant validity.
Construct | Convergent | Discriminant (Fornell-Lacker) | |||
---|---|---|---|---|---|
Cronbach’s α | CR | AVE | M-SPADI Pain | M-SPADI Disability | |
M-SPADI Pain | 0.923 | 0.942 | 0.766 | 0.875 | - |
M-SPADI Disability | 0.984 | 0.957 | 0.736 | 0.834* | 0.858 |
Abbreviation - CR: composite reliability, AVE: extracted average variance (AVE), M-SPADI: Malay Shoulder Pain and Disability Index
Notes: * The number represented the correlation of a shared construct less than the correlation of each construct
Discriminant validity was assessed based on the Fornell-Larcker criterion. The criteria showed the correlation of M-SPADI Pain with M-SPADI Disability were less than the square root of the individual AVE, thus successfully establishing the discriminant validity of the two factors (pain and disability) (Table II).
The M-SPADI Disability was significantly higher in patients with rotator cuff pathology (median = 31.87, IQR 82.50) compared to other shoulder pathologies (median = 20.00, IQR 23.84) (U = 1933.5, z = -2.15, p = 0.032). Similarly, the M-SPADI total was significantly higher in patients with rotator cuff pathology (median = 36.92, IQR = 79.23) than in other shoulder conditions (median = 24.62, IQR = 85.38), U = 1951.5, z =-2.07, p = 0.038 (Table III).
Table III:
Rotator cuff Pathology (n=68) | Other shoulder pain (n=72) | ||||
---|---|---|---|---|---|
Score | Mean Rank | U | z | p | |
M-SPADI Pain | 77.10 | 64.26 | 1999.00 | -1.873 | 0.061 |
M-SPADI Disability | 78.07 | 63.35 | 1933.50 | -2.146 | 0.032* |
M-SPADI Total | 77.80 | 63.60 | 1951.50 | -2.070 | 0.038* |
Multi-group analysis | Path coefficient | p | |||
M-SPADI Pain and M-SPADI Disability | 0.019 | 0.703 | |||
M-SPADI Pain and Total M-SPADI | 0.018 | 0.508 | |||
M-SPADI Disability and M-SPADI Total | 0.022 | 0.425 |
Note: * p- value <0.05.
A multi-group analysis using confirmatory factor analysis revealed that patients with rotator cuff pathologies and other shoulder pathologies had no statistically significant differences between group factors for the M-SPADI Pain, M-SPADI Disability, and M-SPADI Total, p>0.05 (Table III).
Criterion validity indicates a substantial positive correlation between M-SPADI Pain, M-SPADI Disability, and M-SPADI Total and NRS. Moreover, a strong inverse correlation was observed between M-SPADI Pain, Disability, Total and shoulder AROM. All AROM associations were from moderate to high, except for shoulder extension (-0.36 to -0.44), which ranged from weak to moderate (Table IV).
Table IV:
Correlation between M-SPADI with NRS and M-SPADI with active range of motion (AROM) for Criterion validity.
Scale | M-SPADI Pain | M-SPADI Disability | M-SPADI Total |
---|---|---|---|
NRS | .858* | .747* | .815* |
AROM | |||
Forward flexion | -.668* | -.730* | -.722* |
Abduction | -.636* | -.731* | -.713* |
Extension | -.358* | -.444* | -.434* |
Internal rotation 90° | -.540* | -.676* | -.645* |
External rotation 90° | -.511* | -.608* | -.593* |
Abbreviation - NRS: Numeric Pain Rating Scale, AROM: Active range of motion
Notes: * Significance correlations, p<0.05
For internal consistency, calculations of the average inter-item correlation between the five items on the M-SPADI Pain questionnaire reveal that all items correlate. The average range of inter-item correlation was 0.26 indicating that inter-item correlation were fairly consistent. Similarly, the average inter-item correlation between the eight M-SPADI Disability items indicates that all items correlate with one another with the average inter-item correlation range of 0.265, indicating good consistency (Table V). The two-part questionnaire, which was completed by 140 patients, revealed that the M-SPADI pain subscale (Cronbach α = 0.92), and the disability subscale, (Cronbach α = 0.948), had excellent internal consistency.
Table V:
Internal consistency of the M-SPADI Pain and M-SPADI Disability measured with Cronbach’s α and Average Inter-Item Correlation.
Scales | Cronbach's α | Average Inter-item Correlation | |
---|---|---|---|
Range | Average | ||
M-SPADI Pain | 0.923 | 0.259 | 0.805 |
M-SPADI Disability | 0.948 | 0.265 | 0.764 |
Abbreviation - M-SPADI: Malay Shoulder Pain and Disability Index
Notes: Cronbach's α>0.70 and range of average inter-item correlation between 0.15 to 0.50 was considered good22.
Thirty-four patients completed the test-retest with a mean time of 9.20± 3.80 days between the first and second tests. The Cronbach’s α was more than 0.70 in all subscales. The ICC results of the M-SPADI pain subscale, disability subscale, and total scores (ICC Pain =0.84, ICC Disability =0.78, ICC Total =0.81, p<0.001) revealed good to excellent test-retest reliability (Table VI). The measurement error associated with the M-SPADI was shown in Table VI. In M-SPADI Pain (SDC=8.74) and M-SPADI Disability (SDC=3.21), the SDC was lower than the MDC. The Bland-Altmann analysis was omitted because the values for the difference between test and retest scores were not normally distributed37.
Table VI:
Test-retest reliability of the M-SPADI assessed by Cronbach’s α, Interclass correlation coefficient and measurement error.
Scales | Cronbach's α | Interclass Correlation Coefficient | Measurement Error | |||
---|---|---|---|---|---|---|
ICC | 95% CI | SEM | MDC | SDC | ||
M-SPADI Pain | 0.83 | 0.84 | 0.67, 0.92 | 7.78 | 21.57 | 8.74 |
M-SPADI Disability | 0.78 | 0.78 | 0.56, 0.89 | 2.46 | 6.82 | 3.21 |
M-SPADI Total | 0.81 | 0.81 | 0.62, 0.91 | 3.17 | 8.79 | 3.83 |
Abbreviation – ICC: Interclass correlation coefficient, SEM: Standard Error of Measurement, M-SPADI: Malay Shoulder Pain and Disability Index, MDC: Minimal detectable change, SDC: Smallest detectable change
Twenty-one patients completed the responsiveness test at three months’ post-treatment with regenerative therapy and shoulder rotator cuff strengthening exercise. The AUC for MSPADI pain, disability, and total SPADI was more than 0.90, indicating that the M-SPADI is highly responsive (Table VII and Fig. 1).
Table VII:
Responsiveness of patients with rotator cuff pathology after treatment with regenerative therapy and exercise.
Scale | AUC | 95% Confidence interval | Sensitivity | Specificity |
---|---|---|---|---|
M-SPADI Pain | 0.99 | 0.95-1.00 | 1.00 | 0.23 |
M-SPADI Disability | 0.94 | 0.85-1.00 | 1.00 | 0.15 |
SPADI Total | 0.96 | 0.88-1.00 | 1.00 | 0.23 |
Abbreviation - AUC: Area under curve, M-SPADI: Malay Shoulder Pain and Disability Index
Fig. 1:
Responsiveness: Area under the curve for M-SPADI Pain, M-SPADI Disability and M-SPADI Total at three months after treatment.
Discussion
In this study the translated M-SPADI was evaluated using the COSMIN guidelines. Aside from COSMIN, there are other instruments available to evaluate the measurement characteristics of patient-reported outcome measures including "Evaluating Measures of Patient-Reported Outcomes (EMPRO)," "Quality Criteria for Measurement Properties" and “Scientific Advisory Committee Medical Outcomes Trust (SACMOT)"38. Among these, COSMIN was reported to be the most comprehensive and frequently used to assess the psychometric properties of self-reported measures39. As a thorough and rigorous assessment method developed by an international team of experts by Delphi agreement, the COSMIN guideline offers multiple strengths17. COSMIN has the following advantages: the instructions for conducting and analysing patients' reported outcome measures were thoroughly explained and are readily available online. In addition, it offers guidelines for Classical Test Theory (CTT) and Item Response Theory (IRT)38. Although some concerns were raised on the complexity of COSMIN as it involves multiple steps of evaluations, COSMIN provides a robust description of interpretability, burden, and alternative modes of administration compared to the rest39.
The M-SPADI revealed high validity and reliability, as well as adequate responsiveness, in patients with shoulder pain. According to the structural validity, the M-SPADI has bidimensional structure; the MSPADI pain and disability16. The questionnaire's bi-dimensionality were also identified by the original SPADI6, the Slovene SPADI15, and the Chinese SPADI14 and in two large study participants using the Rasch model10,40,41.
The M-SPADI known-group validity found that patients with shoulder pain had considerably greater pain and disability scores than those without shoulder pain16. In this study, we found two main clusters of shoulder pain, which were: rotator cuff related pathologies and other shoulder pathologies which include bursitis, acromioclavicular joint disease, adhesive capsulitis, instability and majority was non-specific shoulder pain. Thus, a known group validity between the two groups were performed. Patients with rotator cuff disease had significantly higher M-SPADI Disability and M-SPADI Total scores than patients with other shoulder pathologies. A similar finding was reported by Tran et al, which found that patients with rotator cuff pathology had the highest SPADI score compared to other pathologies. They suggested poor response to treatment among patient with rotator cuff pathology as were the cause of this observation42. Similarly, the current study found patients with rotator cuff pathology have longer duration of the condition which could be associated with higher M-SPADI score. Moreover, rotator cuff pathology is often a chronic condition that is challenging to treat43,44 and often associated with pain and functional limitation45,46. In contrast patients with other shoulder pathologies such as subacromial bursitis and adhesive capsulitis usually respond well to treatment (e.g. corticosteroid injection and physiotherapy modalities)47. Our findings may reflect that M-SPADI could be utilised to assess rotator cuff pathology comparable with rotator cuff-specific shoulder questionnaires such as the Western Ontario Rotator Cuff (WORC) Index, since patients with rotator cuff pathology showed significantly higher disability scores compared to other shoulder pathologies. Furthermore, there were studies that reported good validity and reliability of SPADI for patients with rotator cuff disease40,48. There was no other translated SPADI that could be compare for the known group validity to the present study.
The cross-cultural validity or measurement invariance was defined broadly in the COSMIN guidelines as “the degree to which the performance of items on a translated or culturally adapted instrument is a sufficient reflection of the performance of the items in the original version or of a different ethnic group, gender, or patient population”17. There were various ways assessing the measurement invariance for the SPADI. Thoomes-de Graaf et al11 compared the Dutch SPADI in patients with a high initial pain score level of >7 who were absent from work with patients with a lower initial pain score level of less than 7 who were not absent from work. In his study, patients with a high initial pain score with work absence had a significantly higher mean SPADI score than those with a low initial pain score. Other study compared pain on the dominant side versus the non-dominant side found significantly higher SPADI pain scores on the dominant side than the non-dominant side40. In this study, the measurement invariance was assessed by comparing patients with rotator cuff pathology and with other types of shoulder pain. According to the multi-group factor analysis, there was no significant difference in measurement invariance between the two groups. This measurement invariance is important to evaluate if patients who have different traits (rotator cuff problems and other shoulder problems) understand the questionnaire in the same way17.
In this study, M-SPADI was found to be strongly correlated with NRS and negatively correlated with active range of motion in terms of criterion validity. This finding is comparable with the M-SPADI Spanish12 and China14 which demonstrated a significant correlation of M-SPADI with NRS. Spearman's correlation validated the hypothesis that M-SPADI subscales and overall scores were negatively correlated with shoulder AROM. SPADI has previously been associated with NRS and AROM in the Original SPADI6, Telugu SPADI13, and Tamil SPADI49.
All of the items in M-SPADI Pain and M-SPADI Disability had good internal consistency, with a Cronbach's alpha greater than 0.90. As a result, neither the pain subscale nor the disability subscale questions should be removed, nor should additional questions be added. Three items were removed from the Spanish SPADI to achieve the best fit model for confirmatory factor analysis12. Our findings with Cronbach’s alpha of more than 0.90 consistent with previous research using the SPADI questionnaire in other shoulder diseases48,49.
The test-retest reliability for M-SPADI was rated as good to excellent and was superior to that of the original SPADI, which had moderate test-retest reliability6. The difference could be due to smaller sample size of 37 patients in the latter study and the patients with non-specific shoulder pain was recruited. A recent systematic review reports test-retest reliability of SPADI ranges from ICC = 0.85-0.92, similar to this study5. The test-retest is important because it determines the consistency of the patients responses to the questionnaire over time31.
Only a few studies examined the SPADI's measurement error50-53. The term "measurement error" refers to the systematic and random variation in a patient's score that cannot be attributed to true changes in the construct being measured23. The SEM in this study was between 2.46 and 7.78, which is consistent with previous studies that reported SEM between 2.1 and 8.950,51. Previous studies reported the MDC values from 12.2 to 23.1 (50-52). The MDCs for M-SPADI Disability (6.82) and M-SPADI Total (8.79) were low in this study, indicating that the majority of patients had similar pathology (rotator cuff). According to the COSMIN guidelines, the MDC should ideally be greater than the SDC23.
A limited number of studies assessed the responsiveness of the SPADI overtime or after treatment32. Responsiveness is defined as the ability of a PROM to detect change over time in the construct being measured23. The AUC in this study was comparable with other studies, which ranged from 0.80 to 0.9250,51,54.
In this study, the AUC for patients with rotator cuff tendinopathy treated with regenerative therapy injection combined with rotator cuff exercise program, was greater than 0.90 at three months. AUC greater than 0.90 indicates that the M-SPADI could discriminate between patients who responded to treatment and those who did not respond to treatment as assessed by NRS. According to Kc et al, the overall quality of the SPADI responsiveness study was moderate, with inconsistent findings56. In contrast our study demonstrated excellent responsiveness to intervention. However, the comparison was difficult to perform because the majority of studies did not include information on the treatment received by the patients6,49,55,56.
There are several limitations that need to be addressed in this study. First, we did not verify whether patients remained stable during the 7-day period between the test and the retest, which could have influenced the measurement error. However, the seven-day period between assessments used is commonly used in previous studies50,55. Second, previous studies compared the SPADI construct validity with other validated questionnaires48,51,54. However, given the absence of other validated Malay shoulder questionnaire comparison could not be made. For the purpose of determining responsiveness, we only analysed patients diagnosed with rotator cuff tendinopathy, as this is the most common condition seen in our setting. The patients were treated with regenerative therapy and exercise for chronic pain. The strength of this study is that it supplements the results of the previous study by including a more comprehensive psychometric properties assessment which include the criterion validity, measurement invariance and responsiveness16. Additionally, this study demonstrates previously undetermined known-group validity and measurement invariance, which compares patients with and without rotator cuff problems.
Validated health-related patient-reported outcome measures (PROMs) is essential in promoting patient-centred care as it provides the clinician with abundant information to assist decision-making in the various aspects of patient care including diagnostic process, treatment selection, monitoring treatment progress, and facilitating effective communication with patients. There are at least fifty PROMs available on shoulder function, with the Shoulder Pain and Disability Index (SPADI) being one of the most frequently referenced and studied3,57. The SPADI questionnaire is brief, simple to use, easy to comprehend, and had substantial evidence for validity, reliability, and responsiveness56. Moreover, the SPADI has been translated into at least fifteen other languages. The M-SPADI, is the first shoulder-specific questionnaire translated into Malay that is valid and reliable, and could be utilised in clinical and research settings to assess pain and disability among Malay-speaking individuals with shoulder pain. The findings of this study are comparable to those of other international studies employing the SPADI questionnaire and would be useful for future large cross-cultural and cross-regional study.
Conclusion
The M-SPADI is a bi-dimensional instrument with moderate/good/excellent validity, reliability, and responsiveness for research and monitoring patient response to treatment. M-SPADI may be used to monitor the progression of disability and functions in patients with rotator cuff pathology, including their post-treatment response and progression.
Acknowledgements
The authors would like to thank Dr Krishnavanie Shunmugam from the Department of English Language, Faculty of Languages and Linguistics, University of Malaya for the guidance in the translation and validation process and Dr Juliana Usman from the Department of Biomedical Engineering, Faculty of Engineering, University of Malaya for the guidance to complete the manuscript.
Appendix
Appendix 1: Malay Shoulder Pain and Disability Index (M-SPADI).
Conflict of Interest
The authors declare no potential conflict of interest.
References
- 1.Kingsley C, Patel S. Patient-reported outcome measures and patient-reported experience measures. BJA Education. 2017;17(4):137–44. doi: 10.1093/bjaed/mkw060. doi: [DOI] [Google Scholar]
- 2.Churruca K, Pomare C, Ellis LA, Long JC, Henderson SB, Murphy LED et al. Patient-reported outcome measures (PROMs): A review of generic and condition-specific measures and a discussion of trends and issues. Health Expect. 2021;24(4):1015–24. doi: 10.111/hex.13254. doi: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Padua R, de Girolamo L, Grassi A, Cucchi D. Choosing patient-reported outcome measures for shoulder pathology. EFORT Open Rev. 2021;6(9):779–87. doi: 10.1302/2058-5241.6.200109. doi: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Shigley C, Green A. Shoulder conditions and health related quality of life and utility: a current concepts review. JSES Int. 2022;6(1):167–74. doi: 10.1016/j.jseint.2021.10.006. doi: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Buchbinder R, Page MJ, Huang H, Verhagen AP, Beaton D, Kopkow C et al. A Preliminary Core Domain Set for Clinical Trials of Shoulder Disorders: A Report from the OMERACT 2016 Shoulder Core Outcome Set Special Interest Group. J Rheumatol. 2017;44(12):1880–3. doi: 10.3899/jrheum.161123. doi: [DOI] [PubMed] [Google Scholar]
- 6.Roach KE, Budiman-Mak E, Songsiridej N, Lertratanakul Y. Development of a shoulder pain and disability index. Arthritis Care Res. 1991;4(4):143–9. [PubMed] [Google Scholar]
- 7.Wang W, Jia ZY, Liu J, Xie QY, Cui J, Zheng W et al. Cross-cultural adaptation and validation of the Chinese version of the shoulder pain and disability index in patients with symptomatic shoulder pain: A prospective case series. Medicine (Baltimore). 2018;97(26):e11227. doi: 10.1097/MD.0000000000011227. doi: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Chester R, Jerosch-Herold C, Lewis J, Shepstone L. The SPADI and QuickDASH Are Similarly Responsive in Patients Undergoing Physical Therapy for Shoulder Pain. J Orthop Sports Phys Ther. 2017;47(8):538–47. doi: 10.2519/jospt.2017.7195. doi: [DOI] [PubMed] [Google Scholar]
- 9.Paul A, Lewis M, Shadforth MF, Croft PR, Van Der Windt, Hay EM. A comparison of four shoulder-specific questionnaires in primary care. Ann Rheum Dis. 2004;63(10):1293–9. doi: 10.1136/ard.2003.012088. doi: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Christensen KB, Thorborg K, Holmich P, Clausen MB. Rasch validation of the Danish version of the shoulder pain and disability index (SPADI) in patients with rotator cuff-related disorders. Qual Life Res. 2019;28(3):795–800. doi: 10.1007/s11136-018-2052-8. doi: [DOI] [PubMed] [Google Scholar]
- 11.Thoomes-de Graaf M, Scholten-Peeters GG, Duijn E, Karel Y, Koes BW, Verhagen AP. The Dutch Shoulder Pain and Disability Index (SPADI): a reliability and validation study. Qual Life Res. 2015;24(6):1515–9. doi: 10.1007/s11136-014-0879-1. doi: [DOI] [PubMed] [Google Scholar]
- 12.Luque-Suarez A, Rondon-Ramos A, Fernandez-Sanchez M, Roach KE, Morales-Asencio JM. Spanish version of SPADI (shoulder pain and disability index) in musculoskeletal shoulder pain: a new 10-items version after confirmatory factor analysis. Health Qual Life Outcomes. 2016;14:32. doi: 10.1186/s12955-016-0436-4. doi: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Gadam Y, Subramanian S, Apparao P, Kumar S, Neerukonda S, Kambarthi N. Reliability and Validity of the Indian (Telugu) Version of the Shoulder Pain and Disability Index. J Clin Diagnostic Res. 2018;12 doi: 10.7860/JCDR/2018/31340.11268. YC01-YC4. doi: [DOI] [Google Scholar]
- 14.Yao M, Yang L, Cao ZY, Cheng SD, Tian SL, Sun YL et al. Translation and cross-cultural adaptation of the Shoulder Pain and Disability Index (SPADI) into Chinese. Clin Rheumatol. 2017;36(6):1419–26. doi: 10.1007/s10067-017-3562-4. doi: [DOI] [PubMed] [Google Scholar]
- 15.Jamnik H, Spevak MK. Shoulder Pain and Disability Index: validation of Slovene version. Int J Rehabil Res. 2008;31(4):33741. doi: 10.1097/MRR.0b013e3282fcae09. doi; [DOI] [PubMed] [Google Scholar]
- 16.Ho CA, Yun Ling JC, Abdul Karim S. Cross-cultural adaptation and measurement properties of the Malay Shoulder Pain and Disability Index. PLoS One. 2022;17(3):e0265198. doi: 10.1371/journal.pone.0265198. doi: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Prinsen CAC, Mokkink LB, Bouter LM, Alonso J, Patrick DL, de Vet HCW et al. COSMIN guideline for systematic reviews of patient-reported outcome measures. Qual Life Res. 2018;27(5):1147–57. doi: 10.1007/s11136-018-1798-3. doi: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Williams JW, Jr, Holleman DR, Jr, Simel DL. Measuring shoulder function with the Shoulder Pain and Disability Index. J Rheumatol. 1995;22(4):727–32. [PubMed] [Google Scholar]
- 19.Haefeli M, Elfering A. Pain assessment. Eur Spine J. 2006;1(Suppl 1):S17–24. doi: 10.1007/s00586-006-1044-x. 15 Suppl. doi: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Chien CW, Bagraith KS, Khan A, Deen M, Strong J. Comparative responsiveness of verbal and numerical rating scales to measure pain intensity in patients with chronic pain. J Pain. 2013;14(12):1653–62. doi: 10.1016/j.jpain.2013.08.006. doi: [DOI] [PubMed] [Google Scholar]
- 21.Pathak A, Sharma S, Jensen MP. The utility and validity of pain intensity rating scales for use in developing countries. Pain Rep. 2018;3(5):e672. doi: 10.1097/PR9.0000000000000672. doi: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Tsang S, Royse CF, Terkawi AS. Guidelines for developing, translating, and validating a questionnaire in perioperative and pain medicine. Saudi J Anaesth. 2017;11(Suppl 1):S80–S9. doi: 10.4103/sja.SJA_203_17. doi: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Mokkink LB, Terwee CB, Patrick DL, Alonso J, Stratford PW, Knol DL et al. The COSMIN study reached international consensus on taxonomy, terminology, and definitions of measurement properties for health-related patient-reported outcomes. J Clin Epidemiol. 2010;63(7):737–45. doi: 10.1016/j.clinepi.2010.02.006. doi: [DOI] [PubMed] [Google Scholar]
- 24.Arifin WN. A Web-based Sample Size Calculator for Reliability Studies. Education Med J. 2018;10(3):67–76. doi: 10.21315/eimj2018.10.3.8. doi: [DOI] [Google Scholar]
- 25.Cheung GW, Wang C. Current Approaches for Assessing Convergent and Discriminant Validity with SEM: Issues and Solutions. Acad Manag Proc. 2017;2017(1):12706. doi: 10.5465/AMBPP.2017.12706abstract. doi: [DOI] [Google Scholar]
- 26.Hair JF, Jr, Ringle CM, Sarstedt M. Partial Least Squares Structural Equation Modeling: Rigorous Applications, Better Results and Higher Acceptance. Long Range Plann. 2013;46:1–12. doi: 10.1016/j.lrp.2013.01.001. doi: [DOI] [Google Scholar]
- 27.Gregorich SE. Do self-report instruments allow meaningful comparisons across diverse population groups? Testing measurement invariance using the confirmatory factor analysis framework. Med Care. 2006;44(11 Suppl 3):S78–S94. doi: 10.1097/01.mlr.0000245545.12228.8f. doi: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Akoglu H. User's guide to correlation coefficients. Turk J Emerg Med. 2018;18(3):91–3. doi: 10.1016/j.tjem.2018.08.001. doi: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Clark LA, Watson D. Constructing validity: Basic issues in objective scale development. Psychol Assess. 1995;7:309–19. doi: 10.1037/1040-3590.7.3.309. doi: [DOI] [Google Scholar]
- 30.Onwuegbuzie AJ, Daniel LG. A Framework for Reporting and Interpreting Internal Consistency Reliability Estimates. Meas Eval Couns Dev. 2002;35(2):89–103. doi: 10.1080/07481756.2002.12069052. doi: [DOI] [Google Scholar]
- 31.Koo TK, Li MY. A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research. J Chiropr Med. 2016;15(2):155–63. doi: 10.1016/j.jcm.2016.02.012. doi: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Terwee CB, Bot SD, de Boer MR, van der Windt DA, Knol DL, Dekker J et al. Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol. 2007;60(1):34–42. doi: 10.1016/j.clinepi.2006.03.012. doi: [DOI] [PubMed] [Google Scholar]
- 33.Palta M, Chen HY, Kaplan RM, Feeny D, Cherepanov D, Fryback DG. Standard Error of Measurement of 5 Health Utility Indexes across the Range of Health for Use in Estimating Reliability and Responsiveness. Med Decis Making. 2011;31(2):2609. doi: 10.1177/0272989X10380925. doi: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Bland JM, Altman DG. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet. 1986;1(8476):307–10. [PubMed] [Google Scholar]
- 35.Boonstra AM, Stewart RE, Koke AJ, Oosterwijk RF, Swaan JL, Schreurs KM et al. Cut-Off Points for Mild, Moderate, and Severe Pain on the Numeric Rating Scale for Pain in Patients with Chronic Musculoskeletal Pain: Variability and Influence of Sex and Catastrophizing. Front Psychol. 2016;7:1466. doi: 10.3389/fpsyg.2016.01466. doi: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Mandrekar JN. Receiver Operating Characteristic Curve in Diagnostic Test Assessment. J Thorac Oncol. 2010;5(9):1315–6. doi: 10.1097/JTO.0b013e3181ec173d. doi: [DOI] [PubMed] [Google Scholar]
- 37.Chen LA, Kao CL. Parametric and nonparametric improvements in Bland and Altman's assessment of agreement method. Stat Med. 2021;40(9):2155–76. doi: 10.1002/sim.8895. doi: [DOI] [PubMed] [Google Scholar]
- 38.Rosenkoetter U, Tate RL. Assessing Features of Psychometric Assessment Instruments: A Comparison of the COSMIN Checklist with Other Critical Appraisal Tools. Brain Impair. 2018;19(1):103–18. doi: 10.1017/BrImp.2017.29. doi: [DOI] [Google Scholar]
- 39.Lorente S, Viladrich C, Vives J, Losilla JM. Tools to assess the measurement properties of quality of life instruments: a meta-review. BMJ Open. 2020;10:e036038. doi: 10.1136/bmjopen-2019-036038. doi: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Boake BR, Childs TK, Soules TD, Zervos DL, Vincent JI, MacDermid JC. Rasch analysis of The Shoulder Pain and Disability Index (SPADI) in a postrepair rotator cuff sample. J Hand Ther. 2021;34(4):612–8. doi: 10.1016/j.jht.2020.09.001. doi: [DOI] [PubMed] [Google Scholar]
- 41.Hair JF, Jr, Hult GTM, Ringle CM, Sarstedt M, Danks NP, Ray S. An Introduction to Structural Equation Modeling. Hair JF Jr, Hult GTM, Ringle CM, Sarstedt M, Danks NP, Ray S, . Partial Least Squares Structural Equation Modeling (PLSSEM) Using R: A Workbook. Cham: Springer Cham; 2021:1–29. In. editors. [Google Scholar]
- 42.Tran G, Hensor EM, Ray A, Kingsbury SR, O’Connor P, Conaghan PG. Ultrasound-detected pathologies cluster into groups with different clinical outcomes: Data from 3000 community referrals for shoulder pain. Arthritis Res Ther. 2017;19(1):30. doi: 10.1186/s13075-017-1235-y. doi: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Desjardins-Charbonneau A, Roy JS, Dionne CE, Fremont P, MacDermid JC, Desmeules F. The Efficacy of Manual Therapy for Rotator Cuff Tendinopathy: A Systematic Review and Meta-analysis. J Orthop Sports Phys Ther. 2015;45(5):330–50. doi: 10.2519/jospt.2015.5455. [DOI] [PubMed] [Google Scholar]
- 44.Boorman RS, More KD, Hollinshead RM, Wiley JP, Mohtadi NG, Lo IKY et al. What happens to patients when we do not repair their cuff tears? Five-year rotator cuff quality-of-life index outcomes following nonoperative treatment of patients with full-thickness rotator cuff tears. J Shoulder Elbow Surg. 2018;27(3):444–8. doi: 10.1016/j.jse.2017.10.009. [DOI] [PubMed] [Google Scholar]
- 45.Lewis J, McCreesh K, Roy JS, Ginn K. Rotator Cuff Tendinopathy: Navigating the Diagnosis-Management Conundrum. J Orthop Sports Phys Ther. 2015;45(11):923–37. doi: 10.2519/jospt.2015.5941. doi: [DOI] [PubMed] [Google Scholar]
- 46.Ulack C, Suarez J, Brown L, Ring D, Wallace S, Teisberg E. What are People That Seek Care for Rotator Cuff Tendinopathy Experiencing in Their Daily Life? J Patient Exp. 2022;9:23743735211069811. doi: 10.1177/23743735211069811. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.ElMeligie MM, Allam NM, Yehia RM, Ashour AA. Systematic review and meta-analysis on the effectiveness of ultrasound-guided versus landmark corticosteroid injection in the treatment of shoulder pain: an update. J Ultrasound. 2022 doi: 10.1007/s40477-022-00684-1. doi: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Vrouva S, Batistaki C, Koutsioumpa E, Kostopoulos D, Stamoulis E, Kostopanagiotou G. The Greek version of Shoulder Pain and Disability Index (SPADI): translation, cultural adaptation, and validation in patients with rotator cuff tear. J Orthop Traumatol. 2016;17(4):315–26. doi: 10.1007/s10195-016-0425-8. doi: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Jeldi AJ, Aseer AL, Dhandapani AG, Roach KE. Cross-cultural adaptation, reliability and validity of an Indian (Tamil) version for the shoulder pain and disability index. Hong Kong Physiother J. 2012;30(2):99–104. doi: 10.1016/j.hpj.2012.06.001. doi: [DOI] [Google Scholar]
- 50.Spanou A, Mamais I, Lamnisos D, Stasinopoulos D. Reliability and validity of the Greek shoulder pain and disability index in patients with shoulder pain. Disabil Rehabil. 2020;42(9):1299–304. doi: 10.1080/09638288.2018.1519728. doi: [DOI] [PubMed] [Google Scholar]
- 51.Thoomes-de Graaf M, Scholten-Peeters W, Duijn E, Karel Y, de Vet HC, Koes B et al. The Responsiveness and Interpretability of the Shoulder Pain and Disability Index. J Orthop Sports Phys Ther. 2017;47(4):278–86. doi: 10.2519/jospt.2017.7079. doi: [DOI] [PubMed] [Google Scholar]
- 52.Ekeberg OM, Bautz-Holter E, Keller A, Tveita EK, Juel NG, Brox JI. A questionnaire found disease-specific WORC index is not more responsive than SPADI and OSS in rotator cuff disease. J Clin Epidemiol. 2010;63(5):575–84. doi: 10.1016/j.jclinepi.2009.07.012. doi: [DOI] [PubMed] [Google Scholar]
- 53.Tveita EK, Ekeberg OM, Juel NG, Bautz-Holter E. Responsiveness of the shoulder pain and disability index in patients with adhesive capsulitis. BMC Musculoskelet Disord. 2008;9:161. doi: 10.1186/1471-2474-9-161. doi: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Christiansen DH, Andersen JH, Haahr JP. Cross-cultural adaption and measurement properties of the Danish version of the Shoulder Pain and Disability Index. Clin Rehabil. 2013;27(4):355–60. doi: 10.1177/0269215512456220. doi: [DOI] [PubMed] [Google Scholar]
- 55.Staples MP, Forbes A, Green S, Buchbinder R. Shoulder-specific disability measures showed acceptable construct validity and responsiveness. J Clin Epidemiol. 2010;63(2):163–70. doi: 10.1016/j.jclinepi.2009.03.023. doi: [DOI] [PubMed] [Google Scholar]
- 56.Kc S, Sharma S, Ginn KA, Reed D. Measurement properties of translated versions of the Shoulder Pain and Disability Index: A systematic review. Clin Rehabil. 2021;35(3):410–22. doi: 10.1177/0269215520963199. doi: [DOI] [PubMed] [Google Scholar]
- 57.Martins J, Napoles BV, Hoffman CB, Oliveira AS. The Brazilian version of Shoulder Pain and Disability Index - translation, cultural adaptation and reliability. Rev Bras Fisioter. 2010;14(6):527–36. [PubMed] [Google Scholar]